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^ (54) Title: NUCLEIC ACID SEQUENCES DIFFERENTIALLY EXPRESSED IN CANCER TISSUE 



(57) Abstract: This invention relates to novel nucleic acid sequences which are differentially expressed in cancer cells. The in- 
vention also relates to proteins and peptides encoded by the sequences, to diagnostic assays and therapeutic agents based on the 
sequences and proteins, and to probes, antisense constructs, and antibodies derived from the sequences and proteins or peptides. The 
subject nucleic acids have been found to be differentially expressed by tumor cells, particularly in colon cancer tissue. 
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NUCLEIC ACID SEQT JENCES DIFFERENTIALLY Ey PRRSSED IN CANCER TISSUE 

fi^U Qffegluveqtion 

The present invention provides nucleic acid sequences and proteins encoded thereby 
5 which are differentially expressed in cancer tissues, as well as probes derived from the nucleic 
acid sequences, antibodies directed to the encoded proteins, and diagnostic methods for 
determining the presence and state of cancerous cells, especially colon cancer cells. 

Packground of the Invention 

Colorectal carcinoma is a malignant neoplastic disease. There is a high incidence of 
10 colorectal carcinoma in the Western world, particularly in the United States. Tumors of this type 
often metastasize through lymphatic and vascular channels. Many patients with colorectal 
carcinoma eventually die from this disease. In fact, it is estimated that 62,000 persons in the 
United States alone die of colorectal carcinoma annually. 

However, if diagnosed early, colon cancer may be treated effectively by surgical removal 
15 of the cancerous tissue. Colorectal cancers originate in the colorectal epithelium and typically 
are not extensively vascularized (and therefore not invasive) during the early stages of 
development. Colorectal cancer is thought to result from the clonal expansion of a single mutant 
cell m the epithelial lining of the colon or rectum. The transition to a highly vascularized; 
invasive and ultimately metastatic cancer which spreads throughout the body commonly takes 
20 ten years or longer. If the cancer is detected prior to invasion, surgical removal of the cancerous 
tissue is an effective cure. However, colorectal cancer is often detected only upon manifestation 
of cUnical symptoms, such as pain and black tarry stool. Generally, such symptoms are present 
only when the disease is well established, often after metastasis has occurred, and the prognosis 
for the patient is poor, even after surgical resection of the cancerous tissue. Early detection of 
25 colorectal cancer therefore is important in that detection may significantly reduce its morbidity. 

Invasive diagnostic methods such as endoscopic examination allow for dkect visual 
identification, removal, and biopsy of potentially cancerous growths such as polyps. Endoscopy 
is expensive, imcomfortable, inherently risky, and therefore not a practical tool for screening 
populations to identify those with colorectal cancer. Non-invasive analysis of stool samples for 
30 characteristics indicative of the presence of colorectal cancer or precancer is a preferred 
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alternative for early diagnosis, but no known diagnostic method is available which reliably 
achieves this goal, 

Summary of the Invention 

The present invention provides nucleic acid sequences and proteins encoded thereby, as 
5 well as probes derived from the nucleic acid sequences, antibodies directed to the encoded 

proteins, and diagnostic methods for detecting cancerous cells, especially colon cancer cells. The 
sequences disclosed herein have been found to be differentially expressed in colon cancer cell 
Unes and/or colon cancer tissue. 

In one aspect, the invention provides an isolated nucleic acid sequence comprising SEQ 
10 ID Nos 1-503, or a sequence complementary thereto. 

In another aspect, the invention provides an isolated nucleic acid comprising a nucleotide 
sequence which hybridizes under stringent conditions to a sequence of SEQ ED Nos. 1-4470, 
4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 or a sequence 
complementary thereto. 

15 In another embodiment, the nucleic acid is at least about 80% to about 100% identical to 

a sequence corresponding to at least about 12, at least about 15, at least about 25, or at least 
about 40 consecutive nucleotides up to the fiill length of one of SEQ ID Nos. 1-4470, 4472, 
4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 or a sequence 
complementary thereto. 

20 In another aspect, the invention provides an isolated nucleic acid comprising a nucleotide 

sequence which hybridizes under struigent conditions to a sequence of SEQ ID Nos. 1-1 103, 
preferably SEQ ID Nos. 1-503, or a sequence complementary thereto. In a related embodunent, 
the nucleic acid is at least about 80% or about 100% identical to a sequence corresponding to at 
least about 12, at least about 15, at least about 25, or at least about 40 consecutive nucleotides up 

25 to the full length of one of SEQ ID Nos. 1-1 103, preferably SEQ ID Nos. 1-503 or a sequence 
complementary thereto. 

In one embodiment, the invention provides a nucleic acid comprising a nucleotide 
sequence which hybridizes under stringent conditions to a sequence of SEQ ID Nos. 1-1 103, 
preferably SEQ ID Nos. 1-503, or a sequence complementary thereto, and a transcriptional 
30 regulatory sequence operably linked to the nucleotide sequence to render the nucleotide sequence 
suitable for use as an expression vector. In another embodiment, the nucleic acid may be 
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included in an expression vector capable of replicating in a prokaryotic or enkaryotic celL In a 
related embodiment, the invention provides a host cell transfected with the expression vector. 

In another embodiment, the invention provides a transgenic animal having a transgene of 
a nucleic acid comprising a nucleotide sequence which hybridizes under stringent conditions to a 
5 sequence of SEQ ID Nos. 1-1103, preferably SEQ ID Nos 1-503, or a sequence complementary 
thereto incorporated in cells thereof. The transgene modifies the level of expression of the 
nucleic acid, the stability of a mRNA transcript of the nucleic acid, or the activity of the encoded 
product of the nucleic acid. 

In yet another embodiment, the invention provides a substantially pure nucleic acid 
10 comprising the nucleotide sequence of SEQ ID Nos 1-1 103, or a sequence complementary 
thereto. 

In yet another embodiment, the invention provides a substantially pure nucleic acid 
which hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 
about 12, at least about 15, at least about 25, or at least about 40 consecutive nucleotides up to 
15 the full length of one of SEQ ID Nos. 1-1 103, preferably SEQ ID Nos 1-503, or a sequence 
complementary thereto. 

The invention also provides an antisense oligonucleotide analog which hybridizes 
under stringent conditions to at least 12, at least 25, or at least 50 consecutive nucleotides of one 
of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 
20 4494 up to the full length of one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480. 4482, 
4484, 4486, 4488, 4490, 4492, and 4494 or a sequence complementary thereto, and which is 
resistant to cleavage by a nuclease, preferably an endogenous endonuclease or exonuclease. 

In another embodiment, the invention provides a probe/primer comprising a substantially 
purified oligonucleotide comprising at least about 12, at least about 15, at least about 25, or at 
25 least about 40 consecutive nucleotides of SEQ ED Nos 1-1 103, or a sequence complementary 
thereto. 

In another embodiment, the invention provides a probe/primer comprising a substantially 
purified oligonucleotide, said oligonucleotide containing a region of nucleotide sequence which 
hybridizes xmder stringent conditions to at least about 12, at least about 15, at least about 25, or 
30 at least about 40 consecutive nucleotides of sense or antisense sequence selected from SEQ ID 
Nos. 1-1 103 up to the full length of one of SEQ ID Nos. 1-1 103 or a sequence complementary 
thereto. In preferred embodiments, the probe selectively hybridizes with a target nucleic acid. In 
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another embodiment, the probe may include a label group attached thereto and able to be 
detected. The label group may be selected from radioisotopes, fluorescent compoxmds, enzymes, 
and enzyme co-factors. The invention further provides arrays of at least about 10, at least about 
25, at least about 50, or at least about 100 different probes as described above attached to a solid 
5 support. 

In yet another embodiment, the invention pertains to a method of determining the 
phenotype of a cell comprising detecting the differential expression, relative to a normal cell, of 
at least one nucleic acid of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 
4486, 4488, 4490, 4492, and 4494, wherein the nucleic acid is differentially expressed by at least 
10 a factor of two, at least a factor of five, at least a factor of twenty, or at least a factor of fifty. 

In a still further embodiment, the invention pertains to a method of determining the 
phenotype of cell, comprising detecting the differential expression, relative to a normal cell, of at 
least one protein encoded by a nucleic acid which hybridizes under stringent conditions to a 
sequence selected from the group consisting of SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 
15 4482, 4484, 4486, 4488, 4490, 4492, and 4494, wherein the protein is differentially expressed by 
at least a factor of two, at least a factor of five, at least a factor of twenty, an up to at least a 
factor of 50. 

The invention further provides a method of determining the phenotype of cell, 
comprising detecting the differential expression, relative to a normal cell, of at least one 
20 polypeptide selected from the group of polypeptides of SEQ ID Nos. 4471 , 4473, 4475, 4477, 
4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493, wherein the polypeptide is differentially 
expressed by at least a factor of two, at least a factor of five, at least a factor of twenty, an up to 
at least a fector of 50. 

In yet another embodiment, the invention pertains to a method of determining the 
25 phenotype of a cell comprising detecting the differential expression, relative to a normal cell, of 
at least one nucleic acid which hybridizes imder stringent conditions to one of SEQ ID Nos. 1- 
4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, wherein the 
nucleic acid is differentially expressed by at least a factor of two, at least a factor of five, at least 
a factor of twenty, or at least a factor of fifty. 

30 In another aspect, the invention provides polypeptides encoded by the subject nucleic 

acids. In one embodiment, the invention pertains to a polypeptide including an amino acid 
sequence encoded by a nucleic acid comprising a nucleotide sequence which hybridizes under 
stringent conditions to a sequence of SEQ ID Nos. 1-1103 or a sequence complementary thereto. 
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or a fragment comprising at least about 25, or at least about 40 amino acids thereof. Further 
provided are antibodies immunoreactive with these polypeptides. 

In a further aspect the invention pertains to a polypeptide encoded by one or more of the 
sequences of SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, 
5 and 4494. 

In a still further aspect the invention pertains to a polypeptide having the sequence of one 
or SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 44857, 4489, 4491, and 4493. 

In still another aspect, the invention provides diagnostic methods. In one embodiment, 
the invention pertains to a method for determining the phenotype of cells from a patient by 

10 providing a nucleic acid probe comprising a nucleotide sequence having at least 10, at least about 
15, at least about 25, or at least about 40 consecutive nucleotides represented in a sequence of 
SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 
4494 up to the full length of one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 
4484, 4486, 4488, 4490, 4492, and 4494 or a sequence complementary thereto, obtaining a 

15 sample of cells from a patient, optionally providing a second sample of cells substantially all of 
which are non-cancerous, contacting the nucleic acid probe under stringent conditions with 
mRNA of each of said first and second cell samples, and comparing (a) the amount of 
hybridization of the probe with mRNA of the first cell sample, with (b) the amount of 
hybridization of the probe with mKNA of the second cell sample, wherein a difference of at least 

20 a factor of two, at least a factor of five, at least a factor of twenty, or at least a factor of fifty in 
the amount of hybridization with the mRNA of the first cell sample as compared to the amount 
of hybridization with the mRNA of the second cell sample is indicative of the phenotype of cells 
in the first cell sample. Determining the phenotype includes determining the genotype, as the 
term is used herein. 

25 In another embodiment, the invention provides a test kit for identifying the presence of 

cancerous cells or tissues, comprising a probe/primer as described above, for measuring a level 
of a nucleic acid which hybridizes under stringent conditions to a nucleic acid of SEQ ID Nos, 1- 
4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 in a sample 
of cells isolated from a patient. In certain embodiments, the kit may further include instructions 

30 for using the kit, solutions for suspending or fixing the cells, detectable tags or labels, solutions 
for rendering a nucleic acid susceptible to hybridization, solutions for lysing cells, or solutions 
for the purification of nucleic acids. 
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In another embodiment, the invention provides a method of determining the phenotype of 
a cell, comprising detecting the differential expression, relative to a normal or control cell, of at 
least one protein encoded by a nucleic acid which hybridizes under stringent conditions to one of 
SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 

5 4494^ or a sequence complementary thereto, wherein the protein is differentially expressed by at 
least a factor of two, at least a factor of five, at least a factor of twenty, or at least a factor of 
fifty. In one embodiment, the level of the protein is detected in an immunoassay. The invention 
also pertains to a method for determining the presence or absence of a nucleic acid, such as 
mRNA, which hybridizes under stringent conditions to one of SEQ ID Nos. 1-11 03 in a cell, 

1 0 comprising contacting the cell with a probe as described above. The invention fiirther provides a 
method for deteimining the presence or absence of a subject polypeptide encoded by a nucleic 
acid which hybridizes under stringent conditions to one of SEQ ID Nos. 1- 1103 in a cell, 
comprising contacting the cell with an antibody as described above. 

In yet another embodiment, the invention provides a method for determining the presence 
15 of an aberrant mutation (e.g., deletion, insertion, or substitution of nucleic acids) or aberrant 
methylation in a sequence which hybridizes under stringent conditions to a sequence of SEQ ID 
Nos. 1-1 103 or a sequence complementary thereto, comprising collecting a sample of cells fi:om 
a patient, isolating nucleic acid from the cells of the sample, contacting the nucleic acid sample 
with one or more probe/primers which specifically hybridize to a nucleic acid sequence of SEQ 
20 ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, 
or a sequence complementary thereto, under conditions such that hybridization and/or 
amplification of the nucleic acid occurs, and comparing the presence, absence, or size of an 
amplification product to the amplification product of a normal cell. 

In one embodiment, the invention provides a test kit for identifying the presence of 
25 cancer cells, comprising an antibody specific for a protein encoded by a nucleic acid which 
hybridizes under stringent conditions to any one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 
4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, or a sequence complementary 
thereto. In certain embodiments, the kit fiirther includes instructions for using the kit. In certam 
embodiments, the kit may fiirther include solutions for suspending or fixing the cells, detectable 
" 30 tags or labels, solutions for rendering a polypeptide susceptible to the binding of an antibody, 
solutions for lysing cells, or solutions for the purification of polypeptides. 

In yet another aspect, the invention provides pharmaceutical compositions including the 
subject nucleic acids. In one embodiment, an agent which alters the level of expression in a cell 
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of a nucleic acid which hybridizes under stringent conditions to one of SEQ ID Nos, 1-4470, 
4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 or a sequence 
complementary thereto is identified by providing a cell, treating the cell with a test agent, 
detennining the level of expression in the cell of a nucleic acid which hybridizes under stringent 

5 conditions to one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 
4488, 4490, 4492, and 4494 or a sequence complementary thereto, and comparing the level of 
expression of the nucleic acid in the treated cell with the level of expression of the nucleic acid in 
an untreated cell, wherein a change in the level of expression of the nucleic acid in the treated 
cell relative to the level of expression of the nucleic acid in the untreated cell is indicative of an 

10 agent which alters the level of expression of the nucleic acid in a cell The invention further 
provides a pharmaceutical composition comprising an agent identified by this method. In 
another embodiment, the invention provides a pharmaceutical composition which includes a 
polypeptide encoded by a nucleic acid having a nucleotide sequence that hybridizes under 
stringent conditions to one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 

15 4486, 4488, 4490, 4492, and 4494 or a sequence complementary thereto. In one embodiment, 
the invention pertains to a pharmaceutical composition comprising a nucleic acid including a 
sequence which hybridizes under stringent conditions to one of SEQ ID Nos. 1-4470, 4472, 
4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 or a sequence 
complementary thereto. 

20 In yet another aspect, the invention provides pharmaceutical compositions including the 

subject nucleic acids. In one embodiment, an agent which alters the level of expression in a cell 
of a nucleic acid which hybridizes under stringent conditions to one of SEQ ID Nos. 4472, 4474, 
4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 or a sequence complementary 
thereto is identified by providing a cell, treating the cell with a test agent, determining the level 

25 of expression m the cell of a nucleic acid which hybridizes under stringent conditions to one of 
SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 or a 
sequence complementary thereto, and comparing the level of expression of the nucleic acid in 
the treated cell with the level of expression of the nucleic acid in an xmtreated cell, wherein a 
change in the level of expression of the nucleic acid in the treated cell relative to the level of 

30 expression of the nucleic acid in the xmtreated cell is mdicative of an agent which alters the level 
of expression of the nucleic acid in a cell. 

The invention further provides a method for identifying an agent which alters the level of 
expression in a cell of a polypeptide having a sequence of SEQ ID Nos. 447 1 , 4473 , 4475, 4477, 
4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493 comprismg providing a cell; treating the 
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cell with the test agent; determining the level of expression of one or more polypeptides of SEQ 
ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493 in the cell 
by reacting the cell with an antibody specific for one or more of the polypeptides of SEQ ID 
Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493; and 
5 comparing the level of expression of the polypeptide in the treated cell with the level of 
expression of the same polypeptide in an untreated cell, wherein a change in the level of 
expression of the nucleic acid in the treated cell relative to the level of expression of the nucleic 
acid in the untreated cell is indicative of an agen twhich alters the level of expression of the 
polypeptide in a cell. 

10 The invention further provides a pharmaceutical composition comprising an agent 

identified by the above methods. In another embodiment, the invention provides a 
pharmaceutical composition which includes a polypeptide encoded by a nucleic acid having a 
nucleotide sequence that hybridizes under stringent conditions to one of SEQ ID Nos. 4472, 
4474^ 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 or a sequence 

1 5 complementary thereto. In a further embodiment the invention provides a pharmaceutical 

composition comprising one or more antibodies which bind to a polypeptide encoded by one or 
more of SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 
4494, In a still further embodiment, the invention provides a pharmaceutical composition 
comprising one or more antibodies which binds to a polypeptide of one or more of SEQ ID Nos, 

20 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493. In one 

embodiment, the invention pertains to a pharmaceutical composition comprising a nucleic acid 
including a sequence which hybridizes under stringent conditions to one of SEQ ID Nos. 1-4470, 
4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 or a sequence 
complementary thereto. ' 

25 In one embodiment the invention relates to a method for detecting cancer in a patient 

sample in which an antibody to a protein encoded by SEQ ID Nos 1-4470 is used to react with 
proteins in the patient sample. In a further embodiment, the invention relates to a method for 
detecting cancer in a patient sample in which an antibody to a protein encoded by one or more of 
SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 is 

30 used to react with proteins in the patient sample. In a still further embodiment, the invention 
provides a method for detecting cancer in a patient sample ui which an antibody to a protein 
having the sequence of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 
4489, 4491, and 4493 is used to react with protein in the patient sample. 
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Brief Description of the Figure 

Figure 1 depicts the nucleic acid sequence of SEQ ID Nos: 1-4470. 

Figure 2 depicts the nucleic acid sequence of SEQ ID Nos. 4472, 4474, 4476, 4478, 
4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494. 

5 Figure 3 depicts the amino acid sequence of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 

4481, 4483, 4485, 4487, 4489, 4491, and 4493. 

Detailed Description of the Invention 

The invention relates to nucleic acids having the disclosed nucleotide sequences (SEQ ID 
Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494), as 

10 well as full length cDNA, mRNA, and genes corresponding to these sequences, and to 

polypeptides and proteins encoded by these nucleic acids and genes, and portions thereof. In 
particular the invention relats to the full length cDNA sequence of SEQ ID Nos, 4472, 4474, 
4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 and the polypeptide sequence 
encoded thereby and shown in SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 

15 4487, 4489, 4491, and 4493, respectively. The 4494 sequences disclosed herem were analyzed 
by comparing the sequences to those disclosed in publicly available databases. Based upon the 
search results, it was found that SEQ ID Nos: 1-503 contained novel sequences, SEQ ID Nos: 
504-1 103 contained known EST sequences, and SEQ ID Nos: 1 104-4494 contained known 
sequences. 

20 Also included in the present invention are polypeptides and proteins encoded by the 

nucleic acids of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 
4490, 4492, and 4494, and in particular the polypeptide sequences of SEQ ID Nos. 4471, 4473, 
4475^ 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493. The various nucleic acids tiiat 
can encode these polypeptides and proteins differ because of the degeneracy of the genetic code, 

25 in that most amino acids are encoded by more than one triplet codon. The identity of such 
codpns is well known in this art, and this information can be used for the construction of the 
nucleic acids within the scope of the invention. In one embodiment, the polypeptide sequences 
of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493 
are encoded by the full length cDNA sequences of SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 

30 4482, 4484, 4486, 4488, 4490, 4492, and 4494, respectively. 
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Nucleic acids encoding polypeptides and proteins that are variants of the polypeptides 
and proteins encoded by the present nucleic acids and related cDNA and genes are also within 
the scope of the invention. The variants differ from wild-type protein in having one or more 
amino acid substitutions that either enhance, add, or diminish a biological activity of the wild- 
5 type protein. Once the amino acid change is selected, a nucleic acid encoding that variant is 
constructed according to the invention. 

The following detailed description discloses how to obtain or make full-length cDNA and 
human genes corresponding to the nucleic acids, how to express these nucleic acids and genes, 
how to identify structural motifs of the genes, how to identify the function of a protein encoded 
10 by a gene corresponding to an nucleic acid, how to use nucleic acids as probes in mapping and in 
tissue profiling, how to use the corresponding polypeptides and proteins to raise antibodies, and 
how to use the nucleic acids, polypeptides, and proteins for diagnostic purposes. 

The sequences disclosed herein have been found to be differentially expressed in colon 
cancer cell lines and/or colon cancer tissue, and thus are useful for determining the presence of 
15 colon cancer in a cell or tissue sample. The present sequences also have utility for determining 
the presence or state of other types of cancer. 

Accordingly, a preferred aspect of the present invention relates to nucleic acids 
differentially expressed in tumor cells or tissue, especially "colon cancer tissue or cells, 
polypeptides encoded by such nucleic acids, and antibodies immunoreactive with these 
20 polypeptides, and preparations of such compositions. Moreover, the present invention provides 
diagnostic and therapeutic assays and reagents for detecting and treating disorders involving, for 
example, expression of the subject nucleic acids. 

L Q^ngral 

This invention relates to compositions and methods for identifying and/or classifying 
25 cancerous cells present in a human tumors, particularly in solid tumors, e.g., carcinomas and 
sarcomas, such as, for example, breast or colon cancers. In its broadest aspect, the method uses 
nucleic acids that are differentially expressed in cancer cell lines and/or cancer tissue, compared 
with related normal cells or tissue, and using them to identify or classify tumor cells by the 
upregulation and/or downregulation of expression of particular genes, an event which is 
30 implicated in tumorigenesis. 

• Upregulation or increased expression of certain genes such as oncogenes, act to promote 
malignant growth. Downregulation or decreased expression of genes, such as tumor suppressor 
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genes, also promotes malignant growth. Thus, alteration in the expression of either type of gene 
is a potential diagnostic indicator for determining whether a subject is at risk of developing or 
has cancer, e.g., colon cancer. 

Accordingly, in one aspect, the invention also provides biomarkers, such as nucleic acid 
5 markers, for human tumor cells and tissue, particularly for colon cancer cells and tissue. The 
invention also provides proteins encoded by these nucleic acid markers. The invention also 
features methods for identifying drugs useful for treatment of such cancer cells, and for treatment 
of a cancerous condition, such as colon cancer. UnUke prior methods, the invention provides a 
means for identifying cancer cells at an early stage of development, so that premalignant cells 
10 can be identified prior to their spreading throughout the human body. This allows early detection 
of potentially cancerous conditions, and treatment of those cancerous conditions prior to spread 
of the cancerous cells throughout the body, or prior to development of an irreversible cancerous 
condition. 

IL Pgfi ni ti o ng 

15 For convenience, the meaning of certain terms and phrases used in the specification, 

examples, and appended claims, are provided below. 

The term "an aberrant expression", as applied to a nucleic acid of the present invention, 
refers to level of expression of that nucleic acid which differs from the level of expression of that 
nucleic acid in healthy tissue, or which differs from the activity of the polypeptide present in a 

20 healthy subject. An activity of a polypeptide can be aberrant because it is stronger than the 

activity of its native coimterpart. Alternatively, an activity can be aberrant because it is weaker 
or absent relative to the activity of its native counterpart. An aberrant activity can also be a 
change in the activity; for example, an aberrant polypeptide can interact with a different target 
peptide. A cell can have an aberrant expression level of a gene due to overexpression or 

25 underexpression of that gene. 

The term "agonist", as used herein, is meant to refer to an agent that mimics or 
upregulates (e.g., potentiates or supplements) the bioactivity of a protein. An agonist can be a 
wild-type protein or derivative thereof having at least one bioactivity of the wild-type protein. 
An agonist can also be a compound that upregulates expression of a gene or which increases at 
30 least one bioactivity of a protein. An agonist can also be a compound which increases the 
interaction of a polypeptide with another molecule, e.g., a target peptide or nucleic acid. 

11 
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The term "allele", which is used interchangeably herein with "allelic variant", refers to 
alternative forms of a gene or portions thereof. Alleles occupy the same locus or position on 
homologous chromosomes. When a subject has two identical alleles of a gene, the subject is said 
to be homozygous for that gene or allele. When a subject has two different alleles of a gene, the 
5 subject is said to be heterozygous for the gene. Alleles of a specific gene can differ fi"om each 
other in a single nucleotide, or several nucleotides, and can include substitutions, deletions, 
and/or insertions of nucleotides. An allele of a gene can also be a form of a gene containing 
mutations. 

The term "allelic variant of a polymorphic region of a gene" refers to a region of a gene 
10 having one of several nucleotide sequences foimd in that region of the gene in other individuals. 

The term "antagonist" as used herein is meant to refer to an agent that downregulates 
(e.g., suppresses or inhibits) at least one bioactivity of a protein. An antagonist can be a 
compound which inhibits or decreases the interaction between a protein and another molecule, 
e.g., a target peptide or enzyme substrate. An antagonist can also be a compound that 
15 downregulates expression of a gene or which reduces the amoimt of expressed protein present. 

The term "antibody" as used herein is intended to include whole antibodies, e.g., of any 
isotype (IgG, IgA, IgM, IgE, etc), and includes fragments thereof which are also specifically 
reactive with a vertebrate, e.g., mammalian, protein. Antibodies can be fragmented using 
conventional techniques and the fragments screened for utility in the same manner as described 

20 above for whole antibodies. Thus, the term includes segments of proteolytically-cleaved or 

recombinantly-prepared portions of an antibody molecule that are capable of selectively reacting 
with a certain protein. Nonlimiting examples of such proteolytic and/or recombinant fragments 
include Fab, F(ab')2, Fab' , Fv, and single chain antibodies (scFv) containing a V[L] and/or 
V[H] domain joined by a peptide linker. The scFv's may be covalently or non-covalently linked 

25 to form antibodies having two or more binding sites. The subject invention includes polyclonal, 
monoclonal, or other purified preparations of antibodies and recombinant antibodies. 

The phenomenon of "apoptosis" is well known, and can be described as a programmed 
death of cells. As is known, apoptosis is contrasted with "necrosis", a phenomenon when cells 
die as a result of being killed by a toxic material, or other external effect. Apoptosis involves 
30 chromatic condensation, membrane blebbing, and fragmentation of DNA, all of which are 
generally visible upon microscopic examination. 
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A disease, disorder, or condition "associated with" or "characterized by" an aberrant 
expression of a nucleic acid refers to a disease, disorder, or condition in a subject which can be 
statistically correlated with the expression of a nucleic acid. 

As used herein the term "bioactive fragment of a polypeptide" refers to a fragment of a 
5 full-length polypeptide, wherein the fragment specifically agonizes (mimics) or antagonizes 
(inhibits) the activity of a wild-type polypeptide. The bioactive fragment preferably is a 
fragment capable of interacting with at least one other molecule, e.g., protein, small molecule, or 
DNA, which a full length protein can bind. 

"Biological activity" or "bioactivity" or "activity" or *T3iological function", which are 
10 used interchangeably, herein mean an effector or antigenic function that is directly or indirectly 
performed by a polypeptide (whether in its native or denatured conformation), or by any 
subsequence thereof Biological activities include binding to polypeptides, binding to other 
proteins or molecules, activity as a DNA binding protein, as a transcription regulator, ability to 
bind damaged DNA, etc. A bioactivity can be modulated by directly affecting the subject 
1 5 polypeptide. Alternatively, a bioactivity can be altered by modulating the level of the 
polypeptide, such as by modulating expression of the corresponding gene. 

The term "biomarker" refers a biological molecule, e.g., a nucleic acid, including DNA, 
cDNA, RNA, mKNA, tRNA, or rRNA, peptide, polypeptide, protein, hormone, etc., whose 
presence or concentration can be detected and correlated with a known condition, such as a 
20 disease state. 

"Cells," "host cells", or "recombinant host cells" are terms used interchangeably herein. 
It is imderstood that such terms refer not only to the particular subject cell but to the progeny or 
potential progeny of such a cell. Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may not, in fact, be 
25 identical to the parent cell, but are still included within the scope of the term as used herein. 

A "chimeric polypeptide" or "fusion polypeptide" is a fusion of a first amino acid 
sequence encoding one of the subject polypeptides with a second amino acid sequence defining a 
domain (e.g., polypeptide portion) foreign to and not substantially homologous with any domain 
of the subject polypeptide. A chimeric polypeptide may present a foreign domain which is found 
30 (albeit in a different polypeptide) in an organism which also expresses the first polypeptide, or it 
may be an "interspecies," "intergenic," etc., fusion of polypeptide structures expressed by 
different kinds of organisms. In general, a fusion polypeptide can be represented by the general 
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formula pC)n-00m-(Z)n, wherein Y represents a portion of the subject polypeptide, and X and Z 
are each independently absent or represent amino acid sequences which are not related to the 
native sequence found in an organism, or which are not found as a polypeptide chain contiguous 
with the subject sequence, where m is an integer greater than or equal to one, and each 
5 occurrence of n is, independently, 0 or an integer greater than or equal to 1 (n and m are 
preferably no greater than 5 or 10), 

A "delivery complex" shall mean a targeting means (e.g., a molecule that results in 
higher affinity binding of a nucleic acid, protein, polypeptide or peptide to a target cell surface 
and/or increased cellular or nuclear uptake by a target cell). Examples of targeting means 

10 include: sterols (e.g., cholesterol), lipids (e.g., a cationic lipid, virosome or liposome), viruses 
(e.g., adenovirus, adeno-associated virus, and retrovirus), or target cell-specific binding agents 
(e.g., Ugands recognized by target cell specific receptors). Preferred complexes are sufficiently 
stable in vivo to prevent significant uncoupling prior to internalization by the target cell. 
However, the complex is cleavable under appropriate conditions within the cell so that the 

1 5 nucleic acid, protein, polypeptide or peptide is released in a functional form. 

As is well known, genes or a particular polypeptide may exist in single or multiple copies 
within the genome of an individual. Such duphcate genes may be identical or may have certain 
modifications, including nucleotide substitutions, additions or deletions, which all still code for 
polypeptides having substantially the same activity. The term "DNA sequence encoding a 
20 polypeptide" may thus refer to one or more genes within a particular individual. Moreover, 
certain differences in nucleotide sequences may exist between individual organisms, which are 
called alleles. Such allelic differences may or may not result in differences in amino acid 
sequence of the encoded polypeptide yet still encode a polypeptide with the same biological 
activity. 

25 The term "equivalent" is understood to include nucleotide sequences encoding 

functionally equivalent polypeptides. Equivalent nucleotide sequences will include sequences 
that differ by one or more nucleotide substitutions, additions or deletions, such as allelic variants; 
and will, therefore, include sequences that differ from the nucleotide sequence of the nucleic 
acids shown in SEQ ID NOs: M494 due to the degeneracy of the genetic code. 

30 As used herein, the terms "gene", "recombinant gene", and "gene construct" refer to a 

nucleic acid of the present invention associated with an open reading frame, including both exon 
and, optionally, intron sequences. 
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A "recombinant gene" refers to nucleic acid encoding a polypeptide and comprising exon 
sequences, though it may optionally include intron sequences which are derived from, for 
example, a related or xmrelated chromosomal gene. The term "intron" refers to a DNA sequence 
present in a given gene which is not translated into protein and is generally found between exons. 

5 The term "growth" or "growth state" of a cell refers to the proliferative state of a cell as 

well as to its differentiative state. Accordingly, the term refers to the phase of the cell cycle in 
which the cell is, e.g.. Go, Gi, G2, or prophase, metaphase, or telophase, or anaphase, as well as 
to its state of differentiation, e.g., undifferentiated, partially differentiated, or fully differentiated. 
Without wanting to be limited, differentiation of a cell is usually accompanied by a decrease in 
1 0 the proliferative rate of a cell. 

"Homology" or "identity" or "similarity" refers to sequence similarity between two 
peptides or between two nucleic acid molecules, with identity being a more strict comparison. 
Homology and identity can each be determined by comparing a position in each sequence which 
may be aligned for purposes of comparison. When a position in the compared sequence is 

15 occupied by the same base or amino acid, then the molecules are identical at that position, A 
degree of homology or similarity or identity between nucleic acid sequences is a function of the 
number of identical or matching nucleotides at positions shared by the nucleic acid sequences. A 
degree of identity of amino acid sequences is a function of the number of identical amino acids at 
positions shared by the amino acid sequences. A degree of homology or similarity of amino acid 

20 sequences is a function of the number of amino acids, i.e., structurally related, at positions shared 
by the amino acid sequences. An "unrelated" or "non-homologous" sequence shares less than 
40% identity, though preferably less than 25% identity, with one of the sequences of the present 
invention. 

The term "percent identical" refers to sequence identity between two amino acid 
25 sequences or between two nucleotide sequences. Identity can each be determined by comparing 
a position in each sequence which may be aligned for purposes of comparison. When an 
equivalent position in the compared sequences is occupied by the same base or amino acid, then 
the molecules are identical at that position; when the equivalent site occupied by the same or a 
similar amino acid residue (e.g., similar in steric and/or electronic nature), then the molecules 
30 can be referred to as homologous (similar) at that position. Expression as a percentage of 

homology, similarity, or identity refers to a function of the number of identical or similar amino 
acids at positions shared by the compared sequences. Various alignment algorithms and/or 
programs may be used, including FASTA, BLAST, or ENTREZ. FASTA and BLAST are 
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available as a part of the GCG sequence analysis package (University of Wisconsin, Madison, 
Wis.), and can be used with, e.g., default settings- ENTREZ is available through the National 
Center for Biotechnology Infomation, National Library of Medicine, National Institutes of 
Health, Bethesda, Md. In one embodiment, the percent identity of two sequences can be 
5 determined by the GCG program with a gap weight of 1, e.g., each amino acid gap is weighted as 
if it were a single amino acid or nucleotide mismatch between the two sequences. 

Other techniques for alignment are described in Methods in Enzvmology. vol. 266: 
Computer Methods for Macromolecular Sequence Analysis (1996), ed. Doolittle, Academic 
Press, Inc., a division of Harcourt Brace & Co., San Diego, California, USA. Preferably, an 

10 alignment program that permits gaps in the sequence is utilized to align the sequences. The 
Smith- Waterman is one type of algorithm that permits gaps in sequence alignments. See Meth. 
Mol. 70-187 (1997). Also, the GAP program using the Needleman and Wxmsch alignment 
method can be utilized to align sequences. An alternative search strategy uses MPSRCH 
software, which runs on a MASPAR computer. MPSRCH uses a Smith- Waterman algorithm to 

15 score sequences on a massively parallel computer. This approach improves ability to pick up 
distantly related matches, and is especially tolerant of small gaps and nucleotide sequence errors. 
Nucleic acid-encoded amino acid sequences can be used to search both protein and DNA 
databases. 

Databases with individual sequences are described in Methods in Enzvmology . ed. 
20 Doolittle, supra. Databases include, for example, Genbank, EMBL, and DNA Database of Japan 
(DDB3). 

Preferred nucleic acids have a sequence at least 70%, and more preferably 80% identical 
and more preferably 90% and even more preferably at least 95% identical to an nucleic acid 
sequence of a sequence shown in one of SEQ ID NOS: 1-4494. Nucleic acids at least 90%, more 
25 preferably 95%, and most preferably at least about 98-99% identical with a nucleic sequence 
represented in one of SEQ ID NOS: 1-4494 are of course also within the scope of the invention. 
In preferred embodiments, the nucleic acid is mammalian. 

The term "interact" as used herein is meant to include detectable interactions (e.g., 
biochemical interactions) between molecules, such as interaction between protein-protein, 
30 protein-nucleic acid, nucleic acid-nucleic acid, and protein-small molecule or nucleic acid-small 
molecule in nature. Examples of interactions between protein-protein, protein-nucleic acid, 
nucleic acid-nucleic acid, and protein-small molecule or nucleic acid-small molecule can include 
binding, modifying, cleaving, processing, or catalyzing. 
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The term "isolated" as used herein with respect to nucleic acids, such as DNA or RNA, 
refers to molecules separated from other DNAS, or RNAs, respectively, that are present in the 
natural source of the macromolecule. The term isolated as used herein also refers to a nucleic 
acid or peptide that is substantially free of cellular material, viral material, or culture mediimi 
5 when produced by recombinant DNA techniques, or chemical precursors or other chemicals 
when chemically synthesized. Moreover, an "isolated nucleic acid" is meant to include nucleic 
acid fragments which are not naturally occiuring as fragments and would not be found in the 
natural state. The term "isolated" is also used herein to refer to polypeptides which are isolated 
from other cellular proteins and is meant to encompass both purified and recombinant 
10 polypeptides. 

The terms "modulated" and "differentially regulated" as used herein refer to both 
upregulation (i.e., activation or stimulation e.g., by agonizing or potentiating) and 
downregulation (i.e., inhibition or suppression e.g., by antagonizing, decreasing or inhibiting). 

The term "mutated gene" refers to an alleUc form of a gene, which is capable of altering 
15 the phenotype of a subject having the mutated gene relative to a subject which does not have the 
mutated gene. If a subject must be homozygous for this mutation to have an altered phenotype, 
the mutation is said to be recessive. If one copy of the mutated gene is sufficient to alter the 
genotype of the subject, the mutation is said to be dominant. If a subject has one copy of the 
mutated gene and has a phenotype that is intermediate between that of a homozygous and that of 
20 a heterozygous subject (for that gene), the mutation is said to be co-dominant. 

The designation "N", where it appears in the accompanying Sequence Listing, indicates 
that the identity of the corresponding nucleotide is unknown. "N" should therefore not 
necessarily be interpreted as permitting substitution with any nucleotide, e.g.. A, T, C, or G, but 
rather as holding the place of a nucleotide whose identity has not been conclusively determined. 

25 The "non-human animals" of the invention include mammalians such as rodents, non- 

human primates, sheep, dog, cow, pigs, chickens, amphibians, reptiles, etc. Preferred non- 
human animals are selected from the rodent family including rat and mouse, most preferably 
mouse, though transgenic amphibians, such as members of the Xenopus genus, and transgenic 
chickens can also provide important tools for understanding and identifying agents which can 

30 affect, for example, embryogenesis and tissue formation. The term "chimeric animal" is used 
herein to refer to animals in which the recombinant gene is foimd, or in which the recombinant 
gene is expressed in some but not all cells of the animal. The term "tissue-specific chimeric 
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animal" indicates that one of the recombinant genes is present and/or expressed or disrupted in 
some tissues but not others. 

As used herein, the term "nucleic acid" refers to polynucleotides such as 
deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The term should 
5 also be understood to include, as equivalents, analogs of either RNA or DNA made from 
nucleotide analogs, and, as appHcable to the embodiment being described, single (sense or 
antisense) and double-stranded polynucleotides. ESTs, chromosomes, cDNAs, mRNAs, and 
rRNAs are representative examples of molecules that may be referred to as nucleic acids. 

The term "nucleotide sequence complementary to the nucleotide sequence of SEQ ID 
10 NO. x" refers to the nucleotide sequence of the complementary strand of a nucleic acid strand 
having SEQ ID NO. x. The term "complementary strand" is used herein interchangeably with 
the term "complement". The complement of a nucleic acid strand can be the complement of a 
coding strand or the complement of a non-coding strand. As used herein, a "complementary 
strand" to SEQ ID NO. x is a nucleic acid sequence which hybridizes under stringent conditions 
15 to SEQ ID NO. X, 

The term "polymorphism" refers to the coexistence of more than one form of a gene or 
portion (e.g., allelic variant) thereof A portion of a gene of which there are at least two different 
forms, i.e., two different nucleotide sequences, is referred to as a "polymorphic region of a 
gene". A polymorphic region can be a single nucleotide, the identity of which differs in different 
20 alleles. A polymorphic region can also be several nucleotides long. 

A "polymorphic gene" refers to a gene having at least one polymorphic region. 

As used herein, the term "promoter" means a DNA sequence that regulates expression of 
a selected DNA sequence operably linked to the promoter, and which effects expression of the 
selected DNA sequence in cells. The term encompasses "tissue specific" promoters, i.e., 
25 promoters which effect expression of the selected DNA sequence only in specific cells (e.g., 
cells of a specific tissue). The tenn also covers so-called "leaky" promoters, which regulate 
expression of a selected DNA primarily in one tissue, but cause expression in other tissues as 
well. The term also encompasses non-tissue specific promoters and promoters that constitutively 
expressed or that are inducible (i.e., expression levels can be controlled). 

30 The terms "protein", "polypeptide", and "peptide" are \ised interchangeably herein when 

referring to a gene product. 
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The terni "recombinant protein" refers to a polypeptide of the present invention which is 
produced by recombinant DNA techniques, wherein generally, DNA encoding a polypeptide is 
inserted into a suitable expression vector which is in turn used to transform a host cell to produce 
the heterologous protein. Moreover, the phrase "derived from", with respect to a recombinant 
5 gene, is meant to include within the meaning of "recombinant protein" those proteins having an 
amino acid sequence of a native polypeptide, or an amino acid sequence similar thereto which is 
generated by mutations including substitutions and deletions (including truncation) of a naturally 
occurring form of the polypeptide. 

"Small molecule" as used herein, is meant to refer to a composition, which has a 
10 molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small 

molecules can be nucleic acids, peptides, polypeptides, peptidomimetics, carbohydrates, lipids or 
other organic (carbon-containing) or inorganic molecules. Many pharmaceutical companies 
have extensive libraries of chemical and/or biological mixtures, often fungal, bacterial, or algal 
extracts, which can be screened with any of the assays of the invention to identify compounds 
1 5 that modulate a bioactivity . 

As used herein, the term "specifically hybridizes" or "specifically detects" refers to the 
ability of a nucleic acid molecule of the invention to hybridize to at least a portion of, for 
example approximately 6, 12, 15, 20, 30, 50, 100, 150, 200, 300, 350, 400, 500, 750, or 1000 
contiguous nucleotides of a nucleic acid designated in any one of SEQ ID Nos: 1-4494, or a 
20 sequence complementary thereto, or naturally occurring mutants thereof, such that it has less 

than 15%, preferably less than 10%, and more preferably less than 5% backgroxind hybridization 
to a cellular nucleic acid (e.g., mRNA or genomic DNA) encoding a different protein. In 
preferred embodiments, the oligonucleotide probe detects only a specific nucleic acid, e.g., it 
does not substantially hybridize to similar or related nucleic acids, or complements thereof 

25 "Transcriptional regulatory sequence" is a generic term used throughout the specification 

to refer to DNA sequences, such as initiation signals, enhancers, and promoters, which induce or 
control transcription of protein coding sequences with which they are operably linked. In 
preferred embodiments, transcription of one of the genes is imder the control of a promoter 
sequence (or other transcriptional regulatory sequence) which controls the expression of the 

30 recombinant gene in a cell-type in which expression is intended. It will also be understood that 
the recombinant gene can be under the control of transcriptional regulatory sequences which are 
the same or which are different from those sequences which control transcription of the naturally 
occurring forms of the polypeptide. 
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As used herein, the term "transfection" means the introduction of a nucleic acid, e.g., via 
an expression vector, into a recipient cell by nucleic acid-mediated gene transfer. 
"Transformation", as used herein, refers to a process in which a cell's genotype is changed as a 
result of the cellular uptake of exogenous DNA or RNA, and, for example, the transformed cell 
5 expresses a recombinant form of a polypeptide or, m the case of anti-sense expression from the 
transferred gene, the expression of the target gene is disrupted. 

The term "treating" as used herein is intended to encompass curing as well as 
ameliorating at least one symptom of the condition or disease. 

The term "vector" refers to a nucleic acid molecule capable of transporting another 
10 nucleic acid to which it has been linked. One type of preferred vector is an episome, i.e., a 
nucleic acid capable of extra-chromosomal repUcation. Preferred vectors are those capable of 
autonomous replication and/or expression of nucleic acids to which they are linked. Vectors 
capable of directing the expression of genes to which they are operatively linked are referred to 
herein as "expression vectors". In general, expression vectors of utiUty in recombinant DNA 
15 techniques are often in the form of "plasmids" which refer generally to circular double stranded 
DNA loops which, in their vector form are not bound to the chromosome. In the present 
specification, "plasmid" and "vector" are used interchangeably as the plasmid is the most 
commonly used form of vector. However, the invention is intended to include such other forms 
of expression vectors which serve equivalent functions and which become known in the art 
20 subsequently hereto. 

The term "wild-type allele" refers to an allele of a gene which, when present in two 
copies in a subject results in a wild-type phenotype. There can be several different wild-type 
alleles of a specific gene, since certain nucleotide changes in a gene may not affect the 
phenotype of a subject having two copies of the gene with the nucleotide changes. 

25 III. Nucleic Acids of the Present Invention 

As described below, one aspect of the invention pertains to isolated nucleic acids, 
variants, and/or equivalents of such nucleic acids. 

Nucleic acids of the present invention have been identified as differentially expressed in 
tumor cells, e.g., colon cancer-derived cell lines and colon cancer tissue (relative to the 
30 expression levels in normal cells or tissue, e.g., normal colon tissue and/or normal non-colon 
tissue). The present differentially expressed sequences comprise SEQ ID Nos. 1-4470, 4472, 
4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 
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1-1103, even more preferably SEQ ID Nos. 1-503, or sequence complementary thereto. In 
another embodiment, the invention comprises sequences which hybridize under stringent 
conditions with any of the sequences of SEQ ID Nos 1-4494. In a preferred aspect, sequences of 
the invention hybridize to SEQ ID Nos 1-4494 with about 50% identity, preferably about 70% 
5 identity, more preferably about 90% identity, and still more preferably about 100% identity. In 
preferred embodiments, the subject nucleic acids are differentially expressed by at least a factor 
of two, preferably at least a factor of five, even more preferably at least a factor of twenty, still 
more preferably at least a factor of fifty. Preferred nucleic acids are those sequences identified 
as differentially expressed both in colon cancer tissue and colon cancer cell lines. In preferred 
10 embodiments, nucleic acids of the present invention are upregulated in tumor cells, especially 
colon cancer tissue and/or colon cancer-derived cell lines. La another embodiment, nucleic acids 
of the present invention are downregulated in tumor cells, especially colon cancer tissue and/or 
colon cancer-derived cell lines. 

Genes which are upregulated, such as oncogenes, or downregulated, such as tumor 
15 suppressors, in aberrantly proUferating cells can be used as targets for diagnostic or therapeutic 
appUcations. For example, upregulation of the cdc2 gene induces mitosis. Overexpression of 
the mytl gene, a mitotic deactivator, negatively regulates the activity of cdc2. Aberrant 
proliferation may thus be induced either by upregulating cdc2 or by downregulating mytl. 
Similarly, downregulation of tumor suppressors such as p53 and Rb have been impUcated in 
20 tumorigenesis. 

Particularly preferred polypeptides are tiiose that are encoded by nucleic acid sequences 
at least about 70%, 75%, 80%, 90%, 95%, 97%, or 98% similar to a nucleic acid sequence of 
SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 
4494. Preferably, the nucleic acid includes all or a portion (e g, at least about 10, at least about 
25 15, at least about 25, or at least about 40 nucleotides) of the nucleotide sequence corresponding 
to the nucleic acid of SEQ ID Nos. 1-1 103, most preferably SEQ ID Nos. 1-503, or a sequence 
complementary thereto. 

Still other preferred nucleic acids of the present invention encode a polypeptide 
comprising at least a portion of a polypeptide encoded by one of SEQ ID Nos. 1-4470, 4472, 
30 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494. For example, preferred 
nucleic acid molecules for use as probes/primers or antisense molecules (i.e., noncoding nucleic 
acid molecules) can comprise at least about 10, 20, 30, 50, 60, 70, 80, 90, or 100 base pairs in 
length up to the length of the complete sequence of any of SEQ ID Nos 1 -4494. Coding nucleic 
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acid molecules can comprise, for example, from about 50, 60,70,80,90, or 100 base pairs up to 
the fiill length of the entire sequence of any of SEQ ID Nos 1-4494. 

Another aspect of the invention provides a nucleic acid which hybridizes under low, 
medium, or high stringency conditions to a nucleic acid sequence represented by one of SEQ ID 
5 Nos. 1-1 103, preferably SEQ ID Nos. 1-503, or a sequence complementary thereto. Appropriate 
stringency conditions which promote DNA hybridization, for example, about 6.0 x sodium 
chloride/sodium citrate (SSC) at about 45 °C, followed by a wash of about 2.0 x SSC at about 
50°C, are known to those skilled in the art or can be found in Current Protocols in Molecular 
Biology, John Wiley & Sons, N.Y. (1989), 6.3.M2.3.6. For example, the salt concentration in 

10 the wash step can be selected from a low stringency of about 2.0 x SSC at about 50°C to a high 
stringency of about 0.2 x SSC at about 50°C. In addition, the temperature in the wash step can 
be increased from low stringency conditions at room temperature, about 22 °C, to high 
stringency conditions at about 65 °C. Both temperature and salt may be varied, or temperature or 
salt concentration may be held constant while the other variable is changed. In a preferred 

15 embodiment, a nucleic acid of the present invention will bind to one of SEQ ED Nos. 1-1 103, 

preferably SEQ ID Nos. 1-503, or a sequence complementary thereto, under moderately stringent 
conditions, for example at about 2.0 x SSC and about 40*'C. In a particularly preferred 
embodiment, a nucleic acid of the present invention will bind to one of SEQ ID Nos. 1-1 103, 
preferably SEQ ID Nos. 1-503, or a sequence complementary thereto, under high stringency 

20 conditions. 

In one embodiment, the invention provides nucleic acids which hybridize imder low 
stringency conditions of about 6 x SSC at about room temperature followed by a wash at about 2 
X SSC at about room temperature. 

In another embodiment, the invention provides nucleic acids which hybridize under high 
25 , stringency conditions of about 2 x SSC at about 65 *^C followed by a wash at about 0.2 x SSC at 
about 65 °C. 

Nucleic acids having a sequence that differs from the nucleotide sequences shown in one 
of SEQ ID Nos. 1-1 103, preferably SEQ ID Nos. 1-503, or a sequence complementary thereto, 
due to degeneracy in the genetic code, are also within the scope of the invention. Such nucleic 
30 acids encode fimctionally equivalent peptides (i.e., a peptide having equivalent or similar 

biological activity) but differ in sequence from the sequence shown in the sequence Hsting due to 
degeneracy in the genetic code. For example, a nimiber of amino acids are designated by more 
than one triplet Codons that specify the same amino acid, or synonyms (for example, CAU and 
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CAC each encode histidine) may result in "silent" mutations which do not affect the amino acid 
sequence of a polypeptide. However, it is expected that DNA sequence polymorphisms that do 
lead to changes in the amino acid sequences of the subject polypeptides will exist among 
mammals. One skilled in the art will appreciate that these variations in one or more nucleotides 
5 (e.g., up to about 3-5% of the nucleotides) of the nucleic acids encoding polypeptides having an 
activity of a polypeptide may exist among individuals of a given species due to natural allelic 
variation. 

Also within the scope of the invention are nucleic acids encoding splicing variants of 
proteins encoded by a nucleic acid of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 
10 4484, 4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 1-1 103, even more preferably 
SEQ ID Nos. 1-503, or a sequence complementary thereto, or natural homologs of such proteins. 
Such homologs can be cloned by hybridization or PGR, as further described herein. 

The polynucleotide sequence may also encode for a leader sequence, e.g., the natural 
leader sequence or a heterologous leader sequence, for a subject polypeptide. For example, the 
1 5 desired DNA sequence may be fused in the same reading frame to a DNA sequence which aids 
in expression and secretion of the polypeptide from the host cell, for example, a leader sequence 
which functions as a secretory sequence for controlling transport of the polypeptide from the 
cell. The protein having a leader sequence is a preprotein and may have the leader sequence 
cleaved by the host cell to form the mature form of the protein. 

20 The polynucleotide of the present mvention may also be fused in frame to a marker 

sequence, also referred to herein as 'Tag sequence" encoding a "Tag peptide", which allows for 
marking and/or purification of the present invention. In a preferred embodunent, the market 
sequence is a hexahistidine tag, e g, suppUed by a PQE-9 vector. Numerous other Tag peptides 
are available commercially Other frequently used Tags include myc-epitopes (e g, see Ellison et 

25 al. (1991) J Biol hem 266:21 150-2 1157) which includes a 10-residue sequence from c-myc, the 
pFLAG system (International Biotechnologies, Inc.), the pEZZ-protem A system (Pharmacia, 
NJ), and a 16 amino acid portion of the Haemophilus influenza hemagglutinin protein. 
Furthermore, any polypeptide can be used as a Tag so long as a reagent, e.g., an antibody 
interacting specifically with the Tag polypeptide is available or can be prepared or identified. 

30 As indicated by the examples set out below, nucleic acids can be obtained from mRNA 

present in any of a number of eukaryotic cells or tissue, e.g., and are preferably obtained from 
metazoan cells or tissue, more preferably from vertebrate cells or tissue, and even more 
preferably from mammalian cells and tissue, and most preferably from human cells or tissue. It 
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also is possible to obtain nucleic acids of the present invention from genomic DNA from both 
adults and embryos. For example, a gene can be cloned from either a cDNA or a genomic 
library in accordance with protocols generally known to persons skilled in the art. cDNA can be 
obtained by isolating total mRNA from a cell, e.g., a vertebrate cell, a mammalian cell, or a 
5 human cell, including embryonic cells. Double stranded cDNAs can then be prepared from the 
total mRNA, and subsequently inserted into a suitable plasmid or bacteriophage vector using any 
one of a number of known techniques. The gene can also be cloned using established 
polymerase chain reaction techniques in accordance with the nucleotide sequence information 
provided by the invention. 

10 The invention includes within its scope a polynucleotide having the nucleotide sequence 

of nucleic acid obtained from this biological material, wherein the nucleic acid hybridizes under 
stringent conditions (at least about 4 x SSC at 65 ^C, or at least about 4 x SSC at 42 °C; see, for 
example, U.S. Patent No. 5,707,829, incorporated herein by reference) with at least 15 
contiguous nucleotides of at least one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 

15 4482, 4484, 4486, 4488, 4490, 4492, and 4494. By this is intended that when at least 15 
contiguous nucleotides of one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 
4484, 4486, 4488, 4490, 4492, and 4494 is used as a probe, the probe will preferentially 
hybridize with a gene or mRNA (of the biological material) comprising the complementary 
sequence, allowing the identification and retrieval of the nucleic acids of the biological material 

20 that uniquely hybridize to the selected probe. Probes from more than one of SEQ ID Nos. 1- 
4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 will 
hybridize with the same gene or mRNA if the cDNA from which they were derived corresponds 
to one mRNA. Probes of more than 15 nucleotides can be used, but 15 nucleotides represents 
enough sequence for unique identification. 

25 Because the present nucleic acids are cDNAs which represent partial mRNA transcripts, 

two or more nucleic acids of the invention may represent different regions of the same mRNA 
transcript and the same gene. Thus, if two or more of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 
4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 are identified as belonging to the 
same clone, then either sequence can be used to obtain the ftilHength mRNA or gene. Nucleic 

30 acid-related polynucleotides can also be isolated from cDNA libraries. These libraries are 
preferably prepared from mRNA of human colon cells, more preferably, human colon cancer 
specific tissue, designated as the 100-101, and 103-1 12 clones in Table 1. In another 
embodiment the nucleic acids are isolated from libraries prepared from normal colon specific 
tissue, designated herein as the 102 clones in Table 1. AUgmnent of SEQ ID Nos. 1-4470, 4472, 



24 



wo 02/29086 



PCT/USOl/30732 



4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, as described above, 
indicated that a cell line or tissue source of a related protein or polynucleotide can also be used as 
a source of the nucleic acid-related cDNA. 

Techniques for producing and probing nucleic acid sequence libraries are described, for 
5 example, in Sambrook et al., "Molecular Cloning: A Laboratory Manual'* (New York, Cold 
Spring Harbor Laboratory, 1989). The cDNA can be prepared by using primers based on a 
sequence from SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 
4490, 4492, and 4494. In one embodiment, the cDNA library can be made from only poly- 
adenylated mRNA. Thus, poly-T primers can be used to prepare cDNA from the mRNA. 
10 Alignment of SEQ ID Nos, 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 
4490, 4492, and 4494 can result in identification of a related polypeptide or polynucleotide. 
Some of the polynucleotides disclosed herein contains repetitive regions that were subject to 
masking during the search procedures. The infonnation about the repetitive regions is discussed 
below. 

15 Constructs of polynucleotides having sequences of SEQ ID Nos. 1-4470, 4472, 4474, 

4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 can be generated synthetically. 
Alternatively, single-step assembly of a gene and entire plasmid from large numbers of 
oligodeoxyribonucleotides is described by Stemmer et at, Gene (Amsterdam) (1995) 164(i):49- 
53. In this method, assembly PGR (the synthesis of long DNA sequences from large numbers of 

20 oligodeoxyribonucleotides (oligos)) is described. The method is derived from DNA shuffling 
(Stemmer, Nature (1994) 370:389-391), and does not rely on DNA ligase, but instead relies on 
DNA polymerase to build increasingly longer DNA fragments during the assembly process. For 
example, a 1 . 1 -kb fragment containing the TEM- 1 beta-lactamase-encoding gene (bla) can be 
assembled in a single reaction from a total of 56 oligos, each 40 nucleotides (nt) in length. The 

25 synthetic gene can be PGR amplified and cloned in a vector containing the tetracycUne- 
resistance gene (Tc^R) as the sole selectable marker. Without relying on ampicillin (Ap) 
selection, 76% of the Tc-R colonies were Ap-R, making this approach a general method for the 
rapid and cost-effective synthesis of any gene. 

IV. Identification of Functional and Structural Motifs of Novel Genes Using Art-Recognized 
30 Mfithodfi 

Translations of the nucleotide sequence of the nucleic acids, cDNAs, or full genes can be 
aligned with individual known sequences. Similarity with individual sequences can be used to 
determine the activity of the polypeptides encoded by the polynucleotides of the invention. For 

25 
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example, sequences that show similarity with a chemokine sequence may exhibit chemokine 
activities. Also, sequences exhibiting similarity with more than one individual sequence may 
exhibit activities that are characteristic of either or both individual sequences. 

The full length sequences and fragments of the polynucleotide sequences of the nearest 
5 neighbors can be used as probes and primers to identify and isolate the full length sequence of 
the nucleic acid. The nearest neighbors can indicate a tissue or cell type to be used to construct a 
library for the full-length sequences of the nucleic acid. 

Typically, the nucleic acids are translated in all six frames to determine the best 
alignment with the individual sequences. The sequences disclosed herein in the Sequence 
10 Listing are in a 5' to 3' orientation and translation in three frames can be sufficient (with a few 
specific exceptions as described in the Examples). These amino acid sequences are referred to, 
generally, as query sequences, which will be aligned with the individual sequences. 

Nucleic acid sequences can be compared with known genes by any of the methods 
disclosed above. Results of individual and query sequence alignments can be divided into three 
15 categories: high similarity, weak similarity, and no similarity. Individual alignment results 
ranging from high similarity to weak similarity provide a basis for determining polypeptide 
activity and/or structure. 

Parameters for categorizing individual results include: percentage of the alignment region 
length where the strongest alignment is found, percent sequence identity, and p value. 

20 The percentage of the alignment region length is calculated by coimting the number of 

residues of the individual sequence found in the region of strongest alignment. This number is 
divided by the total residue length of the query sequence to find a percentage. 

Percent sequence identity is calculated by coxmting the mmiber of amino acid matches 
between the query and individual sequence and dividing total number of matches by the number 
25 of residues of the individual sequence found in the region of strongest alignment. For the 
. example above, the percent identity would be 10 matches divided by 11 amino acids, or 
approximately 90.9%. 

P value is the probability that the alignment was produced by chance. For a single 
alignment, the p value can be calculated according to Karlin et al., Proc. Natl Acad . Sci. §2: 
30 2264 (1990) and Karlin et al., EiQC. NatL Acad . Sci. 2Q: (1993). The p value of multiple 
alignments using the same query sequence can be calculated using an heuristic approach 
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described in Altschuletal., Genet , fi: 119(1994). Alignment programs such as BLAST program 
can calculate the p value. 

The boxmdaries of the region where the sequences align can be determined according to 
Doolittle, Methods in En2ymology, supra; BLAST or FASTA programs; or by determining the 
5 area where the sequence identity is highest 

Another factor to consider for determining identity or sunilarity is the location of the 
similarity or identity. Strong local alignment can indicate similarity even if the length of 
alignment is short. Sequence identity scattered throughout the length of the query sequence also 
can mdicate a similarity between the query and profile sequences. 

10 High Similarity 

For the alignment results to be considered high sunilarity, the percent of the alignment 
region length, typically, is at least about 55% of total length query sequence; more typically, at 
least about 58%; even more typically; at least about 60% of the total residue length of the query 
sequence. Usually, percent length of the alignment region can be as much as about 62%; more 
15 usually, as much as about 64%; even more usually, as much as about 66%. 

Further, for high similarity, the region of alignment, typically, exhibits at least about 75% 
of sequence identity; more typically, at least about 78%; even more typically; at least about 80% 
sequence identity. Usually, percent sequence identity can be as much as about 82%; more 
usually, as much as about 84%; even more usually, as much as about 86%. 

20 The p value is used in conjunction with these methods. If high similarity is foimd, the 

query sequence is considered to have high similarity with a profile sequence when the p value is 
less than or equal to about 10'^; more usually; less than or equal to about 10*^ even more usually; 
less than or equal to about 10^. More typically, the p value is no more than about 10"^ more 
typically; no more than or equal to about 10'***; even more typically; no more than or equal to 

25 about 10"^^ for the query sequence to be considered high similarity. 

Weak Similarity 

For the alignment results to be considered weak there is no minimum percent length of 
the alignment region no minimum length of alignment. A better showing of weak similarity is 
considered when the region of alignment is, typically, at least about 15 amino acid residues in 
30 length; more typically, at least about 20; even more typically; at least about 25 amino acid 
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residues in length. Usually, length of the alignment region can be as much as about 30 amino 
acid residues; more usually, as much as about 40; even more usually, as much as about 60 amino 
acid residues. 

Further, for weak similarity, the region of alignment, typically, exhibits at least about 
5 35% of sequence identity; more typically, at least about 40%; even more typically; at least about 
45% sequence identity. Usually, percent sequence identity can be as much as about 50%; more 
usually, as much as about 55%; even more usually, as much as about 60%. 

If low similarity is found, the query sequence is considered to have weak similarity with a 
profile sequence when the p value is usually less than or equal to about 10'^; more usually; less 
10 than or equal to about 10'^ even more usually; less than or equal to about 10"^. More typically, 
the p value is no more than about 10*^ more usually; no more than or equal to about 10"^**; even 
more usually; no more than or equal to about 10'*^ for the query sequence to be considered weak 
similarity. 

Similarity Peterrqjped by Sequence Identity 

1 5 Sequence identity alone can be used to determine similarity of a query sequence to an 

individual sequence and can indicate the activity of the sequence. Such an ahgnment, preferably, 
permits gaps to align sequences. Typically, the query sequence is related to the profile sequence 
if the sequence identity over the entire query sequence is at least about 15%; more typically, at 
least about 20%; even more typically, at least about 25%; even more typically, at least about 

20 50%. Sequence identity alone as a measure of similarity is most useful when the query sequence 
is usually, at least 80 residues in length; more usually, 90 residues; even more usually, at least 95 
amino acid residues in length. More typically, similarity can be concluded based on sequence 
identity alone when the query sequence is preferably 100 residues in length; more preferably, 
120 residues in length; even more preferably, 150 amino acid residues in length. 

25 Determining Activity from Alignments with Profile and Multiple Aligned Sequences 

Translations of the nucleic acids can be aligned with amino acid profiles that define either 
protein families or common motifs. Also, translations of the nucleic acids can be aligned to 
multiple sequence alignments (MSA) comprising the polypeptide sequences of members of 
protein famiUes or motifs. Similarity or identity with profile sequences or MSAs can be used to 
30 determine the activity of the polypeptides encoded by nucleic acids or corresponding cDNA or 
genes. For example, sequences that show an identity or similarity with a chemokine profile or 
MSA can exhibit chemokine activities. 
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Profiles can designed manually by (1) creating a MSA, which is an alignment of the 
amino acid sequence of members that belong to the family and (2) constructing a statistical 
representation of the alignment. Such methods are described, for example, in Bimey et al, Nucl. 
Acid Res . ISdA^: 2730-2739 (1996). 

5 MS As of some protein families and motifs are publicly available. For example, these 

include MS As of 547 different famihes and motifs. These MS As are described also in 
Sonnhammer et al, Proteins 28: 405-420 (1997). Other sources are also available in the world 
wide web. A brief description of these MSAs is reported in Pascarella et al, Prot. Eng, 2{3}: 
249-251 (1996). 

10 Techniques for building profiles from MSAs are described in Sonnhammer et al, supra; 

Bimey et al, supra; and Methods in EnTymologv . vol. 266: "Computer Methods for 
Macromolecular Sequence Analysis," 1996, ed. Doolittle, Academic Press, Inc., a division of 
Harcourt Brace & Co., San Diego, California, USA. 

Similarity between a query sequence and a protein family or motif can be determined by 
1 5 (a) comparing the query sequence against the profile and/or (b) aligning the query sequence with 
the members of the family or motif 

Typically, a program such as Searchwise can be used to compare the query sequence to 
the statistical representation of the multiple alignment, also known as a profile. The program is 
described in Bimey et al., supra. Other techniques to compare the sequence and profile are 
20 described in Sonnhanuner et al, supra and Doolittle, supra. 

Next, methods described by Feng et al, L Mol. Evol 2S:35 1-360 (1987) and Higgins et 
al., CABIOS i:151-153 (1989) can be used align the query sequence with the members of a 
family or motif, also known as a MSA. Computer programs, such as PILEUP, can be used. See 
Feng et al., injra. 

25 The following factors are used to determine if a similarity between a query sequence and 

a profile or MSA exists: (1) number of conserved residues found in the query sequence, (2) 
percentage of conserved residues found in the query sequence, (3) number of frameshifts, and (4) 
spacing between conserved residues. 

Some aUgnment programs that both translate and align sequences can make any number 
30 of fi:ameshifts when translating the nucleotide sequence to produce the best alignment. The 

fewer frameshifts needed to produce an aUgnment, the stronger the similarity or identity between 
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the query and profile or MSAs. Foj example, a weak similarity resulting firom no ftameshifts can 
be a better indication of activity or structure of a query sequence, than a strong similarity 
resulting from two frameshifts. 

Preferably, three or fewer frameshifts are found in an alignment; more preferably two or 
5 fewer frameshifts; even more preferably, one or fewer frameshifts; even more preferably, no 
frameshifts are found in an alignment of query and profile or MSAs. 

Conserved residues are those amino acids that are foimd at a particular position in all or 
some of the family or motif members. For example, most known chemokines contain four 
conserved cysteines. Alternatively, a position is considered conserved if only a certain class of 
10 amino acids is found m a particular position in all or some of the family members. For example, 
the N-terminal position may contain a positively charged amino acid, such as lysine, arginine, or 
histidine. 

Typically, a residue of a polypeptide is conserved when a class of amino acids or a single 
amino acid is found at a particular position in at least about 40% of all class members; more 
15 typically, at least about 50%; even more typically, at least about 60% of the members. Usually, a 
residue is conserved when a class or single amino acid is found in at least about 70% of the 
members of a family or motif; more usually, at least about 80%; even more usually, at least 
about 90%; even more usually, at least about 95%, 

A residue is considered conserved when three unrelated amino acids are found at a 
20 particular position in the some or all of the members; more usually, two unrelated amino acids. 
These residues are conserved when the unrelated amino acids are found at particular positions in 
at least about 40% of all class member, more typically, at least about 50%; even more typically, 
at least about 60% of the members. Usually, a residue is conserved when a class or single amino 
acid is found in at least about 70% of the members of a family or motif more usually, at least 
25 about 80%; even more usually, at least about 90%; even more usually, at least about 95%. 

A query sequence has similarity to a profile or MSA when the query sequence comprises 
at least about 25% of the conserved residues of the profile or MSA; more usually, at least about 
30%; even more usually; at least about 40%. Typically, the query sequence has a stronger 
similarity to a profile sequence or MSA when the query sequence comprises at least about 45% 
30 of the conserved residues of the profile or MSA more typically, at least about 50%; even more 
typically; at least about 55%. 
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V. Probes and Primers 

The nucleotide sequences determined from the cloning of genes from tumor cells, 
especially colon cancer cell lines and tissues will further allow for the generation of probes and 
primers designed for identifying and/or cloning homologs in other cell types, e.g., from other 
5 tissues, as well as homologs from other mammalian organisms. Nucleotide sequences useful as 
probes/primers may include all or a portion of the sequences listed in SEQ ID Nos. 1-4470, 
4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 or sequences 
complementary thereto or sequences which hybridize under stringent conditions to all or a 
portion of SEQ ID Nos. 1-4470, 4472. 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 

1 0 4492, and 4494. For instance, the present invention also provides a probe/primer comprising a 
substantially piirified oligonucleotide, which oligonucleotide comprising a nucleotide sequence 
that hybridizes under stringent conditions to at least approximately 12, preferably 25, more 
preferably 40, 50, or 75 consecutive nucleotides up to the full length of the sense or anti-sense 
sequence selected from the group consisting of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 

15 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 1-1103, even 
more preferably SEQ ID Nos. 1-503, or a sequence complementary thereto, or naturally 
occurring mutants thereof. For instance, primers based on a nucleic acid represented in SEQ ID 
Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484,4486, 4488, 4490, 4492, and 4494, 
preferably SEQ ID Nos. 1-1 103, even more preferably SEQ ID Nos. 1-503, and even still more 

20 preferred SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 
4494, or a sequence complementary thereto, can be used in PGR reactions to clone homologs of 
that sequence. 

In yet another embodiment, the invention provides probes/primers comprising a 
nucleotide sequence that hybridizes under moderately stringent conditions to at least 
25 approximately 12, 16, 25, 40, 50 or 75 consecutive nucleotides up to the full length of the sense 
or antisense sequence selected from the group consisting of SEQ ID Nos. 1-4470, 4472, 4474, 
4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 1- 
1 103, even more preferably SEQ ID Nos. 1-503, or naturally occurring mutants thereof 

In particular, these probes are useful because they provide a method for detecting 
30 mutations in wild-type genes of the present invention. Nucleic acid probes which are 

complementary to a wild-type gene of the present invention and can form mismatches with 
mutant genes are provided, allowing for detection by enzymatic or chemical cleavage or by shifts 
in electrophoretic mobility. Likewise, probes based on the subject sequences can be used to 
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detect transcripts or genomic sequences encoding the same or homologous proteins, for use, for 
example, in prognostic or diagnostic assays. In preferred embodiments, the probe further 
comprises a label group attached thereto and able to be detected, e.g., the label group is selected 
from radioisotopes, fluorescent compoxmds, chemiluminescent compounds, enzymes, and 
5 enzyme co-factors. 

Full-length cDNA molecules comprising the disclosed nucleic acids are obtained as 
follows. In a preferred embodiment, the invention provides the fiill length cDNA sequence of 
SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494. A 
subject nucleic acid or a portion thereof comprising at least about 12, 15, 18, or 20 nucleotides 

10 up to the foil length of a sequence represented in SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 
4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 1-1 103, even 
more preferably SEQ ID Nos. 1-503, or a sequence complementary thereto, may be used as a 
hybridization probe to detect hybridizing members of a cDNA library using probe design 
methods, cloning methods, and clone selection techniques as described in U.S. Patent No. 

15 5,654,173, "Secreted Proteins and Polynucleotides Encoding Them," incorporated herein by 
reference. Libraries of cDNA may be made from selected tissues, such as normal or tumor 
tissue, or from tissues of a mammal treated with, for example, a pharmaceutical agent 
Preferably, the tissue is the same as that used to generate the nucleic acids, as both the nucleic 
acid and the cDNA represent expressed genes. Most preferably, the cDNA library is made from 

20 the biological material described herein in the Examples. Alternatively, many cDNA libraries 
are available commercially. (Sambrook et al.. Molecular Cloning: A Laboratory Manual, 2nd 
Ed. (Cold Spring Harbor Press, Cold Spring Harbor, NY 1989). The choice of cell type for 
library construction may be made after the identity of the protein encoded by the nucleic acid- 
related gene is known. This will indicate which tissue and cell types are likely to express the 

25 related gene, thereby containing the mRNA for generating the cDNA. 

Members of the library that are larger than the nucleic acid, and preferably that contain 
the whole sequence of the native message, may be obtained. To confirm that the entire cDNA 
has been obtained, RNA protection experiments may be performed as follows. Hybridization of 
a foil-length cDNA to an mRNA may protect the RNA from RNase degradation. If the cDNA is 
30 not foil length, then the portions of the mRNA that arc not hybridized may be subject to RNase 
degradation. This may be assayed, as is known in the art, by changes in electrophoretic mobility 
on polyacrylamide gels, or by detection of released monoribonucleotides. Sambrook et al, 
Molecular Cloning: A Laboratory Manual, 2nd Ed. (Cold Spring Harbor Press, Cold Spring 
Harbor, NY 1989). In order to obtain additional sequences 5' to the end of a partial cDNA, 5* 
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RACE (PGR Protocols: A Guide to Methods and Applications (Academic Press, Inc. 1990)) may 
be performed. 

Genomic DNA may be isolated using nucleic acids in a manner similar to the isolation of 
full-length cDNAs. Briefly, the nucleic acids, or portions thereof, may be used as probes to 
5 libraries of genomic DNA. Preferably, the library is obtained from the cell type that was used to 
generate the nucleic acids. Most preferably, the genomic DNA is obtained from the biological 
material described herein in the Example. Such libraries may be in vectors suitable for carrying 
large segments of a genome, such as PI or YAC, as described in detail in Sambrook et al, 9.4- 
9.30. In addition, genomic sequences can be isolated from human BAG libraries, which are 
1 0 commercially available from Research Genetics, Inc., Huntville, Alabama, USA, for example. 
In order to obtain additional 5' or 3' sequences, chromosome walking may be performed, as 
described in Sambrook et al., such that adjacent and overlapping fragments of genomic DNA are 
isolated. These may be mapped and pieced together, as is known in the art, using restriction 
digestion enzymes and DNA ligase, 

15 Using the nucleic acids of the invention, corresponding full length genes can be isolated 

using both classical and PGR methods to construct and probe cDNA libraries. Using either 
method. Northern blots, preferably, may be performed on a number of cell types to determine 
which cell lines express the gene of interest at the highest rate. 

Glassical methods of constructing cDNA libraries in Sambrook et al, supra. With these 
20 methods, cDNA can be produced from mRNA and inserted into viral or expression vectors. 
Typically, libraries of mRNA comprising poly(A) tails can be produced with poly(T) primers. 
Similarly, cDNA libraries can be produced using the instant sequences as primers. 

PGR methods may be used to amplify the members of a cDNA library that comprise the 
desired insert. In this case, the desired insert may contain sequence from the full length cDNA 
25 that corresponds to the instant nucleic acids. Such PGR methods include gene trapping and 
RAGE methods. 

Gene trapping may entail mserting a member of a cDNA library into a vector. The vector 
then may be denatured to produce single stranded molecules. Next, a substrate-boimd probe, 
such a biotinylated oligo, may be used to trap cDNA inserts of interest. Biotinylated probes can 
30 be linked to an avidin-boxmd solid substrate. PGR methods can be used to amplify the trapped 
cDNA. To trap sequences corresponding to the full length genes, the labeled probe sequence 
may be based on the nucleic acids of the invention, e.g., SEQ ID Nos. 1-1 103, preferably SEQ 
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ID Nos. 1-503, or a sequence complementary thereto. Random primers or primers specific to the 
library vector can be used to amplify the trapped cDNA. Such gene trapping techniques are 
described in Gruber at al,, PCX WO 95/04745 and Gruber et a!., U.S. Pat. No. 5,500,356. Kits 
are commercially available to perform gene trapping experiments from, for example, Life 
5 Technologies, Gaithersburg, Maryland, USA. 

"Rapid amphfication of cDNA ends," or RACE, is a PGR method of amplifying cDNAs 
from a number of different RNAs. The cDNAs may be ligated to an oligonucleotide linker and 
amplified by PGR using two primers. One primer may be based on sequence from the instant 
nucleic acids, for which full length sequence is desired, and a second primer may comprise a 
10 sequence that hybridizes to the oligonucleotide linker to amplify the cDNA. A description of 
this method is reported, for example, in PCT Pub. No, WO 97/191 10. 

In preferred embodiments of RACE, a common primer may be designed to anneal to an 
arbitrary adaptor sequence ligated to cDNA ends (Apte and Siebert, Biotechniques, 15:890-893, 
1993; Edwards et al., Aeids ££S., 19:5227-5232, 1991). When a single gene-specific 
1 5 RACE primer is paired with the common primer, preferential amplification of sequences 
between the single gene specific primer and the common primer occurs. Commercial cDNA 
pools modified for use in RACE are available. 

Another PGR-based method generates full-length cDNA library with anchored ends 
without specific knowledge of the cDNA sequence. The method uses lock-docking primers (1- 
20 VI), where one primer, poly TV (I-Ill) locks over the polyA tail of eukaryotic mRNA producing 
first strand synthesis and a second primer, polyGH (IV- VI) locks onto the polyG tail added by 
terminal deoxynucleotidyl transferase (TdT). This method is described, for example, in PCT 
Pub. No. WO 96/40998. 

The promoter region of a gene generally is located 5' to the initiation site for RNA 
25 polymerase IL Hundreds of promoter regions contain the "TATA" box, a sequence such as 
TATTA or TATAA, which is sensitive to mutations. The promoter region can be obtained by 
performing 5' RACE using a primer from the coding region of the gene. Alternatively, the 
cDNA can be used as a probe for the genomic sequence, and the region 5' to the coding region is 
identified by "walking up.'* 

30 If the gene is highly expressed or differentially expressed, the promoter from the gene 

may be of use in a regulatory constmct for a heterologous gene. 
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Once the fiiU-length cDNA or gene is obtained, DNA encoding variants can be prepared 
by site-directed mutagenesis, described in detail in Sambrook 15.3-15.63. The choice of codon 
or nucleotide to be replaced can be based on the disclosure herein on optional changes in amino 
acids to achieve altered protein structure and/or function. 

5 As an alternative method to obtaining DNA or RNA from a biological material, nucleic 

acid comprising nucleotides having the sequence of one or more nucleic acids of the invention 
can be synthesized. Thus, the invention encompasses nucleic acid molecules ranging in length 
from 12 nucleotides (corresponding to at least 12 contiguous nucleotides which hybridize imder 
stringent conditions to or are at least 80% identical to a nucleic acid represented by one of SEQ 

10 ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486,- 4488, 4490, 4492, and 4494, 
preferably SEQ ID Nos. 1-1 103, even more preferably SEQ ID Nos. 1-503, or a sequence 
complementary thereto) up to a maximum length suitable for one or more biological 
manipulations, including replication and expression, of the nucleic acid molecule. The invention 
includes but is not limited to (a) nucleic acid having the size of a full gene, and comprising at 

15 least one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 
4492, and 4494, preferably SEQ ID Nos. 1-1 103, even more preferably SEQ ID Nos. 1-503, or a 
sequence complementary thereto; (b) the nucleic acid of (a) also comprising at least one 
additional gene, operably linked to permit expression of a fusion protein; (c) an expression 
vector comprising (a) or (b); (d) a plasmid comprising (a) or (b); and (e) a recombinant viral 

20 particle comprising (a) or (b). Construction of (c) can be accomplished as described below in 
part VI. 

The sequence of a nucleic acid of the present invention is not limited and can be any 
sequence of A, T, G, and/or C (for DNA) and A, U, G, and/or C (for RNA) or modified bases 
thereof, including mosine and pseudouridine. The choice of sequence will depend on the desired 
25 function and can be dictated by coding regions desired, the intron-like regions desired, and the 
regulatory regions desked, 

VI. Vectors Carrying Nucleic Acids of the Present Invention 

The invention furthei: provides plasmids and vectors, which can be used to express a gene 
in a host cell. The host cell may be any prokaryotic or eukaryotic cell. Thus, a nucleotide 
30 sequence derived from any one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 
4484^ 4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 1-1 103, even more preferably 
SEQ ID Nos. 1-503, and still more preferably SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 
4484, 4486, 4488, 4490, 4492, and 4494, or a sequence complementary thereto, encoding all or a 
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selected portion of a protein, can be used to produce a recombinant form of an polypeptide via 
microbial or eukaryotic cellular processes. Ligating the polynucleotide sequence into a gene 
construct, such as an expression vector, and transforming or transfecting into hosts, either 
eukaryotic (yeast, avian, insect or mammalian) or prokaryotic (bacterial cells), are standard 
5 procedures well known in the art. 

Vectors that allow expression of a nucleic acid in a cell are referred to as expression 
vectors. Typically, expression vectors contain a nucleic acid operably linked to at least one 
transcriptional regulatory sequence. Regulatory sequences are art-recognized and are selected to 
direct expression of the subject nucleic acids. Transcriptional regulatory sequences are described 
10 in Goeddel; Gene Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA (1990). In one embodiment, the expression vector includes a recombinant gene 
encoding a peptide having an agonistic activity of a subject polyp eptide, or alternatively, 
encoding a peptide which is an antagonistic form of a subject polypeptide. 

The choice of plasmid will depend on the type of cell ui which propagation is desired and 
1 5 the purpose of propagation. Certain vectors are useful for amplifying and making large amounts 
of the desured DNA sequence. Other vectors are suitable for expression in cells in culture. Still 
other vectors are suitable for transfer and expression in cells in a whole animal or person. The 
choice of appropriate vector is well within the skill of the art. Many such vectors are available 
commercially. The nucleic acid or full-length gene is inserted into a vector typically by means 
20 of DNA hgase attachment to a cleaved restriction enzyme site in the vector. Alternatively, the 
desired nucleotide sequence may be inserted by homologous recombination in vivo. Typically 
this is accomplished by attaching regions of homology to the vector on the flanks of the desired 
nucleotide sequence. Regions of homology are added by ligation of oligonucleotides, or by 
polymerase chain reaction using primers comprising both the region of homology and a portion 
25 of the desired nucleotide sequence. 

Nucleic acids or full-length genes are linked to regulatory sequences as appropriate to 
obtain the desired expression properties. These may include promoters (attached either at the 5' 
end of the sense strand or at the 3' end of the antisense strand), enhancers, terminators, operators, 
repressors, and inducers. The promoters may be regulated or constitutive. In some situations it 
30 may be desirable to use conditionally active promoters, such as tissue-specific or developmental 
stage-specific promoters. These are luiked to the desired nucleotide sequence using the 
techniques described above for Imkage to vectors. Any techniques known in the art may be 
used. 
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When any of the above host cells, or other appropriate host cells or organisms, are used to 
replicate and/or express the polynucleotides or nucleic acids of the invention, the resulting 
replicated nucleic acid, RNA, expressed protein or polypeptide, is within the scope of the 
invention as a product of the host cell or organism. The product is recovered by any appropriate 
5 means known in the art. 

Once the gene corresponding to the nucleic acid is identified, its expression can be 
regulated in the cell to which the gene is native. For example, an endogenous gene of a cell can 
be regulated by an exogenous regulatory sequence as disclosed in US. Patent No. 5,641,670, 
"Protein Production and Protein Delivery." 

10 A number of vectors exist for the expression of recombinant proteins in yeast (see, for 

example. Broach et al (1983) in Experimental Manipulation of Gene Expression, ed. M. Inouye, 
Academic Press, p. 83, incorporated by reference herein). In addition, drug resistance markers 
such as ampicillin can be used. In an illustrative embodiment, a polypeptide is produced 
recombinantly utilizing an expression vector generated by sub-cloning one of the nucleic acids 

15 represented in one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 
4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 1-1 103, even more preferably SEQ ID 
Nos. 1-503, or a sequence complementary thereto. 

The preferred manmialian expression vectors contain both prokaryotic sequences, to 
facilitate the propagation of the vector in bacteria, and one or more eukaryotic transcription units 
20 that are expressed in eukaryotic cells. The various methods employed in the preparation of 
plasmids and transformation of host organisms are well known in the art. For other suitable 
expression systems for both prokaryotic and eukaryotic cells, as well as general recombinant 
procedures, see Molecular Cloning: A Laboratory Manual, 2 ' Ed., ed. by Sambrook, Fritsch and 
Maniatis (Cold Spring Harbor Laboratory Press: 1989) Chapters 16 and 17. 

25 When it is desirable to express only a portion of a gene, e.g., a truncation mutant, it may 

be necessary to add a start codon (ATG) to the oligonucleotide fragment containing the desired 
sequence to be expressed. It is well known in the art that a methionine at the N-terminal position 
can be enzymatically cleaved by the use of the enzyme methionine aminopeptidase (MAP). 
MAP has been cloned from E. coU (Ben-Bassat et a/., (1987) 7. BacterioL 169:751-757) and 

30 Salmonella typhimurium and its in vitro activity has been demonstrated on recombinant proteins 
(Miller et al (1987) PNAS 84:2718-1722). Therefore, removal of an N-terminal methionine, if 
desired, can be achieved either in vivo by expressing polypeptides in a host which produces 
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MAP (e.g., E. coli or CM89 or S. cerevisiae), or in vitro by use of purified MAP (e.g., procedure 
of Miller et al., supra). 

Moreover, the nucleic acid constructs of the present invention can also be used as part of 
a gene therapy protocol to deliver nucleic acids such as antisense nucleic acids. Thus, another 
5 aspect of the invention features expression vectors for in vivo or in vitro transfection with an 
antisense oligonucleotide. 

In addition to viral transfer methods, non-viral methods can also be employed to 
introduce a subject nucleic acid, e.g., a sequence represented by one of SEQ ID Nos. 1-4470, 
4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, preferably SEQ 

10 ID Nos. 1-1 103, even more preferably SEQ ID Nos, 1-503, or a sequence complementary 
thereto, into the tissue of an animal. Most nonviral methods of gene transfer rely on normal 
mechanisms used by mammalian cells for the uptake and intracellular transport of 
macromolecules. In preferred embodiments, non-viral targeting means of the present invention 
rely on endocytic pathways for the uptake of the subject nucleic acid by tlie targeted cell. 

15 Exemplary targeting means of this type include liposomal derived systems, polylysine 
conjugates, and artificial viral envelopes. 

A nucleic acid of any of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 
4484, 4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 1-1 103, even more preferably 
SEQ ID Nos. 1-503, or a sequence complementary thereto, the corresponding cDNA, or the fiiU- 

20 length gene may be used to express the partial or complete gene product. Appropriate nucleic 
acid constructs are purified using standard recombinant DNA techniques as described in, for 
example, Sambrook et al, (1989) Molecular Cloning: A Laboratory Manual, 2nd ed. (Cold 
Spring Harbor Press, Cold Spring Harbor, New York), and under current regulations described in 
United States Dept. of HHS, National Institute of Health (NIH) Guidelines for Recombinant 

25 DNA research. The polypeptides encoded by the nucleic acid may be expressed in any 

expression system, including, for example, bacterial, yeast, insect, amphibian and mammalian 
systems. Suitable vectors and host cells are described, for example, in U.S. Patent No. 
5,654,173. 

Bacteria . Expression systems in bacteria include those described in Chang et al. Nature 
30 (1978) 275:615, Goeddel et aL, Nature (1979) 281 :544, Goeddel et al, Nucleic Acids Rec, 
(1980) 5:4057; EP 0 036,776, U.S. Patent No. 4,551,433, DeBoer et al, Proc, Natl Acad. Sci, 
(USA) (1983) 30:2125, and Siebenlist et aL, Cell (1980) 20:269. 
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Yeast Expression systems in yeast include those described in Hinnen et aL, Proc, Natl 
Acad, ScL (USA) (1978) 75:1929; Ito et al, J, BacterioL (1983) 755:163; Kurtz et al, Mol Cell 
Biol (1986) 6: 142; Kunze et al, J. Basic Microbiol (1985) 25: 141; Gleeson et al, J. Gen, 
Microbiol (1986) 752:3459, Roggenkamp et al, Mol Gem Genet. (1986) 202:302) Das etal, J. 
5 Bactenol (1984) 755:1 165; De Louvencourt et al, 1 Bacteriol (1983) 75^:737, Van den Berg 
et al, Bio/Technology (1990) 5:135; Kunze et al, J. Basic Microbiol. (1985) 25:141; Gregg et 
al, Mol Cell Biol (1985) 5:3376, U.S. Patent Nos. 4,837,148 and 4,929,555; Beach and Nurse, 
Nature (1981) 500:706; Davidow etal, Curr. Genet (1985) 70:380, Gaillardin et al, Curr. 
Genet (1985) 70:49, Ballance etal,, Biochem, Biophys. Res. Commun. (1983) 772:284289; 
10 Tilbum et al, Gene (1983) 25:205221, Yelton et al, Proc. Natl. Acad. Sci. (USA) (1984) 

57:14701474, Kelly and Hynes, EMBO J. (1985) ^:475479; BP 0 244,234, and WO 91/00357. 

Insect Cells . Expression of heterologous genes in insects is accomplished as described in 
U.S. Patent No. 4,745,051, Friesen et al, (1986) "The Regulation of Baculovirus Gene 
Expression" in: The Molecular Biology Of Baculoviruses (W. Doerfler, ed.), EP 0 127,839, EP 0 

15 155,476, and Vlak et a/., J. Gen, Virol (1988) 59:765776, Miller et al, Ann. Rev, Microbiol 
(1988) 42:177, Carbonell et al. Gene (1988) 75:409, Maeda et al. Nature (1985) 5/5:592594, 
Lebacq Verheyden et at., Mol. Cell. Biol. (1988) 5:3129; Smith et al, Proc. Nail. Acad. Sci. 
(USA) (1985) 52:8404, Miyajima et al. Gene (1987) 58:273; and Martinet al, DNA (1988) 
7:99. Numerous baculoviral strains and variants and corresponding permissive insect host cells 

20 from hosts are described in Luckow et al, Bio/Technology (1988) 5:4755, Miller et al, Generic 
Engineering (Setlow, J.K. et al eds.), Vol. 8 (Plenum Publishing, 1986), pp. 277279, and Maeda 
etal. Nature, (1985) 575:592-594. 

Mammalian Cells . Mammalian expression is accomplished as described in Dijkema et 
al, EMBO J, (1985) ^:761, Gorman et al, Proc, Natl Acad, ScL (USA) (1982) 7P:6777, Boshart 
25 et al. Cell (1985) ^7:52 I and U.S. Patent No. 4,399,216. Other features of mammalian 

e?q)ression are facilitated as described in Ham and Wallace, Meth, Enz. (1979) 55:44, Barnes and 
S^to, Anal Biochem, (1980) 7i?2:255, U.S. Patent Nos. 4,767,704, 4,657,866, 4,927,762, 
4,560,655, WO 90/103430, WO 87/00195, and U.S. RE 30,985. 

VIL Therapeutic Nucleic Acid Constmcts 

30 One aspect of the invention relates to the use of the isolated nucleic acid, e.g., SEQ ID 

Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, 
preferably SEQ ID Nos. 1-1103, even more preferably SEQ ID Nos. 1-503, or a sequence 
complementary thereto, in antisense therapy. As used herein, antisense therapy refers to 
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administration or in situ generation of oligonucleotide molecules or their derivatives which 
specifically hybridize (e.g., bind) under cellular conditions with the cellular mRNA and/or 
genomic DNA, thereby inhibiting transcription and/or translation of that gene. The binding may 
be by conventional base pair complementarity, or, for example, in the case of binding to DNA 
5 duplexes, through specific interactions in the major groove of the double helix. In general, 
antisense therapy refers to the range of techniques generally employed in the art, and includes 
any therapy which relies on specific binding to oligonucleotide sequences. 

An antisense construct of the present invention can be delivered, for example, as an 
expression plasmid which, when transcribed in the cell, produces RNA which is complementary 

10 to at least a unique portion of the cellular mRNA. Altematively, the antisense construct is an 
ohgonucleotide probe which is generated ex vivo and which, when introduced into the cell, 
causes inhibition of expression by hybridizing with the mRNA and/or genomic sequences of a 
subject nucleic acid. Such oligonucleotide probes are preferably modified oligonucleotides 
which are resistant to endogenous nucleases, e.g., exonucleases and/or endonucleases, and are 

15 therefore stable in vivo. Exemplary nucleic acid molecules for use as antisense oligonucleotides 
are phosphoramidate, phosphorothioate and methylphosphonate analogs of DNA (see also U.S. 
Patents 5,176,996; 5,264,564; and 5,256,775). Additionally, general approaches to constructing 
oligomers usefiil in antisense therapy have been reviewed, for example, by Van der Krol et al 
(1988) BioTechniques 6:958-976; and Stein et al. (1988) Cancer Res 48:2659-2668. With 

20 respect to antisense DNA, oligodeoxyribonucleotides derived firom the translation initiation site, 
e.g., between the -10 and +10 regions of the nucleotide sequence of interest, are preferred. 

Antisense approaches involve the design of oligonucleotides (either DNA or RNA) that 
are complementary to mRNA, The antisense oligonucleotides will bind to the mRNA transcripts 
and prevent translation. Absolute complementarity, although preferred, is not required. In the 

25 case of double-stranded antisense nucleic acids, a single strand of the duplex DNA may thus be 
tested, or triplex formation may be assayed. The ability to hybridize will depend on both the 
degree of complementarity and the length of the antisense nucleic acid. Generally, the longer the 
hybridizing nucleic acid, the more base mismatches with an RNA it may contain and still form a 
stable duplex (or triplex, as the case may be). One skilled in the art can ascertain a tolerable 

30 degree of mismatch by use of standard procedures to determine the melting point of the 
hybridized complex. 

Oligonucleotides that are complementary to the 5' end of the mRNA, e.g., the 5' 
untranslated sequence up to and including the AUG initiation codon, should work most 
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efficiently at inhibiting translation. However, sequences complementary to the 3* untranslated 
sequences of mRNAs have recently been shown to be effective at inhibiting translation of 
mRNAs as well. (Wagner, R. 1994. Nature 372:333). Therefore, oligonucleotides 
complementary to either the 5' or 3' untranslated, non-coding regions of a gene could be used in 
5 an antisense approach to inhibit translation of endogenous mRNA. Oligonucleotides 

complementary to the 5' mtranslated region of the mRNA should include the complement of the 
AUG start codon. Antisense oligonucleotides complementary to mRNA coding regions are 
typically less efficient inhibitors of translation but could also be used in accordance with the 
invention. Whether designed to hybridize to the 5', 3', or coding region of subject mRNA, 
10 antisense nucleic acids should be at least six nucleotides m length, and are preferably less that 
about 100 and more preferably less than about 50,25, 17 or 10 nucleotides in length. 

Regardless of the choice of target sequence, it is preferred that in vitro studies are furst 
performed to quantitate the ability of the antisense oligonucleotide to quantitate the ability of the 
antisense oligonucleotide to inhibit gene expression. It is preferred that these studies utilize 

15 controls that distinguish between antisense gene inhibition and nonspecific biological effects of 
oUgonucleotides. It is also preferred that these studies compare levels of the target RNA or 
protein with that of an internal control RNA or protein. Additionally, it is envisioned that results 
obtained using the antisense oligonucleotide are compared with those obtained using a control 
oligonucleotide. It is prefened that the control oligonucleotide is of approximately the same 

20 length as the test oligonucleotide and that the nucleotide sequence of the oligonucleotide differs 
firom the antisense sequence no more than is necessary to prevent specific hybridization to the 
target sequence. 

The oUgonucleotides can be DNA or RNA or chimeric mixtures or derivatives or 
modified versions thereof, single-stranded or double-stranded. The oligonucleotide can be 

25 modified at the base moiety, sugar moiety, or phosphate backbone, for example, to improve 
stability of the molecule, hybridization, etc. The oligonucleotide may include other appended 
groups such as peptides (e.g., for targeting host cell receptors), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Nati. Acad. Sci. U.S.A. 
86:6553-6556; Lemaitre et al, 1987, Proc. Nati. Acad. Sci. 84:648-652; PCT Publication No. 

30 WO 88/098 10, published December 15, 1988) or tiie blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10 134, published April 25, 1988), hybridization-triggered cleavage 
agents (See, e.g., Krol et a/., 1988, BioTechniques 6:958-976), or intercalating agents (See, e.g., 
Zon, 1988, Pharm. Res. 5:539-549). To this end, the oUgonucleotide may be conjugated to 
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another moleciile, e.g., a peptide, hybridization triggered cross-linking agent, transport agent, 
hybridization-triggered cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one modified base moiety which is 
selected from the group including but not limited to 5-fluorouracil, 5-bromouracil, 5- 
5 chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetyIcytosiae, 5-(carboxyhydroxytriethyl) 
uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1- 
methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5- 
methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5- 
1 0 methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5 -methoxycarboxymethyluracil, 
5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), 
wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4- 
thiouracil, 5-methyluracil, uracil-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5- 
methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

1 5 The antisense oligonucleotide may also comprise at least one modified sugar moiety 

selected from the group including but not limited to arabinose, 2-fluoroarabinose, xylulose, and 
hexose. 

The antisense oligonucleotide can also contain a neutral peptide-like backbone. Such 
molecules are termed peptide nucleic acid (PNA)-oligomers and are described, e.g., in Peny- 

20 O'Keefe et al (1996) Proc. Natl Acad. Sci. U,S.A. 93: 14670 and in Eglom et al (1993) Nature 
365:566. One advantage of PNA oligomers is their capabiUty to bind to complementary DNA 
essentially independently from the ionic strength of the medium due to the neutral backbone of 
the DNA. Li yet another embodiment, the antisense oligonucleotide comprises at least one 
modified phosphate backbone selected from the group consisting of a phosphorothioate, a 

25 phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a 
methyiphosphonate, an alkyl phosphotriester, and a formacetal or analog thereof. 

In yet a ftirther embodiment, the antisense oligonucleotide is an a-anomeric 
oUgonucleotide. An a-anomeric ohgonucleotide forms specific double-stranded hybrids with 
complementary RNA in which, contrary to the usual P-units, the strands run parallel to each 
30 other (Gautier et al, 1987, Nucl. Acids Res. 15:6625-6641). The oligonucleotide is a 2'-0- 

methylribonucleotide (Inoue et al, 1987, Nucl Acids Res, 15:6131-12148), or a chimeric RNA- 
DNA analogue (Jnoue et a/., 1987, FEES Lett. 215:327-330). 



42 



wo 02/29086 



PCT/USOl/30732 



Oligonucleotides of the invention may be synthesized by standard methods known in the 
art, e.g., by use of an automated DNA synthesizer (such as are commercially available from 
Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate oligonucleotides may be 
synthesized by the method of Stein et al (1 988, Nucl. Acids Res. 1 6:3209), methylphosphonate 
5 olgonucleotides can be prepared by use of controlled pore glass polymer supports (Sarin et al., 
1988, Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451), etc. 

While antisense nucleotides complementary to a coding region sequence can be used, 
those complementary to the transcribed untranslated region and to the region comprising the 
initiating methionine are most preferred. 

10 The antisense molecules can be delivered to cells which express the target nucleic acid in 

vivo. A number of methods have been developed for delivering antisense DNA or RNA to cells; 
e.g., antisense molecules can be injected directly into the tissue site, or modified antisense 
molecules, designed to target the desired cells (e.g., antisense linked to peptides or antibodies 
that specifically bind receptors or antigens expressed on flie target cell surface) can be 

1 5 administered systemically. 

However, it is often difficult to achieve intracellular concentrations of the antisense 
sufficient to suppress translation on endogenous mRNAs. Therefore, a preferred approach 
utilizes a recombinant DNA construct in which the antisense oligonucleotide is placed under the 
control of a strong pol 111 or pot II promoter. The use of such a construct to transfect target cells 

20 in the patient will result in the transcription of sufficient amounts of single stranded KNAs that 
will form complementary base pairs with the endogenous transcripts and thereby prevent 
translation of the target mRNA. For example, a vector can be introduced in vivo such that it is 
taken up by a cell and directs the transcription of an antisense RNA. Such a vector can remain 
episomal or become chromosomally integrated, as long as it can be transcribed to produce the 

25 desired antisense RNA. Such vectors can be constructed by recombinant DNA technology 
methods standard in the art. Vectors can be plasmid, viral, or others known in the art for 
replication and expression in mammalian cells. Expression of the sequence encoding the 
antisense RNA can be by any promoter known in the art to act in mammalian, preferably human 
cells. Such promoters can be uiducible or constitutive. Such promoters include but are not 

30 limited to: the S V40 eariy promoter region (Bemoist and Chambon, 1 98 1 , Nature 290:304-3 1 0), 
the promoter contained in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto et al., 
1980, Cell 22:787-797), the herpes thymidine kinase promoter (Wagner et al, 1981, Proc. Natl 
Acad. Sci. U.S.A. 78: 1441-1445), the regulatory sequences of the metallothionein gene (Brinster 
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et at, 1982, Nature 296:39-42), etc. Any type of plasmid, cosmid, YAC or viral vector can be 
used to prepare the recombinant DNA construct which can be introduced directly into the tissue 
site; e.g., the choroid plexus or hypothalamus. Alternatively, viral vectors can be used which 
selectively infect the desired tissue (e.g., for brain, herpesvirus vectors may be used), in which 
5 case administration may be accomplished by another route (e.g., systemically). 

In another aspect of the invention, ribozyme molecules designed to catalytically cleave 
target mRNA transcripts can be used to prevent translation of target mKNA and expression of a 
target protein (See, e.g., PCT International Publication WO90/11364, published October 4, 1990; 
Sarver et al, 1990, Science 247: 1222-1225 and U.S. Patent No, 5,093,246). While ribozymes 

10 that cleave mRNA at site specific recognition sequences can be used to destroy target mRNAs, 
the use of hammerhead ribozymes is preferred. Hammerhead ribozymes cleave mRNAs at 
locations dictated by flanking regions that form complementary base pairs with the target 
mRNA. The sole requirement is that the target mRNA have the foltowing sequence of two bases: 
5'-UG-3\ The construction and production of hammerhead ribozymes is well known in the art 

15 and is described more fully in Haseloff and Gerlach, 1988, Nature, 334:585-591. Preferably the 
ribozyme is engineered so that the cleavage recognition site is located near the 5* end of the 
target mRNA; i.e., to increase efficiency and minimize the intracellular accximulation of non- 
functional mRNA transcripts. 

The ribozymes of the present invention also include RNA endoribonucleases (hereinafter 
20 "Cech-type ribozymes") such as the one which occurs naturally in Tetrahymena thermophila 
(known as the IVS, or L-19 IVS RNA) and which has been extensively described by Thomas 
Cech and collaborators (Zaug, et al., 1984, Science, 224:574-578; Zaug and Cech, 1986, Science, 
231:470-475; Zaug, et al., 1986, Nature, 324:429-433; published International patent application 
No. W088/04300 by University Patents Inc.; Been and Cech, 1986, CeU, 47:207-216). The 
25 Cech-type ribozymes have an eight base pair active site which hybridizes to a target RNA 

sequence whereafter cleavage of the target RNA takes place. The invention encompasses those 
Cech-type ribozymes which target eight base-pair active site sequences that are present in a 
target gene. 

As in the antisense approach, the ribozymes can be composed of modified 
30 oligonucleotides (e.g., for improved stability, targeting, etc.) and should be delivered to cells 
which express the target gene in vivo. A preferred method of delivery involves using a DNA 
construct "encoding" the ribozyme luider the control of a strong constitutive pol III or pol 11 
promoter, so that transfected cells will produce sufficient quantities of the ribozyme to destroy 
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endogenous messages and inhibit translation. Because ribozymes, unlike antisense molecules, 
are catalytic, a lower intracellular concentration is required for efficiency. 

Antisense RNA, DNA, and ribozyme molecules of the invention may be prepared by any 
method known in (he art for the synthesis of DNA and RNA molecules. These include 
5 techniques for chemically synthesizing oligodeoxyribonucleotides and oligoribonucleotides well 
known in the art such as for example solid phase phosphoramidite chcnaical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding the antisense RNA molecule. Such DNA sequences may be incorporated 
into a wide variety of vectors which incorporate suitable RNA polymerase promoters such as the 
10 T7 or SP6 polymerase promoters. Alternatively, antisense cDNA constructs that synthesize 
antisense RNA constitutively or inducibly, depending on the promoter used, can be introduced 
stably into cell lines. 

Moreover, various well-known modifications to nucleic acid molecules may be 
introduced as a means of increasing intracellular stability and half-life. Possible modifications 
15 include but are not limited to the addition of flanking sequences of ribonucleotides or 

deoxyribonucleotides to the 5' and/or 3' ends of the molecule or the use of phosphorothioate or 
2' 0-methyl rather than phosphodiesterase linkages within the oligodeoxyribonucleotide 
backbone. 

Vm. Full-length cDNA Sequences of the Present Invention 

20 The present invention also relates to full length cDNA sequences corresponding to one or 

more of the partial sequences of SEQ ID Nos. 1-4470. In particular the invention provides the 
full length cDNA sequences of SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 
4488, 4490, 4492, and 4494. The full length sequences may be obtained as described above. 
These sequences are shown in Figure 2, and summarized below in Table 2, Also shown in Table 

25 2 are the SEQ ID Nos and GenBank accession numbers for the polypeptides which are encoded 
by the full length cDNA sequences and which correspond to SEQ ID Nos. 4471, 4473, 4475, 
4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493. 



cDNA 
SEQ ID NO. 


Gene Name 


GenBank 
Accession No. 


Protein 
SEQ ID NO. 


GenBank 
Accession No. 


4472 


ReglV 


^JM 032044 


4471 


NP 114433 


4474 


XAG-2 


NM 006408 


4473 


NP 006399 
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4476 


SPARC/Osteonectin 


NM 003 118 


4475 


NP 003109 


4478 


GWl 12 protein 


NM 006418 


4477 


NP 006409 


4480 


HSBPl 


NM 001540 


4479 


NP 001531 


4482 


SKDl Homolog 


NP 004869 


4481 


NP 004860 


4484 


9-27 


NM 003641 


4483 


NP 003632 


4486 


Defensin 5 


NM 021010 


4485 


NP 066290 


4488 


p0071 


NM 003628 


4487 


NP 003619 


4490 


UBE2I 


NM 003345 


4489 


NP 003336 


4492 


Cytoplasmic dynein 
Ught chain 


NM 003746 


4491 


NP 003737 


4494 


lOCkshsl 


NM 001798 


4493 


NP 001789 



IX. Polypeptides of the Present Invention 

The present invention makes available isolated polypeptides which are isolated from, or 
otherwise substantially free of other cellular proteins, especially other signal transduction factors 
5 and/or transcription factors which may normally be associated with the polypeptide. Subject 
polypeptides of the present invention include polypeptides encoded by the nucleic acids of SEQ 
ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, 
preferably SEQ ID Nos. 1-1 103, even more preferably SEQ ID Nos. 1-503, and still more 
preferably SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 

10 4494, or a sequence complementary thereto, or polypeptides encoded by genes of which a 

sequence in SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 
4492, and 4494, preferably SEQ ID Nos. 1-1 103, even more preferably SEQ ID Nos. 1-503, or a 
sequence complementary thereto, is a fragment. In a preferred embodiment, polypeptides, usefiil 
in the present invention have the amino acid sequence of one or more of SEQ ID Nos. 4471, 

15 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493. Polypeptides of the 
present invention include those proteins which are differentially regulated in tumor cells, 
especially colon cancer-derived cell lines (relative to normal cells, e.g., normal colon tissue and 
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non-colon tissue). In a preferred embodiment the differentially regulated polypeptides are one or 
more of the polypeptides having the sequence set forth in SEQ ID Nos. 4471, 4473, 4475, 4477, 
4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493. In preferred embodiments, the 
polypeptides are upregulated in tumor cells, especially colon cancer cancer-derived cell lines. In 
5 other embodiments, the polypeptides are downregulated in tumor cells, especially colon cancer- 
derived cell lines. Proteins which are upregulated, such as oncogenes, or dovmregulated, such as 
tumor suppressors, in aberrantly proliferating cells may be targets for diagnostic or therapeutic 
techniques. For example, upregulation of the cdc2 gene induces mitosis. Overexpression of the 
mytl gene, a mitotic deactivator, negatively regulates the activity of cdc2. Aberrant prohferation 
10 may thus be induced either by upregulating cdc2 or by downregulating mytl. 

The term "substantially free of other cellular proteins" (also referred to herein as 
"contaminating proteins") or "substantially pure or purified preparations" are defined as 
encompassing preparations of polypeptides having less than about 20% (by dry weight) 
contaminating protein, and preferably having less than about 5% contaminating protein. 
1 5 Functional forms of the subj ect polypeptides can be prepared, for the first time, as purified 

preparations by using a cloned nucleic acid as described herem. Full length proteins or fragments 
corresponding to one or more particular motifs and/or domains or to arbitrary sizes, for example, 
at least about 5, 10, 25, 50, 75, or 100 amino acids in length are within the scope of the present 
invention. 

20 For example, isolated polypeptides can be encoded by all or a portion of a nucleic acid 

sequence shown in any of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 
4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 1-1 103, even more preferably SEQ 
ID Nos. 1-503 and most preferably SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 
4486, 4488, 4490, 4492, and 4494, or a sequence complementary thereto. Isolated peptidyl 

25 portions of proteins can be obtained by screening peptides recombinantly produced from the 

corresponding fragment of the nucleic acid encoding such peptides. In addition, fragments can be 
chemically synthesized using techniques known in the art such as conventional Merrifield solid 
phase f-Moc or t-Boc chemistry. For example, a polypeptide of the present invention may be 
arbitrarily divided into fragments of desired length with no overlap of the fragments, or 

30 preferably divided into overlapping fragments of a desired length. The fragments can be 
produced (recombinantly or by chemical synthesis) and tested to identify those peptidyl 
fragments which can function as either agonists or antagonists of a wild-type (e.g., "authentic") 
protein. 
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Another aspect of the present invention concerns recombinant forms of the subject 
proteins. Recombinant polypeptides preferred by the present invention, in addition to native 
proteins, as described above are encoded by a nucleic acid, which is at least 60%, more 
preferably at least 80%, and more preferably 85%, and more preferably 90%, and more 
5 preferably 95% identical to an amino acid sequence encoded by SEQ ID Nos. 1-4470, 4472, 
4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494. Polypeptides which are 
encoded by a nucleic acid that is at least about 98-99% identical with the sequence of SEQ ID 
Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 are 
also within the scope of the invention. Also included in the present invention are peptide 
10 fragments comprising at least a portion of such a protein. 

In a preferred embodiment, a polypeptide of the present invention is a mammalian 
polypeptide and even more preferably a human polypeptide. In particularly preferred 
embodiment, the polypeptide retains wild-type bioactivity. It will be xmderstood that certain post- 
translational modifications, e.g., phosphorylation and the like, can increase the apparent 
1 5 molecular weight of the polypeptide relative to the immodified polypeptide chain. 

The present invention further pertains to recombinant forms of one of the subject 
polypeptides, Such recombinant polypeptides preferably are capable of functionmg in one of 
either role of agonist or antagonist of at least one biological activity of a wild-type ("authentic") 
polypeptide of the appended sequence listmg. The term "evolutionarily related to", with respect 
20 to amino acid sequences of proteins, refers to both polypeptides having amino acid sequences 
which have arisen naturally, and also to mutational variants of human polypeptides which are 
derived, for example, by combinatorial mutagenesis. 

In general, polypeptides referred to herein as having an activity (e.g., are "bioactive") of a 
protein are defined as polypeptides which include an amino acid sequence encoded by all or a 

25 portion of the nucleic acid sequences shown in one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 
4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 1-1 103, 
even more preferably SEQ ID Nos. 1-503, and most preferably SEQ ID Nos. 4471, 4473, 4475, 
4477^ 4479^ 4481, 4483, 4485, 4487, 4489, 4491, and 4493, or a sequence complementary 
thereto, and which mimic or antagonize all or a portion of the biological/biochemical activities of 

30 a naturally occurring protein. According to the present invention, a polypeptide has biological 
activity if it is a specific agonist or antagonist of a naturally occurring form of a protein. 
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Assays for determining whether a compound, e.g, a protein or variant thereof, has one or 
more of the above biological activities are well known in the art. In certain embodiments, the 
polypeptides of the present invention have activities such as those outlined above. 

In another embodiment, the coding sequences for the polypeptide can be incorporated as 
5 a part of a fusion gene including a nucleotide sequence encoding a different polypeptide. This 
type of expression system can be useful under conditions where it is desirable to produce an 
immunogenic fragment of a polypeptide (see, for example, EP Publication No: 0259149; and 
Evans et al (1989) Nature 339:3 85; Huang et at (1988) J. Virol. 62:3 855; and Schlienger et al, 
(1992) J. Virol 66:2). In addition to utilizing fusion proteins to enhance immunogenicity, it is 

10 widely appreciated that fusion proteins can also facilitate the expression of proteins, and, 

accordingly, can be used in the expression of the polypeptides of the present invention (see, for 
example. Current Protocols in Molecular Biology, eds. Ausubel et at. (N.Y. John Wiley & Sons, 
1991)). In another embodiment, a fusion gene coding for a purification leader sequence, such as 
a poly-(His)/enterokinase cleavage site sequence at the N-terminus of the desired portion of the 

1 5 recombinant protein, can allow purification of the expressed fusion protein by affinity 
chromatography using a Ni^'*"metal resin. The purification leader sequence can then be 
subsequently removed by treatment with enterokinase to provide the purified protein (e.g., see 
Hochuh et al (1987)J. Chromatography 41 1:177; and Janknecht et al PNAS 88:8972). 

Techniques for making fusion genes are known to those skilled in the art. Essentially, the 
20 joining of various DNA fragments coding for different polypeptide sequences is performed in 
accordance with conventional techniques, employing blunt-ended or stagger-ended termini for 
Ugation, restriction enzyme digestion to provide for appropriate termini, fiUing-in of cohesive 
ends as appropriate, alkaline phosphatase treatment to avoid imdesirable johiing, and enzymatic 
ligation. In another embodiment, the fusion gene can be synthesized by conventional techniques 
25 including automated DNA synthesizers. Alternatively, PGR amplification of nucleic acid 

fragments can be carried out using anchor primers which give rise to complementary overhangs 
between two consecutive nucleic acid fragments which can subsequently be aimealed to generate 
a chimeric nucleic acid sequence (see, for example, Current Protocols in Molecular Biology, eds. 
Ausubel et al John Wiley & Sons: 1992). 

30 The present invention fixrther pertains to methods of producing the subject polypeptides. 

For example, a host cell transfected with a nucleic acid vector directing expression of a 
nucleotide sequence encoding the subject polypeptides can be cultured under appropriate 
conditions to allow expression of the peptide to occur. Suitable media for cell culture are well 
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known in the art. The recombinant polypeptide can be isolated from cell culture medium, host 
cells, or both using techniques known in the art for purifying proteins including ion-exchange 
chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and 
immunoaf&nity purification with antibodies specific for such peptide. In a preferred 
5 embodiment, the recombinant polypeptide is a fusion protein containing a domain which 
facilitates its purification, such as GST fusion protein. 

Moreover, it will be generally appreciated that, under certain circumstances, it may be 
advantageous to provide homologs of one of the subject polypeptides which function in a limited 
capacity as one of either an agonist (mimetic) or an antagonist, in order to promote or inhibit 
10 only a subset of the biological activities of the naturally occurring form of the protein. Thus, 
specific biological effects can be elicited by treatment with a homolog of limited function, and 
with fewer side effects relative to treatment with agonists or antagonists which are directed to all 
of the biological activities of naturally occurring forms of subject proteins. 

Homologs of each of the subject polypeptide can be generated by mutagenesis, such as 
15 by discrete point mutation(s), or by truncation. For instance, mutation can give rise to homologs 
which retain substantially the same, or merely a subset, of the biological activity of the 
polypeptide from which it was derived Alternatively, antagonistic forms of the polypeptide can 
be generated which are able to inhibit the function of the naturally occurring form of the protein, 
such as by competitively binding to a receptor. 

20 The recombinant polypeptides of the present invention also include homologs of the 

wild-type proteins, such as versions of those proteins which are resistant to proteolytic cleavage, 
for example, due to mutations which alter ubiquitination or other enzymatic targeting associated 
with the protein. 

Polypeptides may also be chemically modified to create derivatives by forming covalent 
25 or aggregate conjugates with other chemical moieties, such as glycosyl groups, lipids, phosphate, 
acetyl groups and the like. Covalent derivatives of proteins can be prepared by linking the 
chemical moieties to functional groups on amino acid sidechains of the protein or at the N- 
terminus or at the C-terminus of the polypeptide. 

Modification of the stmcture of the subject polypeptides can be for such purposes as 
30 enhancing therapeutic or prophylactic efficacy, stability (e.g., ex vivo shelf life and resistance to 
proteolytic degradation), or post-translational modifications (e.g., to alter phosphorylation 
pattern of protein). Such modified peptides, when designed to retain at least one activity of the 
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naturally occurring form of the protein, or to produce specific antagonists thereof, are considered 
functional equivalents of the polypeptides described in more detail herein. Such modified 
peptides can be produced, for instance, by amino acid substitution, deletion, or addition. The 
substitutional variant may be a substituted conserved amino acid or a substituted non-conserved 
5 amino acid. 

For example, it is reasonable to expect that an isolated replacement of a leucine with an 
isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar 
replacement of an amino acid with a structurally related amino acid (i.e., isosteric and/or 
isoelectric mutations) will not have a major effect on the biological activity of the resulting 

10 molecule. Conservative replacements are those that take place within a family of amino acids 
that are related in then- side chains. Genetically encoded amino acids can be divided into four 
families: (1) acidic = aspartate, glutamate; (2) basic = lysine, arginine, histidine; (3) nonpolar = 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and (4) 
uncharged polar = glycine, asparagine, glutamine, cysteine, serine, threonine, tyrosine. In 

15 similar fashion, the amino acid repertoire can be grouped as (1) acidic = aspartate, glutamate; (2) 
basic = lysine, arginine histidine, (3) aliphatic = glycine, alanme, valine, leucme, isoleucine, 
serine, threonine, with serine and threonine optionally be grouped separately as aliphatic- 
hydroxyl; (4) aromatic = phenylalanine, tyrosine, tiyptophan; (5) amide = asparagme, glutamme; 
and (6) sulfur -contauiing = cysteine and methionine, (see, for example, Biochemistry, 2 ed., Ed. 

20 by L. Stryer, WH Freeman and Co.: 1981). Whether a change in the amino acid sequence of a 
peptide results in a functional homolog (e.g., functional in the sense that the resulting 
polypeptide munics or antagonizes the wild-type form) can be readily determined by assessing 
the ability of the variant peptide to produce a response in cells in a fashion similar to the wild- 
type protein, or competitively inhibit such a response. 

25 Polypeptides in which more than one replacement has taken place can readily be tested in 

the same manner. The variant may be designed so as to retain biological activity of a particular 
region of the protein. In a non-luniting example, Osawa et al, 1994, Biochemistry and 
Molecular International 34:1003-1009, discusses the actin binduig region of a protein from 
several different species. The actin binding regions of the these species are considered 

30 homologous based on the fact that they have amino acids that fall within "homologous residue 
groups." Homologous residues are judged according to the following groups (using single letter 
ammo acid designations): STAG; ILVMF; HRK; DEQN; and FYW. For example, an S, a T, an 
A or a G can be in a position and the function (in this case actin binding) is retained. 
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Additional guidance on amino acid substitution is available from studies of protein 
evolution. Go et al., 1980, Int J, Peptide Protein Res. 15:21 1-224, classified amino acid residue 
sites as interior or exterior depending on their accessibility. More frequent substitution on 
exterior sites was confirmed to be general in eight sets of homologous protein families regardless 
5 of their biological functions and the presence or absence of a prosthetic group. Virtually all 
types of amino acid residues had higher mutabilities on the exterior than in the interior. No 
correlation between mutabihty and polarity was observed of amino acid residues in the interior 
and exterior, respectively. Amino acid residues were classified into one of three groups 
depending on their polarity: polar (Arg, Lys, His, Gin, Asn, Asp, and Glu); weak polar (Ala, Pro, 
10 Gly, Thr, and Ser), and nonpolar (Cys, Val, Met, He, Leu, Phe, Tyr, and Tip). Amino acid 
replacements during protein evolution were very conservative: 88% and 76% of them in the 
interior or exterior, respectively, were within the same group of the three. Intergroup 
replacements are such that weak polar residues are replaced more often by nonpolar residues in 
the interior and more often by polar residues on the exterior. 

15 Querol et al., 1996, Prot, Eng. 9:265-271, provides general rules for amino acid 

substitutions to enhance protein thermostability. New glycosylation sites can be introduced as 
discussed in Olsen and Thomsen, 1991, J. Gen. Microbiol. 137 :579-585. An additional disulfide 
bridge can be introduced, as discussed by Perry and Wetzel, 1984, Science 226:555-557; 
Pantoliano etal., 1987, Biochemistry 26:2077-20821 Matsumura et al., 1989, A^^mr^ 342:29 1- 

20 293; Nishikawa et al., 1990, Protein Eng. 3:443-448; Takagi et al., 1990, J. Biol. Chem, 

265:6874-6878; Clarke et al., 1993, Biochemistry 32:4322-43299; and Wakarchuk et al, 1994, 
Pro^em Eng. 7:1379-1386. 

An additional metal binding site can be introduced, according to Toma et al., 1991, 
Biochemistry 30:97-106, and Haezerbrouck et al., 1993, Protein Eng. 6:643-649. Substitutions 
25 with prolines in loops can be made according to Masul et al, 1994, Appl Env. Microbiol. 
60:3579-3584; and Hardy et al., FEBSLetL 317:89-92. 

Cysteine-depleted muteins are considered variants within the scope of the invention. 
These variants can be constructed according to methods disclosed in U.S. Patent No. 4,959,314, 
which discloses how to substitute other amino acids for cysteines, and how to determine 
30 biological activity and effect of the substitution. Such methods are suitable for proteins 
according to this invention that have cysteine residues suitable for such substitutions, for 
example to eliminate disulfide bond formation. 
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To learn the identity and function of the gene that correlates with an nucleic acid, the 
nucleic acids or corresponding amino acid sequences can be screened against profiles of protein 
families. Such profiles focus on common structural motifs among proteins of each family. 
Publicly available profiles are described above. 

5 In comparing a new nucleic acid with known sequences, several aligmnent tools are 

available. Examples include PileUp, which creates a multiple sequence aligimient, and is 
described in Feng et a/., J. Mol Evol (1987) 25:35 1-360. Another method, GAP. uses the 
alignment method of Needleman et al, 1 Mol Biol (1970) 48:443-453. GAP is best suited for 
global aligmnent of sequences. A third method, BestFit, functions by inserting gaps to maximize 
10 the number of matches using the local homology algorithm of Smith and Waterman, Adv. Appl 
Matk (im) 2:482-489. 

X. Diagnostic & Prognostic Assays and Drug Scree ning Methods 

The present invention provides method for determining whether a subject is at risk for 
developing a disease or condition characterized by \mwanted cell proliferation by detecting the 
1 5 disclosed biomarkers, i.e., the present nucleic acids (SEQ ID Nos: 1-4494) and/or polypeptide 
markers (preferably SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 
4491, and 4493) for colon cancer encoded thereby. 

In clinical applications, human tissue samples can be screened for the presence and/or 
absence of the biomarkers identified herein. Such samples could consist of needle biopsy cores, 

20 surgical resection samples, lymph node tissue, or serum. For example, these methods include 
obtaining a biopsy, which is optionally fi-actionated by cryostat sectioning to enrich tumor cells 
to about 80% of the total cell population. In certain embodiments, nucleic acids extracted from 
these samples may be amplified using techniques well known in the art. The levels of selected 
markers detected would be compared with statistically valid groups of metastatic, non-metastatic 

25 malignant, benign, or normal colon tissue samples. 

In one embodiment, the diagnostic method comprises determining whether a subject has 
an abnormal mRNA and/or protein level of the disclosed markers, such as by Northern blot 
analysis, reverse transcription-polymerase chain reaction (RT-PCR), in situ hybridization, 
immunoprecipitation, Western blot hybridization, or inmaunohistochemistry. According to the 
30 method, cells are obtained firom a subject and the levels of the disclosed biomarkers, protein or 
mRNA level, is determined and compared to the level of these markers in a healthy subject. An 
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abnonnaJ level of the biomarker polypeptide or mRNA levels is likely to be indicative of cancer 
such as colon cancer. 

Accordingly, in one aspect, the invention provides probes and primers that are specific to 
the unique nucleic acid markers disclosed herein. Accordingly, the nucleic acid probes comprise 
5 a nucleotide sequence at least 10 nucleotides in length, preferably at least 15 nucleotides, more 
preferably, 25 nucleotides, and most preferably at least 40 nucleotides, and up to all or nearly all 
of the coding sequence which is complementary to a portion of the coding sequence of a marker 
nucleic acid sequence, which nucleic acid sequence is represented by SEQ ID Nos: 1-4494 or a 
sequence complementary thereto. 

10 In one embodiment, the method comprises using a nucleic acid probe to determine the 

presence of cancerous cells in a tissue from a patient. Specifically, the method comprises: 



15 



1. 



providing a nucleic acid probe comprising a nucleotide sequence at least 10 
nucleotides in length, preferably at least 15 nucleotides, more preferably, 25 
nucleotides, and most preferably at least 40 nucleotides, and up to all or nearly all 
of the coding sequence which is complementary to a portion of the coding 
sequence of a nucleic acid sequence represented by SEQ ID Nos: 1-4494 or a 
sequence complementary thereto and is differentially expressed in tumors cells, 
such as colon cancer cells; 



2. 



obtaining a tissue sample fi-om a patient potentially comprising cancerous cells; 



20 



3. 



providing a second tissue sample containing cells substantially all of which are 



non-cancerous; 



4. 



contacting the nucleic acid probe under stringent conditions with RNA of each of 
said first and second tissue samples (e.g., in a Northern blot or in situ 
hybridization assay); and 



25 



5. 



comparing (a) the amoimt of hybridization of the probe with RNA of the first 



tissue sample, with (b) the amount of hybridization of the probe with RNA of the 
second tissue sample; wherein a statistically significant difference in the amount 
of hybridization with the RNA of the first tissue sample as compared to the 



30 



amount of hybridization with the RNA of the second tissue sample is indicative of 
the presence of cancerous cells in the first tissue sample. 
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In one aspect, the method comprises in situ hybridization with a probe derived from a 
given marker nucleic acid sequence, which nucleic acid sequence is represented by SEQ ID Nos: 
1-4494 or a sequence complementary thereto. The method comprises contacting the labeled 
hybridization probe with a sample of a given type of tissue potentially containing cancerous or 
5 pre-cancerous cells as well as normal cells, and determining whether the probe labels some cells 
of the given tissue type to a degree significantly different (e.g., by at least a factor of two, or at 
least a factor of five, or at least a factor of twenty, or at least a factor of fifty) than the degree to 
which it labels other cells of the same tissue type. 

Also within the invention is a method of determining the phenotype of a test cell from a 
10 given human tissue, e.g., whether the cell is (a) normal, or (b) cancerous or precancerous, by 
contacting the mKNA of a test cell with a nucleic acid probe at least 12 nucleotides in length, 
preferably at least 15 nucleotides, more preferably at least 25 nucleotides, and most preferably at 
least 40 nucleotides, and up to all or nearly all of a sequence which is complementary to a 
portion of the coding sequence of a nucleic acid sequence represented by SEQ ID Nos: 1-4494 or 
15 a sequence complementary thereto, and which is differentially expressed in tumor cells as 

compared to normal cells of the given tissue type; and determining the approximate amount of 
hybridization of the probe to the mRNA, an amount of hybridization either more or less than that 
seen with the mRNA of a normal cell of that tissue type being indicative that the test cell is 
cancerous or pre-cancerous. 

20 Altematively, the above diagnostic assays may be carried out using antibodies to detect 

the protein product encoded by the marker nucleic acid sequence, which nucleic acid sequence is 
represented by SEQ ID Nos: 1-4494 or a sequence complementary thereto. Accordingly, in one 
embodiment, the assay would include contacting the proteins of the test cell with an antibody 
specific for the gene product of a nucleic acid represented by SEQ ID Nos: 1-4494, preferably 

25 SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, or a 
sequence complementary thereto, the marker nucleic acid being one which is expressed at a 
given control level hi normal cells of the same tissue type as the test cell, and determining the 
approximate amount of immunocomplex formation by the antibody and the proteins of the test 
cell, wherein a statistically significant difference in the amount of the immunocomplex formed 

30 with the proteins of a test cell as compared to a normal cell of the same tissue type is an 

indication that the test cell is cancerous or pre-cancerous. Preferably, the antibody is specific for 
one of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, and 
4493. 
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The method for producing polyclonal and/or monoclonal antibodies which specifically 
bind to polypeptides useful in the present invention is known to those of skill in the art and can 
be found in, for example Dymecki et al, 1992, J. Biol. Chem., 267:4815; Boersma & Van 
Leeuwen, 1994, J. Neurosci. Methods, 51:317; Green et al, 1982, Cell, 28:477; and Amheiter et 
5 aU 1981, Nature, 294:278. 

Another such method includes the steps of: providing an antibody specific for the gene 
product of a marker nucleic acid sequence represented by SEQ ID Nos 1-4494, the gene product 
being present in cancerous tissue of a given tissue type (e.g., colon tissue) at a level more or less 
than the level of the gene product in non-cancerous tissue of the same tissue type; obtaining from 

10 a patient a first sample of tissue of the given tissue type, which sample potentially includes 

cancerous cells; providing a second sample of tissue of the same tissue type (which may be firom 
the same patient or firom a normal control, e.g. another individual or cultured cells), this second 
sample containing normal cells and essentially no cancerous cells; contacting the antibody with 
protem (which may be partially purified, in lysed but unfiractionated cells, or in situ) of the first 

15 and second samples imder conditions permitting immunocomplex formation between the 

antibody and the marker nucleic acid sequence product present in the samples; and comparing (a) 
the amoimt of immunocomplex formation in the first sample, with (b) the amount of 
immunocomplex formation in the second sample, wherein a statistically significant difference in 
the amoimt of immunocomplex formation in the first sample less as compared to the amount of 

20 immunocomplex formation in the second sample is indicative of the presence of cancerous cells 
in the first sample of tissue. 

The subject invention fiirther provides a method of determining whether a cell sample 
obtained firom a subject possesses an abnormal amount of marker polypeptide which comprises 
(a) obtaining a cell sample from the subject, (b) quantitatively determimng the amount of the 
25 marker polypeptide in the sample so obtained, and (c) comparing the amount of the marker 
polypeptide so determined with a known standard, so as to thereby determine whether the cell 
sample obtained firom the subject possesses an abnormal amoimt of the marker polypeptide. 
Such marker polypeptides may be detected by immunohistochemical assays, dot-blot assays, 
ELISA and the like. 

30 Lmmunoassays are commonly used to quantitate the levels of proteins in cell samples, and 

many other immimoassay techniques are known in the art. The invention is not limited to a 
particular assay procedure, and therefore is intended to include both homogeneous and 
heterogeneous procedures. Exemplary immunoassays which can be conducted according to the 
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invention include fluorescence polarization immunoassay (FPIA), fluorescence immunoassay 
(FIA), en2yme immunoassay (EIA), nephelometric inhibition immunoassay (NIA\ enzyme 
linked immunosorbent assay (ELISA), and radioimmunoassay (RIA). An indicator moiety, or 
label group, can be attached to the subject antibodies and is selected so as to meet the needs of 
5 various uses of the method which are often dictated by the availabiUty of assay equipment and 
compatible immimoassay procedxires. General techniques to be used in performing the various 
immunoassays noted above are known to those of ordinary skill in the art. 

In another embodiment, the level of the encoded product, i.e., the product encoded by 
SEQ ID Nos 1-4494 or a sequence complementary thereto, or altematively the level of the 

10 polypeptide of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, 
and 4493, in a biological fluid (e.g., blood or urme) of a patient may be determined as a way of 
monitoring the level of expression of the marker nucleic acid sequence in cells of that patient. 
Such a method would include the steps of obtaining a sample of a biological fluid from the 
patient, contacting the sample (or proteins from the sample) with an antibody specific for a 

1 5 encoded marker polypeptide, and determining the amount of immime complex formation by the 
antibody, with the amoimt of inamune complex formation being indicative of the level of the 
marker encoded product in the sample. This determination is particularly instructive when 
compared to the amount of immxme complex formation by the same antibody in a control sample 
taken from a normal individual or in one or more samples previously or subsequently obtained 

20 from the same person. 

In another embodiment, the method can be used to determine the amount of marker 
polypeptide present in a cell, which in turn can be correlated with progression of a 
hyperproliferative disorder, e.g., colon cancer. The level of the marker polypeptide can be used 
predictively to evaluate whether a sample of cells contains cells which are, or are predisposed 

25 towards becoming, transformed cells. Moreover, the subject method can be used to assess the 
phenotype of cells which are known to be transformed, the phenotyping results being useful in 
planning a particular therapeutic regimen. For instance, very high levels of the marker 
polypeptide in sample cells is a powerful diagnostic and prognostic marker for a cancer, such as 
colon cancer. The observation of marker polypeptide level can be utilized in decisions 

30 regarding, e.g., the use of more aggressive therapies. 

As set out above, one aspect of the present invention relates to diagnostic assays for 
determining, in the context of cells isolated from a patient, if the level of a marker polypeptide is 
significantly reduced in the sample cells. The term "significantly reduced" refers to a cell 
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phenotype wherein the cell possesses a reduced cellular amount of the marker polypeptide 
relative to a normal cell of similar tissue origin. For example, a cell may have less than about 
50%, 25%, 10%, or 5% of the marker polypeptide that a normal control cell. In particular, the 
assay evaluates the level of marker polypeptide in the test cells, and, preferably, compares the 
5 measured level with marker polypeptide detected in at least one control cell, e.g., a normal cell 
and/or a transformed cell of known phenotype. 

Of particular importance to the subject invention is the ability to quantitate the level of 
marker polypeptide as determined by the number of cells associated with a normal or abnormal 
marker polypeptide level The number of cells with a particular marker polypeptide phenotype 
10 may then be correlated with patient prognosis. In one embodiment of the invention, the marker 
polypeptide phenotype of the lesion is determined as a percentage of cells in a biopsy which are 
found to have abnormally high/low levels of the marker polypeptide. Such expression may be 
detected by immunohistochemical assays, dot-blot assays, ELISA and the like. 

Where tissue samples are employed, immimohistochemical staining may be used to 
15 determine the number of cells having the marker polypeptide phenotype. For such staining, a 
multiblock of tissue is taken from the biopsy or other tissue sample and subjected to proteolytic 
hydrolysis, employing such agents as protease K or pepsin. In certain embodiments, it may be 
desirable to isolate a nuclear fraction from the sample cells and detect the level of the marker 
polypeptide in the nuclear fraction. 

20 The tissue samples are fixed by treatment with a reagent such as formalin, 

glutaraldehyde, methanol, or the like. The samples are then incubated with an antibody, 
preferably a monoclonal antibody, with bhiding specificity for the marker polypeptides. This 
antibody may be conjugated to a label for subsequent detection of binding. Samples are 
incubated for a time sufficient for formation of the immunocomplexes. Binding of the antibody 

25 is then detected by virtue of a label conjugated to this antibody. Where the antibody is unlabeled, 
a second labeled antibody may be employed, e.g., which is specific for the isotype of the anti- 
marker polypeptide antibody. Examples of labels which may be employed include radionuclides, 
fluorescers, chemiluminescers, enzymes and the like. 

Where enzymes are employed, the substrate for the enzyme may be added to the samples 
30 to provide a colored or fluorescent product. Examples of suitable enzymes for use in conjugates 
include horseradish peroxidase, alkaline phosphatase, malate dehydrogenase and the like. Where 
not commercially available, such antibody-enzyme conjugates are readily produced by 
techniques known to those skilled in the art. 



58 



wo 02/29086 



PCT/USOl/30732 



In one embodiment, the assay is performed as a dot blot assay. The dot blot assay finds 
particular application where tissue samples are employed as it allows determination of the 
average amount of the marker polypeptide associated with a single cell by correlating the amount 
of marker polypeptide in a cell-free extract produced from a predetermined number of cells. 

5 It is well established in the cancer literature that tumor cells of the same type (e.g., breast 

and/or colon tumor cells) may not show uniformly increased expression of individual oncogenes 
or nnifnrmly decreased expression of individual tumor suppressor genes. There may also be 
varying levels of expression of a given marker gene even between cells of a given type of cancer, 
further emphasizing the need for reliance on a battery of tests rather than a single test. 
10 Accordingly, in one aspect, the invention provides for a battery of tests utilizing a number of 
probes of the invention, in order to improve the reliability and/or accuracy of the diagnostic test. 

In one embodiment, the present invention also provides a method wherein nucleic acid 
probes are immobilized on a DNA chip in an organized array. Oligonucleotides can be bound to 
a solid support by a variety of processes, including lithography. For example a chip can hold up 

15 to 250,000 oligonucleotides (GeneChip, Affymetrix). These nucleic acid probes comprise a 
nucleotide sequence at least about 12 nucleotides in length, preferably at least about 15 
. nucleotides, more preferably at least about 25 nucleotides, and most preferably at least about 40 
nucleotides, and up to all or nearly all of a sequence which is complementary to a portion of the 
coding sequence of a marker nucleic acid sequence represented by SEQ ID Nos: 1-4494 and is 

20 differentially expressed in tumor cells, such as colon cancer cells. The present mvention provides 
significant advantages over the available tests for various cancers, such as colon cancer, because 
it increases the reliability of the test by providing an array of nucleic acid markers on a single 
chip. 

The method includes obtaining a biopsy, which is optionally fractionated by cryostat 
25 sectioning to enrich tumor cells to about 80% of the total cell population. The DNA or RNA is 
then extracted, amplified, and analyzed with a DNA chip to determine the presence of absence of 
the marker nucleic acid sequences. 

In one embodiment, the nucleic acid probes are spotted onto a substrate in a two- 
dimensional matrix or array. Samples of nucleic acids can be labeled and then hybridized to the 
30 probes. Double-stranded nucleic acids, comprising the labeled sample nucleic acids bound to 
probe nucleic acids, can be detected once the unboxmd portion of the sample is washed away. 
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The probe nucleic acids can be spotted on substrates including glass, nitrocellulose, etc. 
The probes can be bound to the substrate by either covalent bonds or by non-specific 
interactions, such as hydrophobic interactions. The sample nucleic acids can be labeled using 
radioactive labels, fluorophores, chromophores, etc. 

5 Techniques for constructing arrays and methods of using these arrays are described, for 

example, ihEPNo. 0 799 897; PCTNo. WO 97/292 12; PCTNo. WO 97127317; EPNo. 0 785 
280; PCTNo. WO 97/02357; U.S. Pat. No. 5,593,839; U.S. Pat. No. 5,578,832; EP No. 0 728 
520; U.S. Pat. No. 5,599,695; EP No. 0 721 016; U.S. Pat. No. 5,556,752; PCT No. WO 
95/22058; and U.S. Pat. No. 5,631,734. 

10 Further, arrays can be used to examine differential expression of genes and can be used to 

determine gene function. For example, arrays of the instant nucleic acid sequences can be used to 
determine if any of the nucleic acid sequences are differentially expressed between normal cells 
and cancer cells, for example. High expression of a particular message in a cancer cell, which is 
not observed in a corresponding normal cell, can indicate a cancer specific protein. 

15 In one embodiment nucleic acid molecules useful in the present invention, such as those 

of SEQ ID Nos 1-4494, preferably those of SEQ ID Nos 4472, 4474, 4476, 4478, 4480, 4482, 
4484^ 4486, 4488, 4490, 4492, and 4494, may be used to generate macroarrays on a solid surface 
such as a membrane such that the arrayed nucleic acid molecules can be used to determine if any 
of the nucleic acids are differentially expressed between normal cells or tissue and cancerous 

20 cells or tissue. In one embodiment, the nucleic acid molecules of the invention are either cDNA 
or may be used to generate cDNA molecules to be subsequently amplified by PGR and spotted 
on nylon membranes. The membranes are then reacted with radiolabeled target nucleic acid 
molecules obtained firom equivalent samples of cancerous and normal tissue or cells. Methods of 
cDNA generation and macroarray preparation are known to those of skill in the art and may be 

25 found, for example in Bertucci et al., 1999 Hum, Mol Genet 8:2129; Nguyen et al, 1995, 

Genomics, 29: 207; Zhao et al, Gene, 156:207; Gress et al,, 1992, Mammalian Genome, 3:609; 
Zhumabayeva et al, 2001, Biotechniques, 30:158; andLennon et al., 1991, Trends Genet 7:314. 

In yet another embodiment, the invention contemplates using a panel of antibodies which 
are generated against the marker polypeptides of this invention, which polypeptides are encoded 
30 by one or more of SEQ ID Nos: 1-4494, preferably SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 
4482, 4484, 4486, 4488, 4490, 4492, and 4494. Preferably, the antibodies are generated against 
one or more polypeptides having the sequence of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 
4481, 4483, 4485, 4487, 4489, 4491, and 4493. Such a panel of antibodies may be used as a 
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reliable diagnostic probe for colon cancer. The assay of the present invention comprises 
contacting a biopsy sample containing cells, e.g., colon cells, with a panel of antibodies to one or 
more of the encoded products to determine the presence or absence of the marker polypeptides. 

The diagnostic methods of the subject invention may also be employed as follow-up to 
5 treatment, e.g., quantitation of the level of marker polypeptides may be indicative of the 
effectiveness of current or previously employed cancer therapies as well as the effect of these 
therapies upon patient prognosis. 

Accordingly, the present invention makes available diagnostic assays and reagents for 
detecting gain and/or loss of marker polypeptides from a cell in order to aid in the diagnosis and 
10 phenotyping of proliferative disorders arising from, for example, tumorigenic transformation of 
cells. 

The diagnostic assays described above can be adapted to be used as prognostic assays, as 
well. Such an application takes advantage of the sensitivity of the assays of the invention to 
events which take place at characteristic stages in the progression of a tumor. For example, a 

15 given marker gene may be up- or downregulated at a very early stage, perhaps before the cell is 
irreversibly committed to developing into a malignancy, while another marker gene may be 
characteristically up or down regulated only at a much later stage. Such a method could involve 
the steps of contacting the mRNA of a test cell with a nucleic acid probe derived from a given 
marker nucleic acid which is expressed at different characteristic levels in cancerous or 

20 precancerous cells at different stages of tumor progression, and determining the approximate 
amount of hybridization of the probe to the mRNA of the cell, such amount being an indication 
of the level of expression of the gene in the cell, and thus an indication of the stage of tumor 
progression of the cell; alternatively, the assay can be carried out with an antibody specific for 
the gene product of the given marker nucleic acid, contacted with the proteins of the test cell. A 

25 battery of such tests will disclose not only the existence and location of a tumor, but also will 
allow the clinician to select the mode of treatment most appropriate for the tumor, and to predict 
the likelihood of success of that treatment. 

The methods of the invention can also be used to follow the clinical course of a tumor. 
For example, the assay of the invention can be applied to a tissue sample from a patient; 
30 following treatment of the patient for the cancer, another tissue sample is taken and the test 
repeated. Successful treatment will result in either removal of all cells which demonstrate 
differential expression characteristic of the cancerous or precancerous cells, or a substantial 
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increase in expression of the gene in those cells, perhaps approaching or even surpassing normal 
levels. 

In yet another embodiment, the invention provides methods for determining whether a 
subject is at risk for developing a disease, such as a predisposition to develop cancer, for 
5 example colon cancer, associated with an aberrant activity of any one of the polypeptides 
encoded by nucleic acids of SEQ ID Nos: 1-4494, preferably, any one of the polypeptides of 
SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493, 
wherein the aberrant activity of the polypeptide is characterized by detecting the presence or 
absence of a genetic lesion characterized by at least one of (i) an alteration affecting the integrity 

10 of a gene encoding a marker polypeptides, or (ii) the mis-expression of the encoding nucleic 
acid. To illustrate, such genetic lesions can be detected by ascertaining the existence of at least 
one of(i) a deletion of one or more nucleotides from the nucleic acid sequence, (ii) an addition of 
one or more nucleotides to the nucleic acid sequence, (iii) a substitution of one or more 
nucleotides of the nucleic acid sequence, (iv) a gross chromosomal rearrangement of the nucleic 

15 acid sequence, (v) a gross alteration in .the level of a messenger RNA transcript of the nucleic 
acid sequence, (vii) aberrant modification of the nucleic acid sequence, such as of the 
methylation pattern of the genomic DNA, (vii) the presence of a non-wild type spHcing pattern 
of a messenger RNA transcript of the gene, (viii) a non-wild type level of the marker 
polypeptide, (ix) allelic loss of the gene, and/or (x) inappropriate post-translational modification 

20 of the marker polypeptide. 

The present invention provides assay techniques for detecting lesions in the encoding 
nucleic acid sequence. These methods include, but are not limited to, methods involving 
sequence analysis. Southern blot hybridization, restriction enzyme site mapping, and methods 
involving detection of absence of nucleotide pairing between the nucleic acid to be analyzed and 
25 a probe. 

Specific diseases or disorders, e.g., genetic diseases or disorders, are associated with 
specific allelic variants of polymorphic regions of certain genes, which do not necessarily encode 
a mutated protein. Thus, the presence of a specific allelic variant of a polymorphic region of a 
gene in a subject can render the subject susceptible to developing a specific disease or disorder. 
30 Polymorphic regions in genes, can be identified, by determining the nucleotide sequence of 
genes in populations of individuals. If a polymorphic region is identified, then the link with a 
specific disease can be determined by studying specific populations of individuals, e.g, 
individuals which developed a specific disease, such as colon cancer. A polymorphic region can 
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be located in any region of a gene, e.g., exons, in coding or non coding regions of exons, introns, 
and promoter region. 

In an exemplary embodiment, there is provided a nucleic acid composition comprising a 
nucleic acid probe including a region of nucleotide sequence which is capable of hybridizing to a 
5 sense or antisense sequence of a gene or naturally occurring mutants thereof, or 5' or 3' flanking 
sequences or intronic sequences naturally associated with the subject genes or naturally 
occurring mutants thereof. The nucleic acid of a cell is rendered accessible for hybridization, the 
probe is contacted with the nucleic acid of the sample, and the hybridization of the probe to the 
sample nucleic acid is detected. Such techniques can be used to detect lesions or allelic variants 
10 at either the genomic or mRNA level, including deletions, substitutions, etc., as well as to 
determine mRNA transcript levels, 

A preferred detection method is allele specific hybridization using probes overlapping the 
mutation or polymorphic site and having about 5, 10, 20, 25, or 30 nucleotides around the 
mutation or polymorphic region. In a preferred embodiment of the invention, several probes 

15 capable of hybridizing specifically to allehc variants are attached to a solid phase support, e.g., a 
"chip". Mutation detection analysis using these chips comprising oligonucleotides, also termed 
"DNA probe arrays" is described e.g., in Cronin et al. (1996) Human Mutation 7:244. In one 
embodiment, a chip comprises all the allehc variants of at least one polymorphic region of a 
gene. The solid phase support is then contacted with a test nucleic acid and hybridization to the 

20 specific probes is detected. Accordingly, the identity of numerous allelic variants of one or more 
genes can be identified in a simple hybridization experiment. 

In certain embodiments, detection of the lesion comprises utilizing the probe/primer in a 
polymerase chain reaction (PGR) (see, e.g. U.S. Patent Nos. 4,683,195 and 4,683,202), such as 
anchor PGR or RAGE PGR, or, alternatively, in a ligase chain reaction (LGR) (see, e.g., 

25 Landegran et al (1988) Science 241:1077-1080; and Nakazawa et al (1994) PNAS 91:360-364), 
the latter of which can be particularly usefiil for detecting point mutations in the gene (sec 
Abravaya et al (1995) Nuc Acid Res 23:675-682). In a merely illustrative embodiment, the 
method includes the steps of (i) collecting a sample of cells from a patient, (ii) isolating nucleic 
acid (e.g., genomic, mRNA or both) fi-om the cells of the sample, (iii) contacting the nucleic acid 

30 sample with one or more primers which specifically hybridize to a nucleic acid sequence xmder 
conditions such that hybridization and amplification of the nucleic acid (if present) occurs, and 
(iv) detecting the presence or absence of an amplification product, or detecting the size of the 
amplification product and comparing the length to a control sample. It is anticipated that PGR 
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and/or LCR may be desirable to use as a preliminary amplification step in conjunction with any 
of the techniques used for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication (Guatelli, 
J.C. et al., 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional amplification system 
5 (Kwoh, D.Y. et al., 1989, Proc. Natl. Acad. Sci. USA 86:1173-1 177), Q-Beta Replicase (Lizardi, 
P.M. et ai, 1988, Bio/Technology 6:1 197), or any other nucleic acid amplification method, 
followed by the detection of the amplified molecules using techniques well known to those of 
skill in the art. These detection schemes are especially useful for the detection of nucleic acid 
molecules if such molecules are present in very low numbers. 

10 In a preferred embodiment of the subject assay, mutations in, or allelic variants, of a gene 

from a sample cell are identified by alterations in restriction enzyme cleavage patterns. For 
example, sample and control DNA is isolated, amplified (optionally), digested with one or more 
restriction endonucleases, and firagment length sizes are determined by gel electrophoresis. 
Moreover, the use of sequence specific ribozymes (see, for example, U.S. Patent No. 5,498,531) 

15 can be used to score for the presence of specific mutations by development or loss of a ribozyme 
cleavage site. 

Another aspect of the invention is directed to the identification of agents capable of 
modulating the differentiation and proliferation of cells characterized by aberrant proliferation. 
In this regard, the invention provides assays for determining compounds that modulate the 
20 expression of the marker nucleic acids (SEQ ID Nos: 1-4494, preferably SEQ ID Nos 4472, 
4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494) and/or alter for 
example, inhibit the bioactivity of the encoded polypeptide such as those of SEQ ID Nos. 4471, 
4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, and 4493. 

Several in vivo methods can be used to identify compoimds that modulate expression of 
25 the marker nucleic acids (SEQ ID Nos: 1-4494) and/or alter for example, inhibit the bioactivity 
of the encoded polypeptide. 

Drug screening is performed by adding a test compound to a sample of cells, and 
monitoring the effect, A parallel sample which does not receive the test compound is also 
monitored as a control. The treated and untreated cells are then compared by any suitable 
30 phenotypic criteria, including but not limited to microscopic analysis, viability testing, ability to 
replicate, histological examination, the level of a particular RNA or polypeptide associated with 
the cells, the level of enzymatic activity expressed by the cells or cell lysates, and the ability of 
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the cells to interact with other cells or compounds. Differences between treated and untreated 
cells indicates effects attributable to the test compound. 

Desirable effects of a test compound include an effect on any phenotype that was 
conferred by the cancer-associated marker nucleic acid sequence. Examples include a test 
5 compound that limits the overabxmdance of mRNA, limits production of the encoded protein, or 
limits the functional effect of the protein. The effect of the test compound would be apparent 
when comparing results between treated and untreated cells. 

The invention thus also encompasses methods of screening for agents which inhibit 
expression of the nucleic acid markers (SEQ ID Nos: 1-4494, preferably SEQ ID Nos. 4472, 

10 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494) in vitro, comprising 
exposing a cell or tissue in which the marker nucleic acid mRNA is detectable in cultured cells to 
an agent in order to determine whether the agent is capable of inhibiting production of the 
mRNA; and determining the level of mRNA in the exposed cells or tissue, wherein a decrease in 
the level of the mRNA after exposure of the cell line to the agent is indicative of inhibition of the 

1 5 marker nucleic acid mRNA production. 

Alternatively, the screening method may include in vitro screening of a cell or tissue in 
which marker protein is detectable in cultured cells to an agent suspected of inhibiting 
production of the marker protein; and determining the level of the marker protein in the cells or 
tissue, wherein a decrease in the level of marker protein after exposure of the cells or tissue to 
20 the agent is indicative of inhibition of marker protein production. 

The invention also encompasses in vivo methods of screening for agents which inhibit 
expression of the marker nucleic acids, comprising exposing a mammal having tumor cells in 
which marker mRNA or protein is detectable to an agent suspected of mhibiting production of 
marker mRNA or protein; and determining the level of marker mRNA or protein in tumor cells 
25 of the exposed mammal. A decrease in the level of marker mRNA or protein after exposure of 
the manunal to the agent is indicative of inhibition of marker nucleic acid expression. 

Accordingly, the invention provides a method comprising incubating a cell expressing the 
marker nucleic acids (SEQ ED Nos: 1-4494) with a test compound and measuring the mRNA or 
protein level. The mvention further provides a method for quantitatively determioing the level of 
30 expression of the marker nucleic acids in a cell population, and a method for determining 
whether an agent is capable of increasing or decreasing the level of expression of the marker 
nucleic acids in a cell population. The method for determining whether an agent is capable of 
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increasing or decreasing the level of expression of the marker nucleic acids in a cell population 
comprises the steps of (a) preparing cell extracts from control and agent-treated cell populations, 
(b) isolating the marker polypeptides from the cell extracts, (c) quantifying (e.g., in parallel) the 
amount of an immxmocomplex formed between the marker polypeptide and an antibody specific 
5 to said polypeptide. The marker polypeptides of this invention may also be quantified by 

assaying for its bioactivity. Agents that induce increased the marker nucleic acid expression may 
be identified by their ability to increase the amount of immunocomplex formed in the treated cell 
as compared with the amount of the inmnmocomplex formed in the control cell. Li a similar 
manner, agents that decrease expression of the marker nucleic acid may be identified by their 
10 ability to decrease the amount of the immunocomplex formed in the treated cell extract as 
compared to the control cell. 

mRNA levels can be determined by Northem blot hybridization. mKNA levels can also 
be determined by methods involving PGR. Other sensitive methods for measuring mRNA, which 
can be used in high throughput assays, e.g., a method using a DELFIA endpoint detection and 

15 quantification method, are described, e.g., in Webb and Hurskainen (1996) Journal of 
Biomolecular Screening 1:119. Marker protein levels can be determined by 
immunoprecipitations or immunohistochemistiy using an antibody that specifically recognizes 
the protein product encoded by SEQ ED Nos: 1- 4494, and preferably one or more of the proteins 
having the sequence of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485. 4487, 

20 4489, 4491, and 4493. 

Agents that are identified as active in the drug screening assay are candidates to be tested 
for their capacity to block cell proliferation activity. These agents would be usefiil for treating a 
disorder mvolving aberrant growth of cells, especially colon cells. 

A variety of assay formats will suffice and, in light of the present disclosure, those not 
25 expressly described herein will nevertheless be comprehended by one of ordmary skill in the art. 
For instance, the assay can be generated in many different formats, and include assays based on 
cell-free systems, e.g., purified proteins or cell lysates, as well as cell-based assays which utilize 
intact cells. 

In many drug screening programs which test Ubraries of compounds and natural extracts, 
30 high throughput assays are desirable in order to maximize the number of compounds surveyed in 
a given period of time. Assays of the present invention which are performed in cell-firee systems, 
such as may be derived with pxirified or semi-purified proteins or with lysates, are often preferred 
as "primary" screens in that they can be generated to permit rapid development and relatively 
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easy detection of an alteration in a molecular target which is mediated by a test compound. 
Moreover, the effects of cellular toxicity and/or bioavailability of the test compound can be 
generally ignored in the in vitro system, the assay instead being focused primarily on the effect 
of the drug on the molecular target as may be manifest in an alteration of binding affinity with 
5 other proteins or changes in enzymatic properties of the molecular target. 

A. Use of Nucleic Acids as Probes in Mapping and in Tissue Profiling Probeg 

Polynucleotide probes as described above, e g , comprising at least 12 contiguous 
nucleotides selected from the nucleotide SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 
4482, 4484, 4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 1-1 103, even more 
10 preferably SEQ ID Nos. 1-503, and still more preferably SEQ ID Nos. 4472, 4474, 4476, 4478, 
4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, or a sequence complementary thereto, are 
used for a variety of purposes, including identification of human chromosomes and determining 
transcription levels. Additional disclosure about preferred regions of the nucleic acid sequences 
is found in the accompanying tables. 

15 The nucleotide probes are labeled, for example, with a radioactive, fluorescent, 

biotinylated, or chemiluminescent label, and detected by well known methods appropriate for the 
particular label selected. Protocols for hybridizing nucleotide probes to preparations of 
metaphase chromosomes are also well known in the art. A nucleotide probe will hybridize 
specifically to nucleotide sequences in the chromosome preparations which are complementary 

20 to the nucleotide sequence of the probe. A probe that hybridizes specifically to a nucleic acid 
should provide a detection signal at least 5-, 10-, or 20-fold higher than the background 
hybridization provided with other unrelated sequences. 

In a non-limiting example, commercial programs are available for identifying regions of 
chromosomes commonly associated with disease, such as cancer. Nucleic acids of the invention 

25 can be used to probe these regions. For example, if, through profile searching, a nucleic acid is 
identified as corresponding to a gene encoding a kinase, its ability to bmd to a cancer-related 
chromosomal region will suggest its role as a kinase in one or more stages of tumor cell 
development/growth. Although some experimentation would be required to elucidate the role, 
the nucleic acid constitutes a new material for isolating a specific protein that has potential for 

30 developing a cancer diagnostic or therapeutic. 

Nucleotide probes are used to detect expression of a gene corresponding to the nucleic 
acid. For example, in Northern blots, mRNA is separated electrophoretically and contacted with 
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a probe. A probe is detected as hybridizing to an mRNA species of a particular size. The amount 
of hybridization is quantitated to determine relative amounts of expression, for example under a 
particular condition. Probes are also used to detect products of amplification by polymerase 
chain reaction. The products of the reaction are hybridized to the probe and hybrids are detected. 
5 Probes are used for in situ hybridization to cells to detect expression. Probes can also be used in 
vivo for diagnostic detection of hybridizing sequences. Probes are typically labeled with a 
radioactive isotope. Other types of detectable labels may be used such as chromophores, 
fluorophores, and enzymes. 

Expression of specific mRNA can vary in different cell types and can be tissue specific. 

10 This variation of mRNA levels in different cell types can be exploited with nucleic acid probe 
assays to determine tissue types. For example, PGR, branched DNA probe assays, or blotting 
techniques utilizing nucleic acid probes substantially identical or complementary to nucleic acids 
of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 
4494, preferably SEQ ID Nos. 1-1 103, even more preferably SEQ ID Nos. 1-503, and still more 

15 preferably SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 
4494, or a sequence complementary thereto, can determine the presence or absence of target 
cDNA or mRNA. 

Examples of a nucleotide hybridization assay are described in Urdea et al, PCT 
W092/02526 and Urdea et al, U.S. Patent No. 5,124,246, both incorporated herein by reference. 
20 The references describe an example of a sandwich nucleotide hybridization assay. 

Alternatively, the Polymerase Chain Reaction (PGR) is another means for detecting small 
amounts of target nucleic acids, as described in MuUis et al, Met/i. Enzymol (1987) /55.*335- 
350; U.S. Patent No. 4,683,195; and U.S. Patent No. 4,683,202, all incorporated herein by 
reference. Two primer polynucleotides nucleotides hybridize with the target nucleic acids and 

25 are used to prime the reaction. The primers may be composed of sequence within or 3 * and 5 * to 
the polynucleotides of the Sequence Listing. Alternatively, if the primers are 3' and 5' to these 
polynucleotides, they need not hybridize to them or the complements. A thermostable 
polymerase creates copies of target nucleic acids from the primers using the original target 
nucleic acids as a template. After a large amount of target nucleic acids is generated by the 

30 polymerase, it is detected by methods such as Southern blots. When using the Southern blot 
method, the labeled probe will hybridize to a polynucleotide of the Sequence Listing or 
complement. 
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Furthermore, mRNA or cDNA can be detected by traditional blotting techniques 
described in Sambrook et al, "Molecular Cloning: A Laboratory Manual" (New York, Cold 
Spring Harbor Laboratory, 1989). mRNA or cDNA generated from mRNA using a polymerase 
enzyme can be purified and separated using gel electrophoresis. The nucleic acids on the gel are 
5 then blotted onto a solid support, such as nitrocellulose. The solid support is exposed to a labeled 
probe and then washed to remove any unhybridized probe. Next, the duplexes containing the 
labeled probe are detected. Typically, the probe is labeled with radioactivity, 

Mapping 

Nucleic acids of the present invention are used to identify a chromosome on which the 
corresponding gene resides. Using fluorescence in siiu hybridization (FISH) on normal 
metaphase spreads, comparative genomic hybridization allows total genome assessment of 
changes in relative copy number of DNA sequences. See Schwartz and Samad, Current Opinions 
in Biotechnology (1994) 8:10-1 A\ Kallioniemi et al, Seminars in Cancer Biology (1993) ^:41-46; 
Valdes and Tagle, Methods in Molecular Biology (1997) 68: L Boultwood, ed., Human Press, 
Totowa, NJ. 

Preparations of human metaphase chromosomes are prepared using standard cytogenetic 
techniques from human primary tissues or cell lines. Nucleotide probes comprising at least 12 
contiguous nucleotides selected from the nucleotide sequence of SEQ ID Nos. 1-4470, 4472, 
4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 
1-1103, even more preferably SEQ ID Nos, 1-503, and still more preferably SEQ ID Nos. 4472, 
4474^ 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, or a sequence 
complementary thereto, are used to identify the corresponding chromosome. The nucleotide 
probes are labeled, for example, with a radioactive, fluorescent, biotinylated, or 
cherailuminescent label, and detected by well known methods appropriate for the particular label 
selected. Protocols for hybridizing nucleotide probes to preparations of metaphase chromosomes 
are also well known in the art. A nucleotide probe will hybridize specifically to nucleotide 
sequences in the chromosome preparations that are complementary to the nucleotide sequence of 
the probe. A probe that hybridizes specifically to a target gene provides a detection signal at least 
5-, 10-, or 20-fold higher than the background hybridization provided with unrelated coding 
sequences. 

Nucleic acids are mapped to particular chromosomes using, for example, radiation 
hybrids or chromosome-specific hybrid panels. See Leach et ah. Advances in Genetics, (1995) 
33:63-99; Walter et aL Nature Genetics (1994) 7:22-28; Walter and Goodfellow, Trends in 
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Genetics (1992) 9:352. Panels for radiation hybrid mapping are available from Research 
Genetics, Inc., Hxintsville, Alabama, USA. Databases for markers using various panels are 
available via the v/orld wide web at http:/F/shgc-www.stanford.edu, and other locations. The 
statistical program RHMAP can be used to constmct a map based on the data from radiation 
5 hybridization with a measure of the relative likelihood of one order versus another, RHMAP is 
available via the world wide web at http://www.sph.iimich.edu/group/statgen/software. 

Such mapping can be useful in identifying the function of the target gene by its proximity 
to other genes with known function. Fxmction can also be assigned to the target gene when 
particular syndromes or diseases map to the same chromosome. 

10 Tissue Profiling 

The nucleic acids of the present invention can be used to determine the tissue type from 
which a given sample is derived. For example, a metastatic lesion is identified by its 
developmental organ or tissue source by identifying the expression of a particular marker of that 
organ or tissue. If a nucleic acid is expressed only in a specific tissue type, and a metastatic 

1 5 lesion is found to express that nucleic acid, then the developmental source of the lesion has been 
identified. Expression of a particular nucleic acid is assayed by detection of either the 
corresponding mRNA or the protein product. Immimological methods, such as antibody staining, 
are used to detect a particular protein product. Hybridization methods may be used to detect 
particular mRNA species, including but not limited to in situ hybridization and Northern 

20 blotting. 

Use of Polymorphisms 

A nucleic acid will be useful in forensics, genetic analysis, mapping, and diagnostic 
applications if the corresponding region of a gene is polymorphic in the human population. A 
particular polymorphic form of the nucleic acid may be used to either identify a sample as 
25 deriving from a suspect or rule out the possibility that the sample derives from the suspect. Any 
means for detecting a polymorphism in a gene are used, including but not limited to 
electrophoresis of protein polymorphic variants, differential sensitivity to restriction enzyme 
cleavage, and hybridization to an allele-specific probe. 

B. Use of Nucleic Acids and Encoded Polypeptides to Raise A ntibodies 

30 Expression products of a nucleic acid, the corresponding mRNA or cDNA, or the • 

corresponding complete gene are prepared and used for raising antibodies for experimental, 
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diagnostic, and therapeutic purposes. For nucleic acids to which a corresponding gene has not 
been assigned, this provides an additional method of identifying the corresponding gene. The 
nucleic acid or related cDNA is expressed as described above, and antibodies are prepared. 
These antibodies are specific to an epitope on the encoded polypeptide, and can precipitate or 
5 bind to the corresponding native protein in a cell or tissue preparation or in a cell-free extract of 
an in vitro expression system. 

Immunogens for raising antibodies are prepared by mixing the polypeptides encoded by 
the nucleic acids of the present invention with adjuvants. Alternatively, polypeptides are made as 
fusion proteins to larger immunogenic proteins. Polypeptides are also covalently linked to other 

10 larger immunogenic proteins, such as keyhole limpet hemocyanin. Immimogens are typically 

administered mtradermally, subcutaneously, or intramuscularly, Immunogens are administered to 
experimental animals such as rabbits, sheep, and mice, to generate antibodies. Optionally, the 
animal spleen cells are isolated and fused with myeloma cells to form hybridomas which secrete 
monoclonal antibodies. Such methods are well known in the art According to another method 

1 5 known in the art, the nucleic acid is administered directly, such as by intramuscular injection, 
and expressed in vivo. The expressed protein generates a variety of protein-specific immune 
responses, including production of antibodies, comparable to administration of the protein. 

Preparations of polyclonal and monoclonal antibodies specific for nucleic acid-encoded 
proteins and polypeptides are made using standard methods known in the art. The antibodies 

20 specifically bind to epitopes present in the polypeptides encoded by a nucleic acid of SEQ ID 
Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, 
preferably SEQ ID Nos. 1-1 103, even more preferably SEQ ID Nos. 1-503, and still more 
preferably SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 
4494, or a sequence complementary thereto. In a preferred embodiment the antibodies bind to 

25 epitopes on the polypeptides of SEQ ID Nos. 4471, 4473, 4475, 4479, 4481, 4483, 4485, 4487, 
4489, 4491, and 4493. Typically, at least about 6, 8, 10, or 12 contiguous amino acids are 
required to form an epitope. However, epitopes which involve noncontiguous amino acids may 
require more, for example, at least about 15, 25, or 50 amino acids. A short sequence of a 
nucleic acid may then be unsuitable for use as an epitope to raise antibodies for identifying the 

30 corresponding novel protein, because of the potential for cross-reactivity with a known protein. 
However, the antibodies may be useful for other purposes, particularly if they identify common 
structural features of a known protein and a novel polypeptide encoded by a nucleic acid of the 
invention. 
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Antibodies that specifically bind to human nucleic acid-encoded polypeptides should 
provide a detection signal at least about 5-, 10-, or 20-fold higher than a detection signal 
provided with other proteins when used in Western blots or other immunochemical assays. 
Preferably, antibodies that specifically bind nucleic acid T-encoded polypeptides do not detect 
5 other proteins in immxmochemical assays and can immunoprecipitate nucleic acid-encoded 
proteins from solution. 

To test for the presence of serum antibodies to the nucleic acid-encoded polypeptide in a 
human population, human antibodies are pmfied by methods well known in the art. Preferably, 
the antibodies are affinity purified by passing antiserum over a column to which a nucleic acid- 
10 encoded protein, polypeptide, or fusion protein is bound. The bound antibodies can then be 
eluted from the coliimn, for example using a buffer with a high salt concentration. 

In addition to the antibodies discussed above, genetically engineered antibody derivatives 
are made, such as single chain antibodies. 

Antibodies may be made by using standard protocols known in the art (See, for example, 
1 5 Antibodies: A Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)), A 
mammal, such as a mouse, hamster, or rabbit can be immimized with an immunogenic form of 
the peptide (e.g., a mammalian polypeptide or an antigenic fragment which is capable of eliciting 
an antibody response, or a fusion protein as described above). 

In one aspect, this invention includes monoclonal antibodies that show a subject 
20 polypeptide is highly expressed in colorectal tissue or tumor tissue, especially colon cancer tissue 
or colon cancer-derived cell lines. Therefore, in one embodiment, this invention provides a 
diagnostic tool for the analysis of expression of a subject polypeptide in general, and in 
particular, as a diagnostic for colon cancer. 

Techniques for conferring immunogenicity on a protein or peptide include conjugation to 
25 carriers or other techniques well known in the art. An immunogenic portion of a protein can be 
administered in the presence of adjuvant The progress of immunization can be monitored by 
detection of antibody titers in plasma or serum. Standard ELISA or other immunoassays can be 
used with the immunogen as antigen to assess the levels of antibodies. In a preferred 
embodiment, the subject antibodies are immunospecific for antigenic determinants of a protein 
30 of a mammal, e.g., antigenic determinants of a protein encoded by one of SEQ ID Nos. 1 -4470, 
4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 or closely related 
homologs (e.g., at least 90% identical, and more preferably at least 95% identical). 
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Following immimization of an animal with an antigenic preparation of a polypeptide, 
antisera can be obtained and, if desired, polyclonal antibodies isolated from the serum. To 
produce monoclonal antibodies, antibody-producing cells (lymphocytes) can be harvested from 
an immunized animal and fused by standard somatic cell fusion procedures with immortalizing 
5 cells such as myeloma cells to yield hybridoma cells. Such techniques are well known in the art, 
and include, for example, the hybridoma technique (originally developed by Kohler and 
Milstein, (1975) Nature, 256: 495-497), the human B cell hybridoma technique (Kozbar et aU 
(1983) Immunology Today, 4: 72), and the EBV-hybridoma technique to produce human 
monoclonal antibodies (Cole et al, (1985) Monoclonal Antibodies and Cancer Therapy, Alan R. 
10 Liss, Inc. pp. 77-96). Hybridoma cells can be screened immunochemically for production of 
antibodies specifically reactive with a polypeptide of the present invention and monoclonal 
antibodies isolated from a culture comprising such hybridoma cells. 

The term antibody as used herein is intended to include fragments thereof which are also 
specifically reactive with one of the subject polypeptides. Antibodies can be fragmented using 

15 conventional techniques and the fragments screened for utihty in the same manner as described 
above for whole antibodies. For example, F(ab)2 fragments can be generated by treating antibody 
with pepsin. The resulting F(ab)2 fragment can be treated to reduce disulfide bridges to produce 
Fab fragments. The antibody of the present invention is further intended to include bispecific, 
single-chain, and chimeric and humanized molecules having affinity for a polypeptide conferred 

20 by at least one CDR region of the antibody. In preferred embodiments, the antibodies, the 

antibody further comprises a label attached thereto and able to be detected, (e.g., the label can be 
a radioisotope, fluorescent compound, chemiluminescent compoimd, enzyme, or enzyme co- 
factor). 

Antibodies can be used, e.g., to monitor protein levels in an individual for determining, 
25 e.g., whether a subject has a disease or condition, such as colon cancer, associated with an 

aberrant protein level, or allowing determination of the efficacy of a given treatment regimen for 
an individual afflicted with s^uch a disorder. The level of polypeptides may be measured from 
cells in bodily fluid, such as in blood samples. 

Another application of antibodies of the present invention is in the immunological 
30 screening of cDNA libraries constructed in expression vectors such as gtll, gtl8-23, ZAP, and 
0RF8. Messenger libraries of this type, having coding sequences inserted in the correct reading 
frame and orientation, can produce fusion proteins. For instance, gtl 1 will produce fusion 
proteins whose amino termini consist of P-galactosidase amino acid sequences and whose 
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carboxyl termini consist of a foreign polypeptide. Antigenic epitopes of a protein, e.g., other 
orthologs of a particular protein or other paralogs from the same species, can then be detected 
with antibodies, as, for example, reacting nitrocellulose filters lifted fi-om infected plates with 
antibodies. Positive phage detected by this assay can then be isolated from the infected plate. 
5 Thus, the presence of homologs can be detected and cloned from other animals, as can alternate 
isoforms (including splicing variants) from humans. 

In another embodiment, a panel of monoclonal antibodies may be used, wherein each of 
the epitope's involved ftmctions are represented by a monoclonal antibody. Loss or perturbation 
of binding of a monoclonal antibody in the panel would be indicative of a mutational attention of 
10 the protein and thus of the corresponding gene. 

C. Differential Expression 

The present invention also provides a method to identify abnormal or diseased tissue in a 
human. For nucleic acids corresponding to profiles of protein families as described above, the 
choice of tissue may be dictated by the putative biological function. The expression of a gene 
15 corresponding to a specific nucleic acid is compared between a first tissue that is siispected of 
being diseased and a second, normal tissue of the human. The normal tissue is any tissue of the 
human, especially those that express the target gene including, but not limited to, brain, thymus, 
testis, heart, prostate, placenta, spleen, small intestine, skeletal muscle, pancreas, and the 
mucosal lining of the colon. 

20 The tissue suspected of being abnormal or diseased can be derived from a different tissue 

type of the human, but preferably it is derived from the same tissue type; for example an 
intestinal polyp or other abnormal growth should be compared with normal intestinal tissue. A 
difference between the target gene, mRNA, or protein in the two tissues which are compared, for 
example in molecular weight, amino acid or nucleotide sequence, or relative abundance, 

25 indicates a change in the gene, or a gene which regulates it, in the tissue of the human that was 
suspected of being diseased. 

The target genes in the two tissues are compared by any means known in the art For 
example, the two genes are sequenced, and the sequence of the gene in the tissue suspected of 
being diseased is compared with the gene sequence in the normal tissue. The target genes, or 
30 portions thereof, in the two tissues are amplified, for example using nucleotide primers based on 
the nucleotide sequence shown in the Sequence Listing, using the polymerase chain reaction. 
The amplified genes or portions of genes are hybridized to nucleotide probes selected from a 
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corresponding nucleotide sequence shown SEQ ID No. 1-4494, A difference in the nucleotide 
sequence of the target gene in the tissue suspected of being diseased compared with the normal 
nucleotide sequence suggests a role of the nucleic acid-encoded proteins in the disease, and 
provides a lead for preparing a therapeutic agent. The nucleotide probes are labeled by a variety 
5 of methods, such as radiolabeling, biotinylation, or labeling with fluorescent or 
chemiluminescent tags, and detected by standard methods known in the art. 

Alternatively, target mRNA in the two tissues is compared. PolyA"^KNA is isolated from 
the two tissues as is known in the art. For example, one of skill in the art can readily determine 
differences in the size or amount of target mRNA transcripts between the two tissues using 
10 Northern blots and nucleotide probes selected from the nucleotide sequence shown in the 
Sequence Listing. Increased or decreased expression of a target mRNA in a tissue sample 
suspected of being diseased, compared with the expression of the same target mRNA in a normal 
tissue, suggests that the expressed protein has a role in the disease, and also provides a lead for 
preparing a therapeutic agent. 

15 Any method for analyzing proteins is used to compare two nucleic acid-encoded proteins 

from matched samples. The sizes of the proteins in the two tissues are compared, for example, 
using antibodies of the present invention to detect nucleic acid-encoded proteins in Western blots 
of protein extracts from the two tissues. Other changes, such as expression levels and subcellular 
localization, can also be detected immunologically, using antibodies to the corresponding 

20 protein. A higher or lower level of nucleic acid-encoded protein expression in a tissue suspected 
of being diseased, compared with the same nucleic acid-encoded protein expression level in a 
normal tissue, is indicative that the expressed protein has a role in the disease, and provides 
another lead for preparing a therapeutic agent. 

Similarly, comparison of gene sequences or of gene expression products, e.g., mRNA and 
25 protein, between a human tissue that is suspected of being diseased and a normal tissue of a 
human, are used to follow disease progression or remission in the human. Such comparisons of 
genes, mRNA, or protein are made as described above. 

For example, increased or decreased expression of the target gene in the tissue suspected 
of being neoplastic can indicate the presence of neoplastic cells in the tissue. The degree of 
30 increased expression of the target gene in the neoplastic tissue relative to expression of the gene 
in normal tissue, or differences in the amount of increased expression of the target gene in the 
neoplastic tissue over time, is used to assess the progression of the neoplasia ia that tissue or to 
monitor the response of the neoplastic tissue to a therapeutic protocol over time. 
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The expression pattern of any two cell types can be compared, such as low and high 
metastatic tumor cell lines, or cells from tissue which have and have not been exposed to a 
therapeutic agent. A genetic predisposition to disease in a human is detected by comparing an 
target gene, mRNA, or protein in a fetal tissue with a normal target gene, mRNA, or protein. 
5 Fetal tissues that are used for this purpose include, but are not limited to, amniotic fluid, 
chorionic villi, blood, and the blastomere of an in v/rro-fertilized embryo. The comparable 
normal target gene is obtained from any tissue. The mRNA or protein is obtained from a normal 
tissue of a human in which the target gene is expressed. Differences such as alterations in the 
nucleotide sequence or size of the fetal target gene or mRNA, or alterations in the molecular 
10 weight, amino acid sequence, or relative abundance of fetal target protein, can indicate a 
germline mutation in the target gene of the fetus, which indicates a genetic predisposition to 
disease. 

In a preferred embodiment nucleic acid macroarrays comprising the one or more of the 
sequences of SEQ IDNos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 

1 5 4490, 4492, and 4494 may be used to evaluate differential expression of nucleic acid sequences 
in cancerous cells or tissue relative to the expression of the same sequences in normal cells or 
tissue as described above. Preferably, such sequences are differentially expressed by at least 3 
fold in cancerous cells or tissue relative to normal cells or tissue. More specifically, the present 
invention provides the frail length sequences of SEQ ID Nos. 4472, 4474. 4476, 4478, 4480, 

20 4482, 4484, 4486, 4488, 4490, 4492, and 4494 which are differentially expressed in cancerous 
colonic cells/tissue by at least 3 fold relative to normal patient samples. Thus, the sequences of 
SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, as 
well as the encoded polypeptides (SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 
4485, 4487, 4489, 4491, and 4493, respectively) serve as valuable diagnostic markers for 

25 identifying and screening for colon cancer in a patient. 

D. Use of Nucleic Acids, and Encoded Polvpeptides to Screen for Peptide Analogs 
apd Ant^godsts 

Polypeptides encoded by the instant nucleic acids, e.g., SEQ ID Nos. 1-4470, 4472, 4474, 
4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, preferably SEQ ID Nos. 1- 
30 1103, even more preferably SEQ ID Nos. 1-503, and most preferably SEQ ID Nos. 4472, 4474, 
4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494, or a sequence complementary 
thereto, and corresponding full length genes can be used to screen peptide libraries to identify 
binding partners, such as receptors, from among the encoded polypeptides. Preferably, the 
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polypeptides of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 
4491, and 4493 may be used screen for binding partners. 

A library of peptides may be synthesized following the methods disclosed in U.S. Pat. 
No. 5,010,175, and in PCT WO 91/17823. As described below in brief, one prepares a mixture of 
5 peptides, which is then screened to identify the peptides exhibiting the desired signal 

transduction and receptor binding activity. In the ' 175 method, a suitable peptide synthesis 
support (e.g., a resin) is coupled to a mixture of appropriately protected, activated amino acids. 
The concentration of each amino acid in the reaction mixture is balanced or adjusted in inverse 
proportion to its coupling reaction rate so that the product is an equimolar mixture of amino acids 

10 coupled to the starting resin. The bound amino acids are then deprotected, and reacted with 
another balanced amino acid mixture to form an equimolar mixture of all possible dipeptides. 
This process is repeated until a mixture of peptides of the desired length (e.g., hexamers) is 
formed. Note that one need not include all amino acids in each step: one may include only one or 
two amino acids in some steps (e.g., where it is known that a particular amino acid is essential in 

15 a given position), thus reducing the complexity of the mixture. After the synthesis of the peptide 
library is completed, the mixture of peptides is screened for binding to the selected polypeptide. 
The peptides are then tested for their ability to inhibit or enhance activity. Peptides exhibiting the 
desired activity are then isolated and sequenced. 

The method described in WO 91/17823 is similar. However, instead of reacting the 
20 synthesis resin with a mixture of activated amino acids, the resin is divided into twenty equal 

portions (or into a number of portions corresponding to the number of different amino acids to be 
added in that step), and each amino acid is coupled individually to its portion of resin. The resin 
portions are then combined, mixed, and again divided into a number of equal portions for 
reaction with the second amino acid. In this manner, each reaction may be easily driven to 
25 completion. Additionally, one may maintain separate "subpools" by treating portions in parallel, 
rather than combining all resins at each step. This simplifies the process of determining which 
peptides are responsible for any observed receptor binding or signal transduction activity. 

In such cases, the subpools containing, e.g., 1-2,000 candidates each are exposed to one 
or more polypeptides of the invention. Each subpool that produces a positive result is then 
30 resynthesized as a group of smaller subpools (sub-subpools) containing, e.g., 20-100 candidates, 
and reassayed. Positive sub-subpools may be resynthesized as individual compounds, and 
assayed finally to determine the peptides that exhibit a high binding constant. These peptides can 
be tested for their ability to inhibit or enhance the native activity. The methods described in WO 
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91/7823 and U.S. Patent No. 5,194,392 (herein incorporated by reference) enable the preparation 
of such pools and subpools by automated techniques in parallel, such that all synthesis and 
resynthesis may be performed in a matter of days. 

Peptide agonists or antagonists are screened using any available method, such as signal 
5 transduction, antibody binding, receptor binding, mitogenic assays, chemotaxis assays, etc. The 
methods described herein are presently preferred. The assay conditions ideally should resemble 
the conditions under which the native activity is exhibited in vivo, that is, under physiologic pH, 
temperature, and ionic strength. Suitable agonists or antagonists will exhibit strong inhibition or 
enhancement of the native activity at concentrations that do not cause toxic side effects in the 
10 subject. Agonists or antagonists that compete for binding to the native polypeptide may require 
concentrations equal to or greater than the native concentration, while inhibitors capable of 
binding ineversibly to the polypeptide may be added in concentrations on the order of the native 
concentration. 

The end results of such screening and experimentation will be at least one novel 
1 5 polypeptide binding partner, such as a receptor, encoded by a nucleic acid of the invention, and 
at least one peptide agonist or antagonist of the novel binding partner. Such agonists and 
antagonists can be used to modulate, enhance, or inhibit receptor function in cells to which the 
receptor is native, or in cells that possess the receptor as a result of genetic engineering. Further, 
if the novel receptor shares biologically important characteristics with a known receptor, 
20 information about agonist/antagonist binding may help in developing improved 
agonists/antagonists of the known receptor. 

The practice of the present invention will employ, imless otherwise indicated, 
conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, 
microbiology, recombinant DNA, and immunology, which are within the skill of tiie art. Such 

25 techniques are explained fully in the literature. See, for example. Molecular Cloning A 
Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor 
Laboratory Press: 1989); DNA Cloning, Volumes I and II (D.N. Glover ed., 1985); 
Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al U.S. Patent No. 4,683,195; 
Nucleic Acid Hybridization (B.D. Hames & S. J. Higgins eds. 1984); Transcription And 

30 Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells (R. 1. Freshney, 
Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A 
Practical Guide To Molecular Cloning (1984); the treatise, Methods in Enzymology (Academic 
Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M.P. Calos 
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eds., 1987, Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 and 155 (Wu et 
al. eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds., 
Academic Press, London, 1987); Handbook Of Experimental Immunology, Volumes I-IV (D. M. 
Weir and C.C. Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold Spring Harbor 
5 Laboratory Press, Cold Spring Harbor, N.Y., 1986). 

As mentioned above, the sequences described herein are believed to have particular 
utility in regards to colon cancer. However, they may also be useful with other types of cancers 
and other disease states. 

The present invention will now be illustrated by reference to the following examples 
10 which set forth particularly advantageous embodiments. However, it should be noted that these 
embodiments are illustrative and are not to be construed as restricting the invention m any way. 

XI. Examples 

A. Identification of differentiallv expressed sequences. 

Description of the Libraries 

15 SEQ ID Nos: 1-4470 were derived from libraries designated as 101, 102, 103, 104, 109, 

1 10, 1 1 1 , and 1 12 as described below briefly and in the accompanying table (Table 1). For 
example, the 101 library is a normalized, colon cancer specific, subtracted cDNA library. It is 
specific for sequences expressed in colon cancer [proximal and distal Dukes' B, microsatellite 
instability negative (MSI-)] but not expressed in normal tissues, including normal colon tissue. 

20 The 102 library is a normalized, colon specific, subtracted cDNA library. It is specific for 
sequences expressed in normal colon tissue but not expressed in other normal tissues. 
Characteristics of the remaining libraries are described in Table 1. 



Table 1 Library designation and description 



Library 
Designation 


Description 


101 


Specific for sequences expressed in colon cancer (proximal and distal 
Dukes' B, MSI-) but not expressed in normal tissues'^, including colon^ 


102 


Specific for sequences expressed in normal colon (normal tissue firom 
proximal and distal Dukes' B, MSI-matrix patients)^, but not expressed in 
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other normal tissues^ 


103 


Specific for sequences expressed in proximal Dukes' B, MSI- colon cancer, 
but not expressed in normal colon tissue^ 


104 


Specific for sequences expressed in distal Dukes' B, MSI- colon cancer, but 
not expressed in normal colon tissue^ 


109 


Specific for sequences expressed in proximal Dukes' B, MSI+ colon cancer, 
but not expressed in normal colon tissue^ 


110 


Specific for sequences expressed in proximal Dukes' B, MSI+ colon cancer, 
but not expressed in other normal tissues^, including colon^ 


111 


Specific for sequences expressed in distal, Dukes' D, MSI- colon cancer, but 
not expressed in normal colon tissue^ 


112 


Specific for sequences expressed in distal, Dukes' D, MSI- colon cancer, but 
not expressed in normal tissues'*, including colon^ . 



* cDNA synthesized from SW480 poly A+ RNA obtained form Clontech, Palo Alto, CA 

^ cDNA synthesized from normal colon tissue total RNA obtained from OriGene Technologies, Inc.; RockviUe, 

MD 



^ Corresponding normal colon epithelium from colon cancer patients. 
5 A pool of cDNAs synthesized from the following normal tissue RNAs (poly A+ or total) 
obtained from OriGene Technologies, Inc.: heart, kidney, spleen, Uver, peripheral blood 
lymphocytes, small intestine, skeletal muscle, bmg and prostate. 

Constmction of the n ormalized and subtracted cDNA libraries 

The normalized and subtracted cDNA libraries were constructed according to published 
10 procedures (Daitchenko et al, 1996 PNAS 93:6025-6030, Gurskaya et al, 1996 Analytical 
Biochemistry 240:90-97), Commercially available kits from Clontech Laboratories, Inc., Palo 
Alto, California were utilized (Clontech SMART cDNA synthesis kit, catalog number Kl 052-1, 
and Clontech PCR-Select cDNA Subtraction kit, catalog number K1804-1). For each subtracted 
library, the specific or "tester" cDNA was comprised of amplified cDNA from four similar 
15 sample types that were pooled together. Likewise, the reference or "driver" cDNA was 

comprised of a pool of sample types as illustrated in Table 1. During the subtraction process, the 
genes or transcripts unique to the tester are retained, and the genes or transcripts common to both 
the tester and driver are removed. Thus, in principle, the clones present in the subtracted 
libraries indicate those genes or transcripts that are expressed (or overexpressed) in the tester, but 
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not expressed (or underexpressed) in the driver. Reverse-subtracted libraries were also 
constructed in which the tester and driver materials were reversed. These libraries were only 
utilized to prepare labeled targets (see below). 

To construct the libraries, one microgram of total RNA from each sample was 
5 representatively amplified using the Clontech SMART cDNA synthesis kit. The amplified 
cDNA was purified and pooled to create the individual tester and driver samples that were used 
for the subsequent library construction. To construct the normalized and subtracted libraries, the 
Clontech PCR-Select cDNA Subtraction kit was utilized. A forty-five fold mass excess of driver 
cDNA (450 nanograms) was used for each subtraction experiment. Subtractive hybridization of 

1 0 tester with driver cDNAs was performed twice, each time for about 8-12 hours. Subtracted 
cancer specific cDNA was ligated into the pCR2.1-T0P0 plasmid vector (Invitrogen 
Corporation, Carlsbad CA) and chemically transformed into ultracompetent Epicurian E. coli 
XLIO-Gold cells (Stratagene, La Jolla, CA). The transformed cells were plated onto LB- 
ampicillin plates containing IPTG and X-gal. Individual white colonies, representing those with 

15 cloned inserts, were picked and grown overnight in LB-ampicillin broth. Plasmid DNA was 
purified using QIAprep 96 Turbo kits firom Qiagen (Valencia, CA). 

Segnencing of the clones 

The nucleotide sequence of the inserts firom clones was determined by single-pass 
sequencing fi*om either the T7 or M13 promoter sites using fluorescently labeled 
20 dideoxynucleotides via the Sanger sequencing method. The nucleotide sequences of the 

individual clones were compared to those in public databases (GenBank, dbEST, Geneseq) via 
Blast 2 homology searches according to methods described in the text. 

The sequences derived firom individual clones from the libraries described above 
represents a sequence from a partial mRNA transcript, since the cDNA used for making the 
25 subtracted library was restricted with Rsal, a four base cutter restriction endonuclease that 
generates fragments with an average size of about 600 base pairs. 

The nucleic acids of the invention were assigned a sequence identification number (see 
Figure 1). The nucleic acid sequences are provided in the attached Sequence Listing. 

Validation of difFe rential expression in colon cancer 

30 To validate that the differentially expressed sequences found in this library were specific 

to colon cancer, the inserts from the plasmid DNA were amplified by PCR using vector-specific 
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primers. The amplification products were arrayed onto nylon membranes and hybridized with 
^^P-labeled cDNAs prepared from both the subtracted library cDNA as well as the corresponding 
reverse-subtracted cDNA library. Each membrane array comprises approximately 3,456 clones. 
Four such membranes where generated comprising the clone libraries shown in Table 1 as 
5 indicated below in Table 3 . 



Membrane TD Nximber 


Library Clones 


lOl-l 


Clones from subtracted library 101 


101-2 


Clones from subtracted library 101 and 102 


103104109 


Clones form subtracted libraries 103, 104, and 
109 


110111112 


Clones from subtracted Hbraries 1 10, 1 1 1, and 
112 



The set of foiu: membranes is hybridized, using techinques known to those of skill in the 
art and further described above, with ^^P-labeled target nucleic acid molecules obtained from 
human colon cancer tissue. A second, identical set of membranes is hybridized with ^^P-labeled 
10 target nucleic acid molecules obtained from normal human colon tissue. The signals of the 
hybridization produces on the cancer membrane are subsequently compared to those on the 
normal membrane. A difference in hybridization, indicative of a difference in expression of the 
sequence in colon cancer vs. normal, of at least 3 fold is considered to be indicative of 
differential expression. 

15 Using this validation technique, the full length cDNA sequences of SEQ ID Nos. 

4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486. 4488, 4490, 4492, and 4494 have been 
identified as significantly differentially expressed in colon cancer relative to normal colon tissue. 

Those skilled in the art will recognize, or be able to ascertain, using not more than routine 
experimentation, many equivalents to the specific embodiments of the invention described 
20 herein. Such specific embodiments and equivalents are intended to be encompassed by the 
following claims. 

All patents, published patent applications, and publications cited herein are incorporated 
by reference as if set forth fully herein. 
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What is claimed is: 

1 . A method for detecting cancer in which one or more of SEQ ID Nos. 1-4470, 
4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 are used as probes, 

5 said method comprising: 

(a) collecting a sample of cells from a patient, 

(b) isolating nucleic acid from the cells of the sample, 

(c) contacting the nucleic acid sample with one or more primers which 
specifically hybridize to a nucleic acid sequence of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 

10 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, and 4494 under conditions such that 
hybridization and amplification of the nucleic acid occurs, and 

(d) comparing the presence, absence, or size of an amplification product to the 
amphfication product of a normal cell. 

2. A method of claim 1 in which said cancer is colon cancer. 

15 3 . A method for detecting cancer in a patient sample in which an antibody to a 

protein encoded by SEQ ID Nos. 1-4470 is used to react with proteins in said sample. 

4. A method for detecting cancer in a patient sample in which an antibody to a 
protein encoded by one or more of SEQ ID Nos, 4472, 4474, 4476, 4478, 4480, 4482, 4484, 
4486, 4488, 4490, 4492, or 4494 is used to react with proteins m said sample. 

20 5, A method for detecting cancer in a patient sample in which an antibody to a 

protem having the sequence of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 
4487, 4489, 4491, or 4493 is used to react with proteins in said sample. 

6. A method for identifying an agent which alters the level of expression in a cell of 
a nucleic acid which hybridizes under stringent conditions to one of SEQ ID Nos. 1-4470 /or a 
25 sequence complementary thereto, comprising 

(a) providing a cell; 

(b) treating the cell with a test agent; 
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(c) determining the level of expression in the cell of a nucleic acid which 
hybridizes under stringent conditions to one of SEQ ID Nos. 1-4470 or a sequence 
complementary thereto; and 

(d) comparing the level of expression of the nucleic acid in the treated cell 

5 with the level of expression of the nucleic acid in an untreated cell, wherein a change in the level 
of expression of the nucleic acid in the treated cell relative to the level of expression of the 
nucleic acid in the untreated cell is indicative of an agent which alters the level of expression of 
the nucleic acid in a cell. 

7. A method for identifymg an agent which alters the level of expression in a cell of 
10 a nucleic acid which hybridizes under stringent conditions to one of SEQ ID Nos. 4472, 4474, 

4476^ 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, or 4494 or a sequence complementary 
thereto, comprising 

(a) providing a cell; 

(b) treating the cell with a test agent; 

15 (c) determining the level of expression in the cell of a nucleic acid which 

hybridizes under stringent conditions to one of SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 
4482, 4484, 4486, 4488, 4490, 4492, or 4494 or a sequence complementary thereto; and 

(d) comparing the level of expression of the nucleic acid in the treated cell 
with the level of expression of the nucleic acid in an untreated cell, wherein a change in the level 
20 of expression of the nucleic acid in the treated cell relative to the level of expression of the 

nucleic acid in the imtreated cell is indicative of an agent which alters the level of expression of 
the nucleic acid in a cell. 

8. A method for identifymg an agent which alters the level of expression in a cell of 
a polypeptide of one or more of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 

25 4487, 4489, 449 1 , or 4493 comprising 

(a) providing a cell; 

(b) treating the cell with a test agent; 

(c) determining the level of expression of one or more polypeptides of SEQ 
ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, or 4493 in said cell 
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by reacting said cell with an antibody specific for one or more of the polypeptides of SEQ ED 
Nos. 4471, 4473, 4475, 4477. 4479, 4481, 4483, 4485, 4487, 4489, 4491, or 4493; and 

(d) comparing the level of expression of said one or more polypeptides in the 
treated cell with the level of expression of said one or more polypeptides in an untreated cell, 
5 wherein a change in the level of expression of the nucleic acid in the treated cell relative to the 
level of expression of the nucleic acid in the untreated cell is indicative of an agent which alters 
the level of expression of the polypeptide in a cell. 

9. A pharmaceutical composition comprising an agent identified by the method of 
claim 29, 30, or 31. 

10 1 0. A pharmaceutical composition comprising a nucleic acid which includes a 

nucleotide sequence which hybridizes under stringent conditions to one of SEQ ID Nos. 1-4470 
or a sequence complementary thereto. 

11. A pharmaceutical composition comprising a polypeptide encoded by a nucleic 
acid which includes a nucleotide sequence that hybridizes under stringent conditions to one of 

15 SEQ ID Nos. 1-4470 or a sequence complementary thereto. 

12. A pharmaceutical composition comprising a polypeptide having the sequence of 
one of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481, 4483, 4485, 4487, 4489, 4491, or 
4493. 

13. A pharmaceutical composition comprising an antibody which binds to one or 
20 more polypeptides having the sequence of SEQ ID Nos, 447 1 , 4473, 4475, 4477, 4479, 448 1, 

4483, 4485, 4487, 4489, 4491, or 4493. 

14. A method of determining the phenotype of a cell, comprising detecting the 
differential expression, relative to a normal cell, of at least one nucleic acid which hybridizes 
under stringent conditions to one of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 4482, 

25 4484, 4486, 4488, 4490, 4492, and 4494, wherein the nucleic acid is differentially expressed by 
at least a factor of two. 

15. A method for determining the phenotype of cells in a sample of cells firom a 
patient, comprising: 
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(a) providing a nucleic acid probe comprising a nucleotide sequence having at 
least 12 consecutive nucleotides of any of SEQ ID Nos. 1-4470, 4472, 4474, 4476, 4478, 4480, 
4482, 4484, 4486, 4488, 4490, 4492, and 4494; 

(b) obtaining a sample of cells from a patient; 

5 (c) providing a second sample of cells substantially all of which are non- 

cancerous; 

(d) contacting the nucleic acid probe imder stringent conditions with mRNA 
of each of said first and second cell samples; and comparing (a) the amount of hybridization of 
the probe with mRNA of the first cell sample, with (b) the amount of hybridization of the probe 
1 0 with mRNA of the second cell sample, wherein a difference of at least a factor of two in the 
amount of hybridization with the mRNA of the first cell sample as compared to the amount of 
hybridization with the mRNA of the second cell sample is indicative of the phenotype of cells in 
the first cell sample. 

1 6. A method of determining the phenotype of cell, comprising detecting the 

15 differential expression, relative to a normal cell, of at least one polypeptide encoded by a nucleic 
acid which hybridizes under stringent conditions to one of SEQ ID Nos. 1-4470, wherein the 
polypetide is differentially expressed by at least a factor of two. 

1 7. A method of determining the phenotype of cell, comprising detecting the 
differential expression, relative to a normal cell, of at least one polypeptide encoded by a nucleic 

20 acid which hybridizes under stringent conditions to a sequence selected from the group 

consisting of SEQ ID Nos. 4472, 4474, 4476, 4478, 4480, 4482, 4484, 4486, 4488, 4490, 4492, 
and 4494, wherein the polypeptide is differentially expressed by at least a factor of two. 

18. A method of determining the phenotype of cell, comprising detecting the 
differential expression, relative to a normal cell, of at least one polypeptide selected from the 

25 group of polypeptides of SEQ ID Nos. 4471, 4473, 4475, 4477, 4479, 4481. 4483, 4485, 4487, 
4489, 4491, and 4493, wherein the polypeptide is differentially expressed by at least a factor of 
two. 

19. The method of claim 16, 17, or 18, wherein the level of said polypetide is detected 
in an immunoassay. 
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20. A method for detecting a mutation in a test nucleic acid which hybridizes under 
stringent conditions to a nucleic acid of SEQ ID Nos. 1-4470 or a sequence complementaiy 
thereto, comprising 

(a) collecting a sample of cells from a patient, 

5 (b) isolating nucleic acid from the cells of the sample, 

(c) contacting the nucleic acid sample with one or more primers which 
specifically hybridize to a nucleic acid sequence of SEQ ID Nos, 1-4470 under conditions such 
that hybridization and amplification of the nucleic acid occxurs, and 

(d) comparing the presence, absence, or size of an amplification product to the 
1 0 amplification product of a normal cell. 

21. An isolated nucleic acid comprising a portion of a nucleotide sequence of SEQ ID 
Nos, 504-1 103 or a sequence complementary thereto. 

22. A gene which hybridizes to one of SEQ ID Nos. 1-503. 

23 . An isolated nucleic acid comprising a nucleotide sequence which hybridizes 

1 5 under stringent conditions to a sequence of SEQ ID Nos. 1 -503 or a sequence complementary 
thereto. 

24. An isolated nucleic acid comprising a nucleotide sequence at least 80% identical 
to a sequence corresponding to at least about 15 consecutive nucleotides of one of SEQ ID Nos. 
1-503 or a sequence complementary thereto. 

20 25. An isolated nucleic acid comprising a nucleotide sequence of SEQ ID Nos. 1-503 

or a sequence complementary thereto. 

26. A nucleic acid according to claim 25, fiirther comprising a transcriptional 
regulatory sequence operably linked to said nucleotide sequence so as to render said nucleotide 
sequence suitable for use as an expression vector. 

25 27. An expression vector, capable of replicating in at least one of a prokary otic cell 

and eukaryotic cell, comprising the nucleic acid of claim 26. 

28. A host cell transfected with the expression vector of claim 27. 
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29. A transgenic animal having a transgene of the nucleic acid of claim 25 
incorporated in cells thereof, which transgene modifies the level of expression of the nucleic 
acid, die stability of an mRNA transcript of the nucleic acid, or the activity of the encoded 
product of the nucleic acid., 

5 30. A substantially pure nucleic acid which hybridizes under stringent conditions to a 

nucleic acid probe corresponding to at least 12 consecutive nucleotides of one of SEQ ID Nos. 1- 
1 103 or a sequence complementary thereto. 

31. A polypeptide including an amino acid sequence encoded by a nucleic acid of 
claim 25 or a fragment comprising at least 25 amino acids thereof. 

10 32. A probe/primer comprising a substantially purified oligonucleotide, said 

oligonucleotide containing a region of nucleotide sequence which hybridizes under stringent 
conditions to at least 12 consecutive nucleotides of sense or antisense sequence selected from 
SEQ ID Nos. 1-1103. 

33. An array including at least 10 different probes of claim 32 attached to a solid 
15 support. 

34. The probe/primer of claim 32, further comprising a label group attached thereto 
and able to be detected, 

35. The probe/primer of claim 34, wherein said label group being selected from 
radioisotopes, fluorescent compounds, enzymes, and enzyme co-factors. 

20 36. An antibody immunoreactive with a polypeptide of claim 3 1 . 

37. A method for determining the presence or absence of a nucleic acid which 
hybridizes under stringent conditions to one of SEQ ED Nos. 1-11 03 in a cell, comprising 
contacting the cell with a probe of claim 32. 

38. A method for determining the presence of absence of a polypeptide encoded by a 
25 nucleic acid which hybridizes under stringent conditions to one of SEQ ED Nos, 1-503 in a cell, 

comprising contacting the cell with an antibody of claim 36. 

39. An antisense oligonucleotide analog which hybridizes under stringent conditions 
to at least 12 consecutive nucleotides of one of SEQ ID Nos. 1-503 or a sequence 
complementary thereto, and which - resistant to cleavage by a nuclease. 
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40. A test kit for determining the phenotype of transformed cells, comprising the 
probe/primer of claim 34, for measuring a level of a nucleic acid which hybridizes imder 
stringent conditions to a nucleic acid of SEQ ID Nos. 1-4470 in a sample of cells isolated from a 
patient 

5 41. A test kit for determining the phenotype of transformed cells, comprising an 

antibody specific for a protein encoded by a nucleic acid which hybridizes under stringent 
conditions to any one of SEQ Nos. 1-4470. 
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Figxire 1 

SEQ ID NO: 1 . GGTACATTGAATTACAAAAGGATCCAAGAATATTGAAATAGTTACCAAAAAA 
ATTTGATATAGAAATATATATGGTTTArrAATGAATAAGATCTAGCAGtGGCOTACTATAAm 
AACACAGTAGCGTGGTGAGCATAAATATTTCAA,CTTATCTGCAACAATT!(?^ 
ATAATCAGTITAATTGTGACAGTCACATTTACTGCn:AAGTACTGTGATTGATTGGCT^ 
rrTATTAGAAGTTATTGAAAGTAGATGCCAGGCATGOfGQCTCATGCCTGTAATC 
GGAGGCCAAAGTGG<k:AGATCATTTGAGCCCACGAGTTCGAGACCAGCCTGGCCAACATGATOAA 
GCCCTGTCTCTACAANANAAANTACNANTTNGNANANAAATGTTO 
CNAAAGGGCGAAATTCCAAO^ACACTTGGCGGGGCGTTTNTAANTGGTTNCGA^ 
AGCTTTGGCGTNAATCATGGNTNATN 

SEQ ID NO: 2 GGTACTCAGTATCTTGACTGACTTGTTGGAAATACACT TTAG A TTATTTA C^^ 
• GGAATATATTCAGTTAATTGAGCAAAAGTATATGTATTGCTGTGGAGATTATTTTGCm 

ATTATATITATAACTTAGTGAAAAGGTAGAGAAAGTA1TACT€}AAAAAATTACATAT^^ 

GACTTGTATTTGACTTTTGCAITTCTAATTTAATTCTTGm 

TTGCCATCCTCTTTATCTCTGTTT(m'ATACAAACATTTTC^^ 

AAAAQAAGCATAAGTAATTAAAGGAGCAAAATTCTGGGCCTA ATAT TATTGCTCCTT^ 

ATAATCTITTTATTTAAATTTTTOTGNGGCCCTGGATrm 

TANNNTATCATTNCCCCNGGCTKCNTNAAGGTTCNCCCNGATNAAAAAGGTh^ 

NCANAANCTNTTAANATTCCCCCT^^S^TNGGNN^^S^^^ 

TNNCTNNNC 

SEQ ID NO : 3 GGTACAATTGTCTTTTCTGGGACACTCACTTCTGAAAAGAAGGCA GGAA TTG 
GAAGGGCTGAAAAAGGCATCGTGATGAAATCCACGTCCTGCCAAGTrGTACTGTAAAGrnTAGT 
CCCGGCAGTCAAGGCCACAATAGACAAAAATAAAGTCAGGAGGGAAGCCCACTGGATCCAGTTT 
AGACGCCTCITCAGCACTATCCTGAATAGAAGAGCTGTTGTTATAATGCTAAAATTTGAGAAG 
ACAGCCATGGCTGGTTGAAGATAGGACAGGACATAGAAGACAATCAAGTTATCCAGGAAATAAA 
GAAAGGCAGGAATGGACCACTTCATGAAATCAGAGAArrCCTTCCAGGAAGCATAmCAAAm 
CTACTTTGATGATCTTTCTTTATTACNCAGAATGGCCCCAGCCCCCNGNAAACT 
TTTTGACCCCCCNTTNCAGTAGrrGGAANAAAACAAACCTGGTTTCTTCAT^^ 
N 

SEQ ID NO: 4 ACTT ll -i'inil"14" i4 " l " l l 11 i Tri' ll ' lll r il ' l ' l TT^^ 

TOTGTCNCCCAGGCTGGAGTGCAGNGGNGTGATCTCAGCTCACTGCAACTTNTGCCTCCTGGGTO 
AAGCAATTNTCATGCCCCANCCTCCCAAGTAGCTGGGAATACAGTTGNGTGCCNCCACACTGGCT 
TATTTTrrGNATTTTTTTANNAAANATGGGGTT^ 

TCCTCAAGNGATCTGCCTGCCTTGGCCTCCCAAAGNGCTGGGATTACAGGCATGAGCCACCGCAC 
CTGGCCTACTTATCCrGTTAATGAAAATATTTGATTGGACAAAACATCrrATTCCTT^ 
TGNGCTTTTGGAAAATGGTTrrACCTACTGGANAANTT^ 
TGAANAGTTCCCCCCNTTCTNAAANGGGANAAAAAAAAATTTTNTTTAATA 

SEQ ID NO: 5 ACGCGGGCCCGGTGATGCCATCCTCACAGTGTTAAGTGACTTGGAAGATGAG 

accatgaaaagaacaagtgctggtggcactcccaggcaaagatccccaggctggaggagttggc 

gcctgaagaaaaccagagcaaaacctcaagggtcagagggcctgggcat cggtg cagggctcac 

ttgagactgaacacgttccccagggaagatctgtatgcttctaaagaacactitrggcca^ 

gtgaatcatgcctgtaatcccagcacmgagaggctgangcaggtggaatgcttgagctcagga 

gttcgagaccagcctgggcaacatagcaaaaccrrgtctctactaaaaatcaaaaaaattagccc 

agatgtggtgggtgcctggctgtagtcccagctacttangaaactgangtaggaagatcactttg 

agtctgggaaaatgaagcttcaatgaaccttgantacacccgnttgacttcancctgagtgacng 

aaggagaccttgtttgaaaaaaaaaannnaaaaaaaaagtcccttggccggaacaccc 

ngaattccncccactgggggccgtctaangganccacttggaccaacttgggnaaaatgggataa 

TTN 

SEQ ID NO: 6 GGTACTGCGATTACAGGCGIGAGCCACCGCGCCCGGCCTGACTTTTGATTTTC 
TCACTGTGTTCriTTGGTATTGTAAAAATAGTAAATGrrAAAAAAAAAAAAAAAAG 
AAAGAAAATOTACTTTACAAAAAGTAANTTACAAAGGACTACTACTTSTATCTTTATGTTCA^ 
TAACAAAATGGGAAACNCANTTAGGATGCrrCCATAGCCAGGGAAAGGCAATGGTNANAArmC 
CAGCTGAGGTTTTGATCTGCTAriTTCCACATTAAAAGTNAAAAAANTCAAAACAATGTCT^^ 
NAANCAANAAAATANTCCOTCATTCNTTTTGATTATATNNANAAGNGATTAACCNAAAG^^^ 
ANTITANTAAATAATNAAAAACATTTTNAANAGNNGGAACCRrrCTTAAT^ 
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GCANAAOTATGANATATTTTNTCTTATTNGAANTAAACCCACAA 

SEQ ID NO: 7 GGTAC r il'r i 'i'rilU'riTil l 'i'il- i 'J'ri U ll i'rrCGGGGTTCTGAGGACTCACAA 

mACTGNGGCATACAGCAATTTTTATCCATTGTCCTAAGCNCTANATAACCTO 
AACTCAAGTTTTTCAAGCATGGAAAACTTTTTGAAGNAATTC^ 
TGTTTTCTCTTTTTGAATATAATATAGANAATTGAGATGTCAAA^ 
TAGTTTCrrCCCATTTGANATGGATGCTTAAACANATCACTGGGGNCTTNAGTC^ 
GAAAATACNCATTAAAGGCCATAAATTACCTTCCAAGAACCTTAGCTATCATCTTATATAT^ 
NACATAGTATTTATTCTTGGCAACTTGTTCAAAACTAGTTTCCATTTCT^ 
ATCTNCTTTGAAAATTGAGTTAGTAGGCCATATTTCTTATTTTATTT^ 
GGGAANATNNGTGGATGNGCANGGTTTTGTTATATTAAAAAAAACAGTGNNCCCTO 

SEQ ID NO: 8 GGTACGCGGGTTTTCTTCAGTTAATAATAAAAAGATGGCATTGACATGGTTA 
AATTCCCAGGATCCACAATTATTTAGAAATGGACTAAATGGCTGCTITCATTGC^ 
GTTGAATrrGATGGAGTTTACATTGAGAAACAAAGTTTATATTTm 
CACAAACrnTrGAAGTTCCTTCATATTTTGAATATTAACTAOT 
irrrCAATTCTCTGGGTTGTCTCCTCATTTTGTTAATTGTTrrCOT 
TTGATGTAAGTGCCACn'GTCTAGTTrTGCTTCCATTGCCTGTGATT^ 
TTATTGCCCAAACCAATGTCAAAGAAGCATTTTTCCCTATGTTTTTC^^ 
CAGGTOTACATTTAAAGTCTTTATTTTCAGTTGATTTGTGTAA^ 
CCCTGTGTGGATATCCAGNTGNCCCACNCC 

SEQ ID NO: 9 TTCCAGCACACTGGCGGCCGTTACTAGTGGATCCGAGCTCGGNACCAAGCTT 
GGCNAAATCATGGTCATAACTGTCCCNGNOTGANATTGNAATCCGCGNCNCAATTC 
TACGAACCGGAANCATAAAGGTTAAAGCCTG 

SEQ ID NO: 10 AClUT17'i"ll'rr r ri n "17'r i lU'i"llTm'lTrCNATTCTTCATNAATATTNA 
GCNCCTATTATGTGCAAGGCACTACACTAGGOsfCTGGGGAANATNCAAANATAAATNTGAC^ 
TGCCCTCAAANAGCTTACAGTNTAGTNTAGGANCATACAGTCT^^^GGAAAAAAT^ 
AACTAACCTCCCCCATCCCACCCCCACAAAAAAAAAAAAAAAAAACCTACT AAAC TNG 
CCATTTACTTTTAGTTTANCAGCTTCACGTAAAAAGCATAAATNTGAAAGTNT^ 
CnriTACTGGTAAANAAAATTCATrrrCNTTAAAAAAAATGCTGAATN^^ 
AATITTNAGCCGGGCACGGGGGCTCACACCrGTTNTCCTAACACTTTGGGAC^^ 
GATCACTTGAGGTCAGGANTTTGAAACCACCCTGNCCAACATAACAAAACCCCGTTTKTACT^ 
AATACAAAAATTACCCGGACGTGGNGGCGGGCCCCTGTAATCCCANNTACTCCGGANGCTOGNGC 
AGGAAAATCNCTTTGAACCrGGGGGGGGGGAGGGTTGCATAAGCCAAAAANCGCNCCNCTT^ 
TTACCTGGGCGTAAAAACAAATTTCCTTmANAAAAAAAAAAAi^^ 

SEQ ID NO: 1 1 ACri'ril-ri'i-l^rrTTri' rri 'iUUlil U TGGGGAGAAATAAAATTAGCGAGAT^ 
TGAATAGGACAACTGAATTGCTCTATrmAATTTCTCTTTAAAGGGGT^ 
AATATTATCAGTAAGGACrn'GTITCCTTTGGCCATTGGGAGrrAANANC^ 
ATTCTTNTGGGATACTCCCCTCTGACCrCCACANAAGTAAATTTTNTCCTTGAGGAGGCTACTT^ 
AAATTCAGCTAATGCANAACAl'GGGGTTCAmGmAAAGGOTGCAGCCCTAGGGCACA 
AACGTATCTGTTCTTCCAATTAATTTCTGACAAGCTCAGGTGATGAC^ 
ATGGTATAGGCCTTGGTrACCTTANAAACCATCTCTTTTCITAGG^ 
CCCACAGGCAGTTCTGCAGAATATTTTCAAAAACTGAATTTGGAAATGGAGGACCCTGTO 
AAAAAAAACACAGNCTCAATTTCTTCTATCAATTCATTTAGCTATCATCCT^ 
ATGGGTANAATCCCTATCAAAGGGTITNCTCAACTGCANGTAAAATTCTA^ 
CCCACGTACCTTGGGCCGGGNACCNCNCTTANGGGGGAAATrCCAACCNACCTGGGGGGG 

SEQ ID NO: 12 GGTACCNAGCCTGNAACCamrCGCTCCAAGNTAGCNGNAGCAANCCrGGG 
NGGNGTGGACCATANCCNCATCAAACCNGGGGCTAATAGTTCTCTACCTATGGGGCTGCC^ 
CACCACGGGTCTNCCV^TGTCmGACATAGCTGGTCCNGATGGNNTAAACTATGC^ 

SEQ ID NO: 1 3 GGTACTrTTTCTTTCTTTCTTTTC^ 

TCCTCTCTCCCTCCCTTGCCCCCTTCCNTTCTCTGNCCTTTCTTr^^ 
CTCTTTCCTTTCTTTTCTTTCTTTCm 

TTTCTCTCTTCCrrCCTTCCTCTTTCCCTCCCTCCCTCCATTCCm 
TTCCTTCCTITITGAGACAGGOTCTTGCTCTGTTGCCCAGOCT^ 

TCACAGCAGCCTCCAACTCCTGTGCTCCAGCAATrCTCCCGCCTCAGCCTCCGGAGTAGCTGGGAC 
TACAAGGCATGCACCAACATGTCTGGCTATTTTITrTTT^ 

CTCTGGTGCTCANGCTGGNCTTCACCTTTGGCCCCCACTGGATTCTTCCCCTTTCGGCCTO 
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ANGGCTGGGGATTTANACGAAGTGGAGCCACTGGGCCC^IACCCAACArrTTrGm 
GGGGAAACAAATAAACCACTTTT 

SEQ ID NO: 14 ACTTTTTTnT r r ri l'J'r i ' rJ 'l I i gggtttgtgtgtgtgtgtgtgtgtgtgtgtg 
tgtgtgtgtgtgggacaaggtctcactgctgcccanactgagtgctgtggcacagtcacagctcac 
tgtancatcaacctcctgggctcaagcgatcctcctacctcagcctcctgagtagctgggaccaca 
ggtatgcaccantatgctcggctaatatttttatattitgtaganacagggctt^ 
aaactggtcttgaattcctggacacaagtaatcttcctgtcttggcctcccaaagt^ 
caggtgtgagccaccgtgcccagccagaattggagatttttaaatacanaaattctcaagtgct^ 

GCCCAANAAGGGCTACATTTTGTCAACTTITCTGGCTGCTGGAGCAGGTCA^ 
CCTGTCAAGAGGTGGTAAATNrCCGCGTACCrCGGGCGCACCACGC 

SEQ ID NO : 1 5 GGT ACATTTCTATTCCCTTCTG AAGTTT ATTAACTTC AGTTGT/^ 

TTTTACTTCTAAGAAGCCCAAGATAGTAAGATITGTCTAAAATTTGCCTCT^ 
CAGTTTTCATTCCAGTCAAACCTTGTTGTGTTACAGGGCGATGGGCAACmGAm 
GAACTCCACCAGTCCCCATGGGACAACCACGANAACCTGGTCTTGCTGTCCCAGGTGGCATTGCA 
GTTGCCACTCGAATATTTCCTGATAGGGGTCGTATCCCAGAAGGAGGCCTTCCTGTTAAC CCAA CT 
CCACCTCTTGAAACAGGGCGAGCTGCTGAAGATTTGTGATTGCTGGCCATTGTrrCTTO 
TCGCTCCGCCCGGCGCCGGAAGGAGGGCATTNNCCCTNACTCGCGGGTGCCTTACCTmm 
GCCCACGCTCAANTTTITCTCGGCnTrCAGGCCrCCCCGNGTACCTGCCCC 
AAAGGGGCGAATTCCACNACACTTGTCNGGNCGTTACTTA 

SEQ ID NO: 1 6 ACGCGGGGGGACGAGGATATTTCATGCA AGArn TCATTGCCCAAGATCCAA 
GCAGTCACGAAAGTGACTATTCTTTTCATCCTCCTGTGAAGGCTTTTTGCTGTTGCT^ 
ATGTITITCArrGCATGGGTGTGACAACATAACCTTTTCCCATCrGAACAACAC^^ 
CTATGGGTCriTrCGrrGTXjCrAGGAAGATCACGAAAGCAGGTGTCAGCAGGAATCCTGCAG 
CATCAGTATCTTTCATGGGACCATAAATACATTTAATGGGTTTGAAAAACTCAAGTAATA^^ 
GGAAAATGGATTGATTCCCTAmGTCACCATITGTTTATGTATnATTGATGTCAAGGAA^ 

SEQ ID NO : 1 7 ACTNNACANCNCATTCAAANGGTTTAATTNTTTNAA^^ 
AACrrrAAATAAATrrAACrnCTOCCATTGAmmGTCAGGACAA^^ 
GCTAGTCCTACA^^^GAGCTATGCCCTGANTGACANACACCATATTNACAGGCAAAAT 

SEQ ID NO: 18 GGTACi"llll"n'ITTTTTTTTTTTTTTTT^ 

TGGGCCGTTTCCACACCTGCCCmATTGGTCTNTTITAACAAANGGGOT 

TTTTAAACACCACCCATNAGGGNTTANGAAGGGGCCATNATTTTTNGAAG 

AAATTTTNGAGCCCAAATTNAAT 

SEQ ID NO: 19 GGTACrCGGGTTGATTCCATTCCATTCCATTCCAATCCATGCCATTCCACTCGT 
GTTGATTCX^ATTCTnTCCATTTCATrCAAGTTGAATCCATTCCTT^ 

CCTGCAGTCGGGTTGTTTCCATTCCATTCCATTCCATTCCCCTGCTGTCGGGTTGTTTC^^ 

CCATTCCTTTCCATTCCArrCCATTCCATTCCATTCCATTCCGTTCCATTCCATTCCAT^^ 

TTCGGGTTAATTCCATTCCATTCCATTCCATTCCATTCCATTCCATTCCAATCGAGTTGATTCC^ 

CArrCCArrCCATTCCATTCCACTCCATrCCAGTCCTTTCCATTCCATTCCACTCGGGrrGATO 

ATGTATTCCTTTCCATTCCATTCCNTTCCATTG^^^ATTAGAANTNGATTCCATO 

ATTCCAATTCATTCCATTCCGGATGAATGCCATTCNCNTTGNATTCCATTCCCATTTCCATT 

SEQ ID NO: 20 ACCACCAGGCACACCTCAGTCTTCTTGACCCAGAGCCTGAAAAC TGTTT TCAC 
TGGGTTCCACCAGTCCCAGCAAAATCCTCnTmTATTTATTTrGCTAAGT^^ 
ACATCTCATGATTGATATAATACCAAAGrrCTATAGCCTTCTCTTGCAGTATTTGGATTT 
ACCGGGAAAACTGTTCCCATTAGGCTTGTTAATGTCAGAGTGACACTATTATGAATCXIT 
CTTTCCTCTGCCTGTTrCTTCTCTCTTTCrCCTTCAAACTTGCT 

CmCCCTGAGGCTTTGGGGTCAGAGTATATGTTGTTTGGAGAAAGAGGGCAATC AGGA CTCTTCT 

GGGACCCAGATGAGTrCTTCACTAGCCCTTCTGAACCCCTTGCrCCATAAJTC 

CTCTGAATGACCCTGCAGGTCATCATGGNTrrCTTTTTTTATTGNT^ 

CTCACTCTGTCACCCANGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGNAACXn'CTGCCTCC 

GATrrAAGCGATTCrrCTGCCTCAGCCTCCCGAGTAGCTGGGGACTACAGGTNGTGCCCCACNCCT 

GGCTGATTTTTGGATTTTTAAGAANAAAATGGGGTTTNACCA^ 

SEQ ID NO: 2 1 A Crn T r TTT n ' m run 1 1 1 1 1 1 i n T T ITCCCNNAANCAACAAGNGnTAT 

TGATCACCTACTGNGTGCCTGGCACTGTTACAAATAGTNTGGGGGATACAAANAGGTNTAGGATA 
TGGCCCCCNCCCNCCGAAGGGTTTACAATNTACTTGNGANATCGGACNCNCNCNCNCA/^ 
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ATCAATCAAAAATTGNGAATGCTAA 

SEQ ID NO: 22 AATTCGcccrrAGCGTGGTCGCGGCCGAGGTAcrri-iTi"ri"rri"riTrrri-rri'i' 

TTNTTTTTNAAANAGGGGAAACATGTTTATTTTTAT^^ 

AAACTNrnGAANATTAACAAGGCAAACTCACCANANAAAANAGTGTCCNGNT^^ 

GAAAGATGGNCCGGCnrrATTmGACNCCCGTTKGTATTGCTGNATC 

NTNCAACTTGCCOTCTTTTTTTTO 

SEQ ID NO: 23 ANTTCGCCCTTAGCGTGGCCNCGGGCGAAGTACAAATCACTTGNTAGGCCTC 
AGTTTCTGCNACTOTGAANATCACTAGATTGCACTAGCTNGTCTCTAAA^ 
TACTTNGCACTGAACAGAATCTAGGGTGTATGATATCTGTTTCANCTAAAGGCT^^ 
CTA 

SEQ ID NO: 24 GGTACGCGGGACACCAGCTCCGAAATCACCACCAAGGACTTAAAGGAGAAG 
AAGGAAGTTGTGGAAGANGCANAAAATGGAANAGACGCCCCTGCTAACGGGAATGCTAATGACG 
AAAATGGGGAGNCACGATGCNTGACNATGAGNGTTTACCAAA 

SEQ ED NO: 25 CACCCTTACCGTGGTCNCNGCC^IAGGTACTGGATGAAGCTGACCAAATGTTA 
ANCCXSNGGATTCAAGGACCANATCTATGACNTATTCCAAAAGCTCAACAGNAACACCCANGTAGT 
TTTGCTGTCNCCCACACbn^CCTTCTGATGTGCTGGAGGGGNNCAAGAATNAGCTGACTGGACCCC 
ATTGCGATTCTTCTCACTGAACCGANmCAGACCCTNAACGGTGTCCTGCCAGC^^ 
AATCGAACCAACGGCGACGTGGAAGAGGNGGGCACNCmTATTGANGCmTCTTANAC^^ 
ATCACTNGACTCAGTlWrCTTCGGTCAACNCCCGTGAGGAAGAGCGGNCATGCACrr^^ 
AGAGCNGTNGCTNCCGGGATTTCCACNGTATTCAG 

SEQ ID NO: 26 ACGGAATAGAATGGAATGGAACGAATTGTAATGGAATGGAATGGAATGCAA 
TGGAATGGAATGGAATCAACGTGAGTGCAGGGGAATGTAATGGAACXjGAATGCAATGGAATGGA 

atcatccggattggaatggaatggaatggaatggaatggaatggaatggaatggaatcaacccg 

agtgcaatggaatggagtggaatggaatggaacaaccanaatggaatgaaatgtaatggagagt 

aagggagttgaatagaatcaatcggaatgtaatggaatggaatgcaacggaatgoaatggaatg 

gaatggaatggaatggaatggaatggaatggaatgatacggaatagaatggaatggaacgaaat 

ggaatggaatggaatggaatggaatggaatggaatggaatcgttccgagtggaataggagggaa 

tgtattccantgnaattggaaaggaatggaatcaacccanagtggaatggaagggaatgggaat 

ggaatggaacctaatagaatanaatcncccnacaggaatttaattggaaangactggatgga^ 

tggaatgggaatggaatcaaactccattggaattggaaatggattccncntccctngggcc^^^ 

seq id no: 27 acttititttttttttr^^ 

caggctggagtgcancaaagtgatctcagcmaccgcaacctntgcctcccgggttcaag^ 

nttotgcctcancctnttgagtagcrgggactacaggojcnccccccatgcctg 

atttttagtaaanaaggagttrcaccatnttagccaggatgatctcnatcrcct^ 

cacctgcctccccggncaatggcatgatcactgcttactgcaacctgaaacctcctg 

atcctttcncgantggcmanactacaggagcncacccancnggcccagttaatt^ 

TATAA 

SEQ ID NO: 28 AATTCG(XCrTAGCGTGGTCGCGGCCGAGGTACTGGAGGATTTCATATATCA 
GCTATTGACTAATTCCAATCAGTGAATTAGCTTAGCCTAGGTAGATCCAATTAGCCAGTTATCT 
ATAGCAACTGCTGATTCATCAAGGCTGTTCCAAAATAGTGGATGTGAGCAGCACATGCOTACACC 
AAGAAGGAGAGCAGGGCCCTGAAATACTAACCTGGACAGAAAGCATCACGTGGCTTTACCATCAG 
CTTAAACTCTCCACTATCAGGGTTTAGTTTTGCCTGCTTCTAAGNCCTAA^ 
ACTCAGATCANC>rmACTTTTTANAGGTNmGAGTGGAGCNNM^ 
TA 

SEQ ID NO: 29 CNGGGCGATTGGGCCCTCTATATGCATGCTCGAGCGGGCGCCAGTGTGATGG 
ATATCTGCAGAATTCGCCCrrGTGGTCGCGGCCGAGGTTCTTTGGCCTCTOTGGGATAN 
TCAGNAGGCACACAAACAGAGGCNTTTNCANANTTNAACTGCrcA 

AGACANGATGGTGNGCCACNGTNCGNATTNATTTCCACCnTGGTNCCTTGNCACGANC^ 
GTTTGTATAANCTNATCATCGCTTTAATAANCCCT 

SEQ ED NO: 30 NGCATGCTCGACGGCCGCCAGGGNGANGGGTGATGGATATCTGCAGAATTCG 

cccttagcgtggtcgcggccgaggtacttcattataagtaaggtgtctctaaaagggacagatc^^ 
ctagacccctccttaaccaagtanccagtcctgatatcattaatgggtgatggacaaactaata^^ 
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TTCTCTGCCCGCAGATGGGCTGAGGNTGGAAACTCNCACCArrGTCTTCm'GCANGTGGTCCCCGG 
CCAAACGTTTAAGGCTGGATTTTTAATCCATNGGAAGACATTTTTCAGAC/^ 

SEQ ID NO: 3 1 ACriU"riTlUlU"l-lU"lUlU'riTITrTNGANACGGAGTCACCTAGGCTGGCATGC 
AGGGATTTGATACTGGCTCACTGTAACCTCAGCCTCCTAGGCTCAAGCGATTCTCCTGCTTCAGCC 
TCTTGAATAGCTGGGATTTCAGGCATCTGCCACCACTCGTGGCTAAATTTTGGATm 
ACCGGArrrCTCCATCATGGCCAGGCTGGTCTCAAACTCCTGATCTCTGGNGATCTGCCCACTTCG 
GCCTCCCAAAGTGCTGGGATTACAGGCATGAGTCACCGCCCCCGGCCTCATTGAAAAAmATm 
TNAATACCNAACTGGATTGGTTNTTTTGGGGCAAAAANCGTTT^ 
AAGTATTTTTTTTNAATTCANANAANNTTTAA^ 
TAGTTNNAATA 

SEQ ID NO: 32 ACGCGGGACTCAAAGAGACTAACAGTATTGTAAATTCTAAGCTCTGTAAAGA 
AATTCCAAGTTAGTTTAACTACAGAGCTACAAAAATGTCACAGAAAATTGTTCCTAGTGGCA^^ 
TAAAGAAATAAAAATTATTAGGCCAGGCACGGTGGCrCACACCTGTAATCCTTGCAC^ 
GCTGAGGTGGGCGGATCACCTGTGGTCAGGAGTTCAAACCCAGTCTCTACTAAAATACAAAAATT 
ANCCAGGTGTGGTGGCCATGCCTGTAATCCAGCTACTTGAGGGGCTGAAGTATGGANAATCNNfTT 
TGAACCCANGANGlGGAGGCmG^AmAAGCCCAAAATGr^GGCCm^GTATT 

SEQ ID NO: 33 AATTGNGCCCTCTAGATGCCANGCTCGAGNCGGGGCGAATTCGAGCTCGGTA 
CCCGGGGATCCTCTANAGTCNACCTGCAGGCATGCAAGCTOACNTNTTCNATTGAGAAGCCCAAA 
CAGCKITGNNGGNCATCATNGACCTGGCTNCCTCCTGCACTGAAAAA 

SEQ ID NO: 34 ACGCGGGGGCNGTGCTGTTGGGAGTTGCTTGGAGGTNGGCGGCGCNGGGCTN 
AANGCTAGCAAACCGAGCGATCATGTNGCACAAACAAATTTACTATTCGGACAAATACAACTACN 
AGGAGTTTGNGTATCNACATGTNATGCTGCCCNAGGACATATCCA 

SEQ ID NO: 3 5 ACAGCACTCCATrTACACAGAGTAACCCCACTCTTGATTAATCTGTTCTAAAG 
TGCCAGTATTATTTACACITITriTrrTT^ 

GATTGTCATTCCANCTCTATTATCATTNACATTNANCAAGGGAAATTCmATAATO^ 

TCCCTGGTCCCNGAAGGTITACNTNGNCATrGGCANCNCTAAAKrGGNGAAC™ 

GGANCNTGAAAGNGGAGNCNAGGTANTGGCTGTTCAAAGG 

SEQ ID NO: 36 ACCAmrTATTTAGTGTTGTAGGAAATGTTGGGTTACTTCTTAAAAACGA^ 
CAAAGAAATTCAAAAGTCCCAAAGAAAGAAAACAGGAAATAATAATTCTATAATCCAAAAAC^^ 
TGGGCGATCCTTCAATNGGAGGAANANGGCGTCANTTAANTAGCTCACACTGTANATNTGGANAC 
ACCATATGGANATACGGAGTTAAGNTNGGGTGGATACTAGGAATTAANTTCTCCCCCTAANGC^ 
TAAATNTTTCAGNCTTGANAGATNANTNGTAGTTCTAGAAAAANANATAAAGTT^ 
NGTGGGAGGGAAGGACGGCNTGGC 

SEQ ID NO: 37 ACTlU"lUUU"i"l'lU"l"l"l"lUU"l"illU"l"l"lU"ll'NGANACANAGTCTTGCTCAGTTGCT 
CGGTTGCCCAGGCTGGAATGCAGNGGCACAATCGCAGCTCACTGCAGCCTTAACCTNTGCGGCT^ 
AAACGATCCTCCCATCTGTTTTTATTCTGTAAANATGGTGTCTCACCATGTTGCCTGGGCTGATCT^ 
AAACTCCNGGGCTCAAGTAATCCTTCCTCCTTGGCTrCCCTAAATGGTTGGATTACAGGTC 
CACTCTGCCTGCCCTGNCAAGTCnTrmCCATNAAAAACTTTTTATC^ 

SEQ ID NO; 38 TGAGAGGAAGTTCCATCGCCTAGGTTCTGGGAGAAGCAATACGTCACAATCC 

ccactaaggagagggctcaggcaaaagaggagagtgacattgcctagggcatgggcccagagtt 
atatcacaatgaacatgtggacagggccaacgcagaagtgitaaatgacctgtgtgctgggccca 
gagatatgtcacagttacntaatgggcaaagcccacatcagataaaagaggcatctcaaatatg 
ttgtaagttgatgagccctgagatatgtcacaatgtcccccccgaacagattcaaggcaggagat 
tcacatcacctcggtgctgagcccaatgaaatggcacagtgtctcctgagtgcagggccaaggca 
aaagagcaacaccaacttggtattgaggccaaccatatgccacaatccactctgggagcagttgt 
caggcaggagaaaagagtcacaacacctgggttatggcccaatacatatgttacaatcttgcc^^ 
tgggcaaagcccaggttgagacaggagaatcacattnaa 

seq id no: 39 acn-cagcctggtgacagagggagagtccatctcaaaaaagaaaaaaaaaa 

ATGAGTAAGGCCTTGTCAGGAGCTCAATTCTTATCCTTTTTCCCCTAATGGT^^ 

GTGCTCATTACCATCTTCCTACTCAAl'ini'rin'rTiTCTCATCCATAGTOAAAT OT 

ATCATGAACCTAGTTTrrCX:ACACTGGCCAGCCCATCTTTGCCr^ 

TGTAAACAATTAGTCTGGCTATGGTCTAGmAAGGAAGTAATTTGAACACGAATCCTCCAAAGTG 
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GCTACATTTGTTCrn'CTACTGTCACATATAAGAGTGATGAAGTGCAT^^ 

AAATTTGGGGGCATAGTTCX^ATGTCAGTAAAAAAAAATCCATGACTAAGTAGATTAAAATTATGA 

ATAAGAAAAATAGCAACACCGTATTTTAAAATCTACrrCCTATGCCTCATTGCTCATT^ 

AAATTATCAGTAGATATrTATAAGTGAGGNATCTTATATTGGTTACTTA 

SEQ ID NO: 40 ACGCGGGCAACTACGCTAAAGAATnTGAGAACACCAGTGTGTCTACATTCA 
GTTTGCTGTCGTGGTTAGGATTCTTTTCAGCAATAATTCTGCCTTGACTAGCCAGTCTACCTTTC^ 
GTGCTTGCAGTCTTAATGCTTTGTTTTTTTGTTm 

ATCCCTACCAAAGTTCATCTTATGCCrCTTCAAATCTACATAGCTTATGTCTCCAAC^ 
CCATTCAACTAATTAACTATTAAATAAGCATTTTGTTTCCCTTTAGCT 

GGCCACAACAACAGGTCTCTGACAATCTCCAAATATCCTTGGGTTTATCACATCATCCTATATm 
CAACAGGGA(nTGGGCTCTACCAAGTATTCAGTATAAAT(nTrGTAAAGTAAAACATGGCCGGGT 
GCAGNGGCTCATGCCTTGTAATCCCAGCATTTTNGGAGGCTGATATGGGCAGATCGCTTGAGCCCA 
CNGAGTCAAGACCAGCCTGGGCAAAATGGGTGAAACCCCATT 

SEQ ID NO: 41 ACTAAGGTTACAGCTGTTCTGTTGGTCCTAGGCTCTGAGTAGACAGAGCCAA 
GATACTGCAGTCACTGGGATGGAAAGATGGAGTGCCTCCTTGGCAGTTTGTTTCCATGGGGTTAGA 
AGTTGTAGCTGCTCAGCTGCAGAATGGTGTGCTGCCATTAGTAGGTGTGGTGTAGTGGCAGTGAA 
GCCTAGGGTATGGGAAGATACAGTGGCTATAGACCCCCAAATGAGAAGGCACCCTAGCAGTGGCT 
TCAGTCTCAAGATGCCATTACACAGCAGCAGCTTGGATAATAGGGCAGGAGGAGACACAATGTGG 
GCTCCTTGTTTGGAGTAACATAGTCATGTGAACTCCAGGCAACCCCTCAGGCTGGGCCTAAGG^^ 
TGTGAGGACTACAGTGATCTCCATGAGCCAAAGATTGTGGGTGTCCACATTITAATmGAT^ 
TGAAAGGCCTTCCTGCATACCTTTTCTNTTGTAAGGAGAGTCCGCCTTGGCTCTTGACC^ 
ACTAGGANAGAC^JAGATGGTNAAAGCAAATTGTTCATTCCCrmTTAT 

SEQ ID NO: 42 GAATTGGGCCCTCTAGATGCATGCTCGAGCGGCCGCCAGTGTGATGGATATC 
TAGCATAATTTCCCCTTAGCGTGGNCNCGGCCGATGTACTGAAOTATACTNGTCCNATGCT^ 
AATTCTTTGGAATTTTATTACTATGNTTNTTCTAAGAAGAGGTATGNACC^ 

SEQ ID NO: 43 ACGCGGGGGACTGAGAACAGGGACAGGCGACCCGACCCCCAGGGCCCGGTG 
CTCAGGACAGAGTAAAAGGCCAAGCTATGATAGCAACTGGTGGAGTGATAACTGGCCTGGCCGCC 
TTGAAAAGGCAAGACTCTGCCAGATCACAGCAGCATGTCAACCTCAGCCCGTCTCCTGCTACCCA 
AGAGAAGAAGCCCATCAGGCGCCGGCCCCGGGCAGATGTTGTGGTTGTTCGTGGCAAAATCCGGC 
mATTCCCCATCTGGTTTTTTTCTTATTTTAGGAGTGCT^ 

GTTCTTGGATATTGGCCCAAAAAGAACATmATTGATGCTGAAACAACACTGTNAACAAATGAA 
ACTCANGTCATTCGGAATGAAGCQGTGTGGTGGTTNCTTTTTTANATATT^ 

SEQ ID NO: 44 ACTGGTGTGGAGTGAAGCAGGGCCACTTCTATGGAGAGACTGCAGCCGTCTA 
TGTGGCAGTTGAAGAGAGGAAGGCAGCGGGTGAGTCTCCAAGGACAGGGCCTGCACCCCTCAGA 
CCCAGAGGCAGGACTTCCTGAAGGCCCCTGCCTGAGAGCTTCTCAATCAGTGCTGGCCCCTATGTT 
TGGCTGTAAGAGGCTGAAAGTGGAGAGTGGGAAGGGAGGGGACATITAGGTCCTATATAGCCTCG 
TTGAGCCCTTCAAAGGGACATCTCATATAAACATACCAACTAATTAAAAATAGTGGGTTTGCTATT 
TAACCTCGGCATTNAGACAACTC(XACTGAATGAGTGGCTCACACCTGTATCCCATCACm 
GGCCGANGTAGGCANATCACTTAGGCCNGGANTTCAACACATCCTGNTGACATGGAGAAAACT 

SEQ ID NO: 45 ACTri"il-ri-lTTlTi'n"lTl''mTTTTGGNTAGTTTCTATGACT ATGT CT 
rnAGTAATCXriTCTrCTGCAATGCCrrAATTGCCTTT^ 
TGTTGTTGTAGATGAAAAAN 

SEQ ID NO: 46 ACGCGGGATATGCITGCAAATTCAATTTAGTTAAATTAACACAGTCITr 
TCTCTAAATATTGATCTCCAGGTTTGAAAACTTTCTCTAAATAATATT^ 
GCCAArrCATAGTCATGATATCTTCAAATATCCTAACCTTAT CANC AAAATCTTTTGr^ 
TTTAGCTACTTATTTAAAACTACATAATGTCTTTTTmCC 
NTTTTTTTNCTTGNGCTACNAAOATGTTTNCCT^ 
GGGTGGNTTA 

SEQ ID NO: 47 ATGNGGATTGGGCCCTCTAGATGCATGCTCGAGCGGCCGCCAGTGTGATGGA 
TATCTGNTGGAATATCCGCCriTCGAGCGGNCGCCCGGGCAGGTACAAAAGGAACnTGC^^ 
CNCAAAAAGCATTTCACTTGAACCTCTGCTTAAAAAACTGNTTCC^ 

AACCANTAAATGTTAATNAAANCTACAAGATTTATGGCTCTGAGAGAAATATACTGAhrrGAT^ 
TCNTAANTATCCACAAATACCKATTAAATTGNAATGTTTAATACTATATNAT 
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SEQ ID NO: 48 ACAAGTAGAGAATGTCTTTACTITrTCCAACCACTGCCATCTCTTACTCT^ 
GGATTTTTCTTCCCAGCCCTCAGATGGAAATGCAGAAATCACCTGTCTTCT 

GGAGCTGTAGACCGGAGCTGTTCCTGTTCGGCCATCTTGGCTCCTCCCTGCCAGATAGCTCTTAm 

TGAGCTCTGAAGGCTGAGAACTTCAAGATCAAGGCACCAGTAGACTCAGTGTCTTATGAGTGCTA 

CTATTTGOTCAAAGACGATGCCTTCTTGCTAAATCCTCTGCAGAGAGTAACAGT^ 

TAAACAGCTCCTTGCCTATCTGGCTGCCCAGCCAAGCTGGGTCCTACACAAGTTGCTATAAGA^ 

TTCTACAACAAGATCTGACTACTGAAAGATTCAGATCACTTGACTTTTAGCTTGA^ 

TATTAGTGGATAGCGTTGTTGGAATTCAAAAACAACATATTGCATAGAAATGNGTGAAGATTGCA 

TTACCTTATGGATGATAACAAAATGCAAATCTCAAGGNATTTTAAACAGTGTNAATCCA/^ 

CTGAACTCAGGNGTTTCTAATGTTCTATGGATTACITGCGCAGTGAAGGAGG 

SEQ ID NO: 49 ACil7"l'l'll"lU"rrri7"l'l"lUin-ri"ril'i-l'l-i-l'GCCAATATTTAAAAATATAATTG 
ATATITGTATGTTGACCTTGCATCCTGCATNTTTGCTAAACTCACTTTAGTTCCAATAGTT^ 
ANATTTTAGGGGGATTTTCTATGCAAACAATCATGTTGTCTGTGAATANACATGGCTTCA 
TCAATCTGTATGTCirrTATTrCTTTTGTCTTACTGCAANGGCT^ 

TATGAGNGGGGAAAGNGGACATCCTCAGTrrGrrCTTAATCrrACAAGGGGAGCATTTAGTCATTC 
ACCTTTGAATATAAAGTTAGCTGCAGCTTTTTTGNGGATCCCATTATGAGG^^ 
TCTAAATTNGTAAGAGTTTTGATCATCAACAAATGTAAATTCTATCAAATGGTm 
ANAGANCAAATGCTTTTCTTCTTATCTGN 

SEQ ID NO: 50 ACATTTACATTCTGTAAGAGATTGAGCCTGAACTCTCTTAGTCATAAAAACAT 
CAAATGGCCACATGTCCACrA(XAAGCTTCrrCTATGTTAAAAAAATAATAATAA^ 
CCTGAAAAAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 5 1 ACGCGGGGTATGGGGTTTCTTTrTGAGGTGATGGAAATGTTCTGGAATTAGAT 
AATGGTAATGGTTGTGCTArrTCATGATTATACTAAAAACTGACTTTTAATAATCCACTAA^ 
ATTGTATACTTTAAAATGTTGAATTTTATGCTATGTGAATTATATCTCA^ 

CAACATCAGAATCCTAGAATTGGAAGAGNCCCCTGAGAATTGTGTGGCCCAAACTTCAAGTCTTG 

GCAGTTGAGAAATTTTAAGGGTATTGCCAAGAAATNrrGTCnTAAAAATi^^ 

CTCTCTTGCrrAOAACCCAAAGAGACTTGAGAAGGACAGCTGGCNTTAAAA^ 

AANACAGGGTGGCCANACCATGCANCAGT^S^GCCTGTCAACAT^^'GAGACCCrrTCTTCATAAAT^ 

AAAGAANAATACCTGACATGGNNCAACTGCCTATATNCCGCTCT 

SEQ ID NO: 52 ACGCGGACACAGGCAGTCACTAAAGGGATGGCAAAGACAGAAAGAAATCTT 
ACTGTTTTATGTAACTAAATGGTTACAATCCATTACATCCATATTTTCAAGATAAA^ 
TTTTTCACATTmATGACTAGAGGTAGGTTTTACATTTTGGAGTCAGGTGACAGA^^ 
TCCTCCTAGGAAACCAGGCGATTGAGTGTTATTTCCCATTTCAGAGATGGCTTTCAGGTCC^ 
AAACATAGCCAAGTTATAAGACTAGAGACTTAAATTTATATACATTGAAAGGGAGGGAGAAATAA 
AATCTGGAAAAGAAAAGGGGAAAGGGACCTCTTCCCTTTTATTTTCAGCA^ 
CTTATTTrrAATrrGATTTGCCrrACXjTCAACAGCAGTArrr^ 
GATGTAGCGTGATTGCAGTAAAATATGCCTTGATCTCACATACACTTTGTCTAGO 

SEQ ID NO: 53 ACTNTGTGATCTTGCTGAAGACTACAGGCAGCCAANTGGTTCCAGATACTTC 
AGCTTTGTGTATCrTCTNAACTTNATATTAATATAAGNTTCTNAAGAA^ 
TNTTGATTTAAGGANAAAAAATAATCANAATGAATTTNTTGCATAAACNT^ 

SEQ ID NO: 54 ACGCGGGGAGGCTAGCCAGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTT 
TGGGAGGCCAGGGTGGGTGGATCACTTGAGGTCAGGAGrTTGAGACCAGCCTGGCTAACAAGGTG 
TAACCCCATGTCTACAAAAAAAAAAAAATTAGCCAGQTGTGGTGGTGGNCACCTGTAATCCNAGN 
TACTGANAAAGCTGAGGCAAGAGAATTGThTOGAACTGGGGAGGNGGCTm 
NANGTTGAA 

seq ed no: 55 accactttgoaaatgcactgactctttaaaagccacataaatgttcagcca^ 
aaattcaaaattnttatgncttaataaaaatgattccctcccatctcaaatcat^^ 
ctotcatt^cngnaacctgntncncaaanccotacnttxsnttatg^ 
tttttatcttcattcitggtoggaaaanaaaaatctattattg 
ac^mtttoga^ititccttatccctttactttcanaat^ 

gatnataccaccccttgggaagggggggngckttggccancc^raaancaaccncatggggnang 

GGCAGNCTA 
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SEQ IDNO: 56 accagcggatcgcancgttgccatgaaatctgtgcatttcaaggcccgtttg 

GGATGTGGGTTAACTCAGTGGTTCTCAGGTGGGGGAGGTATCATGATTTACCCTCCAGGGAATACT 

TGGCCATGTTTGGAGACCTTTTTTGGTTGTCACAGCTAGGGGAGGGGTGCTATTGGCATCTG 

GTCAAGGCTAGGGATGCCGNTGAATATTCTACANTCCACAGGACCACCCCGCCNCAAAGAAGGNT 

CCCANCTGATATGTNAGGCTTGCCACNGGNGGGGAAACCCTNATTAAAGTATTAAAAAm 

CTACAmmmTANTNCCNCGGGG<nTITGTTm 

SEQ ID NO: 57 ACAACCACTATGGGGAATAGTTTGGAGGTTTCTCAAAAAACTAAAAATAGAG 
CTACCATAAGATCCAGCAATCCCACTGCTGGGTATACACCCAAAAGAAAGGAAATCAGTTTATCT 
AAGAGATATCTGCACTCCCATGmATTGCCAACACTATTCACAATAGCCAAGATATGGAAGCAAC 
CTAAGTGTCCATCAACACATGAACAGATAAAGGATATGTGGTATATATACGCAATGAAGTATTAT 
TCAACCATAAAAAAGAATGAGATCCTGACATGAGGTCATTATGTTAAGCAAAATAAGCCAGGCAC 
AGAAAGACAAACATCACATGCTCTCACTTATCTGTGGGAGCTAAAATTAAACAATTGAAATCATG 
GAAATAGAGAGTAGAAGGATAGTTCTAGAGGTTAAAAGTNCTCGGCCGGACCCNTTAAGGCNATT 
CCACNCCTGCGCCGTACTATGGTCCACTCGGACAANTGGGGNATATGGATACTGTTCCTGGGAAT 
GTNTCCGTC 

SEQ ID NO: 58 ACGGCCAGGGCTATTTNTTGAATGAGTAGGCTGATGGrrT CGATA ATAACTA 
GTATGGGGATAAGGGGTGTAGGTGTCCTTGTGGNAAANAAGTGGNCTAGGGCATTTTAAATCTTN 
NANCGGAAAGCOTATAOTCACTTGCNCCCGCTCATAAGGGNTTGNCCriTGGC^ 
AGTOGGGGGGTTGCGTGTAATTNAATGA 

SEQ ID NO: 59 ACCTAGAAGAGAGGCGNTTCAAAGAAGTAGTGAAGAAGCATTCTCAGNNCA 
TANGCTATCCCATCNCCCTTNNTTNGGAGAAGGAACGAGANAAGGAAATTANNGATGATC 
GAGGAANAGAAAGGTGAGAAAAATGAGGTAAATCmTATTGATTGATGAAAANCCAAAAA^^ 

SEQ ID NO: 60 CGCGGCGAGCTATCNTTTGAATANTGAGACAGAAATNAATCAATATAGAGGC 
TGTGCACGGTGGATCACGCCTGTAATCrCAGCACTTTTGGGAGGCCANAGGCAGGTGGATCGAGA 
CCATNCTGGCTAACATGGTNAAACCCGGTCmACTAAAAAATACAAAAAT^ 
TACCGGNCACCNGTAT 

SEQ ID NO: 6 1 ACGCGGGATATCAATAATGGGTCTGATATAGACTGAGGATTCATATTAACTC 
CACATGCCTCCAAAAAGGCAACCTAGAGTCATGACTAATACATGGAAATTGGTGCCTCCACCCGC 
AGCTGACCCTTTGGTCTCnTAAGAAAAGAAACTAGAACTTTTTAAGGTCT 
TTTTTTTGTTTAGTAAGTATTTAGCAAATATTTTTG/^ 
GCTACATGTTTGAGTAAGGATGTAATTGTAGCTTCCACTTGCCIWCAAC^^ 
TTTTACTTACAGGGTOAAAAACATGTATACAGTCATCCCTCTGTATCTGNGAAGG^^ 
GGACCCTTCACGGATACCAAAATCTGNANATGCTNAAAGTCnTrGACATAAAATTGGCA^ 
TNNCATATbnsrACrTATGC>mNTCTCCTATATTAChW 
AAANNCAATGTAAATAGTTCTTTTAACTGGCATTTGGNTTAAGGGGAAC^ 

SEQ ID NO: 62 ACGGGGGAGACTGTGGAGCANTTATTCAAAACTCGGAGGGAGTCGGCATGG 
GAGGATCCATATAATITCACGCTAAATTGTGa^CGTCTGTITGTGAAATGTGAAGGNGCACAT^ 
TTTTCCTGGAAGGCAAATTTCATTTlsrrTAT^^ 

TGCTGTTGTTANAAACAANGACAATCATTITGANGCAANAAATGATGGTTCCAACNAGGGAGGGA 

GTAACCATGGATATTGCTGAAATGCAGTTGGTGCCAGGGATTTATTANGACATGATTAGTTCTGNA 

ATCATCCCTAANGTAGCGATGAAGTCTCNCTATGTTGCCCAGNCTGATCTCAAACITCCGGCTCTA 

AGTGATCCTCCCACCTCATCACTCCCAAANGTGCrGGGAATTAAAGGCCTGANCX:CATITGTO 

CAAACCTNAC 

SEQ ID NO: 63 ggggactgagaacagggacaggcgacccgacccccagggcccggtgctcag 
gacagagtaaaaggccaagctatgatagcaactggtggagtgataactggcctggccgccttgaa 
aaggcaagactctgccagatcacagcagcatgtcaacctcaqcccgtctcctgctaccc aaga ga 
agaagcccatcaggcgccggccccgggcagatgttgtggttgttcgtggcaaaatccggcmatt 
ccccatctggtttitttcttattrragoagtgctcatctccattatagg 
ggatattggccccaaaaagaacattttattgatgctgaaacaacactgtcaacaaatga^ 
ggtcattcggaatgaangcggtgtggtgggttcgcn'ctttgagcacattrgcat^ 
gaaaatgcttgcccattcaccatggggattggcattttcattttcatttgtgct 
tgaaaaccgtgacaaagagaccaaaatcataccatgagggatatctttcccagtcattgacat^^ 
acacgctaagaataaggagcaaagcaaatg 
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SEQ ID NO: 64 ACGCGGGGCGGTCGAAAAAGAGATAAAGTTGAAGGAAATAAAATTGGCACG 
GAGTCTGGGAAAGTAGTTTCCCTAAAGGAGTCTTAGAAATAGGGTTGGTCTGGAACCTAAGGGGC 
GGAGCCGACGCGTAGAGCCGCTTTGCGCGTGCGCATCACCTAGGCGGTTAGATTTGAATACTTCAC 
TGAGGCGAGCCGGGCGTTGNGAGCGGACTGCTAGAGGCGGCTGTCTGTTTCCGCTCTAAGGAAAC 
TCAGAGCGTGTGGACCCCAAACAAm'CTGCGCAAAATTTGTCGAGGAGGTTTGCCNCGGCAGAA^ 
AGATTTCTTCAAAAATGGATGGGGTNGCCTTCAGANGCTTATTAAGAANNTN 
CCCGTTGAAGACGGCTrrNTTCAOTATTNANCCCNAAGGAGTTCTCTCATG^^ 
TGAACANTTCANGAACCNATGCTTGAGAAAAAGAAAANTCTNTCGAGCAGCTITra^ 
GNTTCCAACGAGTNCTTATAAATAC 

SEQ ED NO: 65 ACGCGGGCACCACGATGAAAGGGCACTGGCAATGGGAATGGCATCTATAGT 
GTTGGCATTCTATATTTACnTTCArrATCATTTAGAAGGATCCTTCTAATCAAm 
ATTTGCTGAACACATAGTAGATACAAGGTATTGTGCTGTGGGGGfrGTGAGGGTAACAGCGTGTCT 
TCrrCCTTAATAACAAGCrrAATANCATTAATAGNGTGAATTACTATTTAGAAGGA 
AACNCAATGGGAGGCTGAGACACAANAATCACTTGAACCCAGCANACGGAGGTGGCAGTGAGCT 
GAGATCTGCACTCCAGCCTGGGCAACAAGAGCAAAACTCTGTCTCCAAAAAAAAAAAAAA^^^ 
GTCCTGCCGGGCGGCCCGCTCGAAAGGGCGAAATTCCATCCACACTGGCGGGCCGNTACTANTGG 
AKTCCAAGCTTCNGTACCAAACNTTTGGNTANATANATGGNCCATAGCTNGTTTCCCTGTGGNN 
AAITNGTATCCNCC>WAAATrTCCTCACAACANTACTAGCCCGGAA 

SEQ ID NO: 66 GGGACTTTTTTTTTTTTTT^^ 

GCCCAGGCTANAGTGCAGNGGCGTGGTTTKTGCTCACTGNAACCTCCATNTCCCAGGTTCAAGCA 

ATTCTCCTGCCTCAGCXTCCCAAGTAGCTGANATTACAGGCACCrGCCACCACGCCTGGCCi^ 

TTGTNTTTTTAGTANANACAGCGTITCACTATNTTGGCCACNCTGGTCATGi^ 

GNGATCTGCCCNCCTNANCCTCCCAATGNGCTGGGATTATAGGCGTGAGCCACCGCACCTGGCCC 

ANrTAACTTCTAAAAATGATAATGATCATGGCTCAATTTGGGGTGATACTTO^^ 

GACATTT^n>^GGTAACNCANGCTTTGNTGTGACAGCCTGGAAATGATACT^ 

CAGGAAAATATTTNCTATTTTNCACAAAANACTTTTGTTTGm 

GCmrrATTGGTCITGACTGGGAQTGCAAGGGGNAATGATCTNAAN^ 

SEQ ID NO: 67 ACTGAAGTCAAAAACAGCACATGGGCCTTGCAGCATCTGGGGTGCAGCAAGC 

agggtgaagggggagactgccttagaatggagggtggcagctccaaggaatgggaaagcttctc 

aatccctttggtctaagtgggaaaacaaaagagctggcagcaatctggggtgagttagggcccag 

gagctaaaaagcaggccggtttcatgataacgagcitcttttrrgtgtagga 

ttatctgcaagacccacagggacctcagcaatgacacattgtgggatagattgggaagccagatt 

aggtaatcacagagctggagcacctcrctaagaaccagctggggctcrgcaggggtgtgacaagt 

gctcatccggatgagggactcacagagacaactggcacattaaacagattgcactgtcatcttcct 

gacagcacgcccacaaaggaccatgctcagctgtcatcttcaaagtgtgggagcagcttcccccc 

aaccctggcangaggccacaagatccaagtngggacccagcctcnacaagng 

SEQ ID NO: 68 ACl"rilU"111ni'i"il'l"1114"il-i"lUTrTNGGGACGGAGTCTCACCGTGTTGCCCA 
GGCTGGAGTGCAGTGACACGATCTCAGCTNACTCCAACCTCCGCCTCCCGGTTCAAGCGA^ 

tgcctcancctcccgagtagctgggaccacaggcgcacatnaccatgcccggctaatttttgtnt^ 
ttcgtananacggggattcaccgtgttanccaggatggtcttgaactcctgacatnacgcagtcca 
cccgccttgggctcccaaagtgctgggattacaggctggagccnctgcacttggccatttt^^ 

TrrTTrCCTGAAACAAAGTNTNACCCTrm'GCCC>n^GC^ 
TANTGCAACCTTTGGChnSfCTGGGTTNAAANCGANTTTTCCTGCCT^ 
ATTACANGCTCAATGCCNCCACCCCCAGNTAACTTTTGTTTT^ 
CTGGTNTTTNAACTCCTNGCCCTGGAANTTNCCCCACCmGG 

SEQ ID NO: 69 ACTTAAAGTAATGGTGATCCnTATTCCAGGGCTTCGCCGCCAGGATTTOT^ 
TGCTGTTTCTTGTeGTATTTGTANATTTCATCGATACTCTGAGCTTCCTGCATGTO 
CCCANGGTTANTCTGGGCCATACCGNTNCACNGACGCCGTCTTCGATCGAGGAAACNGA 

SEQ ID NO: 70 ACCACCTTCTTAACACAAATGATTTAATTTAACCATTAAGTCAAGTCTGCAAT 
ATCAATATCATCTTGCTCATmACAAGTCACAAAAATGAAGTTCAGCAACCTTAGTm 
GGTATTAAAACCTGAGGCTCGTCAACGTTAAGTCTGTGTTCCTGCCCAACAGGCAGTATAGTCTTA 
CAAAGACATTCTTTCTCTTCCAGTCTCTGAATATTCTGAATTATCACAAGTTAAGT^ 
GGTCTTCTCTCCCTTCCTCCTTTCATCCATCTATCCATCrmCATO 
. GCACTGCAGTGAAAATAAAAATGACTTTTACTCCCACCATACTGCAGGGAGCTATGACT^ 
TTAGAAACTGATCTTCCTGGGACnTITCCACCTGTATATATTTCTC^ 
TAAGGAACCAAGGTTTTTTGGCTTTACTATTGAACATrTTAAATCATAi^ 
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TAATGGATTCTAATTATTTAAATAAAAGCCTGGGCCGATTATTTGGATGAAAACATAm 

ATTmCCCAAGGAATGGNAAAGAATATACTGGAATAAAAAACAAAAATCCCCTAAGGGA^^ 

AGCAGGGTAAATTTTATTGGAGGGGACATGCCGAANAAGGTGTTArrrGGGCTTCCA 

SEQ ID NO: 71 GGCATGCTCNAGCGGCCGCCANAGTGATGGATATCTGCANAATTCGCCCTTT 
CNAGCGGCCGCCCGGNCAGGTACGCGGGGCCAGGTCTCTNACTTCTGGCTTGTTCGCTGGTGGCG 
GTCNNANCCGAGCCGGACTGGTCANAGATGATCACGGACGTrCAGTTCGCCATCnTNNCCANCAN 
GCTGGGNAGTGTGNTNTTCTTGCTTGNCGTTCTCTATC 

SEQ ID NO: 72 AcnriTmrv n m u rn t gaaaanatgaggttttgccatattgcccagg 

CTGGTCTCAAACTCCCGAGCTCAAGTGATCAGCrrGCCTTGGCCTCCCAAAGTGCTAGCATTATAG 
GAATGACCATGGCACCCAGCGGTAAATGTTTCTTTTCANACTTTTAAAGGTC^^ 
GCNGTGGTGGCTCATGCCTAGTAATCCCANCAKmGGGAGGCANANGTGGGCANATCTGNTCCC 
CTTNCTCCTTCCAAGTTTTT 

SEQ ID NO: 73 ATCTTTATATTATTTNCTTAAATTGATTGGGCCCTCTAGATGCATGCTCGAGC 
GGCCGCCAGTGTGATGGATATCTGCAGAATTCGCCCTTAGCGTGGTCGCGGCACGANGTACGCGG 
GGGTAGGGAGGGGGACCAGTGGCAGAGGGACCTTAGGTGATCCTTANAAATAAAGGCTAGTTTCT 
GTTCGACCTTGGAGTANGGCGGAANAGGTGTANACAGGTCTGGAGAANCGAGGTAAAACCTGAG 
TAAAAGCAAGAAGTTGGAGAATATGAGATACATCTCATCrCrrTAAATACTTAAATGACTTCC 
CCTCCCGGAGTNTATCACAAmCGGNGATNNANNTGACNGACGTANGTGAANACNCTG^^ 
ACTTACANACTAAACTTG 

SEQ ID NO: 74 ACCAGCCCAGAGAGGCTCTCTGCTACCTGACTTTCACTACTCTATGGTAATGT 

gcaatttctcccgcaactgaactacaacagaagtttaaatgtctagccrac^^^ 

catgcctaaactccacaggaatgagttgtcttttactatgtgagaagtcaaatgtaatgttggcaa 

ataccagtgtgagagctacaagtttcaacaaaagcagcacactgtatattggcagtcaaaatcac 

ttcgctctattctgcattaggtcagccaaacgcttcctgtcaagactcagtaactctcaccatacat 

ttttgtctatrcattggtctatgggcaggaatgtcttataaagcacacattactaagtgc^ 

acatcaccctattccttaaaaaaattctataagaagtarrattntcatitagagattaaa^ 

agctcccaggccttaccaattttcaactaataactaattaacagggctggg 

ACCrmCTTTAAATNTGGTrTTTTCATTGGTTAGG 

SEQ ID NO: 75 ACTNTTTTTTITTTITITI^^ 

ArrcNCTTTTGATGCATAATCATTTTCAATTTTGAGGAAGTCCAATATATATACTT^ 

CTTGNGCrmGGNGTCATATCTAAAAAACCATTGTCTATTCCAAGGNCATGAANAT^ 

TGrmCTTCGGAGTmATACrmAGCTTACATTTAAGTT^ 

CACATGATANAAAGTAGGGATTCAAGTTTmrCTTTNGCAAATGAATATC^ 

ATTTGGNGAAAACCATNaSfTTTTTCATrrGAGTGAGCGGGGCNC^ 

ATATATCTATGGGGTAATTTCTTGGACrmGCITTATTCCANTGATCTACNGGGCT^ 

AAAATNNCNO^GNTTTNTNCTGGAKCTTACTAAGm 

TTTTCAATTAAAAGAAAATNTGG 

SEQ ID NO: 76 TTTCATTTTTGTANGCATrGGGCCCTCTAAAGCATGCTCGAGCGGCCGCCAGC 
GGGATOGATATCTGCAGAAATTCGCCCTTAGCGTGGTCNCGGCCGAGGTACTCAArrTCrGTGGAA 
CTNTGNTATCCrGNGGAACACACAACATTACAGTGGAANCrrCTGTGAAAGATGCCAAT^ 
GTATGGAAGACTGGTCCCAGAACTACCATATTTGTAAAATCCCTGGAAGACCTTATTOT 
ATTCNNTTACAACAGCTOGTAANCCCANCATGNCATTACANNGAGCANCAGCTACGGCCNNTGCC 
TACACACGGTTTTAACCANNGCAATGAATGCNCTTGCTGNACnTCTTA^ 
OCCNTAAAACAA>rmGNNATTTThnsrrATGGAAATAAGGTm 

SEQ ID NO: 77 ACTTAAAATGTAGTAGATTCTATGCCTATGCATATTTCCCAAAATTGTAAGTG 
AGAATTGGAAATGCAACTCCCAAATCTTCAAATCTGAAAAATTATTCAATCTCAA^ 
ATACTATATATAATAACTTATACCTCTrCCCTCCTGCGACmCACAT^^ 

AGAGGAGACTAAAGCACCCTCCAGGCACAATTTACCACAAGTTCCCAAATAAACCTTTCACCAAC 

TCACTATGACAAAGCATAGATGAGAGAAATAACAGCATGCCAGGTTCAACCGAGTCCATAGGTGA 

GTGTAGCrGCTCANTAAGTGTTGGTTGATTAAAATTAAGGGTGACTNAATCCATGCCCAAACT^ 

TCCTTACNAAATGCCCCATAAAATTTAAATTrTAGAAGAGrrAGTAAAAGACTTCT^ 

CTGCATGGAGATACTACACAAAACCAGTCATNTTAATTCCTACANCCTTCANAACAAAAGA^^^ 

AACTGAAAGTAAGATCCCTGTNATTATGTTGATCCTGGCT 
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SEQ ID NO: 78 ACCTITITmCTTATTTAAAGCACAAGAGGCCCATAAATOT 
AATTCTITITmGATACAAGTTTTCAGAGCAAOAGAATAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAGTNCNCGGGGTCITTGAAATAAAACT 
TTTCTGAATGTATTGGTGATTCACCCCCAAGCTAATTTTITAAAGTCAT^^ 
CTCATGAGAGATGTrGAANTATGTNTTTAATCAAGAGTCATGATTTCAAACTAGTT^ 
AGCAGmNGGGNTCTAATTTNATGGGNAAATGGGGGGTTTGGTNAAANNTCTGAi^^ 
CCTTAANGATATNATAANTNATTTGCCNNTAATATGTCTTAAAGCT^n^^ 
TTGAGCCNAAAACANTTNATTNArrAAAAACAGTCCTGNCCGGGCGGNCGTTOT 

SEQ ID NO: 79 AClTi"lUU"lllUUU"ri"lUll"rriTTTNGANACATGGTCCCGTTCTGTCACCCAGG 
CTGGAGCATAGTGGCACAACCACGGTTCACTGTAACCITAACCCTCCCAGGCTCAGGNGATCCTCC 
CACCTCANCCTCCCAAGTAGCrGGGACTACAGGNGCNCACCACCATGCCTGGCTAACCTTTTCAA 
ACCCCTGGCCTCAAGNGACCCACTCACCTTGGCCrCCCAAAGTGCTGGGATTACAGGCGTC 
ATGGCGCCCAGTCAAGAACJl"J"iUll'AAACAAGCCATTTAAGAGTGCCTGCTGCTTAAAACAAAAA 
AAAAAGGGCNTACAATTTCAATTATGANCAAATTTTGAGNCCTAAGTAAAANA^ 
rrTTTTTAAACCAGCTTACArrGTTThnTrAGAACAATGAOT 
AAAACmGAGANANTAGAAAAAATNTTTTTTAAAGGAT^^ 
AAAAGTTCNCTTTGTOTNAAACTTTGAGGAATGGATTATAGGGGAGAA 

SEQ ID NO: 80 ACGCGGGGATTCCTGAAGCTGGCAGCATTCGGGCCGAGATGTCTCGCTCCGT 
GGCCTTAGCTGTGCTCGCGCTACTCTCTCTTTCTGGCCTGGAGGCTATCCAGCGTACT^ 
TCCATTCCAATCCATTCCAATCCATGCCATTCCACTCGTGTTGArrCCATTCTTTCCATO 
GTTGAATTCTTTCCArrGCATTCCATTCCATTCCATTCCCCTGCAGTCGGGTTGATTCCATTCCATTC 
CATTCCATTCCATTCCATTCCATTCCATTCCATTCCTTTCCGTCCGTTTCATTCCATTCCATTCCAT^ 
CTATTCGGGrrAAATCCATTCATTCCATrCCATTCCATTCCATTTCCAATTCCAT^^ 
NTTCCATTCTATTCCGAACCTCGGCGCGACCACGCTAAGG 

SEQ ID NO: 8 1 ACAGTTCCCATCACGTATGTCAGTTTTGTTGTATGCAGCAGAAATGATACCTA 
AACTAAAAACAAGGACACATANAACAGGAGGTGCTGACCAAAGTCTTNAACANGGAGAGGGAAG 
TAAAAAAGGGAAAGGAAAGAAAAAGAAGTAACCTATTATCAGCATCAAAGTATGTGGTACCTGC 
CCGGGCGGNCGNTCCAAAGGGCAN 

SEQ ID NO: 82 ACGCGGGGACATAAAATNTNCTTTAACGCATTTAAATAAACAGAAATCATAC 
NAAGTATGTTTTCAGACAGTAGCGTTAAATTAGAAATAAGTAACAAAAATATAGCTGGAAAACT^ 
CAAATATTTGGAAATAAATATATCTAAATAACTCATGGGCCAAAGAGGGAGCTAGATGTAAAAAC 
AGAAAATCTTTTGAACTGAATGAAAATAAGTGAAACTTANATTATACCTTC^ 
GAAGAGAAAATTTAAGCCTAAAGCAANGCAGAAGAAGGGGAATAATAAAAATTAGAGCAGAAAT 
AAATGGAAATTGNNAACAGAAAAAATAGAGAAAATAANATGGAAATTAAANGrrAGGTATTTAT 
AAAGATCANCAGACTGGTAAACCTGTAGCCAGGCNCCAAAAAAAAAAAAAAAAAAAAAG 

SEQ ID NO: 83 ACAAGGAAAACTACAAAATATGTATGAAAGAAATTGGAGATGACACAAACA 
AATGGAAAGACATCrrATGCTCACAGATCAGAATAATTAACATTGTTAAAATTGTCATAATACCCA 
AAGAAAmGCCGATTAAATGCTATTCCTATCAAAATATCAACAGCATCTTTCCCA 
AAATGATACTAAAArrCCTGTGGAACCAAAAAGGAGCCCAAATAGCCAAAGCAATT^ 
AAGAACAAAACNAGAGGCATTACTTTCCTTATTTCAAATTn^ 

CGTGGTTTGGATTTAAAAACAGACNCn'CCCATGGAACAGAATAGATACCCAGAAATO 
ATTrCAGCCAAACTGATTTTCACAAAGGCATGAAGACATACATTAGGGAAAAGACCCCTCTT 
TAAACGGTGTTAGGAAACTAGATITCCTTATCCAGAAGAATGAACTGAACTCCTATCT^^ 
TACAAAAGCCAACTCAAGATGAGATTAAANGCTTAT 

SEQ ID NO: 84 ACQGCCGGGCANGTACNCGGGACCATACCATATCCCACCAGAGAGTGACTCC 
TGATTGCCTCCTCAAGTCGCANACACTATGCTGCCTCCCATGGNCCTGCCCATGAGTNXm^ 
GCTGCTTTC 

SEQ ID NO: 85 ACGCGGGAGGCGTGAGCCACCGAGCCCGGCCACAATGTGTTTATATACACAA 
AGGAATATTATTCTGCCTTAAAAAAATAAAGAAATCCTGACATATGTGACAATATGGATGAACCC 
GGAGGACATTATGTTAAGTGAAATAAAGAAGGCACAGAAATACAAATACTGCATTATTTCATTTA 

caccgtagaatctaaaatagtcaaaatcatagaanaaagtancatggtgamccctgcgtgtta 
attaccaatcttcccatataggtaattccatatggcacaactgcccttacttgnagagccaccnca 
ccctgtgncatgaactgnaggtgtncttccatggttgcttgcccaotcagggcctgtatct 
acactaccttaca^^itcncaactgngcmgntttntttg^^ 
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AAANG 

SEQ ID NO: 86 ACGCGGGGGAAAATGGGGAGCAGGAGGCTGACAATGAGOTAGACGAAGAA 
GAGGAAGAAGGTGGGGAGGAAGAGGAGGAGGAAGAAGAAGGTGATGGTGAGGAAGAGGATGGA 
GATGAAGATGAGGAAGCTGANTCAGCTACGGGCAAGCGGGCAGCTGAAGATGATOAGGATGACG 
ATGTCGATACCAATAAGCAGANNACCGNCGAGGATGACTAGACAGCAAAAAAGGAAAAGTTANA 
CTAAAAAAAAAAAGGCCGCCGTGAOn'ATTCCCCTCCACTTCCCGTCTCATAT OT 
CCTTCNANTAGAGANGCCCCCCNCCCCGTGGCTGTCCCCC^^^^NTTTANAC^^ 
CCAANCnTGATATTTCNCAGGGGANGAANANACCAAAKmCAGGNCTIOTT^^ 
GNCNNANN(>ICTNTGC>JATTTCTNCNTTGCXjCmTh^^ 
TGTTATTTTNTTTTTTT 

SEQ ID NO: 87 ACTCTGTATACACACATGAGAATGACAGTGACAAAGGCAAATAATGTCTTAG 
TATTAAAACCTGATCACTCAACATAATTTATTTTGACTT^^ 
CTATTGCAACTGTTCTGACCACTATArmAGAGGTTTTGGGATATTAGGCTTATA^ 
ACAGGGAAATATTTATTTTTCATAAAGTATTAGATGGTAGCTTTAGAAAGGGGTGTT^ 
AGGGTAAGGCAGAGTAGGTAGTTGCTCTATTATGACTTTTCCTTGGTTCAAGCAAAATAAA^ 
AAATGTTTATTTAAGNTGTTTTTTGAAGAGAATCATAACCT^ 
GGAGAAGNTT^^^ITGTGCAAAGGAATTTNNAACNTAGNC 

SEQ ID NO: 88 AC n " I ' i - J T lT i 1 i-l i U i l i-rrri"i' J rTi l " i TGGGTTGmTGANACANAGTCTCA 

CThrrGTTGCCTAGGCTGGAGTGCANTAGCATGATCTCGGCCCACTGCAACCTCCGCCT^ 
NAAGTGATTCCCCTGCCTCAGCCTCCCGAATAGGNGGGATTACAGGTGAGCATNATCATGCCCAG 
CTGATTTTTGT^rIT^TTAAGTAAAAACAGGGTT^ 

ACCTNAGGNGATCCGCCTGCTCGGCCTCCCAAAGGCn'GGGATTACAGGCATGCNCCCCATGC^^ 

GTTATTTGTrTGNAATANAAANACTTTGGNGCCTCC™TCCGAATCCANTN 

GGGCTCCATGCTCTNAATAANACCTTNmCCTCCCrATTITAACCT^^ 

TTATTNGGGCCATTTCAAANGGAC(>rNGACCTCAANGAANGAATNCriTN 

TGTCCCCCTTTCTTTCTTGGTTTTTTTAAT 

SEQ ID NO: 89 ACri-l UU n- n ^ l Ul'l l - i 'i lH l'rillU- l TrimGAANANATGGGGTTTNTCCATGTT 
GGTCAGGCTGGTCT^WAACTCCCTACCTOAGGTTATC(:::ACCCGCC^ 

ATTAAAGGCGTNANCCACANTGCCCGGCClUll'rilUTriTAAAAAAAANAAACNGA(>IAATAm 

TACAAGGGAAACAAAATTNNAATrCCATGNNACCNTNAGNATACCCTNCACAGGGAGGA^ 

CAAANTNTNTAGGATCTGGACCTGAAANCCAA 

SEQ ID NO: 90 ACGCGGGAAAGGGACATTTCAAAGCCTATTGATGCTTGTAGTAAAAACCCTA 
ATATCCCATGATAAACACTAGAAACCAGCTCTCAGTGAAAATGC TTTGCAA TGTGTGGGm 
CACAGAGTTTACITCTTCTTTTGATTCANAATGTTGGAAATACTT 
GGAAATTTCGGAGACCATTQAGGCCTATAGTAAAAAACTGAGTATCTTGCAATAAAAA 
CATGCGATCTGTGAAGATACATCnrCACAGATTCATCTGTGGATTCATCrACAGAGTAAAAGCT^ 
GTTTNATTCAGCTGGTTGGAAACACTCTTGTTGAAGAATATATGAGCAGACATT^ 
GANGCCnrANNAAAAACAGGATTTCCTGAAAAAAAAAAAAAAAAAAGGNCAANAN^ 
GGANCNTCTGCT^^^CCCAT^mCNGNTAATGGTG^^TmGAA^ 

SEQ ID NO: 9 1 GAGTCTAGCTCCCACCAGCAGTCGGGCCTGCCTCCTGCAAAAGAGGGGAAAG 
AGAAGAANGAGAAACCCANGANCAAAACAGCCCATCATATTGCCAAANACATGGAACG CTGG GC 
TAATATGTTTGAATAATCACAAAAGAAAACTTTAAAAATAGNCTNrr 
TGGAANAANAAAAGGANAAGAATCTNTTTGNA 

SEQ ED NO: 92 CGTGGCGCGGCCGAGGTACTTTTGACCAAAAATTGACCAAAGTAAGAAAAAT 
GCAAGTTCTAAAAATAGACTAAGGATGCCTTTGCAGAACACCAAAGCTTCCCA^ 
GGGAAAGTGGCCCCCTGTCTCCTGGAAGTGGNAANAAGCCCTGCTCCCTGGCCTTTGGGm 
GGGGGCCACAATAAAATAANTmTTGGCCCCCCOTITTCCAGGGNCAAAAAA 

SEQ ID NO: 93 ACTr il 'lU-rT i - iTll - l - riH -i U l l - lT TCAANATGGAGTCTTGCTCTGTCACCTAGG 
CTGGANTTGGANTGCAATGGTGCAATCTGACTTACTGCAACCTTTOTCT^ 
CTCCTGCCTNAGTTCCTGAGTAGCTGGGATTACAGGTGTGCACCACCACTCCTGGCATATm 
TTirrGATAAANACACTGTTrNACCATGTrGGCCAGGCTGGTCTNAAACTCCT^ 
CCACCCGCCTCGGCCTCCCAAAGTGCTGGGArrACAGGCATGAGCCACCATGCCTGGCCAAAGTT 
TACTCTTTATACmTnmTATTGQNGACATTTrCTCCACGA^^ 
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TCTGTATANATNAAAAAATGGAAACTGANACCCGNGTACCTTGGNCGGGACCACCC 

SEQ ED NO: 94 ACTCCGAGGCmANATTNATTTTGGGTCTTTGGGGGGGACC™ 
GCCTATNATCANTCCGAAAGTNATCATTTNTTGAACCTCANCNACCAACATCC^^ 
GGCTITATAAGCGGCCCCGCTTCCTTTGAATNNATNATCCTrGCAAANACCCAT^ 
TmGCCACTrGClTO^CCX}GGGAAATCATAACCNTTTATNATAGTC^ 

TAl^GAATCCCGGCCACCATTTNAAACCATNTCCCGGGGGNGGGCCAOTCCCGATACCCATCC^^ 
AATCCNATCCC 

SEQ ID NO: 95 GCCGTGGCGCGGCCGAGGTACTmGATATTAAAAG CTAATT m 
TTCTAATAAATTCATTCAGGTTTAACGAGCTTGGCCACACATGAAGGTNCCI^ 
CCGGCTATACCTTCTAATNTCAATTrAAGTTGGGCCTNACTCTTCCT>r^ 
AATCCTTTAATTTAGGCAAAAAATAA 

SEQ ID NO: 96 ACnT N lTi l I ' l - il l - i 11 nil IN GGACAATTGTTATTTAGTTTTTATTTCATAA 

TCATAAACTTAACTCTTGCAATCCAGCTAGGCATGGGGAGGGGAACAAGGGAAAAACATGGAACC 
CAAAGGGAACTGCAGCGAAAACCCAAAAATrmAGGAAACCTGNCAACAAATGGGGGGGNGGG 
GGTOCTCTCCTGAACCTCAAAAAGGATTAACTGGNGGNTAAAAANAAACCCAAOT 
CGAGTTGCCCCANGTCAACAATGGGGATCTTTTTGNTGGGCTTGCCATTCCTGGACCCAAAACC^ 
CCATGGCTCACAATNTTChnSIGCCTrrmCANTTTTCCAAA 

SEQ ID NO: 97 ACAATCTTACTATCTTTCTTTTCAGTTTGTGCCm 

TGGTTCTTCrAGTTGCAATTCCAGAGAAATGAAATGTCTGACTTGATGTCTCA 
GCTTCTGGATGAAAATCTTGGCTTTGGAATCTGTCAAAAGATTAACCCTTAC 
TGGTAGTTTTTTCTGGGGGAAACCCAATTCTAAAAATTNCCTGA^ 
NCCTIT^^^^^CAAGGTAAAAACTCTTTG^^^GGANNTA^^ 

GGGGAGCATGCAATTAAAAACTGCCTTOTCCTAGTANCCTTAAAATGGNATCATG 

CCANAATAATTTGACTOTGCCTTAAAAGTCTTNCCATCCGNAAATNGG 

TGACCTGGAATTCCGCTCTmCTTTTTCrAACTTTAAGi^ 

ATGGGCTGNGATTrAACAATANTrCNGGCArrAAA>mCAATACCrrGGCCCGGGCNGG 
AAGGGCCAAATTCNACCACACTTGGCG 

SEQ ID NO: 98 ACiTT N ^rl ^ l ^ rrIlU T^ l ^ r^ ^^TTTTTN^ 

GAGGTGATGTTTTTGGGTAAACAGGCGGGGGTAAGATITGNCCGAGTTCCCTTTTACr^ 

CCCTTTCCTTTATGAACCATGCCTTGTGTTTGGGmTGNCAGTTGAGGGGAAAT^ 

GGGTTGAATTGGAAAATATTTGGGCCTGNTAAATTGCAANTCCAGTGNTTTNAATCCT^ 

CTTATTGCGGANGGAAAATGNTTrrCATNGTTACTTATACTAAANATTAOT 

GNGAATANATTGGNCCCAATTGGGGTGGGANGGAA 

SEQ E) NO: 99 ACGTGTCTAAGTTCTAGAGCCTCCTGACGTGAGCATGGCTGAGAGTGAGGGA 
CCGCrCCCTGAAGGATCGTTCTGGGTAGGGGAAAACTGGGAAAGTGGGAAAAAGTGCANAAGCG 
AAACACCATTCTTTGGAAGAAGGAAAATCTTTTGATTTCTTAAA^^ 
CAAAAACTTGTAAAAAACCATTCCCGGGAATTGNANGGGAAAAANCTTmGm 
CANGGGTTTTTGAAACCAAGGGAAACCTGGANACCACCITGAAGGAAATTAAGCGGTTO 
TTCTNCTGGCCAAGGCCCCATGCTATTGGCCTAATTCTGCTGCTGGGCCCCTTCACATAGGA^ 
ANAAAACCGGTGCATTGATCAAGGGThrrhrmGGGAANTNAOTCATGAACCA^ 
ANTCNCAAAAAAAANTGGAAGGGCA 

SEQ ID NO: 1 00 ACTACCTGGGGGGGTTTGCTTTCCTGCCTTTTCTCTGGTTGGTCAACATOT 
GGrrCTTCCGAGAGGCCTTCCnTGTCCCACCTACACAGAACAGAGCCCANATCAAAGGGCTATGTC 
TGGNCGCTCACTTGTNGGGCTTCCTTTrTGGGGTGGATGGGTGCTCACOT 
TTCAANANTTTACCGGNCCCCGCTGGGGGTGCCCCITGGGNGACTANNCTC^ 
C^OTGGGNACCCCANTGAACAACTTTCTTCACATACTGGGGGCCCn™ 
ATCCNTCT 

SEQ ID NO: 101 ACGAGTGGTGGACAACAGTGCCCTGGGGAACAGCCCATACCATCGGGCTCCT 

cgctgcatccatgtctataagaaagaatggagtgggcaaggtggggcnaaccagatactactggc 

cttcaagggacagaagaaaaaggcgctcatttggggggcactggcntgcctgccccccnatgaac 

cccaaaattcgactccacaaccgtggtcttatttgagacaacgggaacccttgtggggacncna 

taaaacnccattcccnaccaccntncccagccggaaagncnatattccaangngcttg 

ttaaaactttonntganttgaacccangccttttgttgcaa^^ 

aaaccaccttttgcttanggacttnggaaccacattgctgctcccm 
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atatcctgngaagaaaataaattntnttcattcaaaaaaaaaaaaaaaaaaaa;^^ 
ccgggcgggccntttaaaanggcnaatttcaana 

SEQ ID NO: 1 02 acacccgcacgaggagcggggacggcgggcgcagaagtgggccaccatatc 

TGGAAACTACAGTCTATGCTTTGAAGCGCAAAAGGGAATAAACATTTAAAGACTCCCCCGGGGAC 

CTGGAGGATGGACTTTrCCATGGTGGCCGGACAGCAGCTTACAATGAAAAATCAGA 

TCTTGGAGAAAACTATAGTTGGCAAATTCCCNTTAACCACANTGACTTCAAAATm 

TGANCGTCAGCTGTGTGAAGTCCTCCAAATTAAGTTGGGNTGTrrCTCTACCTTGTC^^ 

AAGGAAGGCAACANCAATITmGCAGGTGTNAAAAAAATGCTGCTCCTAGGGATA^ 

NCTGGGAAAGATGACCrCACACAATTGCTGTTGTTOCTGNGGTNAATGCANCCAATGAAA^ 

TTTGCTTGGGGAAGGCCTGGCCCTGGCCCTTGTTAAAAGCTGGNGGATT^ 

AGCAAOSfAGTTTGTTGCCANATTTGGNAAAGNGTCACTTGTGAAANTANC^ 

ANGCTTNCCTTGAAAAAAATATTCATTCCT^rITGGGCCTO 

GGTT 

SEQ ID NO: 1 03 TNGATTGGGCCCTCTAGATGCATGCTCGAGCGGCCGCCAGTGTGATGGATAT 
CTGCAGAATTCCCCCCTTGAANCGGCGCCGGGCAGGTACAGGAAGCATGGCTGGGGAGGCCTCAG 
GAAACTGGCAATCACGGCAGAAGGCGAAGGGGAAGCAGGCACCGTTTCTTCITAATATTCCTT^ 
GAAATTCTTTATGGTGCACANGTAGCCGTAAAAAATAACTGCTTCACACTGACTTGTCATN^ 
TGGGGTGGGGGTANGGGGT 

SEQ ID NO: 104 ACNCGGGCACTCACAGACATGACACACTCACAGACATGACACGCTCACAGAC 
ATGACACGCCCAGACATGATGAACTCACAGACGTGACACACAGACNTGACGCACTCAGACATGAC 
ACGCTCACAGACATGATGCACTCACAGACAGGACACGCCCANATGCTACGCACTCACAGACGTGA 
TGCACTCACAGACAGGACATGTGTGGCTCCACTCAGCACCCATACTTAGTCACCTGTGCCCAGNA 
GCACGCATGTCTACACAGATCACATTNACAGACACTGTGACACAAAGTTACACAGTCATGTGCAC 

atgcncacacacacttggtcnmgcmgactgcctgngcangacac^ 
aaaaaaaaaaaaaaaagtnctttggccgaac 

SEQ ID NO: 1 05 ACTTnwrTTTTmTTTT^ 

GNGATAATGGTTrTGNGGCTAACrCNAAAAAANGAACGGCCCCAATCTTNAAAAGTCT^ 
AATATTTACAATCA(>fTATTTAACAGGTTrCTAAAANNATNACCATAm 

SEQ ID NO: 1 06 ACTTTCTATGANAAGCGTATGGCCACAGAANTTGCTGCTGACTCTCTGGGTGA 
ANAATGGANGGGrrATGTGGTCCGANTCANGGGTGGGAACTACAAACmGGNTTCCCCATGA^ 
NAGGGTGNCTTGCCCATGGCCGAGTCTCCTGTTACTGATTAAGGGGNACT 

SEQ ED NO: 107 ACAAACAATGNTTATTTGTTTGTAAAGTGCCAGGTTTATATTTANNTA/^ 
TAANATNTGCGTmAAGCAGTAAGGCCNCAT^^^TTTANCTTGGCTGTGCNGGN^ 
TAATCTN 

SEQ ID NO: 108 ACGCGGGGCCTCTTTTTCCGGCTGGAACCATGGAGGGTGTAGAAGAGAAGAN 
NAAGGAGGTTCCTGCTGTGCCATAAACCCnT^GAAAAAGCAGATGGAATTTCGNANAGCTGA^^ 
GATCAACACGCCTGAGANAGATGTTTG 

SEQ ID NO: 109 GCGTTGGGCCCTCTATNGCATNAATCGAGCGGCCGCCAGTGTGATGGATATC 
TGCAGAArrCGCCCTTAGCTNATCCGGCCGAGGTACAANACNCTACGGGAACAGNTTGCCTCCCT 
NCCAGCCTCAACCACAATTCnTCCATGCrrGGGGCTGATGTGGGCTAGTAANACTCCAGTO 
GGCGCTGNAGTNTTTTTTTTTT 

SEQ ID NO: 1 1 0 TGGGCCCTCTANAGCATGNTCGAGCGGCCGCCAGNGTGATGGATATCTGNNN 
AATTCGCCCTTACCNTTTGCGCGGCCGACGTACTCGNNATGACCCCAATACACAAAATTAACCCN 
NTAANAAAATTNATTNACCACTCACTNATTCGACCACCCCTCCCCNTCCAAC^ 

SEQ ID NO: 1 1 1 ACCCTCCAGAAATTGGTGACTTTOCTTTTGTGACTGACAACACT^ 

CACCAAATCAGACAGATGGAAATGAAGAnCTTAAGAGCTTTAAACTTTGGTCTGGGGTCGGC^^ 
CTACCTTrGA>n>rrTCCnTCGGAGAAGCrrCTTAANATTG 
hmKGGCCAATTACTTGANGGACCTAhnTATGTNGGANCrmAACAAN^^ 
TAANATNGNACAAAGGACCTTTTNNCTAACOSriTO 
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SEQ ID NO: 1 12 ACTTTTTGGGTTTTTTTTTTTT^^ 

GArrrAATTTTTAAATGrrTAAATCTCCTTTACAAAAAGTATA 

ggaactggcactgtgaccttncattnagtttrctagaggatgngatcnaatot 
tatnaaaaaggaaattngcttaacgacc 

seq id no: 1 1 3 acttgccccttccccagaaaagcgggacttgctgctaagggtgaaag 
aggcagtttgtccctgntggtctgaccccttgaaaacgtgggtgtataaatnagaaaagca^^ 
ttcaatgattaaannccaannggaaggcmgcttnccaattcmto 
ttngttcnacgattaaaancngttttnttttggcctttccan;^ 
aatnanaangtcacntattttaaattttat 

seq id no: 1 14 acgcgggcagtcaagctggttgctctgaaagtaacccagcttgttgctctaa 
aatacctcagtagcctgagtgttatactagagatcraaaggggitaacagggataggggtgggaa 
aggttagagactcctagaaaatctctgggtaccgtgatotcggcctcattctaatacctgtc^ 

GAACAGCrrrrTTCNTTNGNGCTCTCTTNGCCTTAGC^ 

CTTAATAAGNGATGGGAATGGGTmGANAATCCGNAATTTATTTAAAA^ 

AAATACCTTTTTTTCAAATAAAAATTAAGCCCNA 

SEQ ID NO: 11 5 ACITTITTriTrTTTT^^ 

CATTTGTTTTTTAACCAGrrACTATTAGGGCAGAAAAAAACAGGCC^ 
ATAC^^S^C(^T^TGGAAANNAANGGGCCAAAAATTTTGC^ 

AAAGGTACCAANTTTTANCTAAAATAAGGGTTGGAATTATCNAAAAAGGTCCAGAAT^^ 
AGCmAACAGATTTTGCNATTAAGCCCAAACAGATTGGTTTANCCACT 

CCAATGGAATAATGATTGGATGGGAAAATTGGTGAGCATTAGNGAATACCTATGGTCACTTATGG 
GCCCGGCirACCTTCATTTCrrGGTCTTTTTCCGGCTTACTGNCC^ 

GANGGGGCTTCCAAAAAGCCTGGGCANTAANGGGCTGGTGGAAAATGTGANGGGCAGGATAGGG 
GAACCCANGGTTTTCTTTmiTrGNCTAACNTACT^ 

GTTCCTTTTCCCCAATCCmGGGGGGCCTAACTCCCCTAGGGANCCCCC^ 
CCCTTN 

SEQ ID NO: 1 16 ACTTCTITGTGTTAAGTATTCAGCCACTGTTTTTAGATCTAGT^ 

ATTTAATTTGCTCAACATTTACTGGAATGGGTGGAGTGAAAAAAACTGATGCATACTGC^ 

ATCTACCATTTTTTAAAGATAATGGTTAATTAGGAAAAGANCCCTTTTA^ 

NTGAAATGGAATCCAGTTTTTCAGAATNCACCAACCTGGTGGATGGGTNTTAAAANGT^ 

AAGGTAAGNATTCCNGATTATATTAATGGATAACTGGGTTGGTTTTANAATATGTAAGCCGGTT^ 

AATCTACNAGGTTCACATATCONCGGTTOCTCTGNNATAATAGAAAGGTTAGGGGArc 

TGGGGGGTTGTGGATAAAGGCCCTGNGTTTGGTTCCCTCCTCTGGATNCCAATAGG GGGNA TGCT^ 

CirrTTTNGGGGACCTNATTCCTTGGAAGATTGGGCTTG^ 

AAANNAANNAACCTCCTGNCTGCTNCCATNTTAAAGCTNAAAATT^^ 

NCCTAArrCAGGNAOTCCAAATTTTTGGGGGNAAAANCNTCTGGGANAATNTAT^ 

NANGTTGGG 

SEQ ID NO: 1 17 A CinH l■ 1411 " ll ' lU ■ i ^ l ^ ll ■ ^ ' ^ ^^ l " r ll' J ■ llU ' N GCATCAAAAAGCm 

TGGTCCAAGGCnTGTTAGGATAGTTAAAAAAGCTGCCTATTGGCTGGAGGGAGAGGCTTAGGCAA 

AANCCCTATTACTTTGCAAGGGGCCCTTAAAAGTCGOTGGGCTCAAAAGGNCrm 

TAAAANTTANGCCTTTCGNAANAAATTCTNNGCCAANCAANGCTTO 

SEQ ID NO: 1 1 8 ACCACTTGAAGCCAGAATAGTTNGNTTATGTGGAAACCACGGGACCNGGAAA 
ATITCATCnTNATNGAAGArrCGANGGTTTGGANATITAAATT^ 
GCTACGGTCTTTATCCNCAGAGCCGGTGGCTAAAATAAANAC 

SEQ ED NO: 119 ACACITGATTCAGATTCCACCrGGGATrCGACAAATTTrTC^ 
CTATTTTCTTTGGGATOTCTCTTGTTTTCGTCTTTAA^ 
AATTCrrn'GTTTCrmACTTCATCTTTT^ 

GAAATTCTmCTTTGTCCITAAATCAAAAACTAAACTTTCCAAGGAGCT 

ACrrGTCTTTTAGCTNCCCGGCCTTTGC^^ 

TGCCTCAATTTTTCirmCTTCTTTNGGGGAAGATC^ 

ANAGTTCGCCTAAAATATGTCGTAmAAGGATAKGCCTCTGAATATCCTTCCTGACTGC^ 

TTrGTrCTCTGCAATTrCTCCCTAAArrCAANAAGCNCTTCm 

GGGCCTCCAAGGCCCNGCGTCCTGGGCCCGGACCACNCTTAAGGG 
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SEQ ID NO: 120 GTACCTCAAGGTTCTCAGGACCTCCmCCCCAGATCTTAGGGTCCTGCCCT^ 
TGGGTCTCCTGTGTCCAGGGGAGAGGATCTGGGGAGTAAAAATTGTGAAGTGGCAAATCCCTGTA 
CCCGGTTTTmTGGATTITCCAATGAAGGGCTGTCTCACATCCCACCTO 
AGGA 

SEQ ID NO: 121 ACmTrTTTTTriTIT^ 
CTTCTTCCAGTTTCITGTATriTrTAAGGNNTTNA 

ATTTGCTACGGAAAATTGAAATCCGCNCACTGAAATATCCTTTTATTGCAAC^ 

ACCAAAAAATCAAATTATTTATGCTCTANCCAAATTATGNAANGGTITCTTm 

CCAAGCTGGANTGTANTGGAACAATCANANTTCACTGGANCATNNGCCTCCNNGGCT^ 

NTTANTTGCTrAACCCCCCCGAGTNGCITGGAANTANCNGGTGAACAAAACGAC^ 

TrrTTTTATGTTITrGNTAAAACCGGGGCTTTGCT 

SEQ ID NO: 1 22 ACGCGGGTGCCATTCCCTCCrrCTTCTGGA'iTl"n'l"l"l'Cl"rTGACCATATCAAGC 
TGAAGAGATGAACnTGTTTTCTCAAACCrrrGCATCAAATTAAGAGTAAAA^ 
ATCAAATGAGGCAGGCCAACTAAAACAAGAAATAGGTTAGAGAATATTGACTCTCCGCCGGGCAT 
GGTGGCrCATGCCTGTAATCCCAACAGTTTAGGAGGCTGAGATGGGCAGATCACTTGAGGNCAGG 
AGTTCAAGACCATGCCTGGCCAACAKTTGGTrGAAACCCITGTCrCTACrAAAAAATACAAA^ 
AGCCAGGCGTGGTANTGGGCCCTGNAATCCNAGCTACTTNGGANGCTGANGCAGOA 

SEQ ID NO: 123 GTACTCTT^mTATAC^OTAATCTGGNGGATANCTATTT^ 

AATrCCATATCTTa^GTTGNTTCNCAAAAAACTGANTTTACTACNANGTATATATTT 
ATTAATTATAANTTTTNGNATTTAA 

SEQ ID NO : 1 24 A Ci i ' i ' i 'l i i i 1 1 1 U li r i nN TGGGGAAAAATCCrm CTTTA CAAACTTCCAT 
CAGTTTAGGAGTCAGTCTGTATGCCTITAGTGAGAGAGATCCTTGGGCAAGTTTTTATTG^ 
TAAATGANAAACGACAGATTCTTCAATGGGCOTGCTGGTNACTAAAACTGGAGAOTC^ 
GCCCGGNTTNAOSfAATGAGCCATANTATGNNGGACTGAATACOSrACCCCACGTGAAAGAN^ 
ATGTTTANNTTGGCGNAANGCTCCCNATTATTTTCCATCTNAATTT 
NNTANTCC 

SEQ ID NO: 1 25 ACTTTTTnTTTTTTTTm 

CATTTTAATCCAATTTGTTAATCTTTGATTTCTAATTTGATGCTCT^ 
CAATTGAATATGGGTTNGGNCTTAAACGTTACTATTTGGTTTN^^ 
CATnTTCCrirrmAAAGCCTTATTTCGAATTGN^^ 
NAGGGTTNGGTACCAAAAAT 

SEQ ID NO: 1 26 ACCXTTGCCTTTCTCACATCATNAGATCAAGTCACTCTTTGTGCATCCC^ 
GGCTGAGCGCATCATTTCCATGTTGAACTACrrCCTGCAACACCTGGTTGGCCCCAAGATGGG 
CTAAAAAGTANAANGACTTCAGCATATNTGAmTCAACCNAATCTGTANGTATCAGATAT^ 
CTTA 

SEQ ID NO: 127 ACAGTGTGGCTCATGCCTGTAATCCCAGCACTTCGGGAGGCTGAGGTGGGAC 

aatracttgagtccaggagtttgagaccaggttgggcaatgtgatgaaaccctgtctctacaa^ 
aatacaagaattagctgagtgtggtggcacatgcctgtantagccacagctacttgggaggctaa 
ggagggagaaccacttgagcccaggaggtcaaggctgtagtgtgctgtgatcgcgccantgcatt 
ccagcctgngtgacagagcaagaccctotctttaatacantnaaaaatnnaaaaaanat^^ 

SEQ ID NO: 128 acacggcagtcttagagaagcaaatggctcagatgatgataattaagagtag 

CCAACATTAAAGTTAATTmAAAAATACAGTTAGGTGTTTATATTATTTAG^^ 

AATCCTCTTGCTCAGGAAGTGTATACAAaTITrTAAAAATTA 

CTTCTATGACAACTCTAGTGCAATATTAGAGTTTCATTTATTCCACAATATATT^ 

CTATTTATCTGGCTTTACTAGTAAGTTCTTTTAGATTAACATCCAGTTCAT^^ 

TGGAATTTAGTCCATTTCCTAATGAAAAACAGCrCAAATTTTAGGGTAACT 

AACATTATATTTGGATCCAATTTTCCTGANATATTGAAATACTGA^ 

ATTATrrTACTTTCAAATGTATANTrAATTGCTAACATGTrmATTTTCATATA^^ 

SEQ ID NO: 129 acagtagaaacaagcagagctactgatactctcacagctcatttcagtttgtc 

TTCCTTATCrGTATGAAAGGGGACCATAGAGAGAGGTrGAATTAGTTTCAATACAGCCCTAAGCAC 

TTCTTATGGTGTTTTITGAATTACTGCTCAACAGTrCAGTCCAGTTAAAGTAA^ 

ACGTAAGAGAATGAACTTGGCAACACCTTAGATTATAATTCTGATTTTAACi^ 
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AATGCACATAATATTGATACTGCGTTGTAAACATTTAGGCCCGTAAAATTACrGAAGGACTGTAGA 

ATGAAAGAGAATCNCANAATAAACTTAAGGTTANAGAACACTTAATGTTTCCTGCCTN^ 

ATGCTATACTTGCAAAATNTTATTGNGAGGNAGAAATATTATNT^TTTO 

TCATT 

SEQ ID NO: 130 ACri"lini"rilUUlU-ll-rrriU'JU-riGGGANANACAGGGTCTTGCCACTTTGCCC 
AGGTTGGTCTCAAACTCCTTGGCTCAANANATTTCCTGCCTTGTGAATCACT^ 
ATGCATTCTTCTAATTrrTTTTAATTTATT^^ 

ANANATTCTTTTrrrrriTnTr^^ 

GGAATGCAGNGGCAAAATCTCGGCTCACTGCAAT^rmGCCTCCCGGGTTNAAAAC^ 

CrAAGCCTCCTGANTAGCTGGGATTACAGGNGNGCNCCACCACACCCAGCGAATTTTTGTNr^ 

TTTAGTAAAGTCAGGGTTTCCCCATNTTGACCAGGCTGGTCrCAAACTCCTGACCroWGAT^ 

cccccotattotcccaaagtctgggattacaggcgtnagccaccgncctggccaatat^ 
nccctnccgggcggncgtttaaaagggcaanttccancacactgggnggncnttn™ 

CNANCTNGGTNCCAAACTTGNGGTAANNATGGGGNATANCmGTTCCTGTGNGAAAT^ 
C 

SEQ ID NO: 13 1 ACri"ll"iU"lUl"ll"l-ri-ll"lUTrGGGAATGCAACAACTTTATTGAAAGGAAAGTG 
CAATGAAATTTGTTGAAACCTTAAAAGGGGAAAaTAAACACCCCCNCTCANG^^ 
TGCAAAATGGACTCTTTNTGGATGTTGTANNANACNTGGTGCANTCNT^ 
AAATTAAATCAACCTNTGCTGATNAAGAGGGATNCNTTCATATATTTANNATNm 
TTTTAAATGGTNATNATT 

SEQ ID NO: 132 ACCTACATCAGATCTAACCrrGATCrCAGCAATGTGGATTCCCTCTTCTACGC 
TGCCCAGGCCAGCCAGGCCCTCTCj^GGATGTAGAGATCTCTATTNCAAATGAGACCAANGATCTG 

cn'ctggcagctngtcantgaangactcatoctgttncccngatctaccat^^ 
aaggggcttnggcctttonnanctatctcantaancnct 

CTANGA 

SEQ ID NO: 133 acaaatgttttttattcaaangtncaaaataaattatctgtaggcatggaca 
atgacagcagtaaaccaitatatatttngtcaactgaaaccagtnactgatggttatagtgatt^ 
agccgcctttttcntattttntccaactgacttctctgai^ 
ggccttcctgcananntcattaaotaaagnaaanccctagcnangagkitaa^ 

CATTACAACTTCCArrrCNACCANTGGNNCCATTCCAAGGGCCCThrTTNT^^ 
NNAC 

SEQ ID NO: 134 AC lU - lll ' l ' ril4 - rrriU - iU ' i - ri - lU - l - ri ' l ' l - ri GGGGATTTANTAAAATAAATGTAT 
TTTTAAAATNTTITTAGGAAGCTGCAANACTGGCAACGTGATTGCrrOT 
TTAAGTTCTATATAGGCTCCCTrCTGTGCrrCTGTTTTTGCA/^^ 

CAGCACCTGATCCTTGCCCCn'GACCCACrGGCCATTTTGGATNGGACCACGTGATACANGTTGGA 

CCAAGTrGGTrGCATTAATOTATTAAATCTCTCrGAATCTGGCAATCANCAACCATCAOT 

TTTTTTACAAAAATACCACCCCATTTCCNCCAAATTTCTTN^^ 

ACCGTTTATNCTATTTCnTAATAGGCCTTCTTGATGAAACCCTr^ 

NCrCAAACTNGGGCAAAAACAAAAAATrGAGGGCANTNAACATGCCCTGNGCNrTG^ 

TNCCTAAAAAATTTCTCCCOTAAAAAANANTNGGNGGCCAAAATTT^ 

ATTAAATTCNGTTATAT 

SEQ ID NO: 135 ACATAGTAAACTGTGGGTATTCAGGGAGATAAAAGTTTTTTGTTTGm 
GTTTGTTTTAAAATGAGGAGTCAATAATTGTTTTCAGATAATTATCCTTGGCTACA^ 
ACAGGGGTAATACCAGTCTGAGGTTGGACAGTTGTTGGGCAGTTGTTGGGCAGACGTCCT CACA G 
AAGTATTGTGTGCCTATATGTGTGCGTATAAAGTTGCAATGGCCTTTrrGCAAGGTTGTGG^^ 
CACCTTTTGTGCATGAGAGCCCTCCCTTCCTGTCCTTCTCAGTTCTAOT 
AACTCCATTITGATTCTGACAACTrTCAGAAGCCTTAGGAAACTAGACAGTGGAAT^^ 
GCTCTTTAGCCAGTTGCATTTTATTTGAACTGGCnTrAGCAATGAGCACTAAT^ 
TGCTGCTGGGTGCAGTGGCTCATGCCTATGATCCCACACTTTGGATCACCTGAGTCAGGATTTCAA 
CCACCTGCCACATGGNAAACCCCNCTCTACTAAAAATAAAAAATTACCCGGCATGGGNGGTGGAC 
CCTATAATAGCCACTACTGG 

SEQ ID NO: 136 AC anUl ' ll ' l i^ ' ill - ri ' i ri ' l 1 I N GGTAAACANGGCGGGGTAAAGATTTGCCGA 
GTTCCTITTACTTTTrrrAACCTTTCCTrATNAGCATGCC^ 

ATGACTTGTTGGTTGATTGGAAATATNGGGCTGNAATTGNCAATCCCAGTGTTTATAATCT^ 
CANNCnTATTCCGNAGGANAATTCTNNNATTGhrrACTT^ 
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CATCATAAAATCGNCCCA ' 

SEQ ID NO: 137 ACrTGACCCCACAGCCGTCNGGGATGAGCCGCTTCTCAGCCACCATGTCTTCA 
AATTCATCATCATTGNAC>rrGGTGAANCCCCACTTCTTNGNNAANNGGATCm^ 
TGAANACITGTC 

SEQ ID NO: 1 38 ACACAGCTGTCAGGGAAAGTCCTGATGGCCACAGTGAAAAANGTCATGGTTN 
GACAAAANATTCATNGCAN(nTAGCATGGNTCANACCAGT^^^CNTACOTAAA 
ATCTAGTTTTNACCACATCTGCTTCAAAAATAGCACACCAAACTC 

ACTTGCACCATCGTGGAAGAAATGGAAGAACAAGGATGGATTTT GGCT GGCTGGAAGTCACATCT 

TGGGGAAGCTGGCCANGTrrGGCATrCCACAAGGOmXjTTCTTATTrr 

TTCCnTKGnCCCAAAGTTTCTCCCAGNTTOTTmAAAT^^ 

SEQ ID NO: 139 ACGCGGGGGCGAAGGCGGGGTCGGCGCTGCCGGGTGAAATCGTAGGACAGT 
GAAGATGCTGCTGGAATTGTCCGAGGAGCATAAGGAACACCTGGCCTTCCTGCCTCAANTGGACA 
GCGCGGTGGCTCCAAhrmGGGCGGATTGCNGTGGANATTTNTGAAACrCTGGChn^ 
ANTNTANTGAAGGMNCNTCCTGAAAATC 

SEQ ID NO: 140 AC iTi ' rrrrn Ti' i 'i'i i 'i i n u n TCCGGi rrri-i i rriTrnn ri 1 1 1 1 1 iaaa 

AAAGGAAAACCCGGTAOTGATOTCGGGGTTGAGGGATAGGAGGAAAATGGGGGATAGGNNT^ 

AACOTGAGGGNGTTTTCTCGNGTNAATGAGGGTTTTATGTTGTAAATGNGGGGGGTGAGGG 

CCNTTGTNTGTGGTNANTTTTrTAAGGNAATTmTGGG^^ 

SEQ ID NO: 141 ACT r r rn 'i T rnTrn-l l l l l 1 1 rT i GGANAAGGAANAGGrmTATTCGGCCG 
GGAGCCNTCNGCNNACTCCCGTCTNAAGAGC(>fANCTCCCCNAAAAANAAATTCCT 
AAGAGCTTACAACTTTAAGGGGTCCACGTGAAAGGGTNATANTAGATCAAGTAANCGTGAGGAAC 
NTNAOTGGGGGCTACACACNTrGGCCrnrrGGACAAAAANTT^ 
CTTGGGTTT 

SEQ ID NO: 142 AC i n' i n' il 'lUl'riU-lU U l'inT riUn ' l ' rri ' r GGCTGGATTTGCCTTTATAGGANAG 
TGAAGGGAAACCTCATGTTTCTAGGATCAANCTGNGGTTTNAAANATTCATCCAANTAm 
ATATTCCTNTTTCAAAAATCAAGTTTCCAAAACTTCTGCTCTGTTTT 
CCTTCCAGTANTTGAATAAAAAAAGGCACTNTThrrCACTGGGAAAAAA^ 
AAATTACCTTTCAGGATTOTGOTGCCAATGArrAAAACACANAAGGAATlsrrGCCC^^ 
NCTGCTGCGGNAAOCTAATNCTTG^mTGNGTNANATTTGNATCCAATTGGNGAm 
GAAAGCNGGATTTTTOTCasrACTCCGNNTTANCTCNT^ 
ANGTNTGTATCAAANCTTTCTTTGANNANATTGTTNNAAA 
AATACTT 

SEQ ID NO: 143 ACCGAGTGTGGCACCTAGGACAGCAGGCAGTAGTGCAGATAAGGTGTGACTC 
TTTCTAGCATAGCCAGGGGGCATGGCTACCCrCATATATCCCCAGGCCITCCCTAGACTCTAATC 
TTCCAAAAGCCACAGAAGCAGTTrATGTAGmAGTAAACACAGTATCTGACCTTCCTAAOT 
CCAAATGTTrAAATTTTGAAGACATTTTTArmACCAATAATC^ 
AAGATTAAAGTCATGGGAACTAAAAGGCATTAAAATTTCTACmCCTGAGAAATAm 
TATTTTTCrrTAAACCAATTAATTAGAGATCnTrrATATAAACA^^ 
ATGCACACAGATACACAGATAGAAGAGCTTTTrnrnTn^ 

CCGTCCCTAGGCTGGAGNGGGAGTGCCATGATCTTGGCTCATACAGCCmAACCTTCCANGGTTCA 

AGCAATTTCTTGNGNTTNAACTTCCCGAm' AGCTT GGGACTACAGGCCCCCGCCCCCACGTO 

TAATTTrTTTATTTTTAGNAAAAAATGGGGGTTTTGC 

SEQ ID NO: 144 CTGAAGNAACTANCNTCAANAAGTAGCCTCTGTATGGGAATAGAGCTAAGGA 
GGATGCTAAGGCTCAGGCGAACTGACCCCrTGACAGCCTAACATGGAGGCTTTTATCTTTGAA 
AACACATTTGTCAACTTGACAGGGAGCTTGGGCTGCAGATTCCTGCCCTTGTGAGACTCTGAGGCC 
CGGCAGAAAGAGCCCAGGCATGGGAGTCAGACTCATGGGAGGGTGTGGGGGTAAATCCTGGCCA 
TGCAAACTTCCTGCACTAGGATCrrrACITCCCCTGGCTAGAATNCTATNACTATA^ 
GAATAAATGACTGGNTCANCGGGGGTmAAAAGAAAATTTSIAAANGGGTTAAATT^ 
ACAGGGATTGTGGNGAATNAAACNGANGGAATTCAAGCCAAAACCATGGGGGGAAAAAAAAACC 
CCCTT 

SEQ ID NO: 145 ACTGTTCAATAAAATrrAATTCACTATAAATCAATTTTITAAJ^TTAA 
AAAATATTAAGCArrTTTTAATTCAACATATATATATATACTATGCTTAA^ 
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AGGTGATATTCACCATArrATATCTACrAATGACACAATGAACTTrmGAGGGATACAr 

AATGArrCTGTAAATGACTAAAGTCTTTGGTAATAAAAGCTGGAGATTGGAGGTTAC^^ 

TAAAGAATTTGAAGGAGAGAATGTAGAAATGTGAAGCAGAAACTTAGCCATGAGATGAAGACTA 

AATTGAGAACGGAATGTTCATATAAGGTAAAAAAAATATATATACACCGTITAATAATTGm 

AGGGGGAAGGGAGGGCAGGGAGAGAAGGAGACAAGGGTTGAAAAACCNGCTGNTGAGTAGCAT 

GCTCACTCCTGGGGGGNNANATCATTTGTCCNCGGCCGNAGCCANGCTAAGGGCAAAm 

SEQ ID NO: 146 AC n - iTi - i 'l I'l I ' l l i i U i 1 1 i i n 1 T ATrGGCTTTTAAAATAACCTrTAi l l l l r 
AAAANTTANTNTrGTGCNTTATAGGAAATNGAAAAACACAANCAAAAAACA^ 
ACAANNATTTNANANTTTGACATATTCTTCNAAATCCTGTGTGTAGGCAC^^ 
GGACTGAAACTG 

SEQ ID NO: 147 ACTTTTTTTTrrTTTTrT^ 

GGTAGCNGCTTAATACTTATCTTTNGAAATCTATTGCTNATGCT^ 

ACCAGAAAAANTANTAAAGGCTGCCnTmCCTTTTTNAAAG 

CAATG>™TTNACrAATACATTANTACrrNAAAGGTGCTTAAAAAT 

SEQ ID NO: 148 ACTTCCXSGTGCTAAGGGNTNTCCGATTTGTAGAAGGCACAAATATTAATAGG 

atcacrntagctcttgggaatgtcatcaatgccitagcanattcaaagagaangaat^^ 
cccttacaggaaatagtaagcttactcgcttgttaaaggattctcttggaggaaactgtca^ 
taatnatagctgctgttantccttnctctgtattctactatnacacatn^ 
aaccngncaaaagnanattaaantctctttgaaattgna 

SEQ ID NO • 1 49 acgcggggccataccagcctaggtgtggagcaagaggtagggaggccctcg 
tggatatacacaaacaccccanatacaaaatggagcattgtggtagtggttagggtgttttatgn 
aaacantttaaattanatanttctattcattga 

SEQ ID NO: 1 50 gtacgcngggacgacgaagatgatgaanatgatgatgatgaagatgatgag 
gaggaggaagaagaggaggaggaagaggtggggtgggacgacagtgaaatctagagtaaaacc 

AAGCTGGCCCAAGGTGTCCTGCAGGCTGTAATGCAGTTTAATCANAGTGCCAi 1 1 1 1 1 1 i 1 1 iGTTC 

AAATGATTTTAATTATTGNAATGCACAATTITITrAATOT 

ACNCAAAAATTNAAANAATTNTAAAmrmAATNTNCr^ 

TTTCCATCANNCTGCTTTmrmTrrTGTATCTO 

AG 

SEQ ID NO: 151 ACiTiTriTrmTTANTiTGrrm 

CTGGAATATACCTGACCCACCATTITNANAANGACCCATNTNAKGTCTGACCATO 
CCATGTTNACACTGACCTAATGCAAAOTATGGAACCATTGGGCTGGTTATACAm 
AATTATNNTCCAACTNTGA 

SEQ ID NO: 152 ACGCGGGAGACTGAAAAACTGCCTCATGCATGTGTTCTATTTATTGATATATG 
CACATATGGCTGTGATTCACGTAAATrCATTTITAAATGTTACTGAATTCACAGTATC 
CCATGTATTTTCTCAATAAATGAATACTAAGATGCANTTTTGAAAGTATAAAAATAAGGAGCT^ 
AGAAAAACTAAGTTCTGCTTTTTTGTTTTT^^ 

naatatggttnnaacancttattgcttttgtatca 
SEQ ID NO: 153 Ac rr rrnTr rrn ' n - iTi 'i n i u i GGAGACAGNcrcACTCTATTGcrGAGGCT 

GGAGTGCATTGGCNCNATCTTGGCTCACTGCAACTTCTNCCGCCTGGGTTrAAGCGAT^^ 
CTCAACCT 

SEQ ID NO: 154 acgcggggagaaaggaacacagtaaactgaattgatccgtttagaagtttac 

AATGAAGTTTCrrCTAATACTGCTCCTGCAGGCCACTGCTTTGGAGCTC™ 

acatgcctggaaaataataattgtctattnggtgaaagatacitanaanaaat^ 
tananttcattotatnaanaganatntttnaatattngtatat^^ 

SEQ ID NO: 155 ACITATGTCCAriTCAGmCCCCACCrATAAACAAGAGCCAATTrCTCTT 
TCCCTGCTCTCCCCAGGTTGAAAAGGTCGTGGCCCCTTGGAAAGATTGTATTGACTGTG^^ 
TCTGGTGCCACCTGNTGNATGCCACAAGAAAGGCCTCTCCTGACTCCCAAGTTGTAACCCGTrrCC 
ACCAAATCGACTTCCAAATAATATTTATCAAATCATCATCTGTGCrmCr^ 
TTTTAAGGTGGAAAAAGGCAAAGAAGGCTTATATGTATTTTCTTCC^^ 
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AAGTTOCTTCGGTGAAATTNTTGACCACNTTATGTTTNGGGGGACTCCCTATNGGATCA 

SEQ ID NO: 156 ACCACTGNATTGATTAGNGGTGTATNTAAACANGGCTCCCTTCATTGCATCTG 
AGGACrrGTTTTCTTTTTCTTTATTTTTA^ 

ACTACCCAGTTTGNGGTTTTTTGGGANAAATGTAACn'GNACAGrrACCTT^ 
OTACCCCAAAANAAAAAAATTAAAAAANAANTNCCGTCCNCTTT^ 

SEQ E) NO: 1 57 ACCTGGAGGCTCAACGGTTTAAGCTTCACCACAAAAGCNAAATGGGCACACC 
ACAGGGAGAAAACrGGTTGTCCTGGATGTTCGATAAKrrGGTCGTNGTCTTGGlSfNTTNATN 
TNCTNAT 

SEQ ID NO: 1 58 ACTGTAAAAGTTCTGACACAAGACAGTGGCAGTGGTTACTTn-CATCGACTTr 
AGCATGTGATCTCAGGGACTCAAGACATACGCTAAGTTCTATTCTGAGTTrrGGGCAACAGAAGC 
AGTGACAGATATTTCTGAATGAANAAATTTTAAGGTGTTITCAAGCATm 
ACACACTNTTNGTTGTTGCTCACAAGTCACNTTTGNTGCCCAAAGAATTTAAAGAACCm 
TCTACAACNT^m'ACTTTACCAAAGAANTAACT^^TANTTNGAAGGGG^^ 
AAAAAATTCACACTTANCTTTTT 

SEQ ID NO: 1 59 ACAACTATGATACATAAATTAAAAATACAAAAAAAAAAGGAGGGGGCAGGC 
ATGGTGGCrrATGCCTGTAATCCCAGCACTTTGGGAGGCCAANGCANG GTGA TTGCTTGAACCCA 
GGAGTTGAGACCACCTGGCANCATATTAANACTCCITCTCTACAAAANTTTrAAAA^ 
CAGGTNTGGGNGGOTCAACTCNriTrANCCCTA>«^TATTTTNTNATACCm 
TTAANGNTTTNNTATNArrAAATATC 

SEQ ID NO: 1 60 ACCCCCTTCTCCCACGTAGCCACGGCTTCCCCTACTATCAACATCCTGCACTA 
GAATGGACATTTGTTACAATTATGAACCTACATTGACACATCATTATCACCCAAAGTCTGTAGTTA 
ACTGTTAGGGTTCACTCrrGGGGGTTGGNAAGTTCTGGGGGCTTNGATAAANGCGTAAGNGGTTC 
CNGTTTTTAAGGGCTCAAAANGTCTTGCITATOAGNGATTATTGCmGTAGG;^ 
NAACTTATCTNACTGCCCCAGCATAAATGTCTTATACCATTATCAGCTTTAAAATACCC^^ 
GTATCATTGANAATAAGACCTTTTTTTTTTTTTT^^ 

NCCTTAGGCCGNTTGANAAAmGAANAAACTGGAATGACAGCGGGGNGGGANGAACCNGAAN^ 

AANGATNhRNlATNGGGGGGGNCAGGTGGGGGGAATTAACNNAA>rr^ 

CTNACCTGGCNTNATGCCATGAATGAGTNGCACAC 

SEQ ID NO: 161 ACGCGGGAGTGAAGAAAAAGAAATTCTGATACGGGACAAAAATGCTCTTCA 
AAACATCATTCTTTTATCACCTGACCCAGGAAGTTTTCArrGGGAAAAG 
TTACTAACCATTTTAAAAGACCACCACCAAGGGAAACCAAATCTTTTCTTG 
GAATACCTTTTGGGTNAATGGAATTGAAATNCAAAANAATTCTG^mC^ 
NTGTAATTNATTGTTGTAGATAAACCTCCTOTTATNCANCNAAACCCCCCTGTTGG^ 
AACTNCTTGAAATACTTTATTAANTAATCCAATTNCTTCCNAANT 

SEQ ID NO: 1 62 ACAAATTTTGGGATTAAGCTGCrCCCAAGACAGTCTTCATCACCTTTGTGA^^ 
TGGAAACACCAAATAGTCAAGGTCTGAATTTCCATTGTGTGTGGCTAAGACCAGTCGCATAGGTTO 
ATAAATGTAT 

SEQ ID NO; 163 ACATATTGGCATrTCATCCTCAAAGGAATCATCAAAAGAAAATTCACTGAGT 
AATCrrTTTACCATGACTGTTGAAGTGAAGGGTCCCTATGAATACCTCACACTTGAAGACT^ 
TTGATGATTTTTTTCATGGTGATGTGTATTGTATATGTCCTGTn'GGTGTTCTG^ 
CTGCCTGCTACTGGAGAGATCTCCTGAGAATTCAGTTITGGATTGGTGCTGTCATCTTCCT^ 
TGCTTGAGAAAGCTGTCTTCTATGCGGAATTTCAGAATATCCGATACAAAGGAGAATCTGTCCAGG 
GTGCTTTGATCCTTGCAGAGCTGCTTTCAGCAGTGAAACGCTCACTGGCTCGAACCCTGGTCATCA 
TAGTCAGTCTGGGATATGGCATCGTCAAGCCACGCCrTGGAGTCACTCTTCATAAGGTTGTAGTAG 
CAGGAGCCCTCTATCrmGTTCTCTGGCATGGAAGGGGTCCTCANAGTTACTGGGTATT^ 
TCCCTTGACTCTGATAGTAAACCTGCCCTCTCANCAATTGACGCCTGGGTTATTTTATGGATAm 
TTAGCCTGACTCAAACAATGAANCTNTTAAAACTTNGGANGAACArrGNAAAACTC^ 
GCA 

SEQ ID NO: 1 64 ACATATTGGCATTTCATCCTCAAAGGAATCATCAAAAGAAAATTCACTG AGT 
AATCTTTTTACCATGACTGTTGAAGTGAAGGGTCCCTATGAATACCTNACAOT 
TTGATGATTTTTTTNATGGNGATGNGNATTGNATATGTCCNGNTAGGhfNTNAAA^ 
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NNCAACNNTTNCTGGNGATGAACCCTGNATTATCGCAC 

SEQ ID NO: 1 65 GGTACTTTTTTTTTITmTT^^ 

ACCAGGCIGGAGTGCANAGGCACAATCrCGGCTCATTGCAACCTCTGCCTCCCGGG^^ 

TTCTCCTGCCTCAGCCTCCTAAGTAGCCAANATTATAGGTGCCCGCCACCACACCCAGTTAATTTT 

GGCAOTGTAGTAAANATGGGGTTTCACCATGTTGTCCAGGATGGTCTCGACCTCCT^ 

ATCCACTAGTCANAGTTTGTTTTTAAATGACTACCCGCGTACGCTTGGGAGTCCTCAGCAGGGGG^ 

TCATTCACAGTGAGGACAGACACAGGTGAACCTATGGGTCGTGGAACAAAAGTTATCCTACACCT 

GAAAGAANACCAAACTGAGTACCTGCCCGGCGGNCGTTCNAAAGGGCGAAATTCANCCACTGGC 

GGCCGTTCTAATGGATCCCAACCTCGGACCAAACTTGGNGNAAACATGGCATAACTGGTTNCTGG 

GGNGNAAA 

SEQ ID NO: 1 66 AClUU']"17U"JlTl"lTl"J'rri''lU"ri"rTTGGGTATATCCTGGCAATGGGATTGCTGG 
GTCAAATGGTAACTCTGriTrAAGTTCTTCGAGAAATCTCCAAACTGCm 
TAATTTACATTCCCACCAACTGTGTATAAGCATTCCCTTTTCTCAACATCCTCTACAGCT^ 
TTTGTTTGTTTGTTTGTTTTTITGACTTCTTAGTA^ 

GTTGTGGTTTTGATTTGCATTTCTCTCATGATTACAGATGATGAGCCTGGCCATI^ 
ATTCCTTGCTCATTTTTAAATCCTTCTTTTAAAAACTATTT)^ 

AATTCCAACACCTGATGTCCCTTGGGGTGCCAAGAAOTGTCTCAATCTACTGGTTGGTCTTT^ 

TTACTOTGCTTATACTGACTTGGTrcrmGANGGGTGGATAArrGAATGGNGAACCTCATGTG 

TGAAATGACTGCCGGAATCTGAGGAGCrGATTCTNGNTGGGTCCTTAGAAGGATTTGGTTGGTCAT 

TTGATNCNNGOTGTACNNCCTGGANCCTAAATACTTTAACTNGGANTO^ 

AANATCTANACCTGGCCGACCCTANGGN 

SEQ ID NO: 1 67 GGTACirrriTnTTTTriTn^^ 

CCGGGCCGGAGTGCAGTAGCATGATCTCGGCTCACTGCAACCTCCGCCTCCCAGGTT CAAGCA ATT 

CCCTGCCTCAGCCTTCCGCATACCTGGGATTACAGGTGCCCGCCACTACGCCCAGCTATTITmTG 

TATTTTTAGTANAGACGGGTTTTATCATGTTGGCCAGGCTGGTCTCAAACTACTGACCT^ 

ACCCACCTGGGCCTCCCAATGTGGTGGGATTAGAGGCGTGAGCCACCGTGCCCGGCCAGTGATTC 

TTGTTAGAAGTGAAACTTCAGAACATCCATCCACATGAGTGGAACATCATGAAGCAAGATGCTGG 

TTCCTATCAAAGGAATCTTACATAGCGCAGCATTCAACATGTNATGAGAATAANAACTCANACTC 

CCCCATCCnKAAAAGTTGGAAATTATTAAAGCCCTGTATGGGTGAAATOTGm 

CCGGAAAGNTTAATNCCNGGAACCCCTTTTTGGANGGGGGNAAAAACCCCTTT^ 

TTTNAAAAGGGCTTTCCCAAAAAAAGG 

SEQ ID NO: 168 ggtacatccctgtttatcccattccatccaccgaggcccaacagcatggatga 

TCTGTTTGCAGGGAAGCCTCCCTGCTCCCGTGACAGCTATCTCACCAGCTGACACTTTACCATATC 
TGGCAACAAACTGTTTGCrrCTCTTCTTGGATTTCAAATCCACCAGCrTT^ 

GGCCTCCCCCATGCAGAAGATCTTCATTGGCTGCATTCACCACAGCATCAACAGCATGTGTGGTGA 
GGTCATCTrrCCAGACTGATAACTCrATCCTAGGAGTCAGCATTTTCTGAACACTTGCAGAGATTT 
GCTGNTGCCTTCCTGAACTGGANAGACCAGGGTAGAGATCAGCCAAACTTATTCTGGAGGACrrN 
ACACAGCTGACCTCATTATTTTTTAAAATITrcAAGTa^TTGNGGGT^^ 

TAGGTTTCTTCAAGAACANCCATCTTTGAimTCNTTGNAACTGNTGTrCGGNCNCCATGGAA^ 
TCATCTC^fNGmCCCGGGGGAGCTTNAANGGOTATNCCCTTTNGCT^ 
AAATNGGGGCCCTTTOGGNGCGNT^TNTNGGGAAAANNAANGGCCNN^^ 
AANGGGATNNTNNANTGGGNGTTTN 

SEQ ID NO: 169 GGTACACACrCACATCTGGACCTGTGAGAACAAAAGGAGTCTGCCAGGATCr 
AAAATAAAGGCCAGGGAGAAGGTGCAGTTTCAGATACAGTGCATGGGCGCCACTGTGGGCCTGG 
GTCAATGAGTGTATTTGGCAGTAACATGTATGTAAGAACTTAATCCACAGCTTGATATAAGGCAA 
AGGCTGATAAAGTCAGAACCGCAATCAGAAAAATCATAAAAGACCTGACTAGCCTGGGCAATATA 
GCGAGACCCCGTCTCTACAAAAAACAGAAACAAAATCCCCCATACATTAGCTGGGCATGGTGACG 
CATGTCTGTGGTCCCAGCTACTTGGGAGGCTGAGGTGGGAGGATCGCTTGTGCTTGGGAGGTTGA 
GGCTGCAGTAAAGTGGTGATCACATCGCGGNACTTCCACCTGGGTGACCTTGTGAGACCCGTCTTA 
AAAAAATGTCTGGCGNCGGGGCTCACACCTTTAATNCCACACTTTGGGANGGTTGA GGGGGGNG G 
TTCAThrrGAAGTCAGGAGTTTGAAACCATTCTTGGNCCACATTGGGGNAAACCCATNl-l^ 
NAAAN>n^NNNNl^TNNN>n^l^^ 

SEQ ID NO: 170 CGAGGTACACTTTTCAACCAAATCAAAAAAACAACTCT AAAAG ATTCTATTA 
TGTAAATTCAGTTTACATAAGTATATTTriTAAAATTTTGTCCTCTCAA^ 
AATGTTTATTAAATATGTTTCACACAGATGAGGTTATTTCCrrTTAATGTT^^ 
ATGGGGGTATTTGTACACAAOTAATTCGGTTTCTCCACATCCTTGCCAACATTTGGTTTCATGACTA 
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TTTTTTACnTrAACCATCCTGATAGGCGTGTGCTGGTATCTCACTGTGTT^^ 
CTGACCACTGATGATGTTGAACATCTTTTCACGTGCTTATTTGCCATCTGT^^ 
AATGTCTGCCCGCGTACCTGCCN 

SEQ ID NO: 1 7 1 AC ri T rn ' n " i ' i - J "l ri U N i"l U 'J" rilU GGGGGAAGTCCTATTTATCATnTAAAG 
AACAGATTGNGCCnTrGGTGTCATATNTAAGAAATACTGTCTAACCCAAAGCCAAAATAG^ 
AGTTTrGTGCCTrAAATTTAGGTCTAGGATCCATTITAGACAATTACATAC^ 
TGTGTTTTTAAAAATATGAATAACCAATTATTTAAGCACTATrrGTTGAAAA^^ 
AAATCAGTTGTCCAAATOTATGTAGGTCTGTGTCTAGATTTGGTTACACTAATNrrAT<^^ 
TTACTATATTCTCTTGATTATTGCAGGGTrrrrrAAAAAGCCTTGAAAT^ 
CTAACTTTGTCCTTTACCAAGGGTTAGACTATCTNGNCCCITAGTATTTAAT^ 
NTTTCTACAAAAAAGGTATOTTGGAATTTGATGGACATTTGGCTGAATCTN^^ 
NAACTGGTATTNAANATCTGGAATTCrrmCCTGCAATCCGGTTm 
GGNAATTNACCCCCTGNNGGCG^^rANTNATNGGATNCANCNNGGACCNAC^ 
NCTNTTCCNGNGGNAANNTCCCNCCNG 

SEQ ID NO: 1 72 CGAGGTACTTTTTTTTTTTTTrTTT^^ 

ATCATCAGAGGAAGGACCTGGCAACGTTGACACTTTGCTAATTACACCAAGTCTGGCTAGCT^ 
CACCCGCGT 

SEQ ID NO: 1 73 GGTACTTTITrill"rilll71"i"rril"17U"rGGGTAGAGACAATGTTTCACTATGT 
TTGCCAGGCTGGTCTCGAACTCCTGACCTCAGGCGATCCACCTGCCTCAGCCTCCCAAAGTGGTGG 
GATTACAGGCATGAGCCACCATGCCTGGCCCAACTACTGAGATCTTATCCGGAAGTTGCTGATTAC 
CAGCTTCAGGTGTTTCTGTTTATTGGGAGACTGTTCCTGCTGCTGGCTGTGACCAATTATTATm 
AAAAGACAGTTAACAACTGCCGGACCATCATCTGATGGTTGCCTGACATCTGACATTCCTGTTGTG 
TGTGTTTTGAGGNGAGGGAGCCCTCTCCTGCCCTGTTCTTGTCTGACTAGCTACCTACTGTAACj^ 
AACTATATTTGGATTCCATAACGTGATACTCAAATGAAATTTCAAAATCTTTCAAGACATTTATGA 
ATCATCAACTTrCGGGGTTGGTCTGCTAGGCrCACmAATGCAAGTTCTANCCAGGGCACAC^^ 
AACTATTTTGAATTTCGGNGNGCCTmTAANAAGGATTTNTTNACGTTC^^ 
CmTGAAAAAAAAATTTCm'AGGTAGGGGTTCATCKrCCAAATACTGCACAT^ 
ATTTCAT^TNTTCTAACCCCCTG^fTAT 

SEQ ID NO: 1 74 ACCCGGGGGCANNCNNGTGGTCCCATAGCACAAGCTGTGAGGGGATTCACTT 
GTGTGCNGAACTCCTCGGAACC^^^GGTGTCCCTAAACAT^r^'CCTGGGAACAGCCNT^WCT 
CCCTGATGACTANNGAGCTANCTAAGATCAGCTGANTTA 

SEQ ID NO: 1 75 AC ll ' ri ' rnTi - l -i" i "i l - l 'i I ' l r iT N NTGAGTGAGGCAGGAGTCCAANGAGGNTAT 
TTGNGGCANTAAAATTGATTAAGGATOCTNGTTTANGANATCAGGTACGTCCTTTAGNGTNGCGT 
ATGGNTATCANTCGAATTGAGGTTA 

SEQ ID NO: 176 ACTGGGATTACAGGCATGAGCCACTGCGCCTGGCCCANAAATCTCrnTGAA 
CAhrmTTCAAAAAATACAGCTAGCCTCAGTGGrrCATGCCTGTAATCCTAGCACTTTGGGAGACCA 
AGGCAGGCTGATGGTTTGAGGGCAGGAGTTTGAGACCAGCCTGGGCAACATGGCAAAACCCCATT 
TCTATTANAAAAAAAAANAAAAAAAAAAACCTrGGCATGGTTGCACGTGTCrGTAGTTCCANCA^ 
CTTGGGAGGCTGAGGTGAGAGGATCACCTGANCCCAGGAGGTGGAGGCTGAACTGANCTGTGATT 
CCGCACTGCGCTCCAGCCTGGGCAATANACAAGACCCTGCCTCACCAACCCCCCAAAATACCATT 
TATAATAACTTn'AAAAAAAGANTCGCTrAGGTGAAATCTAAGGACTCATATAGTGAAAGC^^ 
AAACCCTGATGGAAAACATTAAAGGAGACCTAATTAAAAGGAGAGACACACTGCArrTGNGGATT 
GGAAGACTGGACGGAGTAAAGACCTCNTTCTCTCCATATTGATATGTAAACTTANTGCAATTCCTA 
TTCAAANCTCCAGCAAGACTTTTCAGATATAGACAAGATTNTTCTACANTTNACTGAACGCTAA^ 
GAACTANAATAA 

SEQ ID NO: 177 TATACCACTCACTATGGGCGAATTCGAGCTCGTACCCGGGGATCCTCTA AGTC 
ACCTGCAGCATGCAAGCTTGAGTATTCTATATGTCACCTAAATNCNCCGGNGAAGAAGGCNGTTTT 
GCGTTATTGGGCGCTTCTTTCCNNCTrACCTCGATCAACTTGACTCNNNTTG 
NGCCTGCGGCNGAAGCCGGTATACANGNTCACTCAAAANGCG 

SEQ ID NO: 1 78 ACi l l-i-n n T i Tr n n' i l l -ri'iU-i G CTTTTTTTTT^^ 

CTGGNTAAAAAAATTTGGGTTTNATTGNTTTNGCTAAATAATACTAAAAAAA^ 
AGGCAGGGCTTGAATTmrrAATTNGATCCArrrNTTTAATTAA^ 
NATCATGGCCAAAAAAATTGT^^S^ITAACCCCCNCCCCCCCCCAAANGTTTTNGC^^ 
CCATNNCCCNTTCCNCNCAAGGGCCTNA>mTCGGATGGGGGNAACCTTNCCCCAANAAACTGCC 
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TNAAGGNCGGGGCAATGGGTGCCAATirrACCTNTCAGCAGGTTAGTCAACCANACAAACNGGGG 
GGCTAAAGTCCAAAAATTCTTTCCAGGTTTTOSfTGCTTATT^ 

TAANCCTGTAAAATTTAAGGGGAGTTGGGGTGGGGGCGTAANAGCAAANGGACAGCCGGANAAN 

AGAAAT^n^^CGGGTCCCCCAAGTTTTITCCTGGGNTAGNGGCTTTO 

GGGCCAGAGTAAATGGCACNasrCNNGTTTTTTTATNAAAGAAAAAT^ 

NTNAA 

SEQ ID NO: 1 79 acaggcttgaacagaaattggagaatgccttgaaagacaatagaaagtgcc 

CACCCACCAGACAGACCAACTTGAANGGAGCTTTATTGGCCAAGTGGTATNCCCGTTGGGAACCT 

TTTATGAATGCTTAmCAGTGGANTANAGATCTNCITCCGAAACNTCCCAAA^ 

AGGGGAAGGAAAACCAACCCTTTCANNATTGNTTGGAACTNAAAACAATTTNGGAAA AAANGG G 

TTTCCANANAAACCTTGGGCCTCCANAAAGGGCACATTNNACCTTGGGTTCAA^ 

CCTTTNTTTGGGCCAAGGCCCCTTTAACCCNGCCCNT 

SEQ ID NO: 1 80 CCCCCCGCGGGCACCTGGAGCAGAGGGGTAATGACCACTGGAAACACTTGCG 
GTATTCCAGGAGAACTTCCCCACACATCGCGGGCAACAGGGCACATGCTGAAAACTAATGCCCAA 
AATCTCTCGAAGAGCACCTCCAAACGCTAATCAGATTCCCAAGTGAATGAGGAGACATGTGATGG 
CACCGAGAGAGAACTGGTCGGCTTACCCTCAAGGAAGCCATCAGACTAACAGC CGAT CTCTCTGG 
AAGAACCCTACAACAGAAAGAGTGGGGCCAATATTCAACTTCTTAAAAAAGAATTTTCAACCCAG 
AATTrCATATCAGCCGACTAAGCTTATAGGAAGGAGATATGTCCTTTTCAACAGCAAATGCTGAGA 
GATTTTTGTCCCCAGGCCTGCCCTAAAAAGAGCTCTGAAAGGAACACTCCTTGG 
CCTTGCCNTCACACTGGNCGGGCCGTTNCITAGNGGNTCCNGANCTCNGTACCANGCT^ 
AATGATT 

SEQ ID NO: 1 8 1 ACTmTITITITITriTTTT^^ 

CTTTTGAAGGGTTTTCGNGTCTCTATCTTCTTCAGTTCAGCTCTGAT^ 

GCrANCTTAGGGATTTGTTTGCTCCTGGTTCTCAAGTTCCTTTAGTTGTGATTTT^^ 

TGANATCTTTCTAACTTTTTCATGTGGCCATTTAGTGCTATA 

CTGTGTTCCAAAGATTCTGGNATGTTGTATCnTITmCTGATTAATTTCAAANAAC^ 

TGCCTTAATTrCATTATTTACCCNAAAGTCATTCAAGANCAGGTTATTCAATTTTCA 

GCATGTGGAATATTTNGGAACAACTCCAAAAATGCTCCTTTTGTGTGGGGTGT^ 

TTTTATTATGAGAAAATATACATGACATAAAATATANCATTTAAAAATTACTAAATATATAm 

ATGGCATTTAAGTAAAATTCACAATGTTGNGCAACCATCACCATTATATATTTCCANACTTTTAC^ 

TCATTCCCAAACANAAACTTTTGNCCT 

SEQ ID NO: 182 acgcggggagaagttaggggctgcagcggcgctggctttaggtgaacgacgt 

GAAAATTACTTTTCCCACTGAAACACACCCAAGTATATGCCCAGCCTTCATG^^ 

AAACGAAGCGCCmATGTGGGTGGCCTTAGCCAGGACATTTCTGAGGCAGACCTACAAAATC^ 

TTCAGCAGATTTGGAGAAGTTTCGGATGTGGAGATCATCACACGGAAAGATGACCAAGGAAACCC 

ACAGAAAGTTTTTGCATATATCAACATCAGTGTAGCAGAAGCGGACCTGAAAAAATGTATGTCTG 

TmAAATAAAACAAAATGGAAAGGTGGAACATTACAAATTCAACTAGCAAAAGAAAGC^ 

CACAGATTGGCCCAAGAGAGAGAAGCTGCAAAAGCTAAGAAAGAAGAATCAACAACAGGTAACG 

CCAACTTGTTAGAAAAGACAGGAGGAGTGGATTTCCATATGAAAGCTGTGCCAGGGACAGAAGTT 

CCATGGCATAAGATTTGGGTTTGTGAGCNAATTTGGGAAGAGTCITACCTGTTrOT 

AATCAACATANNCGTAAAAATCATCTNATATGGATCCCTCAAAATTCTGCCCCAACCTG 

SEQ ID NO: 183 ACGCGGGTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAACT 

tagctgggcgtggtggtgtatgcctgtaatcccagctacttgggaggctgaggcagaagaatcag 

ctgaatccatgaagtggaggttgcagtgagctaagatcacaccactgcactccagcctgggtgac 

agagcaagattccacctcaaaaacaaaaacaaaaaaaaacaaacccaaaaaataaaataagtaa 

ataaataaataaaggtggagtgactatattaacaccaaaggtctttccaggacangtatcaccag 

anataaagagggtnatttcatanaggcaaagaggtcaagtgatccagaagacaccaatcctaag 

tgtataagtaactaatancagatcttcaaaatacgtgatntaaangctaatagaactgcaaggat 

AATTAGACAAATCCAAAATTAAAGTTAGATATTTCAATACCCCTTTCTOAA^ 
GTAATCTGAATATTTANNATTTA 

SEQ ID NO: 1 84 ACTTTTTTTTTTTrmTTTTTT^ 
TTirrriTGGCAGTTTCTAAGNCATTACTTTmAT^ 
GGGANAGTTTGTATGATTAATAAAAANCAGCTTTTTNATNAAATGCTNG^ 

cagcctgngakatccgaccatcccattaactttgaagnttntctngattaataaaaaaa^^ 

ggngggngaaaaaaaggnggaacatgctaaaaacctaaatgacaatcatccaaatgngaggaaa 

naanaaccgattnaccaactncctttttkmtttna^ 
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GCCTTCCTGGCTNAAAAAGCCTGCAGNCCCANAGAACCCNTGAAAANAGCCATGGOT 
GAANTAGGA 

SEQ ID NO: 1 85 ACAGTATTNTGAATGTGAGATGATTTGTCAGGACTAACTGTCTTmAACA^ 
ACATTTTCAGTNTITrTAAATAAAATTTTGNAAAGNAATGTGAAT^ 
AATTCATTCACTATTGNGTANGAANATGCTGTTAANACATAGGAAGGG 

SEQ ID NO: 1 86 AC lH - rill 'r il ' riUUCil - ril TGTTrmGGTGATGTGGCTrAA^ 
TCTTTTTTGGGACATATTTCTGCCAATTAAAGACTAGAAGGGCACAACI^^ 
GAGAANATACATTAAAAAAAATCTTCTGATGTTTTGTA^ 

CACTATTGTGAAAAGGAGCAACGNAGTTTTGGGTTTTTTGTTGTTNGTTTGCT^ 

AAGAGATTAAAANGTTTCTGGATAAGGGATTAGCTTCTCGAAGTGTCCATCATrCTGNGTA^ 

NCTTAAATATGNAATGTACCAAACTCCANAATTAAAAAANCTCTCATGTTGTTATNCm 

AGCAATGATAACNGCNTATAACACTGGCATITSrCATGGCAAArrGCTGCTACCrrTOG 

CAATNTTCAANCAAAANGACTTGCTOTAANGTGNTmAAANANGCAA 

TTANAATGANAGTTTTATTGNACTNCCCTTTITCNANNTGGTCTNATTTO 

CCTGNNNGGCGTT^n^GANNGGCTAATTCATN^n^A(:OT 

SEQ ID NO: 1 8 7 ACTTTrnTTTmrnTmr^^ 

TCAAANAAAGCGCANCAGAGAGGCATTGCTTGCTGGAACACTTGACTCCAGTCATGTGTCAAATC 

ACCIOTACATOTAGCTTTACCAANAATGNATGCTAATGTATCCTC^^ 

NGAGN^rmTGGCTAATATTAACCNAACATGNAACCAACTAAAATGTAACACCATCNCCA^ 

AAAACATCACTACCAAATTCNANTATCCTAAlSrmCANGTNCCGGCCCNGGCGGCCTTTC^ 

GGCNAATTTAGCAC>mTGCTGCCNATCANTAGGGCCTAGNTA 

SEQ ID NO: 1 88 ACCACTTCACTCCAGCCTGGCGACAGAGTGGAACTCCGTCTCAAAAAATAAA 
ATAAANTAAANTAAAGCNAAAATNTAAANTGTTAAAAAAAACAAAAAAAGGGAAAAAGGANG 
TGATTGCCTTGGTGAGTCAACACTGGGTATTTTCTGACCACTATTTGAAACAAAA^ 
TGATATTCTATGCAAAGATCrmTCCTGGANGGCACnmGCGGNNACACCAGTGNGNAC^ 
NANCCCTTCATTGAnTGAAT 

SEQ ID NO: 1 89 ACCTCCAAAGTGGTTAAATAAATTAAATTACCACTGGAAGAGAATAAAAATT 
TTAGTTGATCCACATTCTCAATGACACCTGAATTTCTGTTTTGTTT^ 

ATAGCATGGTATTTCATTGTCTGAATTTATATTTATCTGAATACCAGAGTAGATGGGGATCTCTCAT 

ACTTTITTACATTTGGCTTCTTTTTCTGAGAATAACn^ 

ATTTTTTTTTAGTTAAAGGGCrrTTTAAT^^ 

ACAGTTCGGGGCTGTTATTITAAATTCCriTCnTrGCACTTTTC^ 

TGCTCACCCGGTNAAAGGAAAGTGTAmTAANCAATTACTCTNTACTTTC^ 

gtnggaggtnagaacnaagctttnattctg 

seq id no: 1 90 accctaccactgttggaccagtggagagcagtggattgagatctcgctaccg 
ttcttcacctaccgtctacaactcacctactgacaaagaagactacatgaccgacctacgaacttt 
ggatacttttctcagaagtgaagaggagaaacagcatagggttaagctggggagcccagattcta 
cctctccttccagcagtcctacmctggaactatagtcgttctatgggggattatgcacaaa^^ 
aaagaagtttcagtatcagctrgcctgtaggtctcaggccccatgtgctaacaaagatg^^ 
atctcagctctaaacaagccgcagaagaggtctgggcaagagtggctatgaatagacaacttctt 
gatcatatggattcatggacagctaarrtagaaattggatcaatgagacaatatagtgccttgttc 
aaanatgaatctgcacaccanatgaaacaatggntggccaacctaagatggaaaggtngttrct^ 
ctgaaaagctgcctggtaagiwcttttttcantt 

seq id no: 1 9 1 acacaatccntrataaaagtngtatatatttttttctgtcaa^ 
atttataaatacaagatttacacagcaccccatcaaaaaaaaattaaaacccm 
catatatttcataccctataaactttcaaagggggtgctctggtnaaggncnccctagttaa tg 
ccatttactgggngcagcaaaatacatattaatcngnaaagttttttmggccataa^ 
nanaaaatnagtcnaganactttttcggggnaaacnca>™agnctnat/^^ 
anaaagggaacttaaatggggcgatttcccctgccgaatcaaaaagttaaaccagatttgg^ 
aaggttgattgggaaccaaatggctctncnaaaccatttaanggcagggantgngaatgggto^ 

GAAAAAACCTTGGGAAAGCGGANATOGGAANANCTGGGGANGAAACCAATTT^A^^ 
TTGNGTTTGAACTGGGNTTAAACTTCCCNCGTTTCAAGNTACTAANN^^^ 
TTTCCAAATTGNNAATNAACNTTTTTTAAAACCANNCTC^ 
TGGAANNGGNTITITnTTT 
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SEQ ID NO: 1 92 ACTCTGGCTTGTGCITAATACrrGTGGTTAAGAGAATCACCTATATTGTTGCAT 
ACAGTTTATTCATTTTCATTGCTGTGAAGTATTATATCCAAATATTATAATTAATTGATCCA 
TTACAGATGATCATTTGGGTTATTTCTAGTTTGGGGCTATAATTTGTGCTGCTGTGA^^ 
ACATGTTTTTrAAAAAATATATTATGCATTTCTGTTTGGCATTATGCCT^ 
TCATAGAGTAGCATTTATTTCTAATTCTAATGCTCTCANAGGGAATACNNGGAAATm 
CAAAGGNCThmTrAAAATCTGTTAAGGAAAACTGAAm 

ACATGGCAATAAACATTTAAGCACTTTTCNATATTAAGACAGAAAAAGCAATGAATAGC AGA 

GAAAATGGAATTTCCCCCAATATTTAAGTAAAAAGAAGTm'GATGAAAGAAAAGTTG 

AGNGGTTTTATTOTAGGAATTmAAAAATATAGGATATTGGAACATT^ 

ATTTTTirmTTCCNGNAATTTTNGGGACCCCGANN^ 

GGNCAATTAGAAAAAGGG 

SEQ E) NO: 1 93 GGTACAAAATAAACTTTGAGGCAAAAGGCATTGCTGCAGATAAAAAACATGC 
CTATATAATGACAAAAGGCTAAATITACCAGAAAGCTATATCAATTCCAACGTGATTATACCTGAT 

aacacagctttaagaaatatgaagaaacatgacagaattaaggagaaataaacaaatccactgt 

catagtttgaaaatttaactaaccmctcagaaccrtataa^ 

caatatcatcrcgatttataatcacaataagmccccttttggaattaaatagcaarrtam 

ttatatggataaataaatacccacaaattggttttgaaaaactattta aaaa aaggagaactgag 

aagaaatacitgcrrgtagcagatactgaaacaacattctcraaggccattttactga^ 

aataactttatgcgatcagtgtaatatagtagaagttccaaaattaaatatgaaaaat^^ 

AAAAATATGAGAGrrCAAriTAAAAGGAAATAGTGACTAGTTAATAAATGATG 

SEQ ID NO: 194 GGTACr rj -i"lTn i ' i " n 'l'l'l 1 1 U 1 1 1 i 1 iGACAGGGTCTCACTCTATCACCCAG 
GCTAAAGTGCAATGGCGTGATCACGGCTTACTACAGCCTCGGCCTCCTGGGCTCAA GCAATC CTCC 
TGCCTCAGCCTCCCATGTAGCTGGGACCACAAGCATGCNCCACCATGCTCAGCTAA'1"1'1"1"1'1'AACT 
TTTTGTAGCAACAGGGTCTCACmGTTGCCCAGGCTGATCTCGAATTCCTAGGCT 
CCCACCTTGACCTCCCANAGNGCTGGGATTATAGGCATGAGCCATTGCACTGAGCTCACTAGCCA 
NAATTCTTAAAAATOTCThrrCAGGAGACTATANATAATGNCCTA CTTOT ^ 
CAAATAAATGGGCTGNGGGTAANCACATTTCTTTTCCTTTAATCTAm 
TTCTATGTGGGAACTTAAAANA^^^TATTTCITAAATATAAA^ 
TGGGGGGGNGGGAATTTTTTNTTTAAATCACCGGCTTTNG 

SEQ ID NO: 1 95 ACITGATNNGATTCTCAGCTTGGTTGCTGTTGGTGTATAGCANAACTAC^^ 
TTGTGTAGGTNAATCTAGTATCCTGATACnOTGCTGAATOCATTTACCAGTTC^^ 
GAGGANTCGTTCGGGTTTTCTANATNTACNCCGAAGGAGGGAGGNAGGACAT 

SEQ ID NO: 196 ACTGCGGGGTCCTTGATGGACCCTAAAAGGGGTTGGAGAGACCGATTCACAG 
AGAATGATTCCATCAGGGGAAATOCTCCATGACrGGCTTTGAANATGGTGGGGGCCATGTGACAA 
GGAGTGANGATGGCCIOTAGNAGCrGANAGTGGCCCCCAGNCNACAGCCTGNNT^^ 
NCNCATTATTTCTCTGAGCCTCCAGAAAGGAGGAAAGCCTAGCCAACNCCTTAATTTTAGCOT 
AANATGCTGAGCAGAAAACCCAGAAACTNCTGGrrAACGTGGNGAAACCCCNTCTTTA 
ATACAAAANATTTTGCCTNGNCTCNGGTGG>raGTGCGCCrTGTANTTCCCATCT^ 
CTTGACGGNAATGATAAATGGCNTTAACCC 

SEQ ID NO: 1 97 ACrrrTTTTTTTTTT^^ 

AAATGAAACCCCANATTTAATTAAAAATTTCCCCATATTCTGGCCTACTCTG TAATT^ 

TGCCTGAAAGGNATTATGTAGTTACTTAATAAGANAGAGGAAGGGGAAGTATTTTTAAT^ 

GGATGAGAGGAAAGGAAAACCCTTTGGGTTTTTNTTTTTNTGCG^ 

AGCCTGCGTATTTCTTTAGCAGGTITITCTTAGAGACAACnrAACAA 

TCCTAACTCTACAATGT 

SEQ ID NO: 1 98 GGCTGC ANAACAAATCAAGCACATCCTTGCTAATTTCAAAAACTACCAGTTC 
TTTArrGGTGAAAACATGAATNCAGATGGCATGGGTGNTNTATTGGACrrACCTGTGANGATG 
NTGANCCNATTTAA'TNATTATTT 

SEQ ID NO: 1 99 CTAATTTATATGTTGCTCTGCTTATT AAAT AATCAGCTTAAGGATAATGGGGT 
ATTATTCTACTCnTGTGATCCACGCGGGTAATGThrrNTNTTTTCNAAAC;^ 
CCAAGATGANTTAATTThrrANNAGTCCTTATTTACACAATAAACAN^ 

GGCCGAACACGGNGGCTCATGCCrATAANCCCAGCACTTTGGGAGGCCGTAAANGGGTGGATCAN 
GANGTNAGGAGATCCNAGACCATCCNGGCCAACATGGGTGAAACCCTGTGCTCA 
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SEQ ID NO: 200 ACTTJ"ril'l"ri"l"l"rrJ"ri 1 ri"l"riTl"l'lACAAGGGTAGCAAAAAATATNTGTAA 
NGCAAACATTAATCAGTAAAAACTAGGAGNGGCTNTNTCAATACAAGACAAGATANACGTCAGT 
GGAAAAAAAATTACAAGGCNCATTACATAATGACAAAAGGACTAATCTGTTAAAAANATAAAGC 
AATTCTGAATTCCAAACAGCAAACAGCAGAACTTCAAAATACATGAAATGAAAACCAGTAAAGCT 
GAAGTAANAATAGAOSfCNTCTITATTCATAGrrGAGCATGTTTACAATCCACTGNCAGGCj^ 
TANAACTGNTTGGCATATAAACAAGAAAGGGTTTNNAAGAACTGGNCAACm 
GNNTGTNNTAAAAmATGGGATACATCmCTNCAAANAAAAANT^^ 
AGAAANATTTTNCTAGATGAGAACANGATGACCNTGNGAATmAAANAATTGC^ 
TTTGATNTTTGCT 

SEQ ID NO: 201 CGTACCAGATCCCACCrAGGGGCGCNACTTGCrTGCATAACTCCTAAAANAC 
CTGGNCACCCm>nWANCCNTAGGACNCTGACTNNNNGCT 

NNNAGAGTCAAACTATAAATTACNTNCCAAGGTTAGGTTCTACCTATGCCCAGNAATGAACAAGG 
ACAGCTTAATAGGTTATAANCAAGATGGAGTCNTTTNGGGTCTGATCTCnTrCAC^ 
CCTCAGTTACAATTTTTGTAAAGGTGGNTTCAAATGCTTTGCTGACCTCCCATTAAC/^ 
CCGATTGGAACTTOTNTTTTGC 

SEQ ID NO: 202 ACTTGCTAGGTATCCTGGGTCAGTGGCGGTGCAAACTGGTTTCCTCAGCTGCC 
TGCCATGGGGCTGAGTCGTCAGGGACTGGTAGTGGCTTTGGAGCTNATAGCGGAGATTGACTGAA 
AATAACTGCAATTTACTGNCAAGCCTTCCCCTGAATGTTACAAGCCTTCAATANACTCCAGAGTTC 
aiAAATAATTACATCAGACAAATTCTGCCAGNGTAATTTTTGTCTGGGTGGGGTAAAGA^ 
GTGCTTTCTACTCCAGCAT 

SEQ ID NO: 203 ACAGTCATITTAATGATGTTGATTCTTCCAAACAATGATCATGGGAT Arrm 
CCACTTACTTCTGTCATGTAGTATTTCTTTCAGCAGTGTTTTATAGTTCTTGTTGTGGACATC 
CCTCATrGGTCrCCTrrGTTAAATATATTTCTAGATATTTTATTTm 
TGAGATTTTTATTTGGTTCTCAGTTTGAGTGITGTCGATCTATATAAATGAAACT^ 
TTTATTTrGTATCCTGAAATCTTACTGAAGTAGTTTATCAGGTAAAAGAATCT^ 
TCAAGGTTGTCTAGGTGTAAGATTGTCATTAGCAAACAGAGATAATTTGACTTCCTCTTTTTCi^ 
ATGGAAGACATTTATTTATTTTCCITGCCTGATTGCTGAGAGTGGCCATCCT^ 
CCATTCAGTATGATATTAGTTGTGGGTTTGTCATAGACGGCTCTTATTATTTTGCAGTATGlllll'l 
CAATGCCTAGTITGGTGAGGGTTTTTATCATACAGACATATTGGATATTATTGAATGm 
AGGTATTGGAGATGATCATATGGCTCTGTTTTTAAAATTGNTrCTGNGGGGAATCACATT^ 
TTTGC 

SEQ ID NO: 204 ACGTGACAGAGCCAGGCTTAAACGCAGATCATCTGGCTTCAGACTTTCATCA 
CTTTATTAAAATAOCTCATAAGAATACTATGAGGCTCAAATGAGGCTGGCGGAAACCACAACATA 
TGATATTAGTITCAAAAGAAGTCATAACAGAAATAACGAAAACCATGAGGATGAAAAGAAAAGC 
CnTGTTTCmCCACTGTTGAGTTTTTCAAAAGCATm 

TGTTGAAATGGAGGCCAGAAATTTGAAGAGTTGAAGGCTGGTGCAATCACnTTGGAAAATATC 

AGTATTATACACATTCTAATTATATTTTTTGTGCAGTCGTAAGATTAGACAACAATTT^ 

CACACCGCTCATATTAGATAGATGTCTGTAGGGGGAATACTCCTTCCCCTGACAAGACCACATCGC 

CAGTAACGTCACTCrACACACACAGTTGGCCTCTGTGTGTCTCCGTCTrrAAGAGTAATTCAGGACT 

ACTAGCCAAGTGGTTGGGATTTAGGAATAGAGTGGAATTCAGCTTACCTTGTAAAAACTAGGA 

AGATAAAGCCTTTTCTAGCATATAGGCATTGNTGGCATAATCCAGCTCACTACAGNCTNACCCCCC 

GGCTTANGTGATCCTCCACCT 

SEQ ID NO: 205 ACl"lUHUU"l"illTlM"lTTriUl"rrri"rGGGATOGAGGGCCGCTCTGTTGTCCAGG 
CTGAAGTGCAGTGGCATGATCCCAGCCCATTGCAACCTGTGCCTCCCGGGCTCAAGCCATTCTCCT 
GCCrCAGCCTCCCAAGTAGCTGGGATTACAGGTGCCTGACACCATGACTGGATAATTTTGTAT^ 
TAGTANAGATGGGGTTTCACCATGTTGGCCAGGCTGGGCTTGACTCTTANCTCAATGNAGGAAATT 
GGTTGNAATGCAANCTTTAAAACTTAAACCCAATAAANGAATT 

SEQ ID NO: 206 ACCGNATGCTGGCNNGGAGGTGGCATATAGCTCACTGGNACTGANGGGCTGG 
GCACCCAACCCTNTTCCACCTGTGCTAATCGCCTGGATCTATCATNANTGCAAAAANCTOCTm 
TTGTACTGG 

SEQ ID NO: 207 ACAGAGAAACCCTTAGGCCAAACTTAAAATATGTAAGGAGGCAGCTTTAGGC 
TAAACCTGATTTAACAAGGTGAACAAACAGAAGACrCTGGAGAACrATTrCAA^ 
GAAGCAGGAGTAGATTTAATCCTGTATrrCTCCTTTCAGAAGATGTAAGTAGTGCCAATAGGCAGG 
AATTAAAAGAAATTAGATTTTTCrCAATTAGAATTrCCTAAAAAT^ 

TTAACTTGAGTGAGTTCCCTGTCCCTGGAGATAACCAAACTAAGACTGATGCCTACTTTTAAAAAG 
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AAGTCTCTGAAGATCTGTGTATTCAGTAGAGACTTGAATTAGGTACrrcrAAGGCTO 

ATTCTATGAAATTACGATTCCGGGAGAACCCmCCAAAAAAAAAAAAAAAAAA/^^ 

GCCGGACNCNCTANGNCNATTCCACNNOTGNGGCGTNATATGGTa^ACrrCGGACAA>^ 

ATGGANACTGTCTTGGGAATNGTNTCOrCAAATCCCCANATCCNACCGANCTAAGGNAAN 

GGCTAAGNAGNCTACTACATTATGGTTGGCTNTGC 

SEO ID NO: 208 ACTGNNTGCAACAACTCATGGANTTTGATGGGGAAGACCTGGTCTCAAATAC 
CAAGGGGGGTCTGGAGCTNCCTGTGGNTTAGGNGGNGAAAAAAAANATNGGGAGANAGCCNGGC 
CAAGTmAGAACCTCTGCAAGCTCATGAAAAAAATCTTATATAAAAAGGTGAGAAAGTGACA^^ 
CTNCATAAACTTGGGTCTTTCNCCTTGCTGCANTTTGACCACCACCT^ 
KTTGGAGGCNGGTTCATTGAAANCCCOINGCACTTTTGGGACAAOT 
TGGGCa^AAAAANCCCCCrGNAGAATCAACCCTTGCCCCCCCCTTm 
AAAGGTTTATGGNCCACAAAAAATNATNNNGCCATTTAANGACCTNGTGGGTC 
AACCNCCCTG>mTTTNTT>rrGGCITm 

NCCAOTGAAANAAACCTOGGT^^ANGTTTTTGAAAAAAAAN^^AAT^ 

CCATGCTrCATTTCCTNGAAGAAAACCX:CCNTITNGANGGCAAANAAGNATO^ 

AAAAANCCCTTTTTGTTTANG 

SEO ID NO- 209 ACGCGGGGGAAACGGAAGTGAGCGGCGGGGTCGACTGACGGTAACGGGGCA 
GAGAGGCTGrrCGCAGAGCTGCGGAAGATGAATGCCANAGGACTTGGATCTGAGCTAAAGGACA 
GTATTCCATTACTGACNTTTAACCAAGGTGGACCTTTGGAAAGTCATGATa^TCTCGG 
TCTTGGGTGAAAAATGACCITTGCCTAGCCTTCCCTTGGATTATAAAAAAAAATm 
CAAGANAAAATGAArrmCCCACTGIWAAACATTCANGGTCTATTrGCT^ 
GAATTGAAGGCNTGCNCAGGTrCACGGCTTCATTTTTTAAAGCTCAAAT^ 
GGNAATGATAAAACTTrrGGTTTAGGGAATTrTTANNGGCCTCCAAACGAATC^ 
CCTTTGGGGGGGGAAANAAACCTGGGTTACCNGNAAAANGGGCCGNNNAAGNAAACCCNNGGGC 

GGNTrnOTTTTAAAmATTTTTTNCCra^ 

ATGNGGNGGCTTTTNTGGGGGANCNANCCTNGG>n^AAAGTGGGGGGAAAAANGGGGGANAAA 
AGNTTNTCCCXjGGGGAAAArTTTT 

SEO ID NO* 210 ACTCGGGGGAAACGGAGGTGNNCNGGCGGGGTCNACTGACGGTNACGGGGC 
ATATAGGCTGTTCNNANAGCTGCGGATGATGAATGCCANNAGGACTTGGATCTGAGCTAAAGGAC 
AGTATNCCANTTACTGAACTTTCACCAAGTGGACCTTNTGAAAGCATGATCTTC^ 
rmCTTGTGGTNAAAAATGAACNTTTNGCTTANCATCCCCTAGAAm 
CCANCTCAACCAAGATAANATTGAA 

SEO ID NO* 2 11 ACAAAATGAAATTTAGGACCAGAGAAAATGCAAATTAAACTGAAAGTTTAA 
GACAGGGAGAAAGTTAGAATGCAAATGCATAGAACATAATATGTTCTACCCAGATATTATATTAA 
AATGGCTAATTTTATTGACTTTCCTGGTAGAAAAACAAAGGAGGTAAGCTATCTATC^ 
CTCAGCTAGTGCATGTGGAAATGTGTGTGGGCAGTTTGGGTGGTCACAATGACTGAATGCCTAGCT 
GGCATTAATGTCTGGAAGCCAGGGATTCCAAATGGCTATCCTGGACAGGGGACTGGGTTGAGGGG 
GCCAArGGGGAGGCACTCCCATATGCAAAGAATrGTrCTGCCCAAAATGCCATAACACCCTGCTG 
AGAAAGGCTGAGTGAAATGTTTGTCCTTAAGTAAAAAAAAAAAAAAAAAAAAAAAAANGG^ 

SEQ ID NO: 2 1 2 ACGCGGGCAGGGGTAGAATGGAAGGAGAGGCGGCTGGAGAGGACAGGTGGT 
GGAGGGCCTTGGCITCTGCTAAGTGAGATGGGAACCACTGGAGGGTTTGAACAGAGGAGTGCC^ 
GATTGATrrATATTTTGCAAGGGTCATTCTAGCTGCCATATTGTGAAAAACTTTAGTGGAC/^^ 
CAGAAGGAAGAGGGAAGACCTGTTAGGAAGCTACTGCAAGGTTCCAGGCTTGGGCCTGGGCCAC 
AGCAACAGCAGTGGTCAAATATCTAGATTTATmGAAAAGAGCC AATAGG ATTTGCTGAGAGm 
GAATGTGGAGTGTAAGAGAAGGAAGAGTTAATGATGACATTAAGGTITrrGGCCTGAATAGCAGG 
AAAGATGGAGTTACCAGTTACTGAAATAGGGAAGGATGGGCTGGGTAAGTAAGGAATTTGGTGCA 
AAGCAGCTGTCTGTGGTTGGAATGGGAGGTTCTGCTTGCAAATCAAAGTGGAGAGTTCTCTC^^ 
CAGGTCTGCANCAAAGCTCGAGACAGGGATCTGAATGCACITGGTITATTGTTGGGGGTGCTCTC^ 
NAAGGAACCTGTGAAAGCCTTTATCAGTCAmATTGCTGTGANAAGTTCTCTTGGAATO 
CCTCGCCCCGACCACCCTAA 

SEO ID NO: 213 acttttaggagagatgggatttcaccatgttggctaggatggtctcgatctct 

TAACCTCGTGATCCGCCCACCTCAGCCTCCCAAAGTGCTGGTATTACAGGCATGTGCCACTGCTCT 
CGGCCAATTAATTTTTTTTTTATGQAQATATGGGGTCCCm 

GCCATCTTCCCACCTTGACCTCCTAAAATACTGAGATTACAGGTATGAGTCACTGTGTCT^ 

CATGGAAATTAAAOTACATGCTrCTGAATGACCAAAGAGTTATGAAGAANTTAAGAAGAAA^ 

AANCAAATTCTCAAGACATGAAAATAGAACNACACCATACCAAACCCCTGTAATACTGAAA^ 



27 



wo 02/29086 



PCTAJSOl/30732 



atgtaaggggggaagtcataagattaatggcctacttgaaattagaattttcaaataaaga^ 
tctttncrrgtcagnaactngattaaaaggagnaaccaaaatcaaarrgt^^ 
ttgngatcaaccgaattaaccaaattaagacitaaaaatcca™gaccatngaaccgaaa^^ 
ttttgtaaanataaaaaaagttgaaaaccntctgttcnctaaggaaaaaganggaag 

seq id no: 2 1 4 acttttttttttttttrtt^^ 

TATGGTATTGAAAGTATAAAAATTAAAGGCTTCrGAAAAACTCAAGAAGGGCAAAAGGCTATACA 

AGCAAGTAGTATACAGATTGmCCTTAAAAGATAAATTTrATATCCAGAANTTAAATCC^^ 

ATTTTTTTTmriTCACTTTCCCAAACTTGGGAGCOTGAATGCCT^ 

ACATTTT 

SEQ ID NO: 215 ACNCGGGGTCTTTCCCATCTTGCAAGATGGCGGGTGAAAAAGTTGAGAAGCC 
AGATACTTAAATAGAAGANACNCGNACCATAGNANGTTG>rmCITNGTANGCATGGG 
NTTAACCTTAAAT 

SEQ ID NO: 2 1 6 ACCTGTAGTCCCATCTACTAGGGTAGCTAAGGCAGGAGGATCGCTTGAGCCC 
ANAAGGTTCAGGCTGCAGTGAGCTATGATCATGCCGCTGTAATCCAGCCTANGTGACACAGTGAG 
ACCATGTCTCTAAAAAAACTAAAAAATATTTTTAAAAAATTTTAAATAGAC^^ 
ACCTTTAAAATATGCTATGGGGCCCGGATGCATTGGCrCATGCCTGTAATCCCANCACm 
GCCCANGTAAGAAGGATCGCTTGAGGCTAANAAAGTTCAAANAACAGCCTGAGTTGACAAGCAA 
TAGCTTATCTCTANAAAGCCTATTrAAAhrrNNAANAAATTNAN>^ 

TGGTCAAGAGCCTOTATTTOCATCTTNTCAAGAGGGCTGANGGGGGAGGATCACTTGNATCCAAG 

AAATCCAGCCrTTTAGGCTCATrGAACCTAGCrCACCCAACTGNACCCCAACTTGGGCAAACA^ 

GTAAGATCCCATTTCT 

SEQ ID NO: 2 1 7 ACGCGGGGAGTTCCAAGTAGGTAATCCTTCTGAGAAGTCCCACCTTTCTGAG 
CAGCTGTGTTTGAAGAAAGCTAGTGGGAAAAGTTCCAGGATTACATGTCAGGAAACTACAAGAGG 
TAANAAACATTTTGNTGATTTACCAGTGTTNTTAACNTTCCTNCTGGGCT^ 
GTNAAAAATG 

SEQ ID NO: 218 Acrn"iTiTnu'iuu'i"iui"i'iiTiTiGGGii"rin-iuiu'riu-rrii'ri'i"i"ri"i"rri'rri' 

TCNCCANCCCACCAGGGGTTTAAmnNTGAATCAAAANATCAGTTCAAANAGGACCCCTGNTTT 

GTCCTCATGCAGGGGTNAAAAT^m4CAAACCNCCCTGGGAATGTCCAAGCCCAAAAAANCCAGG 

GGCCANTCCCTGAGCAAGNGGAAAATTGGGTCCTGGAGTNOTAGGCTGCCTCCTCCirr^ 

CTCCTAANTITrATGANACTGNNGGGGNTrGGGGTAACAAACNGGNCAAAATAATTT^^ 

GGACCTCNTTCCTGNAAACTGGGGCTCAACTGGGANTTCTGGOTGAANTTGGCNNGNC 

TGGGTTT 

SEQ ID NO: 219 ACAGGTTTTGCCCAGTCTCCTATAGCATGGTATAGTGATAACTGATTTTTTAT 
AACAATGACTCANAGGCATTGAAGATCCATAACTATCTTCTGAATTATCACAGAAAGNAAGAAAG 
TTAGATNAGTTTATGTTAANTlSrrTTAATAAATCATNrrAACNATTGTTGTA^^ 

SEQ ID NO: 220 ACTTTTTTTTTTTTTTTT^^ 

TCAATTrrTATTrrGGTTTrCTTACAAAGGTTGACATm 

AAAAATTCAAATTTTTGGGGGAGCGAGGGGAAGGAKITAATGAAACTGTATTGCAC^ 
ATCAATCCTTC lU l U TCT r CTn'GCCCCACA>nrTTAAGCAAGTANATGTGCCNA AAAAAT ^ 
ATTCAACTTTTCAGTTAAAAAAAGAAGAANGAAGAATTNGCCAAAGANAAAGT^^ 
CTTTNTrrrrrAAATTTAAAATGAAGTTCATTTTAm 

ATTCCCIGGTCAAGACCNCCGATNTCCAAAGOTGCCATTTAAANGAAGGGCAGGOWGATNGCCT 
ATTTTITTrGNATTCAAGATTGCTTTCCCCATCATTTGTCCT^ 

TGTAAGGTGNACCCTCTGTTGNCCTCACNAACAATmrCNACANTCATTAGAACCCTGTAAAAATG 
ACACCCTTTrCAGGTTGCNAATCX:CNCrrCCATIvriTm 

SEQ ID NO: 221 acactctatgtctgcatttgattattaccttaaaacagacitattgggtcaca 

GCATATAAATGTTTCCATCCTTGAAGGACACTGTCAGGTGGCTTTTAAGATAAAGCTTC Am 
AAAAATTCTAAATGCATTTTTAAATGACTATGCAAGAAAGTTCATTAAAATATACT^ 

gaggccgaggcangccggatcaccctgaagncaggagtttcnagaccangcctgaacaacattg 
ggagaaacccccgtcttctactnaaaaatacaaaaattagnccgggccgtaagtggcanaatgcc 
cggaattccaacmctanggangcot^ancaaggaaaaactgcrrtgaaccct^ 
gttgcggatagcccntatnotancttt 
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SEQ ED NO: 222 ACGCGGGTAAGAATCOTGAAATAAATGTTTTTTTAAAAATCTCTGGGAAGA 
GTAAACTTGATTATTTATTTCTAAGTGAACTATAATTATCTTTAATTT^ 
CTAAGAAATTATCAAATGCAACTCAAATGCAATrTTTAAAAAGNAATCCANTTAAAGCm 
GAAACAANTTTAAGTTTTATGGnATGATTTGAATAAGCCNAAATTATCT^ 
AATTTTTCATGCrCTGCAAATTGNTTTCTGTrrCTATGTAAA^ 
ATTATA AAAGA TNCTGATTTGAnANCATTCCCTTTNCCTGAANCACCNTK 
TAAATCCTTTT 

SEQ ED NO: 223 ACGCGGGACnxnTGAAATGGTrATCTTTGTGGATGATTTTTT^^ 
AAOTACCTCOTGAATAACTNGTTrAAAGTGGTGGGOGNTTAAATm 
CAAAAATTTAAACCTACTCTTTTGATTrCNACTTTTCAOT 
GTAAGTTANTTTTAAA 

SEQ ID NO: 224 ACTATATTTGATTTTTAGTCrAGTAAAATGTTAGTAACTTGTTAAATGC^ 
AACAAGGGAAGGCAGGTATTGGGTAGAAATACATGTTTTGACTGTATCAGCCATGGGTGACTTCT 
GCAAATTTACTATCTTCCAAAGAACGATACACTGTCATTGTAGAGACTGCACAATCTGTCCAT^^ 
AGCACAACCTCITAATAGATTACCAGTTCTATCCCAATATACAATGATTTTGATGCTTCAAAA^ 
TTTTAAATACAGTATAAAATCACTTAACTAAATATTGCATATTAACTCTATGG^^ 
ATTAGAAGCCTCAGAGAAAGAGrrAAGAAGTATATTTTATGCAATACTATATGCCCTGCAATAGG 
TAAAATACrGAAGTTAAATCTTTACTCTGATCACATTAAATATCCCAATATTTAA^ 
CCATAAANGGAAGCTTCATATTCCACAGTTTATTGCTCCCATGAGGTATCTGCAAGCCTACAT/^^ 
ATAATGArrCCTTGGTTTGTTTCCAAACAAATGGATTTAATTrANTANTTCC^ 
NAACAACATGACACTGGNTT 

SEQ m NO: 225 GGTACTTTTTTTTITrTm^^ 

CCCCGCCATNTTATGGNGGAAGCAGTCGCTATGATGATTATAGCAGCTCACGTGATGGATATGGT 

GGAAGTCGAAACAGTTACTCAAGCAGCCNAAGTGATCTCTACTCAAGTTGNGACAGGGTTGGCAA 

ACAAAAAANAGGGCTTCCCCCTTCTGTAAAAAGGGGGT 

SEQ ID NO: 226 GGTACTCAGAGGAATTTTTTTTGTTTTGTTTTGTCT^ 

AGGATGAAAAAAATAAACAGAAAACTCAGCTCAGGCACAATTGTCACCAAGGAGTTAAAAGCrr 
mCTTCAATAGAGGAATTGTTCTGGGGGTCCTGGAGACTTACCATTGAGCCATGC^ 
GCACAGGAATAAGTAGACACTTTGAAAATGGATTTGAATGTTCTCATCCCITITGC^ 
TTGGCTCTCTTATGTCCTTGGOTGCTCCTCTATTCTACCTCTCTITCrC 

GAAGACATGTATCCATAAGAAGGAGTGCTCTTCATCAACTAATAGAGCACCTACCACAGTGTCAT 

ACCTGGTAGAGGTGAGCAATTCATATTCAAAGGTTGCAAAGTGTTTGTAATATATTCATGAGGCTG 

GAAGTAAGAAAGAATTAAAAATITGCCTAATTACAATGGAGAACCATTCTAGGNAGTGATCTTGG 

ACCCACATGAATAACTTTCTTGAAGGGCAACCCAAATCCATTITATTTCTGNCTGGC^ 

NTGGAAAGGTT 

SEQ ID NO: 227 ACAAACAACCACTTCTCAGTAGAAAGTTAAGAATAACATTTAAAAACATATT 
CATGTTTTAGAGAACGAATGTGCCATCGTTGTATATTAAATAAAAATAAAAGATTAACCAGCTATA 
AGAACACTACAATTACAACrAGAGTGGCAGTGTTTTTTAACTAATAAAAGTATACATGm 
TGCAGCATACCTGAAATCTTGATGTTTGTCAATACTTATGGTTGCTTCAAAGATAAATTTATGTGAT 
TATTTTTGAAAGATGTGTATTAATTTGAATAATACCCAGAAAAATTATAACTTAAA^ 
TTCAATATGAGAATCATTTATGTGTGTAAATACTCAACTAAGAAAGATCAAAAGTGTGGTATAATA 
rrACAAGAAAAAATATTCAAAATGGAAAGTCCATTTATGAATGTATTAATATTAAAATCC^^ 
TATGTTTirrTATAATGNCrACATTATAATGKrrACAAANGCCATAA^ 
CATNCTNCAGATATGGCCCATAAACTTCATTITCTANAAAAAAGAAGAAATGTTTTAT^ 
GAT 

SEQ ID NO; 228 GCGTGGTCCGCGGCCGAGGTACTAOTCTCAAGGAGGATTCATGGTCTGTCCT 
TTGCrCACTACAGATTTCTCCTCTTCTCTGGGAAAAAATGGTGAATGCTTCTGC^ 
ACTAGATCCCrrGATTATTACTACATTATTGGKrATTCCCCNCGTrGACACCTNOT^ 
ATCGGGNGATGTGTGTTTNATTAGGCA 

SEQ ID NO: 229 GCGAGCGGNCGCCCGGGCANGTACAGNGGCCCCCCGTGAAAGACAGAATTG 
TGGTGAATCCTGGTTGTCACGCCCTCCCAGTGTGCANATAAGGGCTGCTGCTTGNGACGACACCGT 
GTCGGGGGGTCCCGNGGAGCTTACTATCCTAAT 
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SEQ ID NO: 230 CNCNGGACACTGCGCCATTTCCTGTCCAAAGCTGGGCGAATCAGGGATNCCG 
GTTCACAATGGATGCTGATAAAGAGAAAGATTTGCACAAATTTCTTAAAAATGTGGATGAAATCT 
CCAATTTAATTCATGAGATGAATTCTNATGACCCAGTTGTGCAACATGAAAGCTGCCCTGGAGACA 
GAAANGAGACTACNNGTTATGGAGGAAGACCATGNAGGAGGATGAATGCATGACCACCTTGAAC 
AANACTATTATCAGTCCn'CACACAACTGNTCTNAANAGTGCAGNAATGAAATAAACTaSfGAGGC 
CTTCTTGGNATCTGTGGAGAAGGATGCNAAGGANCTAGCCAGGTGAAGAAGGGAAAACAAAGTC 
TTGGOvlGATGCCCTNNAmAAAAAGGGAATGAACCATTTGGTGAANGGGAATNATTNA^ 
CTATCCTGCGCTCACTGAGGGTTTNGAGANTCTNNACGACATGANNGGNCTGTN^ 
CNCNCNNAGGGTTAATTCATNrrCACTTCTTTCTTACANGTGGGACCGA>^ 

SEQ ID NO: 23 1 TAAAGATCCAGCGTTTCCCCTGGAACTTCCTCGTGCCTNTCTGTTCCAACCTT 
GCCGNTACCGGATCCTGTCCGCNTOrmCCTTNGGAACGNGGNGCNTTmATAh^ 

SEQ ID NO: 232 TONAGAGAGAAAGACAGGGCTCTGAAAATACTGCCATAGGCTCAAGTTCCAA 
AGTGCTGGAGTTACAAAAGTATAAAGGACAACGCAAAGGACTTTTTAGCCAAAGAAGAACCAGA 
AAGGAAGAGTCCTAGGGAATrGGAGACGTCGCGAAGGAATGTTGTAAGGAGAATTCAAGCCTAA 
AACGGTTCTGAGCACTGATTTCATCAAGCCGATGGAGGCAACAACATATTCCTTAGGGAAAATTA 
AGATGCACAGTATGCTTCAAGATGATGGCCAAGAACAACCAACTCAACGAGCCTTCAGGAATTCA 
TGATTTGAATATCATGTTCAAGGCTGCCTTCTAGATAAAGATCAAACCAAGAAAGTCCATGTGTCA 
CTGCTCCGTGAAGGCTCCGGGTGGAACTCCCTGCCCCACrCCACCTTTAAAATTTNCAGCCAAGTG 
TCTTCACCTTCTTCAGCGCCCCTGGTTACATGTCCATCTCTCTNTTTAGAACTACAGACATCAGC^ 
CTCGAGAAAAGAAGGTTGNTCTGGGAGAATCTAATGTGACaSfGTTATTGNAAATATATNTC 
TCAAGTTirrGGCTTNGGGCTGCTAAATATAACCTTTTOACTGGTT 

SEQ ID NO: 233 GGTACAACCTTCAAACATTCCAGTITrTATAAAAAAAGGGGCACACAATCGT 
GGTTTTGATCCCCTTTTGTTTTTGGACAAATGTTTCTACAAATACA 

TGCAAATTTACTTGCACAGTAATCTGCCAGCCCATTTACTCCACTTAATCCAGCTGAAOT 

GCAAACCAAATGTCCATGGTCATTAGCAATCATAGCAGGTAGAAAGGCTTTATAAGTC CATAA AT 

GTGCTTTGAAATTCACATCAAATGACrmCCATAAGCTCATCTGGACAGTCAAGG 

CTGTTACGArrCCGGCATTGTTGATTAGGATGGAAACATCGCCGACTTCTTTTT^ 

TACrCTATACACTCCTTCCTTTTGGCTGCAATCGCAGGTATAGGCGTGCACTCTTGGTGGCTCCAGC 

TrCCCGAACCATCTTACATGTTTCCTCATTCCCCTCCTTATTGATATCCCAGAGAACAAGAACAGAT 

CCAACCCGGCNAACTGCAAGGTTAAACCTTTCGAGTCACmCACCCCTGTGATGAAGACTATTT 

CCACAACGTCTTCCGNGmTTNGGAGTANGCAAAATATANCCTCN 

SEQ ID NO: 234 GGTACGCGGGGGATGTGTCAGCTCCGCAGGGGTTTGGGGAAACGGCCGCTGA 
GTGAGGCATCGGCTGTGTTTCTCACCGCGGTCTTTTCCTCCCACTCTTGGCTGGTTGGACCCCACTA 
TGGAAAAGTTGGCCCCTGAGCCAGAGCTCCAGCAGCCrTGTTAGGGCGTGGCCTGAGGCTTGGAT 
AAGTGGGATGTAAAACGAAGATCAGGAGCAGATTTGAAGAATTACAAAGTGAATTGGTGCCAGTC 
AGCATGTCAGAGACAGACCACATAAGCCTCTACTTCCTCTGATAAAAATGTTGGGAAAACACCTG 
AATTAAAGGAAGACTCATGCAACTTGGTTTCTGGCAATGAAAGCCACAAATTAAAAAATGAGT^^ 
CAACTATTGNCATTAAACACTGATNAAACTTTATTGTCAACCTANTGACATTATAATCGAAT^^ 
CCCAAGAAAATTAT^m'CCAATCTGGGGGGNGGGANGGTT^^TGGGCCAA^ 
AAAANTTrrGAACAAATACTTANTTTTCTNAGGGGAAATTTO 
CGACCGAAAGTCCCCAATTTGGGGAATAAGGClll'lTT'lCCNANTANTNTGAAA 

SEQ ID NO: 235 GGTACTCTGGACAAGGACAATCAGCATCTTCTCCCAAGGCAGCTCAGGGGCT 
GGCATGAGGCAAATACAACAGACTATCCCGAACTCCTTATTAGACCACAAAATACAAACAGCArr 
TTCTTTTTTTCTCTTTTTTTTTm 

GAGAGACTCCGTCTCAAAAAACAAAAACAAACAAAAAAACAAAACAAAAACAAAACCTGT^ 
TGGTAACAAATATTGAATCTTCTTTATTTTTAGTATCCACTCTCTTCCCA^^ 
AGAGCACrrrCCACGAATTCTATTGAGATAAAGGNGGAGGAAAATGCAACCCCCGCGTACCTGCC 
GGGCGGNCGCTCGA 

SEQ ID NO: 236 GGTAC n i Tn "i l " l - ilTi - i 11 ITI 11 1 11 1 1 TAAGACTA NGGT AACTAGAGGTGT 
GGGTGAGATGAAAATINGGTAATAGGGACAAATGAAATAAGCCAAACTGTTTTGCCACNAAGACT 
TCCANTCTTNAGGrrANTTCTGGTNTGTTAAAGGNGGTTNTGCAGAGCTCAGNTCTGGGAATG^ 
CCCrmCAACTTCrTGATAAAGGCGTGACITCCAAACTriTCTG^^ 

CCATTTTGAGACAAAAAGTAGAGGCTCTGNCAAGNCAATNCTGCATTGCATGCTTGGNCCACTGN 
ATAANCCACGCCTGAGATACAAANGATGCACTACNCTTGACCCGCTTTATGTNCTCTTCCT^ 
CTTNTITNTNATNACmATTAGGGTAAAACACCNCATACAGGCTTT^ 
TGGGGGTTGGGTAAAATTTTTGCCCCCATAAACCAAACTTTGTGGCTATGCThrm 
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ACCmGGC^^^^ATTCCAAA^ITTTAAAGGNTNGGGGAAACm 
AATTTCANTTAANATTTCrOAACCTTAAAAGGAANATGGGCCNCAAC^^ 

SEQ ID NO: 237 GGTACAAGTAAAGCCTGTGGTGGGATCAAGGAGCTATCAATATCAAGTTCAA 
GGATTTCCGTCTTACriTITGCAGTCTGATATGTCACCACCAAATCAAAA^ 
TTCTTAAAGAGTGTGAGGTCTCCAGTGATGATGTTAATAAATTTTTAACATGGGTAA^ 
CAAACTACAAAAACCTAAACTTTGAAAATCTTAGGGAAACACTAAGAACTTTCCACA^ 
GGAAGGAAAGATCAAAAGCAGCCTACACAGAATGGNCAGGAAGAGTTGNTCCTAAACAATGAGA 
TGAGTCTTCCTCTGGAAAACACAAGTAAGGNGTGATTTrrATCATCAAGTTrCAGTGNA^ 
TACTTAATAACTNCAGGGNTTAA^^TGGTACCACTNNCNTNGTATTCT^^ 
GGAAAAATGNGNrrATTGNTGCTTTTTGGAGGATGCCCITGGCATm 
OTGAANGGGTTTATNANCNGACTTTTAAAACNTTCCANTCCriTCCCC^ 
CCraTTTACCTTTGGNTNCCACGAATNCCTGATITTTTATO 

SEQ ID NO: 238 CGAGGTACCCTGAGCCAGGCGGGGACAAAAACTGACAAACTGCAGGATGTG 
GCCATACTGGGAGGAAAAGTCTCCACAAATGGCCTTGCCCCCCCGACACCCCCGCACACACACAC 
AAAACCCCrGCAGACCTACTTATACCCCTTAGCCTGTAAGCCCGGTGNCTGCCrCCTCAGAm 
GTAGAGCAGCCCAGCAGGTTAATAAATTTGCTTGCCGACTTTGGGTCTTCrrGTCCTTTCTC^^ 
TAACCTTATAAGCCCATGTGCATAACTCCCnrCTAGGTCAGCGGTCCCCCAACCTATTTGGCATNAN 
GGACCANGmrCGTGGAANGTAAriTrTCCACAAAAGGGTTGGCCNGGGGTTGGAAGGNGNGTG^ 
CCTTNAAAATTTTGGGATArrGGNGGAAAAANGAANTNTTTGCCCNAC^JAANG^ 
GGTTTmTTGGATCTGNTGGGTGNGGNCCTNTNTGNTNACrAC>OTCANA/^ 
AAATTTCTTNAANAANANAAAGGNNCTCNAArrGGGGCNATGGTTGGAATGGAAGNGNT^^ 
GGNGGG>mmOTACNTCTAAANTNGGNTTNCTTriTNATANN>mGAC^ 
TANNATATTT 

SEQ ID NO: 239 ACTTTTrTTTTTITTTT^^ 

CAGGCTGGAGTGCGGTGGCACAATTGCAGCTTCCACCTCCAGGGTTCAAGCAA TTCTC CTGCCTCA 

GTCTCCTGAGTAACTGGGATTACAGTCATGGGCCACCATGCCCGGTTACAAGTCrmATTANACT 

TGTATTGTGCAAATATTTn:CTTTTAGTCTTTGGCTTACCT^ 

GAGAAGAAAGTTTTAATCTACTAAAGTTTATTAGTTTTrCCTTTC^ 

TATCAAAAATCAAATCACCAAACCCAAGGGCATGTAGATTTTCTCCTGGGNTTTCnTCT 

TTATAAGTTTGCTTITACATTTAAAATrATGAACCAGCrrGAGTTAAT^^ 

GTCTGGGTCTGGGGTCAAATTTTGGACNTTTTATTGGTCCNACCCATGTTAOTAAGGGGTAA^ 

CTTAAGGATCGCnTTATNTCCrGGGAGAAATATCCTTTTTTTTNTANGGA^ 

ATAAmTCTTACTTGAAAGCTNNTTAANCCNAAATAAAAAAGGT^ 

SEQ ID NO: 240 CGAGGTACAGCAACATGGCGGCGCCCATGGACTCTTAGAAAAGGAGAAAGC 
TTTTTCTCTGTGGACTGGAAGGGGCATTTTTCATGATCACTATTTAGATGGGTGCT GT^ 
GAGAGTCTGGGAAGGCGOCGTCCGCTTTrCTGACAAGGGAAGAGGCrACTTTGTCCnT^ 
TCAATGACTTCCTGACTTGGAGGATGTGGACCTAGTGGCTAGACCCAAGGACCAAAG CAAGA AGT 
CGTGGGGGGCCCAGGAAGACAGGAGGATCACATTGGGATTCCAGACATAAGATCAGGTTTTAACC 
CCCTTTGGCCAAATTTTGGCTGAAAATGTTGAATTATCAACTCTGAAATTAAAAAGAAAGm 
TTAAAACATTGCAATTrrCCTTANAATTTCTGTATATATTAACATCATGAATC 
ATGTGCATGTCAAGGTTTTGTACCTGCCCGGCNGGCGCTCGAAAAGGCGAATTTCA 

SEQ ID NO: 241 CGAGGTACCTGGGGGCCAGAACGTAAGTTTTGACTCCTCTGCTAGGAGTGAG 
CTCAAAAATGGATATGATTCAAATACATAGATGCCTGTGGCCAATATTCCGGATCTTCACAGTCCT 
CGGAATGCCCTGGCAGGGCTAAACTCCTTTrAGTCCAGTCCTCCTCAAGCTCAGACCTGCAAACT^ 
TTCATTCTTACTGTGTATTGAGGGTTCCCTGAACTGAAGGAAGAAAGTGTCTGGAGGGTGGGAGA 
GACCGTGTGTGGCAGAGTTAGAAACATCAGTCTATCTCAGGGTCCTAGACAAGTGATCTCCACAT 
AATCAGCAACAGATGGTCAGGCAGCACCTCCAAATATTTGTATCCTTATCAATCATATTTATGTGT 
GACTGCCAATCCATATGTCATGTGTATTAAGCrrrATTTTAGGTTTT^ 

SEQ ID NO: 242 ACTTTTTm' lTri r il UT T i iTlU T lUl-rNGNT ni T ^ 

CAAACTGCAAATTGCCCCAAKrmATTTGTAGTCCNTACAAANGGGAAAAAAANTTAAGGT^^ 

TAACNCCACCTACTTGGGGANATGGGGAAATGGNACTGTCCCCCTCACCATCANCTAAAACNT^^ 

TNGGTCAGCAGGGACTTGNATACATACCAACTGACTGTCCCAANAGGANCTCAGTCT 

SEQ ID NO: 243 ACTAGCATTTTCAATTACCAAAAAACTATrAGCAGTAACATTGCCATTGGTAG 
TATTTCAGTCAGTCCACTATCACTAAGCTAATGGGAAATCAATAGTAATAAAGTGGTAGATTATAA 
TGTAGCCATGAAAAATGTTGTITAAATCAAGTTCCACAGTTCTTTCAGAAGAGAGGGAGATTGm 
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TGTATGTTCrrGATCmCTGAAGTTAAAGTTCATGTATTTrTAGGAT^ 

ACmTAAAAAGGTGCCAAACrGTTACACACTCATTAAGAAAACACTGCTAGATGAATTATT^^ 
CCATGTGAATGCAGNTrATTANCCAANCAATGAAmCWGCATAANCACGTGTTATCm 

CTGAAGTAAA 

SEO ID NO- 244 CGTACATOAAANACACGTCCACATCACANTTQCCCCCAAACTGCCTGTGCTC 
CTCGATGGTGTCTCTCCCTNCATAAAACGCATGCTTATTGACCTTGGTm 
TCNGTGANGATGAT 

SEO ID NO- 245 acagtcacactgatgaaagacagggaggccatgtggaaccccaggggccac 

AGAGCCACAGTGAAGCACCAGGCTAAGGCCATGAGTTAACACCACAAACCAGAAAGGC^^ 

CACrrGGGCnTrrGGTTAAGTAGGTTGGGCTrGCTCATTAAGGACCTTTrC^ 

ATAATITArrCATITCACAAACACTTCAGTAGACriTCACAAACACTAATATAG^^ 

GTAACrCCTTTAATCCrAACAACTCTATGAGGGAAGGGAACTArrA^^^ 

GGAACCAGGCNCAGGGGAGATTAACCATTTGCCCAAGATCAGGATGTCTGAAATGTrGATCT^^ 

TCAACNCITCAAGOAATAATGACCCTTTGAAAGATGCCCCTTmCOT^ 

CCTAGGAATTCITAGGAGAGAATTTGTCCAGAAAATCTAGGACCTGCTGATAAAAAGGAAAAGTA 
CACAGATTNTGNCCCTTGCCATACACTGC 

SEO ID NO: 246 ACTTTNTTITITITrrr^^ 

TOAAACTCNTGCCCTGGAACCCCCGCCTCOWGAGGGCCCJ^AGGGCAGGCNAACCGGCCT^ 
ACANTGGCTCCCCCGCTTACCTNGGCCGANACCANNCTAANGGCGAhrrrCCTCACACTGGCGGG^ 

GGTANCTANTG 

SEO ID NO' 247 gtacgcggggttgaaaaatggcgactgtggcagagttgaaggctgttttaaa 

GGACACCTTGGAAAAAAAGGGGGTArrAGGGCATITAAAAGCAAGGATCCGAGCTGAA^^ 

ATGnrrTA a ATGATNACCGATAACCC>nrANCCCGTGGNCNCCATGGTAGG CACGGCAACT ACCOT 

CAAAAGTTGATAGGGCAAAC>m'CAANTGGGTCGTCCCCNCCCCCGCGljJ£i 

TITrrAGGGCCrrrTCAATNTTTlArmAAATGCCNTGA^ 

AGCAGCCACATCCNTGGNCTGCNn^ATNNTATTTTAAAANCATNGATCNGCT^ 

TTCCAACTTTATCNTCTTNAACATACCCANTGTTTTTO 

CrrNTmAAAANACCCCCAAACTACCCGTTTNCTNNAATGCm 

SEO ID NO- 248 acgcaggggaatggaatggaatggaatgcaatggaatggattcatccggaa 

TGGAATCGAATGGAATGGAATGCAAAGCAATGGAATCAACTCGATTGCAATGGAATGG^^^ 

atggaaaggaatacattggaatcaacccgagtggaatggaatggaaaggactggaatggagtgg 

AATGGAATGGAATGCAATGGAATGGAATGGAATGGAATCAACTTGATTGG^^^^ 

gaatggaatggaatggaatggaatggaatcaacccgactgcagggg/^tgg^^^^ 

GCAATGGAATGGATTCAACTTGAATGGAATGGAAAGAATGGAATCAACACGAGTGGAATGGC^^^ 

GGATTGGAATGGAATX}GAATGGAATCAACCCGAATACAGGGGAATOTAA^^ 

TGGAATGGAATCATCCGTAATGGAATGGAAAGGAATGGAATGGAATGGAATGGAATGGAA^^^^ 

ATGGAATGGAATAGAATCAACTCGATTGCAATCGAATGGAATGGAATGGAATT^ 

ATGGAATGGAATGGAATGGACCCGGACGGAATGGATGGAATGGAATGGAATGGAATGGACCGAA 

AGGAATGGAATGGAATGGAATGGAA 
SEO ID NO- 249 aGGCGGGGGACGCGCGTCTGTGGAGAAGCGGCTTGGTCGGGGGTGGTCTCGT 
GGGGTOT^ 

TCACCATTGCAGCCTGTAAATGAAAATATGCAAGTCAGCANAAAAAAAAAAAAAAAAAAAAAAN 
GTT 

SEO ID NO: 250 ACTCAGGGGAGGCCAGGANGGCCTTGANCTTGGGCCGGGCACTGAGGOTCC 
CCACATATGCrGAGAGCAGGGGGAACGCATCCAGNCTGCC^^ 
ANCNANTNCAGCAGGrrGTATTCAGCATAAGGATATNTGGTTTCCACNA^ 
CCTGNTCTGGGACACAGNGGTCTAAAAAGGCTTAATATTNCCCGGACAGGGNCCTTCACATANTC 

ATTNCTTTGCCCACCTCTTTOTTT 

SEO ID NO* 25 1 GAAACANATTAACCACATNCTCCTTCrrrGGGCTAGCAAGGTCCAGGGCT^^ 
CTGGAGTCTGNCTCTACCATCAGGATANAGNAATCTTCCTGCTNGGATATAAAGa^^ 
S?STGTrrrCATACrrmAGAACCATGTGGCCCGNTACC^^ 

tgcttgttggaatgnanaaaagtgctgggcatgaatggta^ 

AGCCAATTATGAGGTNTGTNATTGCCCATTGNAAGATGCTOANTGGNTAAACAANGTCTGGCCTT 
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NACCTTNCAAGANGGCNTANCCCCATTGNAAGAGCTCCCANTGCCAAAATATA 

SEQ ID NO: 252 aCTTTTTTTTTTNTTTTI^^ 

TTTTNCGNANANACAGGGTTTCNCCKrGCTGCCCAGGCTGGTCTCAAAGTCCT^ 

CCACCCACCTTGGCCTCCCAAANNGCTGGGATTACNGGCATGAGCCACCNCTCCTGACCANAGTN 

ACTTTTGNAANGGGCACTAATNCCACACNTGAGGCTCCACCTTNAO^ 

GCTNAACNTCCTGCCCavrrCCCCAATGAGNCCAGTGGCTTAACCTTTTCTNTNATTNTT^^ 

ATNATTGCTCATTNGGGNAAAACCTCCAAATGCTCCGGTNC 

SEQ ID NO: 253 ACCCAACAGGAGGTTTTTTCAACTmGCTCTTCTCCOT 
CATCCCCAGAGCAAATTCAATGTAANTATACANTTTCTTCmCTTm 
ACTTACATGAGGTCCTrGAAAATTGGTCTAATATTTCGCTTNTAAAANCTAGATACA^ 
TGTGGGTGGCTCACGCTTTGNAATCACAGCACTTTTGGGAGGCCAANACTGGTGGATCACN 
CAGGAGATCGANACCATNCTGCCTAATACCGGNGAAANACTGTCTATACTAATAATTCAAAAATA 
TTANC 

SEQ ID NO: 254 AGCGGCGAGGTACTi-rri-riUH"14"l-l-ll lTITT TTTTTm 
AGGAGCATTAACCTTGACTATGTCTTTANCTNCAGCCACCTTTTTAAGAOT 
NGGGGGAGGGCTANTCANGNAACGAAACTGTAAGCCGGACNATNTGTGAGGAGGGGAGGTTAT 

SEQ ED NO: 255 ACTTACATGTGTGAACACATATAAAGTGTC^GGTTTACAGACCCTGGCTCAA 
GGACAGTCTANGATGGGAAAGGAGGTANGGCGAGAAGAATCACATATTANACTCCCNGGTGCTT 
NAGCCTCACCCTATNCAAGGGACATGACNTATGGGGT^n^NNTTANTCCATNC^^ 
TCANGACTTGAAGTTCCTAATTNGTATGNnsfGGAACCNCAANANCACGTTAACT 
GNT 

SEQ ID NO: 256 acgcggggaggccccagccatctcaggctacnctatcccaggatcagcatgg 

CCGCCTCCAGTGGATAATCNCCCTGGCCTTGGCTGNCCTCCTTGTTTGTGGACANGGNAGNGCCTT 
AGGCNTCANGAAAGCTCCCTTTOTCATGAATGCCCANTNNGTGANC^^ 

SEQ ID NO: 257 CGCGGGTTTGAAAGTCTTTGGCAATGANATTAAACTANAGAAACCAAAAGGA 
ANAGACAGTNANANAGANCGAGATGCGANAACACTTTTGGCTAAAAATCrCCCTTTCAAAGTCAC 
TCGCGATGAATCGAAAGAAGTGTrTGAAGATGCTGCGGAGATCANATTAGCANNAAGGATGGNA 
AAAGTNATAGGGATTGCAT 

SEQ ID NO: 258 TCGCGGCGAGGTACACCAAGCTTCATTITrGTTTTTTGCNGGCTGAAGTCATG 
GCATGCAATTTTTGCATTTACGArrCTCITGGGCATGCCCTGTGA 
CNTAGCCA>rnGTTGATCOTACTNTCCANATTGACTTCTTC(>ITGGNCTTTCC^ 
GTTGACTG 

SEQ ID NO: 259 ACGCC^^^TCCGGCCAACANATGATATGCAAACCATTGTTGCTGTGGCCGAA 
CAAANTAACAAAACCTCCTACCGACACTGCAACA^TGAAAC^^^TGGANGTGCCNATAAG^ 
TTCTGACCTCACCTANT^rmTGCAANAGTGCANAANCCCCCACAATNTCTNTGCT^ 
TGGAANATGGGGAANTGGTNGNCCCTGArrCCACANCCACCCATNTOATGTCCTGGGTTTOT 
CNAAGGATCAAGCTNNGNCTTAGNGGCCTNTTAANCAGCCTGTTTCAAANTGCCATA^ 
NTGCATATGGGGAACATTGGGACTCTNAG 

SEQ ID NO: 260 ACTTirrTTTTTITTTTr^^ 

tttatttatcaanaggaactatttnttanccx:acatattcatgtgtcatagttcagg ^ 

cagggacaaacttctaggnaattcaacccgaaaaaattntttatnttccaana™ 

tgaaanatccagccitcctnatntcctnaaaatctttnatgacotcggtan^ 

SEQ ID NO: 261 CGCGGCGAGGTAcmnTrri-iriiirnTrrr riii J iroANAGAAGGANccA 

ATGCAATGGCTANTCTTTACTGGAAANAAGAAACmACAGCACATTTTGAGTAA™ 

GGTGCTCA^r^GTGCCAGCTGNTGGGGCACCATCTTGTAATGGGCTCCACANCAGGGGGCATCNCT 

GGGCCTCGCCTTTGNGCACCAAAACCATACAACGCTGGTTTTGTCCTCTTANAGAT 

SEQ ID NO: 262 ACCCTGGCATTGCTGACAGGATGCAGAAGGAGATCACAGCCCTG GTCCCC AG 
CACCATGAAGATCAAGATTATTGCTCCCCCAGAGCGGANGTACTTTA'rrri l 1 VI ITTAi 1 1 I INCA 
AGGGTATAANCATTTAATTTNAATTGANGGTAGNACCAATNCAAAOTANGTTTGGN 
GTrANAGACAATGAANNNTCCCNCCTTATGrn-AAAAATTTTAAAA 
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GATGGGTTTTArTAAAAGNGGAATTTTNCrTGACACCNTCT^ 

NGATTACCCNATACCTNTT^^'AAAT^WTCTNmGTTTTAAA^ 

AAATACG 

SEQ ID NO : 263 ACNCGGGATGTGGAACTATCTTTGCATGTTCTGTCCAGCAAGTTCTCTCTCCA 
TGAAGACCATAOIACCTAANGTGACCCCATCTCAGTCTGTCTGTTGGTAGACCTCCTGCNATC^^ 
TGAAGGCCTGAATNANACAAAAAACTGGCTTCCTCCCATNCAANAANTGAANATTTGACAT^ 

ccaagactggaactgagtgagctgcattaggaagctgtatgttttgnggaatttaaarit^ 
naaattgaaangtnnctagantnttaggacgttataccqtntgaac 

SEQ ID NO: 264 acaacccttctcaccctgtgggttggagccgagtcaggccactatggggaag 

CAGTTGCCCCACAAAATGTGGGTTTGCTGACCrATTCTAACTGTTGAATATGCTGCCCATTTC 

AATGAAAAAATGACITTGGGGAACCAAACCTOGCCCTTTGCCCAACTTG^ 

C^mmNNT^^NTTTTG^^mGTNGTACNACCA™ 

taatgcnatgngaacnatnaantggtnagaaaaccaaactgnttaaaacccctttct 

ATTTTATTNTNATTTGTAAAAACACATATTAACNAATTCNTTTGCCTO 

ANTITrTTmCTrATTAANAAAATNAAATCCCrrACACCC^ 

CCCTTTNGGCOTTCCNAATTGAAANGGGGNTTCTTCCTT^ 

AANGCCm'CATCCNGAAAGGGGCNTNGGGCTGTTAAAACNNGGCTGGCC^^ 

CCCAA 

SEQ ID NO: 265 ACTGTTAAAATGTITCCATTGTTTATTCATCCACTGGC^TTTAGGTATACT^ 
GGTCATGAATGAGOTITATCATAAAGTGAAGGCTAATTmGTATTACG TATCA GGGGTO 
CACTGTCTTCACTCCATACCTACTCCCCCATTGGCAGTTTTCCATGCAATG TTTTC rC 

GACCACTGTACITI^sr^TTT^^^TTr^^ 

AAAATAATNAAAANCCATCTCAAATTATTATNCACNTACAAAAATAGGGTACCTNGGNCGGNACC 
CCCXTNAGGGGNAAATTCCNCCA 

SEQ ID NO: 266 ACGAAGAAGTCCTGGCAAAAATCAGCTCCACATCCACAGATCGGCTCACAGT 

tctcaagaccaag(xacagtctatacaaagggatatcattactgtcrrgcaaccgaccctracac^ 
tggcccagcagctgactcatatagagctggagaggctcaattatattgggccagaagaatttgtt 
caggcgtrcgtgcagaangaccctttggataatgacaagagttgctacagtgaacggaagaaaac 

ACCGAAACTTAGAAGCTTACGTTGGAATGGTITAATCGCCTCAGCTCTTGGTTGCTACAGA^ 

GTATGCCTGTTAAGAAAAAACACCGAGCAAGAATGATTGAGTATTTTCATTGACXjTAGCTCGGGA 

GTGGTTTTACATTGGCAACTNCAACTCCNTGATGGCGATATCTCTGGTATGAATATG 

NTCNACTAAAAAAAACTTGGGCCNAAAGTGAAGACTGNAAAAATTTGACATTCCT^ 

ATGGACCCTNCAANCAATmCTTTAATTTTCGAAACACTCTTCGTGGGGNAAC^ 

TTACCn3GTCATANTAGTAGAAAAAANGATTGGNNATNCCN'i"J"l'CJ"lNAATTOTT^ 

TATTTNTTTCCTNAAT 

SEQ ID NO: 267 GTACGCGGGGGGGATACGCCGCNGCGCACGGCANTTAGTGGGTAGGCCTGA 
ATAGCCGAGGAAAACTGAGCCGTGGGCCTCANAAAGAAGTTAANGCACCCGCAAGCCGGGCAAC 
TGCCCTCCTTCCGCGCCGGCXjGAGCGATTNAAGTGAAGAAACAATGGCCAGCAATCACAAATCTT 
CAGCTCTCGCCCTGTTTCAAGAGGTGGAGTTGGGTTAACANGAAGGCCTCCTTCTGGGATACAA 
CCTATCAAGGAAAATATT(>[AGTTGGCAACTGCNATTGCCACCTTGGGACAGGCAAGACCAAGGT 
TCTCNTGGGTTNGNCCCATAGGGGACTGGTGGGANTTCTGCCTTCTNAAAAT^^ 
NTCGNCCTGAAACACAACAAhn^fGTTTNTNTGGAATTGAAANCTTGGGACNATAAGGTCCCCATC^ 
GGCANAATTTTAGANAAATCmACTTATCTNGGGGCTTTTTAA^ 

SEQ ID NO: 268 ACGCGGGGCTATTGCCTAAGGACTGCrrCCCCTCTrCAACAGTGAAGCTGCA 
GGCCAACCACATGGAAAGGAAAAAGAGACATGAAGGGGAAGCAAATGTGTGTTCCAATACTTGA 
AGCACAGTTTATGCTTTTCACTrrGGCATATATCCTATCTGCCAAACCCTATGTATATGCCC^^ 
TAAATTTCAGGGAAGCTGCAAATGTGTTCTTTAACTTGACTAACATGTACATTTGTC 
AATTAGCCAGGCGTGGTGGCGGGCATCTGTAGTCCCAGCTACTCGGGAGGCCGAGGCAGGAGAAT 
GCGTGAACTCGGAAGGTGGAGCTrACAGTAAGCCGAGATCACGCCACTGCTCTTTAACCTGGGTG 
ACAGAGCAAGAATCCATCTCAAAAAAAAAAANGhnSfAAAANANNCANTCCTAAAAAAm 
TGGCATTGTTTTACAAAATTrrCTAAhfNGGANGAGNGAATAT™ 
TTTATTAANTGANGAAGGCAAAGNAACTGATGCNATATTITAAAAAAAAAANCAGA 

SEQ ED NO: 269 ACl l "l UlU ' l l lUllU " riU ' l - i " i - ri GGGATGTTNGGNGGTmAACm GTTAT GTC 
CATAAGGTCCTTCCAGAATCTACATGACCTGTCrCCrrACTGATCATTTCTTGTCACT^ 
TATACTCTATCTAGGCCATACTGACCTTCTTGCAACTTCCCAAATAAGCCTAGTGTGTTTTG 
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AGGACrTTACTTTNCAGCTCCCCTTGCTGGAATGCITCCCN^^ 
GAANAGTACGGNACCACAGTTTATTACCAATTCTGACGGAAGTCCNCCTTC 

SEO ID NO- 270 AC i 'r ri"il ' n 1 1 1 U l L N ' l - l ' i -i" Il - ilOT CCCAAAATGTGTTTNTTGANATGGTTT 
CCCACTCATCTTGAO^CANANTGCrmAGTGCTTGCriTCCTCCT^ 
CCTTGCTrrrCCNCCTGTANGCTGNCNNANGACANTGGANCANCCAACNCACA^ 
GTGCATGGNTAAAAACCNGGGTGATTrrATACa^TCCCTGGGCATTTCACAT 

SEO ID NO- 271 ACTTGCACAGGAAGTGTTGGCCGCrrGTTGCATTCCGTTGCTGCTCCAAATTA 
AAAAAGTTGGTTATTTGGGACCTCAATCTCAACACAAGTGCCTmGTCCCACC^^ 
CGITCCAAGAAAAANGGGTTGTCCCGTGTGCTAAANAAAACTTGGGTGTGACTGGTTCGATAA^ 
TCCAAAGCrrGTTCCCATCATACCCGTAATGGCrrrCTTITrrCC^ 
CATTTCAACTTCANGANAATGGAAAAANGGCCTCTTTTTTGCCCCTTTAC^ 
TTATTCTGGGGGNGTmXJGGGGAAACNTTGTTAAAATrmGTNT^JAA^ 
NATmAANTAANNTGGNTNCCCNNATATAOTAAAAAGNGGGTNAAAGA^ 
TTNTrnOTCTTAANAAGGGGNNTTAAAATNATTCC^ 
CCTNAAAGC 

SEO ID NO- 272 acatttggcatgatctgggcctatgcggtcttacaatccctgtataaaactag 

ACAATGAAAAACAGAAAACAAAACAAACAAACAAAAAAACAAGAACGAAGCACCTACCACATG 

CCAGCTACTGAGGCTATGAAGGTATTCTCCCGCCrrAGAAAGCCCAGGATTAATGCAGGATTGCG 

ACATrTAAACAGAACATTTCCATACAGCATGAGTATAAATGACriTCCCAAOTrrACACTGAGAGT 

AACTGACACAGCAACCCCAGCAAAGTCTGAGCTGAGTCCTGAATAAITGTATAAAAAGGGGAGAG 

AAACAGAGTGAAGAAAGGGTTTCCCAGACTCTGTCCCAGGAAAGAAAATGAGCTCGTGGAGAGG 

AATAGACTTTCTCTATGAAAACAGAGGGAACAAAGAGGAAGATGTCTGGGAACCGAGGAGTAAT 

AGAGACCTGAGTrrACATCACTACTCTGCCCTCCCTANGTACAAAAGTGGATACAATACAATGGA 

AAAATGCATAGAAAAGCAGaAAAGATTTTGTCAACTAAAAAAACAAAArrATGTTGOT^ 

CAACTGGGATGATGCTTCGCACAANAGCTTGACAATCAAAGAAAAAAGCAATACTTAATATTC^ 

GCCAGATGTGA 

SEO ID NO- 273 ACATGAAGTCCTATACGGTATAATGAGGCAATCAGAGATATAAGGATrGGAA 
AGGAAGAAACAAAATTGTGATTATCTACAGATTATTTGTAGATTATGTGATTTCT^ 
AAAGAGTAAACCTTCACATTArrAGACACAACATGGGAATTTAGCAAGGTAACTGCATATAAGAT 
TAATATACAAGATTCATTGTGTTTTCTCmAAATATAGGTmTAATGGOT^ 
GAAACACACCATTTTAAATTCTCTTTCCAATCCCGATTAANCT^ 
TTGCAATTTAATATNTTCTrrCTTITGTCTGGCI^^ 

AACITCANCATGTGATATGTTGTNTGATATCATNmTAAAGACAAAAAGTTNA^ 

SEO ID NO- 274 ACCTTAGTGAGGCTCAAAAGGATTCTTTTGAGTCTATm 
lu^ATTCACCACrCGGGTGAGAAAGGTGACATTGTAGTCnTrCTGGCCTGTGAA^^ 
AAAGTCTGTGAAACTGTCTATCAAGGATCTAACCTAAACCCAGATCTTGGAGAACTGGTGGTTGTT 
CCmGTNTCCAAAAAAAAAAAAAAAAAAAANAANNGTACTGCGGGGAAACGGAAGTGAG 
GGGGTCNACCTGACGGTANO^GGGCAAANAGGCTGTTNCGCAAANCTGCGGAAGATAAATGCXrN 
NAGGACTTGGATCTGAGCTAAAGGCAATTTNCCAmACTGAACTTTCATCAAGCG^^ 
AAAGTNKmAKTCrrGTTCGGAAATGGATmrCTTGNTN^ 
CCTTNATmCCANNAAAAAANATTTNCCN(>fNATCCAANANTCAAT^^ 
AACATTCNGGGNNTTTTTTGCrCCAN^rAAAATNACNTATC^ 

SEO ID NO- 275 acgcgngggataactaccgatttgcacatacgaatgttgagtctctggtcga 

ACGAGTATGATGATAATGGACAGGGNATCNTCmATTrCCGGC>miNACATGTGACTANCANG^ 

tgaggacnagactggggcttat 

SEO ID NO* 276 ACTTTNTTTrnT ri -r i -i-i 1 U 1 1 U 1 1 INGGCCATTCAArrAATAnTATNGAN 
AGGCm-ANCGNGCNCATOTCACTCAAGGNAAACAGTTGAAAANCTTArrAm 
GACAAATACrrGAGTAATAGATAATATAArrACATATAAACAGGTCACATGTNNAATCTCCCANA 

GCAAACCITCTTTTCTCGAGTTTGCTGATGTCTGNACCTTCTAA^ 

ACGGGCCGCTTGAAGGAGGTGAANACACTGTGCTGGAANTCTTAAAGGTGQANNGGGGCATGGA 
GmCNCCCGGANCCTTCa^GNCATNGACACATGGACTTTTCTNGGTTNGANC^^ 
CAACCITGAAAATGAThnrNAAAATNATTGAATNCCrNAAACC^^ 
CATCNhn^rrGAANCNTACAGNACATNCTTATATTTACCTNAGGAAT 
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SEQ ID NO: 277 GTACCATCOTCATGTTATTTAAAAAATNTAAGNGCANAAGTAAAAAAACCAA 
GGAGCTCIX3NAATAAAGTAACATAAAACACCACTGTAAGGATCACTTAAAAAACGATTCCCT 
GGGAAATNNGGGCACTGAGTGTGTAAACATGTNAGCTGTACCTNCATAATGAGTAANCANTNTGG 
TAATGAANTATTTCTGGCCAAGCTTATTGCTCAAGCTTGCNTATNTATNm 
TNCAATGTOTGATAANNATCAGACTTTGNNAAACTTCTGGGCTCTNGNNATTA^^ 
TATAGCATGGCATNGNTTTITAANNATNGAAACCCNAAGTGTTGATGTA 

SEQ ID NO: 278 GGTACCTATATGGCTCCCCCAATTAATCACAATTCAGCAGCATCCACTCCCCA 
TATGTGGCTGAGATAACnTACTTTGACTGTCTATTrGGTTATCTCTCAGACAAAGCTTC^ 
TAAAATGGACAGGTTTGTGGACTGTGTAGAGCAGCGGAAAAGTTTCCACATmCTTGGACTGOT 
CACCrTCTGTCAGCGGCTCCAAGGTGTCTGCTCCAGGCATGTCTACACACCACTCT 
AATCTTGT 

SEQ ID NO: 279 GGTACGCGGGGAAGTAACTGTGGTGTGGAAGCAGAGTAGAGAGAAAACTTG 
rrCCTCATTAGAGAGAGAGCCACACTTCTCACTGCTCACAATGAGAGGCCAAAGATTACCCTTGGA 
CATCCAGATTTTCTATTGTGCCAGACCTGACGAAGAGCCTTTTGTGAAGATCATCACTGTTGGAAG 
AAGGCAAAGCCGTATGAAGAGCACATGCAGCTACTATGAAGACGAGGACGAAGAGGTGCTGCCT 
GTCCTACGGCCCCACAGCGTGCTCCTGGAGAATATGCACATCGAGCCACTGGCCCGACGCCTTCCT 
GCAAGGGTGCAAAGGGTATCCATGGAGACTGGCCTATTGCACGTTAGAGCACGGGACCAGCTTAA 
AAGACGCrrCTACCCGGAAATNGGGCATCACTAGACAGTCCTGTCCTATTGGTCNTTCAAAGAATAT 
GGATAATCAGATTTTTGGAGCATATGCCAACTCATCnTrCAAGGTTCAGTGACCACTATm 

SEQ ID NO: 280 ACGCGGGGCCAGGAAGATAGGCAGCTCATCTGTGTCCTGTGTCCAGTCATTG 
GGGCTCACCAGGGCCACCAACTCTCCACCCTAGACGAAGCCTTTGAAGAATTAAGAAGCAAAGAC 
TCAGGTGGACTGAAGGCCGCTATGATCGAATTGGTGGAAAGGTTGAAGTTCAAGAAGCTCAGACC 
CTAAAGTAACTCGGGACCAAATGAAGATGTTTATACAGCAGGAATTTAAGAAAGTTCAGAAAGTG 
ATTGCTGATGAGGAGCAGAAGGCCCTTCATCTAGTGGACATCCAAGAGGCAATGGCCACAGCTCA 
TGTGACTGAGATACTGGCAGACATCCAATCCCACATGGATAGGTTGATGACTCAGATGGCCCAAG 
CCAAGGAACAACTTGATACCTCTAATGAATCAGCTGAGCCAAAGGCAGAGGGCGATGANGAAGG 
ACCCAGTGGTGCCAGTGAAGAAGAAGACACATGAAGGCTTGCT 

SEQ ID NO: 281 ACGCGGGAACAGTCCCrrrCTATTGTCTATrCTCCTCCTCCTTCAGTCTTTACT 
GGATGTTTTATATGAATGTATTGATACAATTTGGGGTCrmGTGACTGGATTCCATCACTT^ 
ATrmGAGGTrCACTCATGTTACAGTGCATATCAGTATTTCATTTCrrm 
CCATTmGGATGTAGAGGACATTrrAAAATTTATTCATTANCTGATGGGACATTTGGGT^ 
ACTTTTTGGCTATTATAAATAATGCTGCTATGAACATCTGTATAAAGGGTTNCT^ 
CNA^^S^^TTA(>rCTTTTTCGGGGA^^'CTTT^^GGTT^ 
ACAAATAGGGGACATTTAAANCCTTCCATACCAATTTTNTTTm 
AAAAGCCTTCCANGCCACGTrrrGAATGTGANANACATTTGGCAAAAAAATTT 

SEQ ID NO: 282 acaatttgtgccattaaaacattatctttcatcacaaaccctaggtgaagtat 

GCTGGGGAAGCCTTGCTCTGTGTTTTTGAAATTGTAGGTGGCAGCCCATTACTGGG TCAT ^^ 
CATGGCCAGCATTTAAAAGAGAAACAGAATAAATTAGGAAATATTATTTAAAAACT^ 

agttctgtgtggaaacatgggaatcataatgatccaaatagatgggaaacagaaacatggccaga 

agaatattaggaaatgagagtaaagtgatactgacagccttattgctctgaaactcaatatgaaa 

tgagaagcaagtaggaacttttacattttgataggtgatgagcagaaagatatctcaagcaatta 

acacacatccaggtcagaagttggctaacttttcccataaagggccagatagtaaatagtttaaa 

ttctgtgggccataaggtgtctgctgaaattatttaactcagagaatggga 

seq id no: 283 ctttgcattctgattcmgatatttgaacrrggccatctgtgggtgcta^ 
cgtaagactaaaaggcacctactaaatacgaa caag gataaccagttctaccaaaaacattacca 

ACCAAACCTCCCCTGTTCTTTTTTCAGACmCCTTTTGCTTT 

GATTTAATGmCCCTATTATTCTGCTTTGACTCACATACTAAAATGACCACATGGCACTCCATTGA 
ATCTrrrCAGTTGTTCACGAATTAAAATAATGCCCATGTTITACAGTT^ 
TGGATCTGTAGTTTCAGGACCTGTTTTTAAAAGATCAAGATACTTTTATGTGTTT^ 
TGTTnrrCATrrTTTGtGGGTTTGGTTTm 

TTTTAAAAGTCACAGTGTTNTGAGCCTGGTGTAAATGCCNNGAATATTATATN^ 

TGCATCTCGGTNAATGNGCCAAGTCCTrCATTAANGGCT>WCCCAGGGGGGTTTrTGAAA^ 

TG 

SEQ ID NO : 284 GGTNCGCGGGAGACGGTITAACAGAAAC AGCGGCAGTGTAGTATGGCCAGG 
GATACCCATTCTCCAAGACTCACCATGCTCCTCTAGGTGGCTCTGGCCTTrGTTGACTCTCAT^^ 
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AGACAGAAAAGAGATATATTAACCATGGGATGGGGTCACTCTGATTAGAGTAACCCTAmGCTG 

GCCTCTTCCAGGGTCCCTGCCTGTCTGCACCCACTTGCAGTGCAACCACAGATGCACAATCA^ 

CCTTCCCAGTGGCAACTGCTATAGCTCCTTCACTGGCAGATCCTGCCTAACAATCAGAGAGOT 

GCAGAAGGGGCCCCTGCCAGCATACACTCAATCACAAGCCTCCTNTCACCAATTTAAAAAC^ 

ACTTGGCTGGTGGTGOTGGTGGGTCACGACTGTCATCTCAACACTTTGGGAACCCAAGGCGGTGG 

ATCACCTGAGGTCANGANGTCGAGTCAATTNTGGCCAACATGGTGNAACCCCATNTTNCTA^ 

AAATTTTAAAATrANCCGGAGGNGGNGGCACATGCCCGTATCCTACTACITNGG 

GANAATTCTTGATNC 

SEO ID NO- 285 ggtacagtggcgtgatcttggctcactgtagcttctgcctcctgggttcaagc 

CATTCTCCTGTCAGCCrCCCGAGTAGCTGGGATTATAGTCACGTGCCACCACGCCCGGCCAAC^ 

TGTAGTmAATTGAGATGGGGTmCTATGTTGGTCAGGCTGGTCTCAAAGTCCT 

GATCCACCCGCCrCGGCCrCCCAAAGTGCTGGGATTATAGGCATGAGCCACTGTGCCCGGCCTGCT 

ATTITAATATTTGAGAGATAATGTTACCAACATGCTCATCCACAATAATTACCAAAATT^^^ 

TTGTArrCAACCTGTTTTTACATATAAAGGGAGAAGTGCTTAATTAAOT 

TCAGTOTAACCATATGACAGAATTCTITAAAATTTTAATAGCTCACAm 

GAAACAAGCCATGATTATTATTTAACCTNGCAAGGCTCTTGGACTNGGTANAAAA^^^ 

TAACirmGAAAAAACATANAANCCTTNTm 

SEO ID NO- 286 tgtactgaggaagacaccattccttgacggtgtctaagaagccaggtggatg 

TGTGTGGTGGCTCCAGTGGGTGTrrCTACTCTGCCAGTGAGAGGCAGCCCCCTAGAAACT 

gcgtaatggaaaatcagctcaaatgagatcaggcccccccagggtccacccacagagcactacag 

AGCCTCTGAAAGACCATAGCACCAAGCGAGCCCCTTCAGATTCCCCCACTGTCCATCGGAAGATG 

ctccagagtggctagagggcatctaagggctccagcatggcatatccatgcccacggtgctgtgt 

CCATGATCTGAGTGATAGCTGCACTGCrGCCTGGGATTGCAGCrNAAGGTGGGAGTGGAAAATGG 
TTCCAGGAAGACAGTrrCACCTCTAAAGGTCCGAAAATGTTNCCTTTACCCTGGAA^^ 
AAGGGGTCATACACCAAAAGGTATTmCCCTCACCAGTCTNAGGCTTNACTGGC 
ATTTCAGCACACCrmm'GNAACCTTANTTGTNANCNAGAAAANGG^ 

SEQ ID NO- 287 accgcgggaaactatatgctatctacaagaaatttacttcacctgtaaggac 

ACAAATAGACTGAAATTGACAGATGGAAAAAATATTCCATACAAATGGAAACCAAAAAGTAGCA 

GGAGTAACTATACTTATATCAAATAAAATGGACnTTAAGTAAAAAAACTATAAAAAGATGGAAAA 

GACCACAGGTCAATAGAGAAAGAGGAAATAATAAATGTAAAAATACATGTATCCAATATTGATGC 

ACCTAAATACATGAAGCAAATGTTAATAGACCAAATGGAGAGCTAGAATGCAATGCAGTGATAGT 

ATGAGACTTCAGCATCCCACTGTTCTGCAATGGATTGATCATTCACACAGAANCTCAACAAAGAA 

ATATAAGAArTTAAAATGGACTCTAGACTAAATGGACCCAACAGGATTTTAAAAAAACATTO 

CAACATCTGCAGGATCCGTGAAAATTAACCAGATAGCrmGAAATACTGGATGGGGCAATCAAT^ 

AAATTTTTAAAATNAATTTAAAAATTTCCT 

SEO ID NO- 288 ggtacaataagtgccttgcacataagagtccaataaatttcttgaatgatga 

TATGCTGATACArrGTTCAATAATTTATTTACACTGAAGTCTACTAAATCCTAAACA 

CAATn-AATNACATrrTGATGAAAATTATACATGGAATGCCTTATTTGAAAAAATAC^^ 

TAATTCTCCATCAATACTTCGAGAGTTCATGGATTTTCTGAAGCTTACCTATGAATCCCAG^^ 

AAACTCTAATAAGAAAATTGTATGGGGGGAAAAAGTCTAACAT^m'GCTCAATA/^ 

TGArrCTGCCCCTCAACTTCAGAAGCTCCTAAAACAGTTCTGACTGANCCAAAAGTOT 

AAAGGATCTGAAGATGAACTCNTGAGTNAAGAAACCGGCITNCCATITNGGa^AGNG 

ATAAATTAGANTGGCCCAAAACCCGCTCTCCATCCATTTNCTAATNCTGGAGCTTCTO 

CTTCCTTAATTCAAAAGGTCTATNCrrGCATTTNTGCCTCrmGAACN^ 

NAATACTACrrrCNATTTAKITTNTTTNTTTCTAATAANCCGGT^^ 

NCTANTNGKTNGTTA 

SEO ID NO- 289 ggtaccgagtgcacctatgtctaatcatgtgtgcatgtgaggaggtgctggc 

TGACTGCATCAGCGGAACCCAGGCATAACGGCAATCTTTTGTTTTAGGTAT^^ 

AAAATAATGGAAAGTTmGCTGATrrAAACAAAAATATTTACTCTTTTCA™ 

TTAAAATATATCTATTTCTCCCCTCTGAACATTTAACTAGGAACACTGGGCAATT^ 

AGTGCTGATTGTTTAATACAATAAATATGGCTGTGGAGAAGCACACAGCAGCAAGTAGCTTACAT 

ACNGTCCACGGATAAOAAATTTAOTCTCATCTGGGTTAGGTGTCXAGCCACCCAAAGG^ 

riTGGCATTCCGTGTGCTTTTTTrmCGGGTANGAATNAATTCTGCTTO 

AGAAAAAATNATTCAGCmGATATCCTTAAANirrrATTGACa^T^^ 

GCCAAAANTGANCOTATGGTCCAAAATGCTNGNTTNGGATCm'GTCTTCAGCTNATm^ 

GCAAGGNNNGCTCmANGGCTGCCCTTNAANCOTGATTGAAGGNTTGGGGANCNGGGGGGGG^ 
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TCmTtGGCAAGCrncrGTrmGmACCCACCCmTTTATTCJA 

SEQ ID NO: 290 CCAGCGmCCCCTGGAAACTCCTTGGGCGCTCTCTGTTCCACCCTTGNCGTT 
ACCGGATCCTGGCCGCTrrCTCCTTTNGGAACGTGGCGCrrrrrATACTAC^ 
GGGNAGNCGTCGTCAACTGGCTGGGGCACAACCCCCGTNACCGACGTG^^SfCTT^^^CCGGACm 

GT 

SEQ ID NO: 29 1 CCGGGCAGGACGCATACAATGACAAAGCCATTTTGGAGCAGAGGAACATGCT 
GACGGAAGAGCTCTATGGGAACACATTTCAGCTTTACAAGTCAGCAGATCACCCAACTCTGGACA 
AAGTGTTAGAGGTACC 

SEQ ID NO : 292 ACTATAAGTAGATCCACGTATAOAGAGAAAATTGATTTTTG ACCAATTTCAGT 
GAAGGAAGGGAATTTTTTAAAAACAGATATTCCTAGAGCAACTGOATATCCATTTGGAGAA^ 
CTATTrTACTTTATTAATATCAGATAAAATTGACriTAAAGTAAAAT^ 

TCACTGAAAGAAGAAATAAGAATTCATCAGGTAGAGACACCAATTCTATATTGGTATGCACCTAA 

TGTATGCCAATATATAGAAAGCAATAATATAAATGAATGTATATAATATTGNATACCAAGCTAAC 

TrrATATATAGAATTCAGAGAGAAATGATTTGGCTGAATCATAGAGGGAGAAACCAAATGGCTTA 

TTAGGCACATGACmGATCTGTGCCCGCTACAGTTTGNGTTGGATGGAGCCAAATCNGGT^^ 

AGATTNNGGACCAACAATCCITCANGATGGTTGGTNCATATTAAACTGGTACCGGGATTGTGC^ 

GATATrCTGGCCTGAATAATCrCTAGNGGAGNTArmAANCTTCTCACTGGANNAA^ 

CCCAAAAATTNGNCNT^^^GNCTTAAGGNTAGGATTTm 

GNlSriTGGATAACAAmTITrNGCCrTGCGGGNGTGG 

SEQ ID NO: 293 Ac r r iT rn n rr i Ti ri i i rnrrrn r n 1 1 n icngaaatgaacaaatattta 
tttatmtrtataacaagtaaggcantgttgcttaaaggaagacaaacaaacat 
ttgacaatgcattttttcatctgttcggcacaatgcrrrrgtcataatgganatgngacagcaa^ 
titccaggacattcagtnttcggcggcagnanttagggcanatgactggccgctcaaatnsitcta^ 
nttgtttcaggacagtggaaaagctgnttkraaatgaggccaaagcacnaggtaggtggaaggtt 
cttggntcgggttgaaccncgacagcgcnccaanagacaacactgaggcaatggggaacaacat 
tgctnttttaantgancnccttgggtgcnagcgtgctgagggttaaaa 
tmtcggccggacnactcttaaggcgaatrrcaacacctggcgggctgtnactaggggtt 
tcgnanccaacttggnctaaatcntgggctagctgtitcttggnggaaatggattncgttana™ 

CCmrAAAAThnsfAACQ^GAAANTNAATTGTTAAANCTGGGGGCCTAA™ 

attgnggtgngctcctngccgtttcaannggaaaantgtgg 

seq id no: 294 accatctcactcaacrcrrgcaagaactctaacgagactggtattattattcc 
tatcttacaaaagaggaaaccggccaggcgcagtggctcacgcctgtaatcccaacacttcgaga 
ggccgaggtgggtggatcacctgaggtcaggagttggagaccagactggccaacatggtgaaac 
cccarrrctactaaaacaaaaatatacaaaattagccagccatggtggtgtgggcctgtagtccca 
gctagttgggaggctgagccaggagggtcacttgaacccaggagacagagattgcagtgagcca 
agatcacgtcactgtacagaagtttttaaaccaaactgaggcataaagcagaaagagcaaaaga 

CACATGAATACCCTTCTTAACAATCTCTTCTACTTATGCCTCACCGAACCTTTGNACC^ 

AATCCCCAACGCGGCTNAAGTCAGACTGGGCTTNAGCTTTTGNCCCAGGCACCCCCACA 

TmNATTTGGNTGGCCACTTKANAAAAACrrCCNANAAANAAT^^ 

SEQ ID NO : 295 ACGCGGGGCCTCCTGTCTTGTCTCAGCGGCTGCCAACAGATCATGAGCCATC 
AGCTCCTCTGGGGCCAGCTATAGGACAACAGAACTCTCACCAAAGGACCAGACACAGTGGGCACC 

atgggacagtgtcggtcagccaacgcagaggatgctcaggaat tcancgat gtggagagggccat 

TGAGACCCTCATCAAGAACTTTCACCAGNACrTmrilU-ril"! 11 11 J 1 1 1 1 1 NTNGAGGGGGGTCT 

TmCTTCTTGOTCTCAAAAAGGNCAAAGGGAGCCCGACNAGGAATAAATANCAATGCCCTG^^ 

TCCAACTGACCriOTACAGAAAAGTGCTTGACTGCCAAGGGGTNTTNCCAATCATTNATGA 

TTGGAAAANTCTCCATACTCCTCTTGGGNGANGGCATTAAGGGTTTTNGNCCA^ 

CTTGTTTAAim^TTTCNGAACAAAGGAATTTTTC^ 

TTACCCGGTNGGAATNTTCCTNNTTGNAACCCCAATTTTT>^ 

SEQ ID NO: 296 CGAGGTACATCATTGAAATCTTTTGGTCTTGTTATTGGAATATTCTTCACGTA 
AGTATATCATAGCTAACTGAATTTATrTCTAAGTATrmACAGTTTTAT^ 
GAATTGGTTTTTTTCTTCTCAriTGTAATTAGCTATTm 
TATTTTTATCrTCTOTCTAATGAGCTTArrTAATCATTGAAATTACAGTA G 
TATTTTGGGTrTTCTAGATATGAAATrATTTCACCTGAACATATAGATCATm 
GTTTATACTGATTACTGGTACATTAGCCTGGATTTTTCAAAACAATGGTGAAGA 
CGTGACTGNCTTGGTCTTAATTTCATGGAAGTAAGAATGCCAAATATTAATAGGGAATAGTATTCC 
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CTATTANGAATGACCATTTACrmGGGTATTAAGNANGGAGCCATTACCATGN^ 
CrmCrrGTTTATAGGGGTAATGCTAAAAGGGGTCTGGAATTTATAAAANGCCm 
TTGGTAAATGGATGA rr il l UlTCTTTAATTGGTGAAGNAAGAATAAAANGGGAGGATTTGANGG 
GAACCAACrGGNnrrGGACAAACTACTTGGGCTTGGGAAAA 

SEO ID NO- 297 GGTACTTTNTTTTTTTTTT^^ 

xvtmtwgtgctaagtaaaaaaggctnttgccaggcgcgggggctcacacct^ 
ctttgggaggccgaggggggnggatcacttgaggncangagttnggaaccagtntggccaacat 

GGGGAAACTCCATTTTACTACTGNTTGAGGAGCCGGTGACANAANCNNTACATCATOT 

rXTATTATGCCCAAGAGGGTNCAACAAAAAGGCACrACTATGATTTGGGGGNAACATATT^ 

ACATGTCATTTGACTGCCATGGATACTGGGGANCAAGGACATTITGGAAANTTTC^^ 

ATCTTCTGGGAAATGGGCATTCCCCCNlTrANTGGAANGATTGNTNTrrTNACCA/^ 

TATTirmAAATAAArrGGGAAGGGNmrCCGGGGNNCCCTGGANCCAGGCAAGGGG 

NAATNCTTGCACTGACCTATGGCAACCTAAATATTITmAAAGGAAAAACAC^ 

ATTGATTGAATTTCCTTCCirGGAAANA^ 

GGGGAAAANAATGGTTNTTTNJ'rrrrri IT 

SEO ID NO- 298 GGTACAACCTTCAAACATTa:AGTTTTTATAAAAAAAGGGGCACACAATCGT 
GGTirrGATCCCCTTTTGTTTITGGACAAATGTITCT^ 

TGCAAATrrACTTGCACAGTAATCTGCCAGCCCATTTACTCCACTTAATCCAGCTGAAOT 

GCAAACCAAATGTCCATGGTCATTAGCAATCATAGCAGGTAGAAAGGCrrTTATAAGT(X^^ 

GTGCTTTGAAATTCACATCAAATGACTTTTCCATAAGCTCATCTGGACAGT^^ 

CTGTTACGATTC(:X}GCATTGTTGATTAGGATGGAAACATCGCCGACTTCTr^ 

TACTCTATACACTCCITCCTmGGCTGCAATCGCAGGTATAGGCGTGCACTCTTGTGG 

TCCCGAGCCATCTTACATGTTTCCTCATTCCCCTCTTATTGATATCCCAGAGAACAAGAACAGATC 

CACCCGGCNAACTGCAAGGNTAAAACCTTCCGAGGTCACTTTCACACCTGTGATGANGGCTAT^ 

ACCACAACGTTNTTCCGGGNnTNGGAGNA 

SEO ID NO • 299 ACTGrrGAATTTGGTTCGCNAATATTTGTTGAAAATTTTTACACCT^ 
TCAGTGATATTAGTCTAAAAATTCATTTTCTCATAGCNTCCITACGTGGCT^ 
AATGCTGGCCCCNGAAAACGTGTTCGGAANAGTTANCNCCC^fNTTCANTNCTGTTA^ 

AAGATT 

SEO ID NO: 300 ACAATTGAAAGCAGAGGCATCCTTGAGCmAAAGCATTGAACAAACTGGA^ 
/uVTGCAACATACCACATAACTGAAGTGAAAAAAGTCTGTGTTrrTGTGirill^ 
TTCAAAAAGTTAAAAAAAAAGACATATAAGGTTGATTAAAGGGAAAAAAGGCTCCAGTTTGTm 
ACAGGrriTAAAGTTCrGCTGTGTGTTCAATTGCCTTGTGTAACCACTTGTCNCCTTANGGCC^^ 
TCCCCTCTCTATCCCtnTTTTTTAAATGTCCATTrTGCT^ 
CANAAATCACAl^AAACTTTTTGGTTTGGTGACATACANAG 

SEO ID NO: 301 acttttataggcaacaccattccagaaattcaggatgaatggggatatgccc 

CATGTCCCCATTACTACTCTTGCGGGGATTGCTAGTCTCACAGACCTCCTGAACCAGCTGCC^ 

CATCTCCnrrACCTGCTACAACTACAAAGAGCCTTCTCTTTAATGCACGAATAGCAGAAGAGGTG^ 

ACTGCCTTTTGGCTTGTAGGGATGACAATTTGGTTrCACAGCITGTCCATAGCCT^ 

CAACAGATCACATAGAGTTGAAAGATAACCTTGGCAGTGATGACCCAGAAGGTGACATACCAGTC 

rrGTTGCAGGCCGTCCTGCAAGGAGTCCTAATGTTTTTCAGGGAGAAAAGCATGCAGAACAGATA 

TGT 

SEO ID NO- 302 ACTTTTTTTNTITriTrTTT^^ 

ACrm'AAACAGGNGTGTGGGTATAAACTGCTGTNTCTANGGGCAGGACCAAGGGGGCAGGGGCA 

ANAAACCCCAACGTGCATGGCCNNCNTTGCACAOTGGATTGCAAAGGTTGCNNGCTATO 

GCTACTAGTANCCCCGNTTrrCCTGGTTTATNNTGTAACATANTTTGGT^ 

GATCCCNGAACAGGATGATTCCCNATGG 

SEO ID NO: 303 ACrrCTTTATACATCTAGTAGACTTNGGCTGTGAATCTGTCTCATCTGATGGA 
TTnrrGGTITGTAAGTTAAATmTrATTACTAAm 
GGTrTAAATTTCTTCTTTATTCAATCTTGGGAAATTATCTGTTTC^^ 
GAATTTTTAGTTrGTGTGCATAGTCATAATAGTCTCATAGGATCTmGTAm 

SEO ID NO: 304 ACTATATTTATAGGAATAAAATATTTCCCTTACAGOTAGTCTTTAT^ 
AAAAACTrrCTGAAAAATOAGATGTATCTGATATCCCCGCGCATTTCACATGTO 
CATTTTACTGCCTCrGGTGCACAAAAAGCCCTTGAGCGCTCACCTAATGCTTCTGTG^ 
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GTTCCGAGCACGCTGACTCCATGTTGCTAGGGAGCCTCTCTAGGAGGCCAGCAGGCCGCTGGGCG 

GCCCCGACTGGGATCTGCTGTGGTATGTGGCCCTCTTTGCTAGTGCAGGAGCCTCCTCCGGTCACT 

TTACCTGCATTCCTTTGAGCCTGGGGCCCAGGCGGGGGACCCTAGCAAGAGGCTTACTTACGTGGT 

CCCCTGATACCAGGCATGGGAATGGTAGCACTTGCTGATTGTGTGGGTCrrCTGGCCACAATGGAA 

ATATTTAACACACAACACCTTAAAGTGGAGGTGGGCAGATTTTTCTCTGAAAAGGACCAGA^ 

AAATATTrGATTAAATTAATTAATCAATTTATTTANAGACAGAGTCTGGCTTT 

GANTGCAGTGGTGCAATCTTGGCTCACTGCAACCTCCACCmCCAACAATTC^^ 

CCAAGTAGTTTG 

SEQ ID NO: 305 ACAAAATGGAAAAGCTGGCCTNAGACAACAAACGCAGCTCCCTCTTCAAGA 
AGAAGAAGGTCAAGGAAGATGAGGATGATGGTGTGGGTGATGGGGATGAGGACACTGACAGTGC 

cntagggagcttccgatattcttcccgcagtaattcccagaaacctgaaacagacacatgctcctc 
cctggctgtctgtgatcactatgcaagtggcagcagagttggcaaaaaagatggatagcagtatt 
aataagtggctcagtggcctcaggacngaggaaaaacctcctttccaaagtgactggtctggaag 
ttccanagggaa 

SEQ ID NO: 306 ACCTACTATGTGTTAGAGACCACTGTAGGGGCTGGGAACAGACATGGAAACA 

gtgggcagaacactcrgctcccaggaactcacactctagagcaggggtcagcaaaccntttctgc 

aaacagccagataataaatgctttaggctctgtaggccacatggcctctgttgcaatttacttgat 

tttgctgtgagagcacagaagcaatcagagacaaaatgcaagttagcaggagtagctgtgctcca 

atataacmatttacaaaaataggtggtgggccagagattcatgatcaagtttaccttgtattag 

tctaatttcatgctgttgataaagacatacccgagactgggtaatctacaaaaacaaagtggttta 

atggactcacagttccacatggctggggaggcctcacaatcacggcagaaggagaaaggcacat 

cttacgtggtggcaggcaagagaggatgagantcaagtgggaagggaaaaccccttatx:aaacc 

ATCAGCT 

seq id no: 307 accatctcactcaactctrgcaagaactctaacgagactggta'ita'rtattcc 
tatotacaaaagaggaaaccggccaggcgcagtggctcacgcctgtaatcccaacacttcgaga 
ggcx;gaggtgggtggatcacctgaggtcaggagttggagaccagactggccaacatggtgaaac 
cccatttctactaaaacaaaaatatacaaaattagccagccatggtggtgtgggcctgtagtccca 
gctagttgggaggctgagccaggagggtcacttgaacccaggagacagagattgcagtgagcca 
agatcacgtcactgtacagaagtttttaaaccaaactgaggcataaag cagaa agagcaaagac 
acatgaatacccttciraacaatctctrctacttatgcctccaccgtaacctttgtaccc^^ 
caatccccaacgcggtctcaagttcaaactgggctccagcttctgtccacagccacccccacattt 
tctttttgtattttgtctgccacttcaaaaanaacttcca 

SEQ ID NO: 308 ACriU'i"lll"iUlM"lll"J"ll"l-lU'14'l lXjGAGTCTNACTTTAGGTAAGTGGAAAGC 
AAAGGTGTTCTGTTAAGGGTGACGGTGGGACGGTCCTTCCAAGCTCCGTCCTTGTGGCC TT^ 
TGTCGCACACACCCCTGAGGCATCWrGGGGAACACCCAGAACAGACATCCrATAAACATTTT^ 
AAGCTGTCirrATATTTTAGCAATCTTGGGAGGAAGTCNTTTGC^^ 
ACAGGTmGTGATAGCAATCOTNTTATATAAAAGCAGTANTTNAAACATGGC>rrACANG 
GAGCCTTOGTGGNTATGANCTTTGCCCATTACANTTCCACACTTTNANAAT^ 
TTCCCATGATTANGGTNACTTGANGAAATTCTTAATGGAGCAAACCTTTNGTTTNCAAG 
TGGCACTAANTTG 

SEQ ID NO: 309 AC nU - in - 14 ' l -lU UU - 14 - ] MU'lU' l' l - l ' l CAGTCC(nTrACCTTTATTAAm 

CTGCANAAAAACTCTATGAGATGGAAAAACGAAGGTrCAAGGAAGTTAAACGACCTGTTCAAATT 
CACACAGCTAAGGAGTGGCACCACCAGCATCCGAACCCAGGCTGTCCCACTTGCAAACAACAACC 
GGAACAATGATGAGAAAACCGACAAGAAAAATAATGACAATGACGAANACCirrGTCGCANACC 
ATGCTCrTTTACTCTACACAAAAACCTTCTGGGANAGCTATGTCATCACTC 

GCACACTGAGACTGAGAGAAGGTTAGGGACAACAGCCAAATGCCCCAGGAGATGGAATTCAGAT 

GGAGATAATGAACAGCCAAGTCCTCCGTAAAGTGAAGTATGTCATACACAGTAAGGTGTTATTAT 

TAAATCCCTAGAATTCATGGAAAGATGTTCTCCCTCAAGTGGTTTCATGTTAATTCACCTACAAAC 

ATTCCAGATGCCCGCGTACAGGTTTCACTATTCAAATATATGATGTTAAACTAACAAACTCATGAC 

CrrCAAAGATGTCTTCGTCCCACGCACACACATTTGGAAATTTGTGTCCATTGCTATTTCCC^ 

CTATAAT 

SEQ ID NO: 3 10 ACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATTTCGACCGCGAT 
GATGTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAATTGGCCNANGAGAATCNCTAGGGCTA 
raAGCGTTTNCAGAAGATGCNNAACCAAGAhrrGGCTTCCATACTCTT^^ 
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SEQ ID NO: 3 1 1 ACAATCACATTTAAAAATAGAGAGAACAAGCCAGGAACAAAACCCCACTAA 
CTCAGTATATCTAGTTCAGATGTGGCTAATGTCCCTATTTAGCATCTrCAAGTTGCTCACATTT^ 
TCAGCAAACTAAGAATGCGTCTCTAGCACTGCGGAGCCTGCCAAGTCCAAGGACATGGGAATGTG 
AATTATCAAACAAGCTGGGGATGAAAGGTGCTGAAATCATClTITrAGAATATO 
GCTTGAGTTTTTAGGGTGCTAATTGCTGAAATATGTTTTAACTTCC^ 

CCCACTAACCAAAAAGTAAGAAGGATGTTGTCAAAAAAAAAAAAAAAAAAAAANAATGTACGCG 
GGGGCrcrCGCTCGGTCTTTCTGCCGCCATCTTGGTTCCGCGTTCCCTGCACAAAATGCCCGGCGA 
AGCCCAGAAACCGTCCCTGCTACANACAGGAGTTGCCGCANCCCCAGGCTGANACAGGGTCTGGA 
ACAGAATCTGACAGTGATGAATCAGT 

SEQ ID NO: 3 12 ACGCGGGGAGGCTTGAGGGAAGCATGGAGGTCCATGGCAAGCCCCAGGCTA 
GCCCGANTTGTTCGTCNCCCACCCGGGATTCCTCANGANTCCNAGTGTCCAAGGAGCTGCTTGACG 
GNGGGAAACGCCGCCCCNGATGTThrrGGNACAGGTTTNTCATCANCrCCNAACCTAAGTNCAGAA 
ANACTTCANTCTTCAA 

SEQ ID NO: 3 1 3 AClU"ri"rrrrrJ"ril-l"lUl'lUl'J'illll'GCGGAGGCAGGATCGCCTTATATTGCT 
CAGGCTGGTTTrrCITCTTTAANAAGCATTTGAAATTGTCATATTTATG^^ 
ACAAGCAAATACACAAGCACAAGTITCAATTAAGGGGTAGAGTCCATTCTTTAGTTTTAGm 
AAAAGTGTTTAATACACTTTTAATACrmATATAGACCAATTAAAGAGATC^ 
TGTATATAATAATTTTGTAATTGCTAGGCCTACATGGGAGTGTGTTAGTCCATTTTGTGTTGCCATA 
AAGGAATACCTGAGCTGGGTAATTTATAAAGAGGTITATTTTGCTCACGGTTCTGCAGG 
AGAACATGNTCCAACATCTGCTCCTGCTAGNCITAAGGAAGCnTCNATCT^ 
GGGCCAAAT 

SEQ ID NO: 3 14 ACACACrATATTTACATCACCCACCCTGAAAACAGCAGGTTCTGGCTTTTCCG 
TGAACCCCCAGATGAATATAAATTGGGAGCCTTGAAACAGTTTCTTTCCCAAAACCGGGAAGCGG 
TTGCTTTTGGGCCCTTTCCGCTTTCGTATNNGCTCGTCCCCTTGAC 

SEQ ID NO: 3 1 5 ACGCGGGG AGGGCTTACGTGGTCTCATGTTGCTCCCATTTTTCACTACTTGT^ 
CAAACAACAGTGACAGAACACCTGGTTCCCATGGGCAGGGAGTTCTGCTGGGCACTAGGAACGCT 
GAGATCAAAGGGACCCTGCGTTGTCCTGAAGGGGCTGACAGTCANAGTAGGGGAGGCANACACC 
TAAACNGGTGTTACNCTTNATANCACrGAGCACCACAGGGCTCTTAGAGAATGCTNCANGCTNGN 
ACTAANAACTGATGCCTGATGANCAANGGACNTTTGGGATGC 

SEQ ID NO: 3 1 6 ACGCGGGAAACCAGGAAGATACAGAATCTCTGAACAGACCAATAACAAGCA 
ACAAGACTGAAATGGTAATAAAAAGAAAATGCCAACAAAAAAAAGTCCAGGACCAGATGGATTC 
ACAGCTGAATTCTATTAGACATTCAAAGAATTGGCACCAATCCTAATGACACTATTCCAAAAGATA 
GAGAAAOAGGGAATCCTCCTAAATCCTTCTGTGAAGTCACTATCACCCTAATCCAAAACCAGAGA 
AGGACTTACCACCACAGCAAAAAAAAAAAAAAAAAAAAAAAAGTTCGCGGGGATGTCATCCATT 
CCGANACTGATGCCTGGGCTTCAAGGACCNTTAGGAAACATANACTTCTCCCAAATGGAGGTTTG 
GATCACCTGCTGCCCAAAACATA^^^GGGAGATTTCAGT^^^TTTAATGT^ 
CTTGTCCTGCCCGGCGGCCGTTCNAAGGGC 

SEQ ID NO: 3 17 ACTAAAGAGAAAATGGGACTACAGATCCACCAGAATGACTAAGCCAGGTGC 
AGTGGCATGTGCCTGTAGTCCTGGTCATTCAGGAGGCTGAGGCAGGAGGATCACCCGAGCCCTGG 
AGTTCAAGGCTGTAGTATACTAATGGCTGTGTCIXjTGAATAGTCATTGCACITCAGCCT^ 
ACAGTGAGACCCTGTCTCTCAAAAAAAGAAAGACTGTCCCCAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 3 1 8 ACGCGGGGGGGTTATACCTGTCTTGCAGCCATCCGAGATCACGCTrCTGTCTT 
GTCTTTCrCGAGAAGCAGGTGGAAACATGAGCATTCAGTTTCrrGGTACGCGGGTAITAT^^ 
TAGACGGAGTCTCGCTCTGTCGCCCAGGCrGGAGTGCAGTGGCGCTATCTCGGCTCACTTCACTGC 
CACCTCCGCCTCCCAGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGCATTACAGAT 
GCTCCCCCACGCCCAGCTAATTTTTGTATTCTrAGTANAGACAAGGTTCCT^ 
TGGTTTCCGACTCCTGACCTCAAGTGATCCACCCGCCnTGCCTCCAAAGTGCTGGGATTAACAAGG 
CGTGAGCCACCCTGCAGGCCGATGCrrrCAGCTTATTAAGAGCTCACCTTGTTGGATGACTGCi^ 
ACATTTTCTTGGTAATGTAAACCAAAAGCTTCTGTATTTCCCAAATANCCCCATTAATCAACAm 
CAAATAATTGTAAAAATAATTTCAAGAAAATGAGAAAGNTGACCCCCAGACTGGGGAAAAACTAT 
TTACAAAAAACTTATTTAATAAATGACTGGTTTTCAAAATATTTAAAGAAC 
AATTNGAAAANTTNCCCNGATTAAATNATGGGGCAAAANATCNGAACAACTCCCCAAA 



41 



wo 02/29086 



PCT/USOl/30732 



SEQ ID NO: 319 ACriTTTIT i l'r i l'iini'l''ll'i'l'il'ri'l'nTrnTANANAA 

TCATGTGTCCrmTAAAGCATGCATTGCAANACAGCATATTTGGTATGTTGGATATCTTGNGTT^ 

TCTCTAACTTCACTTACCACTCTCACCACCCTGCTNTGGAAATTAGGACTNTGATCTACCCAAGCA 

GQCTTCCnOTGACTTCTGGTTGAGTTGGTCAATGGGGANCACCACAGGAGATCAAAAG GAGG AGG 

GAGAGAGAANACAGATTATTTATTGCCCANGCCCTTTTCTTGCAGGTGGGGGANAACAT^ 

GTGCTGCAACCOTGACCCAAAGTCACAAATACTGTCAAAAAAAANATAAAAACAAGTAGTGGGA 

AAAAAAATGGATTCAACTCTATGTCATTTAAAATGCTGCTTGCATTT^ 

SEQ ID NO: 320 ACTCAACAGGCTCCCTAATAGATACAGAATCTGTATGTTCAATTGC ATTTCAA 
TrCAAGTCCTCCTTATAACrTCACACATTGTCTAGCGCATACTCCTCAAACTCT^ 
CCTTGAGCTGCGACAAGCCAGAACATGTGTTGCTTCATTCCTAATTGCTGTTTACAGATTT^ 

ctggtatttttaaaaattacactgtcattaacccgaggcagaggtccagttggagcaacagggtg 
aggaatcttgtgcagttgtctcaatggatttctttgctccacagttgaatgccaattccagat 

TCTTTATCCACACCACTCGTCnTGCATTCTTCGTTTCCCAAATCCATTTTATTTOT 

aaaagaatccaaatcctttnttacccaaaaaaaaaaaaaaaaaaaaa 

SEQ ID NO: 32 1 ACTTITITriTITrTTTTTT^ 

ctcaaaacattaaaaaaaaaaatcanaactgancattgccagnaaaggtcaaacttgccatagg 
ataaactttctgggtctcatatgaagccirracanacanaagcgtgtcctatgttcatggcot 

TGGATGNAAACTGGA 

SEQ ID NO: 322 ACTirrTTTTTTTrrTTT^^ 

TTCTGTGATCTAAAAAAAACTGGTGCTOTCCANCCTGCTGAAAAATATTTNAOT 

AATATATTATTAGGCTTACAGGCTTAGTGNTAATCTGTGTTAAAATTCTGAAAATACAATNATTAT 

TCCTTTGCTNTAAGGAAA 

SEQ ID NO: 323 ACGCGGGGGGTTATACCrGTCrrGCAGCCATCCGAGATCACGCTTCTGTCTTG 
TCTTTCTCGAGAAGCAGGTGGAAACATGAGCATTCAGTTTCTTGGTACTTACCATT^ 
CAGCAACTGAAAAGTGGGTTACATGATACAATGAGTGTAGTTGCCTTTCTCCAGACCTGTTCTGTT 
TCCTTGCCCTCTCrGGCCTCTTTCTGGCTCCATTTACCrmCCACTAGT^ 
ACTTTCn'GGCATTTTCCACTTACCATCTGTGTCCAGCATCCTGGTAAAATCCAATAAACm 
ATCCAGCATCCCATTAACTATATCCTTGTATATAGCAAATTCACTGGGAGTGGATAGTCATATGAT 
GGCCTCAAGGCAAAGGGCTCTACCACTTTTTNGAATACCCACTTGTGTG(XAA\GTTCTGTG 
TTTGCATCAACTAAANTCATCTCACTTTAAAGCTAAANTATTAAAAGATCCTA^^^ 
ACAGAAAGATTANAATATTTTCAATTATTAATTCAGAATAAATATATCTTTTTT 
NAATAACTAATTGAATTGCATTGGTTTNAATTAAATGCNGTCATGTGTATATATAGAATTAAAATC 



SEQ ID NO: 324 ACTTTTTITITriTrTTTTTT^^ 

CCAGGCTGGAGTGCAGTGACACAATCATAGCTCACTGCAGCCTCAACTTCCTGGGCTCAAGCGAT 

CCTCCTGCCTCAGCCTCCCCAGTAGCTGGGATTACAGGTGCTCACCACCACACCTGGCTAATTrTA 

AATTTTTTTGCAAANACAGCTATGTTGCCGAGGGTGGTGTCGACCTCTTGGCCTC^^ 

CCACCCACCrrGGCCTCCCAGAGCACTGGGATTACAGGCATGAGCCACCAGGCTGGTCCTGGACT 

GTGTTTCTATAAAAAGCCGGAAGCACACCCAGGGGTGCTCCCTTTGATGACTTTGCATO 

CCAGATCCACCCACCTACCCCATTAGGGCTTGTTAGATACAAGGACTTAACACAAACACAAGCTG 

GGGACAGCCTGTGAGTCTTGATTCCTGCAGGCACCCAAGGAAAGTATGGGAGCTTTCTGCTGTCTC 

CTCCAATCATTCCTNTANCCCnrCATGGCCTGGTAACACTTT 

SEQ ID NO : 325 ACTCGGGAGGCTGAGGCACGAGAATCACTTGAACCCAGGAGGCGGAGGTTG 
CAGTGAACCGAGATTGTGCCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCCGTCTCAGAAA 
AAAAAAAAAGAAAAAGCATCAGAACCAGGTCAGGAGGAGAAGGCAAAGAGTTGCCACTGCTCTC 
CTTCTGGGATCTTCTCACCCAGCTCCAGAAAGGCAAGGGGCCTTGGATGCITTGGAGCCACTGOT 
AGCCCCGGGAACCACATCACCTGCTCCTCGGGTTTrrrCATANAAAGAGACACATNCTGG^ 
TGTTG 

SEQ ID NO: 326 ACTITrCl-ril-JUU'ri-l"I-l-i-lGCAATTITACCTTCTTTA^ 
CrrrCCTCAAAGGGTATGGTCATCTGTTGTTAAATTATGTTCTTAACTGTAACC^ 
TATCTCTTTAATClTJ-l 1 11 ATTATTAAAAGCAAGTTrCTTTGTATTCCTCACCCTAGATTTGTATAA 
ATGCClWrrGTCCATCCCrri'lU'l'CTTTGTTGTTmGTTGAAAACAi^ 
TTTTGTATAAATGANAGATTGCAAATGTAGTGTATCACTGAGTCATTTGCAGTGTTTTCT^ 
NACCTTTGGGCTGCCTTATATTTGNGTGTGTGTGTGGGTGTGTGT 
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SEO ID NO- 327 acctagggagtggcagagtagtgatgtaaactcaggtctctattactcctcg 

GTTCCCANACATCTrCCTCTTTGTCCCTCTGTTTTCATAAAGAAAGTCTATO 
ATmCTTTCCTGGQACACAAACTGGGAAACNCCrrCTTrACT 

AATTATTCAAGAGACTCATCNNAAACTTNGCNGGGGTGGCTGTGACATANCTCOWAGTGA^ 

TTGGAAAAGACATNTATNCTCATNCTGTATGNAAATGTTCTGTTAAAATGTCGCAATCCTGNA 

ATCCTGGGCirrCTAAGGCGGNANAATACCTTCATAGCCTCAGTAGCTGGCATGTGGTAGG 

CGTTCTNGTTITTTNGTTTGTTTGTTTTGGm 

AAAACCGCATAGGCCCAAATCATGCCAAATGT 

SEO ID NO- 328 acgcgggctactcctctgaancaaganggaattaacaaaagacaggcaaag 

AGACAiO^TATTGANCAGGTGTTGCAAANAAAAGTTAANGGCATCNTAANNGGC^^ 

AAGNAAAArrCNNTTCCATTrATTAATNTITATTGAGACTCACTGTGNTNCC^^ 

CCTGAANNAATTrCAATTTTGGCNAAGATTGTANTTrrCTGTTCN^ 

AAAAATAATATCATAOTGAATAAAGGGNNGAAAAAATAAATTITAACCACTN^ 

CAAAAGGGGCrCTACNAAAAATAGGCATTTACATOCCATrm-GAAAACTCAAAA^ 

AGGATCANCTNGTATTTGNGAAGAAAAACCCNCTTACCCCAAAAAACTGACATGGGTNGCNTGGA 

CCTAATAATAATTTTTTTAAAAANAACCCT 

SEO ID NO- 329 ACNAAATTTAAAATrAAAGCNACTTTCITCCGNATTAAAANAGATTOT 
TOCCTAAATTAGAAACCNGAAGGGGTAACNNTNCTATTTAAGGATCATAC^ 
NAGGC(>fCCAAATTAATCTNTTAACCATTTTTCCTAACNCTTTNC^^ 
CCCAAACCT^n^AC^TGTITGGCTTGGTNTTCTAAANCTAACAATA^l■ 

SEO ID NO- 330 AC rn - i 'l l 'n i l 1 1 1 i i 11 1 1 1 i 1 ITl^TNGGCAAGGGCAATTCACATTTATTT 
CCTGAGCATGCTGACrAAACATACTTCAAACACAGCAGAAGGTAA^^ 

AGCCGnTCTTGCCAACAGCGAATGGTGGTCCATGAAAAGTACrrrrri-l 1 1 1 1 U 1 U 1 11 1 i HNA 

AACAAGGACAATNGTTATTrANTTTTTAmCANAATCANAAAAC^ 

GChrrGGGAGGNAACAAGGAAAACNTGGGAACCCAAAGGGAACTTGC 

SEO ID NO- 331 ACTTAAATGAAGCATATTCATGTAATGTGCTTTTTTT^^ 
m™GCAAATAGArrGTCTGAATTAGTCACAGAATAATm 
GCAACTACCCTTTCITTTTTNATATATT^ 
CTTAATGTTTTCATTAATCT 

SEO ID NO- 332 acaagatataattgataaacctgaaaatttaaagacactccgagtgaagaaa 

ACTGAAGTITArrGGTCAATGGAGACAAACACAAAATGCTATTACAAATTCAGAAAGGTCCCAA^ 

AGTCACTAAAGATTATTTTTGTGAAAACAGGTTATACAGCAATGAGAAAACAATTTAATAC^ 

rrCCCTCAAAATGAGATCCCCTCCAAACTCCCTGCATCACTATITITCT^ 

ttgtgagagaagattgaaaatanaacttcagttnaaagctctttaaagaancaca 

SEO ID NO- 333 gtacaacactagttggaaaatgacttggtaaagcaattaaatgtnactttca 
ctaataaaagaaagaattcttataaataaaacatngcaaanaaataaaggaa 

CATrCTGGCTATCTTACAGCTTTmCCAGTCNTATATTTTACACrc^ 
T 

SEO ID NO- 334 acatattactgagaccttatctaacatgtaattgtatctaattaccagacaca 

ACCAACCCCAACTGTAGAAACTrGGGCAAAACAATTAAGCACACAACrrCTTC^^^ 
CnriTACGrrcmTITATAAAGAAAAAAC^^ 

AAAArmATGAAACCAATGGGCCTGACTGATTTAATGTCATCTCATAAAmTCANGAATT^ 
TGTOTCACAAArrAGArrGTATATTrrGTTAGTTAAAACTCGG 
TGTCrAGGAAATAATAACTGGGAATTCAACAATATmAAAAAGGTC^^ 
AAAATAACTGAGCCCCAAATTCAAATCACTGGCTAACCTACCTACCCCCAATGGGTAGATAC™ 

CAAAATTTGGNATATTrrAATTTNCAAGATCATTTTAGATGTTT 

SEO ID NO: 335 ACCGATCCTGAGACOTCGTGCAGGCAATCTCTGATGCCCGCTGT^^ 
CATGGGGGCTGAGGTTGGrrrcAGCATGTATNTGCrrGATATTGGCGGTGGCriTCOT 

angatgtta 

SEO ID NO: 336 acaaatgagacaaaggcacagaggttagttcacatagctatgaggcac^^ 

CAGAATTCAAACACAGGCAGTTrGGCrrCAGAGACCATC^^ 

TCCAAAAAAGTATAAACATGAGCAGGGrrAATTGTAGCAGCTACTTGGTTTITACGTCAAGA^^ 
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TAAACCACAAGAGGAAACATGAAGTTTTTGTTrTTTACT^ 

TCAGGCTGGAATGCAGTGGCCCTATCTCAGCTCACrGCAACCTCACCTCCAGGGTTCA^ 
TCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGTGTGTGTCACACCTGGCTAATT^ 
rrAAGTANAAACAGGGTTTCACCATATTGGTCAGACTGGTCn'GACTCCTGACCTCT^ 

CTC 

SEO ID NO- 337 ACmr rn l n 1 1 1 1 1 1 n I rTTrTGGNCAATCAACAAGTGnTATTGATCACC 
TACTGTGTGCCTGGCACTGTTACANATAGTCTGGGGGATACAGANAGGTCTAGGATATGGCCCCC 
ACCCACCGAAGGGTTTACAATATACTTGTGAGATCGGACACACACACACAAATAACGATCAATCA 
AAAATTGTGAATGCTAAGCATCAAAAAGCAATTTATACATTGAGGGTTGGGGGAGGGAGGGGT 

SEO ID NO- 338 ACCTAGGGAGTGGCAGAGTAGTGATGTAAACTCAGGTCTCTATTACTCCTCG 
GTTCCCAGACATCTTCCTCrrTGTTCCCTCTGTmCATAGAGAAAGTCTATTCCTCTC^^ 
CATTTTCTTTCCTGGGACAGAGTCTGGGAAACCCTITCTTCACTCrGTTTCrCTCCCCT^^ 
ArrATTCAGGACTCAGCTCAGACTTTGCTGGGGTTGCTGTGTCAGTTACTCTCANTGTAAACTTGG 
GAAAGTCATTATACrCATGCTGTATGGAAATGTCTGTTAAAATGTCCAATCCrGCATTAATCCrGG 
GCmCTAAGGCGGGAAAATACCirCATAGCCrCAGTANCTGGCATGTGGTAGGTGCTTCGNTCIT 
GGTTrriTGNTTGGTTGGTTTGGTTTCTGGTTTTCATTGGCTAGT^ 
GCATAGGCCCAGATCATGCCAAATGT 

SEO ID NO- 339 ACATTTGGCATGATCTGGGCCTATGCGGTCTTACAATCCCTGTATAAAACTAG 
ACAATGAAAAACAGAAAACAAAACAAACAAACAAAAAAACAAGAACGAAGCACCTACCACATG 
CCAGCTACTGAGGCTATGAAGGTATTCrcCGGCCTTAGAAAGCCCAGGATTAATGCAGGATTGCG 
ATATTTAAACAGAACATTrCCATACAGCATGAGTATAAATGACITrCCCAAGTTTACACTGAGAGT 
AACTGACACAGCAACCCCAGCAAAGTCTGAGCTGAGTCCTGAATAATTGTATAAAAAGGGGAGAG 
AAACAGAGTGAAGAAAGGGTTTCCCAGACTCTGTCCCAGGAAAGAAAATGAGCTCGTGGAGAGG 
AATAGACTITCTCTATGAAAACAGAGGGAACAAAGAGGAAGATGTCTGGGAACCGAGGAGTAAT 
AGAGACCTGAGTTTACATCACTACTCTGCCACTCCCTAGGT 

SEO ID NO- 340 ACACAGAAAGGGAGGTGTCAACAAAAGAAGATAAGCCCATACAGTGCACAC 
CTCAGAAAGCCAAGCCAATGCGGGCAGCTGCTGACCTGGGGAGGGAGAAGATCCTCAGGCCACC 
AGTAGAAAAATGGAAGAGACAGGATGACAAAGACrrAAGAGAAAAACGTTGTTTTATTTGTGGA 
AGAGAAGGGCACATTAAAAAGGAATGCCCACAGTTTAAAGACTCTTCAGGAATGTCTAAATCAGA 
TTGTATGTTTGGATCGCCCTCACCTGTCCCATTGAAACCAACTGGTTTATTTGTTCAGGCAGTGOT 
TTACATGAAGACAAAAAGAAACAAAAAACAACAATATTTTTGAGTCCCCAGTCAGGTAGCCm 
CAGTAAATATATGACTCAGGGAAAAGCCTCAGCGAAGAGGACCCAGCAGGAATCATGAGGGAAG 
nAAAATGCAGCACTCTAAATGGCCACTCAGGCGTTCCTATTCACrCGGAAAATTAGGrrCATTTCA 
CAGGACACAGCAGTGTAGATCAGGCTTCAACTTAACATTTAAGGGAAATGTCAGAliliiiiilAA 
TrrAATGAAATTGTTAATGAGGAAAAATTTITAATATAGTCTTATCTACCAACATCCCC 
AAGGATTTTAATA 

SEO ID NO* 341 TCATACAGACCATGGAATACTNTGCAGCCATGAAAAGGAACANGATCACGTT 
CriTGCANAGAGATGGATGGAGCrGGAGGCCATTATCCTTAGCAAACTAATGCANGAAAAGAAAA 
CCAAATTCCACATGTNCTCACTTATAAGTGGGAGCTAAATGATGANAACACNTTGACACATGTTGC 

SEO ID NO- 342 ACl T r n i ll 1 1 U l i l r T TTTTTTTrGGTAGANACAGGGTCTCACTATGCTGCC 
CAGTCTGGTCTTGAGCCTCTGGACrCAAGCAAACCrCCTGCrrrGGCTTTCCAAAGTG^^ 
ATAGACATGAGCCCCCATCCTTGGCCAATrmAAATATCATATATAAAAATTCGGACrmGT^ 
AGCTTTCTGTTTTTAAATTTCTCTTCAAATTAAGGAGCAGTTGAGTTTGAT^ 
GTTGGGACATGGAAGCCAACTCTTCCATCCTCAAATTGTGTCTCATATAGTTTATGAAACTACCAC 
AGACTCAACATNATAATATGGAAAATGTAAATACAAATACATACTATCTATAAAATNATTAGCCA 
TATAAAGAAAAACTTCCACTCCCTCTCCITCTTCCTTCTGCTCAAA 

SEO ID NO- 343 acctggngggtctgttcgaatntnccaacctgtgtgccatccacgctaagag 

AGTCACCATCATGCCCAAAGACATCCANTTGGCTCGCCGGATACGGGGAGAGAGAGCTTAAGTGA 
AGGCAGmTATGGCGTTTTGTA 

SEO ID NO- 344 ACrri-i'lTl-i 1 1 1 1 lCNGGG^r^TTTTTTAAGTANNGGGNGTNGAGCCCGAACG 
CTirCTrAArrGGNGGCTGa^TTTAGGCCTACrATGGGNGTTAAANGNTTTACTC^ 
NTNTNTTCrANAGCCCAAAGAGCmriTWrCTrAGGACTANCA'ITT^ 

AAAGGGNTATGTGGmCNAATNTATAGTTCANCGTANTTATTCrAT>m^GNTNAACCANCNW 
NANCAAAACACrGGTATNGTTATGNAGCCATCTACCTTTAAAATNTTCTCrChnT^^^ 
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TATTACAGATTGNGCITmrNTANGNCTGGGCCTTTNAANAAAN>^^ 
TGATTAACTCTTNGNANCrmCTTNTACANANCTANTmC^ 
AAANTGANGNTNATN^f^^mANT^rITATNAATNNATGG 
NTTTTNNAAANCITITrTAATTNCCACNCT 

SEQ ID NO: 345 CGAGGNCNCGGGGGGGACCnTGGGCTGCAGTCmCTATTGTCAATGGCCTAT 
ATGACCTATGAGGTTAGTTAATCNTTTAGAATCTTAGTTTCTTCACTT^^ 
ATATGTGATTTCTAAAGCTCTTTITAACrCTAAAACGTGGTTCTGTCATANG^^ 
AGGAAATACACATTAGAATATGAATGTCTTCTCCCACATTATTCTGTAAATCCTATrrACOT^ 
TATCCANAGTAAAATATGGGATTNCAATGAAGTOTATTAGTTACCTAACACTTTAAGCCT^ 
GGCTTGATTITAGAAACCACTTTGAATTTTCATGCA>OTATGGTAT^^ 
TCTCGCTNGATTAGCTCCTNTGTGC^.CTNGATTNTNCT 
GNNTNTTAGCCTGTNGTGCCNTCTTTCTTTNCNANNAAAACCGT^ 
ATACGTGN^^^^ANTT^m^GNATTTNTCNNGTTGNNN^ 
GGNGNTC^mCTT^r^^^'AGGNTGANTCTTNTTTGTT^^N 

SEQ ID NO: 346 AOTrrTTTTTTTTTT^ 

TTTTTTTTTTTCTTNACGGNTTCAANGGACNCTTTO 

TNACTACCTANAAATGGAATTNCATCrGGGTTCCATGCTGANTTGANA^^ 

TChrTAATAACNTANATTNAAANNANAACTAANCTAACNCNGCTNATm 

ATTTAANTm•ATCC^ITTCATAAAATGCNNAATTGGTTTANT^^ 

TTGAANAATNTNTTGCAANTNCNNGGAGCTTTGAANATTNA>n^ 

TAAAGGGT^rrmANNCATATGNNTACM^r^CTGGCCANNNTCTAAATT^ 

CTTNThnTNATTTNCNTTTATGNTNAAGATNTGAC^ 

NNTNhnrANCTACTTTNACOTATA>riTCN^ 

ATKNATTTTCTTAATTATGTNTTNATATCCANTATNTNTCC^^ 

SEQ ID NO: 347 ACCGGGGUTAGTGNCTTATrGCAGATAATTTTNAGCNTAGGGNCTGGGGGNT 
ANGACGNNTCTCTCNTTTTNAGTCGGAGACCTCTG(>rGNATACm 
AGGCCATGAAGCTTCCCAAC^T^^rTCCNCCT^^TTNTANTTA^ 
ANNOTGCATNTCTTGCNTTGNCATNTGATTACTCCAGATNTATTAC 

SEQ ID NO: 348 aATNCGCCCTTAGCNCGGNCCNGGCCGACGNACANCGGTACCGCANCATGG 
GCCANAATGTTGCATATTACATGCTCTACTNAGTGGAAGAAGATGAANATGCCNACAAGAAACAG 
OTCGCTNAAGTCTNTGTCGAACANNANGAGCCCCCGAACGTGACTGN 

SEQ ID NO: 349 ACrNrrTrTTrTTTTTTTAT^^ 

GGTTGGAATACANGGGCATGATCTCAGGTCACrGCAACNTCTNCNNCCAAGGTNCA^ 

CTGGCCTTAKACTCCNAAAAAGmGGGATTACNGGC^^IGTTCCNCa^C^^ 

TArrTNTATAGAAATGAGGNrrTACCATATTGGCCCGGCTGGCCTNAAATTCCTAGNCT^ 

TATA^^^CNACCNTGNCWCACCAAAGNGANGGAG^^^ACNGGCAT^TAACCANTGCCNCNGNCC 

TGAANANCNGTTTTNTGATTTCAANNCTTmNTNG 

NNTTTNCNCCNAAAACGAAATAAAAAACC 

SEQ ID NO: 350 TCGAGNGGCCGNCCNGGCANGTACATTCA AAAAN CNTNAGGAAATATTNTG 
ANTGCCCANGOTGATGAAAACTGGGGTGAATTAACTCCACACATTTTATTTCAAGNNT 
AGNTTTAGNGGNGCCAACGCAATGGTCNCTATGCATGCNNAGAAGNCAAAAGAGNTNCTCC 

SEQ ID NO: 35 1 ACTGCATAGATTAAAGAAATCNACTGCNGNNANNCCNCTCGTANGGAANGA 

acgccattgccaatgataagncnttgcacntaggnnntganngcaaacaantatagtgngcrrcc 

naacaggtnataaccancctgataaacaccattanannncgatgccaagcatgtnncncatntc 

tgtgaccaannactattcatantgaacaaaagttgtgtatanatncatncnaagggcaaact^ 

tccnatgataataccaggctaagggcttccttagaacngotgttatggaatntn^^ 

ctgttggotcngaactgggngotcnatggtnrrcntggtaaaaccnaaato^ 

CA 

SEQ ID NO: 352 acggttcttcctgtgtcagctoaatagcttgctgctttttaagaacca^ 

CNGNNNAACTTTGCGGCAGCTTGTTTTCTGTTNNTATTACT^ 
TCTANTCCACCTCCGAGCATNGGCTC 
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SEQ ID NO* 353 aCGCGGGGGCTCAAAGNNGGCGCCATCCGGGACCGGCGGTTGTCTGTGGCCG 
GAGGCTGATCAGGATTATTG™TCGAGCTTTTAGNGTCTT^r^^ATTO 
TGNTNCANCAAAAATCAAATNCTAAAATrNCCTTTCCAGGTOGTTTNACAA]^ 
ATCCNTTAACCAANCCACCCGCCTTGAGGAAAGTNACCCCCTTGNAKTGAGGCAANCACi^^ 
NCGGCmAAACCTGANCTAGNAATTCXrrTNGAAAAAGGTCCTCCAAAAAA>^ 
CNAATTCCCTTAAANNCrAANTCTNNTGTNNThrrGGGGGG^^ 
AGAAAA>rmACCrTTTANTCAAANCCAANGAAATAAACTTTGNT^^ 

CirrTAAAAAACNCCATTTGAANTTAAATACCTTGGTTAANNm i 1 1 AAGGGG 

GAAATTAAATNT 

SEO ID NO: 354 ACAGAAATTTCACAAGATGTCAAACACAGTGATGCCATTTGCTATGTTTNATr 
rrGCTAGTANCTTATTAANCATANCATGCAANTAATCAAAGANAAN™TCCNTGACTrA^ 
AAAATAATTCrANAAAAGTTTCACTAGGTAAAGTm'GCAAATCWTTAT^^ 
GCCATGAACCTTCATGT>mTGCAANATT>m'GGCCTCAArmTCCCCCCir 
NT^mGA^fNGTNTNAANANAATTTGNGGCCAAGAAANTACATCCC^^mGT^^ 
GAGGAACNrTANATTTTAGGGTITATAAACTTNGGCTGATTTCCCNGCC^^ 
CATCTITGCCAGTAAAATAAAANNANCTNNTTTNCAATAANC^ 
TGGGNGGCCOTGGNAAAAAAGGCCCCCTCCCTTTN 

SEO ID NO- 355 ACrrTGCCTACGGCAGCAACCTGCTGACAAAGAGGATCCACCTCCGAAACCC 
CTCGGNGGCGTTCTTNTGTGTGGCCCGCCTGCAGGATrrTAANCTTGAC^ 
CAAANCAAGTNAANCTNGGCNTGGAGGGATANCCNCCNTTTTTKAAANNCCTGGC/^ 
NGGGGAGTANTOTGGAAAATGANCAAAANCANTTTAANTTmOTGNN™^ 
AANGTGGANTGNTGTNGTAATANAAGTNAAAGTTGCAACTCAANAAGGAAAANAAATACNT^^ 
AAN^^ITT^WTNACAAATNCAAAGGGCTCCCCTCCaWCAGOT 

aaaaaaangttnccttgnanttcaagaaagttaaagcattaaaccat ngct ™ 

AAAAT^GNAAACTTTNAAANGGGGAACNCAC^rrT^TAANATACA.^ 

tttaaaatttttaccttgaaccggnat 

SEQ ID NO- 356 ACTCmGTTTTGGCACACTTTTCCTGACAAACAGC 

AANTNCTAGTCCACNTTANCANCANTANCKIOTGAAACCGCrCTCCGTA^ 
TNCAAATGGACTGGAAh™CCTGGNAGGGTTTNACAAAATTAAGACAAAGGNCAAAGGAACn^ 
GCCAAAGGAAATGGAAAGCANTTOTrrAAAAATAGTGGGAGGNAGGACCAAANACCTANTAATT 
CCATCCCTTTTAAATTGGANCCCTTIWC^JTCNCCCNNa^ 

CCNACCAGACACa^ANTTrAAGTNGTTGCTTTCAAAGNTTAAAAGCC>m'AGGGT^ 

GTCNCCCNTCTATGNAAAATNGGGCATGGNCCTTTGACNGCCTCmCTAAATNG^^ 

ACTCITATTTCTCTCCAATCACNATAAGAGAAAG 

SEQ ID NO- 357 attcgcccttaccggnggccngnccgncgggcnccttgggtntgaaggggtc 

NCTCTNGOslCTCTGCNANGGGTGCAGCNGNAACNGNTGNGAAGmTNCAGGCTCTACTA 

NTGAGCATGGNCNTTAACTGACCAANACTGGNANGANANNGT^^S^GNTC^™TGACN^^^ 

GGGCNCCCCAAAANGNANCNCTGNGCTTGGGACACACAAATTNAANACCATCACNGGGAAT^ 

GCTGCTGrrATTACCCCATTCAAGNTGACAACTGATGCAACAGCANACTCCAGTCTCGCAATAATG 

AAACCAGTG>rrrGATCrmAGCAGCAAAimTGTNTNANTCCCNTTNAN^ 

GGAATGCirANACCATGGGGNCNGTCTAAANAAAATACrTATTTAAATCANCATGTC^ 

TAACCTTCNACNTTAAAACTTANGGNNCAACCCCATCTCCAGAC 

SEO ID NO: 3 5 8 GCGAGGTACTTNTTCTTCNCTTTCl"rJ"l-l'll 1 IGTTTCAANGTNGCNNCNATAC 
TrrNNCCACmTGNAANNNGTACCAGTTAATTNOT^ 

AGGGNNAAGTIOTTTTGATATNAGTATAACTGTGAAATNGNAAATGGAAATNT^ 

ACAACACAATGNTGTNCTGCNTAAATCNGNCOTGATTAAAATCTGTGCTGTIWAAGGGC^ 

ACTN>n^GCNGNTCACTGTGTCIOTATGCTQ^ArrCAGNCNTGTN^^ 

GGCANGTTTNCCCNANCATGTNCTNTNNTTGTrACATAATANGC(>lGNTCN^ 

CCCTNrrTNNAATTNTCNGTTCTAGNTCCCAATTNCTTCN^ 

TITTNGNCGNCTGGNNATAGAACTCTGNCGTGCCAAA 

SEQ ID NO: 359 ACnTN Tl - l - i 1 1 1 i 1 1 1 i l i 1 1 1 1 1 U i i Jl sTrTGAGCTGGANACTNNCTNTGTTG 
CCAGNCTGGAGGGCNATGGCm'GATCTTGGCTCACrGCAACCTTTGCCTCCCGGGTTNAAAC^^ 
TirCCGCCmANCCTNCCNAGTNACrGGGATTOTAGGCNCCCNCCACC^ 
TATTTTTTTTTANTAGANACAGGGTTTCACNTTATTGGCt^ 

GGNGAACCACCCTCCTNGGCCTGNCAAAANTNNTANGGTTAAAAGGCGGGANCNNTTG 
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GNTTGGGATA 

SEO ID NO- 360 AC lTmiUl TlUl i'l'i riri nT TTGGNGCT'riTn^NTrAAAAGAACmAATO^ 
TOAGGNCOsIAAATNCNTCAATTmCAAATGAAAACCCTTCAj^ 
AAACTTNTTNCAAATTACNGAATAATTTAACTTm 
TAAAATTTNTCCCCAAATAAATGKTTTCTTAATTrTAANGAANTNT^^ 
GGGACCACCCTNAGGGNN 

SEQ ID NO: 361 TCATTrCTACCGAAGAaTNCCNCCGAACNTGTCrrGCCAATGAGATAAA>^ 
CT^r^AGGTAAGAACTGATGAA^n^^A^^rGACCCAATANGATGGNCCATTT^m 
ACAANCNANGAANNACTGGNGCmrrrCATNTNGACCTCANATTTCTTGCAAAAC^^ 
NGANATAAGGCCTTTTrrCrmAAATClTNATCCNNGTO 
GCCCTTTTGGGCNNCNTAAATATGGGAAATTAANTTTTCTTCCANTCC^ 
rrCCArrTTANNAAAGTOn^CATNATTNGTCNTACCGGTTAAAA^ 

SEQ ID NO- 362 CCCTnWCGTAACGGCGCCCGGGCAGGTACTATTANCCATGGNCAACCCCAC 
CGTGTTCITNGACArrGCCGTeCAajGNGANCCCTTGNGCCGCGTTTCCm 
CNNGGTCCCAAANACAGTATAAAArrrGNNNGCTCTGANNACTGNl^AGAi^ 
TAAGGGTTACTGGTCT 

SEO ID NO- 363 ACTCTTGGmGTCAATGGGACrrACCAGCNOTCCACCCANNANNTT^ 
CCCAACATNACTGNGAATAATAGTAGGATCCTATANGGAGCCAAACCCNGNANTNATACACTGNC 
>rmGNNANGACCACANGANCAANAATNrmCCNGNCTATGCTGAGCGGCCCAAGC^ 

SEO ID NO- 364 TNCGNATGCTACTTGNNCAOTGATGGTAAAAGGGTAGCITNCrGGTO 
CCGATrCAGGTTATAATOAGGAGGTCTGCGGCTAGGAGTCAATAATTTCGATTNGGCTTATC^ 
CTAAATAT^AGACTGGAOTCGTTTGCATCCTACTGNCGATTCGTCGATCCAT^mCANGAl^ 
TATNAAGATACITGTNNCCCANTCGTNTCrTGNGGNCCTATTGGCTCTOT 
ANNGGACCCCCTGTGAATTAGTAAANTTGGCTTGGKrGGAGAACTAGCNAmTrACri^ 
AACCAAAANCTTCNCCCTGTAGGGTTTTTANTTCAAAAOTGTT^ 
AAGNTrGATNA>OTAANACCCTCGCACNCNNGGAGCTGAATTTATTCAACATGT^ 

SEQ ID NO- 365 ACGCGGGACAGNCCNGNCCACAOANNGANGGCATANTAAACITATTCATTNC 
CANGAAO^CNNGNGGNAAGTATm'GNGGGATCTGGCTCAGGCCCACATACANCANCTNATCGAA 
GCTGNCGNTAAAGATGATCNCTTmAArrAGNGCCTATGNANNATTATAAAAArr^ 
NGTGGAAAACAAAAANAATGrrCAAATCCATCAAATCTTTm-AGAACT^ 
GGACNTGACNTATTITGANGATAACAAGNrrANTAGATTGAAGAT T™ 
NGAAATGTAANATAACTGANTNATNTrrCTTTNAGTACCNGNAANT^ 
TGCITCCAATATTTCNATTTGTTTGGGATAACNAATTGGNATNTT^ 
ATNATmANTAGTNTTNTT>mCCATAAANAAAGTGGTCTTTAi^ 
NCCGNAANACTTTANGGTANmrCAACACCTGTNGCGTTATATGGAT 

SEQ ID NO: 366 ACTirrnTTiTiTiTrT^^ 

GTAAAGGCTTTTGTCGTTCTAAATCCTGATTACATGTCACNTGATCAANAACAACT/^^ 

NrrCAGGACCOTKmAAAAAAANTNCAGCNCNTTANAAKTITCC AAAA^ 

-mTAACAANTITITTGNTNANGGTNGGGAANANTTNAAAGN^ 

TTTTTTANANNTNCTGNACNAmTrn'CCAAANTTNGA^ 

TTTTNGAAATAAAATNAAAACO^TNACCNThrmGGTO 

AAATCCTTAATTTNAAAANAACTTTC^rrAANAATACCT^^ 

CNATTTCCAAANACTTTTTTATTTCTTCNCCTAAATTT 

SEQ JD NO: 367 ACATCACAACATGCTTTTTAANNTCATTATGCATTGTGCTCACATTCCC^ 
ATGTTGTTTCCAAAGGTGCTCACTCTCTANCCCAGCTGGATTCTCCGGGNAANAGGCAAAGACAGT 
TTTGGCTAANAANAAACACATGGNAANGATGGNGGNNGGGGAAAGGAAAAAAGCATTNC^ 
TTmAANGATCAAGNO^TNANTTAAAAGNGTNATATTTCCQ^TAGNCTTTNC^^ 
>m'GGNNTbOTANGAATANAANATCNACTCbrrCTTNGAi^ 

SEQ ID NO: 368 ACGCGGGGCAGTTCCGCCATGGCCTCCTTGGAAGTCAAGTCGTAGTCCTCGC 
AGGTCTCGGCGGGAACTGGAAATGCCCAATCCACGACAGAACAAATNTTCGGGGCriTrT 

ctcaacgaaccccaaaacctgccgctcatngngnggctgctngtgaaaagcttntccgaa^ 

AATCAACTNTNAAATTh™ATCATNAATNANGGAAGCCCAAATNGAACAAGGGATO 
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AATTGGAAAAAANCT>rmGGTCAAANAAAATTCTTNTN 
TNGAACTNnSfATNTTTTCATGGAATNAMCNTGCC^ 

CTCTCCACCATNAANCAAAAGGAGGGNAATTTTGATATTGGCTCTGGAACCCGCTACAAAAGAAA 

TGGAGGNGTOTrTGGNTNGGATTTGAAAANAAAAATTATTAOTGATGGAATTTTGCCATO 

CAGCTNGGCACmTSICmGGCTNAAGTGATCCAACCNTT^ 

GGCAATTTTTTACTTAAATNrrGGTGANAACCAAGANCCTTTGAT^^ 

TTTCCNAAAA 

SEQ ID NO: 369 ACGCGGGTGGGAATGACAACTTCGGTCGTGGAGGAAACTTCAGTGGGTCGTG 
GGTGGCTTTGGTGGGCAGCCGTGGGTGGGTGGGTGGGATATGGTGGCCAGTGGGGAATGGCTATA 
ATGGGAATTTGTAATGAATGGAAANCArnTTGAAGGGTGGNNGAAACTACAATGANTrra 
TTAACACAATCANTCTTTAAATTTTTGGACCCATGAANGGAAGAAATTm 
GCCCCTATNGCGGNGGANGGCCATACTTTNNCAAACCACNAAACCAANGNGGGTmTG 
ACAACACCATTONCTTTGOCAATTGCAAAWvIATTTTTNATT^ 
GGAAANCCNAAAAAATTGACANGGGAACCTCNfNGGTACNACANATTTTTNAACG^ 
NCA^^mGTNG^^S^AGGGNCrmACTTC^rAAAAAAAANANNT^T^ 
TGGCCAAAAAAACCCGAAGNACTGT^ITNGNGA^fNAAATT^T^AAA^ 
TrnsIGGGAAANTm-AANCCTTCCAAAAAAGGGTmAANGNAAAAT^^ 
TNNNGNTTGC 

SEQ ID NO: 370 ACTGCTGGGCGGCTTCTTCGCGCTCGTGGGGTTGGCCAAACTCTCGGAANGA 
AATCTCGGCTCCAGTTTCCGGAACCGGATGAATGCCCTTGTTCGTGCAATTTGCTTGANGGGTTCC 
CCCrrAANGGNTTTNGCrACCNACCCAATNCCCTKAACTNNCAAATO 
ACr>fi^TNGNTNGGTTGmwrNNGTCATNGGCCCANC^^ 

TNA^r^CTGNTCAr^^AAGGGGGC^mTNTNANCTTGGNANCTTNTTAAAAAN^ 
TTCCCNACCCTTNCTNCCTTGGGTTT(>rrCTGGTTNTTA^ 
AAAGGGGKmANACCCCnTTTNAAAAAAAAATTTTAATNCCW 
GNNAATTTCANNACANTGGCGGNCN^r^^'ATTA'mGGATNCCAAC 

SEQ ID NO: 37 1 acaattcatctaacttccggaaagcactttcagtccaaatgcataaaccgtcc 

CACATGCCCNCCAGAACCANCTTNAAAANGTCAA>nrTNGCTAACTTTAACC^^ 

G^^^GTTT^^^NAAGGCCT^nwmATNC^l^^^^^ 

NNTNCGGNAACGGTTTTTANTTTNGCNTT 

SEQ ID NO: 372 ACGTCGACCACTACAGATCCCTGGAGGAGGACCANGAACCCATTGTTTCACA 
CCANAAACCTGGGAAAGGCCACANCAATTCCTTTCAAGANAAACTTCGGGGCCAACCAANAACN 
ACNCCTNGGNGAACCCCATNGGAAAGGGGTTNGGANTCAAAANAAGGANCACNAATITrCCCAN 
NANGGCAAACCCCrCGNGGATNCCNANAATGATNAAAAGGCCNTTNNGGTGANCNAAAANA^ 
NCNCNANGGCCTTTTCAAANANGNAAANCNAAGGCCCCCTTAAGGGANAATGm 
CGCCCl"l"ill''iGGQ>rmAAANAAAGANAAGGGCCGAAAGGGCCCCNCCm 
CCmCCTTAAAGGCGNTTTNNNAACCCCI^ 

AATTTGGCAAAANCNATCCANGGT^m'GAAAANCTTTTACNANNGNNAANGGNC 

TGNCCNAANGNTAAAAANAAAGGTTTTACCCCTNTAAACTCCNAAAGGGAAAGGCA^ 

GNTATAAAA 

SEQ ID NO: 373 ACCTTGGCCAGGTCTCCACCAGGCACCACAGTGGGAGGCTGGTAGTTGATGC 
CAACCTTGAACCANTGGGGCACCAATCCACAAACTGGATGCTTNCCTTGGTTITGATGGNG 
GGCAACATTGANATNmGGGAACCANCTNACCACNGGTTTACNNGCNACAAGNCNTN^^ 
CATNGNGANGGTCANATTTCANCATTTNGTTGGNTNGCmAAANCAAGC^ 
T^mANAAG(n^s^NTCGTX^GNAAGmT^^™AACA^ 

AATGGATTCCGGGGTOAGGTACCAAGTTTGTCTNGAATTh™GNAAGGNAACATTTAAGGNT^ 
TTNAATNTTAAGGNACNATGNmGGAGAACAAAmGGTAATAANGGGGTAAAGGTTATGTbWG 
TGGCCCTT^^'ATTAGNGGTTTATAAANAAATGOTATAANGGGCNTTIT<TGTNAC^^^ 
ATNNAGNGGNCAAGNGGGGTGGGTGGGTAA 

SEQ ID NO: 374 CCGCGGCGAGGTACnTrTTTTTTITrTT^^ 

TTTATAGCCATGATTCCACAGATGTNGGTGAGTAAACCAACAGCCATTCCTAAATAAAATACAAA 

AATTCCCATCATGCATAGTTAACAGG>™G>n^AAAAANCCGGCCTTTTANGGGGCrrA^ 

AAAATTNrmCAACAANCATNGAAANmGGGCTITITIT^ 

CCCTAhn^GATAAAACNTNTNCANTTTNTACOTGCAGGNGGANTAANCTTGGC^ 
CCCCCATTCAAAANCTTNAGNANTACCnWCCCCCCTNT AACNAN AAAAAAACC^ i'NGAA 
TTCNAAANATTTNATTTTTNGGCTNAGNTTI^AGCCN^ 
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AAAAAAKGGAAAAATAAAN(nTITGGGAACCTTTTCCNTTl^^ 
OTCCANTTTCCCCCTTTAANATNCAATTTNOTAAAAANAA^^ 
ANCCNCACTTrmCCCX:CAACCTNmAN>NCCTTNNAm 
CNAAAATGGGGGCGGNTNCTT 

SEQ ID NO: 375 ACTTTGGCCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAANGCA 
GTTCCAGATTTCAACTGCTCATCAGATGGCGGGGAAGATGAAAGACAGNTGGTGNNACCACAlSm 
TCGTr^GATTTCCACamGGTCCCTTGGCCGAACGGTCCACGGGAACACT^^ 
CTOTAATAAAACTGCCCACAATCTTNAGCCTGCATGCTGTTTGATGGTNAAAAN^ 
CCAAAACCCGNTTGGCACTGGAA>rrCGGTCAAGGGGACCCCNCGNATTCCCGGGTAAGATTGCCC 
NNNAAAATNAATAGTTTNAGGGNGGCTTGNCCCTGGrrrCrrGGTGATNCC^ 
(>ITTAmT>nsrrGGAGGGNGGNTAAAAANACTTTGGNNTNGAACC^^ 
CNTCTTGACCANAAAACCAAGTCAGGGGGTTTTNGGA 

SEQ ID NO: 376 CGCGCGAGGTACTGCCCCATGTGCAACATTAAGATCCACGAGACACAGCCNC 
TGCTCAACCTCAAAACTGGACCGGGTCATTGCNNGGAACATCGTGTATAAGCTGGGGNCCTN 
TTGGAAAGACAGTGAAANAGAAACCGGGTTTCGGGGAATTITTAC(XAA>rrCa:GN^ 
AACCGGGT^n^IACCCAACCCCNNTTGGGGAANGANCCNANTACTTGNCCAACCTT 
TrrCANNAAAGNrrTGANCACTNNTAAGGCCCANTACTTTTGGCTTATGATC 
GTNCCCNTGGAACCGG 

SEQ ID NO: 377 CGTNCCTAAANTGAGTATCAACrGNT>OTGCCATANCACTGTGNNAA^ 
ATCCTNnrAAAAGGCGAANGTGTOACTGAGGAGCTTGNCAAAGTGAAGC>riTCATCCANCATA 
AANKTGTCNGCNATATTANTCACNANGTTANTGGCTGCTNCAANCTTNATNACANTANOT 
ANCCTNNGACrGCTTCCTNANGAAAmCCTGCNNNGNCATNT(n^ 
TCGACTTTATCTCCTG 

SEQ ID NO: 378 ACAGm'GGACCTNn^GNNATTAGAGGCNOTNNNW^ 

GCCCCACTGGTCTATTTGGNCCTCCTGGACCTCCAGGTGTAATNCGGTNAACTNCTNGACrNAAGC 
AATCATGGACKITATNCTGGm'GTTACCAGGGGAATANCCAACTCITrTACTGTATGC 
ATGAAAAAAAAAAATTATNCCTTTNATGACTGGNNAAAGGNTTAAAANNTGTATCNT^ 
NAANGNTTANNTAGGNGGhrrTTNACTNATTCAAATANTTTAA 

SEQ ID NO: 379 CATGCCTCGAGCGGCCCGCCATTGTCGATTGATTTTNGCATT^^mCM^ 
TCTrCAGCITCANCCCTAACAATGTACCOTATTC™CATNTAAAA)^ 
T^CNTCTNNGCTCTCNGGTCCGTCTCTCCTCTCTTTTCGATN 

SEQ ID NO: 3 80 GTACTCITGGTITGTCAATGGGACTrTCCAGCAATCCACCCAAGANCrrCTTTA 
TCCCCAACATCACTGTOAAATAAATAGNGNGATCCTATACCGTGCCAAGCCCANTNACTCCAGAC 
ACTGGGCCTCANTANGACCACCAGTCCNATACNATCACATOTCTATTCANTAAGCCCNCCCTAAA 
CCACTTNCNTCATCATNCAAACANCITmAACCCC>OTGTGAGANATO^ 

TAACTGGTAANCTTGANATTNAATAACACAACCCNACCrGTGGGNGGNTNAAATNAATNANA^ 

NTCCCANTTAANTNCCAANNTTTCTATmTWCCANATTGANNA>^ 

ATTGTACTOAAG>rrCGGAOTATTGACCCTNCGTANATTTNANTNTA 

SEQ ID NO: 381 ACTCCTTGGCCGCCTCACTAGCACTCTCCGCCTGCrrTTTAAAGGC^ 

GAGCCAGCAGCGTGGCCTGCTGCNAAATGAGAGTCACCAGNCGTTTAANCAGGAAGGACAGCNN 

CANGGAAAANCCACCANTGTAAAANTNChrmGGGCCCGGAAAACCTTATT^ 

TGCCCCCGGA^T^G^^^CTNGAAG^^mAACrIT^TTCGGNAAAAAT^^ 

CCCGGATTrGATTANCAACCTCCCNANGGTTTANCATTNANAACCACNAAAAAAGGNGm 

TTNGANACTTNANAAACTTCANCANCCCGGGACnTGAAAATTTT^ 

ATGAANGGGAArmCCGAAAAGCCACNCAACAAAAACCTTTCCCATNAAAGAAAGGTNCAACT^ 
ANATCCAATNCAAAAAT 

SEQ ID NO; 382 , ACCGTTGCACTCCAGCCTGNNCTAGATANTNAGATNCTGTCTCAAANNAATT 
AAT>mTNArrrAKrAATCATAAAANTCTrGATCTANGCCTACNATN^ 
TmT4TANANNAATATTAAATTAA™ATCTNTGNTA(>IAN^ 
TAATTNAATCTCTATNTNTNTTTTNATATTTNCAATO 
TATNAATTTCNTTAAATOTAANATNTAATATThTTATANCNAAAOT 
TANTC 
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SEQ ID NO: 383 CGTACTACCG/lNATGCCCGTTCTTACAACCGGTNTCAAATCGNACACTGTCA 
CCGAAAAAGGTGTTGAAATNGAGGGACCATTGTCTACNTNAAACCAACTGGGAT^^^CGNCCA^^^ 
NAACANTGGNCTNTNNAATGAAATACAGATGCOTCTTTCAGAACTGANAGTNGCCCTAC^^ 
CTATNAACTNGGCT 

SEQ ID NO: 3 84 ACGCGGGGAATGTCTGAAAGTCCATGAGCTGTCTTTAATAGCGGATTATNAA 
ATTGCNNCCAAAGNAACAAGANTTTTrCOTTGAACTTNATNCC^^ 
TGCANATTGGNAACATTATATCANAGTTGTG>rrTKIWrAATTATANATATAT^ 
TTAANGTTATNTTCNTATAATNAANT^^SfTAAT^^^IT^ATC^ 
ATTTCNTNTT 

SEQ ID NO: 385 ACGCGGGGGACrGGAGACACTGAAGAAGGCAGAGGCCCTTATAGTCTTGGTT 
GCCAAACAGATTTGCAANATCAANGANAACCCAOTGAGTrrCANANAACCGCTAANTANGrrATA 
GANArrCTAGTNCTATCATACATATTTAAGTAT>WANTAATTNNTTN^ 

SEQ ID NO: 386 ACTTTTTTTTTTrrTTTI^^ 

ANATTTrTCTTATTTTATAj\NGCNATTACNACAAriTANGNAACNA^ 

^OTAAATNGTTTTTTTTAAAAAATAACTTG^^^G^^ITGCA^ 

CCAAATNTNATTTTArrCTTTGCNCTAAACCAAAATANCnTATNGAAAAl^^ 

AACNCNAAAAACCTTCCGCTTTTAATNA^n^TNAAATOCNNm 

AAANCNAACrrmATNCNATTTNAATNAACTTCAT^ 

SEQ ID NO : 387 ACTGCNGGGACTTCTCCTTGCTGCTGCCATGTGAAGAAGGATGTGTTTGCTTC 
TTArmTNCNATAATTGaWANNTAGGCTNCm 
NACTTATANNCCANTNTTTAANACAl^AGWrTTTAAGTA 

SEQ ID NO: 388 TCCNCGAACCNGCCCGCCAGTGTTGATGGATATCCTGCANAATTCCAGTCTCT 
ACTGGCNGCCGTTACTACn^^GATCa^AGNGGGNTNNCATTTTNTTC^^ 
TANCTATTChfNNCTAThnsrrATTTNGANCNNCNTy^ 
CNATGTATTTNTrAGTTANATTTTNACCTTTAATATTTNTNTANNT^ 
NNTNAATTTNAACNTCTNTNATTTATmrATAATOT 

ANTATAATACNANCrrATAAAANCNNCTAAATNATAATNNAA>mCANAT^ 
CTNTGAAT 

SEQ ID NO: 389 ACTTTGTGGATAAGAAAATGGAGGAACACATCmATGGANAGTGGGCATTTG 
ACATTNTGGAACAGGTAACCANCATGTATAATNAAAT TTATA AGTTTCTTTTT AAT^^ 
CTCnSTNCAGATATNTITmATTNANTTNT^ 

TNTAATNNATTTCTTGTAATTANNTTmATTTNTTTTAATTATATTANN^ 

TTNATTTT^mANTAATATANTTTNTTNTTANATT^ 

NATNNGAGNATTTATTTNTTTNNTTANTTAOT^ 

TTATANNAATA^^TTCTTT^^IT^TATATTTTTA^^^ 

^TANTATNATATTTTATNAATT^^T^A^^mTANTATAN^^ 

ATTTTTANNNhnTTAATANOTAATANANNTATTAAT^ 

TTmNANTAGTATTTTTNATTAATrTATT™ 

SEQ ID NO: 390 ACCNGGGhn^C^n^CTATATCGCNAGNCnm-CCmCCN^^ 

TGANANCCAATGGGAAGGAGCCTNANCTGCTGNAACCTATNCCNTATNAATTCATGGCATAATAG 

GTGTTAAAANAAAAAANTAAAGGACCTNTGGGCTAAAAANNAAANATAANTW 

NGGAATC^mNTNTAANN^ITITNAAAANCAAATTGNCTGCTO 

CACTGNAGGTTTGTGTTGNACCNCTNGNNCTTATTGGCNNCATTAGTTGTGT>rmGNAJ^ 

ACNNTNTNGACTCGATAANCATTTACAAAAATTCTTGGNNTTGAGCCATATCNGNAAAA^ 

TTCATNTGNATGTANAAGATTATCTITAAGNANTrCTAATTTTTGCCAGTGAACT 

NTG 

SEQ ID NO: 391 CAANT^^^AAAGACCCTNAGGAGTTCATGGANCACATATATGTTCCCANAGGA 
CAGTATm*ANCACCTGCGGCGCCGGCTNNGTCACTTGNGGATGGTGTGAAAAATGAGAATNNTTA 
TATACAGACTNCTGAACANTGTNATGTmNmAAAGGANGATGNOTCCA^ 
AAGAACATAANTTTCTAGGAGAAACTAAGGTTGATNAAGTNTACATGCGGGAAAACAAATTNA 
TGCrrGTGCATCATGGNCACAAAAACTNNAGNANAGGCCACCTGTGAGAGCACATGTCCAT GATG 
GAGGTTCAACATCTCAACATGGNAATGACCTTTNGTCAGAATNCNCCATTGGAAAAATNTGATm 
GCTGOTGNTNNACATCKmGTGCrriTTNCX^AGGGA^ 
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NTGATTCANGAACTGGATTGCCTTANTGGAAACmGANGAAANGGNTATNCACCANTCmTACCT 
GANACTATANAGAAAACGTTCNTTTNATTGWITCCCAGATGTGTCA 

SEQ ID NO: 392 rmCCGAAGCCGGNCCGCCCCGGGGCAAGGGTACACCTTGGTTTGGGTGTTT 
AATAATGGGGNGAATGGGGGGTNCTCC^IGGNAACTTOTGCTTTATTAT^ 
CCCAANCATTANATOANTCCATAAACTACCNCAim'ANAANNCNTCCCCCCTACT^ 
CCmCTCGGOTGCNAGCCATTCCACACCNCTACCANTCCACAANCTCKNCCATO 
CTACCTNCCCATCANCCCCCTCTCA 

SEQ ID NO: 393 ACGCGGGGAGrrCCGTCGCAGCCGGGATTTGGGTCGCAAGTTCTTGGTTGTG 
GATTGCTGGGAATCGTACmGACAAATGCCAAACTTTCGGTGAAAAACTCTTGACTGGGTAAAG 
AANCCATTNCCCCTTGANGNTTGAACCCCANNTGGANCANCCATTCGGANGAATTGTTCA^ 
CAAAAGATTCCAAANNNTAAGGGAAAGGGCATTCCasriTCCTNGAAC 
CTTTTmTTGAAAAAraANCTGGGAANAANTGGCCGCCCCCCT^ 
AAAAANAAGTNCNCCCTTNGCANCTTNGNTGNNCTCCGTCTNT/^^ 

SEQ ID NO: 394 CGTGCACTCTTGNTTTGTCAATGGGNCTTTNCANCAATCCACCCmGANCT 
TTATCCCCGCATbn^CTGNGAATAATTCTGGANCCNANNCmGCCNNGCCCAT^ 
GNCTNATTNNGACCACCmCACGNNCNTTGACAOTCCnTGAGC^ 
ACATCTCCAACCCCTTGGNGGATG 

SEQ ID NO: 395 ACGCGGGTTGAAAAAGAAACAAAGGAATACTTTGAGAGTTGGGGAGAAAGT 
GGAGAAAAAAATGTGTTTGAAGCTCTTTCTGAGCTCATAATTTTAACAGCT^ 
GGAAAGGANOTCANAAGTCAACTCNATGAAAAGGT(>IACATGCTGTTTGCANATTTGCGATO 
GTTCCCCCCCNCCNCTNCa^ACNCNCTGNGNCCATTGAO^GGTGCTTCrC^ 
CNAGANGATTCATACNCCTGNANTTATNAAAGGCTCANCGAGAACWANNCTOTC^ 
NGGAAGCNAANAAGGCTNACCCANGCm'CTAAANAANACNGCANTGGCTGCCGCrTAAG^ 
TACTAGCGCCCTCCTANO^AANANANGGGNhnslCCTGGGAAAGTTCCATCTCCCGNNTGGTGGAAA 
CCGCrrAACTTGCCTATTCTNTTCTATTTAT 

SEQ ID NO: 396 ATNCNACCCTTCAANTTCATANGGTGGGGGAAAACCCGCNGGA(nTTTANAT 
ANTCCNGGCGGTTTNCCCTGGGAAACOTCCITGTGCNCITOT 
CGGATACCTNTCCCCCCTTT 

SEQ ID NO: 397 AC iTm -iT mririri ' iTnrvmm AGCAcrr^ 

CAGCAmAi^AAAACAAACnCAmmAAAGGAGAACCAANAGCAGGGGGATTGGCAGCCAACTA 

TTACCAAGGCNAANCTTGTTACAAAAGGAACCTTTGGGNNATACAAAClSn^AGGGCT 

NNACTTANTAAAAGGGNTGTGTNGGGAANTTNAATTITNTNAT^ 

TTTTArNTNATGAT 

SEQ ID NO: 398 ACGGGGTCCXTCACCAGACATTGAATCTGCCAGTTCCTTGATCTTGAACITCT 
CAGGCTCCANNACTGTGAGAAATATGTTTm'ATGATATTTGAAANCCNCCC^^ 
NTATNNTCANNTNAl^ANGACTNNGATAACTATO 

SEQ ID NO: 399 Acn- iT iu-i- iUTrn - iT rr i - rrri -i 1 i rrrrri i NGGGGACAGTGCAANAANANA 

GGGGTGACCTGTGAATTGGTGCTGGGGANCTGCTNAGGCCCAATGTGAGGCANCACTAAANANAT 

GANTAAATrrAGGGNGATCnrrANCCTOTCCTACCC^GGCAANAAGGGTTGGGGAOCG^ 

CKCAANTTGGCrmCANNGTATATTCAAGCCATTGGGCTNNAAANANGC^ 

ANATTTACNNNGNTrGCTCTGGGGCCANAAAACCTCAaNfAAACCChn^ 

AGGCCCCTTAAAACTTNTGTTANCCANCANANAGNTCCATATNANCANAACCTTTA^ 

NAAA 

SEQ ID NO: 400 ACTTCACACANGATCCCAACCCCCACNNANNTTCAATGTCG ACCm'C TGATC 
ANTCAGCTTCATTGNCTGCAAACAGCmCTCTCAGGANATTCATNAT ANATA NTCrriTTGTAANC 
CATAANGANAOCrATTAThriTNCTTNNCATNA>fNACT^ 

SEQ m NO: 40 1 ACGCGGGGAGCACGGTTCGrrTTTC(nTrANrCAGGAAGGACNTTGGT^^ 
GGTTTACANACTTTTCAAmTAATNANTTATITmATANTGT TATGAT ^ 
TCNC^^mATT^^^GTTTAAlmTGT^r^GTTTANW 
NATTATAAATTTT 
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SEQ ED NO: 402 ACTGCA rn - lTn 1 1 1 i l I TTmATAAGGCrTATAACTATGGCTGGATCTTTTG 
CTCTANTOTCTAAGAAGGGCNATTmATTTT^ 
ANTT^GNANAT^^SrTTANGCATrANGTANAl^rr^^TTATA 

SEQ ID NO: 403 acggatgctacttgtccaatgatggtaaaagggtagcttactggttgtcctcc 

GATTCANOTTANAATGAGGAGGTCnWOTCThrm^ATAm 

TTCTrnrnTTTTAATACAAATArmmOTACT^ 

NGCrrNTTAATNrrATNXm'ATrATNAATCTrrATNATTAAm 

TANNNGGTNTNCANTIOTATOAhrmGTAhrrATAmATTNAAA 

TCTAr^T^KGTTKTTTTATAATAT^^TITAATATA^^ 

TTANTTATATTTrmriTrnmTNKrA 

ATNrrrNTATNhnOTGATNTTANTNTNTATN^ 

SEQ ID NO: 404 ACTTCAAGATTAGGANNGTTGGGTTTNACATAAATGTATTCTCTGGTNANGGT 
GGCTGGNATATACCTTGACCCACCATCTTCAGAANGACCCATGTCACGTCTTGACCATTTGGANCA 
AAGCCATGTTCACACTGACCTAATNCAAGAGTNTGGAAGCATTGGGCTGGTTATACATTCTATTTC 
r^AAATTNATCCTNCCCCTNT^mAGGCATNGANAACC^TmATCANATCAm 
NATTn^COTCAANATANTTATNGGNCTNnsiTC^ 
CTTTCCNrrAATGGGGGAmCTNATTTTOGCNATTCCTTC^ 
ATTTNNCACTATT 

SEQ ID NO: 405 acncggggggtgaagngtacaagctcctcctgttcccaccctgaattaaccc 
ctcaaacacanaaccncctccgntgc^tgtttacaagnggtotcaaggcatcaataacctgca^^ 
acttgtngaaaactantaaaagcaaatncaatgtcatgctgnaatcccactttgaaaana 

ATAANATCTACTrGACTCTGCnTCAACANGCrrGCTCGCNa;AT>rmT^^ 

TNCATNAACANCCAAAAACAACNTNNTAATAAACTATNATAACATNAACa^TO 

T^mGATCATTANAAAGCCTTNCT^^T^ACCTNATTCATCOT 

TNNTmTATTGGGATNTCCACCANCCCAGNAAAhrmcrm^N 

CTATTMNNTTCCAAAA 

SEQ ID NO: 406 ACNCGGGTGATCCTAATGTGGKTAOTACTNNTmXjGANANNACTCCCT^ 
GGCTAAC^fNNCTCNANCANQACANANNATAANGAACAAAA^iATAT^CCTCCTATGT^^ 
NGAAGGGO^TACAGGATCrACGAACOTGGAAGAAAANCATGGNGCTCAGGAATTCATCNTCTAA 
CATTTCACTTCCCCACCCACCCCTTAATGCTCCCACmNGCAATNATCTCTCm 
ANAAAGGGGGAANTNGNGCCTTTGrnTNCAGNTNTGCAACAACACANCTTTm 
AmA>rmGGAGAACTCTAANACAAAAATAATTrTTTTNATNAAAATO 
ACTTNCTCATGGCCOTTACAANGGNCTCCTTANNGGTTTTT^ 
ATTCA^™AAANANGNTT^^IT^^NCCA^^mGAA^ 
AATNCNATATGGCTNAT 

SEQ ID NO: 407 GOmTGAAAGATirGGGGCCCCTTCTANGATTGCATTTCTTCGANGCCC^ 
CGCNCAGTGGTGGATGGNATATC^^GCATAAATT^^^NNTTCrTANCGT^^ 
TCrrCTTNGANTTAC>rrrTTNAAGNGN'ri7"rCn'ATTANC^ 
CTGNNTCTCANTTNCCTCAANACATTNTTCTAA 

SEQ ID NO: 408 cgtacccatctcagatgaatggntacggatcatcacctaccttttcccagacg 

GACAGAGAACATGG^TCANA^fNCAAGTGCTAGGCCC^TATGANCAAANGANGAATNATGCT^ 

GACAGTGTAANNATATGTTTATTATCNCNAAACGOGANGArWANTTAACTAC^^ 

CrGAAAATG^^^iAAAAAANTCr^CGATGTr^AAAANACCCANGGGCTTG™ n ] 1 iCAAAAAAN 

T^«^ATT^mACNAATNTOTTATTGGGGGNN^^CAAACT^^ 

CTTTT 

SEQ ID NO: 409 ACCCGGNNGGGTNGCTCNTATNAAAACCTCATNACAAGNCATITATTTCTGT 
AAACTGCCATACAACTTACATGTmAhrrATTAATTGCACNAAGTNTATCAANACTTA^ 
CACAAAATTAANTKCTATTITAACGGTTCKrCTTTTCA^^ 
GNACATTTTTGTTAATANTNANTTTTAT 

SEQ ID NO: 410 GNNCGNCCGGGCACGAACCNCNGGGGANAGATNNANAATNATTGCCCAGNC 

tagattaccaggtggtaacccctanacc>rrctgcataatgaantanctataaataactcttcanng 

ggagccanagctaacgaccccctaaaccanacgaoctacctaagaacactcngaktngcac^^ 

n^^ggtatgtaan^lgannantggnaana^^^tatatgtanaggcnacnaacctaccgctgct^ 
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ATATACNGTTGTGGATACATAGAATGTTOTCrTANCTNAAAAGTCGGCCN(>IG>^ 

TCNNTNNTGGNGTTGAChrrGTTATTCCAAATGAGGAACNGGTTTTTGGNCACTAT^ 

TGTTANACCGTGTGNTTAANGAATNGTGCTNCAAAATTTTTCTCCCNTT^ 

GGCrAATTTNCA>mCACrN>nSIGGGGCGGNCavlANCTGGTO 

ANAATTANCNGCNTTCCGTGrrCNCTNTTGNNANmTITCTO 

CGANGTTTACCTNC 

SEQ ID NO: 4 1 1 ACCTAACAAACCCACAGGTCCTAAACTACCAAACCTGCATTAAAA ATTTCGG 
KTGGGGCGACCTCGGANCAGAACCCNANCTCCGAGCAGTACTNNGTTTT^ 
TTT^^AAGGTAGTGGGCGTGGAGC^^'GAACGCITmr^AATTGGTGNCCT^ 
GGTGTTTTAGAGCTTNCTTTATGTATTCATGGTTCATTGGATNTm 
GTTACTNGTTTOGAATATATTTTCAChWGCATATm'GGTTATTT^ 
TTNAATAAANCCmmTATCATTTATTGNCATTTATGANAAAANGTGT C^ 
TNTTCCOTGmTTATTGNTGNCTNACCTTGTTG>™TCATTTT^^ 
AANACT77^JTNCNCTATACCCTTTTTT^r^TNGT^ 
GGGTTATGTCrrCTNCTTGGGGAATTNTTNTATAANCTTITOT 
NTTGTGNGTCCCCCAANCTCrTTGGGAGGNNTTTCGTGGCG 

SEQ ID NO: 412 NTGTACii'i'n'i i-jTrrj'i'i 1 n J J l i i J tttngagaggaaaaccc ggta atga 

TGTCGGGGTTGAGGGATAGGAGGAGAATGGGGGATNGGTGTATGAACNTGAGGGTGTTTOrr^ 
GTTAATNAGGTCTTrATTTTGTCNNTNTTTGTGATTTATTNA™ 
TNT>n^rrCTNNNTCTNTNArrCCrrNTTmAT^^ 
NTATCNTATTTCTTNATACTNTTCNTTCTTmTGTTAT^^ 

>nsrrcTNTrrTTNTTcrrTTATATTTn^ 
NTT^^^ATT^^T^mT^ATNACTAT^rITCT^^ 

i UU li ririNATATNrCTAANNTATTTTGTmCTATNTTTNATATC^ 

TTITmAT^m^•CTmT^^TCTTNG 

NGTTTTTTANTTCTTCTTCTTTTC 

SEQ ID NO: 413 GGTACl l - l UU- l ll "l ll-l l "ll'lUl T TrrTTCCCTCCCCACANAACCCATCTCAAAT 
CATTCTGTTAACCACCATTCCAACAGGNCGAGGAGAGCTTAAACACCOTATTCCTNNGC^ 
C^CTNCTATTTTTAAAAGGTTCNCANCAAAANTANANANANC^^^ 
AAATNTACAAGGGGATTTACANGGTNCX:GTCGGCCC^f^rITAANGNAGAACT 
GGACAACCAAGNATCNCCAACTCNGGAGGNTOnsrCNCCCCTACCTATAAAANTTCCCCNATN^ 
rrACAOTAAACCCGGNGTNCCCNmACCTGNTCKrAAGAAAACCCCTCTN^ 
AN>rmGGNTCTCCTTNCAAANGTAAmCCAANNNTANNC^ 
TNAAmCCONCCTNANTAANNCChnWGNATAAAATATTCNAANAm 
GNCCCNANCXACNCTTAAGGGCNAAAATCCCAAhn^CANTNGGGGGGNCCTTATT^^ 
AACNCNTGACNNNACCTTNGT^^'AAATT^mTGNN^m 

SEQ ID NO: 414 TCNAGCGGCCGCNCNGNCGGGCNCTTTANANATTGGCNCTCATTGGNTGANC 
GCGAGACNCTTAGNGCCATGAAATNATTACACAGTAGGAACAGGTTCTTTATNGAGACTGGACrG 
CATNGCTCmAGATGATGTNTGNGGGGTAATCNCAGACTCTAACACTTACTGTATTACTTTATATG 
CGTTAGTTAACNNTCTAAGNNCTAAAANACGTATAATGTTNNNTAGGAC 
TGGAATACCCANNTNNTGGNNTACTATNGATNCTAAGGGGTCAGGCrTATC 
TAGNGCTNAANGAGCXJTAAAATGGCATCT^^TNAAAATCTCTATGCATAATCTATGC^^ 
>mAGCACAATTTCNTTCNAAGAAAGCCWTTCAACATGAAGNGNACTGCNTGGANGGGAAAGAAC 
ATTGGTACTTTTACrmAAGANANCGTTAATTmCNCAAANGNACA 
TGNAACATAATATTGCGCTTTCNAAGNTTTATCAAGNGGCANNNAAATTGrn^ 
CNNGGCACCCTANGGNGA^^^CCNGNCCKTNNGGGGCGTAC^^^GGGACCANNTTGGTC^ 
GGNAAAATGGCCNGGTGGNCCCGGGGAAANNTTCGGTCNGTTCNNAATCTTCGNCNGGACNTNA 
GNTANG 

SEQ ID NO: 415 QGTACT ri TJ- i lT ri i ri 'l'l' l lU'i T l Tn T niTi - lTlUTil U-rJ'N'iUTlUU-^ 
^T^CT^CrITrcCCGNGTNCCCGGGTT^^rCATTGNAATATAGNGACCTGAGTCCAA^ 
NTmTTTANAANAGGNGGGCNTGANTTAAAATGTNTCCnKTGNGAAAAAATACC^^ 
GGGGNTNCAAAANT^r^CCTAACTCasrGGGGCATAAT^^^GGANAAATAT^m 
GGNCCCNAAANTAAAGGAAGTGAACCCNAGGGCTNANAANCTGTTNTNATTTNAAAACAAT^^ 
CTGNCCATTTTTThrrAGGAAAACNGGGANTNCTTTTT^ 

A^^^TTCCNGGGCNGGCTTTTNAAANGGNGNATTTCANCACANTGG^GGGCGTTA ^ 

NATCNTTGTCCAATCTTNGNGNAAAAATGGNAAAAATTNriTTCT^ 

ANTTCCCOTAAATTTOANNCGAANTTOvIAATGTAAACCGNGGGNCCAATATGTO 
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TTANTGG>nvJNNNTN>n^ATGTCNCTTT™ 
CCGNANANNGNTTNTANTNGGCNTTCCTCCCTC 

SEQ ID NO: 416 TCGCGGGTTCTCGTGAGATCTGGTNGTTTAAACGTTTTTGGCACCTCCCCACA 
CTGCTTGGATCTGCTCCTGTCATGTAAGGCACCTGCTCCCAGTTTGCCTTCTGCC^^ 
TTACCTGAGGCCTCTTCAGANGCAGAAGCCGTCATGCTCCCTGTACAGCCTGCTTAACTATGA^ 
AATTOAGCCTCTTTCCCGTATAAATTACCCNGTCTCANGTATTTCTTTATTC^ 
NACTACTAAGACTT^ATTGAANAAAmTAGTTGCAAAGACAATA^TGC^TGTG^^^ 
AAAATTGAGCCNTTTGAGAANAAACCGAAACrTCAGNCATTANACTNGNGTrANTrr 
NACTGATGTTTmTATAAANNCCTAOTGGTGGAAGTCATACW^ 
GGTGANATTTAACTGNATNCNCXrrGGGGTANAAAANGGTTTrTANNGGACAAA 
NNTTAGGNGNTNGGANANNNANNGCCTACXrAAAATCCGGAAAGCNOTGNCC^ 
TTANGGNNAANTTTGGNGNATCGNGNGNGNNTTTTTrrGTGTNCCGAATTNTAN^ 

SEQ ID NO: 4 1 7 TTGAANCTTCCTTrrGGGCCTTTCTTTTCCNACOT 
GCCGCKTTTTCCTTTNGGAANGNGGGGCTTTm'ATA^ 
NGNCTTCCTCCAAACmfGG 

SEQ ID NO: 4 1 8 ACAACTAANATTTTATTAGKNATCGCrraGCTTACACACTCCANGCAGGAAG 
TTATTrAAATCACCrCANANAAAACCTGNGTGACCrAACCNAKTACOT^ 
TACtTCCANANTANANTCACCNCAAG^rI^SINNANGGNC^^rAGGCCCT^ 
TANNTGTCTCANTGTAGGGANAOTAAANAGCGTGTCTAGCrCNCNTOTCTAC^ 
ACTNNCATGATGAOTCGTACATANTANTCCGTGCGNCACAAACATNGCTGANGNANGTGGC^ 
CATTTCCTl^CACACGATOTCTGCGACCCGCCNGGGATTAATACTCCNGTATGCT^ 
CTCCACAGCTANATTAACAAAATTGCTOlGGNAGANCANTACAANNCTrCN^ 
GACCTNGGCGGGGCCT^^^^ATTCCTCTACAGGNANC^^^fGNCNT^^ 
ACCTTACC>mCTrTGTmmCCTTTTTCCGGCrm 
CNCTTTNCTTTCCNGGGCGGGCGGTGAAAGGGCOANTTTAAACN^ 
CC>mCNTGNTCNACCNTGGAGAANAAANT7NJNAANTGNrrCNNNGNGAATGT^ 

SEQ ID NO: 4 1 9 GCGTGGACGCGGCCGAGGTACTAGAACGGGACTCATCCAGAAGTACTATGCC 
CTCCTNNGTTAANGCTCAATCATTTAAGAGTAAACNCAAGGAGAAGCCNTTCTGTAT^^ 
TCTATGTGGATTAAAANGAN(>nWANTGAGANTGGNCTATGATCTNCCNn^GGCT^ 
CTACNANrCCTGCTGATCCGGAACTA>OTACTATGNCNATAAAAGNmGNGAGAAAGN^ 
TAGNCAATCTATAANGCGGNNNAACANCTGCATACNNNAATGCrGGCAO^NATNAAN^ 
NAGTNGGTGACCrATGCnTATTNTATGCATGATTGNGTAGATTTGGATTNNGACGNNA^^ 
AAGAATT^^^AGNATCT^^CCGTATCTNATATTNAGAAAGAAAAAATGGTGCr^^ 
AATGAGTNOTCGGCCCXKrGACNACCCTAATNGGCGATrTCCAGGNCACTGGCCGGCa^IGTC^ 
ANTNGATTCCCAACCTTGGGNCCNACCTTGGNGGAATCATGGa^AGCCNGTTTC^ 
AATTGGTATTCGGNCN>nSITITC(>n^CAANAATANNANNCNGNT^ 
NCCAAAGAAGGNCTTAACNCNATTATTNGGTTGGTTChmCCCCmAA™ 
NTTTTTANTAANCCC 

SEQ ID NO: 420 Acr rrrn r rr ri - i - i ' i - n - i n i i i aaaggaagggggngtnnacctnnanccctt 

riTNAATGGGGGGNNGNTTTAAGNCCNAChmNGGGGGTAAAATTm 

^TmCNNAGGGCCNAAAAACCTGTNC^^T^TNGGAasrAACAGTAAA^^^ 

GGGTTm'GGGGNCAAATTNAAAGTTAANNTAAAATTTTNTT^ 

TNGGANGGTmGCCCCCT^^mCCTATAAAT^^^TCCNANTNT^^ 

TTmAACTGTTNTAAGGAACCTCGNNNGNTrrCGGGGGriT^^ 

TTTNNTAGrrAATTCrnOTNCAAAAGGGATAGGGGTAACCCT^ 

TTTTAATTTTTCCTTGGGGGANCCTGGGCCGGNACCACmTAAGGNCNAAT^ 

GGCCNTTTTAGNGAACCNAACCTCGGGACCAAAmGGNGAAAAAAAGGCAANANT>r™ 

NGAAATTGmNCCCNTNAAAATTCCNAAAAAAANAACCGGAACTTAAATGTAAACC>^ 

CATNGNNGGCCNCCNNTTNTTGGNNGCC 

SEQ ID NO: 421 ACi-j"j-i-n"ri iTn ' rri 'i' j m i 1 1 iTTNGGTrnTnTrrm-i-i-rrri nniii i 

TTTTTT ri - rri T ri - l ' l 1 1 i l U 1 1 1 1 1 1 rnGGGGGGNTTrn-AAANNAATTTbrrrAGGGGGGGAAAAA 

AAGGGN>rn'ANTNTNTTGTCCANGCTGGCCTTNAACNCCrGGNC N^ 

GCCNCCCANANGGGNNGGGTTAANAANAAAAATTTbmTNTTW^ 

AAAANCAAGGGG^^^GGGGANTITGTTT^^^TTTNAAACCNNT^^ 

TCTNACTGGGGCa^CCTGGCTTTGGCNGTGC>n^ACATTGNT^ 

CCATmGGATTAAAAAACACCTCCTTTGAAATTAANNNCCCTIT^ 
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GNCCATTTAAAAAAATNGGGGGTTGGACCC^ICCAAAGAAATTGGGGGAACACAGTNACCCC^^ 

CAAGGNCCNTTTTAGGGTTTGGTTAAAAACTTGGCCAAACCTCCCCTANGGNAAA/^ 

GNAA>mJAAATTITn'AAAGGAAACCNCCCCATTTGAAAAAAAAAAAAATT^ 

SEQ ID NO: 422 ACANTATGTNTAATNNTTAAAATGrmArTATTTGGAAAATAANGCGTGTAA 
TANNATGCCAGGGACTGNCAAANGACTTGATACAGGATGGNTANNCTTGTCAGCTAAGGNCACAT 
TGNGCCOTTNTGACCTTATCTrCCTGGACTATTGAAANCNAGCTNANTGNA^ 
ATANCGATTGA>nWGGCAbrrAGrrAAAGTNATNAGCATGATNAGAGTTNCTGNCA^^ 
ANAACCTGATT>rmAGN>rmACANAAATGTCAGANTTGCAGCTATTGCN 
GCCNGCTAGCTAGGTTAAAGAr^GGTT^AAATCr^GGGAT^^^^GCTT^ 
GANGACAT^ATCTGANAGACAAGTTTGTNGCNCAGTTGCCTGNTANAAAAAACCTTTG^^ 
TNKrTNTNCAANAGCCAriX}GNAAAAATTCCGAGGAGNTTGGA>n^GAAA^ 
ACmGTTGCCATTTTAAANANNANTTAATGTATOTTNAANT^ 
GC™GATAANAGACCTATCAA]mmTrGCTCAAANGCTTb^ 

SEQ ID NO: 423 GGTACTGTNGNNTCATOnsrrGGGANNGCCCACACCNACANNraCa^^ 
TATGa^CNACATNANTGCCNATCNNATGGNNANC^WC>^AGANCN^n^GCT 
NTTNANNNCNCCNNCANGANGATCNCACANTATGGACOTGCTCCTGTCCTTTO^ 
CGTCNNGCATCACGAmGGAGTTGCTGGCCAAGGTGGCTCTGATAANCAGCCNTGGTGTNT^ 
NGATATrrCACGAAGACTGGCKrTANNGGACCATACCXrrGNAimrrTCT 
rmATNCCATGGAlWIKnNAATCAANGTNTGCTNTGGTCCTGAAGC^ 
ACNCCTNAA™AAmATTAAAGGGAA>nsn^CCCTATNCCTGANGTGGGGTQCC^ 
TACNTAANTATAGACGGGCTAACCTGCAAACCATmnrGAGAAATGACTCTTNT^ 
GGTTTTCCAAGATGTCNhrrACCANACNChrmCNTTGAGAAGGNTTNTTCCCC^ 
TGTTATC^TCCCTTTTCNCTTGAAGGGNAGATCTGC^^mAGGGTTCCNTA^^ 

SEQ ID NO: 424 GGTACrrTTrTTTTTTTT>m 

CTTTAAAGCCTTAGGCCGTATGACTAAATGANTAGACTG>IANTGACNGCGGGGAGGAAGAA>1CA 

NANGAAAGATNTTAATGAGGNGGTCNGGTTGGGGGAAATAANNCGAANATTCNCTNCCAGGGTG 

AGTCCTCACACTGGCCTNATGCCCTTGl^GANTTGNCNNCCAAACACAGGCTNGNTO^ 

CTGCAC^AGCAGAGAAC^'GCNANATTAGGGN^^SrACCTNACAT^fNOT^ 

GCANAAATGOTGTNGCTTNrrrGNAGG>™AGATAAGTGNTCCGGGGCT^ 

ATTrCmCCCATNGCTAAAGNGAATNCATCCCATACANNTGKTATTTANCNTN^ 

NNTTGGGCT^mGGTGNCCCT^^^GTGGGGTAT^GGNACCCAATTCATACN^ 

ANCCAATG>mACTNTNATCNNGGGAGNAACNANAAGGATACCT TCAT brrCNCAA^ 

NNCCTGGTTTCCAACAGAGGCCTTTTCNTGNTGGmAATNANTGTm 

ATCACATNCTGGGNmWCTNTGNGNhn'CTAACNCCOTT 

SEQ ID NO: 425 tcnagcggccgnncgggcgggntctttttatgagaagcgtatggccacanaa 

GNTGCTGCTGACGCTATGGGTGAANAATGGAAGGGTTATGNGGTCCGAATCAGNGGTGGGAACCA 

CAAACATGGGTCCCCCATGATGCAANGGTNTCrTGACCCATGGNGCGTGTCCGTCTGNrrCCGNGC 

AAGGGGOTTCKrcTTNNAGACCACGCGANAACGGNTNAANAGANT^ 

TNCACTTGANAGNTNCATATNAGATCGTNCrTAThfmTN^^^ 

ATTNTTCCTGNACTGACTGATACGGAKNTITGCCnTrGCCNGTCT^ 

ATCC^n^GGATT^^CAAr^C^^'CTITAAANAATATNA^r^GTCT^ 

CCTTACAmTAAGATNGGGNATTAATTCTNTGACCNATGClWTCNCATCAT^ 

ANrmNNNNNCTNGAACTCATACCNGCTGCGTNAT^ 

CbnsfCACNACATATGGCNNANTNCrrCCCANCTGGTGNCCNACNTAATTGGAA^ 
NCmTWmGGTNAAAGThNAGTNGTANNTNCTTCNTGGG 

SEQ ID NO* 426 CACTTCAGCAGCGNGGCGGGAACCTGGGGGTATTGAANAACNGGCAACCNC 
AAANTANNATTTATNCAGGGGNCAGAACGNCANGCTAGACTT^WCTTCCACT^ 
NGGCAACCTGCAAGACTNAAAAGNAGGAGANGANGNNGAATNGOAAGNATCANCGGACCGACG 
GACTGGGAAACCCANGGCNGNNAAACGGGNGAAGANNAAACAAGAAATCCNCCCTGAAGAACC 
GAANGANGGGGACAAGAAGGGGNNNACCTGACNCACACCCCGGAAGANGNCGAAGGGANCGNC 
AGO^GGAAACCGGAGGANNANANAAACTGGGGAACNGANAACAANAAAACATACGGGGGCCGG 
NAANGNGCCCCGCAACANNATGCGGGGGNAAAAGAAACCACGCCCGNNGGGCNGGGAGNAG 

SEQ ID NO: 427 ACCGATGATACTGNCNCTTGCNCTGA^^^ATNTAAACACTNCACAGTGr^NAT 
ATNGGGAANATATNGGGAAGGAAATArnirmNTNAAANATGAACGCTGNC^ 
>rmAAACTGGCTCACTTANAOTCTTTNfNAGGATGGGANG 
NCCTmC\TCTGATGTTCNGCGGT^rt^CANGGACC^TAGCTACCTATC^^ 
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NGAT^ACTAATGAANANTTNTNACNANNCAANACCTGANAAT^mWN^ 

AGGNACAKGAGNGGGATCATGGGCNTTTNCANTAAATCTNCCTGQOTTCCCC^ 

TATTATGNCNTNGCTTAArrGNmTKrCCCTGAGCTTTA^ 

CCGNTC^mX3CCTTTNCAAGGGGATT^AC:NAAA^rmGGTGNATAC^^ 

CrrrCACTACTNGAGGGTCNACCATCCTNATTAAATGTOTGACACCATCAATNA^^ 

GTTTCATCTNTAANAAGANCa^ANTATTTTTATTTACANGCGGGTN^ 

SEO ID NO- 428 ATTTrrGGAAAAAAATAATTa^CCCCCCCCCNCCO^TTAC>n'GTCGGNCC^^ 
CCNNATGGTGGNCTAATNANGAAGCNGNCAGANTITANCGANCCThWANAA^ 

agctgnagctatnaaaaaaccancnggggatgatmgataccatcagritcacanann^^ 
ntgaagcattactnnaaaactgggaaaaaaggcccncttcggaangaactgg™ 

CCACAAAT^GCACAGNTGGGGGATCTTGTGNATNCCNCGAANCACAACNGAA^T^^™ 

gcnangccngggcncccagacgcngcngcgcaaaaccggcgaanacncnacctcccnaaacaan 
ctanaanagnngcaggcaaaanacagaaagcctt 

SEO ID NO- 429 AGCGGNCCGCCCCGGGCANGGACATTCCCTTGNGGATCCTGCTTGCTTTCGT 
lu^AAAGCACCANTTGGNACAACCTTACCCCCGAGTGGCCNAACCAACCr^ 
ACTTTTTnCCTTGGGCCCTTAATmTTTAANTTAACm 
NTTGTTNAAAATGCTGKrrCCCATACNAATITANNTTANCCNCm 
AAANCCCm"GGGTTrANNTGGN>n^AAAGGGGGGGTrrAAAGNNGGGGGGGG^^ 
AAAAAGGGNNTrnTTAGNNGGGC>nTNTTGG<XA>rmAAAAAA 
TTAAAAAAAAANNCCNGGGGANANAAAAACCC^WGGGGGGGGNACCCNTCa^^ 
AACCNNCCNNrmGGGGAAAAAAGGGTTrrAAAAGGGGGGCNNC 
AAAAAAAANGGGNNAGGNAAAANNNNNCNGNCXmriT™ 
ANn^GCCAAAAANAAAANAANCNTTNNGGGGNGGGNAAAAAAAAAAA^ 
NTTTTTTTTTTTAAAAAAAAAAAANC 

SEO ID NO- 430 acagaaagtttatactataaaattacatccctaagngattagggtcctcagt 

AACACAKGAATTAAGAAATTOAAAAGGGNCATTGCTCGGGAATCCACATAACTACAGANTAGTA 
GCGCAAGCrmTTGTmCGTGATCAGAAAAGAGACTTTITOAAGAACAT^^ 
CArrATGCCCCTCNTANTTAAAAGGGNNGCCTANGA<>nmWCNT^ 
ANCCCCTrrTTTTTNCCNCAAAANGGKTTTm 

CCNNANGAATGTNCCCTCANTGGAKNATmCTTCANNNGAAGGTNCC^ 

CANNGNGT^^^^rITGANNGTCNC^^■AGGTTGN™AGNANNA^ 

ANNNNCCCCCTONAAAGGANGAANTTmriT^ 

NNGGAGCNGNCAANANACCNCnTTTANANTNAGTATNNTTNGGANTN^ 

CNCNANANTTTNTTTmriWAATNNCTNAAAN^ 

CNTGNTGTTAA 

SEO ID NO- 43 1 ANCCGTGGTCCGCNGCCCGANGTACTTTTGGCCTTTTCTTGGGGATAGNAAGT 
TATTrCAGCCAGGGCCACAACAACAAGAAGGCAAGNTrCCAANAATTTrNAACTO 
AANAArrGGCCGGGGNAANAATTAAAAAANCAAAArrGGNNCCNACCCCCAAG>nTNG>m 
NGG^mCCAACCCCTGTTTCCTTTTGGGCCNAAAAAGGGGGGAAAAAGGATAAANT^^ 
T^r^TGAAAAATNAATNAATTTNCCAAAAANrmAAGGG^m 
AAGGGGAAATTTTTNTNCCCAAAATTCCCTGGGGGCTrOAAATT^^ 
GCNAAAATGGAAACCCCChrrTTAAriTrrGGGGTTrrrGGGGG 

nccnatttncaaacto^gaatagggccntaactt™ 

TTNCCCCAAAAAANCCAAAAAGGGGGGGGTNGGGAAAbrrrGGGNCACC^ 

NGGCCCCCNGANAAAAAAANGNAAAANANNGACCCCCCCAAAAimTTNAGGGGGGGGCCCCCC 
NTAnrCCCCCCTTGGTNNCCAA 

SEO ID NO: 432 Ac rrrr r r iT nT rm i u 1 1 1 1 u rTTTNGG'n-rrriTn i u i u i n 1 1 1 1 u f 

TTITITITITITITITGNANAAAT^^ 

ACNCTCAAATAATAAATNAAATNTAATCAAATKrrAAAAATTGGTNTrAAAACATO^ 
GATNCCCCGTTrGCCTNTAAT>nTrCCNACANAAAAC>fAjm'AAAN 
NTmAANAAAATTTNNTTAANTGGNAGOTGrrTAAAANTACCANT^ 
riTNAGGCTNTNAATANCTTTAGGGATCmANNAGGGGNGGGANNA^^ 

AATACCTNGGCCGGNACCACCCTAAGGGNAAATTCCANCANANTGGNGGCCGTTANNANGGG^^ 
CCAAGCTTGGTACCAANCmGGGGAAATAANGGNCANAGhrrGTTTCCTGGGNAAAATTGT^ 
GCTNAAAATTNCAAANAANATACNANCCGGAAANCANAAAGTNTAAANCTGGGGNGCCNA^ 
GNGNANCTAACCC^mA^TAAT^GGGGTTGGNGCANANTGGCCG^^mAAAmGNGNAAAACNT^ 
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GTN 

SEQ ED NO: 433 CATATATACCCAAGNGTGCGTATCnTGATCTGTATGCTCTTANATGCGCm 
Tm'ACAGCTACNGCACATA>n^GavJCACATANCATNTTrACNCACAANGGT^^ 
GATTNNATATCATCANCNAGCACG>fNTGCTGG 

SEQ ID NO- 434 GGCCGAAGTACGCGGGCCTGAACCCAAGAGACAGAGGTTGCNGTGAGCCGA 
GATCGCACCATTGCNCTCCANCCTGGGCANCNAGCANAAAAACTNTGNCTCAAAGAAA^ 
NANTAGANCNAGACGANAATGGCrmCNGGACAGGAGCATrGCTCATTTGCTGGCGGGAra 
NANAATCAmCCNTGGWGGTCCTTCTNCTTACCCTGGCmGTT™ 
NTAANACTITNTTGGAAGCNAAATAA 

SEQ ID NO: 435 Acr rn - i -i' i Tr n ri 1 1 u 1 1 u iNTGTNTTAriTi-ri-rriTrTN-ri-ri-ri rrn in 

TTmAANGCTTTGTTGTTTGGATNTATGGANGATGGGGATTATTGCTAGGATNAGGATGO 

AANAGGGCAAGGACCCCr^^mTAACTTGTTAGGGACGGATCGGAAAATTT^^ 

AAATATCATTCCGNNTTGATNnsnSfGGGAANGGGT>rrTAAACGGGTTNGGCTA^ 

TKrGGGNCCCCCNNAGAAGNC>mGGNANAANATCGNTAATGNCATTANAGGNGAAAATNAACAN 

AANTNANCCNNNAGGCNNhTTTTNATTNTGAANANAANGGNGGAN^^ 

GGAGGTG>rrTCCTAGGGGGri>nKTNAACCChrmCGTNCAAAAAAAGGAGGNGG 

TAGGGCTmA^^^A^^mAANGGCACATNAAATTGAAAGGTAAAAAAANN^^^GN 

GTCTNCTGANTAACCTOCmAAAAhrrAGTTAAANNAANGNTGACCCAAOT 

NAAATTAAAATNCTAATTACTNGGGCCCCANANAAATNANATTITGNGCCTb^ 

NNCCTTTA 

SEQ ID NO: 436 ACCTTGATACACATAATCAGCCTTTTCAAAAATGCCTGACAAGAATTAGTCrT 
TCCTTTGTGCTAAAAGTCTTCCCACCCATGGATGGAAACAGGCTGACTCCTGGAGGGTC AAGC AA 
GGGGTGGGGAAAGGGGAACACAhrmCTTmGGGAAGGCNAAAGCAAAAAAGGGGTNT ^ 
CAAACCAACNTTGGGCCAGCTCAAANGGGGNCCNAAGCm^^CCCCNAAAAAAGG^^T^ 
TTTTTTOTAGGGCCCTGCACTTTANAAATTTGAANGmTGNA^ 
CTTTTTTCCANANGGAAAANTTTCCGGGGCCANTCNCCCCAAGGNAANm 
CCCNNriTrGNAAAGNAAGGGCCCCGGGGGGTrCTTNCCAAAAAANOTC>^ 
AAAAAAAANCCCOTGGGGNAAACCGNCCNNCTTTANAAhrn'GGGCCCCC^ 
TTTTATTATGGGGGGGGGNNTACT 

SEQ ID NO: 437 CNGACGGGCCCNCAGGAGCANNACAGGAACTACTGGNTNCTGNNAACCTGN 
GGNTGCTNATGTGNCC^CTGGGCACCTTATTGCNAACCTGAACCAAANANACCrCCrrrGNT^ 
TTGGGCCTGCTGTCCAGCTTCCGAGGTGCAGCAGGGTTGTGGGAACAAGAGACGACTTTNAGGAT 
NAAANGACCAAAGGANAAAGCTGCCTTACATGATTTGATTGGGGCCTAGGANATGGAANTCACNC 
mATTNTTNAGAGAGNT^^^TTNACTAATG^mGNAGGCTGAGGNGCANNCCTTO 
ANANGGCCGNACGCGGTGGOTCCCCCTGCAATCCCNNTACTTSTNGTGAGGCNAAGGTGGGCNNGC 
CANCCrN>fNGCTCGAA>rrCA>JVGANCCOTCCNT^ 
TNN^mATCCAATNA^T^ATCNTNNCATCT^mANCTCA^^^ 
GCATGTGGAAANATGAAATT 

SEQ ID NO: 438 AC LTiTi ' m rrrn i I ' i j 1 1 nTn u T r i u- r i-iui-Juiu'iiGAmAATTNTNAAN 

CAAAAACANCGGAAAANGGGATTAATNATrWGGTTGTTANACNGGGNC^ 

NNAGCCCAAAATGTCNCAGGACCGGGGCAGAGGACCAACATGGGCNTTTTGTNNATNACCANGG 

GGGGACCNANAGGGGANCGGCNATNAAAGGGCAATNAANNNCTAAATOCATTGAAAC^ 

ANACAGTNTCOTGCAOTCCCACATNCnTGTACCTNGGTCGNAACCCACNCTAAGGGCAAATO 

GGCNNAOTGGCGGCCNTTNCTAGNGGATCCAAACTGGNTCAAANCTGGGCNAAATCATGGNNAT 

ANCTONTTCCTGOGTGAAATNGGTATCCANTCACAATTCCCCACAACATACCACACCGGANNCAT 

AACGGGTAAANCCTGGGGTNCCNAANGAGTGNNCTAAGNGAGATTAArrGCGm'aGNTAANTGG 

CCGCTTTC 

SEQ ID NO: 439 ACTATGTCGATTCGACAGAACANTTmANGATTCTCGGCCTTGCCCCTTCAC 
GAGCCGCCACCAAGCAGGCAGGTGGATTTCTTGGCCCACCACCTCCrrCTGGGAAGTCTCTTGAAC 
TCAAGACCTCmATTTNCTATCATTCTTTOCTAGACACACA^^^ 
mGAACAAGAGCCATNAGGTANCC^r^TAOTACTTGGGCCNCNmCTNAG™ 
AAANCCTTITGGGTAThmAATAANAGTNAAAAAGGCAANCCCGCAAACAmGNANGTGACm 
GNCCrTTAAGATCriTNNNAAATNAGTGGATTGNATAGTAANNTCi^^ 
GAAAAAACAANANNTCCCTCNTGGGKTCATTTACNTAAANGTTTTTACNTGGGGNAN^ 
AAAGNGNemTGTANGCCCTGCAAGTrGGCTGGG>nTrGANCATrrTNGAGAm 
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ANTTANGTTTGNTAAACC 

SEQ ID NO: 440 AGCCGTGGTCCGGCCGAGGCAClUl'llOTllUU'l^rril'ri'ril'ilTTCGTTTNC^ 
GAAATGTTCGAAGTTAACTCATTTTATTTCTAGGATTCGGATTNCAACATTTO 
ATAAGNCACnrrTTrGCAANCTAAAAANTAGAATCAAACTANNGGTGANCTAGTCCrCTAGGC^ 
CCAAGGCNGATCCTTGGAATCOTGANCANAANGATNGACATCCTACANGGTGCTNGCAATACGGC 
TATAAACCTTCTAAANACTTNACNNCTTACATTGTTNNATTAGGAAAC^ 
OTAAAGGAATNAAThrrGCNAAACATmTGNTTTANN^ 

GCCCACCTACTNGTGAACTGNNANCANAAATGGTCCCAGGGACrcTNAANCNN/^ 

TACATTTTTGAAAACTTGCTCTGCAGGATAAAAAAANNCCNGGhrrGCTGTCNGTOT 

TGANANAA>riTmCCGGTTGANCCTCNCTGGACTNnGhriTO 

TGNAGGGGGNGGGGATATIsrrGTACCTOCCCGGCCGGCGTTTAAAANGGGNAATTTCANCCNAC^ 
GNGGTCNNNACTTNGGGGNNCCAACCCNGNNTCCCANC 

SEQ ID NO: 44 1 ACACGGGGGTCGGTGGCCGGCAGTCATNTCNCGGCCGTNTCANAATTATAAG 
GCTGNCTGCAGAGATTNCGAANAAATGGCAACANATGAAAGCGTCANCANCTTTAGTTCAGCATC 
CTTGGCTGGGGAATATGTAGATTTCACTTITAC>riTGANAATTC^ 

ANATNGCTTGGAAACTATATG>mTGNATAAATNATNCNAAAGTTCCANTATTGGCA ATCATT GG 

GGATCCCCTTA>rmGTTCAANGAAGCCCTATTATTNCTTAATNCTGNNTNA 

TTNTCANTNATNCCTTANTANNAAAAANTTCNANAATTCAA 

SEQ ID NO: 442 CGTGGTCCGGNCNAGGNACNAGAATGNTTCATGAAATCCGNTTTTAAAATGA 
A(>nTTO4TGNGNGCCACA>riTCCTANGACTGGGGC>IAGGNa<CNOT 
NTNAATCTNTNAANAAACNNAATTCCTGCCmAATGCNCNCN^ 
CXrGCCANGGATNCTTTGACTTTGGTTTGCTGCTGNTGCTN^ 
CANGTTNNAAGAAANGGKTGTGGGTTAANGGCTGTCm'AAAAGANCCCTGG CT 
GANTCa^GAr^GCGTT^GTTACCC^r^TTGNAACTGACCCG^^'AATTTNAAC^^ 
TTNAAGCNTmrrANGAAGCCTTCCCGGGANGNAATTTTTTCCA 
GGGCCrGTTNAAAAGGGGGAATTTCNACCCACrGGNGGGCNTrrCTAATGGGAOT 

SEQ ED NO; 443 actititttntttttttttt^^ 

NTTCANNNCCTGAA^^rGN(>NA^M^AANTNAAAAT^^mAAAANATG 
NCGAAGAANTNAATCNTNNCmrGGGGATCmGANCTNCAANGACTG 

GGNAANNAGAAAAACACNACTAGGGNACCCCAAAAAAACCCCATTTATTTTCCTTTGGAAAAAG 

GNNGGGGNCCANGNTAAAAA>rrGNNANGGGG>nsrAAATTACNTTNA>nvrCAAAAGATGG>^ 

NCCTAAAAGmTNCNCNOTNGGANGAAAGNGGGTTAAAA>WCTNAAA^ 

AGGGACTCTAATNGGTCChmT^GGGCTNCTTNAAAANGGGAAANGGAANNA 

AAAGGGNGGCNTTNAAAACCC^^S^NAAA^mGGAAANAAAAAANTTTCOT 

AlNf>nvmTrGAAACCNGCCAAATTTCCnm[TNGGGNAA^ 

TTTAAAAAAAAAATGGNTTNGGGGGGNGNCC 

SEQ ID NO: 444 ACCCGGNGCCCNNACGGNGCCNNACAGATGGCTGGNTNNGACATNGGGCNA 
NNCTGCCAG^^^GGAGCATTGNCGGCNNCCGAGATTTNlm^^CAT^^ 

GAATATNTTTGCCCNGAA(>JCAGAANCTGGNATTCTCATGGNTGAGNTGAGGTGACTGNATGTCA 

ntgagagactgaacacanatcancatacatcttacccatgctcmcaaagactgtgctaagaga 
gaacttgtgcncattcccttccntatnggcac 

SEQ ID NO: 445 ACirmrnTiTTTT^^ 
TmrmriTITITTTNCAANCC^ 

ATCCAAATTTTGTCTTAACCGGGGGGT^n^^GNAAACACAGGCTTmt;CATCCCAOT 

GNAAAGGCTNNTOmGAAANGGCCCCACCATGGACCCTOmGTAATC 

TNGCCGGGKITAATmTAANTANACATGGGCGGCCAGTTAATNAAATCTNN 

imTTNCAAGNGGmTCCNANTATTTTGGCATGNTNTmAAAANA;^ 

ANTGNA^^ITNCCCCNCC^mGGGGNCCTNAACC^^^m 

SEQ ID NO: 446 ACTTNCTTTTNTTTTTITNTNNTr^ 

™ATAAGAACANATATTTAAANTCGAAGGCCANTT^m'AGGTCTCATrrAGCT^ 
CACTTGTATTTACCTITnICCCTAGNGNGGTGAGNAACTATCAAGAAACAAACCTGTGA/^ 
GNTAACATTCAACANATATTTGGTATATATANCGGTCTNGGANGCAAANATATTTNTCAACACrrA 
ANTGGNGNANCAAAGNGWGTCATNGGGANATAAACAGGATNGNTTTAANNTTGAGAhnrrA^^ 
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NA 

SEQ ID NO: 447 GTANTCTTCNTNTAGATGCATGCTCGAGCGGCCGCCAGTGTGATGGATATCTT 
GCAGAAT^CGCCCITNCGAGCGGCCGCCCGGGCAGGTACNC^TAGAGGTA^^GT^^^ 
AGNNNTGTN^^SIA^^rTAAAGANNCTAANGTC^™ 

SEQ ID NO: 448 GGACKirTTTTTTTTTTTTT^^ 

TGNNrTNTAAAATGNATATTGNGAATAAAACNGCNTTThf>f^ 

mGNNNCANGGGNGNNGCCGANNNAACCATGANCTGGGCTGGGTTTNTATNTTTGATGAA^^^ 

AGCCT>nS(NCGCTTTTGATTNGGGANANAAAAAANNGNNCATNGAC 

CNGNCAOTTTNNAACANTAAATTGCNGGNCMfGGGGGGGAGGGCTNT^^ 

GAGCCCITCCATTTTrrANGAGGGGANGTGANAAAAAGATTACACNGTGGNTTGAGTGNCCC^ 

GGAANAATTNGGACCTAACNTGCCCGGAAANGAAGNNATAGGTCTGATNCGTTNGTANAANACG 

ACAANNTNTCACCCTCCCCCNTCCTGGTGATGCACTGANGGGACAGGTGGAGNGANANTACGNTTA 

TTGOsIACCAGTlSrrCTGTCACCTNCTNGGGANGACCGNGTm'AAAGAAT^ 

CNTTGCCTTTTNA 

SEQ ID NO: 449 GTACTANACACATCNGGGACAACCNCCATTTCGGANATGATGCCGAAAACNC 

aaggccanaagcnnaagcnaggggatgganagtttgnggaagotattrctttacccannaatga 
cctgntgcaaagacttgatgcnctggtanctgangaacatcntcacngtggacgccangg tctat 
nnctacgcxm'agcgctgaaacatgi^aagcaaagccatttganngtgcccttot 
>nsicccnaatatgacacttggccatotattgtnaacgaaaagnnccangcc^ 

TG 

SEQ ID NO: 450 cGCGGCGAGGTAcrmTi'riiiTiTi"i"i4'iUTi"iTiu'ii"ri"iTriTrm''nTrrr 

TTTTTTrrrTTTTTGGACNCCCAAAACCATCC^ 

AAAAACATTTCAGGNGGAATTNANAATTNCCGG^™AAAAAACTNGCCC>^C^ 
TANNAAAGNCAATTCATNAAANGGNATAAAACCNNTTGNNGGGCATGANGGCANGGGACAj^ 

t^waacttggccctggncc^ttngaancc^^nggnagg^^^jgancnttt^^ 
ncccccggggcaaaaaagaaaatccncntaaaaaaaaaaaaaaatittagcnaggggggc 

SEQ ID NO: 45 1 acgtgccgcaggaaatactccggtagcaatcgcatnatcggtgccaaggacc 

ACCCATCCTTCCAAATGANCGNGGCCNAGGTTGAG\TAGGTAAaSfGGNAGNNTAAANGGCCAATT 
NANANNTGNNNCNTTANGGGGGGCCNTNNCANGNATGGGNGAGNNAAAT>n^ 
TNGGCCAAGGCCNANGGCATCmOTNAAANAACrmGACNGGAAAGAATCACAAATGTGG^^ 
NTTTGNNATAANTAANTAATGAAACCCAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 452 CNCCTTAACCGTGGCTTTNCCGACGTACANTCCAATCGTCTTCGNGGGGNTm 
CNCTTAGCCGANGAGTTCNCNACNNNTTCCACAAATTmAAAGANGAGGTAGACCCACCT 
CTTAAANACTTT 

SEQ ID NO: 453 AACTACCATCTTTCACATCAAATNGGGGANCGTGGAGGTAGTGGAAAAGCTA 
TTNGGATTNAAGTGNNCAGAATTCNTGTANACCAACAGCAACTGANCCNCTGTGTAAT^ 
NNTCCAKGGGNAATCANCTANTTCGAGCNTNTTNCGCTATTTAGGCT 

SEQ ID NO: 454 GACGGAGCAATCGANGAGGCATAACCACACTTGGGGGTGGGCTATAGGGGC 
TGGGAAAAACCCTGAAAAATGAACTGGCrmCACTGGAGGGCCANGGGTTTGGAAATATTTGGC 
CAGNCrmGAAANGTNTTTAAAAGNCAAANTTrCCTTTAAGTGANTCT^ 
OTCANCTTCCATTGTNCCTTNACANAAGGNCCCCCTGCGTTCTTGCTTGNA^^ 
CCrrGATGATNAANAANGGCCCCAATTATTGATNGCCCCNCNNAANCCNNANTCNGGGCCCANGG 
CACbfNACCANGTTCTCTGTAATCTNCTGGGANAAAGGCTTTGGNACNNT/^^ 
TNAGAAANAANGNTNATTNCAAGCCCCATGCTCCACCCTGCANTCGTAAACNTCCTCAAT^ 
GCNGGAAGGGGAAAATATTNGGAACrrCGGGAAAAGGGGGNTTTCTTGGGCAAGGAAAANTGTC 
TCTGCACTCCrmGNNGAANGGTrrTNAATTTTAAACCNCNTGACNTTO 
ATTTTTCTANAAACANATNAGGANANTTTGGNTCTNTTCAANTTCACAAGGG^ 
CAGTTTCNTNTNGAAGNAANCTNCC 

SEQ ID NO: 455 ANCNrraGCCGCGGNCGAGGTACTNNCTNNACTGTGAACGG^ 

CATGGNTCTGCANTCAAAATAATNATNAAANGGACAGGCNrmGCNAAAATGCATNGGGNCCNAC 

TAANCNTbnWCACNATCAAGGNNACCAACACTNNAN>n^GNTTTNGGC 

AT^^'GNGGNCr^GCTNaTOmTTT^ATTCANNGATCAAACNm 
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TGTGGTAATCCNNAACNCTGCTGTCTCNNTAAANTTTCATANCATGACTTACNGTTNGANNAAACC 

CNGNANTTGTAGANGTCCATAACTGGTI^GAGAACGNCGGTANACATNTCTTTTTTNATAGAGGT 

CGCTACGNTCTGATTTNCGCNCAATTCNCNATNCTNNTCNANAANT^O^CTTTNANCT^ 

TGNTNGACCATACTGGGGGNCGGANACCTTAGGGANATACATCTTTTGTTTTGCANAAACAANAT 

TAATTTAAATCCTTGGCCTGGGGGGGCTCTCACACTNGTGAGNTTT^^^ATTGNAATATNCNGCTO 

NCNATNACGGAGGNCTGGG^^^^^n^^ATCTTGGTTGCT^^^^GCNCTCTGCTCTTANTGTANANAT^ 

GNAGGTCCG 

SEQ ID NO: 456 ACTTTTTTITITITrTOTOA 

ATNTTTTAAACTTNCTCCNGTAAAACNAGATGAANAGGCNGCNATCCTTATNAACAAATTGGhW 
CATNGGNCANTTCGGCAGGGCCANTGATNGNAATGCCTNTCANTTTTNTTGAANACNCACAGTTT 
NTTCAAATTNTCCTNAATGAATAAATCTCATCCNCCCTTCTTTCCCTTTTNAAATATNTCAC>^ 
TTTThTKTTONTTAAAACCTTCTTTTNGCTGTTCNGN^ 

GCCCCTTAANAGAANGGGGGCCATTTTTCTTTNGACCCCNATTTCATNTrrCm 

AATGGNNTNANAAATTTOCCNGGGNANATGNANCAATTTCTTTGANTCTNTGAAA/^ 

CCAANCTNTCTTTTTTCTTAAT^™NTTNTTGNTGGCN^^ 

AAATTTTTATCTNANNGNGTNTATGGTGGTAANTCATCAGATCTTCTGNA>mGNGKNNTCTNTANA 
TNG 

SEQ ID NO: 457 CGTGGTCGCGGCTCGAGGTACACTGGGAACTCCAAGAAAAAGCTTTGAANAG 
AAAATGGAGGAAGCNCGAANCCAGAAACTCTAAATCTTGAGACAAGAAGACTNTCTArNATGAA 
NGAGATTCTTTNTGANCAANACCNAACNCACAATGTGTGGGAGAAATCTNATGANCTAATACCAT 
GCCTGCGAAANNAAAANTGCNAANKNAATNTTCCTNTCCGGNCGGCCGCTNGAAA 

SEQ ID NO: 458 ttngtactgatttnaaaaactaatcacttaaatgtgccacnccgcaaaagag 
aaaaccaagagtggtccacaaacacatgctccrn'ctcttcngaaggtttta cnann catngtaat 
acataacccantcttttantantaaactnaannggccantngaaacaaacagttttgagaccgtt 

CTTCCCAACCTNNTNAAAATNGGGGGGGCAGGGTATWGGGGATAATATCNNTTATATCCGACNT 

GANCTTNCTGGGCGAAACTTGGGTGACCTTGNNANNNTCCAGCTTNTCTGCTTGTCCACTbmCm 

TGATGACTTGCACCAGGNATTNGCTGCTTCTTTTGNACGNACNNGTT^vJTTCCGCCCTAG 

NTAGNNTTANAAAGCTTTAAACTTNGAATGGGGCTTmTNANGGAACCCATNATO 

CGGTTTTTNACGATACTNTAANATANTTAAAAGGGCNCCNTTmCANNTTGT^^ 

NAAATANTTTTTTTCTATNATNNTTTNTAAATCTTAAACTTAAATNCTNCN 

NTANNATTGGGNATNNTCNGACCTNTNTATGNNT 

SEQ ID NO: 459 ggtacgcggggactgcaaggcggctgcanagagaggttgtggcgctamtttc 
tctaagccatccanngccatcctcgtcgctgcnnngacacaccgctctcgccgccgccttgantga 
ccaatatnaccnttcgtggcaccctaaagggccacaacggttgggtaacccaaatcnntannncn 
cggatgttncngnanatgatcctttccgcntttcganatanaaccntcacantgntggaaacnnn 
ccagggtngaaaccanctatggaatatccahn^nnngtcctttgcggggtnactccnacttgtgng 
agnnattntggattattnccctnatatggcncanattgcccatctaagggatcctggnnatgnaa 
ccnttgcgcctttgngtattttacnaacttngatcccctcnnaangggattttgtgggccat^ 

CATGAATTGCTTGAAAGNTGNCNTTTmNNThrrANANTAChWGNNAN^ 

AATATAAAACCATTAAAGNTTNGGNGANTACCTGGGNGATGTGAAAANACANTGC'mNGGAANT 
ANACCNATNTAAANGNGTTTTTT 

SEQ ID NO: 460 accgcggggctgnctctcttttnngactcagcccgcctgcacccangtgaaa 
taaacagccatgtggntcacacaaagccgtgttnggnggtnttttaacatgnacccanatnaaat 
tnggngccatnantnggatcgggggacctccntngggaaannaatcctccgncctcnngntcttt 
gctccntnaaaaaaatccncctacaacntcaggtcctcaaaccgacagagcccannaaacattta 
acacanttmaaaatntggtaagcagcctttttttattttttm 
aaaccatttttctctctttnantctnt>n^cnnttnnactt 

acatataccntattnatttttcntggtaaganaaaaaaaagaanananhtktntttatcccg^ 
gccccaaaantttcggncnntttggtcnaaag>fntngnaaanngnaantcttcccthmggntnnc 
ttaaataaatggaagggantgctcttnttnatttatacacnctatgattatanggggtgntantaa 
cantcggggaaaannngttttggattttt 

SEQ ID NO: 46 1 GGNACCCCCGAGTCCNTGNCTGGCATACTGAGAACN ACCAAACACACACCCA 

agctcggtctcctnttggtgattctggggagcanatcttnatnaagggcnnccgtcccatgagag 
ggggaaaacatcccntctcttgnaaatntaaggaaanttttcncntatnantctgtaaagnagaa 
tttcctgnatttcagactantgnnkaahfntnancccccwagaganktnat^^ 

>WGTTCTTACCTTGANANTTATATAAANGAAAGTGrn'GATATATTNATTGCANNAGNGTTCAANA 
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AANAAACATCTTATGTTATGTGNANNTNAANTANNCmANNTGAGGm^ 

GNAATGNTATTNAATCACTAGGrrNrrTTNTNATGCANTAAANCC^^ 

TKITGNNAAATGTGAGATANTANNGTTTITITTTTNGC^^ 

NTAT^™CNTAT^mTGTANTATTGGArITmANNGGAGAATNTTC^^ 

TNTNTArrGATTTTATNO^CAACANTCTrNCTTT 

SEQ ID NO: 462 CGAGCGGCCGNCCGGGCANGNACTTGAGGCATTACCTCCTCTGAGNCACTGA 
AGTCATAGGGGNCGTATGAANCCCCATNTTTGGCATNNGGNTGGCGANACNTACmCTOT 
TTNNGCNCNNGGTCNTCTTGGArm™:GCTCITCATC^ 
TNANTACGGTTANTGTNC^^^CAC^nS^^AACCTT(>^AAAAATAANCAN^ 
hnSfACNCTCTTCTATTCTGNAGNGACACAGCTANTGCTCATACGNNGGNTTCTGATGGGCT 
ATGNNAAATCAAANGATTTNTAGAGTNCCGGCTTGTAACTTGGNGAAGANOTGGTCGNCG 
NNCCANAhnSfCOTATTCCTTGCCNGNGCATGTCTTWAy^ 
GCCTTCCGTTGANATGAAACGCTANAATATTTNAAANCCCmGGTNAOT 
GGAGCCCTCNTTTNTNTAAANTTNGGGGGTAAAATAACCCTTTTGCNGGGGCTTANCCG^ 
NT^^^ATTATATTTACACCGAAGGGGCCCCNCT^m^^T^GGGGCANGGC 
>rrmCCTGTCTCasrCTANGGCANNTTTGANAACTCCGGCCTGTTTi^^ 

SEQ ID NO: 463 ACriUl"lllll'lUl"lUUU"14"rjUl'lUGNACACAriTNAAGGGrrTNATTTANANA 
AATmTATGTTAANCCNTTGAAAATGAGGAAAAGATGGTIT^ACAAAACCCAAGATTNAATGANGN 
TTGAANCNGNTAAGGCAAAGTNTTTTCTTTGGTTCCGGAACACCCGGAAGAGrc^ 
TGGAACCANCCTTTmTACTAATGGCATCTTNCACANACTTNGGNAA^ 
GAANATNAANTTGGAAAAANAANNNTTTNAANAGGCNCATTOCT^ 
CCACANNTTCTTAGGG>n^GANACCTNCATGANCNNCTGATNCGACTTTTO 

GAAACACCAGGGGACCCCTATGACACTGGAGNCGAAACATGCATCCATTGGCNAAACA^fNGGAT 

NCTGGTGATNNNAGGGAAACCATAGTTCGGCCTGTTATGGGGGCTCANCCTGGAGCACTGGAA^^ 

GGGNrACAGGCTTTTTGNGTGACTGNTCACCANTGGGNAACCACAGGGACTNGGNTCCATGAAAA 

CTTATGTCAANGTCANTrTTCCCCCGNTTANCrrGGCCNGGGACCACGCITANGGGC^ 

GClsnrrrGGGGGGCCGTTCTATTTTGGATCCCGANCCT(>rGGTACCAAGCCTTGG^ 

GGCATTAGNAGGACCCCCTGNGTTNAAANhrrGGATTTTCCCCCTTNCAATTN 

ANAAANCCCGGTNAACATTAANNTrGGTAAAAGCCTTGGGGGNGCCCTTAAANGGAGGG(^ 

TTTACKNNNCTTTTTTATTGGGGTNGGNCCTNACTN 

TTGTTNTNGCCAACCTTGGNTTTTTTTGGAAATTNGGNCAACC>™ 

TTNCCTTTTTTGGGGCGCCTCTTTCCCCCTTTGGNGNGAANTAy^ 

GGGGCG 

SEQ ID NO: 464 GGACCTCCCGNTCANNGCTNNTCATmCCCOTCNCCNCNGCCT^ 
NGCTGACCNNATCCTCTTNNTGNATGGAGGCNCTNTCNGGGANGGGGGAACN^^ 
NGGNNAAAANGGGGTGCTAANGNGCCNCGGNGCACGCTCNTGNANAAGC>ia:NTAATGAA^ 
TTGTNAGACCTGCGCACmACATCTCCCTCCCTT^^^CT^CTCTNANGTC 
NCNGAGACANGCAGGCTGCCNCCAG^^S^^GAGTT^fN^™AAAN^ 

TTTCAATCTCCTCATTGATGATGCAGTACTGNCCTANGNACTGTGACCCATNCACNAGTGGNCm 
NGAAGNNNCC^GTTTCTTNTTCNCT^nr^m 

ATAACACGGAGAAAAAACNAAGC^^^IGGTCCCCAGTGTGATGNGCCTTNTCAGNCACTAACTGTO 

NCOTGTACCNGACCCGCNTCNATNGAAAGGTTCAATNCAACACACTGGNGGCNATTCT 

CCN 

SEQ ID NO: 465 GNGGNGGCCNGGCCGANGNGCACACTTTGGATACACTGGATGCTCATGTCAA 
ANGGGGTCAACTCATCTTCACTCTGAGATNCAAACNrrAACNCnTGGajGCATCAACC/^^ 
CAAACTATCrbnTCaSfOAATATTTATAGNCTCCACTNGCTT^ 
NGCTT(nTOATCNCCCTCNGTTTTGTGNTNTGACTCCCACNCTGACATGNAGGCT^ 
GGTGCATGNAAACGACCAACTTGGACANAAAACCCCCGCGNTCTNCCNGGGCNGNCGNTCGAAA 
GGGCTAATTNCAACTNACTGAAGGNCGTmCTACCGGATNNGANCTCGGTGCrc 
NATCATGGGCATGGCTGNTNACCTGTGTGGAAATTGNTTACGGTCATNATTGCCNCAACATACTAT 
GCGGGNAGCATAAAATGTCTAAGCCTTGGGNGCCTAATGANTNAAGCTTACTTAGATTGAATNGC 
GTAG^n^C^CACTGTCCGGTTNCCANAGGGGAAACX:CNTTGTGC^nSfGCTGTATT^ 
NCCCCGGGG 

SEQ ID NO: 466 GGTACCAAANCNANTNACAGGANGGGCGGGACTACCGGAACTACAGGCTGT 
TATCCCTTCCCGAATTTGGAATTTTGTANGAAATrANTGTTTCTATGG 
CAAGAACnWAATATNNCACrrAATTCGGNCATTGGATNaSfGNGNTG 
AGCCNCCTGGT^rGCT^OTANAGGAANTGGATNNCTANNGGCCa^TC 
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ATAACNCATGCCAC>IGTAGTCCTCACTGGAATTGGAGCANATGAANANCTTGCACG>rrATTTTCG 

NNANCCCNTTONfGTTra^CNCANNTNGNCTGTGACAGAATGNTTATGGAAAN 

GGTCCAATTA^^^TNTCNAAA'l^m^GGTNGNGNTGACACAGATATT^m^GATCATO 

TAATGTTCCTNNCTGGGTGGACNTGTGCTGCCTTATTAATCTCTNGNAGTNGGA^ 

CTTACNCCCAGQATTGGGGAAAAAGGCCTNTCGCTTCCGGGGGGACT 

SEQ ID NO: 467 CNGCGGCQ^CCGNCGGGCrCNCCCNTTNCCCAGAAAAOCGGNACITGCT^ 
O^AGGGNNNAANGNCCAAAGNNNTNGTGCCTGCGTGNACNGACACCCTTGAAACGAT^ 
TNATNGCTACAGGCATGNCNGCCANNNTTANACNCCANAGGAANGCTGNNTNCN(^ 
CAGCGCChWAOTATCCTTCCACNGATAAGGCGTGCANCrTTTGTCTCTACCAG 
TATNNNTrGATAAAATGhnsrCNTKrANAAGTGGATGANCANCATTGAANGGANA>^^ 
GGATATNGANGGACNTNNNNTAAAAGANTNNAAANACGNTGmTACC^ 
GAT^^^GCTOGNGC^CGTCCTAC^GATGACNNNNTGTTACCAGTGGAT^^ 

ANTGGCGGGACGGGACGAANGATCTACCAACGTAGGCNCCCCACCCTATNGGNGAACTCNAATTC 
Am-GGCGGNO^TTCAAGAGGACNCCAAANCGNTrCCAGCTAGGCACAAGCrGGTATNTCCCTGNT 
GCGGGANCmATCCCANAAAATTCACCCCCNTNTACTCCCAGGCTNANGGTTAAAGGGGGGGTC 
CCNNNA 

SEQ K) NO: 468 CGC^^^N^ffaCCNNNANGACCTATNCATGC^rrATG^mG NATGTO ^ 
TANTACCNNNTCCNANCATGNGWCGAaWGTGNGNGAKCATACTNG^^ 
GANCCTGGANGGAAGGATCCACACCCTAAGCANGGAGGNGCIWGCGANTTNNTNNCT 
CTGNNNAGNTTGQWCmAANAGNNAANhnsfAATGNTGCC^ 
TCGCGANNATANATACCACACTGGNNTCCmTNhrrAATATNGChrrCTTCGGT 
NCANNACATAATNCT^^^fGGACGNTGTGGCAAm•AGGANTACTGGTGCCCNr^AGGCGNATGNT^ 
ArcCGAGTCGANGAAGACCCACTGCrCTGOCAACTTNTrGNGATTGATTTATACT^ 
TTTAAAACTGGCTGTTTCCCACrGGGrrAATCTGAAGAACAANGCTCOTGGTO 
NAAGTTTAAANGCACrrCTITATTTGGATGTNTTCNGCrTACTATGG^ 
GGmTTGGGCTAhrrTGGAGCGAAATTCCTTTGTCANATISrmGGTA^ 

SBQ ID NO: 469 titttttttntngntnctatntttttc™ 

TGACACTOANGACAGGNGN>nsrGGGNNTAAACNG>rrGCTTCTAGGGGCACGACCAAGGGGGCAGG 

GGCTNCAGCCCNAACGTNCAGGGCCTGCATTGCACAANGNNGATGCANANGTTGCNAGmNTGG 

GCAGGAACTAAGGAACCCCNCTTGACCCGTATTATCTNAAACATAGANTTGGTAGGACTGNTNCA 

TAACCNNANNCCATCANCCCGANNANTCCCATGGTNATGANGNTGCCCAAAATCNGGGCCCATNA 

GTTO^CGCACmGCGGTGGNGGCATTAANCCCNGNCCCCANTAACOTTTCCCACCATNT^^ 

NCCCTGAOTTMWTGGAGAAAGNGAATGNTTNAANATCCTAACANAAGN^ 

GGGTGNTAACANGGACCAAAATNATGNTNGG>rrGACTTCGAAGGTCIT^ATTmGGA 

ATNGNA>RSrmcrrTNAAAGGmGNGCrmGGNGGGACCCTCATAh^ 

AAGACTTTTTCmGGCTTTGGGGAGG^^^fANC^^mGGATAA^^ 

ANAAGGCNTCATTGTTN 

SEQ ID NO: 470 GGTACWTNTrCANNNNATAAGNGCTGhn^GNNNCNCANAATGANGG^ 
TNGNmXjNGGTGATNNCCATCNCGGmAGCTCAGTCNNAANANATTAT^ 

NNGGAGTGATNANCATNANCTGGGACNAACTGAGCATCTCTACTCANA>nsrrAATACCGTGAGANA 

GGCACNTANGCACNAGCTTGTNTNACATATGTCCAAGGCTGTAGGANNOTT 

AACGTGAAThrrGCTNAATAGTAATGANAGCTTGTGAATATTAGGOSrTAAAGGTGNTGGTNA^ 

NNANCNCNACTGC^CCAT^^■GAATTATCTACTGNGTGT^ACTTACTGAAGNTGAAATGT^ 

TGANATATNGTTNNAOTGNGCGTTCGANGANGATNNGCTTTTGANNN^ 

TGAANAaWNNNTCTCACNCTGATmTNCAAAA>^ 

ATCmATGGGGATCCTGNGACCTNGAhfNTTGGAAAAAANATAATTACAGTCCCT^ 

CCATGATANCTATAATGGGrrCNCNCTTCnTNNCCGTNNNCNCTCN 

GGGCGTTCT 

SEQ ID NO: 47 1 GCGAGGTACCACNATCACCAACCAATACAAGTTTGAACTGGACCTGGGGCTC 
TCCCTGCGCAGCCATCGa^GCGTTCCTACCAGAAGCGTTTCCGCGCCCGNCCGACTGAGGCTGGA 
AAGArrNNGGAAGCGCCCThnrACNTGCCCGGGCGGCNGTTCGAATGGTCTNAT^ 
NGCATNCTGTTNNTN^ITNGAT^^^^NANCm 
NCmCTTTGNG^^^^mm'GGTNATTCATC^rNANT^^ 
A>rmAATTCNNTTrrmCNANATATTrGAGTTGGTATT^ 
NATATTTAAmTCr^^ATTATTGTTT^CTATr^IT^m^^^ 
AG^^ITITIWTTATANGA^OTN^^NGT^m'ACT^ 
TATGCTGTATTTTNAGTTOTATNTTGTC>rrCTATTTTm 
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T^r^TTT^^TAATTN^^ns^ITCGT^ 

ATCTTANTTTTTC^^^^GTm'A^TATATNAAATATN^^ 

SEQ ID NO: 472 GTACGCGGGGGTGGGTGGGATTGAGGTGTGCCCTNCNTCATAAATACAGACT 
CANTTTTGCTTGCACACTCTGAATCTrNNACCGCATNCTANCC^ 
GTGAGGAAhrrCCAGANTTGCCATGGAAAAAAATANNATTGTTATCATNATTNCNCC^ 
CITNCCTACACTCTGGCCAAAATTTCCATATTANAAACCTTANCCANAAAGGACA^ 
CNGACNCATTCTGNCCNANACCCKCTNCAGANGTrG>n^GTGACCAANTAATCTNGACTCA>^ 
TTTNAAAAAACCCTCTATATAAANCCAATANNANCANNAAAKNCCTGGANNATCA>OT 

SEQ ID NO: 473 TAGGTCCNGCCGACGTACACTCGAAACCAAAT^WCTAAAACTTGGTTNGCOT 
AAAAAATNGTNGTTGTAACATTAAACCATAACCTAATCAGTGTGTNCACrATGCTTO 
CAGTNTNCTCACACTNKTTCTGGTTTCAAGNCTCAANGCCNGACAN^ 
TTTTNCTTNACAGTACAANrCTATCAGCAACmTGAGAGCm>^ 
ATCTGTNTCTGCAAGG 

SEQ ID NO: 474 GGCGGNGACGACNCGNNCCAACGTGTGCCCTATNAACTCTCCATGGNAATCC 
CCNCGCCTACCATGGTGNCCACCNGGTGACGGGGANTAAAGGNTCATTCCCGATANGGAGCCNTA 
NAAACCGCTACCNGATCACAGNNGAANGNAGTNTNGCGCTCATATmrc 
NGAGGTAGNTNAa^IACTGAGTACNTG^fNNACNTAA^fNT^^^GGCTCTTTTG 
GCGAAGTGNCAaWACrGGNGGNCGTTNTThTOGATNCANGCTNGGATCNAOT 
GANNTNATTGCCTTTTTCTTTTGNAATAT 

SEQ ID NO: 475 CGTGCANTTNANTGCATAAAAAGGCCTCTCTCCATNANACTCANCACTTTAC 
AGATGTANAATOTATAAGCATGCCAAATNGTACrrATCTGCCACATACAAAGCNTCATNCCANGT 
GCTAGNGAGGGGAAAAAAANGTANGAGATNTGGCCCTCAAGGANCACCAGATNTTAATCTACCT 
AACNAAGTCCNNAGNGT^n^CCAGGCATGNAAAAATTAGTGNTGCTACATGGAT 

SEQ ID NO: 476 ACNTCrrTGACATTTTCAAAI^GAANAACCTGNCNNNTTTCAm 

NNCCTTrrrGCACas[ACTNG>nsrCAAGTGATAGAAAGGTGGNGAGACAGGATTGGATCACAAT^ 

ACATAACCCAGAAGGmT^CNTTNNTNATATTCCTTAAAACAAATANCNGNTAACGCCGGAAm 

TTGNTTCAANGGCTTCCTTC^n^CGGCCTGGAAGTTANAGGAATTCTNGTAAAANGCGGN 

AAACANCCCCCAAGGAACCTCAAAGGCCTAANTNTGGAAGGAACCCGGNCNATGGCNGGATCAA 

GGCNNTTNl^CTCCGCCGCACCACCCCIOTGGGGGATNATTNCACCT 

GNTACCTCNTTNCACCTNGNGGAATATGGTNGTAACNGTTGCTTGCNNGAAAAANT 

AANANCCKCAACATCTANCNGNAGGATCArrcTANNNCTGCGNGCrmCT 

ATTTGTNGG 

SEQ ID NO: 477 CCGNCGGGCNCTGCCAGCANCGGACCCTCANAAGAAANCNCATNACNTCAG 
ACTGGTTCTTTCTCCATAGCTCCTGGATGTNCAGTATGTATCATGATT^^^mACAC^ 
TTNACCAATTCAAAATNCCGNAGCTATATGAGTATTCTNATAACCAAGAATACACTACACCANCT 
NATGACTGm'AAAAAAAAAAAAGATGNANTCCNGCCTTNNTTGGGACNGAAAT^ 
AAGNNGGCNNCACCCCTCTANATAAAAGCGTTGANGNAGGTCTGATTTAAATGCTGCATTCCC 
AATGCTCNTTTAAAATGAAACACACANCCCCNGAATACANTATGTTTTGAAAAGAAT^ 
ACATCTATACCArrAAAATATTANCAAAGNGTATTGNCATNGAATNAATGNGTCCTTATAGGGCCT 
NNGNCGNA 

SEQ ID NO: 478 Acr j ' n i iTi -i ri nn 1 1 1 rrrrrrrnTTr ccGNTrrn-i 1 1 1 1 n i rrrn itit 

TTTTTTTTTTTTTAAGGNTTTGCTGCm^ 

CAANATTACCCTNTAAAANNAAATCCATTTTTTAAGGNCTCATAAGGNAAAANAAACT^ 

NTTNTNCTTTCANATTACCCAGTCTGATAAACCm'AAACANOT 

ANAAAATTTANAAAANAGGCCCTTmAANTCACCmCTGNNT^ 

TGNANAT^mTCATTTNATNAAAGGTT^rrCCCCCA^^^^NAACCA^ 

CAAANAAATTTrmGTTCCAAACACTNTITrorrGGTTGA^ 

AAAAAATNTTTTGAAAAAAAATNAAAAACAAAAACCCCasrAAA^ 

NAANAAGGCCTTTCAATGTAAAAANAAAGGTTTTAAACCCCTNNTTTm 

AAAGNCCCAAAAAAAAANTCNrrCTNCCNANGGGGAAAGANACrGGAAGGTTGGCATTTAC/^ 

T 

SEQ ID NO: 479 CGCGGCGAGGTACNCGGGGGAGTGAAGGGTCTGCTGCTGAAATTTGGGGGC 
AAATAACCGNAGTANGTTTGTTCCTGTGCCTTGGNGAGTCNCCGGCCTACTGGGAACGGGACTTha 
AAAANGAACTATGTCTNGAANGCTGTGGTCCTAGGCCATTTTTGCTGGCnOTAAGCGNANGTC^ 
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>n^ANCNATTTGGAGCACACATQ4CTNTTTTAANTGGAAGGNAGTm 
ATTbrmTTThfNGCTNGANANTNCNCTTATT^^ 

SEO ID NO- 480 ACTCGGGACTGTGTCANGAATGATGGAAACCAAATCATGGAGAGTTTNNTTG 
NCAATATCNTTCNGCAANANAANAATTCCGGTACAGATTCNTTATNAGNTC 
TTNGTTACACT 

SEO ID NO- 48 1 ACGNGGTCGAGGTACACANTGGGGGCTCCrCATACATGGCCTCACATTGAGG 
CAACCTTAACTTTGNCTAANANACTTCATCTTNATACNTNCTCGCC^ 
ANATTNGATGATGGGGCAGCCNGTGGGGTANTAAATCGTGAA 

SEO ID NO- 482 GTGTGTGTGTGTATGTGTGTGGTGTGTGTGTmAAGTTTANCCTTrTGTT^ 
GNTnTTGGrrGGCANNAACCGATTITAATGACrrAGCTTTTAAA^ 

CTGAATArrGTNTAGCATGTCNCTTGAANCTACTTGNATCTACGGNGGNGCTCCTAANGGACC^^ 
ANTNCNTNATTTNGGANAGAGGTGTGGAAATCITGTATTGCNACNCCTGGAj^ 

ttagagtggnggaaaaaccaatctgngangnacaancttcn™gnccnttnatnaaccat^^ 

ATGCTmANbrrGGNCTirmANCCCAATTh^ 

ANCITCAATGNNGGAGNNNNi^GGCCATAATAAATrANAhn^GT^^ 

NTTNNTTAACTTTGGNCKCTTGAACTGNNTCCATNCCAAACAT^ 

NCTNAAAANGTNGAAAACTNTTCCTCTATGGAAAA 

SEO ID NO* 483 ACOTCCNTCTTCCAACTGCITGCCAGCAAAGATCATOCrrCTGCT^ 
GGAATGGCrTCCTTATCCTGAATCTAGGCCmCATTITCrATC^ 
GGGTGATGGTCTTCCCNTANGGTTTrCACCANAAATCrGCATTTNGGGGGGGCrCC^^ 
CCGTATCGAAAGGGCCCNhnSANNNAAACANTTGATACCCCTNCCTATCCAAAAATAA^ 

gnng(:ottgaaaaattnccaga>otggngggggaactttaanato 

AACCAANTAGATGTNANCTGGGGAAANTAACNAANCCCCCAATGCCA^WGAAAACCTA^ 
TTGGCNTTAATTTAACCCNGTGGGGCAACC 

SEO ID NO- 484 AcrmN rj - i - i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I TTO^AAGGGTGATAACGTCTTTTCANAN 
ATCATAGCACATGAAGAACCCATGGACACTACACAGACTATGAACAGTTACCAAAAAAAAACTCG 
TGACTAAAGNGGGGATTANCNNCAAAAAAAAAATTTCCAAAGGNGAAAAATTGGNAAATCNGGA 
ATITITACNAAANGTTAAAAAArrATITATCTTCANCCAAAATGAGGCCCCTTCN 
NCNrrGCTrrCTTCTCCTTTCGTTCTTGCNCGCGTTTCTTCCGNTGCTN^ 
GGTTNTITGTAAATTTnmGAAAANTTAATrCTGNTGACAT^ 
AAANTGCCCCCAAGGhrmAACrTCCTrmGGNGGGTCATANNTGGNNGGGACA^ 
-ITTAAAAAAANNAACCKmGTTTTTTTm^^ 

CITNGAGNGGGGAAANAANNCNC>n^AAAAA'rrCCCCCGNGGAGGGGGTT^ 
CCCAAAACNCCANCCTCCNCNTTTTGGGGGGGCCCCCCNAAANNCGG>n^AATm 

TTTG 

SEO ID NO- 485 TGCCTAATNACAACATGGATGACmGCAAAGGANGGGCrCTTTACTTTA^^ 
ACCATANAAAAAATCNAACGCNCANATGGNTNATTGNmTCAGNTATGNCNCTG^ 
TGGCA^^^AGGANTGAAANCAT^TGGNAATNTGNCA•ITAACATGTNTNATAAAAGGC>JTO 
AAGGAATNNACCCNCCATNNTTGAAANGCNGTTTTNCAACT^ 

CGTCTGTGTCCATCOTTGNAAANACTGGCAGCNGTTTGNANNTGAANNAGNATGAAGAAAAAGA 
ATGGGAGCriTmCCCCTTTTTTrTmCT 

TCCACNACAT^^^TAAAAGCCNATTGGTTTGGACCTTGGGCCGGGGACCAC^W 

SEO ID NO* 486 ACTTTTAAGAAAAAAAGCAGGGCCTTGGAAGTTTTGGGTTCT^^ 
CTGTTGCAAATTTCTATGGNTrGGGTTTGGGTGGCGGNAAACCGCCG>riTCNTNC^ 
ACTGCCCACGGTGGGCNGGCGGTCCCrCTCTAOTCTAANGGGGACCCACCGTNTANATTCTGNA^ 
CTGGAAGTGTGNAAGGTGAATANGNTCANGNNGGNCTTNrmTTTTAN^ 
NTCGTGACAAAGCATANCT 

SEO ID NO- 487 ACnTSfATNNTGNTNNCCCAATNGATACTGNTTGNACTAACATCCACTCTO 
GTNGCTGAGATAACTTACTITGACTGNCTATTTGGATATCrrCTCANACANACOT 
AAAGGACANGTTTGTGGACTGTGT 

SEO ID NO- 488 ACCCATGCTCACACNCACACACTTCCAGTTTTATACAGAAl i 1 1 1 1 lAANGGA 
XvGAAACCAACCCAAAGTArrGCATrrGAGGNGACACTCCCTGAAGANTTNTATACAGAGGTANT 
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^^^ACNNCrmACa^CAGCTGAAGGT^CTrmATTCATTCT^^ 

TGNCNTTCCNAGATTATCCCCTTNGCNACNACNCAAGCAATGATATTTAACNNAGGTN^ 

NTOnIAACTNCATATAGAGAAAAAATCOATNACGAAAATNCNTGNAATAAGACTNAATO 

CNTAACAAGGNCTTAGTAGATATNTAA 

SEQ ID NO: 489 CTCNGNGAGGCCCCAACGCTGCTNNTGCTACACTATNGNCAGGATNAATCCG 
GTCCNGTGCGGCANATGGNTANTCTCCTCTGGCCTTGGCNGCCCTCCTTGTTCGTGGACANGGGAN 
AimcmGTGGCAmHAGGAAAGCrCCMTTTTTCANGAi^CTGCNCCA 

SEQ ID NO: 490 ACA(:NCGC^^WCTGG^^m^A^fNACGCAAGCrmANNATAGGTCNGGN^ 
GCNCTCTGNNANTGGGTNNNTGGTGCAACNNTm'CNCAGGC^ 

TNGCTGGATAATGAAATCNAGGATGACGATGCCTTmrCCTGCTGTTCTAGGGCATCCANG 
GCCAATCGNnrGTCCnrnCAAACCTATACCTTCCATTTTATNACTCC^^ 

NAAGTCnCANCGATGCOTGGATAAACATCCACNGCNAANGAAAAAGTAATGCCAAANANAAANG 
ACTATGANTCNTGANTACANGNGAAATAAANA>rrTTTNCTafCCA^ 
TTTAAANGAGNAACNCmCNTTATNCC^n^CNAAGGACGTas™ 
ACNACTGTGAACACNGANCNGGANCAAGAATCCTCCCAAGGATCATN 

SEQ ID NO: 49 1 ACCACAAGGATGTGAAGCATATGAACTCTGCANGANTCCTGNCCACACTGAG 
GAATTATGACTACTATGCATGCCNAGNTCCANAATGTATTAAAAATGCACTTGCTTCOT 
NAGTCTCATITGGNCGNANGCGNGTGGCTCATNCNTGTAhrrCCCOTfGCACnTSITC 
GC^fNGTGGTNNATCITTTGCTCAGGAGTNTNAGACT^ 
CnTrACANANOTANNANTANAATTGGGCATCGNGTATGCCCCCAG 

SEQ ID NO: 492 GTACANAGTAGCCGTGATGTGGTCATTNGTCCTGATGCCAGCCTTGAAGATG 
CAAAAAAAGAGGGACCATATGATGTGGTGGGACNACCANGAGGCGATCTGGNCGCNCCNGANTC 
TATCTGANTCTGCTGNTGTNNANGANATACTGNNCGAACNNGAATACCCGAAGNNGCC^ 

ccgccatcttntgctggttcctactgctntggtnggcrcntnaaaktahnt^^ 

tcaacacacccttttgctanagacaaaatgatnaatggaggtnattncacctactntganaatcn 

tgtggaaaaagacnntcttattcttacactgccnggggcctgggaccntctt^ 

NCNATTTbrrrGAANCCTCTGTANTGGCANrraAATGTNGGCGGGTCAAAGANAANG^ 

KTTCCTTAAANAACNNCACNCAGNGAAATTTTGACTCATTCNCTTA>^ 

ATGAANCC 

SEQ ID NO: 493 CNGTGGNCGCGGCCGAGGNACAGCATCANTGAAAAACACANTGTCATGAAA 
CACANATGCTGTNGCATGATGACAGTCACAGANCCATGCTNAATTGCTCAAAGAA GCTCX:CAN CA 
TNCTACCCAACGGOSIGGAGTANAATrGNrhnTOmmANAAA 
ATGGTATTTTACCTTTIWTGAANNTCATTACANGGAGTTNGTATAANTA TT^ 
TGNCAGCTCATNGGAACCCAGNGGATGNGATNCCACCCCANCGAGGNGGTTTITNTGTO 
CNGGATCATATTCTTGGAAATAGGAGCATCTCCANTGGCCCCAATTCTGGNGAAAGGAGAACCCC 
TTCAACCCCCA^AT^TmACAGGTTGATTTGGAGAC^^^AANGCGGGGCACCT^ 
AGNNANGCATCCCCCNGNGGGTTGCTCATGGOATGGNTANAGGGO^TNGNGTGAAAACra^ATO 
CCCANAATTCCCTATTGGGGG>n^TGGCX;AITmTISrCNCATGCNCCNAAANAA 

SEQ ID NO: 494 GGNTCNCGGGGAANAGGAGTTGGANTATGGGGGACGCGGNAGGCGGCNTAN 
ACAATNAGTTTTCTAGGNTGCrrTTTNGNTCCANTn'GGGAGATCGATAm 
GGAAACCANTATAANGGCANAA^r^GAATAC^r^GANGACGGCTTAGGGGAANTGNACTTNCT^ 
TTTGACGGANANNCCGCKrCAAGGAAAGGTAAACCTCGCNTTTAAGNCACCTm 
GAACACCATNGGAATTAGNATCTGAANTNNGCANGGNTGGATCGTNAOSITT^ 
TCNT^WCCTATTGNATTAATAAACCNA^TNATAAGAACmGCCCTACCATGOT^ 
ACNNTANTTATNhrnrrAAATTTATGGCTm'CAN^ 

CrATJSrrCGCAITOTGATGGCATATrCGTTTAAANANATCAATTNriTAAANAT^^ 

ACNO^TCGANATNANCmCANTGGNTCNTCANNCTCNNCNCTNT^ 

ACTTAATATGAGATTGGGChrrCNAACANNNNTTCCCTATGAAAATTNNAACT^ 

NNNANATTNTANGGNTCNANTGTCGCGTATAAOTATCTCTACNTNTGCTN/^ 

NNTGANNCTGCGNCCG 

SEQ ID NO: 495 AC n - j -i 1 1 1 r i 1 1 1 I N i IH - l - ll 1 1 i r m GGCTCAGATAAATTATTTArrA TATTC 
TGGATTATCCNTGGAATTTTTCTGGATATGAATAAATAACAATGTNTCTCANAAAAA 
AAAAGCCTTATAAAGTTAAAACTTTAAATGCTTTTCATCAAAATGTTATGACC^^ 
TTCATGTATCTCATCrCACrrrCTATACATTTTGGAGNGATATGTATTTATACATACTGCAGTTGGA 
AATAGGGAATOCTTTmGGTTTCCCACGGTAGGCTTGCAGGGTTTAAAAGCANAAGTC 
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ThrmAAGGCAACATAATGAAAAGTATTN>riTITGAAACCCTGGA 

TTTNTGGGNGAGGGCCANTNTNTGTGGCTCANNCCTGTATCCCNACNCTTTAAAAGNTTO 

AAGANTTTTTNGANAANNATAAGTTCGANNANAANNTTGGGCATCNNAAGm 

CNTAAAAAAAAAANTNCNTGANCCCATGGGC>n^JCCCCTGGANNCCCCACTCl^ 

AGGGGAANATTTGTTGNNCCCNCNAANNGANGCGGCNAAANCCTNAAGNANNCT^^ 

CTGGAAAAAGNGGCCTTGNTTCNAAAAAAAAG 

SEQ ID NO: 496 GCGTGGNCGCGGCCGAGGTNCAAGATTTACCAGAAAGAGAGTGGTGTGTAN 
ACATGCCTGGAGCAGACACCTTGGAGCCGCTGACAGAAGGTGAAGCATTCNAAGAAAATGTGGA 
AACTTlTCCGNTGCTCTACACNGTasrANAAACCTGTCCATTTAAm 

GATAANCAAATAGACAGTNn^AAGTAAGTTATCTCAGCNACATTTGGGGAGTGGATGCTGNTGAAT 

TGCGATTTATTTGGGGGAGCCNTATATGTANCrGAGAGGTGGGAATTTTTAGAT>rrA^ 

ATGNNGGCCAAGTTNCATGANTGCATCAATGNTGTTATNNTGGGAACANGTGAATTTCTATNNOT 

GTGGGCTNGTTNGNNGCITIOTTGATAAAAANTNAAGGCCCCNGm 

CAATCNTTGGTTTNCCTTCNTTTNNAATAANNATCNTCAC^J^ 

AACCTTAATAACCTGNAANAAATAATm'GCTGTGCTAATTAATNATCNTAGCNGTGhm 
NGCAGCCTAATGGACTATOCNTANCCATTTATCTNATTAGTGTTATTAGTGATCTCTm 
TGGNATGGCCCTTCGTGCTATATTT^^^^'AT^^T^AATNCTNGG™ 

SEQ ID NO: 497 GGTAc riiTiiTiui iriU ' riTriiTiu GGGG'iTniTriTr iiuuuiiTiTJu - rr 

i r il ii ri - 1 1 i 1 1 1 1 1 i r GNNTCCCCTCNTACTGGTGGGGNTNGAACTAAAANTCANGCCTNTGCCT 
TTTAAGCTOAAGGNCGGNTGNCCATNTGCTTITrACGl^GCCTNCATN 

AAACCCCCChTOCCAAAAAANCCCAGGGNGGGANCNAa^CTNGNGANGAANNANGGNNCCCCNC 

AACTTTTNNTGGGTTTGANGGGGCM4TTTGGCTTm 

GTTNACTTTTTTAAAGGTCAhR^GNCAATTCCTGNGGGGCN'ri"^ 

TTNnSfCNTGNGGGNATAAGGGTTrATCNAAGG>n^TTTAAANAAACCCCCCGCAAGGCCCT^ 

AAAAATNTTTTTCCCCCTTTCCAAAAGGGAAANAGGTAAANGNNGGmCT^ 

TTNAAAGGGCAANTTCAACNANCTGNNGGCGTmTAANGGGATCCCACNNGGNNC^ 

NGNAATAANGG>nsrTACKrGGTCNCGNGGNAAAATNTTTCCGCTCNAATTCCNNAAAT^^ 

AAATATAAAGGTAACCNGGGGCTNAG 

SEQ ID NO: 498 ACGCGGGNNNGTNGNAGCCTGNGGGNCCTANTGNNNNATNGATNGNATNAT 

caaotatacaataacctttgngnnntgtgncnaantggagngaatttgnn™ 

tgcnctncagcatanggnaaanctgnaaagagctatgccanangactattagantggcaagcctc 

tgatcanagcgttanagatnacgagananangggtgtnat>mgtgcctnggctgtgccntncatg 

ganngan^^^ccanagngganntgtgtttatgacttggcaactcaattgcattggaagcaataa^ 

ccgacttatcaagtcctaaatgtacc 

SEQ ID NO: 499 CATTGANCTCCATAGAGACAGNGCCGGGGCAAGTGAGAGCCGGACGGGCAC 
TGGGCTGACNCTGTGCCTCGCTGANGAAGAATANTrTAATCGTGGGCAAAGGANATCCn'i^ 
CCAANAGGCNTAATGTCNTCATCTGCnTTTATmGTNCAATCTGTGTNTNGAGAAAAC^^ 
ATNCGCCCKrAANNCAACGTGANGAATTTTCTGAA 

SEQ ID NO: 500 ACCNNGGNCNNNGCCGACGTGCTAACATGCTTNACNNATCANTATGGAGNCT 
CACTCTGNCACGCCCAGACTNGAGTGCAATGGCACCATCTGTGGCTCACTGNNAGCTCTGGTTNC 
NCAGTTCAAGCC>nrTCTCCrGCCTCANNCTCCCAAGATACTAAGACCACAGCCATGTGNTACCGAC 
ACCTAATAGTTNTNTTATATAGNNAGNTGGGGTTTCANCAATGGTNGGNCAGTACTGGGTCTAAG 
ACTACT 

SEQ ID NO: 501 CCGGCCGNCGNGCTCAGTCTTNNNTATTACANChmTCATTGANTATAAAAAN 
TCANTTN>fNNTTANCCAATAAAGGNCACTTCNAAAAGCAAACATTC 

CCATATCTCAAACATTTCACTTTTGCCTGATGCCGCAAGCCTGANAGGNGGGTGGCNCTGTNCCCT 
ANGGACNGGTCCACATCTAGAACACATGGCTCTATGCTCTCCTTTGGGGCTTANAC 

SEQ ID NO: 502 AATAAANACCTTATCCGTGGNCNCGGCCGANGNACATGATNCANATTGGTTT 
TGCANTTATTAACGANCTGANCTANANATGThrrAAATGCAGCAGANTTATGNCTGTNCTGCA^ 
ATGCNGTTAGGNNTCNNTATNGTATrmX}ATANAAGAACATNTGNTTGANCANATC^^ 
ATCTGANTAGGNTNCTATTTACCTTCTGNATTTTAACGAAAACCCTNATAT 

SEQ ED NO: 503 ACCCCGGGGGCATTCCGTGTCCTTNCGGTGCTNGGCAACAAANCCGTCCAAA 
CCGACACGCNTGGTATNCTCGCGGTGTCCGGCAAGANACTNCCAAGACANACCCTATTGACTGAN 
GCTNATGTGAATNCAANGGCCTATOCCCTTNCCCATNCCCACTOAACAANAAACT^^ 
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mAACNAATCATTTTANTATTAACANCTTNTGAAAAGGAACCCATT 

SEQ ID NO: 504 ACGCGGGGGGGGTCGGAAGGGGAAAACAACTACGGCTGCGGTGTGGTTGGT 
GGTGAGATGACGACCTTAGTGCTGGATAATGGAGCTTACAACGCCAAAATCGGTT ACAGC CATGA 
AAATGTGTCGGTrATTCCTAATTGTCAGTTCCGGTCAAAAACAGCACGTC TTAAA ACl-i'i'TACTGC 
CAACCAGATAGATGAAATAAAAGACCCrrCTGGACnmTrACATCeTCCCTTTT 
CTTGGTGAATTGGGATGTTCATGAGACNANTTTGGGATTACCTTTTOG GAAAA NAAAT^ 
GTTTGATITmACrATACTNTATATTATTTANTCa^CTNGT^ 
AATTCCATANNAATTNATGAATTGAAANTTCrArm™AATAAATANCOT 
TTATATAANTAANATTCCTrGGGGCTrrNTATTGCANTTAAGNTmr^ 
a^CAANTNATTTTGTTrCATTGGTO^ATTGNATNri^ 

SEQ ID NO: 505 GGTACTATATTGTTTAATTACTGGAACATTGT AGTAAGAA TTTATATCAGGAG 
TGTAAGTTACTGAATTTTGTTATTTTTCAGAATTCTTGTGTC 
GATrrTAGGAAGAGCTTGTCAAmATATTAAAGAAAAGCTACATTGGAATTO 
AGATAAATTTGTTGGGTATTTTGATCITCATATTCAGTATTGCT^^ 
CAGrrrATrCAGCTCTrrOATTTCmCAGCArmAAATAATm 
TATrrrGTTAAGTTCATTTCTAATNNCTTTTTTCTAGGATACTA^ 

TTATmGGGTATTCCTATAATATAAAAGTCCmCCCGGGCCGGCCGNTCGAAAAGGGCGAAA^^ 
AGCCACTGG<>rGCCGTTCCTAATGGATTCCGAACTCNGTNCCCAAGN 

SEQ E) NO: 506 ACTACTAAATTAATAAArrrATTCCACTTTTGAAATGACAGCCAA^^ 
CTAATTGArrCTCATTTGGCACGTTCTTCTCAATTCTGTTCACTAAATTA 
CTGCCAAGAGCCATGATGTTGTCTGCACAAGACAACATITTCCATCACTTTCAGAAAGTTA 
GGCATGTTAAGGAAAAAAAATATAATCCCACAGCAGCAGCTATTTAAAA TAAGAC AGCCACAGG 
ATTCATGCAGAATATTTTAAATATTCCTGrrGAACACAATGATTTAATTGATTT^ 
GATGCTCCCAAACATCCTATATGCATCCATGGAAATTTAAAGATCCrGGAATAGCGCTT^ 
GGACATCTTAAAAAGATAGTATmGGCTTCAGTGAGTTACACAAATGAATCACC^ 
ATTCAATGGGTCTGGTTACAGAGTGNGGNAATITCGTCNriTAAATTGaNlGAATTTATA^ 
AAAATTCCGGCTGNNCCCCAAAACAAGACAATTCCTCCATGNAACTGGNTGNATNAAAGNAGACN 
TTrCCCAGGTGGGAGGTGGGGTACCAANOTOTTTTTGTACCTTGCCGGACNCCCTANGGNA^ 
CCC 

SEQ ID NO: 507 CGAGGTACrmGTATTTTGATATGGACAGTTTATTCATTrGCATACAGTrATT 
QACTmrCCCAGCTGATTAAAAGATAGTCA AGAAA TTCrG CAATATAGC TGCCA AAAT ^^ 
CTACATTTTTATGATATTGTCATCTTTTCTGN rrrrri'lUViri 1 11 1 1 CTTTAGCTATrTTACTTAAG 
CATAATAGCCCAATAGGACATATAAAAGATTATAAATACAGAGCTTTATTATCCTGACGTCrrGGG 
TCTTTTAAGTATATACTTTTCTGAAAGGTATCCATTTTGTAGGCT^ 
TTGGTTATTTTTGCTGCTGTTCTCAACATCATCATTGCCTGCTGATGTGCCACGATGCT 
AAACAGCAATAAGAATGTCTTAATTTGAGCAGTAACATGATTGCAAGAAACCAAGTTTCACAACT 
TGTAAAGTCTGTATTTGGGATCTTGGCnTATTmCCGCCX}NGTTTTC ^ 
AATTGAATCAOAGTTTTCnTGCTATTGGGGTGGTAACCNCTGGAGAACTITCT^ 
GAAACTGGACATCTTTGTTAAATGGATAANAAAACACm™TTTAAT 

SEQ ID NO: 508 ACTGGTTGGGGATGGGAATCGTGCTmCTITAAACTTCAGTTTACGAGATGC 
TTTGAGAGCGTTAGGCAAAAGCAGAAATAAATATTAGGAGCAACGGGGAA AGCTT TATAAAAGA 
TCATGGTGGCCACTGTTGCAGCTTTGAAGAATGAGTGCTGGCTTGAACAGTrcm 
TTGGTAGCTGCACTGAAAGGAAAAAACTTTCACCTTAAGAAmGAAAAGGAAGAAACCT^ 
CTGGTCTTCATGGCATTTAGACTGAGATGCTTAAACAGAACAGAAGTAATACGCATTTCCT^^ 
AGGATAGGGAAAATGTAACAAGCTGGTTGCTCTTGAGGTTAGAAAATTGCTGTTrCTCT^ 
AAGCTGGATrrACTTGAAAATGGGAGAAGTTGGCrTATTGGlTGAATATTGGGACATCA^ 
TATACCCAGTTTCAGTCGCAACCAGTTTmCCTTTGTCTGGGGNAAATCNAACCNAA^ 
TTGAATCCCGAATCCTAAATCACCTTTTrnrmAATCCCA^ 
CCCCTTTGN 

SEQ ID NO: 509 ACATCAGACTAGATACAACATGCAGAATGTTrrCCTGAACTTATCCGGAAA-IT 
CCAAAOAAAACATCATGAAACAGCTTACAAAAAAAAAAAAAATATATGCCCTAGTTATTCACCCT 
GCTTCAACACTGTCAACGTAAAGGCAGAAATAAAGCAAGCTATCAATACCTCAGAACTACTGATA 
TAAGACATCAAATTTCTAAATCAGTGTATTAAAAAAGTGAACACTTCCrCTTTCTrCT 
ATTTAACTAGAATCATGTTTAAAAAAAAACTGATArrAAATGTGACACrrCAGAGCTACTACTGOA 
AGGAGTAATTCATAACTTCCCTACCCTCCn:CCATCCCTGCrGArrCAAGAGAAGGGGGAAAAAAC 
CAAGAAAACNAAACGAAAAACX:AACCNNGGTCTCTTGNAGAATTGCTGCrATrC^^ 
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GCATTGCTGGCATGCCNCAATGTGGGCCCTGGAATANGATTTTGGGGAACCGGCACCGTNGTA>IT 

CNCCTNrrGCAGTNCCITGGCGGACCCCCTANGGGNATTCCACCACTGGGGGCGGTTTA^ 

CANCCGGC 

SEQ ID NO: 5 10 ACTTGTAATTAGCACTTGGTGAAAGCTGGAAGGAAGATAAATAACACTAAAC 
TATGCTATTTGATTITrCITCTTGAAAGAGTAAGGTTTACCrGT^^ 

AAAAAATGATAGTGATTTTGATGTAATTTATCTCTTGTTTGAATCTGTCATTCAAAGGCCAA^ 

TAAGTTGCTATCAGCTGATATTAGTAGCTTTGCAACCCTGATAGAGTAAATAAATTTTATGGG 

GTGCCAAATACTGCTGTGAATCTAriTGTATAGTATCCATOAATGAATTTATGGAAATAGATATTT 

GTGCAGCTCAATTTATGCAGAGATTAAATGACATCATAATACTGGATGAAAACTTGCATAGAATTC 

TGATTAAATAGTGGGTCTGTITCACATGTGCAGTTTGAAGTATTTAAATAACCACTCCTTTCACA^^ 

TATTTCITCTCAGCCGTTTCAAGANCTAACATGTGGATTTNAAAAATTGNCTCOT^ 

ATTTAAGGGATGGTCNAAAANTTTTGCNATTGGATAGGCCNAAAAATGGGA^ 

SEQ ID NO: 5 1 1 ACATCAGACTAGATACAACATGCNGAATGTmCCTGAACTTATCCGGAAATT 
CCAAAGAAAACATCATGAAACAGCTrACAAAAAAAAAAAAAATATATGCCCTAGTTATTCACCCT 
GCTTCAACACTGTCAACGTAAAGGCAGAAATAAAGCAAGCTATCAATA CCTCA GAACTACTGATA 
TAAGACATCAAATTTCTAAATCAGTGTATTAAAAAAGTGAACACITCCTCTTTOT 
ATTTAACTAGAATCATGTTTAAAAAAAAACTGATATTAAATGTGACACITCAGAGCTACT^^ 
AGGAGTAATTCATAACTTCCCTACCCTCCTTCCATCCCTGCTGATTCAAGAGAAGGGGGAAAA/^ 
CKNNAAAACCNNNCNNAAAACCNCCCNGGGCT>nTGGNNAATTGNTC 
GNNTTTGNTTGCNNGGCCCNANG>M^GGNCCaWGAAAAAGhn^^^ 

SEQ ID NO: 5 1 2 ACGCGGGGGGCCACGTTCAGCGGACACGGGAGCAAGATGGCGATTCCGGGC 
AGGCAGTATGGGCTTATTTTGCCAAAGAAAACACAGCAGTTGCACCCTGTTTTGCAAAAACCATC 

agtgtttgggaatgattctgatgatgatgatgagaccrctgtgagtgaaagccttcagagggaag 

crgctaagaagcaggccatgaaacagaccaaactggaaatccagaaggcccttgcagaagatgc 

tactgtgtatgaatatgacagtatttatgatgaaatgcagaaaaaaaaggaggaaaataatccx:a 

aattgctttrggggaaagacagaaagcccaagtatattcacaacttgctaaaagc^^ 

agaaaaaaggaacagggaaaaagaatggnaaagaaaatnccgagagaccagaaatgggaaang 

gggagttggtgataaaaaaccttttgtgcctttgcotttaanaaaaac^ 

NAAAAAAAAANAAAAAAAAGGGTTNTTNCTTGGAACCTIT^ 
TNNGGGGTTTTTTGGNCCCTTTAATNNCN 

SEQ ID NO: 5 13 ACATCTCTCTATTAACAGGATTrGTTrACACAATTATATTACACTrCACCAAC 
CTTTATACTGCATTTCATTAAATACAAAATACATTTACAAAAAGAGTCTACCACGGTGTTCOT 
CAATGCCAGCTTAAGGTCTTTTAAAACTrCCTCTTCTACATATTTATAGTGGTTACAT 
ATCAACATTATGAGTTTTATGAGTTTATmCTAATCAAAGAGAATAGTGTCAGCCTGTITCT 
CCAAATAGGAAAAACAGCATGTGAGATGATTCCCTGCACATAACCAAGGAATCCTTrrCA TGCAC 
ACAACATTGGACTTTTACTTGTGCAGTCACTTTAACATACAAATCATCTT^ 
AATTTTCTTGAAATACCAAGTGGGTGGAGAGCTTCTTTTCAATGAATCCACAC^ 
ACGTGCCrCANGCNTCAGGAGNGTATATTTAATTATATTGAGAGNGAGGACGGGGNAAAAAATTT 
GCTGAAAAANAAATTTCTTTGGCTGACATTTTTCAACCITAAGCT 

SEQ ID NO: 514 NACTN>rNN'ri nTri n-i i rrrrrn ri"i"i i 1 1 iaagaaaacttagggactaaa 

ATTAATATAAAAATTGGCATAATGTTGGATTGAATCTACAr mGGCAGAAGTTAAACATTC CC^^ 

ATAATGTCAAAATTATACATCATGCAGTTCTGTTTTmGTTTGTTTTATT^ 

CTGGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGTGATCTGCAACCTCTGCCCCCCGGGTTCAAGC 

GArrCTCCTGCCTCAGCCTCCCGAGTAGCTGAGATTACAGGTGCGCGCCACCACACITGGCTAA^ 

TTTGTATTATTAAGTAAANACGGGGTTTCAACATGTTGGCTAGGCCGGTCTCTTCTGACCTCAGGG 

NGATCAACCCCCTCGGCCTCACAAAATGCTGGGArrACAAGCGTGAACCACITGCCAGCCCACAT 

TATACAAmrGNAAAGNAACTTTGCACAANCAGTbmrGCCGTGGCACACCATmTsfTAC^ 

TGGTTGAAAAAANGTTTTTnmTrrATGAATCCGCm^ 

SEQ ID NO: 515 acgcggggggccacgttcagcggacacgggagcaagatggcgattccgggc 

AGGCAGTATGGGCnTATTTTGCCAAAGAAAACACAGCAGTTGCACCCTGTTTTGCAAAAACCATC 

AGTGTTTGGGAATGATTCTGATGATGATGATGAGACCrCTGTGAGTGAAAGCCTTCAGAGGGAAG 

CTGCTAAGAAGCAGGCCATGAAACAGACCAAACTGGAAATCCAGAAGGCCCTTGCAGAAGATGC 

TACTGTGTATGAATATGACAGTArrTATGATGAAATGCAGAAAAAAAAGGAGGAAAATAATCCCA 

AATTGCTTTTGGGGAAAGACAGAAAGCCCAAGTATATTCACAACnTGCTAAAAGCAGTTGAGAT^ 

AGAAAAAAGGAACAGGAAAAAAGAATGGAAAAGAAAATACAGAGAGAACGAGAAATGGAAAAG 

GGGGAGTTTGATGATAAAAAANCATTTTGACATCTGCTNTTAAAAAAAACTGCCAGA^ 
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GAGAANAAAAAAGAAAAAAAANGCnTGTTCCCTTGAACOTNTTTGGT^ 
NTTNGGTm 

SEQ H) NO: 516 ggtactttggtcaccacgggaaacacaaccaaccaagtgttttgccaagaga 

CACAGATCGGAACGCTCATCACAACACTGCATGGCTCATTCAGTGCAGGTTGCCACAGGTCTACA 

GTCACATGGTTAATGAAAAtCCAAAACAGGACCAACAGGATGACACArrTTCTGTTCTTAATG^^ 

ATTGCCArrATTGCTAGATCTCACTCTGTAGCCCAGGCTGGAGTGTCCGTGGCCAGATCTCGGCTC 

AATGCAGCTTTGACCrCCCCAGCrCAAGTGATCCTCCCACCTCNCCTCCCGAGTTGAGTAACTGGG 

ACCACAGGTAAGGGCCACAAGTCTCAGAAGAGTANGAAGTGCCCAGCATNACACAGGAAGCAAG 

TTACAGAACCAANGGCTCTGGTCATAAACTGGTCAACATCnTTATCACATAAGGGCT^ 

CACTrACTOAAAGCNTNAATTTTGTGGGAAGAAANANAAACCNAGGCTTG 

ACGGTGTTTACNCCNCTGAATTTNT 

SEQ ID NO: 517 ACrrGGACCATCCACAGCCCAGCAAGGCAGAGCAGGATGCTTCTATTCCTCC 
TGGCACCCATOAGGCCCTGCTTCAGACAGCCCTTTCTCCTCCTCCrCCTCCCACCAGGCCT^ 
CCTCCCCAAAAGGCAAAAGAGGCACCAAACACCCAAGCCCAGCCCATCTCTGATGATGAAGCCAG 
TGATGGGGAAGAAACCCAGGTTAGTGCAGCTGATCTGGAAGCCCTCATCAGTGGCCACTACCCCA 
CCTCCCTTGAGGGAGATTGTGAGCCTAGCCCAGCCCCTGCTGTCCTGGATAAGTGGGTCTGTGCAC 
AGCCCrCAAGCCAGAAGGCGACTAATCACAACCTCCATATCACAGAGAAGCTGGAAGTTCTGGCC 
AAAGCCTACAGTGTTCANGGAGACAAGTGGAAGGGCCTGGGCTATGCCAAGGNCATCAATGCCCT 
NAAGAGCTTCATAAACCTGTCACCTNGTACGCCGGGAGTACTGCACCACThn'AAAGATGGCGANC 
CAGAGCCAAGAAAAATCCTITGGTTCNNAAAATTTCTGAAAAAAAGGAAGhOT^ 
AGO^CCCAGCNAACAGGCCmTrGaSfAAAAGGGGTATGGNGGGAACCAAAAAN 

SEQ ID NO: 518 ACCGCTGAAGACACCCAGAATGAAGGAAAAAAGACAAAAAAGAATAAAACA 
GCTTTTAGTAACGTTGGAAGAAAAATTAGTCAGCGAGTTATTCACrrAm 

gatttgggaaacatgcaccgagcaaatgtgattagacttatggatgagcgagacctgcgactggt 

tcaaaggaacaccagcacagaacctgcagagtatcagctcatgacaggattgcagatcctccagg 

agcggcagaggctganggagatggagaaggcgaaccccaaaactggaccaaccctgagaaagg 

aactgattitgncttcaaatattggacaacatgatttggacacnaagactraara 

ggattaaagaaaaaacaca'agtccagattaccnttaagaaaggnaaaaatgttgac>^ 

aaatgaaatggaggagatatttcatnaaatcttcagactttcctggaatactacttctca 

cncaactgm'cangngggaaahrmaagggggcttcggccttggccnaatgagggaaggt^ 

gaacttcxjagaccccgaagagacom'accaaa 

SEQ ID NO: 5 1 9 ACAAAATGAGACAAGGGGAAATATAAATTAGTGGGGCAGGTCGGGCTCCGG 
TGGGTGGCAAAGCAGAGAACATCTATCCAGTTAGCCTATGAGGCTTGACCCTCAGAGTTGCT^ 

ttgggctggacttgaccagctgtaacttaagataaaacagtcccacccttaaggtcatcaaatga 
aagacacaaggacagcatagcagaggcctagctggcttgtcagaaaaatgcacgggagatttca 
gttggctaaaacctcaatatgagccactgcatggccaggctgtgcaaaggagtggggctcca™ 
atagtgcacggtgtggccccactgaggaggagagagtcctccgcacacagcactggctgtacccr 

GACTTTGGTAGACACAGGCATTGGCATGACCAAAGCTGATCTCATAAATAATTTGGGGAACCAT^ 
GGCAANTCTNGNACCmGGCCGCGANCACGCTTAN 

SEQ ID NO: 520 CGAGGTACTrCACTTGGAACTGCTTGGTAAGTrATTGTTTTTGCTGGAAATAT 
TCCATCCAAAAATGGCTGCCAAGAAAC(nX}CCATAATTCrGCATTTGATGGCACATCATAOT 
CAAGATAGAGCCAGTATAATGCCAAATTTTGTATCCATTATTAACCCGTAACCTGGGGGCACATGT 
AGCTGTTAAAATATGCrCACCATCCGGGCACCAAGCAAAATATGTAGAATCAGAAGCCACCGGTT 
TAGAAATAAGTTTGTAGTTTTTCACATCCCACACTTNCATITGC^^ 

AATACTAATATATGTCCATGAGGGCTATAGTAGGCTGCATTACGAGGACCAGTTCCAAAGTCAAA 

TACAGGATCACATTTCAAGNTGAAAAATGTCGCTTTGGCAGGC ATTAAA CCN^ 

ACTCAGTAGAACTAGAAATTCCAACTACTTATAAATGGGGGCATTTTTNGNAAATGCAC^ 

CTTTCTCATTTGGTGCCATGGAATGNAAAAGTTGGTCTTCATANTAGGAACTCTGNCTTGGNA^ 

TTGGNTACTTTTNCCACANAG^OTACNTTTTTTTC^ 

SEQ ID NO: 521 GGTACATACAAAATCTGAAACTGACACTGTCAGTTCTATACTn'GCACACGTG 
AAGTGTCAGAATATTlTCrrCAGTAGTACTTACAAGGTGACTATATCAGCATTGGCTG 
GCTATGCTCAAAGGGTCTTTCCCTTCTTCATCAGTGGCATGTTGATTGGCACCTCGT^ 
AACATACCTGCCTGTTTAAGAACACAGCATTTITGAGATATTGTCGAAAAAAAAAAAA^^^ 

AAAAAAGT 
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SEQ ID NO: 522 ACCTGCCGGCCCACTGTGCACTGCTTGTGAGAAAATGGATTAAAAACTATTCT 
TACAmGTTrTGCCTTTCCTCCTATTTTTTTTTT^ 

TTATATTGAGACATATAATGTGTAAACTTAGTTCCCAAATAAATCAATCCTGTCTTrCCCATCT^ 

ATGTTGCTAATATTAAGGOTCAGGGCTACTTTTATCAACATGCCrGTCTTC^^ 

GTTCTGTGTCCTTCCTCATGATTTCCAACCTTAATACTATTAGTAACCACAAGTTCAAG 

GGGACCCTCTGTGCCTTCTTCmGTTTTGTGATAAACATAACTTGCCAACAGTC^ 

TACATCTTCTACTGTTCAAACTAAGAGATTTTTAAATTCTGAAAAACTGC^ 

TAGCCACTCCACAAACCACTAAAATTTTAAGTTTTAANCTATCACTCATGTCAAT^ 

GACAAATGTCTCCCGATGCTCTTCTGCGNAAATTAAAArrGTGTACCTCGGCCCGCGAACCACCCT 

AAGGGCGAATTTCCAGCACCACTGGCGGGCXjGTTCTTAGTGGATCCCAACTCGGTACCCAAGCCT 

TGGGG 

seq id no: 523 acgcgggctatgcggagacaaaccctccatctcagctgtcaccgtggagctg 
agcaaagaaacactggacaccatgttagatggcctgggccgcatccgagaccaactctctgccgt 
ggccagtaaatgatccagccagctgccagggccactgccatgacccagctgctcatgagtgataa 
atgtctccccatatgcaggctgcccttgcagctgcagctgacaacaggcaggatggtggggacag 
cagggggctactgccatccagaagttacagttggattgggaagaagcagccagatcccccgctgt 

TCTCACTCATOTCTTTCTCTTTCTOAAGCTGGAGAGCAGAAGCCCCCATC^ 

GTGCAACITAATTACCACCATGGCAGGGTGAGGGAACATTTGCATCGTCAGCTGCCTCTGGAT^ 

ATGTTTrGANAAATTCANGCCCAAAl^CATGCNGCNCTATNCATTAAGTAAAGTTATm 

SEQ ID NO: 524 ACAGAGTTAACAAGTTTTGAGTTTTTTATATAGGAAAAGCCTAGTCAATO 
ATGCnrrCTAGAAAAATTAACATTAAAAAACAAATAGAAATCCATGACTAAAGGGGGAAAATAAC 
TTTCAAAAGTTACCAAAATTCGAATCATATCAGAGACCATTATAAATTTCAAACAGTAGAm 
ACACATATTGCATTTTCAAATTCTAATGTAGCAAAACGTAACCACATAATTTGGCrACAGCT 
GmCAGAAAAGTTTAAAAAATTAGCAAAGTTATATCTATAAAACTTTTGTAGTT^ 
AGTAAAAAGGCTTAAATCTTTAATAAAGGAAAACAAAACAATCCTCTrAAATTTCT^ 
CrCTCCAGACATATATTACAAATCTGCTGTAAGClU'lClWl'ACCTGAGAGAACTTCCCAGGATCCTT 
TATCCCAAAGGATTACCrrAAAGAGTTCTTCCATCATTTTACTCATGTGAATATGATTAAACTCCTA 
TAGAAGTGGATTGGGACATATGCATTCTTAATCTGCCCITNCCCATrrGrri'ClU"lCT 
TTTGCTTAANGAAAAAAAAAACTCmGGTAAAAGGCAAAAT^ 

CCTCTGGGGAAAGAGTTGGTTGGGGAAAGAAGAAAAGGANAGANGGGAGAGAGAGAAAGGGT 

SEQ ID NO: 525 ACGCGGGAAATTAGAATAAGCTTITATCAAATAGATAATTGATGCAATTTAG 
GATTCACGCAAGTTTCAGTGTCAAATGGCGGTCTTATAGTTTCAATTCTGAAAATAGCAAAOT 
TAAACAGCCAtmTAAACnGTTCTGGCAAACCAGACCCTGCTGTAGATATAGTCTAAGGTAGTTA 
ACCATATAAGCCrmCAACTCTTAATGCCCTCCACATGAATCAGCAGTTAAGAA^ 
CCATGAAAGCTTTTGTATGTATTACTAGGTTTTGTrTTTOT 

TAAAGCTGACCTAAATGGATCAGTTTATGTGTAATATTCTAGTGCTTTAATGACTCTTT^^ 
GGAGGGAGGGTAACATTATTTGGACAGATGCAGAAGGAACTGTTAGTGAGTCAAGACAAACACA 
TCTGAAATAAAGGAACTGTGTATTAACATGTTAACAATTCATAACTGCACrTTrrATG 
AAAATCTATTTATAGGTACCTCGGC 

SEQ ID NO: 526 ACACACACGAAGAGAGAGGGAGTTAGCTGAGAAGCAGCATCCAGGGAAAGA 
AAGGTCTCCACAAGCTGAGATGATGGCCCTCTGCCCrCTGAACTGTrCATTAACAAGTCACCAGTC 
CCAGTTCmGTTCAAATCCTCAAACTTGGAGAGGCACTGCTCCAATGAGACAGCTACAGCAACCA 
CAGGGAGGAAGTGCTTTGGGCTGCAGGAGAGACAGTCAAAGCCAAATCACGTCCAGGCGGGATG 
AAGTAAACCTGGACCGAGGAGCCTGACCCAGCTGCGGAGCTCACACATGCCCCATAGGGACGGTC 
CTGCCATATACACAAACAGCACACTCAGCATGTGTGCAGAGGCAGGCAACTCCTGAGCTTGGTTC 
TCGCGGGGTGGGGGTGAACAATATACnTAATGGCCAAGAAGATCCCAGCTCTCTGCATGAGACC 
CACATCTCTCCTGGGCTGTCACTCAATCITGGGAAATACTGGTrCAAGGAAGAGAAACAAGGGCT 
CCATTGTGTGAGAGCAGCAGGCAAGCTCCCCTTCTGCATNTGGACANGACCCCCCCCTTN TCCTNA 
AANCANGCTTCANCTTAACTGNCTGACATTGCTTTCCTGATGGCAAAAGGTC^ 
TGNCATACAAAAAAANTGACCCTANCNTTAACCGTAAAANAGAAGGTCAAGGGAACACCCCCNA 
NAATG 

SEQ ID NO: 527 ACCTAGTATAAGCCAATCAATCTCATTTTATCATTTAGTTACTGACTGAGATT 
TCAATCTTCTTTAAATGTTCTAAAACTTATCAGGCTCTATCTGCATO 

AATCAAAAGTCrrGATTATATTCAGCTAGCATGAACAATGAAAGTGAGTCAAACGTTCTCOT 

ACTGAGAAACTGGATTACATCrriTGGTATGAGTTCAATTAATTTCAAGTATTCATCA^ 

ATTAAATTTCTTCCTTTCTCAGGTAAGAATAATCATTAAACTCAAGTGCT^ 

TAATAGATAAATATATAATTCTATATTTGGATTGCAAGAArrAAAGTAAAAGCAAAACACATACA 
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CACACAACCAGAGTCGCCAGTCATTTATCTTCATATAATCAGACAATAATnTGATGATAGAATAC 
rrGGATTAAATTACCnTrTTTTTITCTGTGCm 

AAAGTAAGCACCGGGTATGAAATTGTGAAGTAAAAANATGAATACTGGATATTTTTGTAATAGTT 
TOGGGNCTCATTCAAGAAGAAGGCCTATTTCAGGAAGGATAATAAATACNATATTGCATAANGAN 
mTGGTACCTCNGGCCCCGGACCCCG 

SEQ ID NO: 528 ACACATTTAACAGTAGCGAATTACACCAAAATGATTTACTITGAGATTTGAAT 
AATTTGCATAGCAGTAAAATGTGTTrTGTGTAACATACAAATAGAAAAATGACCCAGTATCTTAAT 
TGATACTTACTGGAGAGTATCAGAATTACCCAGCAGCTCrTACAGAATGCCATAAATTCT^ 
CTAAATATTGAAATCAATTATTTGAAGTAATGTTTCTGATTTACTGTTAAA AGTT G 
TTTTGGAGATATCATTTATGCCTGCCTGTTCCCITATGACAGTGAGGCCTTCTTrG 
TATGATAATCATGGGCTCTGTTTTAGTTGATGAGAAGTGGCTCCTATGAATGCCTCTGCTCAATTTC 
TTTTTATTTTACTTTATTTTATTTTTAGGGGTCTC 

CCACCTCCCCACAGTGCTGGGArrACAGGCATGAGCCACCACCGCCTGGCrCTCTGTTCmTCAG 

TGTCTNCGNGCCATCAGTCAGCAGTGCTTACATGTTTAGCATATTTGCATGCAGTTTCTCTTC^ 

CCCGAmATATTTTTGGCCAAAAAAATTGGCAAAAAGTCCTmGGNCXiCGACCAC^ 

SEQ ID NO: 529 AATTCGCCCTTAGCGTGGTCAGCGGCCGAGGTA CTGTGT TCCCCAGGCAATTC 
GAATa:AGACTGCATTCrCCrCAGTTTCTCTGAAGCTOACrGTCACATTIT^ 
AATTTTCTTGAACGAATTTTTAAATTGTTTCATATCAAAAAGGTCAAC^ 
CTCATCTGAAAAACATNCCAAACTTTCTGGTGCTGATGAAATNGCATATAAATGATGTCT^ 
GCAGCATCACTGATACTTGCACGCTTTTCCTCACACAGATGGATCAAGTGCTGAACT^ 
TTTCTCTGTCGGAAArrTACAGTCTGCAGGTTGATTTTCAGACAAAAAATCCCAGG^ 
TGTTGGCAGTTCATTCATGGGGATTTTCAANGATGGTCCCTCITGATNAAATCAGCAACAGTCC^ 
TATCCACTCTCTTTGGCACCGGCCCACATNrmCNTrGCNGTCTCACTCTCAAGAAGC^ 

SEQ ID NO: 530 ' ACCATGACATAAAAGGTTAAAGAACAGGCAACACAATGAGCACTTAAGTTTT 
TAACATGTGGGGAATAGGGCATTTTAAAGGCTGGAACCAGTTCAGAGGAAACAAGGGTTTGGGTA 
GAGGTANAAAGGriTAATTAACCCTGAAGATCTGCAAATGGAGGGAGGGGTGAAGGAGGAATCT 
TAAGACCNANGGAGAAGAGCCAAGGACAAGGTCAGGTTGAATGAGTAGGAGGTGTTCACACATC 
CTCTTGTGTGCTAAGTAGTGGCGTGCATCTGATCTGAGAGGCANTAAAGCAGTGATGAAGACAAG 
TGCAGCNAAACATTGAAGTCCCTGATTGCAGCANTGACTCTGCAACCTTGGATNAATTATm 
TCrrTATTTTGTAAAAACGGAGATAACCGGCCTATAGGGATTTGTTAAAACrGAATAT^^ 
ANAGCTTTAAATGGTGCCTNACANATGANACCNArrAATACTGTTTATTNGCC 

SEQ ID NO: 531 ACTTGGAGAAAGTATAGCAGCAAACAATGCCTATAGACAACAGGAAACAGA 
ACATATACCCAGAAAAATGCCCTGGCAATCATCAAATCACAGTTTTCCAACATCAATAAAGTGTTT 
AACTCCTCATTTGAAAGATGGTGTTCCTGGATTGAATATTGAAGAATTAATAGAGAAACTTCAGTC 
TGGAATGGTGGTAAAGGATCAGATTTGTGATGTGAGAATATCTG ACATA ATGGATGTATATGAAA 
TGAAACTATCCACATTANCTTCCAAAGAAAGCAGGCTACAAGATCTTTTGGAAACAAAAGCr 
GCCCrrGCACAGGCTGATAGACTGATTGCTCAGCATCGCTGTCAAAGAACTCAAGCTGAAACAGA 
GGCACGGACACITGCTAGTATGTTGAGAGAAGTTGAGAGAAAAAATGAAGAGCTTAGTGTGTTGC 
TTGAAGGCGCACCAAGTTGAATCAAAAAGAGCGCAGAGTGATATTGAGCATCrCrTTCAACATAA 
TTAGGAAGTTAGAGT(nGTGGCTTGNANAACATTGAAATACTGAACAAATCCCrmCTGGGAOT 
TCAGAGAAATGGAAAGTNCCTCNGGCOjGGACCNCCCTTAANGGGCGGAATrrCCANCANACC^ 
GCGGCCGTTTCTTA>rrGGAATCCNNCCTCGGTNCCCNA>nm?4GGGGNAATCATGGGGC>^ 
CTG^nTC 

SEQ ID NO: 532 GGTACCACCGACTTTTCTrGTTCCCACATAAGCATTTCCCriTAGGGCTCTAA 
GATGAGGTCATCATCGrrTTTAATCCTGAAOAAGGGCTACTGAGTGAGTGCAGATTATTCGGTAAA 
CACTCTTAGGCCTAACCTAGCTAGTCAGTCAAGCAGTAATCTAGCACAAC TCTAATGTTGAGA TGA 
TGGCCTCGGTGTGAGTGGCACAGCATATAAGGCACATTCAGGAATCAAGACl'n'l"n"n'rrnGGC 
CrACTCCTCTCCCTTCTCAAAGACCTTACAGGCAAGGCTGAATTCTAAAATAGCCTTAT ^ 
AAACAACACTGGTATAACTAACTCCCATTTCTACTTGAAAAAATTCTTTGGAATA^ 
GATCAAATAAAAAAATCAAGCrrmATAATGATGATAAGGAATTAATTACAAril 
AATATAGTCCATACAAGGCTTATATACrTTGCTCTAAACCTAGCTCACCTGGTCTAGTAGC^^ 
CATTTANTACTACAGTCAGAAAATCTAAATTCTAATTGGTAAATTCATGTCCrCAATA^ 
CTGACCAAATGAGACAATGAATTAAAAAGACrrrGCAAAGGTTCCAAGCNTCATCCAAATATATA 
COTGGAATCATTTATAATTTTCATGGGCTCATCCTGNCCATTCTATTCTTAATCCCC 

SEQ ID NO: 533 ACTTACATGTGTGAACACATATAAAGTGTCAGGTTTACAGACCCTGGCTCAA 
GGACAGTCTAGGATGGGAAAGGAGGTAGGGCGAGAAGAAGCACATATTTTCTCCCTGGTGCTTCA 
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GCCTCACCCTATCCAAGGGACAGACATATGGGGTGTGAGAAACCCATCCCCAGGTCCCAGCCTTC 

AGGACTGGAGTCCITITGAGTCTGGTGGAGTCACAGATCCAGTCTTTGGGGGACACTGGGTCTG 

TCCTTTTGAAAGCCCTGGAAAGGTGGGAGGTAAGAAGTAAAGGGAGACAGGTCCCTGCTAGAAG 

AACTTGACGCCTCGNCCATCGCTGACAOTOATGATCTCGGCCTTNTGCTCCTGCTNTA^ 

AGAGCCCGCAGTAAAGTGGCTTCATCCAGCCCGTGGAACTCCTCATCCTCTGTGTCTTCCCCATTA 

GTCAAGTTCATACAGGGTAAAGACNGAGTTGTTCnGGCCACTTCCTGGAAACNCACT^ 

SEQ ID NO: 534 ACATACACAATCACTCAACTGGAACAATCAAAACCATCTATGAGTGTGGTTA 
TTAAAAAATAAAATTACGTTCATACAATGGTAGAAAATGAAATGTTTTTATTAAT^ 
TACAAAACCACACATATATGAATTATATAACCTAGTGTTA TATAT TTAAAAATCTTTATGCTTGCA 
ACTGAAATGTCrCTACTCCAAGGGAAGTTTCrGATTTTTAATTTTOT 

ATTCACAATGATTAAAATGCCTTACACATAGGCAAAAAGCAGACCCAATCCCAGCAAACAGAAAA 

ACCATAAGTCTATCATATCACCATATGTTTCACCATATAGTTTTGAAAAATAATCCTAm 

TGGTATGTCTTCATATTTATACTTATTATCAAAGTGATTGCATATrGAGGCACAGAGCTTAAAGAG 

GAAATATATATTACrrATAGGGGAACCAGACACTGAAACAAGAATATCAATCAATGGCTTCAAAC 

AAA 

SEQ ID NO: 535 GTACGCGGGTGCCGACACCTCCCTCATCTCTCTTATAGTGGAAGGATGGTCAG 
CATTAGGCTGATGGGGACTGAGAAGGATAGGAAGGGATANAAATTGCCATGTGTATAAAGCTTTA 
TTCTTTAGCCCTTAACCCTAAGGCTCAGGGAAATACCCTATGTTATTGTGCrCCCTGGAT^^ 
ACTCATnrCCTTCCACTCTGGAGCAGGGTGAGGGGAATGTTATGGGTAACAGACATGCAGGCAT 
GGCTCTACCCATTTCTITGCACAAGTATGGGGCCCATGTGGTAGTCCCCATACCCCTCCAGTO 
ATATTTTTGTCTTCTTCCTTTCCCCTCnTrGCCATTCCTACCITGCATT^^ 
CAAGGCAAGGAGATAAGGATGCTCTTCITGCTrrTTATATCTGCACATTCATACCTCTCCA^ 
CAGCTTTTCCCCAGCCAGGGCCCTCAGCCTTCCTGCTGCCCCAGTGATTGATTGAGAGAGCTGTTG 
GGGTTTCTTCTGCCAATGACCCCTGGGAAGANGGACCITTGGTAAGGGNCATGATAAANTGGCGG 
GGGTOTGGTCCTGCTCANGNTTTTCATNCrrCCTCCTTTCOT 

TTAAGGGGGGTGCACCCNGGGANCCCTGANAACTGGGTGCNCAAAATTCCAAAAAANAANGGNG 

SEQ ID NO: 536 ACCTCTACGTATrGACAACmGAGTTCTGTAGA TAACA AGCAGATTTGGGTC 
TCCTGTGATTGOCTAATGGTCTCCATCTCCCAGCAGACTTAATTCAGGTriTGCTTCTGCTAC^^ 
CGCCAGTAAGGAAGCAGCAAAGGTAGAGAAGAGACCnnrrTTCTCTATCAAAGGCCAGAGATGCGA 
GAACAAAAATTCArrCCCCTTTGGAGACAAATGTAGTCCATCTGATAAATAAGATGAGAAGTCCT 
GGCTGTCCTGCATCAGGGTCCACAGGTCAAGTACC 

SEQ ID NO: 537 GGNAC' j ri-j-n i 1 1 1 i' j 1 1 inm i n i rn r rrn - i rii i acncngaaaaattg 

GAACATAbmTSrrNTTGGCTGGAGGTTTGGACATTCCTANAGCAATACATATGCCriTC^ 

AAANACCTCACTACCGCCTCCnTNTTGAGCTTTTTTTGGAGGAGCATTCACAC^ 

AGCTThTTAKITNTGANATGAGGTATGTGAGCAANAGGAGGCNNTTbr^ 

TGGTCACATCCCCANATGTCAAGTGAAANATACNGGCAGTTCATTTCCCCTAGTAGCCCACTCACC 

TCAAGCTGGAATOm'CAGCTTCACTCGGACTGTTAGTTGCTTGCACCGTT^ 

AGCACnmAATTCAAAQ^CCANCCAAGCACAGAGTTTGGTAAACTCGGGGGAACT^ 

AAAGACTGCCTGAGANAGCGCTCCNTCTTCCAACAATGGGCCCTTGTAACCrAAATNTTCCAACG 

ACTNCAAAATGTCAOTCmCATGAGGGNCACACTCCATGCTACTGNGGGTATTCAAATTCCCAGN 

CGNCAGGCTCCCCCTGTTCGCCGGGAACGCNTACCTGNCCCGOGCGGGCCGTTTNAAANGGGCGN 

AATTCCANCCCCCTTGGNGGGCCGGTraCTTAANGGGATTCNNAGCCNGGGTNCCAANC^^ 

SEQ ID NO: 538 ACTGGGATTACAGGCGTGAACCACCGTGCCCGGCCTTCCCCAGATATCTTCA 
AAGCAACTGCrAGTCCrGCTTTTGCACATTACACTCTACATTCrCATTGCCCCAOT 
CCCTCATGTCCTTTTCTAAAGATGTAGAAGACTTTGAAATAGCATAACATTTAAATGGTC 
CACCTGGCTTGAAAGTGACTAATAATAGGAAAATCAGACCTCAGGACTGAGTTCAATTTCAAGAT 
TCTCX:CACTCCACrrTTGCTAAGAGGGAAACAGAAAAACATTACTCCAGCCACACAAATATGATA^ 
TGTGCAAGTATCCCACTCTTACTCATTCAAATCCTACCCATTTTTTCCCATCGTGGAAC^ 
GCCAACTTATACCTCTCTCCAATAACCCATAAAGATGGTGATITGAATTTGGGCCAAGAT^ 
TTTTATGGAATACAAGAGGAGAGTITTCTGCAAAACACAGGATTTGCAGACAAGGAGAOT 
TCAAATTCCAGTTCTGCCATTTArrACCTCAAANGGGGAT 

SEQ ID NO: 539 GGTACGCGGGGAGTTTAGCGTAAACCGGGAAGCGGATCXjCGTGGAGTGAAA 

gtcaccgcagcggagatggacaaaccatgtgggtgccctccaggtgtgtgcgaccatgaaatggg 
agactagcaqgacccacgttggccaaccgtggacctggtccctccagtaggagtcgtgagccagc 
tgaatttgaatgcaaagatggagcaagggccgactggagtcacactgacatccacccccataaca 
tggggacagatcaagaaaacaaagcagoaagctgagaaaatgctggagcgtccatgtgcttggc 
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CATGTTAGCAGTAGTGTCCTGTGAGGATTGATTTCCAAAAACTCCTGTTACATGAGGAAGACACCA 

AGAAGAGCAAAGAAAAGGAGCCAGGGTGGCTCITCCTCAGATGGAGTCTCGGTCTGTCGCCCAGC 

CTGGAGCGTAGTGATGCGATCTTGGCTCACTGCAACCTCGACCTCCTGAGGTGGGAGAATCGCTTG 

AACCCAAGGAGGTGGAAGTTTGCAGTTGAACCCGATAGTGCCATTGCACTCCANCCCTGGGCA^ 

AAGAAGTGAAAACTTTTTTCTNAAAAAAAAAAAAATA 

SEQ K) NO: 540 GGTACAAAGTCCAGAATAACTTGTATCACCACTGTGTAATCAACAAGCTCrrT 
GTCTCATGGAGTGCCTCTGCCTAATTGGCTTATAAACAGTTACAAGAAGGTTGATGCTGCTGAATT 
GCTTCGTTTATACTTAAACTGTGACCTTTTAGAAGAAGCTGTGGATTTGGTGTCAGAATATGTGGA 
TGCTGTATTGGGAAAAGGACATCAATACTTCGGAATTGAGTTTCCACTGTCCGCAACAGCCCCAAT 
GGTGTGGCTTCCATACTCXTCrATTGATCAGCTTCTCCAAGCrCTGGGAGAGAA^ 
TCACAACATCGCACrGTCCCAGAAAATACTTGACAAATrGGAGGACTACCAGCAAAAAGTrGATA 
AGGCAACACGGGATTTATTATATCGTCGGACCTTGTGATTTGGATTGTCACCTAGCCTTTGTAACC 
GCTTGGTGCCTCTTAGGACTTAAGACTACCCTACAGGAACCCTGTCCTGCCCCGGGCNGGCCKn'CG 
A 

SEQ ID NO: 54 1 GGTACATGAGGCTTCCAGCCCAGCCCTAGGAGATCCATCCCAAAGACCCCAC 
AGAGACCTGCATGGGAGGTGGGGCCACAGGTCTGGTATCAGGCAAACCTAGGTTGGAACACTGGC 
TCCATAAAGAGGAAGTCACTrAACCTTCTCTGGGGCATGGTTTCTTCATCTGTTCCCACCTCTGAA 
GACTATCGTAAGACAGAATGAAAGTTAAGCAACTTAACGCAACGCCCAGGATACCAGAATTATTC 
TAAATGGCAGAATCCTACTTAGTCTGTCATCITGGGAGTTCTCTAGGCAGGCAGGTrGCCAGGGGT 
GGGGCTGAGATCCAGATGTGCTCTAGGTCCCTGTCTGCTGCAGAATCATGTGGCTGCTGGAC CTGG 
GGGTCCCTCANGTCCTTGCAGGAGCTGAGGGTAGGAGACTCCATTTGCCAAACAACITANAACTr^ 
GGGCCTCAGTTrCCTTGTCriTAAAAGGAACAGGTAAAGAATTCATAAAAACAATTGACAANGGG 
CTGCCTGGAATTmCCTGTAATCTCTTCCAGCANAAGANGCCTCANANGTGANAGTAAGAACAGG 
AAAACANGGGACAGGTTTCACTCTGTGCCAAGTGTGGTGGCTTAAGGTCTGTAATCCCANCNC^ 
GGGGAGGCTNAAATGGGAGGAATTGATTGATNCCCAGAAGTTCAAAGGCTTCNATTAAACCTGTG 
CCTGGGGG 

SEQ ID NO: 542 ACAAAGACTTTGTAAATGTGATTCAGGGCCCCCAGCACCCCTGTGTCTGCAG 
AGTGCCTTCAAAACTCAGCTGTTCCAGCCGGTGCCAACCTGTGAACTTCCCACCATATCCCAG)^^ 
CTGCTATTCCCCAAACCACTTCCCAGTITCCTTTCAGTAATCTrTCTGAAGGAGCCAGGAC^^ 
GGCCTGTTGTTTAGTGAATTTCTrTATTATTTTCAGCCmAAAATGT^ 
AATrrGrn-CCCTTTTTTTrGCTTCATTrrGTTT 

TTTTAAATTTTTTAATTACCTGTTGTAGGGTGTTCCTCCAGAAGCAAAGAGCA^ 
TGATGTACC 

SEQ ID NO: 543 GGTACGCGGAGGAGGGCCCATGTGCTGAAAATCCGAAGTGCCGCGGAAAGT 
GGAGAGCTGACAAGGAAGGTn'CGAGCGTTTTGCTGGCAAAGGGATTTCTTACAACCTCCAGGCA 
TGCGTCTTTCTGCCCTGCTGGCCTTGGCATCCAAGGTCACTCTGCCCCCX:CATTACCGCTATGGGAT 
GAGCCCCCCAGGCTCTGTTGCAGACAAGAGGAAGAACCCCCCATGGATCAGGCGGCGCCCAGTGG 
TTGTGGAACCCATCTCrGATGAAGACTGGTATCTGTTCTGTGGGGACACGGTGGAGATCCTAGAAG 
GCAAGGATGCCGGGAAGCAGGGCAAAGTGGTTCAAGTTATCCGGCAGCGAAACTGGGTGGTCGT 
GGGAGGGCTGAACACACATTACCGCTACATTGGCAAGACCATGGATTACCGGGGAACCATGATCC 
CTAGTGAAGCCCCCTTGCTCCACCCGCCAGGTCAAACTTGTGGATCCTATGGACAGGAAACCCACT 
GAGATCGAGrrGGAAAATTTACTGAAAGCAAGGAGANCCGGGTACCTGCCCGGGCCGGCCGCTCG 
AAAOGGCCNAATTTCCAACAACACCTGGCCGGNCGTTTACTAGTNGGAOTCCGNAGCTCCGGT 

SEQ ID NO: 544 GGTACTTGTITAACCCAGAGTTAACTACCCTGGATAGCACAAATrGTTTTGGA 
CTCAGAAGTAAATCTGAAGCCTGTATGTTGGTTTACTGCTAAATTCTGATAGTGCTATTC^ 
ATATTACTGCTGATGCAAAGATATTmcnTAGAGGAATrATGAAGGAAGAGTGAAAGAGGATGA 
GGGTGTAGGGAGAAGGAACAAAGATTAGAAAGAGGGAATATTTAGGTTCTTGATrAAGAGGCAG 
AOCTOATrTAAAAGATACCAATATGGAACGAGTGGCTGGGTATCCACATCTATCnTrCTTA^^ 
AAGAATAAAGCCATITCAATTAGATCCAAATAGTrAATAAGAAATCTGCTAATTATGGATTCTTTT 
TTCTTTTITnTGCTACCAAACTACAGATAACCATrrCT^ 

CCTCTCATGTGGAGCCTAGGATTCTTCANATAGTAAAAGACAGNTGGAGTCTGTC AGGACAGCTA 

CCGCCACATTCrrGGCAGCATTTCTTACCAGATTGCTGATTNCATGACTTGGATTTT^^ 

TCTTGGAGGCATAATrrCCGAAAGAGTAAGATAAACTTTCTTATTAAAAACT GGGTTT AGGTCCAA 

ATTATGGAAGATGTTGAAAAACANCCTCCAGTCCCCTTTACATTTTGGAANTCTTm 

TCA 
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SEQ ID NO: 545 ACTCACAAGTAAGAAACmCTCTACTGAAGGATACTGTCACAGAGTTTGrrO 
CAGAGCATCTATATATATATTTATTTATTTATTTTAAAAAAATAAACAACAATGATGAACG^ 
AGGTTCCTAGAACCAATTCrCTTGATTCTCrrACTTCCACAAAATAAAGTGTATCATTTGGCCA^ 
CTACAGATGTGTTTTTTTTTTTT 

SEQ ID NO: 546 GGTACX3CGGGGACCTCGGTGATGTCGTGGGTTCAAGCAGCCTCCTTGATCCA 

gggccctggagacaaaggggacgtgtttgacgaagaagcagacgagtcgctcctggcgcagcgg 

gaatggcagagtaacatgcaaagacgagtcaaagaaggttatagagatggaatagatgctggca 

aagcagttactcttcaacagggcrrcaatcaaggttataagaaaggtgcagaagtcattttaaac 

tatggacgactccgaggaacattgagtgctitgctctcctggtgtcaccttcataataataattca 

actttgatcaataaaataaacaatcttctggatgcagttggccagtgtgaagagtatgtgctcaaa 

catctgaaatcaatcactccaccgtcccatgttgtagatttattggactccattgaggatatggac 

ctttgtcatgtagttccagctgagaaaaagattgatgaagctaaagatgaaagactctgtgaaaa 

taatgcrrgagttraacaaaaactgtagcaagagccataatggggatagattggtcatatgtana 

aatgttgtagaacacaaggagcatgcacattcagaaaacccaagccccccatggaatttttggaa 

cagacagcccagtttaattaaaccagctgggcctnrrca 

SEQ ID NO: 547 GGTACTTriTITTTTTTrTTT^^ 

ANAAAATACTTGAATTGCCTTAAACAAT^OTAAAT^^^TTAAACAACATGANAAANAG 

GTCAAAACATAGTATTTAGTTCACTGAGTTGCCCTGACANATAATGAATGGGGArrGATTTAATAG 

TGACCAAATACACTGGCCAThrmACTAAAGNGCTGTAAAATGGCCAAGNGAGGACAACTGCATN 

TAAAATGANATCAAATCCTCGAGTCCATTCCTTTTAGCAAAAATGATTAAAACCATNrrGGCAGGA 

CCAAGTNTTTGCAAAGCCTATCAAATGGATGGATGGCTTCAANACAGCANANAACCCACAANCTT 

GAAAAGGCCTCCGGANAGTTCACTCAGGATAAACX}GGGTGCTGGCATCGTCCTGGTCCT 

SEQ ID NO: 548 TGTACTTTN'n i rrn"ri"m"i"n j"n i i'incattggttaaacagtttanttccc 

AAAGCTAGTAATTTTAGrrAAATATNCATTANANCCnTmANATGGCTGCT^ 

CAAAATGNGTAGTTTTAAACTCAAACTCGAAAGCCAANATAANCAACTCCTTCAm^^ 

ACCAAGGCOTAANAATTCACTTANACAAAAAGCTTTCAAAACCTACCTAAj^ 

taaattttcaaaactgtt^m'ccctgttgcggacagccxritgatct^ 

ggcatgctctcatgttagctttttaagttactgaaaactataaatttagcotcatt^ 

tatagttttctcattccgaangcttaaacatttaggtcaaaaattaaaatccag 

cotctttagccaggttgtaxagttaacancatggcaanaaactggtgagaacattta^ 

gagcataaaaatacttcaaagccttncgaaacttgaacttaagccattttcctai^ 

ttccaactacagggaaaataaaaactgccctagggactggagaatgatttaancccnccttcaaa 

ATTTTTTCCNGGTTCCCTGANATNAATITmTAGGhriTNr^^ 

SEQ ID NO: 549 GGACCCTTATATTTCACAACTTTCTGTTCATAAGTTATAGTGACATTGCTCriT 
GGTAAAAATGCCTGCTTTCCAATACTTTGATTGCATATTAGACATTCTTAACAG^ 
GTGTTGAAAGTTTrATTTTTCCATTTTTCirn 

TAGGTGTGGTOGCTCAGGCCTGTAATCCTGGCACTTTGGGAGGCCAAGGTGGGAAGATCGCnTGA 

GGCCAAGAGTTCAAGACCAGCCTGGGCAACATAGCGAGACCCCTATCTGTATTAAAAAAAAATCT 

GAriTAATTCTTTTATTTATCATAAGGGGriTAArrCCTGAAGTAAAGGTTTC 

AAAACTGCCAAATGATTTTTCTTCMTrATGTGCGTGATAAAAATACAAAGAATGGTGTGGCCACC 

TCCTCCCTTTCAAGCTNGGGCAACAGGTAGCTCrrrCCAGCCCCTGANCCCAGC^^ 

GGTGCCCGGACAAAAAACTACATGGGCC^n^^^CGTGTCTTGGGGGTGGAAAAGGGAGGGATGAAT 

TGGGGTGANAGAACCCTGTNGAATT 

SEQ ID NO: 550 GGTACCAAATTTGCATAmGAAATTAACACTTTAGCATTTGCTGAACTCAGC 
CCTCGTTAACTCCCTTAACAAGTTCAATCTGAAATCGAATTTGCATTCAAACAGTTTAATGCCACC 
AAGTAGGTCTGAACTAATGXATAAACTCAGCGCCGCCGCCGCCACCCCTACTTTCAGGGCAGCTG 
CTCGGGGAAGCCGGTTTTTTITITGGCCATTTTGCAAACAAAACCAACCCA 

AGCACCCAAGGCCCATGGCAACTTGGTTCCACAAGGGAGAGCCTTCCAAGGCCATATTGTCAGTC 
TAATTAATATGAGCTTTTITTTTTITITCAGTGCTGOT 

GCATTTATTAATGGTTGCTGCCNAAAAAAAAANGAAAGGAAANGAAAAAANGAAAATCCGAAAC 

ACCCCTCCCCNGAACCACCCCCAATACTGCTGCGTGGAAATGAATCGGCATTGTTCCTAGAGTTTG 

GCNCTC riU ' ril ' ll ' ll 'CTGCATTCATTCTaTmGGCAGGACmtlAC 

TATTOTGTAGAGTGCACCATTTGGGGAACCTTTmGAAAAAANACCCAAACCGGGA/^ 

TGATTTTAAAAAAAACNNGGGGGATGNTCTNGAAAGGAAAATGGAAATGGAGGNG 

SEQ ID NO: 55 1 ACCCAAGGGAGGAGCCNAACCTAANCGGCGGAAGAAAGTGAGGAGGCCCTT 
CCAACGlTGATGCCCCTTCTCnTCCTCAAATCAATGTCAGGGAGTCAAAAGGGCTGTAGCACAGG 



74 



wo 02/29086 



PCT/USOl/30732 



ATGGAGTTTGATTTATNCCTNCTCCCCCAACACCTAGGAACTGAATCl 1 1 TlUl l-n i ATTTTTTGA 
GATGGAGTCITGCTCTGTTGCCCAGCTGGANTGCAGTGGTGTGATCTCANCTCACTGCAACCTCTG 
TCTGCC 

SEQ IDNO: 552 ACATGGCTCCATGGAGGTTCTCCAGTCGGTGTTGCTGCTGCTGTTTTCGAGCC 

ttatctcgtctgtgctcctcataagtgtcgtcatcagtgggcagctcatagcggcacaagggacag 
gaatttgtcttgcttagccagggcagaatgcagctggaatggaaaaggtgatggcaaggcatctc 
, aatggcagtctcctcctccrcaaattccaaaagacacacggggcacttgagctcagcctgagagc 
ctctgatgactgtcctggggaggttctcaaccacagtcttggcagctggtggaggcaggtggtggt 
cccaatctactaccaaccccaagtcttcaaagtccatcctattgaa^ 
gcagcatgttggrrcgcgtctcctgctcanggtccgacggctcgcagtccgtgttcatcgaaatag 
gacgccatggctgcccacccttctgacacaaccccccngcgtacctnggccgcnancacgctaan 
gggcgaaattncancnacactggcgggccgttactahnsftgggatcccgagcttcgggtaccaam 
tttggcggtaatcatgggcataaacttgttttcctggtgtgaaaattgntatncgotcaca^ 
cccacaacatnccagcccgnaagcnttaaagtggtaaaagcccnggggtgccctaaag 

seq id no: 553 acaaattgggtcaaatggctgcttctcatggctcttggctccaaaagttacag 
ttccagaaagactaatttccattgattitgggaacttctggccaataatccagatc^^ 
tcrctcgaaatacttcaagctggccaaaactagttttgtattccaaatgtgtaattggacctct^ 
gtaaaaaggtatatgggcttcacagaattcaaaattatttttcacactttc 

seq id no: 554 acatgatgaagcacttacaatagtgtctggcacatacaaatactctgcaaat 
attgcttattatcataagtctctaatatt crgaca tctgaagcctitggggttctaaattagct 

TTTTATGCTGCCCACTTAAGGTGTTTTTTATTmCAACGTCTCTGATCTTTGA 

TTGGAGGATCTTTGGGAATCCTGAGGCTCTAAATTTGGGAAATTTTCCTCCAGACCATCTGCAA 

TGAGA<XACATTTACTGCCATCAGTTATCCCTGCTTCATGTAGGAGCCCTGGTTCAGCTCCOT 

TTCAGTGTGTCCAGGrrTAGCTCCACTQCCAOCCTTCATATAGGTCTCG ACCCAAGATTTTAG TGTA 

TTGCrcCCAGATCATCACCTTAGTTTCCACTCACAGTTCTGATGGTTTTGTTTTA 

TCTGAGrrrCTATTTTCTTTGTTTTTGGCATTTGAGGAATTCTGTTGm 

CG 

seq id no: 555 acgcggggagcgcagcgggggcgggaaggttgtagtgccgcgagttgagct 
cctcttgcctaagtggtcgcgcccccntraagagcagcgattgtaaggagaggcggtcccggtgtc 
ctcgggtcccaggtgattgtgaagtgctgaccaattgccactggacatacttgaaacaaaatagg 
aaaatggcagcaaactcttcaggacaaggtrttcaaaacaaaaatagagttgcaatcrrggcaga 
actggacaaagagaaaagaaaactacntatgcagaaccagtcttcaacaaatcatcct^ 
gcattgcactctcgagaccctctcttaataaggacttccgggatcacgctgagcancagcatattg 
cancccaacanaaagcagct™cagcatgctcatgcacattcatctggatac^ 
actcttgcatttgggaaccntattttrcctgttitacctcgcctttacccan^ 

TTGCGATNGAAAAGTGACTTTI^^ 
GAACTGGTGACTTTTAAAAAATTTT 

SEQ ID NO: 556 ACGOjGGGGTTTGGATCCGGGTpAGTCGGGTGCCGAGATTTGGGAGAGACGC 
TCTGAACTGACTGCCCCGCATCACCGGAGCGTCCCAGCTGCGAGGAGTGTAAACAGGAACATCGA 
TAAGTAGTOTAAAAACTTGCACAATGAAATCCGAAGCCAAGGATGGAGAGGAGGAGAGTCTACA 
GACTGCTTTCAAAAAATTAAGAGTGGATGCATCAGGGTCTGTAGCATCTCTGTCTGTTGGAGAAGG 
CACAGGTGTCAGAGCACCAGTCAGAACAGCAACAGATGATACCAAACCTAAAACCACATGTGCAT 
CTAAAGACAGTrGGCACGGGTCTACAAGGAAGTCTTCACGAGGAGCAGTGAGAACTCANCGTCGT 
CGACGTTCTAAGTCTCCTGTCOTCATCCTCCAAAGTTTATACATTGCAGTACC 

SEQ ID NO: 557 GGTACATAAAGATATGTAAACCACATTAATCTTGCAGTAATATAATGTCAAC 

atcatcatgaacaccttgcagaagcagctctcctggatatgttacaatcrrccttcgtttt 

ttaagagttcctaccaatgcttcaaagaggttggcacatttatcatcacggaagaggaccccaaa 

tttcacgcntaacmccatcagcatittttgaacccaaacgatgaatt^ 

acctcgtgatccacattcattgctgccatgcgtaggtcctcttcatggtatcctgccaggcaccaa 

aaatacaaaaggtgtgtagcacaggatctgccctcggggatctcacattctaaaggcagatgagg 

gcctctctgcgcggttttccttcccgccggagaactggcgcx:gcgtgggtctggacccggttacct 

tcttcaatgcactggggagggtgaggcgaaggaagoaaagtggctgaggcaaaanaaaaaaacn 

gctgtgggaggaagaagaccagggtanggggaaaaagtaatgtttcttgcgctcaaagaaaggg 

gccggccaaacttcccccgngtaccttgcccgggccgggcggtncgaaaag 
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SEQ E) NO: 558 GNOTACTGGTTGrrGTGGAGGAACCTCrGGCTTGCTCATTAAGTCCTACTGAT 
TTTCACTATCCCCTGAATTrCCCCACTTArrTTTGTCrmCACrATCGCAGGCOT 
ACCTGCCTCCAGTCTTACCTAGTCCAGTCTATCCCCTGGAGTTAGAATGGCCATCCTGAAGTGAAA 
AGTAATGTCACATTACTCCCTTCAGTGArrrCTTGTAGAAGTGCCAATCCCTGAATG^ 
TCTTAATCnrCACATCTTrAATCTTATCTCrrTGACTCCTCm 
TTCTAGCrCTCrrTCAGTTCTTTGAACCTrCCCACCTTAGGGTCTATAAGGrrCOT 
TGTTCTACTCTCCCCTCnTCTTCAACACATCCTTCAGTTTAAAGCACTT^ 
CCACrTCCTTAGGGATGTCrCTGTGACCrCCCTGTGCTGGATTAGCTTCCT 
AAANAACTATCTACTTCCTTTGNCATAAACTCATCATTTTCCTATAAACATTAATATTG^ 
CTAGAATGNAAAGCTCCCGTGAGAGCAANGATCCCTCCTGTTTACCCTGT 

SEQ ID NO: 559 GGTACACACACACACACTCTCTCTCTCTCTCTCACCTGGCTAAGGCTTTTA^ 
ATTATGAAGATAAATAATCTGTTTCACCAGCTGGAGTGAGTTGCAAGGAAGATTGCTGGAGCTAC 
TCAATCTAGCGCTAATGGTTTGGATTCATTACTGCAAACCTACATAAmAACATATn'Gm 
TAGTGTGACAACTGATGAAAAAAAATGOAGCAATCTGAATrGTATAAAATAACTTAAGAAGGAAG 
AAAAGTGATATATAAATATATTITGCAAATGTCACATTAATTTAAAAATGAGTATGATTG ATm 
TTTTTAAAGTGGGCATTCTTCACTGTTTCGAGACCTTTGTATGTATTT 
TTCAGGCCATrATTATAAGGTGTTATnTGGCCCTCNAATGTAGAAGTTATGTTTAAAT 
TCAAGGCCCT 

SEQ ID NO: 560 ACTTTTAATGGTGGGAATITACAGTAGAAGCATCCnTTGCTGAGTrATACATT 
CCTTTATCAATCTCrmGATACAACATTTAAAACAAGTAGCITCAAGA^ 
GGATAGTATTTCTAAATAGCATTCAGGAACAGAGTATTATTGCACAGATCTGAAGATCAAAAAAA 
AGCTCAAGGAAATACAGATCGGAAGTGCTGATGAGTTATATTTATTGAAAACCCAACTTTTAA 
AAGTGCTAAGATCAGTCACCCATGTGAATAAGAAGCCAGGAAAGGAAAGATGGGGAAGCCCAGA 
TCACCAGGCTTCTATTAAGGAGGAAAGCAACAOAGGAAACAGTGAAGGGGAACAGAAGGGGGTA 
GCAAAGTGTTACAGAAAAGCGGACTGGATAGACAAAACTGCAGAAGGTGTATGTTGGGGAGAAC 
TGAAAGGGAAAACAAAAATACTTGACATAGTCTTAAGTAGAAGAAGGCNAGTTAGAGAAAAACA 
AAAGTATCTACTGGCCTTGTCAACATACAGACTTCAAAATCCCCTTATGAGAATCCAAAGAATGAT 
GTGTGTAAGGGAAAATTITATTTGCCCTTCCGGGAAGAAATCAGTATCTrTGCCAAATOT 
GACGAAATCAAAGCCCCATTAATGATTCANAATCAGTGGCTTGACCTCCTG 

SEQ E) NO: 561 ACTTTNTTTTTTITTTTTTTTT^^ 

GTTTATAANATAGTTCCCATTACATATAACATTACGGTCACGGATTCTACAGCCACAAATGCCCGC 

AGTCACATAAATATATCCAATCCAATCAATGCCTTTTCCTGCTAACANAGGCATCTGAAGTTCAAA 

GGGANAGTCNCATTTTGAGTAAAAGTCGTCCTTAATGGGAGGGCTCCTGTCAGTGCATTAGGAAC 

TAGCCAAGGAGCCITGCTTGCCAAAGCTGTCTGACTCAAAGGAGAGGAAGGGACANATGGCCT^ 

TGACTGGGGCTGAGGCANAACTAGATmCTCTCTTGNGGTTTAAGATATTT^ 

TCAAATCCTATAGTGTGAATATCTGGGGAGTTCTAACTTCrGGATGAAAAAGGAACCAATTTA^ 

GTAAGAAATANAAGCCTGCriTAANAGGGACCCTAACTGCCTCCTTGAGGAGTAANGGAGTCANAA 

GGAAGACCCTAACTCACCATTCCITGGCCCAAACCANTTGGTTTTACCCCATACTCC^ 

GGGGTTCAGGGACCCTGACACCCATTTCANGGGAGTAAACCATTTANAAANCTCCCNCTTGCT^ 

ANCTAAAAATTTGCAAGTCAAAATTCnrrrnGGAATTTTAAAAG 

SEQ ID NO: 562 GGTAClUU"llU"riU'lUUUl"l'l"J"ll"lUUl'lU"lUU"lNGGAAATAAACATTTATrm'A 
AAAATTAGTTTTGACbTITrTAAAGNGAATGCANACAAGGNGTT^ 

NAAGCTANANAAGTAAATTCCAAGG^r^GGCAATAACTGACTCACATT^mTACAAGTGGCC TANA 

CAATANGGAACCTTTCACCTNAAATTCACAGAGCCATGAATCACCTNTGCTTCCCCATGACCm 

CCATATCCTTCCTACTCTGTCTTCCAACCATGACACAGAACTGAAACATACTTTAAAAATCTNATT 

CTTGGCTAGGCACGGTGGTCACATNTGTAANCCCATCACTTTGGGANGCCAAGGCNGGCGGATCA 

AGAAGTCANGATATTTGAAACCACCCCGACCAACATGGGGAAACCCTGGTTNTACTAAAAAANAC 

AAAA 

SEQ ID NO: 563 GGTACmCTGGATATGGGATGAATTTTTAAAAGATCTGGGAAACAACAGTA 
AGGGGGAAAAAAGGCCAGAGTATGATTTrTTTAATGGCATCAAAAAAATAGCTArrTAAA 
GTTATTACAACTACTACATAGATCCAAGAAGTGAAAATAATACAACAACAAATCAGTTGACAATG 
CTGATTTAAGCATGCCATAGGCATCTrTCAATTATGACTATGATCACCAAGGNCACTGGACCAGAC 
AAATATCCCTGTTCTCTGGAACATATGCCTCAGGTTACAAAAACTAAGGAAACTACCACAAAGAT 
TTTGAGCAGCTCTATTCTTGGACCTGCACAATGCTCTACAATGAATATTATGTGATAAAGGCA/^ 
GTOTAACTGAGACTTTATTTATAGTTACAATTCTTTGCrTGGATGAATAGT^ 
CTTTTAAAACTATATAGTACCCATTGCAGGACATTCTCACTGGAGATCAGCrrCTTm 
AGCCAGAAGTCATTGGGTGAAGCCTTGAATCAGCTAGTTCCTTAAAAAAACAAAATCITGTT^ 
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TGNCTNGGTGCTAATGAGGGGAAAATGTGAACCTCAANGGANAAAATGGGTTITGGAAACrc 
ATANNNATAGAAAGATATNGGAAAACTCTTGGGGCOTGACCTGGGGGAATANGTAANTTTC^ 
ATTTAAATT 

SEQ ID NO: 564 ACATGCTCTTCAGGTTCTAGGGCTCCTGTTAGGGGAGGGAGAAATGTTGAAT 
CAAGAGGGAAAACAACTACTATGATTTATAAACATATTTTAATGTAAAAATTTGCAm 
AGTGGCCCTGTTTTCTGTGTTAAAACCCCATTTGGTGCTATTGAGTTTGTTCm 
AGTGAAAATTGTTGATCTTGCTGTAGGGAAAAATTAAACTCTTTGAATCTCCAAACAAGGAAGm 
CAGCATTCCCTTATGGATCAGAGGAACCTTAGAGACCTGAAATTGTTGCTTCCAGTTTAGCTG^ 
CTCAAATTCAAGTGAATATTITCCCTrCTCCCITrACCCTTCTCCAGAAATAAA 
GTTTTCAGAATCTTAA 

SEQ ID NO: 565 AATTCGCCCTTTCGAGCGGCCGCCCGGGCAGGTACTGTTCTAGGACTGGCCA 
AAAATGGGCAAAATGTATCACTCCAAACACTACrrGATTCAGCATTGTTTrCATGTOT 
CACCTGCACTrrGTTTCTGCACTATrATGTAGTGCATTTTAACnTA^^ 
ACTTATTTAAGATACATTACTGATATTTCATTATAATTAGTTCACCrrCCCTGTGAAAC^ 
TGTAAAATGTTGNGGAAAATGATACATATGTGGATGCTAATGAAATCATAGTATTTTGGGGTAGCT 
NCTCTGAANACCTNAAAGACCTGCGGCrmGGTTTATAAGTGTTGGGGCCmATCAAGCCCCATCT 
GATCCAANATCAATATTTTTTTGAANAAA 

SEQ ID NO: 566 GGTACACAGGCGAGGGCATGGACGAGATGGAGTTCACCGAGGCTGAGAGCA 
ACATGAACGACCTCGTCTCTGAGTATCANCAGTACAGAATTTGGAGCTAAGGACTGTGACTGAAA 
TCATTTTCCCATATGAGCAGACCCTGTGTGTCAGGCCTGTTTCCCATATGAGCAGAGCCTGTGTGC 
AAGTCnXiTTTCTGGCATGTCCCTCATTGAGGAAGGGAAGCAAAAGCTGGTTATTGCCAGGCCT^ 
AACACrrAATATGCAAATrCTATCATNCTGAAACTGGGGCATCTGANGAAAAGGTGACCrr 
GATGGCrrrATTTGCATGGCTCTGCCTGTCTGCAGTGGTTGAGTCCTCATNACCTGGTATTN 
GAGCANATGTGTGC^GAACGTTGATGCCCAGGCAGATACCTC^^mAAGACTGCTGCAGGCANTGC 
TTTAAAAATGANCCAGT 

SEQ ID NO: 567 ACTCTAATTTCACTAACTGCCAAAAGGTTTTCCAGAATAATCTCAGTTGOT 
ATTCCTTTAAAGATGAAGCCCGAAGAACGCATGGCGATTACTTTANAGGAACAATTAGCAGCAGA 
GGCAGGGCTGTGCTGATCCCATCTGGCATCGCTGGGAGCTAACATTAAAGACATGGCACTTTGGO 
TCCGGGTCCAGGTCCTGGTTCAGAGCAGCTGCCACACCGTGGCTACTAGAGGATCCTTTTCCGGCT 
TTGGAAACTGAGGCTGACTGCACCATCATCACTAAAGGCCTGAGACTGCTCGCCGTGCTCAACAC 
CGACTGGAGTGGCCATTGTCTTCCANCCACGCGGCCGACCTCGAAAAAGCCCCNCGTACCT 

SEQ ID NO: 568 gggtaccttcagaagctggatacagcatatgatgaccttggcaattctggcx; 

ATTTCACCATCATTTACAACCAAGGCTTTGAGArrGTGOTGAATGACrACAAGTGGTTTGCC 
TAAGGATGICACTNATTtTATCAGCCAnTOTTCATGCA 

SEQ ID NO: 569 GGTACCAAAGGAGAATTTGGAGAGCTGGCTAAATTATTTGAAGAAAGAATTG 
CCAACAGTGGCGTTCANAGCCTCAACAAAACCAAAGGATAAAGGGAAGATAACCAAGCGTGTGA 
AGGCAAAGAANAATGCTGCTCCATTCAGAAGTGAAGTCTGCTTTGGGAAAGAGGGCCTTTGGAAA 
CTTCTNGGAGGTTTTCAGGAAACrTGCAGCAAAGCCArrCGGGTTGGAGTAATTGGTTTCCCA^ 
GTGGGGAAAAGCAGCATTATCAATAGCTOAAAACAAGAACAGATGTGTAATGTTGGTGTATNCAT 
GGGGCTTCAAGGAGCATGCAAGTTGTCCCCTTGGACAANACAGATCACNNTCATTAGATAGTCCN 
ATCrTCATCGTATNrrCCACrrAATTNCTCCTNTGCGOT^ 

TGAA^I^r^AAATTAAAANCCGTATGGGAGG^ITGCCNAGTG^fNATTCCNTTTCCCAGNGCT^GA^ 

CTCNGANCANGTAAGNTACCrGGCCCCNGGNCCGGGTCGCl^CGAAAAGGGCCGAAATTACCA^ 

CNCCANTTGGCGGGCCGTCTTCTATTTGQAATCCCNGCCTT 

SEQ ID NO: 570 TCGAGCGGCCGCCCGGGCAGGACGCGGGGATGATGATGAAACAGAAAATGG 
CCCCAAACCAAAAAAACGACGTCCACCAAAAGCAGAGAAGAAAAAGGCTCCCAAGCCAGAACGT 
CTGCCTCCATCAATGAAGGGAAAAATAAAATCCAAAGCCATAATTTCATCAAGTGATGACTCTTC 
GGATGAGGATAAACTTAAAATTGTTGATGAAGGACATCCCAGGAACAGCAACAGCAACAGTGACT 
CAGACGAGGACGAACAACGAAAGAAATGTGCCTCATCAGAGAGTGATTCCGATGAGAACCAGAA 
CAAGTCTGGCAGCNGAGGCCGGCANTCCCCGGAGGCC 

SEQ ID NO: 571 AOCTAGCTCTGAAAACACATCTACAGAAGCAAATGAACTCATCTGCAAAATA 
AAAAGCACATATCmAATTTCTAATGTTTrATTATAGATTTrrAAGATACATAm 
ATTAGCTTAAAGAAAGTAAGTCACACAAGAATAAGCTTTTGCATGTCTGCT 
AmTAAGTTTGTTATGAAGTTATAAATAAAACACATATTGTGCTTATGTGAATAAAACCCAGCNT 
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AATCGGAGTGCATTTGAAATGCCAGAGTATTCTCATTGTTTCANTAAGACAAAAGAAGAACATTTC 
CTTTTAGAATTAAANCTTGGCTTCNGTTTrCCTITAT(^^ 

ATTCAAAAO^AAAATTTGTAAAAANNNAAAGGNGACTAAATNAAACATGTTCCTGGGCC^^ 
NAANNANCCATTTTNGGAANAACTGGAGGCAAANACCAAAGTTA 

SEQ ID NO: 572 ACTAAGTGTATACGTATTmGCCACTTTTTCCTCAGATGATTAAAGTAAGTC 
AACAGCTTATTTTAGGAAACTGTAAAAGTAATAGGGAAAGAGATTTCACTATTTGCTTCATCANTG 
GTAGGGGGGCGGTGACTGCAACTGTGTTANCAGAAATTCACAGAGAATGGOGATTAAGGGTAGC 
NNANAAACCTGGAAAAGTTCTGTGTTAAGANCTTGCTGGCAGAAATAAACTTTTTGGAA^ 
ATTANNCACA 

SEQ ID NO: 573 ACGGTCTCACAGACAACGTTGAGAGAATAGTAGAAAATGAGAAGATTAATG 
CAGAAAAGTCATCAAAGCAGAAGGTAGATCTCCAGTCTTTGCCAACTCGTGCCTACCTGGATCAG 
ACAGTTGTGCCTATCTTATTACAGGGACTTGCTGTGCTTGCAAAGGAAAGACCACCAAATCCC^ 
GAATTTCTAGCATCTTATCrmAAAAAACAAGGCACAGTTTGAAGATCGAAACTGAOT 
AAGAACAGAAAAATTTAGTTGCTACTGTAGATTTCATGATTNAGANGCAGCCTrrAAm 
GATCATmCCrmTrrrHGmmrAA^iAANCCTTNCCGGCCA 

SEQ ID NO: 574 acgcgggtgtgcttcctttcaaagggttggacctttaaattgctgcaaaaggt 

AAATTGTATTTTTTTTTAAGTATTGGTGTTCnTrACTCTAGCT^ 
GGTTrCTTTAAAAGTTCATGTAATATTrCTGATTTITCAGAATATTTGC^ 

AAAAAACACATGCATACACACAArrAAGAGCTCATGTCTTAGCAAGATCTGGGAAACCAACATTG 

CGAGAGTAGCTATTTTGAAAGAATAATTCTCCAGAAGTTAACATCTAATATCTAGTATCACCAAAC 

AGTATCGCTGGTCTTTTTArrCATTTGGAATGNATTTATTTITrTACCT^ 

ATTG 

SEQ ID NO: 575 ACGCGGGATTTCCGTAACTArrGTAATTTCCACTTTrGTAATAATm 
AAATATAAATTTATTTATTTATITITTTAATAGTCAAAAATCTTTC^ 

AAAATGArrGTGTTGCTTTTAGGATTGATCAGAAGAAACACTCCAAAAATTGAGATGAAATGTTG 
GTGCAGCCAGTTATAAGTAATATAGTTAACAAGCAAAAAAAGTGCTGCCACCTTTTATGATGATTT 
TCTAAATGGAGAAACArrTGGCTGCATCCACATAGACtnrrATGTrrTGTTTTCAGTTGAAA^ 
CCTCCTTTGGCAACATTCrGTAATGAANCANAATriTrrTT>r^ 

GGTTCTTGGAAANANGGNTCAATGGGTATmGGGGCTGGGGTATTGAACACGAAATTTTAm 

CATNGNGNTCAAAATATACCANTGGTNAGGTTTTAAAAAAGTATTCTTGANGGGTCrm 

ANTAATTNAAACTTTCATAAANNGTCCCTCGCCCGCGACCACN 

SEQ ID NO: 576 ACGCGGGGCTTTTnrCATTCCCGTTGTTATGGAGGGCCACATCTGCCAAAGC 
CTGGAGTCTGCGAAGGCCGGGACCCGGTTCCCCGGCCCACAGTGGGGGTGTGCAAACCCGANAGA 
ACTGGGTTGCAAATTCGTGAAGAATCAGCATCATGTTTGGCAGCTGAGTATTGGAGCCAGNAGCC 
TGCCATGANGTTTl^GAGAACANAGTGCTGTTTTANANCTGGCAGCAGCATCTCAGCCCAANAGAA 
GGTTATATTCCCAGAGGATGTCAGTCCCAATGACCANNAGCTGNCATTAGATTTGGATTCTGAAAN 
TAANNGGCTTAACAATTGGGTGNATAAAACKTGNNTITCmnT^ 

SEQ ID NO: 577 ACTGCCTAGCATGGTATCTTCCTCATCATCAGAGGTGCTCTATACATCTTCAG 
TTGAAACGTGAAACAGACATCCATATGCAAAATATmCAAGGGCTTTGTTGGCTTTTACATGm 
TTCmAGATAACTGGTAATGATGCACATTACAAAGGAGACITITCTAAATCTCAAGTCC^ 
AATTTTTCTITGGAACAACGGCACATTTTCAATGCCAAACCTTCTCCTAC^ 

ATGiXAAAACTCTGAATTCTTGTAACGGATCCTGCAACTAGTTCTATCCAGAAGATGGAGACAATA 
TTCCCTGGAGTTGACTGAACATGTGAGAAGGCACAGCTCANAAGGAGAGGAAGGCTGAGGGCAG 
TGAAATGAGAACCTATGCATCACCTGGCCTTTTTACATGTTAGTCTATCCTACrrATCCCAGGAA 
ACTTCTGCTGTACC 

SEQ ID NO: 578 ACGCGGGGGCGCTACTGCCGGAGCGGGGCGGTTATGGCGGCGGAGACTGCG 
GGCCCGTAGCTGGGCTCTGCGAGGTGCAAGAAAGCCTTTGAGGTGAAGGTGTATGAAAGTCATCA 
TAACAGATGTTTTCCAAAAACTTGTAGAAGGTTGTGAAAAAACTACTAGGATCACGCGGCATGTA 
TTGAGCATATAGGTTGCTGTAGATGAATGTTCITAGCrGTCATGTTTAAAAATACTTCT 
ACCTCAAGTGTGGCATGCAGCATTTTGGAAGGAAAArrGAAGACGTGTTCAAGAAAACATGAACA 
GAAGCAAATGATGAAAATGAGCATTTTAOTGATGTTGATAACATCACAATAAATTATGCCNANA 
AAAAAAAAAAAAAAAAAA 

SEQ ID NO: 579 ACTGTANAATGTGATGGAAAAGCATTGATGAGAATTTATTGGCAGNTCAGAT 
TGTGTTTTCCCAACTTAGTCTCTTTATTAATTOGhrrAAGGTTITCTCCAAAAA^ 
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TGGGAATTATTTAATGTAACAGTGGGCACAGATTACTrATCTTCCTTCTCTGCm 

GCAGTAACACACACAATCCACATCTTGTGCACCTCAAATGAACAGACTTGGTrrCCTTGC^ 

GACATTTCCATGACTGmCACATACAAACTATTGGGTGAGOTTTTTCAaCTGITACCGACCCACG 

TCCTGCTGTCTCTGTGTGGCCrACAAAAACTGTCCATTCCCACCCCTTTGCTTTGCCAmGC^ 

GTCTGGAATTGTCAGGTCTCAGCTTCNAAAAGTCCTGGTTCCACrGACAGGACACATTC^ 

GGAATTAGACCTCAAAGTCTAGTTTTOTATGTNGGTATGAAGGGGAATTTm 

AAAGCTGTGAACAGCATTAGAACT 

SEQ ID NO: 5 80 ACGCGGGGAGTTGTAGCTCAGCGTGGCTACAAGTAACTGTGGTGTGGAAGCA 
GAGTAGAGAGAAAACTTGTTCCTCATTAGAGAGAGAGCCACACTTCTCACTGCTCACAATGAGAG 
GCCAAAGATTACCCTTGGACATCCAGATTTTCTATTGTGCCAGACCTGACGAAGAGCCTT^ 
AGATCATCACTGTTGAAGAGGCAAAGCGCAGGAAGAGCACATGCAGCTACTATGAAGACGAGGA 
CGAAGAGGTGCTGCCTGTCCTACXjGCCCCACAGCGCGCTCCTGGAGAATATGCACATCGAGCAGC 
TGGCCCGACGCCTTCCTGCAAGGGTGCAAGGGTATCCATGGAQACTGGCCTATAGCACGTTAGAG 
CACGGGACCAGCTTAAAGACGCTCTACCGGAAATCGGCATCACTAGACAGTCCTGTCCTArrGGT 
CATCAAAGATATGGATAATCAGATTTTTGGAGCATATGCAACTCATCCTTTCAAGTTCAGTGACCA 
CTATTATGGCACAGGCGAAACTTITCTCTACACATTCAGCCCTCATTTTAAGG 

SEQ ID NO: 581 ACTTCATTmCTGTGGCACAAGATACTCrAGGCTCATCTTGTATAGTTCATAT 
CCCAGCTCTAGAATCAGTTCATTTTCTAAGGAGCCCTGGTTCCTTTTATTGGAAA^ 
GCACCGGGTGTGCTCCCATTCTAGTTGTTTTCTGACCACATAACTGCTAACAAAGATGCTTCACTCT 
GGCrACACTGATGTGAACTTTGAACTTTAGCAGAAGAGCTCAGCTCTAGAGAACAATGAGCTOT 
ACATTACCTTTTTTCCTCAAAGAATAAGTAAGTCTAAGCAGAAAAAAAAATATGCA^ 
AGTATGAATGAAATAAGACAAACCATCAGGCTTGCTGTATTGTAAACCAACACAATATAGTTATA 
ACAGATCTGTAGAAGGATCCTTAGAATAAGAGAGTCATTTGTCGGGGGTCATCAGGGAGAATACT 
GATAGTATCITCGGCTTTGNCCGCATAACAGACACANCATGGGATACTCCTGAAATTCATCCATGA 
TACTGAATGNATACTCrGTCTGGGCTATTACATGAAGGANCAATTCTTA 

SEQ ID NO: 582 ACCACAAATGCAGAATCAGAGCAGTAGGAAGAAGGTTAGTGCAGITATACTT 
TCATTAAAAAAAATrCTGAATCACTGCTATTTAAAAACACCTTGAAGCAAGTCTm 
TGTTTTTTAAACTAAGGTAGCAAACATTTTGCCATGTAATGGCAGTGTTATATGCCGTTATOT 
TTGTATAAAGAAAACAACATGAGAOATTTTTAATACTGGAGTTTGGTTACATTACATATTTAAGCT 
TCTACACAGAATGATGGACACTTCGAGAAGCTAATCCTTATCCAGAAACATTTTAATCTCTTAAAA 
A ACAAAG CAAAACAAACAAACAACAAAAAACCCAAAACTACGrrGCTCCTTTTCACAATAGTG^ 
CATTmACCATAATTTAGTTATGGCTACAAAACATCAGAAGAri"! 1 rri'i'AATGTATCTTCTCTAT 
GGTAATTAAAAAAAAAAAAAAAAAAAGT 

SEQ I D NO : 583 ACGTGCTGGACACCACTTTTAAAAAGCAATCACTGTGCTAGAAAAGTATATT 
GGCTTTGTTAGGATTAAAGTTCATTAACTTCAATGTAATCATGCCTCCrrATTACTGAAGTCAGATTG 
GAACCACTAAAGATCCAAACTTTCTGTCTGGTAATAGAAAGTAAAAATCTAGACATCATTTACAT^ 
TGAGAAAGCTGTTTTTAACATTATTTTAAAATGCeAAATATGTTCmCTAGAAA^ 
TGTT TTTGTTGGATAGCriTITAATTACATTTCAGAGAGGTGTAATm 

rmGAAAGGrr TATGA TTCCA AAAT AAAGATTTATA TGACTGGT GATACTGGCTTTACAGAAATT 
TCAGAGAACTAATTl iTAAAATCTTTAGCATTTAAAACTl' 1 ri'l'lGnTrGNTTTCTGACATATTCTG 
ACAAAAGAGCAGCAAACCACTGCTGTGTGGCATTCTTGGAGGTGTGCTGTGAATGTGCTTTTTAAG 
AAATTAAAAAAGAGATCCTTCTT 

SEQ ID NO: 584 ACTGTAGGTATTTATTAATAATAGCAATGAAGATGAAAGAGTGATGTATCAG 

agaggtggagataaaatcagtaaaacttagacactaaatgataggggaaggtggaggagaggaa 

tgagcctagaaaacttagaatataatggttctaaaattaaccaaagtaagggacacaggcattag 

agtaggttitgcagagaatgaatgttttaagacacacacaggtgtctctgggacaaccaagaaaa 

gtgcaacaggcagatggattgaggagtctggctaaagataaggatttaggaactgctgaattaaa 

attacccaagcgtgagaagtggtgttgtgattaagagagaaaaaaaaatggaggtctgaggaat 

aacatttaaggaataaatgaagaggccaaaaggtggggggtgottcangagtgancaaaatgta 

agaagtcaagggaataaatctttaaagnaggggttggcaaaaaatgtcnaatccaaaaaaaagn 

CAAGTCCCTCGGCCGNGANCCCCNC 

SEQ ID NO: 585 acttntgaaatgccatcancaggcctcctacaggagtgcx:cancaaggctcc 

AATNATGCCACCAGCCACCAGGCCACNCAGGCCrACGmTATCCTAAAAAGACTTCCCGTGACAG 
CTCCTGCNATTACAAAATGGCTTAAGGCATCTTrATTTCGGTATACATNCANACTAGTG™ 
TGTTGAATATAGNNACAAACACrGCNmTCTCCAACCCCAGCGCCNGCCATAACNAATGAANNCT 
CATGTGGCANCAa^^ATGTGCATATTGCACAGCATCNAACCGGCTTTGATAAAmCTGCCTGGCTC 
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TGCTNAATGTATTGTTGTNTAGCATGAATAAAAGCTGGTATTTCCCATACACCNAATCAATGATGC 

CTGCTGTAGCTGCNGGCNTACACAAAATTAGCAAGGGGNCCTOTGAANTTCTrrGCTGOT 

GCCAANCANCTCCCTGNGGCNGNTCCbrrCCATATTCCGGATAATAGGGCTTTGGNACXj 

SEQ ID NO: 586 ACCAAATAGGGhTTNCCCACCCCACCCCTGCGACAAGTGCTCTTCTAGAACAG 
GTTCCTACCAGCAGCACTGGNGTGAATGAAAGAGAGACCCANCCNCGTCTNACA(>INGTGGAATT 
GCACTTCTTANCNAAAANGAACTTTATAAAAhrrTNGGGATTT^ 
CC 

SEQ ID NO: 5 87 ACATTTGCCAAGACAAAGGTrCAGAATTACTAATTTTAGATATTATGATATTC 
raAAATAACTATTmATCCTGTAGTTCTATGATTATATGATTTGTAAATAAGAAGCCTAAC^ 
TAAAATTCXrrGACTTTAGTCTCATATATCrrGTCAGACTGTCTaJGTTCAAAT^^ 
CTTCAGTTGTGCGACCTTGGGCATGCTACCTAATCTCmGTGCCTCAAm 

GGGATGATGATGATNACAATAATACCTACCTCACAAGGGTGTTGTGAGCATCAAATGAGATAACA 

CAAGTAAAATGCATAGAACAGTTCCCAAGCACAGAGTAATTCAAATAAATATTAACTAGTNATAG 

TAGNGGNGhrrAACTCGNGAATCTTTTTAATAACATAATANGCnTTGATTTrAI^^ 

GTTAACTTClU-ilCCCTTGTATAAGTTTTATNTCAAGTAAGGTANGGTGTTTAAGTTA 

GTCCCAATCAAGGGACCTAATGAAGATATATATrAAATAATTNCCTTTTTTATTGGG 

SEQ ID NO: 588 GCCCACTGATGTCCACATCCCANANAAATTNTCAAAAGACGGAAATCAGGGC 

aaagacrccatataacaaagcgcaaggatatgctactaaaagttgctgtccttccatggctaang 

cagaaataaatattttggaagcagaaaaactacaccatccaataatcccngcagtgagaatgatt 

cctaccattccttgcaaagaaaatatcactgcanagctggaaagtaagatcatgggcagaagaca 

atatccaaggacacttgccacacnaccnaatgaaacacctgncatacrcattaagtttnataa^ 

aaaacattcctagacattcaattgcactgatccccgnntacv^tnnncaaacitg 

cagtatcattgtggctccnnatgcangcaaaanncchntggacctgccaaattagtt 

seq id no: 589 acttgaagatatctcggcaaactgaaacaatggtgaotctgacgtaattca 
ggagattctgaaacatcaggcctgcatgtgccagaggatacctctggtgtgccaagagcatgttc 
tccagctaacagcagcatactcaggcctcccatttgagtaggatcagccattccaaagttaagctg 
aggcatttcttcagtctttgcitrcttggggctrggcaagtctot 
ccaaagriu'cluu'cgtggattgacagtgggtgttggggatttcacangcttgtttgtggtangaca 
ccatttgtagccaggatitgctitcataaatgcatncttanactncttgccatgtctgng 
gmrmcttrgggatcaaggaacngccnacccaatcngctagntcttggttgcccct™ 
aagcttgggggggttccttacgtacctgcccng 

seq id no: 590 acatcttaaacacgacattgacactgcaagtgcrtggatgtcctggcacctca 
gaacacaactttgtgcaaaaggagtgctgtgcatctgaatagtgcaattcatttccaatgcaaga 
atgccagritcttagggcccccggtcaggatattacccacttggtttaagttgtgatttcaaatccc 
attttctatgggagatacactggtggaacaaatctgcagagggccctgcacctgcgctggtggat 
caacggtgtcttctcacctccagggctagctgcttgccttccctcgtggggcttaagtggctgi^ 
agcagatagacagtgtagatgaagtcctgggccctgctgctggactgagcagtgagatgactggg 
ctggcatttgttcntrctctgctgtgtgtgaccrrctactatgtgcctggaaact^ 
ttaatgctgcaaacaggattttcttattgcctaagcatgaaotggnatgaaaaty^ 

ATTTACTATGTAACTrGGATCTTATAACCTGGACTGTAAGTTTGTTTACCTCCAGTGGGAGTT^ 
TTTAAAG 

SEQ ID NO: 59 1 ACAAAGACGCAAAriTTCATAGTGCCTAGAAATAGCACAGATCTATTCTACT 
CAAGATTATTTGAA'll'rrriCAGGGTATTCrACCTAGAGCCTGTGGTTAATCGCCTCCCrGCTCCCC 
CTACXnUllATTCCCTACCCCCTCAGGGAAmGGATACATGTGAGGAATAGTCCmG'riU'ri'Crr 
ATGAACCTAGAAAATTACAGATCATAAAATCTGGATATrAAAGTAGTTTCCAAAAGCATCTCATG 
GGAAATCAAAGTGCTCGGCATTTCCGAGCTGGAGAATAAAATCAAGAATCCTTAANAGAGAAAA 
GAAAATGGAGATGGAAATATGAATTTTAACTGGGAAAAATAATTGGAAAGCTAACCrm 
ACTCTAATTATGAAAGTTGGAAAAGGTTGTGATGCTCAGGATTATCXrrGTAATGTGTGAAGATACA 
TAAATTAGCACCTAATTTATAGGCAAATTTTGGTAAAACACAAAAAATTATGGTATGTCTAA 
ATGAAAAGTATGTAAATGCAOTGGCCTAAAACCTGTCAACTGKAATACAGTrrAATGTGGAACTT 
TTCTAATAATArrGAGCTCTTG 

SEQ ED NO: 592 ACACACCAGTCmCTTTGGTGTTTAACCTAATGGGCTGAGGAAGGGAGGCA 
GCCAGGAAArrATACCCATTAAAGTGTGTGGTTAATGCCTATATATATCATTTTTACTCACAGTTA 
GTGGACITATCTTTTAAAACAGAGGTATGTGGCAGTGGGGTAAGAGAACTTCCCTTArrATAGAGA 
AGATTCCAAGAATAGGTTCAGACATGATTTGAAGAACCATGTAACTCTGCTGCCTTGCACGGTCCC 
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(nrriTAGCACTAATAACTCCrAAGTGTCTGATAATTrCAAGGTTAGCAGCACTCCCA^ 

GTCCGGGCTGGGTGGTGGTGGCTCATGCCTGTAATCCCACAATTTCAGAGGCTGANGCGGGCGGA 

TCACCTGANGTA^^V^GA^m'CM^AGAACCAGNCTNGC^^^CCANGNGGAAACCCQ^m 

TACAAAA>™ANCCAGCGTGGGGNAGGGGCCCTGTATrCCCAGCTACrrrGAGAGCTGGAGGNNGG 

ATAATNGCTTGANCCAGNAGGCAGAGmGNTTWGAGCNAAATCCGCCArrGTTTC 

SEQ ID NO: 593 ACCXCAACTTTGCTGGACCTCATGCAGCTTTAGCTAATAAAAGTTTCTT^ 
GCAGATAAAGTTACAATGCTGTGGAATAAAAAAGCTACTGCTGTGTTGGTAATAGCTAGCACAGA 
TOTTGACAAGACAGOAGCTTCCTACTATGGAGAACAAACTCTACACTACATTGCANCAAATGGAN 
AAAGTGCTGTAGTGCAATTACCAAAAAATGGCCCCATTTATGATGNAAGTTNGGAATTm'A^ 
ACTOAGATNTOTGCTGTATATGGhrmATGCCTGCCAAANCGACNATTTTNAAC^^ 
CCTGTATTTGACTTTTGGANCTGGGNCNC>INATGCAAGNCTAm"ATANCCCTNATGGANA 
TAA^^mTTAACNTGGTN^TGNAAANNCCAGGGGNNAAATGNAATGGTNGGG^^ 
TNCAAhfN'l-J U - r JNTAATACCGGNTGGrrATGNATTNTACCTNATTTGCTTGTACCC 



SEQ ID NO: 594 ACATACCCAAAAGAATTAAAAGCAAGGACITGAACAAACATGGTCCTAGCA 
GCATCAATCACAGTAGCCAAGAGGTGGAAGCAACCTAAATGTCCATCGACAGATGAATAAATCAA 
CAAAATGTGGAATAGTCATACAATGAATATCAGCCnTAAAAGGATGGAAATTCTGACACATnTA 
CAACATCGATAAAACTTGAGGAGCTTATAGTAAGTGAAACAGGCCAGATACAAAAAGACAAATA 
GTGATAGTTCCCCTCAGATGAGGCACCTANAATAGTCAAATCCACAGAGACAGAAAGGAGAATGG 
AGGTTTCAGGGGGCAGGAGAGAGAATTGAGGCGTTAGTGTTT^OTGGGTGCAGATTTTCANCTGG 
GGAAGATGAANAGGTTCCAGGGGTTNCCCGCACNACAAGTOGNATNNTACTITNAA^ 
AGANGGCAATTNrrATNTTAGGGTATTTTTNAAACACAGA>riTAAAA 
GGTTTNGCANGANCNATACCCAANANCCNGGriTrCC>Om'CNGCGCCCT 

SEQ ID NO: 595 ACTCCAGCAGCAGGAGAGCGGATTTACAACATCTCAGGGAATGGCAGCCCTC 
TTGCTGACAGCAAAGAGATCTTCCTCACTGTGCCAGTGGGCGGCGGAGAGAGCCTGCGATTATTG 
GCCAGTGACTTGCAGAGGCACAGCATTGCCCAGCTGGATCCAGAGGCCTTGGGAAACATTAAGAA 
GCTCTCXAACCGTCTCGCCCAAATCTGCAGCAGCATACGGACCCACAAATGAGACACCAAAGTTG 
ACAGGATGGACirrrAATGGGCACTTCTGGGACCCTGAAGAGACrrCTrCCCTTCAGGOT 
rrGAGTGTGAAGTTCCAGAGCAAGGAGCCATGTrCCTCTAAGGGAArrCAGGAATTCAGACGTGC 
TATTCCCACACCAGTTAGGTAGAGCTGTCTGTTCACCCTCCCATCCCAGCTGATCCCAGTCACTGC 
rrGCTGGGGCCATGCCATGGAAGCTTCCATCAGTCTCrAGCrGAATCCTCCTGCTCTCTGANCrc 
TGGCTmGCCTNCTGCACTAACATNCTCTTAACCTTGCCTGCCTTGCAT 

SEQ ID NO: 596 ACTGACACATTACGACTCTTGTTGTTGACAAACATTTAAGCAGGGCTATCCAA 
AGCTCACAGTCrrcAAGGCAGGGAGGTGTGTAAATTGTAGGTTmAATTAAGCAGTAAGCAAGC 
TGAGATTTTTCnTGGTAAAATAGTTTGTTTATTCATGTAGATCTAAAAGGCTG 
AGCACAGGAGAATGCCAGAAGAGGTCAAGGGCATGGGGATGTATCGACATGAATCCATCCCAAC 
TTGTGGAAAACAAACGACACAGCAGTTACCATCACAAAGAACTCTATTAGTAGGAATAGAAGACT 
TGAGGTCCAATTCAGTTCTCATTrGGArrCTATAGTATCrCTAAGAATTTGGTTAAAAAAACAAAA 
CAAACAAAACAAAAAAAGGGAAATCCTTACTCCTTTCATTAACAACTGOCACAAGGAOAAOAAAT 
AAAAGACAACTAATATAGATNGGCCTTCTGATTAAAAAATCTGTGAGCTTGAATACATTTTAATAT 
GTTGGAATCTAGAGTGGNNAACCAATGGGTTAATATACTACCAAAACrnTA 

SEQ ID NO: 597 ACTTTTCrin-ri'Cl^'CCl^TnTlTGGAAATTATTTTCCTGAGC CTm 
CGGTATATTGTAAACnrrrATGrrAAAGAAAAAATATACATITACAAATTGTGAGATTT^ 
AAATmCTACGATGTATACTGGCTTATTTTTTAATTTAAAACGGGGTTTCCGT^^ 
GGGGGTGCGCCGTTAGTCCCCTCGCTCCTGGCnTrGGGGGTTGGGACTTGGNGGTCCAGAAACTCT 
GGGAGCTTCTAGAAGAAATCTACTGAGNGTATrrCTGTOrrTTGTNAATTCOT 
ACCTGCNNGGNNGGTCTGANGTGAACTGNGGGGGNTGNGCACAGNCAGCCGAGNGGATCCCNCC 
CAGCGCTGANC(>TOCCNAGATGGAANGCCmCTNCCAAACCCNNGCCTNGNGGGGCGNGTCCG 
CCATTNACCACTCTTGCCACTTGnTN 

SEQ ID NO: 598 TTCGCCCTTTCGAGCGGCCGCCCGGGCAGGACGCGGGAACGACATrTTTTGT 
AACTTTACACTrrmGGrrATmATTrrAAAAAAATGAAAAATTAAm 
ACTGTTGGATTATTTATTTTANAAATTCCCCCCmGTGTTGGACTOCAAArrGAGm 
TAGGCCTTTCACAACTAGGACTGAGAATGTATGTAAAAGTrCTGTGACAGTACCTCGGCCGCGACC 
ACGCTAAGGGCG 

SEQ ID NO: 599 accaagtgagtgggaatacatattctagttaaagcatttgtgtctagctacac 

ACCGCTAACAAAGTTACTTAGTTATCAATGTAGGATTCTTAAGGAGCrrrAAGCTAAGGAAACCT^ 
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TTAGTGACTTAGCTTATTTTGTATCTTrrCACTTAGGAAGATTTTGGAG 

AGGATACCATCTGGCGGCnX3CACATTGTAACAGTAAAGGCAGAAAGCTGTAGTGATAACXrrCrCT 

CCTAAAAGAGTTAACTGGTCTCATCCAGCAGAAGCTATCTTAAATCTGTGATGTGTCAGGTGCAGC 

CAAATATCACACCTTCTGATCTTAGCCATCCCAAACCAGTATCTGTCCCGA GAGGAA ATTCCCCCC 

ACCCCCAGAAGTTTACAGAAAACTGCCTmCAAOTOTTTGCCTTArrCAGCTTT^ 

TTAAAGCAAGCACTGTAGCAAAAGCCCACTTCCACATGGCCCTGGCAGGGAGCACTGCTGCTCCA 

TTGCTCATTCTCNCTGTACCTCGGCCCGCGACCACCGCTAAGGGGCG 

SEQ ID NO: 600 ACAATATGCAATAATAGTGACATCTTTCAGAAAGAGATGTTGAGAATGGGAA 
TTCATATTCCTGAAAAAGATGCrrCCTGGGAATTAGAGGAAAACGCTTATCAAGAGCTTCTGCAGC 
ACTATGAGCGTTGTGATGTTCGAAGATGTCGrrGCAAAGAAGGGCGAGACTATAATGCACCTGAT 
AGCAAATGGGAAATAAAGCGCTCTCAGTGTTGTGGrrCCAGTGGCACACATTTAGCCTGCTCCTCA 
TTACGGTCATGGGAGCAAAATTGGGAGTGTTTGGAATGTAGGGGTATTATCTACAATT CAGG AGA 
GTTCCAAAAAGCCAAAAACATGTATTACCCAATTCTAATAATGTGGGGATTCAGATTGrriTGTO 
GAAGAGTCATCNCCTANATTACCCAGACAGNCNCCTGGATCCAANAGTAAAATCTNCrGNNGCNA 
GGCAGCAAATTTAGAAAAAAATGT^n^CNACANCT^mTATATTACN^ 

SEQ ID NO: 601 accagtgcgaatcatcgggctatccaggtccgagatcctagtctcctgtcgg 

CTCTGAGGAGGATGGATCCTTCTGCGGATACATGGGACCTCTTCTCACCTTTAATATCATTATGGA 
TAAACAGGTTTTACATTTATTTGGGCTTTGCTGrTAGCATTAGCCTTTOT 

CATCAAGACGCAGGGCAAGAACTTACAGGAAAAATCTGTTCCAAAAGCAGCrCAGGATTTGATGA 

CAAATGGTTATGTCTCCCTTCAAGAGAAAGACATCTTTGTGTCTGGAGTGAAGATTT^ 

CTCAGACnXjGAACAGCGAAGGGATTCGCAACAGTTCrrGCTGAAGCAGTTACATCCCTGGATCTG 

CCTGTGGCCATTATTAATCTAAAAGAATATGATCCAGATGATCATCTGATAGAAGAGGTGACTAGT 

AAAAATGTCTGTGTCTTCCTGGGTGCGACATACACTGACGGCCTACCAACTTGAAAGT 

SEQ ID NO: 602 ACCAAAACCTGCAACAGGCTCATGGAACAGAGCCTAGGGATCTAGGAGCAT 
AGGAGGTGGTGGTGCTGGGCAGGGCTCTGCATCCCCITTCCTCANCACAGCACCATCTTCACCCTC 
CTGGGAAAGCAGCATTGGAGCCTACACCGCTTGTGCTTTTCTCACCAGGGTAANAAATGCANGTA 
riTGCAGAGGGGAGTGAGTCIGGAAGGTGGCAGAGCACAGCTAGGGa^AGACTTANGGGAACTT 
GTGGGAAGAGTAACTGTGGAACCTACCTATGCTCTCTTGACCCCAAACTCCCCAAAACCCCrCACN 
TGAGGACTGTCTACCCCCGGGGCTCAAAATAAACTGCITACTGGAAGATGGGTGACTTAAAGGCA 
AAANGGAANGCTGNCCCCTGGGCTCCCCAATCCCCTGCTTGCAANANCTGGTTTGTGATNCTNNG 
AAAACCCCTGCATTTTNCCCirrCAGCCAANCTCCTCANAGNTTNANACANAAAGGGGNTGGAG^ 
NGANGGTCCTATTNTNACirrACCCCTCAANGCCX^nTmCACCC^^ 

SEQ ID NO: 603 ACTATATAAAAAGAAAAATATTACAAGACCITTrGAGGATCAGACATCACTG 
GAATTCTTTTCAAAGAAGTCAGATTGrrCTTrATTCATGTTTGGCTCCCATAATAA 
AATAATCTAGTAATAGGTCGNATGTATGACTACCATGTGCTGGATATGATTGAATTAGGTATTGAG 
AATIOTGTCTCrCTAAAAGACATTAAGAACAGTNAATGTCCTGAGGGAACAAAACCC^ 
ATTTGCTGGCGATGATTTCATGTAACAGAATNATNATNAGAAGACTAAAANGNCrnsn*^ 
TTCTTCANAGGCCNCACNGTATAAAATATCCNNCTGCCNGGNTTAGAGTATGTTCT 
GCNTTGNATGGGNGA1TCACTTCGATCT 

SEQ ID NO: 604 ACTGGAACAGGGATAAGTTCTTGGATAAGGTGCCAACATACCTATAAAAGCT 
GATTTTTGAGTAAATTATCGATTCTAACATATGTAATGGATTTGGTGTGATAATm 
ACTATAAGTGACTTTrTATTCTCCACCAGAAAAGATAAATGACTGAGAATGTAAGTCTGCGCTCTG 
ATTAACACAATGGAGAAACGGAAAAACTATCTCTGTTAAAAACTGATTCCTGTCATTOT 
TCAAATAAGAGGAAGGAAAATAAACTTTTTGTGTGTAGATAGAAAAACATACCTGAGGCCAGGTG 
CAGTGGATCACGCCTGTAATCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCAGCTGANGTCAGG 
AGTTCGAGACCANCCTGCCAACNATGGTGAAATCACGTCrrmCTAAAAATTCAAAAATTATCTG 

SEQ ID NO: 605 ACGCGGGGCTTTCACATTCGGGAAGCGTCGGGATTAGGTGAAAGAAGCTGAG 
CTGAACACATTACGATGGATGATGGAAACATAAGACTATCAAGAAATCCAAGTGGTAATGGGCGA 
AGTTTATTCAGCATCCGGCAATGGACTTATCGTAGTTGGGGAAACGGGTGTTCCGAATAATATCCT 
GGAAGTTATGAGGACACCrArmAAATATAGGCCTGAATTTTGTNAAGTAATATTTAAGGTM^ 
CGTGATAATTAAATAAAATGCTTAATTCATGTGACTAANAAAAAA 

SEQ ID NO: 606 ACGGAAGGATGCTGCAAGCTGACCCCAATAAAGTTTCTGCAAGGGCGAAGA 

aaagaggccttcctcagttggggaccctgggagcaggcaaccattatgcagaaatncaagttgtg 
gatgagattitcaatgagtatgctgctaaaaaaatgggcatcnancataanggacaggngtgtgt 
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GATGATCTCANGNGTAAGCAGATGCTTNGTCCA 

SEQ ID NO: 607 ACATTCTTCAAGCACAGGGGGCCATCAATAGTAAAAATCTGCCCTTCACCATT 
GTTTAAATCTATGAGACGTTCAAGGCATTCCAAGGAATCGGTGCrCrrGGAGCTCCTCCAAAT^ 
CATGCACATGATATATAACrrAAATGGGAAAGGAAAAGGTAGTGGAAACCTGTTAGCTCTCACTT 
CGTTGATTCTNAGTGGNTANGGAATGTCCAGNGAThnmTATNATTTGCCGANACATCTCTAT 

SEQ ID NO: 608 ACATGAGAGTCCTGCCITAATTTTGCTGTTTGCCCTCGGATCTCrGGGTTTGAT 
TTTTGCGTTGATTITAAACAGACATAAGTATCCCCTTAACCTGTACATGGTAAAGCATAAATCGCA 
CTTGAGATTCTGAAmTTTGGCTCCTCCATrrCTGGAAATTGAGACTCAA GCTT TATGAA 
AAGAACTTAAAAATGAAGAAGGTCACAGATTGATCTTITATAAGACCTTATTr TGATGCTT TGTG 
TTCAAGGAGATGATACCTGTCATCCATATAAGCAAACTTTTTGGCTTACAACTATTTT^ 
AGCCTTCTAGTCTGTAATGGAAATTGTATATTTTGATAGAAGTTTTTCTCCATTGGTTA^ 
TTACnTAAAATOTGGTTCTTTAGAAAATAAATGCAGGTTATAAATGTGTGTATAT^ 

SEQ ID NO: 609 ACrrGTTACCAACrrrGCTTTCAAACCArmAGGTCCACAAAAACTTCACTG 
AAGTGCTCCCGATCAAAATCTGATAGTTTCACAAGGTATTTCTCAAATCGAGGTAAGTTGAGGTGC 
CCACTTTCATTAATATAACCCCCAAGTTCTGGCNGGATGGTAACATATGTTCCATAAAGAAGAGGC 
AGTGCATCATGATTAATATGTAAATGAGGTANATGAGGGATTAAATCATT CCAACAAG AAACNCC 
NTCAAAATCCAATCATCTATTATCCTTTCAATATCATAmAAATGTGANClU-l'lCrrri'AATACTG 
AAAACTCTTAGTCAATATNCTCrCTCATTAAAGACAAGTGTAGAANGTGATATGTT 

SEQ ID NO: 610 ACGCGGGGACTACGATGGTGATGAGTTTCGAGTGGCCGTGGCAGTATCGCTT 

cccacccttctttacgttacaaccgaatgtggacactcggcagaagcagctggccgccrggtgctc 
gctggtcctgtccttctgccgcctgcacaaacagtccagcatgacggtgatggaagctcaggaga 
gcccgctcttcaacaacgtcaagctacagcgaaagcttcctgtggagtcgatccagattgtattag 
aggaactgaggaaaaaagggaacctcgagtggttggataagaacaagtccagcttcctgatcatg 

TGGCGGAGCCAGAAGAATGGGGGAAACTCATCTATCAGTGGTTTCAGGAGTGGCAGAACACTCCG 

tctttaccctgtatgaactgactaatggggaagacacagaggatgaggagttccacgggctggat 
gaaccactctactgcggctctgangcctacacangagcacaaggccngatatactgnagcntgcc 
gagcgtaagtttttacaggactgctccrracttttactccactttcagggtt 
nncccaaatggttttgatccaactaaagnctctc 

SEQ ID NO: 6 1 1 ACATCATTGGGAATGGAGGGAAATAAATGACTGGATGGTCGCTGCTTnTAA 
GTTTCAAATTGACATTCCAGACAAGCGGTGCCTGAGCCTGTGCCTGTCTTCAGATCTTCACAGCAC 
AGTTCCTGGGAAGGTGGAGCCACCAGCCTCTCCTTGAATAACTGGGAGATGAAACAGGAAGCTCT 
ATGACACACTTGATCGAATATGACAGACACCGAAAATCACGACTCAGCCCCCTCCAGCACCTCTA 
CCrGTTGCCCGCCGATCACAGCCGGAATGCAGCTGAAAGATTCCCTGGGGCCTGNTTCCAACCGC 
CACTGTGGACTCTGAGGCCTCTGCATTNCGGGTGGGCTGCCTGTGATATTTTGTCATGGGCTGGTC 
TGGTCGGTrrCCCATTTGTCTGGCCAGTCTCTATGTGTCTTATTCCTTTGGCTTCATTAAAACANA 

SEQ ID NO: 6 1 2 ACGCGGGGGAAGAGGTGAAAATTCCCCTGGTAAATATTTCACTCCTTCCAAA 
AGACGCCCAGTTGAGTCTCAATACCTTGGATTTGCAACTGGAACATGGAGACATCACTTTGAAAG 
GATACAATTTGTCCAAGTCAGCCTTGa-GAGATCATTTCTGATGAACTCACAGCATGCTAAAATAA 
AAAATCAAGCTATAATAACAGATGAAACAAATGACAGTTTGGTGGCTCCACAGGAA AAAC AGGTT 
CATAAAAGCATCTTGCCCAAACAGCTTAGGAGTGTCTGAAAGATrGCAGAGGTTGACTnTCTGCA 
GTGAGTGTAAAGTGAATGGCATGACCAGGTCAGAATCCACCCCTGGCTTGGAGACCCACAAGATT 
TANAGTGGAAACTCACACCCAAAAACCATAGGCGGAAATGTGACAAAGAAAANCCCCTCTCTGAT 
GTTCACTGGAAGCCAGACGAAAAAAAAAAAAAAAAAAAAAAAAAAGTCCCGGGGGAACCAANC 
TANACCCCCAACCAACAACTCCTANAACAC^^•AAANNCCCCCTrITNCAAAATNGGNAA^^ 

SEQ ID NO: 613 ACTTCAAATCACATAGCTTAAAATATGGAGAAAAGACAGGTAAAAAAATTAT 
CTTAACCTGTAGTAGTCnTTTTTCTTTTTAAAATTTTrAT/^^ 
AATArrATGACACATTGGTGTCAACTTACACAAATGAAAGCTGATGCCAGCTTrTO 
GTTCATTTCACCAAACCmATATATTACATGAATATCCAAGTAAAGTATTTTTrm 
AACACACAGGAACATAATATACATTGTrrTTTriTTTAAAAA^ 

TCrrCACCACATAAGGKrCTTTTAAGACCATTGAACCNTTTGCATT GCAAGACT TTCAGTCATGG 

CTmCCCAGTTTTATTCTTAANGAAAAAAAAAAGAGAAAGGTTTATTTT^ 

GGNAAGGACTATT^m^^CAGCTNGCTTCCTTCTTGGTTCCX}ANT 

SEQ ID NO: 614 ACGCGGGGGACCTTTGTAGCACCTCAACATGAAAGGGCATTAGCTATGTTTC 
CTan'rTrACAGTGA'irACCAAACAGATCrrGCCACTTrGATTGTTAAAAATGAACCACATTCTA 
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CCCTGGTCTGGGACTTTGGAGGGAGATGAATITCTTGTTGGAAATGTAAATCTAGTGTCCATATTT 

AATACTCTCACAGCTTTGTGTTTATTCTCTTTGCTCATGGAA TAGC AGAACAA 

TAAGACATrTAGGAGAACCTGCTGTATCTAACCCAGTTGGATTTTCmCATGCTTAACACAGTAN 

TGAAAATAGAAGGTAGGCCGGGCACAGTGGCTCATGCCTGTAATCCCACACTTTGGGAGCTGAGC 

AGGTAGATCACCTGAGTTAGGAGTTCAGACCAGCTGCCAACATGGCGAACCCTOCTTACCCAAAT 

CAGAAAATTACTGGCGNTGTGGCAGCGCTGAATGCnOTCTTGGGACCC>mGCAAA^ 

NTGGAACAAGTT 

SEQ ID NO: 6 1 5 ACGCGGGGGTCACAGACCAAGCAAAGAGACGCAATCAGCAGCTGOTCTCAC 

ccttcctctgaacagtgaccaaaccntrcacctgatgagcaacctggctg ggga tgtratcacagc 

tgcagtgactgcagctatcaaagaccagttagagggtgtgcagcaagcacntctcaggctoccc 

ccatccx:agaagaggacacagacactgaagaaggtgatgactttgaactacttgaccagtcagag 

ctggatcaaattgagagtgaattgggacttacacaagaccaggaagcagaagcacagcaaaata 

agaagtcttcaggtttncnrcaaatctgcnxsggaggccattaatctaggatcanc^ 

cacanaaacaccaaaaaaaattcaaacagaaaaaaaaaaaaaaggaaaagaaaaamtgactg 

tactttatgatacrragattrgtttatttcctctgcanoaattaatgntrtaatcact 

ttgantmgncgtattttggtanaagcatgaangacttna 

seq id no: 6 1 6 acccaatgggcagggaagatcaggaagagatccatggga cataa ggaagtt 
aogttactgcgcatagctcccaggagattgtcccctccatttctcccacrcattcrmgc^ 
gagtgttccctctaagctgatccactaccaga^gctggaacactccccraaaaagctagcttggct 
tcctgggcagccagttcttnaggagcaaggcttgttaagttcttgaagctgncact^ 
cccccttancgcacanaggacaatcctcaccaacacgccaccanccactcnaatggaattcctat 

TCCCAAGGCTGACGCCACCrrcrri'll'CCACTrrGGGTGGTGAGGCTTGNAGTATCTGCCAATTACT 
TCAATCCCTCATCTATTAACTTCTOTCCTCATAAOTGGTGATGGAAATCAANNNACCAAGTTAGGA 
NGAANGACCXrrNCCCAAATAGAGCCnTGCAANll'CAT 

SEQ ID NO: 617 ACAGGTCTGGCATGGTGGCCACCACGTGCATCTCCTGAATGATGTC ATTTA GG 
TCCAGCTCGGATTCCATGAACrTCTCTGGATTGTCTGGAAACTTAATCCGCAATTCTTGGTT^ 
ATGATCTCrrrTCAAATGTGAGGATCATTITCTTCACTGAGC TTTCA TCCA^ 
CTCTTCCTCTTCCCCATCTCTGTCAATAATCTGCANCAGCCTTTTm 

CCACAGTCATTTCTTCITCCCGATAGCGGCGNGTTCTCGAGTACCrGCCGGGCGGCGCTCGAA^ 
GCG 

SBQ ID NO: 6 1 8 ACTGCGTITGGGCCTCAAAAGGACATCCTTGAAGTCCAGTTTCACATCGTTGT 
CAATATGAGGCATGGCGCITANCCTCGGGGTAGCGATGAATNTGAGGGCTCGNGCAATCCCGCTA 
AGCCTTCTGGGCCACGCCAACAGGTTCCTCTNGTTGCCGGCAAGAACTACTAGCCGACGAATTCC 
AGCTTNGGCGGCGGNAGGGCGAGGCCTTmTGGATAACATAGTTCACGCACTNNCCGANACTNTG 
CTTTGTACNACTGTAAAGGGCA 

SEQ ID NO: 6 1 9 ACTAGTAACAGGCAATTAACAAACTAATAAGAAAATCAGCATTTT AACA ATT 
TAAACGCTTCATGACAGGGTAATTCATGTCCAACATATCAAAAACATATTTATAGA TAAC TT^ 
AAGAAAATACATAClUlU"illlGATAATCACAAGTAGCAATGAGATTTrCTATATTArmCAGTCT 
CACTTTAGAAATGTTTTAATTGTCTAAATTTAATCAArrCATCGATTAAAGGA/^ 
AGTAAAATTACATGTGTTTATATATAAGTGTGTGTGTTTCAAATAACAAAACGCAGGTTGTAAACT 
AAAATCACTGGAAGGCAAATTGAAGACAAAAGTGATGCTGGTTTAAGTTGTTTGGTCTT C^ 
AATTCCAGTTCCAAGTTGTmTrCAACTCTAGTAATATCCAGATAGATAATTCACCTGCACllU"^ 
TTCTATCCrGTTTCTCTTGTa:ACTGAATITGCrmACTGAACTGTAA 

SEQ ED NO: 620 ACTrri'rrril'l^-l-lU-l'i'riUU'l-l'GGTCCTTANACTGGTTAATTTCAATTG C^ 
ATCmCAGTTTGCrGATTCTTTCTTCTGTCTGCTCAAATCTGCCATTGC^ 
TCATTTCAGCTGTTGTAATTTTCAGCTCCAGAATTTCGAmGATTOT 
ACGTTCCCTATTCArrCATACATTATTCTCCTGTTTTCCATCAGACTTTTTC^ 
ACTGACTATATTTAAGAGAGTTAATTTGACATCTTNGGCTAGTAATTCCAGTGCTGGGCT TCCT CA 
GGAGTGGTTTCTGTCAACTNTTTTTATCTTGTGAAAGACCATACTTITATG 
ArrCTITGTAAGACTGGATATTITGCGTTrGAATGTGGAACTTTGGAATCAATT^ 
GANGTGATTTTGTTTNGAGGCTGCTGCTGCC 

SEQ ID NO: 621 ACAGAACTCAGAGGAAAAAAGAAATTAAATTTTAGCTTTCTGGAGAGCAGCC 

cctctctggcaccatcaaacacttcrrrgtttcccttcaacttggaactc^ 

gtgagggtttggccattcttttatcttgggtccatgtgagtgacagaaatggtgcggcctgggaaa 

gatctccctcctttacattttctcttctccctcctcctccttattctaaaaact 
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GGGGCAGGGGCTCnTGTAGAGAGATCCCTGGCCCANGACAGGAGATGCCAAATCrAATTTATCTC 
ACTGAGGGCCTTTGAGAAAAACGCTTCAGGGCCAGGCTCAGTGGCTCATGCCTATATAATCCCAN 

SEQ ID NO: 622 ACGCGGGGGCCAACTGAATGGCGTCTGTCCTGTCATCCATATGGTGCCTGGA 
AATATTrACCAGTCAAGGTCAAGGTCAGCATCTGTGGTTAAAAATATAGCAT TCTGAC CTAAAAA 
AGTTArmGCAGATGAATGTGrmCAACrCAGGACCTATCCAAATGAGG^ 
TTTTTTTTCCTATTTITAGACATCAATTCTATAGATrCTGAC^^ 

AAATGCTGGCAAAAAGANGTGCriTITGGATATGGCAGCAOTGTAAAAATAAAGCAGTO^ 

AATCCTITTAAACACAGAAATCXTrGAGTTCrTCrC^nTGGTGGCT^ 

ATCCTTTGAAAGAGAATTAN 

SEQ ID NO: 623 ACGCGGGCTTTTACGTCGGCCTTCGCGAGCGTCTGGGCGGGTGGTAGGAACA 
ATGGCGCTGTCTTAAGTGACACAGTGGAGCANCTCTGAAGATGCAAAGATAC ACGAA AAAACTTC 
CNGAACAT^^'GGGAGAATATTTAATGGAAAATCGCnTGGTTAAAACCTGACACTm/^ 
ACAGCGTTCTGAGTGTGGACGAGTAGCCANTGAAGATAATGAATGTCGAATGTGACTGACTAGCA 
ACTTCATmGAATGAGGGTCNCTNGTCTGCCCATTTGATAGAGGCCAGATGNTTGA^ 
NTTGCANCTATTNTTGCTAGTGCCANAAGGTTANTTGATGTGGGGGAAAGCTGTTAAGAAANCCN 
TCNANAAAAANNTCTTTmTTACAACATGAAANAAAAA 

SEQ ID NO: 624 AClTlin- riUll 'rri^-lll^U'l'rr riNGAATTAAAAAATCCATTTTATTG CTTGG 
GTTTAAAATAGTTGGGGGATACAAGTATrrACAATGCTATTGGAGTCAATTArrGGCAACACT^ 
CAACAGTAATACCATTTCTAGCrmCAArrGGCAATAOTAAAACOT 
TAAATACCATATrATATTTACrAAGTTAANAGCTAGTTTTTACTCTCTTCCATAAm 
AATGTAANATGATGGCTCAAAAATGACGACTNATAGTTTGAATTTATGTGTATGCAATATACATAT 
GAGAACCAAATTCAACAAGTGCATGATGGTACTCATGAACATTGATTGTATGGCCTGNC AGTT ATT 
CCTTGGTCAATAANACrGAAGGGNCAACCCTTTTCTTTCAAGAGTTGGCCTTTCT 
ATTAATTGGATArmCCTC^ITGCCTCTCATATGATTAGNGGGAGGTTCATCCACAAACAAANACA 
GGAAANTTTTGCAACCTTNCrGGAmTNCCTAGGGGACCAAAGGGATTNA^ 
NNAAAAAAAAGCTCrmTTAAGGAAAAACNA 

SEQ ID NO: 625 ACCAAATGGAATGAATAGGGGAGAACATGCATTAGTTCTGTTTGAAAAGTGT 
GTGCAAGATAAATATTTGCAGCAGGAACATATCATAAAAAAGTTAATTAAAGAAAATAAGAAGC 
ATCAGGAGCTCrrCGTAGACATTTGTTCAGAAAAAGACAATTTAAGAGAAGAACTAAAGAAAAGA 
ACAGAAACTGAGAAGCAGCNTATGAACACAATTAAACAGTTAGAATCAAGAATAGAAGAACrrA 
ATAAAGAAGTTAAAGCTTCAGAGATAAACTAATAGCTCANACGTTACANTAAAAATGCCAGTTCA 
GCAGTTACACAANAGATGGCCCCACCGGATGGAACAGGCCAACAAGAATGTGAAGAGGCCGCCA 
AGAAAAAGAAGCATGGTATGAAATATGTAAGAGGGAGAAGGATCmAGATCTTCGAAGGAAAA 
AGAGCACTrGAGAAAAACCrrAGAGTGCAATAGGACTTGAGAAAACACTACAATTAAC ACTT CTC 
AGAGAANGCGGTGCCCACTGTTGAACTAGGAGGCAACCGCTAG>nrCTCAGAGAATGANTTTTAGG 
GNNNTTCTCNCGNCTTAANGAAGGGCCCAACATTAAGCNGGTGNTCCCNGGACCAAATACNCTGA 
ACCCC 

SEQ ID NO: 626 ACAATGATTCTTAAAAAATCTITGGCCTTAGTGGCCriU'lUUUl'i'CACTTACAC 
ATTAAAAATGCTGCTGCAGTAACCAGTGTTTGGGAAAGGACATCAGTCTTCAAGAACCATAAACT 
GACAGAATTTCAATACAGTAGGTTTCCAAATTGCAATTTGTAGTGCACATGACAGTAAGCGAGGTT 
TTGGGTAAATATAGATGAGGATGCCTATTCAGACAATCTACITCAAGTAAAAAAAAAAAAAAAAA 
AAAAAAATTCACAGATACCCATCA^^TCTACTTTAGGNT^^'AACAGTGCTTAATCT 
ATGCriTCACAAAATNTAAGTTACTGGGGTGATAATTAAAAACCANGTGGTAATAACAATATCT^ 
AATCCAGGCCATAGCTGAAT-AATAACCAGTCCTCGGNCGGACCCGCTAAGGGCG 

SEQ ID NO: 627 ACAGAAATAAAATAATGGGAATTATCATTAACTTCACCCTGGTTTTCTAGCTT 
AGTAGAACCAAACAGAAGAAATCATGGCAATAACCATTAAC TATAGAAAA AAGGTAATGGAAAA 
ATGGTTGCAGGTTTAATCACAAAATGAACTTAATTTTTGTTGATTTTGTm 
AATATCTATAAATATGAACTGACAGCATCGrrCTAAATTTACTTCTGAAGAGCTGTCGAGA 
ATAAAANATAAGCAAGTTACTGGATCATATTTATGGGCTGCTGAATTAACTACCCGAAAAGTATC 
AGTTCTTTCAAAGAACACAAACAAAGTGAACGTGGAAAAAAGCCTCTTTGCAAAGTCCm 
GTCCTATCCTCTAAAATTCCAGCCCAGANCrrGATATTCCTGGATTCTGTTTAAGT ACCT^ 
AATATGACACTGGGATTGCACATGGGAAAGGNAGGArrGTGACCAAAATTATTTCTTTTCCAAGN 
AGCATTTCTTAATCTCTATCACmGCCCmCCTGTCCTGGCGGACCCTANGNGATCAC 
GTCTGNGNCCACNGGNNACTGCGATATGCTCNGTCCG 
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SEQ ID NO: 628 ACGTTCCCATTATATTCrTGCTGTCAATCAATCACAAATrTATATCAGATTAG 
GATAAACTAAGCCATTTTATOTATTTTATTTTAAACCTTATTTTGGCAGAGTAATTC 
GAAAAGCrGTTACTTTGAAATTACCAATTTAlTACAAAACATAGAAATGTATTGTAGCTA^ 
CAACCAAGCATTTTCTGTOTTTTAATOAATATCTAAAAAACTACATTTAGTT^ 
TGAATGATTTTTTACTGGCTCTATTGCmANAAAACTAANAGATTAATGATTC^ 
CTTTTCTTGATCTTATTTTACATTTCGCAGAG>n'ATATTATAGTTTAGTAACAATGG<^ 
C^TGGATAACNGAANCAACTAANGTGTTGGGNATTAGAATATAmNGTGAGCAGATGGACACTGG 
GNATATG 

SEQ ED NO: 629 ACATGACCTTTAGTGAAGATTAlTrGTCATCAAATTACCCATATCCAAGTTTC 
CATGGGCCTGGAATTTCCTTTCCACTTGATAGAAGTATATATTAGGAAGTCCAGTTAATAGTATTT 
rrArrTAAAAAAAAAAAAGGAAAAAAGAATCAGCAGAGTCAAGTTGTCrrAGTCTTAAGGCTT^ 
TGGATTTCnTCCTTGGAGGAGGTCAGQATCTrCCCAAQGCCTGGOTCCTCGAATATTC TTCCAG TCT 
CAAACTTGGAGCTTTTGATTTITCATATTCCGACTCTAAAGATTTT^ 

CTCANGACCATTTTACTCITCACAGCATCATATCGGNTTTGAGAAACTCCGAAGACCAAAAG^ 

CACAATCAGCANCAACATGGGGGACTCCTAGCCGAGAGCrTGTCTTGCNAAAGCACCATCACCGG 

GGTGCAACATGAGTGACTCrCCTCGCTANACTCCACGGCCTAGCCANACTCCAANCTACAGCTCN 

GCTCTNCAAACGATACCTGCCGGCGGCGTCGAAAGNGAATTACCACGNGGCGTNTATGGTNCCCr 

GTCAACTGCG 

SEQ ID NO: 630 ACAGAGTTAACAAGTTTTGAGTTTTTTATATAGGAAAAGCCTAGTCAATTCAG 
ATGCirrCTAGAAAAATTAACATTAAAAAACAAATAGAAATCCATGACTAAAGGGGGAA^ 
TTrCAAAAGTTACCAAAATTCGAATCATATCAGGGACCATTATAAATITCAAACAGTAGATTTACC 
ACACATATTGCATTTTCAAATTCTAATGTAGCAAAACGTAACCAC ATAA TrrGOCT ACAGCT AATC 
GTTTCAGAAAAGNTTAAAAAATTAGCAAAGTTATATCTATAAAACTrrrGGAGll'J'ClU"JU4GC^ 
GAAAAANGCTTAAATCTTTAATAAAGGAAACAAACAATCCTCITAAATTOT 
AGACATATATTACAAATCTGTGTAAGCTTTCTTTCCTGAGAGACTTCCAGGATCCTTATCCAAAGG 
ATACCTTAAAGAGTCTTCATCATTTCTCATGTGATATGATTAAACrCTATAAAGGGATGGGNATAT 
GCATC^TATCTGCC^TCCCATTGTTCTTCTGAAGNATT^m'AAGAAAAAAACI^ 
NTTACCrCAAATGNTGCTTGGGANANTGTGAAANAAAAAAAAGAAG 

SEQ ID NO: 63 1 ACTAAAGGAGATAAGTCTAAGTTCTCCCATGACTTGACTCTGGAGAGAAAAT 
GTGAAAAGCGAAGTGTTTACATTGATGCAAGAGATGAAGAACITGAAAAAGATACTATGGATAAT 
TGGGATGAGAAAAAGCTGGAAGAAGTAGTGAACAAGAAGCACGGTGAGGCGGAAAAGAAAAAA 
CCAAAAACrCAAATAGTGTGCAAGCATTTCCTGGAAGCTATTGAAAACAACAAGTATGGCTGGTT 
TTGGGTATGCCCTGGAGGGGGTGATATTTGCATGTATCGTCATGCACITCCTCCTGGATTTGTGTTA 
AAAAAAGATAAAAAGAAAGAAGAGAAGAAGATGAAATTTCArrAGAAGATCTAATTGAGAGAGA 
GCGTTCTGCCTAGTCCAAATGTTACCAAATCCTCTAGATCTTTTCTTGCCTGGAAGAAAAGGAAAG 
ACAAGAAAGATTGATAACTTGACAAGATATGGAAGAAGGAAGCTGCTTCAAAGCAGGGAAGCCT 
ATGACAGGGTCGGAGTGTTGATTCTCCTGACTGNCATGTGATATNAGAACAATGTCCCCTCNCCNG 
ACAGGGNGTGAGGTGTATCNGGTGAATGANTAATTTACCX}CCTGCGGACCCTAGGNGATCA 

SEQ ID NO: 632 ACGCGGGGATTTAATrmTC'l'J'l'i'J'i'i'i AAGTGGGGAGGAAGGGGAAGCTAG 
ATGGACTAGGAGAOACTTGATTTTGGTGCTAAAGTTCCCCAOTTCATATGTGACATCTTm 
AAAATAACAACAAAAAAAAATGAGAGAAAAGCTAAAAAAAAAGTAAGGGGNGAGCAGT TAATG 
GTATTCATTCCACATACAATATCTGNGTAAAACNATmCTGGTAAAAGTANCTTCNATGGT^^ 
GCTTTTATAATACCGGTANGTCTATTC^^'AAANCCTCTCGNCNATGCTTNCTTTC^^^ 
CTTTATATAACTTNAGA 

SEQ ID NO: 633 ACCTTGGGTCTCAAGGGGTCACTCAAGCGCTCTGCTATCTCTTCAGCTAAAAC 
GGGTGTCAGGTTTTCAGCTGCTACTAAAGATAATGAGCATAAGCGTCACTGACCAANACTCCAGC 
CAGAAAGCTGCACATGTGACCGTGTCTGGGGGCACCCCAAAAGGCGAANGCTGTGCTTGGGACAC 
ACAANTTA>n^GACCATCACGGGGAATTCTGCTGCTGrrATTACCCCArrCAAmTNACAACTGAGG 
CANCGCNTANTCCANTCTCCAATAATAAACCANTGTTTGATCTTAAAGNATGTGNGTNfTCGT^ 
TCAACTATGACAANACAAANGAAANNTTANACCATGGGGGGCAATCCAAATGAAAANAATTATC 
NANATNA • 

SEQ ID NO: 634 ACrrnTTTTTTTTTTTT^^ 

ATAAKTrmACTATTTTTTACATAAGATAGCAACCACAaAANTrrA^ 

GGNTAAGGAGGACCCANTCCrrGTGGGCTGTTTTCTCAGAGGATAAAAAGCCAAAGTTCACCAGGG 
AAAGGGGTTAAAGACTGCCCAAmAAGTANAGGGGAANAAAAGCTAAACTGCNGGNTTTCANA 
TAAAGATAACCGATTTTAGGCCTTCANTTTNTCAAAGACCACTCACAAAGACAGTCCCCCTANTNC 
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TTAANGGGGATTCnT^TCCm'GNTGGGGAAATCCCTTCAAANANNNNC 111 Crj'1'AGGNTTTGCCCA 

ANNAATCCAANAANCirGATNGGATTATKTTTTT^^ 

GCCACNNCCNAAATTTG^^^TTCGTTGACTN^ITCCCAAaSIAAAAAAAACA^ 

SEQ ID NO: 635 ACGGCAGGGAAGAACTGGAAACTCAGAGAAAGAAACTGCCCTTCCATCTAC 
AAAAGCTGAGTTTACrrCTCCTCCTTCTTTGTTCAAGACTGGGClTCCACCGAGCAGGAGAT^ 
TGGGGCAATTGATGTTATCGGTCAGACTATAACTATCAGCCGAGTAGAAGGCAGGCGACGGGCAA 
ATGAGAACAGCAACATACAGGTCCTITCTGAAAGATCTGCrrACTGAAGTAGACAACAATTTTAGC 
AAACCACCTCCGCTTrTCCCTCCAGGAGCTCCTCCCACTCACCTTCCACCTCCTCCATrrcrrC 
CTCCTCCGACTGTCAGCACTGCTCCACXrrCTGATTCCACCACCCGGGTTTTNCTCCTNCACCAAGG 
CGCTCCACCTCCATTCTCTTATACCAACAATAGAAAAGTGGACATTCCTCTGGTTATTGAATAGNC 
CGTTCTGCACGTGCATTTCATATGGCAAAGTTGCCTTTCCCCATOTCCTGGGTCTGCTCCTTC^ 
GGCCTAATTCTTGGNGGACACCM^GCCAAGCCNKrGGGGANCrrATTrATNGa^JTA^ 

SEQ ID NO: 636 ACGCGGGGGGGTTTCCTGGGCTACTACGATGGCGATGAGnTCGAGTGGCCG 
TGGCAGTATCGCTTCCCACCCTTCTTTACGTTACAACCGAATGTGGACACTCGGCAGAAGCAGCTG 
GCCGCCTGGTGCTCGCTGGTCCTGTCCTTCTGCCGCCTGCACAAACAGTCCAGCATOACGGTGATG 
GAAGCTCAGGAGAGCCCGCTCTTCAACAACGTCAAGCTACAGCGAAAGCTTCCTGTGGAGTCGAT 
CCAGATTGTTTTAGAGGAACTGAGGAAGAAAGGGAACCTCGAGTGGTTGGATAAGAGCAAGTCCA 
QCTTCCTGATCATGTGGCGGAGGCCAGAAGAATGGGGGAAACTCATCTATCAGTGGGTTTCCAGG 
AGTGGCCAGAACAACrCCGTCTTTACCCTGTATGAACTGACTAATGGGGAAGACACAGAGGATGA 
GGAGTTCCACGGGCTGGATGAAACCACTCTACTGCGGGCTCTGCAGGCCCTACAGCAGGAGCACA 
AGGCCGAGATCATCACTGTCAGCGAATGGGCCGAAGGCGTTCAAATTCTT 

SEQ ID NO: 637 ACTTTTTTTTTTTTTTTl^^ 

CTAATACAATTGCrmAAAATGTAGCAAAGAGTCATTTACTACTCTCANAAGTGGCACATACATG 

GCATANAAAACAATCTATAGTCAGTTAACTATTAAAACAGAAACTTGAAATTTAAGTGACAAACA 

TTTGTAGCACTCCCTAAAGAAATAGGAAATAAAAATGCATTTATCCATATGAACTTGATTATTCT^ 

AATTACTGACTATAAAAAGGCTATTGNGAAAGATATCACACTTTGAAACAGCAAATGAATTTTCA 

AriTTACATTTAATTATAAGACCACAATAAAAAGTTGAACATGCGCATATCTATGCATTTCACAGA 

AGATTAGTAAAACrGATGGCAACTTCAGAATTTATTTCATGANGGGTACATTTTGATAGTATT^ 

TAGGC li 1 lUU CCAGGTCAAATTAATrrAGrrGCTTGCNAAATATAAAATCAAGCTTGCTCCAGTTC 

CACAAGGACTCCNCCACAGTCTTTAGGATGGGANAAAAAATCACTNG 

SEQ ID NO: 638 ACNTNnNfTTITrTTTTTTTT^^ 

GCTAAAATATATmOTACNCAGGGGCAGAGCTTCCAACTTITTTAACAGOT 

AAATTAAATTTNTTCAAAAACCCCNAANATGCATANATTAAAAAGCAAGCTGCCTm 

TAAAATAATATTCAATGAAACTCTTAACGTTNTACNCCAATGGNGCAATGGANAATATGGAGGAC 

AGCAAGTAAACNCAGTGAGCAACTNTTATTCTTACAAGGCAGTAGGTAAAGTATAATGTGAAGGA 

CAGNCTAAGATAATCTTTCTGGTTATAAAAAATGGGTCGGTTTTGTGATAAGTGCCAANACTGT^^ 

CTGTTTAAGGATTGATGATAATGTATTACCAGAATACCTTTCITATCCCTGGGAGAAAATATTT^^ 

CACTGATGQAATGACATATCACCAAGGGAGAAAAAAACCTGGATACTGNCAGTAACTAANATACC 

TATTCTGACTTTAAANAAAAATTAAGGACTTCATGGGAGAAAAT 

SEQ ID NO: 639 ACAGAAGGTAGATTTAGAGGTGAGGGGCAATAAGTAAATGACTCGCACAGA 
TGGTGTTCATGCTCAATGTTTCTCTTCTGAACAATTTTCAGATTTGGTGTTAAAGCACCTTCT 
AGGCGTGGTGGTTTATGCCTGTAATCCCATCACTTCGAGAGGCGGAGTCGGGTGGGGA'ITGCTTGA 
GGCCAGGAGTTCAAGACrACCCTAGGCAACATAGTGAGATGCTCGTCTCTArrAAAAAAAAAAAG 
AAAAAAAAAGGGGACCCAAGGCCATATATOTjTTTCATTCCTAGAAACTCTCCACAGTGTCAGAT 

gaagactgcaaatccagtggtcctacattcacagataattcctagcctttggttagtgaggagaat 
gggatgggaggacacaaaatgagaacttttitgtggtatgctcaattcgtatctggcagtacctcg 
gcccgcgacccccctaanggcg 

seq id no: 640 acgcggggacctacctgggataacggcggcgagcggacggctgcatttacg 
gggtctcccggagggccagagtcgtggcttacagaagagacgaaatgtggtctgagggacgatat 
gaatatgaaagaattccgagagaacgagcacctcctcgaagtcatcccagtgatggctacaatag 
actagttaatattgtgccaaagaaaccaccactgctanacagacctggtgaaggaagctacaata 
gatattacagtcatgttgattaccgagactatgacgagggccgcagtrntctcatgatccnagaa 
gtggccacctcacagaggagatgaatctggttatagatggacaanagacgatcattctgcaagca 
ggcaacctgaatacagggacatgagagatggctttanaaaaaaaagtttctactcttccattatg 
cgagagagcggctcttataaaagggacaatacrrrittcagagaatcacctgitgnccgaaagga 
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TCTCCCACAGCAGACTGGNTCCrrGTCAGTAGCAGANCTCTCTCCAGAAGGGCAGATCATCTCT 
SEQ ID NO: 64 1 GGAC nT l'l TrriTi - l ' m i'l 1 r i CAACTTAATTTACTnTTATTAAAACCAACC 

tgctgaaaagaatacaaattttgactaggagganaaagggggagnggatttatgattttatggca 

attataacaggagtgtttcaatggagacmaggcaactagaactcancatgaaaccanaaggta 

naagctoataaaggcaggggtaaaaaggagaaaatgagttcaaaacaactaactttcggccagg 

tgtggtggcrcacacctgtaatcccagcaatttgtgaggccgaggtgggccggntcacaaggtca 

gganatggagaccatcctcctaacgtggngaaaccccatctctantaaatacaaaaaattngccg 

ggtgtgacagcl^tgtgcctgtngtccctcractcaggatgctgaggcaggagaatggtgtgaacc 

cgggangcagacttgcattgagcaaanatcggcccntgcctccaacctgggcgnacagacgaga 

nttttttnaaaacaaaacaaaacactagagggcattttgcatgcaaa™ 

seq id no: 642 acttgagtgcrgctgagcrrcagcccacrgagagtttacctctggagttcagt 
gatgacttggatgttgtgggtgatggtatgcagtgtcctccttcacctttgctttrrgatcot 
taacccttgaagatcatttagtcaaagaaattgctgaagacccagtggatattttgggccagatgc 
agatggctggagatgggtgcagatcccagggatctcgaaattctgagaaagcctctgcaccattg 
tcrcagagtggattggctactgcaaatgggaagccagaacccacttctattagttgatagtn'ggg 

GAACCATTTTACrrTGGTGGATTTAAATTTCrCGTTCrrCAAAGAAGTAT^^ 
AAAACACAGGCAAC<XAGACTACCTTGTTTITOTCTGGATTGAGTATTATAGTCAAACGTAAAGC 
CAAATTTTGNGGACGTGGAC(X)TGGGTTCTTCCCACTCAGGGCAAGGAAGAGAAAGANAGATATC 
CCTATCAAAGCTCTGGGATTATTTGGGACTATACCTAAAGGAAGTGACATCT 

SEQ ID NO: 643 AC 1UUU4 in U'rJ'l"lU'144U"i'l'lUATTNCA(nTATTATTTAT^ 
CTTCCAATTTCCTCTTGCCAGACTCCCATCCAAAGAGTCATAGCAGCCTTmrCCACCTTNTACATG 
AAATACATCCCCACCTGAACAAAGGCNCACGACAGGAGGAGGGGAATAGGACTTNGCAAACTGG 
ACACGGNATCGTTCAAATCTGGACTANGTTCCGTTGTTACTGGTTTCACAGTTACAGGCTTCGGAT 
GGTCTGC^^CGTGCTGTTTCAANACTAATGGNAGACTCTATTGCTTCTGTTATGTCCTTATNCAACCT 
GGTCAGCCTGCCTCTGCTCAAATATNGNGTAATCAANTGGNGAAATCTGCCTAAANNCATCATAA 
CTGGGGGNGACTGTTAATAATANA(XACCTGAAATATTCATCCTCTCCAGNCnTrmTCAT CCTC A 
TATTCrrGNCCAAGATAAGTGGCACAGCAAAAATGGNTTCAAAGAGGAATCCATTCTGGATnTG 
CCTTTTNTGGCCCCNCGTCCTNNGCCGGACACNCTAAGGOCG 

SEQ ID NO: 644 ACTCTAATITCACTAACTGCCAAAAGGTTTTCCAGAATAATCTCAGTTGCTTC 

attcctttaaagatgaagcxcaaagaacgcatggcgattacmagaggaacaattagcagcaga 
ggcagggctgtgctgatcccatctggcatcgctgggagctaacattaaaoacatg gcact ttggg 

TCCGGGTCCAGGTCCTGGTTCAGAGCAGCTGCCACACCGTGGCTACTAGAGGATCCrnTCCGGCT 

ttggaaactgaggctgactgcaccatcatcactaaaggcctgagactgctcgccgtgctcaacac 
cgactggagtggccatgtcttccagccacgccggccgacctcggccgcgaccacgctaagggcg 

SEQ ID NO: 645 ACCCrAACCTGACAGGAATTAACTACrGriTrnTGTGGGGCAGAAAGCAAA 
ACCTGGTGTTGTGACnTITATCCTAATGGrrcTTAGGCAAGGr^^ 
NATGCATGCATTGTGCATTATTrrGTAGACAAGCTACTTTTTCITCTGNCCCm 
GCAATTACCCTCCCTITGGGGTCTAGAGTGAAAGCTAATTTGTGGGTAGATGAGATTGCANAAGA 
ATGGATGTCCATGGCTGTGAACACTGCACACTGAACATTCATCrCCAGTGCTCACACTGTGCAGCT 
ACCACTCCCTGGCTGCGTGCCATGCTGTCGGGTNCAGATTTGCACACATAAA TTCCT CAGGANGAG 
TTTGCATGAGCATCCCTCGCAATATTCTGTACCGCAGGGGAAAGATGAAAAATm 
ATATAGCAAGGACTNCCCCTATCCrrCTGCATAATGAATTAACTAGAAATAACTrrGCAGGAGAAC 
NAAGCTAAGACCCCCGAACCANACGAGCTCCTAANACAG 

SEQ ID NO: 646 CTTTTGTTNGCGGCCGAGGTACACAAGTAAAATACTACAGAAATTAATTTCIT 
TCAGCATTGAAGTGTTTGCTTTCCTCTITATTTCAACTAAGTTGTAATAATTTCTG^^ 
ArrGATAACTAAGATAATATCTACAGAGNGGATGCCATTAArrCTCTTAGCAATCACGTGCAGAA 
GGGAAGGGTTTGGGCCAACCTAGGGNTGTGCTGNCTCTGNCGCTAGTCTAGGAAAGCACCCAGGN 
GTCAGGNArmGGTCCCTTCCACCACACCTCAGCAAATGTAAAACAAGCCCACAGTTAATGGCCT 
GTGGCAACCCTGCCXrrGGAGAATCTNAmAANAGCATTraAAATGAGGAAANCAANArrCACA^ 
TAAATATCTNGCrrTGGCACACAGTGGCrmATTGATTAATAAGCAAAT 

SEQ ID NO: 647 ACCACrCTA(XCTATACn:CAGGACnTCATCTTTCTTACTG AAAA ACCCAAAC 
CATCAAATGTCCCCrrGTTCTACCCAAGAATTGTTCTGCAACATACGCCACCTACTTTCITGCm 
TATGAGGCTGAAGACTANACAGGCAGGAGmATTATACAGACCTGCCATATACTTGAAGTAGTG 
TGTrAAGTCAACCAOAGGCCAAGTTCTTGAATGCTTrCTCTGTTTGGATTATTTCCACGCT 
TGTCTGCCTGGACAGGTCCTGTCTGCACATTCTCAAGGNTGAGAAACCCCAACTAGGGNAACTGC 
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CCATCAGAAGGNGAGTTCACTGGGTAATGGGAGCAACTGCTCATTGTGAAGGTGACATGATGTTA 
AAAGOCrrGAAAGGGGCTGCATGGTGGCTAACGCCTGTAATCCCAGCACTTTGOGATGCTGAGCA 
GACGGATCACCTGAGTCACGAGTTCGAACTAGCCTGCCAACAAGGAAAACTCCGCTNTCTAAAAT 
CAAAAAAAAAAAAAAAAAAAAAATTAGTACCTNGGCCGCGACACNCTAAGGNG 

SEQ ID NO: 648 ACTGGATTCTATAOGTCCACCATTAAAAGCTGGTATGGGACACCAATTTATAC 
ACATAAGGGTTAGATTAAAATTTTAATTITTTGGTTGATATTAAACTGAAAT^ 
TCTGAATCTTAAAAAAAGTAAATGATAAAAATTTAATATAGTTAACTGTTCACTGATATGTCT^ 
CACTTCATCATAACCTATATATTTAATTAAAAATCAAATTATGAGTCTGCAAATCAGATGCTATCA 
AGCAAATTGCCATCCAGGGTCCATAATTCTTTTTATATTTTrATCTCAGATGAATAT^^ 
GTAAATTTTAATGTTCCAAATTGTTCTAAAAAAAAAAATTATCAAAAGCn'CCAGT^ 
CTAATTCATTTGCTCCCAACGAACTACCTGTTTGTGTTGTGAGGTAGCATCAAAGACTATTGATCTT 
CTGTGACAGTAGTAGCCTTAATTCATACGCATTCCCTCTTCATAGGAAGAGTATGGACAACCA/^ 
AAGGGACNNATGANGTCNCCCTTTCATTAATCATTTGCCTCCNGG 

SEQ ID NO: 649 ACCGGAAGAAGCAGCTGGCAAAGCAGCTCCCTGCACATGACCAGGACCCTTC 
AAAGTGCCATGAGTTGTCTCCCAGAGAGGTGAAGGAGATGGAGCAGrrTGTGAAGAAATATAAGA 
GCGAAGCTCTGGGAGTAGGAGATGTCAAACrrCCCTGTGAGATGGATGCCCAAGGCCCCAAACAA 
ATGAACATTCCTGGAGGGGATAGAAGCACCCCAGCAGCAGTGGGGGCCATGGAGGACAAATCTG 
CTGAGCACAAAAGAACTCAATATTCCTGCTATTGCTGTAAACTGAGTATGAAAGAAGGTGACCCA 
GCCATCTATGCCGAAAGGGCTGGCTATGATAAACTGTGGCACCCAGCTTGTTTTGTCTGCAGCACC 
TGCCATGAACTCCTGGTrGACATGAmATTTTTGGAAGAATOAGAAACTATACTGNGGGCAGACA 
TTACTGTGACAGCGAGAAACCCCGATGTGCTGGCTTGTGACCANCTGATTTTTCANCAATTGAGTA 
TACCCAGGCAGAAAACCCAGAATTGGCV^CCTGAAACACTmTGCTGCCTTGA 

SEQ ID NO: 650 A Cll - llU - ri ' 17 - i ' l - lU " iHH - ini - rri ' rrril " l^ ' l 'CGANAAAAAAACCGCCAGTNTTT 
TAmrrCATGGAAAAACANAACAAACCa^CAAGTTGGAGTCNCGGANATAAAATNCANATG 
TGGAAAACGGTCTGTTGTCATGAACTNTCACTTTCAAATACCArriTATNTC^ 
NGGGGCAAACANAAGGCCATGCTGGAGTCT>rrrACTTTTGGAAAATGGANAATCAAAAAm 
ANTCAACAAACAAAAAAGGNGGGAAACTCCITGGTAAAGCThn'ACAAACATAATTATNC^^ 
NTTTTACCAATAAAANATAGCTAGGGTANAAAAAACANATGGTTANAAACTGGNGCCAAACCAA 
AGNGAAAGCTTrGGTGCCTTCT>rrAAACTCCTATCCTGTTTCTTTAAA^ 

NAAAGATTACrCTGAATTCCCCAGGGTTCmCNCCCCAATTCATCCCTCCCTTTCCCCCCCA^^ 
CCGAAAGGGGCCTTGT N l - l ' 1 11 1 1 ' N TCCGGCNCCACTTTGGGAAGG 

SEQ ID NO: 65 1 ACTGATATAATCTAACAAATGAAGGTGCACCTTTACTTCCTGGAACATAGAC 
AGCCACCTTGTATGGTTGGGGTCCAGGTGATAATACAAAATCATTAATnTTTGCAAATGCAAm 
ATTrGCAATTGTGTTAAAATTGTTGTTTTCAAAGAAGTGAACrrCATTGTTAACATTGC^ 
AAGAGTITCATCTTCTGACCAGGATGGACACCAArmGCATTTTTITCTGGATG 
ACATGTCCrAGTTTTCACATCATAAAGTTGTAGGTTGGGTATCCCAGCTGTGCCATCTTTAGAAGT 
AGTGTAAGGCTGCCACGTTGCCAGGACAGTATTTTTGGGTGAGAATTCAAGGCAAACrrc 
NGAGGTCGAAGGAGTGCAOTAGTCCCTTGTTAGTGACACTGATAATATTTACTTTTTCTCCATTGC 
CCCAGGCAAACAAGGTCCCATCCTTACTAAAAGATACAGACmGCAATTCTTCCCAGATTCCOT 
GGAAACACTGTGCTTTCTGTAAAATGTGGTGGTCCATTCACCATGTCC 

SEQ ID NO: 652 ACTX;CAAGGACAAGrrGATTTCTGGCCAGGCAAAGTrAACTCAGTrrm 
ACTATAAATTTGTGTCTTATATGCTTTAGGTTTATGTATCTATAAACCATTCACCAAAGACATC 
AATTmAAGAGATCAAGGTGTAAArrATOATGATTTATTATTTTOGTCT 
AGTATGTTAAGCATTGTITAAAAATACrAGTAAGTCATAATrATXjCAGAATr TTCAC AAAGTr^ 
TGCACAGAGAAAGCATATCATTTCAGTTACTGATACATCTTAACACTACTTTCrm 
ATTTAACATACACAAGTTT 

SEQ ID NO: 653 AClU"l"lUU-i- lU ' J ' lU ' riMU - l MU U - l TmGGCrmCAATarm 

ATCCAGGATGGATTrrANATCrrGTTGAAAGCAGCCACATCCATGGACTGCACATAGTCCTCAAAA 

GCAGNGATCTGCTCCTCCAGCATATCTGTTCCAACTTTATCATCTTCAACTACACACTGTAm 

GTTTCTTAATTCCGTATCCCACTGGAACTAGTTTAKATGAGCCCCWJACTAAGCCGTCTGCT^ 

TGCTTCTGACGCACTCCThrrAATITCGCCATATCTGTCTCATCATCCCAAGGTTrC^ 

GATGGAAGACTTGGCAACAAGNGCAGGTTTmGGCnTrCTTTGATTCATATTGNGCAAGAC^^ 

rrCCCTrAGCCrCTTrGCTTNTTCACmCCTCCTCATCATCAGATC 

TCATCmACTATCNGNAGCrCCACTTCCTGTAGNGTNTTCCACATCGGCNGGACCATAm 
AAAGCTTTCTTCACTCCTGGCAGGCTGGGCCTTITCCTTTTCGTA 

AAACGGTAGGGCATTGAACACAAAATTCGGNCn^GCNGGTGGGGCCTGGGACAACCGGGGTTTA 
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AAATACTGGCNCCATTTG^^S^^TGNGGATGGNNCAAAACCCCNTNGATGTTAACCTC^ 

SEQ ID NO: 654 ACTCTAATTTCACTAACrGCCAAAAGGTTTTCCAGAATAATCTCAGTTGCTTC 
ATTCCTITAAAGATGAAGCCCAAAGAACGCATGGCGATTACTTTAGAGGAACAATTAGCAGCAGA 
GGCAGGGCTGTGCTGATCCCATCTGGCATCGCTGGGAGCTAACATrAAAGACATGGCACTTTGGG 
TCCGGGTCCAGGfCCTGGTTCAGAGCAGCTGCCACACCGTGGCTACTAGAGGATCCTTTTCCGGCT 
TTGGAAACTGAGGCTGACTGCACCATn'ACAGAAATTTCATCATGAAGATCCTrACACTTAACACC 
TrAAGGTGATGAGTmCACATGACTGTTTrCAAGGCCACAGCCACCACCTTTAATCATCATCACT 
AAAGGCCTGAGACTGCn'CGCCGTGCrCAACACCGACTGGAGTGGCCATGTCTTCCAGCCACGCGG 
CCGACC 

seq id no: 655 accctaaggcaggcccactggctctttttgatcaaggattctgagaaaagct 
gcccttggaggcccttgaaataacatagggagcagaatgagtgctcgagtcgtggctgacacagt 
ccagctcacactgccatcacagaggctgagtgagcagtcacccagggagggggctcccagctcat 
tccattcccatggggcaagtgactagaaggtaagagcacccgagtaagccagtgccrrcctgtat 
ccacacccaggaataccttccagttgtccaagcagccgtagttgtaaggattcctaaatactctgc 
ccttggcctgtagccgacgtctctccttcttgttgatgtgcxntrcgatgctagtctcacctcgact 
gatgagaacagcatgccatacagttagggcacccagggcaagtgccacagaactgcacaggaac 
cagaggtagacaagactcttgtgagtcatccttrctcgaaaggagaangtgggtggtggggtcrr 
ggtgataagtctgggtggcaacccgcctgtagtttggtc1tgtcgagctgtttca14 
cagcattaagccrcccgaaaanggcccaactttccatactgcagttanacacaaccccanagtcr 
tgaaaaagccnnaaagagaagaaagttcccttggccgngaccaccnctaangggcgaatttcx:a 

CAACACTTGGCGGGCCGGTTCTTATGGGATTCCNANNCTCGGNACCAAAC 

SEQ ID NO: 656 ACACTTCAAATGCTGATTCTGTTGAATCrTCTrATTTTT^ 

TTAGGGCCTATGATAGAGAAGATACTGAAACATGCTAAGATAATAATTGGTCTTAGAAAGACTIT 
ACAGCAAGTATTTAATGAAAGCCACTACACITCCAGCTCTAAAAAGAGCCAGACACAATCCCTGC 
TTCCAGAGTTCACCATTCAGAATACAGGAGGTCAAGCACTGATTCTGAATCATTAATGGGCirrGA 
TTrCGTCirrCAAGATTTGCATAGATACTGATTTCITCCGGAAGG 

CACATCATT CTITGGATTCTCATAAGGGGTATTrTGAAGTCTGTATGrrGA CAAG GCCAGTAGATA 
CirrGTTTTCTCTAACTGCCTTCrTCTACn'AAGACT 

TCCCCAACATACACCTTCTGCAGTTTTGTCTATCCAGTCCGCTTTTCTGTAACACnTT^ 
CTGlTCCCCrTCACTGTTTCTCTGTTGCTTTCCrCCTTAATAGAAGCCT 
CnTTTCTTTCCrrGGCTCCTTATTCCATGGGTGACTGATCTTAGCACTTCCT 
AATAAATATAACTCATCAGCCTTCCGATCTTGTAmCXTTGGACCTTT^^ 

SEQ ID NO: 657 ACTGTAGAATGTGATGGAAAAGCATTGATGAGAATTTATTGGCAGTTCAGAT 
TGTGTTTTCCCAACTTAGGCTCTTTATTAATTGGCTAAGGTTTTCTCCAAAA^ 
TGGGAATTATTTNNTGTAACACNNGGGCACATATTACCTATCTTCOT 
NNCNCTTNCCNNCTCANNTCCACATnr 

SEQ ID NO: 658 ACTG ClU ' l ' l ATTTGAGTTTATGAACAGAAATAGAAAGTATGGTGCrTGGGTTr 
TGCCCrXTCn'ACTCCrGAAAGTTAAATCAGAAGACACTGATTTCATTTTGTGAAAm 
GACTATTGATCTTTTOTTTCATTAATATGAACAACTATTAGTAAAAAATAGCTTTAACA^ 
GCTGATATCTAGTAATCTArrCTTTTAATGTGAAAATAAGATAAj^ATGTCCTGGAGCT 
CXTAAATrTGCCAGTATrrCTGTATGTCATTAAGTITTTTTCCTCTA^ 
TAATCTTTGCATACCTGATGGCATCTATGTCAATGCTGATTGGGTAATTATAAATTCT^ 
TAAAACITAArrTGCCTCrTAAGGTGArrGTCCTCTGAGTAATGATTGT^ 

CTTGCAACTATACTATCACATGGGTCGTTAAGTAAAAATAAATAAACCAAATTTGTCTGAGACAG 
GCTAAGATCAATCTTCTCATCAAACCAATrrrCTCTAAGAGCAAATTTCCTr^ 
GCCATTCrrGAATGCCTCAAAATTAAACCGTTATCTATTTAAATCTTCCCGGAArrAGNCT 
CAAAAAGGANGGGNGNGGATNTNTTTAANGGNGTAAATATTATCNCCATATTTGGGGGNGG 

SEQ ID NO: 659 ACTTCrATATATAAATTTGGACGAATAGAAGTAAATATGTTTATTGGTGAAAA 
AGAATTCCAGAAACTAATGGCAGATCCrcGAAATCCAGACTTOTATCATGTATTAAGTGTTATCT^ 
GCAATTAGCTTGTGAGATTAAGGTTCTGCACATGGAGCCTTGGTCATCATTTGATATATACACCCG 
GAAAGGGCCGCTGGAAAACCCAAAGCGTAGGGAArrATTAGACCAATTACAACAAAAGCTGTATC 
TTATTCAAATGATTCCrCGTCAAAATTTATTTACCAAGAACTTAACACCTATGAACTATAATATATT 
TTTTCACTTGTTAAAGCACTGrmGGGAGGCGCAGCGCCACTGTAATAGACCACITACGTO 
GACTCCACTTGATGCGAGAOATATATTGATGCAAATAGGAAAACAGGAGGATGAGAAAGTAGTTA 
ACATGCACCCTCAAGACTTCAAAACACTTTTTGAAACTATAGAGCGTTCCAAAGATTGTGOT 
AATGGCTGTATGATGAAACCCTGGAAGATAGGTAGCAACTAGACrGTCGTTTTTGGTGGAGCGGT 
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TCATTThrmCGAAACCTATGACATGAAAACCAAATITrGAAAACTCCAT^ 
TACTGGTCrTGTCTTGCCCAANCCAGCCAGATCANTITrTCCTAAGCCT 

SEQ ID NO: 660 AGCAAACTTTCTGGATGCCCAACATGATTTTCAGTAACCACCCTTTAGAGTAT 
TTGTTTACTAAGTTCACCACATrrTGAACATGGTAGTTTTAGACTGCAATAATATITAGAOT 
TATACTTACTGCTAAGNAAAATCTAAATCCTGCAAATGCACAGAATTCAAGCTGAAATATAATGA 
TITATGTITAGCTCACATTGAAGTATTGGTTGGTTACrTATGTATrAATGCAGNGTGCATTCACAT^ 
TAATCAGGTTTAGTCTGTITCTATTTTAATAATTTrAAAAAATTATACAAGCh^ 
CATGNTAGTTACAATGGNACACATTTTrAGGNGTCQAAACACAATTrC AAAA ATCCCTAATGNAA 
GGTATAAAAATGNNACANGAATTGAAAAATGGCCAAAGTANCNAATATTrrTCNAAGCNAATT^ 
TTTAGNAGGNATAATTTACATTTTGCTTTTCTAGNGGGGTTGAAATGm 
ATTTATANTTTTATTC 

SEQ ID NO: 661 ACTGGATTCTATAGGTCCACCATTAAAAGCTGGTATGGGACACCAATTTATAC 
ACATAAGGGTTAGATTAAAATTTTAATTTTTTGGhrrGATATTAAACTGAAAT^ 
CTGAATCTTAAAAAAAGTAAATGATAAAAATrTAATATAGTTAACTGTrCACTGATATGTCTATTC 
ACTTCATCATAACCTATATATTTAArrAAAAATCAAATTATGAGTCTGCAAATCAGATGCTANCAA 
GCAAATTGCCATCCAGGGTCCATAATTCTTTITATAriTITATCTCAGATGAATATATNCGANnrC 
TAAAATTTTAATGTTCCAAAATGGTCTTAAAAAAAAATTATCAAAAGCTTNC^ 
GCTAATTCATTTGCCCCCANCGAACTACCTGGTTTGGGGTGGTGAAGGNA>WATNAAANAA}^ 
GAATCTTCTGGGACANTAGT 

SEQ ID NO: 662 AC l ' i I T l ri - l in AAAGATTACTAAACATACAGGAAGTGATAAGAAGTATCAT 
TCATCAGAAGCATCATTCATCAATCAACTTGAANAAAAAGGNGATATATTATTTCTTTAAGGTGCT 
GNGGATGTGTTAAGAGCATATTAGAAGGAATGGTTTTGTCTAATTTTCTTCATGAGTTATGGNG 
TGAGACATCGAGTCTATATTTrGGGGCAAAAACTAAACGGNAGNACNAAAGGAAATCTATTNTAA 
TAGNATArrriTGTrGAACANAGGAGGTTNGATAAGAACTGCAAACCAACANACTCNGCAAACAA 
GGAA>INAAACGNGrmNCCOTAAANACATGrrCANGNGAATCGAANTNCAATAACT^ 
TGAAGAAAAAGTrrCNAAAATTTATNAAACANGCCCTNGTAATTACTCCNCCANAAANNANOT 
GCCThWCCCNAATTTATTANAANATTTACCANACCAGTTGGTCCCGAAGATAril'1-iriN 

SEQ ID NO: 663 ACATTAAATGrrACTITGGCATTCCTATTTCCTAGGCTTACAGGAATTATTAA 
GAATTCCTTTGTAATGCAAATAATCACNCTCTTGGAAATTAAATTTAAGTAGAAAATGTTACAm 
TAAGGCAAGAAAACATTTGTAAATATTmATAAAGGCATTTAATTCAACAAATTAACGGAATCAT 
AGTAAATGGATGCAAATAAAGAATATTACTTTATGAGAAAGACAAAGTTCTTTGGAGCTTACTGT 
GACAGTATGGAAAACTTGTCTTCTTTTTTAAAAGATAATTCCATTTTGATAC^ 
GTTAACCTTCAAGAAGAAATAAGCACTCTTAGTTGAACAAAAAATGTAGGGAAATTAAAATACAA 
AAAATGAAATTAAAAGAAATACCAAATGTNAATTGCCTCAAAATrTATCTTCTTTC^ 
AAGGATATTTTCrCAGGAGTGACrrAATTGAAGCCCTCrrATAATTTACATATGGAAGTGGATGGT 
AGAGCTCATGGTGGCAGGGCATTCCATGATTAAGTGTTAACAGGGNTTAC 

SEQ ID NO: 664 ACAGTCCGGCCCGGTGGGGAGGAGGGAGGGAAGGCAGGCACACGAAGACAC 
AGGTATGTCGGGAAGTGCACACAAACCGTTGTCTTTCCTTTTTGGTTAAAGAANAAAAACTTTGTA 
ATCAATATCCTGCTCATAAGTAAAAGTGGAAAAGAAGAAACTTGATTGCTrrCATCTGGCGTTTTG 
GCATCTCCTCTCCCATTTCATATGCACAGTTTATTTGGGTAATGCTACCGTCACCAGCAGAACACCT 
GTAAGTAAAAACAAATGTCAGGAAGGAAAAAGTATGAACAACAGGAAGCTCCAGAGGCGGCTCC 
ATGCGGGCGCTGGGCTCANTAGAAGCAAACGGTGTGCAGAAAAGGTGCCGGCCACTCTTCTCCTC 
GAACTCCTGCAGCAGCTCGTCGATCTCGTTCTCXIATGTCITCCGTGTGCTGGATACACTGGCAGAG 
CTCACAGATGAAAAACGCCCCCAOGGTGGCCTCCTCGTGGTTGTCCTGGATGCCACATTGACACNT 
NCACAGCTGGCAGGTCTCCTGTTCTCTGGAATTAAAnrCCACCACCTG 

SEQ ID NO: 665 CGTCAAGTTCTTCTAGCAGGGACCTGTCTCCCmACTTCTTACCTCCCACCTT 
TCCAGGGCTTTCAAAAGGAGACAGACCCAGTGTCCCCCAAAGACTGGATCTGTGACTCCACCAGA 
CTCAAAAGGACTCCAGTCCTGAAGGCTGGGACCTGGGGATGGG'nTCTCACACCCCATATGTCTGT 
CCCTTGGATAGGGTGAGGCTGAAGCACCAGGGAGAAAATATGTGCTTmTCTCGCCCTACCrC^ 
TCCCATCCTAGACTGTGCTTGANCCANGGTCTGTAAACCTGACACTTTATATGTGTTCACACATGT 
AAGNCCTGCCCGGGCGGGCGGTCGAAAGGGCG 

SEQ ID NO: 666 AC i ' rri - l - lUUlUUU"iU -lU4-l l"IUUU - ll - 14 T AATTATTCAAATAAAATm 

TTCAGTTCCTGAAAATGTAGNGNCATTAAAGGNCATTTCCTGCTAAATTTCAAATTACANATrTGN 
GGGCCATTCCTGANCAAAAGCATCATTTCTACTAAATATTCAGCTTGNTAACTAAGGNAACTGACA 
GTNTCATGGCAGGAGGNGANAAGGAGCAATGATCAGCTCATAGCTNAAAAAAGGGAAAAAAACA 
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NTAACCAAAGCAAAAAACAAAATAGCTNCNGATATTNGTAAAATAATAATACAATCCCAAACATC 

CCCT^^TAAATTNCr^ACNAAAACAATACCCATTAAGGmAGCAA^^^ACAGOT 

NAACTTTATTGCATTrTATAGNGATTCTTAAGGCCTATTTCCAATGAAACCATTTTAAA^ 

GAGGAGNGGAATmANATGTCTATTACACTTGGCrmAAAAAAAAAATGCnT 

GAGCAAAAATCACTTTTGGTNG 

SEQ ID NO: 667 ACCTGATTTrrrrrTTAAATGGACAGTCTATGCTTTCATGAGGCATGCATATTG 
AGCACCCCGTGTGGGATGGTCCCAGCTAGGGGAGAAAGTATGTGTGTCTGTGCTCACCCGGCAGC 
ATAACTACACAACTGCCCCTCTGAAGGATGGAGCAAAGGACATCTCCTGAGCACTGAGCATCAAG 
CACTGAGGCrCTGGGAAAAGGAAGATCAGACAGAGAGGAGGTAAGTGGAAGGAACTGAGCAGCT 
TCGOTTCAATCCCAAGGCAAAAOCCrGAGCAAGGACACACTCCCAAAGTGCTCAGGGTGGTTCCC 
CGGGCCCCAGGACGGGTTGACACANGCACAAGCACAANCAAANCCTTGAAAGCrm'GAGTTTm^ 
GCnTGAGTCTAATTATCTTTAAAGGAAANAGACrGGATCCrCAAGTCTTCCAATCCATC 
GAATACAGGACTTCTGTCTGATGTTTCAAAGGATGAATATmTGAANGCTGCCTCAAAA^ 
CAKTCTGGGCANACAimmGACCTTATTTCTGTAAAAAAAAAAAAAAAAAA^ 

SEQ ID NO: 668 ACATTGGTTGGGGGCCATGGCGGGTTCCACATTCTTATGATTCTAATAGTCAG 
GGCGGGGCATGCTGCTGTGTCrTGAATAGGGArnTGCATCTGGCTGTCTCTGGAGGCCGTTATCC 
TGACCTTAGGAAAGACGCTCCCnTCTGCTCCAAGATTGGTGAAGCCTGTCNACCGCAGTTTTAGAT 
TTTGAAAGTCrrCGCAATCAAGTTCTCliuriCAATATGCTrCGNTTrGTTATTTAA 
ACAATCAATGCAATAATCATGNTNANAATAACNATGCCANCGANNGTGCCCACANTAGTNANNAT 
CACATGAANTTTGCCTTACA 

SEQ ID NO: 669 ACAAAGGAGCCTAGGAACCCTCAAAGACCCrrTCCCCTACTTACTTCCCACA 
AAAAAGAGCCTTGACAAACCAAGGTTTTATTTCCAGCAACATTCAGCTGATACAAACACATAGAA 
ATACAATCTCACCAAAGAGCTAAGAAGCCCTCCAACACACGCCCAATTTCAGCATACAGCTAAGA 
CTACACAACACACACACAAGCTCAACAAAGAGCTAAGACCACTACACACATCTCCACAGCACCAC 
AGGCACA TCTCAAAAGGGTCTGTCCAGT AGTTCCAGCCAAGGGAAGGGTGATGGCrAAGCCACCA 

GCTCCAGCT I I'crn ' rci" rrc 1 T n m lu gagctgg agtttgctttgttgcaaggctcctgacagtt 

GGACACATGGCrGCACACATGCTGCTACTTGGATCCCAGAGGGTTCTCrGGTGGCAGNTTGACCN 
ACTCTGNCTAGGNATC 

SEQ ID NO: 670 ACCCGCCTGCCATGGACTXSGATCTTCCAGTGCATCTCCTACCATGCCCCCGAG 
GCTCTGCrGACCGAGATGATGGAAAGGTGTAAGAAACTAGGAAACAATGCCrTGCTGrrGAA^^ 
TGTGATGTCTGCCrrCCGGGCTGAGTTCATCGCCACAAGGTCTATGGATTTCATTGGCATGATTAA 
AGAGTGTGATGAATCTGGriTCCCCAAGCATCinClM"lll'CGATCACTGGGATTAAACTTGGCCTTG 
GCrGATCCTCCTGAGAGTGACCGACTTCAQATTCTCAACGAAGCTTGGAAAGTCATCACTAAGCTG 
AAGAACCCACAGGACTACATTAATTGTGCCGAAGTGTGGGTGGAATACACCTGCAAGCATTTCAC 
GAACGAGAGIW4ATCCGT^TNGCAGATGTCATCAAC ACAT G ACTC^^sfA TCGGCrIT^ GAAN ATTCT 
ACCCCA^^^TCA^r^ATATTAAAAGTATTGCNCITCATGCT^^CATTTT^^ 

TGNATGTCCAAAAANANGGNNGGNGAGGTTNAATGCTATGGACCCTTNTAAAAAAAAAAGCCCC 
AANGCCGGNTTTAAGCCTTNTGITGAAAATCTG 

SEQ ID NO: 671 ACGCGGGGACAGGAACAAAAGCAACCAAlU'i'lU'AACmCTCrrTCTCATTCCT 
GTTTTCATTGATTTCCCACATGTAGTCCTTTTGCTCAGGAAGTCTITGGGGAAATTAAGGATCm 
AAGCTCTGAAATAGGTGATCAGGTTAGTGGTGTCTGTCAGCTGTCTAAGAGGTTGGAAAATGAAC 
TACrCAAGATAGTCACGAAAATACrGAAAGTTTGATTTrrCmCCATAm 
GTITGACTGGAAGGGGTTmOTATAACTAAAACCTCAGCGCATAAAGGAGATTTAAAAGGAGCA 
CATGATTTAGTGGGTGGNCATGAAACTAGAATGGGATTTGGGGGNGAATTTGa^AATTCT^ 
TAATCCAAACTC™TGCTACAAGCCTTGGAAANGNCTTNAATACrT^ 
TGCTTAATTATTTGNAANCCCTTCANGCCTNATTCACAANTCCTGNCGGCGGCCGTNAAAGG^ 
AATTCACCATGNGGCTGTATTAT 

SEQ ID NO: 672 ACrril^rrrri-ri-l-ll^lUi ri'i riNGGNGANATTATTTACTAAATAATTGATAT 
ACATCCAACATCACTGAATGGAAATAATrrAAAAATAANCAAGGCTTAAATNGGGGTCTTrCAA^ 
AAACTCAAAACATTCTNGGNGACTGTNrrCmAACAAANGGAATGACAAAACTAGn 
TTATCrCAATCAACTCCAAAACTCAATGTTGCCAACATGTTTTTNTAC^ 

CAAAGCCTCATAArrrAGTAATTACCATGTTlTm'GGTrrCTGTNCrCAAATGCCCAGAAAGATCC 
TGTGAAAGGAAGCAAAGCACCTAhTTNTCATCAOTCTmTAAAAACTITAAGOTGCCTCCAACT^ 
TAAAATTTANATCCTNCrGACAATITCTTCCTATGGGAAGANGAATGCCTCGAACCCT^ 
TTGCTTAT<>fAAGCrGTTNm'AANCNCTTGNTTTNTGCT^ 
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AAAACCAAAAAATNTAATTGAATTT 

SEQ ID NO: 673 ACACCATrACTCTTCTCAAGAGCCTGGTAAACAGCAGATGCTTTATTCAGAA 
AACTAGAAAGAGCAAATACTCCTGCCACATCTTGATGATITCTATATGCTAAATGCATTGCCATGC 
ATCCTCCCATAGAGAATCCrCCrArrAATATCCTGTTCnTCTrGATGCCACTTm 
ATCAAATCAGTAAGCACTTGAGACATGACATCAATTGATTCAAGGTGTTCTGGGCAGTCATTGGTT 
ArnTAAATCTGTCAAACCATACATTGGAGArrCCTCCrrTCATAGGAGTATATGATCTGGGAGGA 
GCTGTTGGANAATAATTTTTATGTGTGGAATGTTAAATCTTGATTAAACCTGNTGATCCACATCTT 
AATCCrGGCCAGATCACTGACrATGCAGGAAGACAAAGAGGCCTTTGCTCCTGCGGAAAACNTCA 
GCGTGANAACCNACCCGACGCACCGCATGNTGATCCCCGGTNCTGCCGGCGCGTTGAANGCNATT 
CNNNATGCGGCGTATl^GATCGACTGGACANCnT^GGGArrGGGTA>mTCCTGGGAATGTTCCNT 
NNANTCCACACANACCGGAGATAAG 

SEQ ID NO: 674 ACCrCAACATAACCTGTAAAAGTATTTCTAGATAAAACTTTACAAGTGAAGA 
AAGAAAACCCATGATGTTACACTTACACACACACACACACACACACACACACACAATCATTCTTA 
AGGAAGAACAAAAACATGGTAAGAGTGTGAACAGGAAGGGAATGCATCTTTTTTTGTAAAGCTTA 
TA1TAAAAAACACAGCATGAGTAAAATAACTTCNTATGCCAAGAGAAGATGCAGAGAGAGGAAA 
CAGAANGCNGGGATGAGACrAACTCATTAATAAATAGTTNGAGAATGTCATTCAAAAACAGTAAA 
TTTGGGGAGTTACANATAATCCCCAGTTGCACCACTmAAAiriUl'lGTTCCGATAACTATTGACCC 
TAAAAAANNCTTNTATGNGT^^SfTrTI>^GGGANGGTNTTNGA^TGG^ 
CCTNNGCCNTTTOAAGGG^rnvrAATCCCCAAAN^rACNa^ANGGG 
AAGGCCCCCNTTTTTAGCCNAATThTOACCCNNGTTTATm 

GGGNAAAATAANGNGGCCCATATTTOCNTGimTGGNhrmAAAAATNATTGGNTGGNAAGm 
NGGCCCC 

SEQ ID NO: 675 ACCAGGCTGGCGACAGGTGCTACCAGGAGTGGGCTGAGGGGAGAAAAACTA 
TCTCCCACrcTTTTGGarCAGGCAATGTCAACGACTrCCACATTCCCTGGCCCACITCCTGAGC^ 
CCCCAGGTTCGGCTCTGTATAAGGACCCTCCCCTCCCAACCCCAACCCCAGAGTGCAGTGCAAATC 
AACCAACAATTrACTGGTGGAATGGCAATCAAAGGAAACAGTTAAACACCAAACAATTTCTTAAA 
GCCAAAAAATATrrrrCATGGAGTTGAACATTTTrCGAGTGTGTrrrm 
GACATTTTGTTCAAACAGAAACAGCATCrANGAATTCTGGCACTTGGGmrrAGGGGGT^^ 
>n'CATCATGGATTCTTCrCCTrGGATITAAAAAGGCCTCGNGTTTCTATTCCTGAmTATA 
CCTGCTAGCTTTCCKITITAGCGGACAGTGGGTGGGCAACCAGCCITCCTGGTTANATGGGCAATG 
CCAANCAAAAATTCCTTATTCACTTGTNGGCTTGGTTTTTTATTCAAAGNA^ 
AAANAACNTTACAAACCCAACCCCCAAAGGCGCCTTGACNGGGACCCTTTCAAACTGATTGGTGG 
AA 

SBQ ID NO: 676 ACAAATTTAAGACTAGACAATTAAAAAAAAAAAACACAAAATTATAGCCGC 
AACACTATCCTAGCTGCATTGTGGTGATTATATTGTTCCGTTAGCTGTCCATTATATACAACTGGAT 
TTTAATCGTGCTATTCACAGAGGGGCCAAAGCACTCATCTAAAGGTAAAGGCGCTCAGGTTTAAC 
GTGCACTTTCATAGCCCCATGAAAACACAACCA1TATATACACTGTAAACACCACGGGGCATGAA 
ATC CACCTCCAGCC ACAAAC ATCAA TCTCACAGGATCAAGCATAC GAGG TCTAAGGTGGGGTGCC 
TGACTTTTCTTTTTAAGAATACrnTCATCTAAAAGTCCTGCTAT^ 
CTTATTTCAAATATGTTGCCAGTGCTCAAAACATTTCTGGAACCCATTTTTGAGGACTA 
CATCTGCAGTGAACTCTTCTGAATGTCCTGGATGGTGGTGAGTCTTCATCCTTK3A*GGATGGGm 
GATTTANGAAAACAGGCAGTTATTCACAGCCAACTTGAATCCAACTGNGGGGATCCCCCGGGTTC 
AAACATTGANATGCCCTATAAACACAAGACCAATTrrTTGTACCTCGGNCGNACCACGCTAAGGC 
G 

SEQ ID NO: 677 ACGCGGGGGAGGCCCGWTNTCTCATCGAAGATGGCGGCGCGATCTGTGTCG 
GGCATTACCAGAAGAGTCTTCATGTGGACAGTCTCAGGGACACCATGTAGAGAATTTTGGTCTCG 
ATTCANAAAAGAGAAAGAGCCAGTGGrrGTTGAGACAGTAGAAGAGAAAAAGGAACCTATCCTA 
GTGTGTCCACCTTTACGAAGCCGAGCATACACACCACCTGAAGATCTCCANAGTCGnTGGAATCT 
TACGTrAAAGAAGTTTTTGGTTCATCTCTTCCTAGTAATTTGGCAAGACATCTCCCTGGAA;^ 
TCCGTCTAAAGTTCAATOTCTGCTCAmAACTGATGACTTGGGTCATGNAGTCCCTAACTCCANA 
CTCCACCANATGTGCAGGGNTAGAAATGrrCTTGATTTCTATAATGGCCCTATTCAAAGATAGATC 
TAAATTrGATGAACTCAGTGCCA>rrAATCTGCCCCCCAATTTGAAAATCACrrGGGAG'rTCTAACA 
ATTrCGGAAGAAAAACCCATTTGAAAATCACTGTCTTTCCCTGAGCAANGGGGGCTGCTCATTAAA 
ATCTTTTGATACTTTACCATGGGAAATTCTNCCANAACnX/TTCrmy^ 
AG 
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SEQ ID NO: 678 ACTTTNTrrTTTTTITITITmG^ 

TACCTTAATAGmCAGAAAGAGGAACAAATTAarrCAGTCCAAATTGATTGGCAGrrGGCTT^ 
CTAGGGAACCANGGGTTCTGACTGCTAAGGATTNATTTGGGATATTTTNAATACTAANCCTm 
CCTTTNAGGCTTACCCCANAATAATTGCTTCACCITCCTTrTCAKrTAAT^^ 
TNGTTNTANAAhTOTCNTTCATlTTGATTCCAAAAGGCCAAGm 

ACATGGCCAAAAAANAAAACGTCAAAATAAACCTCTGGATGATAATTACTAGTCTAAGGAACCAA 

ACCNTNCATTNTmTAANAATAATTCCNTTNCAAATGCCCCTAATAGGCTISTO 

GT^^^AATAACTGAAATGGGTGGCrITmACAANATCCNAAGCAGNT^CCGGGGCCCCm 

CANTGGCCCmAAAACCAAAAGGGGTITmCCGTGGNGGCCCATTrTAAAAATCCNOT 

TTTCC^TC^WGAAAAGCITGCCX;ATA^nTAGGCTGGAC^^ 

SEQ DD NO: 679 ACCTGGGTGTATCCTGTGTTTGCCAAACTCAGCCTCTTGGGTCTAGCAGCTTT 
CTTCTCTCTCAGCTACGTCTTCATCGCCAGCATCTACCTACTTGGAGAGAAGCTCAACCACTGGAA 
ATGGGGTGACATGAGGCAGCCACGGAAGAAGAGGAAGTAATTGCACACCATTTTCCAAGAACCA 
AGAAAGAAGAAAACACAAGAGATTTrrCTCATCri"l'l"i-l"l-l'rrrrri"l-rCTGGTGGAGGGAGGTGG 
TGOAGGAACTTANCAAAOTAGGAGGGACAGANAGTGATCCTTAAAriTAATAANAGTTCGTGAAG 
GTAGCTTAACTTGANAAOT^TTGGTTrnTGAAAGGTTGCTGN 

SEQ ID NO: 680 ACAAGATATrTGCGGTmGTTTTTTATAACCCACTAAGCCAAGATITOTATC 
TCrQTATGGAACTGTTTTrCAAATGGACAGAAATGGTCTTTGATCrrTTCTGAACCAOT 
ATTCTTCTGAGGATACAGTCACCAAGGCAGTCAGGGCTACGGANCCAACACACrrACCTCTGGGT 
GAACTCATCTTTTATTTTTTCTGGGATATCTTCTCCCATAACCTCAGCTATCAACAGC>^ 
TCTTTGAAGCTGAACCCrrCATmATCTCAA'rrCCrGCTGAGTCCTTAAGTrCCTCTCC^^ 
CGATGTTTGCTGAGGTTGGGTAAACTGAAGCAGAACCTGGTTCGTATAAAAAANGCCATGCTTTC 
AACTGAGTCTCTATCAACACTGTTCTCTCTTTrCTGNCATACAAGTrrTGGATATTrGN^ 
TCTCATTTITCTGGCTATTACTGCTATTCGCTTCACANAAAATAAAAGCTCTCAGCAAGGTTC>^ 
AAAACCAAAANACAGCTTTTCCTGAGTGTGGAAAAAANGATTTGATAGTTTTGTCOT 
ACTGNGCTGrmGAAATGACCCAAGTGCTGCTGAAANAACTGGCCCCAATTCANGTAANCCT 

SEQ ID NO: 68 1 ACAAAArrATCATCATTTATTTTTOATTrTTTTCACCAGCCCTG/^ 
Cn-GTAATATGCTOTTTCACAATCrrmATTAAATTACTTAATGArrCCAGTCT^ 
AGACrrrGCTCTGTTGTTATAGATCArrGAATTGGGGGGGAAACATAGTAAGCTAAACTAT 
ACTGCTCATGAAAGACCCACATTGTrGATTArrTTTCCCAGCACAGAAAGAAAAGGACAAGTGAT 
TCATCCTGCAATGGGCCTCTGCCAATCU'rrrCATATCTATGTAACCl"ri"lGTAGTTACCTGGTGTGA 
TTAACCGTTGTGCTArrGTAAATCTTGGATTAATTAACAAAACAACCAAAAATCTATCACC^ 
TATTGAAAAGAAATTCAGTAAAACAAGATGTGTCrCATAGTTAAGGAGAGACATAAAAANTAAAA 
ATGTCATC^AACAGGTTGGATITAGGATTrACTGTTAATCNAGAAACACCGAGGAGGCTTANCTCA 
CCCTTTNATTGGAGAATGTGGGGAANGGAAAAGAGAGTAAACACATTAACTTTAGTrANa^y^ 
GTGCTGGNTAAAAAAATTCCGTGAAAGGAAATGGTTACAAGACAAATTGGCTTTTATCCCCnTC 
T 



SEQ ID NO: 682 ACACATGCACATCAAAACACTTCAACTGAATATAGATGCCATTACATTATTTA 
GTTACGTTACAAAGCAAACGGCAGGTTCATAAACGrrGTTCTATrATGTATCAACTGAAAAAAATA 
TATTCAAAAAAAAAGTTTTTTGAAAGACTCATGGGGAAGTGOAATGGTGCCCCACATTAGGAAAT 
AAAGCrTTTACCAGGACCACCTTGCTCCAGCTGGCTCrCAAGGGACCACTGAAAACAGCT 
CCCTNAAAAANACAGAATGGGTCrrGGTAATAATTCAATGGACTCTCAAACTCATTCCTOT^ 
CACAACGGGGGNCCCAACCTGATAATGGCCCCATGGGTTGGCTTGGCCAGAAAGTCATTATCTCG 
AA>n'CGNAAACCCCX:TTTCAAGCATCCAATCTTTGGrrrcrrTAATGACAATAGCGrrAAT^ 
TrmCThm'ACNGNANTTACAACATCANAANGGNACnTCATOAGTTCATTTT^ 
ATTTCGCTGNATTrrCAACNANGGTTTTCTTCTrCTAAAACCTNAAG^ 

NACCCTATGGATTGGCACCGTITGTGGCCCATGCrCCCTGCTTAATQGTCANNATGATGCATTATN 
C 

SEQ ID NO: 683 AL MI I U 'I UC ri rJ T CUl 'C Cn - rri - rriU GGAAATTATmCXn'GAGCCTTTTC 
ACGGTATATTGTAAACTTTTATGrrAAAGAAAAAATATACAmACAAATTGTGAGATT^ 
QAAATmCTACGATGTATACTGGCTTATTTmAATTTAAAACGGGGTTTCCGTC 
AGGGGGTGCGCTGTTAGTCCCCTCGCrCCTGGCTTTGGGGGTTGGACrmGGGGGNCCAGAAACT 
TGGNAGCTTCTANAAOAAATCTACTGAGTGGNKTTCCTGTTTTTGGTTAAATCCCT 
ACTGGACCTGCTTGTAATGTCTGAGGGNAACrGTGGGGGTGCNACAGCCAGCCCGGTGGATCCAC 
GCAGGCTrAACCGACCNATAAGAAGCCTTCTCCCAAGCACCGTGGTTCAGOGCGTTTCCATTGACC 
AGTTTGACCCTGGTrGAATAAANAAAATGCGTTNGGTrrNA 
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SEQ ID NO: 684 AC111UU"1"11U rmnTrrnTnTl"l'AAAATTTGAACACATATTTAATAATTTG 
TGATAATGOTTTTGNGGCTAAGTrrAAAAAATGAAAGGCCCCAATCTTCAAAAATCTACTCTGA^ 
TATTTACAATTACATArrTAACAAGTNTATAAAAGTATAACCATATOTGGGATTTGCm 
ATAANAGATGGTACCrGGANAGATGTATGGATGANATAANATTGGCCACAAGCTGATGGTTACTA 
GAACTAGGGGTCTGAAAGTTAAAATTNTACCCAAAAAATNTCTAAAAArrrTGNGNCCAA^ 
TWAATCCCATTAAATTCAGGAAAANAmrrCAATNGCTTAATANCATTTNCAAATAATAAA 
TTTTGGAAAANAAACCTTTGGNTANANATTTTNCTGAATTCANACTN^ 

GAAAAANTATTTTGATTAANAANTAACAGTTTAGGAAACCCTTGATGNNTAAAGTAATGGAACCG 
TrrCCTTACAGGAGTATAAATANGCATAATCT 

SEQ ID NO: 685 ACTTTTGGAGGCCAAGGCAGGTGGATCACTTGAGCTCAGGAGTTTGAGACCA 
NCCTGGCCAACATGGTGAAATCX:CATCTCTACTAAACATTNAAAAATTAGCTGGGCATGGTGGTG 
CNCACCTGTATTCCCANCTACTCGGAANCCTGANGCATGANANTCGTNTGAACCTGGGAGGNNGA 
NGCTGCTATTACCTAAACATTGTCNANTGNATATNCCCTATGCANAGA 

SEQ ID NO: 686 ACGCGGGGGCTTCCGGTTCTGACGGACGCTTCGGCCGTAACGATGATCGGAG 
ACATCCTGCTGTTCGGGACGTTGCTGATGAATGCCGGGGCGGTGCTGAACTTTAAGCTGAAAAAG 
AAGGACACNCAGGGCTTTGGGGAGGAGTCCAGGGAGCCCAGCACAGGTGACAACATCCGGGAAT 
TOTGCTGAGCCrCAGATACTTTCGAATCTTCATCGCCCTGTGGAACATCTTCATGATGlTCTGCAT 
GATrGTGCTGANAAAACTCAAGCATArrGCCmCCATCTAGCACTGGGGCCATANTrCTGATACT 
ACTGGNAACGAAATTGNGAGATTTGCTGTAAATGGATNTAGGAAAANCCNAACGGGANATTITGT 
GAT^^^GGCC^CTNGGGTNCATCCTNGT^mGCTACCCAAAAAAAAGAGGCT 
lOTAATGATATTCCNmGTGGTNGGCTGCNATTTGTTNCCANCCAGT^^ 

SEQ ID NO: 687 ACGCGGGGGGGGCGCACCCGCCGATTGTGGCCATGQCGGCCGCAGTCTCTAG 
TGTGGTGAGACGAGTGGAAGAGCTCGGGGATCIGGCTCAGGCCCACATACAGCAACTTAGCGAAG 
CTGCCXjGTGAAGATGATCACrnTrAATTCGGGCCTCTGCAGCATTAGAAAAATTGA^ 
GTGGAGAAGAGAAAGAATGTTCAAATCCATCAAATCrrCrAGAACnTrACACACAGGCrrm 
ACATGACATATTTTGAGGAGAACAAGCTAGTANATGAAAATTTCCTOAAACTCTTm 
AAAAGACTTGATNA^ITTITmCAAAACCAOAAAT^T^A^r^A^ 
TGCATTTTGCTTGGGGATGANCTCTGGAATGCrjTCTTrrGGAACNANGACCC^ 
GCATTCTTTACCAAAAAANANAGTGGCTTTAAAAAATTCATTTGCTTAAAAN 

SEQ ID NO: 688 ACGCGGGGGAGACCTGGCTGCTGTGTCCCGCGGCTTGCGCTCCGTAGTGGAC 
TCCGCGGGCCTTCGGCAGATGCAGGCCTGGGGTAGTCTCCTITCTGGACTGAGAAGAGAAGAATG 
GATAAGCCCCTCTTCCCATTAGTGCCTTTGCATTGGrrTGCTTTGNNTCACAN 

SEQ ID NO: 689 ACGCGGGGTGAAGATATTATGGCTGCTGCCACGGAGCATAATCGCCCGAGCA 
GCGGTGACAGGAACCTGGAGCGAAGATGCAGCCCCAACCTCTCCCGAGAGGTGCTCTACGAAATC 
TTTCGCTCCCTACACACCCrGGTTGGACAGCITGACCTCAGAGATGATGTGGTGAAAA™ 
GATTGGAACAAGCTCCAGAGCCTCTCGGCATTCCAGCCTGCNTTGCTCTTTAGTGCACTTGA^ 
CACATTTTATATTTACAGCCTTTTTrANCAAAACrTCAGTCTCC^ 

GCTGrrGAANAGATAGGAAGAACAGAATTGGGGAACAAAAATGAAGTAAATGACAAArnTCCA 
TTGGGCGACCTACAAGAGGAAGAAAAGCNCAAAGAAGGTGATTTAANAGATGTGAAAAAGACAC 

SEQ ID NO: 690 AClU■^^lU"l■l■rlU'lU'lnl■l■rrl■J"lU■lU"14"l"lU4■CCAAAAACCGCCAGT^r^T^A^ 
CTCATGGAAAAACANAACAAACCCACAANTTGGAGTCACGGANATAAAATACAGATGAAATGGA 
AAACGGTCTGTTGTCATNAArThrrCACTTTCAAATACCATTTTATATC^ 
GGCAAACANAAGGCCATGCTGGAGTCTNTTACrmGGAAAATGGAAAATCAAA^ 
CAACAAACAAAAAAGGAGGGAAACTCCTTGGTAAAGCrmACAAACATAATTbrrCAT^^ 
ACCAATAAAAAATAGCTAGGGTAAAAAAAACAGATGGTTAAAAACTGGTGCCAAACCAAAGNGA 
AAGCTTrGGNGCCTTC^^^TAAACNCCTATCCT^rrTTNm 
ATTACrm-GAATrasrCAGGGTrTTNTCACCCCAATTNATCCCTNCCTTTCACC^^ 
AGGGCCATGTNTTTTTTTGTCCGGCCCCTTGGGAAGGGGCTTGGCCTN>^ 
CTGTTNCCCTAAhnTGAAANGGANGAGGGGGCCCAC CJ 4 4 ' lUUl - l - ri - ril - lM CNCNCCOTA^^ 
AAAAA 

SEQ ED NO: 69 1 acgcgggggggatgcgcttgggctccctgttcgttcccacatgcagggcagc 
acgaggagaatgggcgtcatgactgatgtccaccggcgcttcctccagttgctgatgacccatgg 
cgtgctagaggaatgggacgtgaagcgcttgcagacgcactgctacaaggtccatgaccgcaatg 
ccaccgtagataagttggaggacrrcatcaacaacattaacagtgtcttggagtccttgtatattg 
aaataaanaaaggagtcacggaanatnatgggagacccatrratgcgttggtgaatcttgctaca 
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ACITNfAATTTCCAAAATGGCrrACGGATTTTGCAAAAAATGAACTGGATTTGm 

GAACTGA^TATTNACNCAA/J^ACCGGCTTTGC^mTITCNAAAANA 

A1TAAAGGCAA 

SEQ ID NO: 692 ACCCCACACCCTGAAGGTGTCTATGAGTTCACATGGCTCAGGAATGGAGTTTT 
GGGGCCCTAGATGTAGACAGCTGTGGCGATGGAGGCCAAGGTTTTGGCCTGCTGTGGGGGTTGCA 
TTCACCTTCCCATCTCCACATGACGATACCCCAGCATrrACGAGGCANC TTCTG CTGACCTGGCCA 
AGCACCGTGGTCCCNACTTCAACGCATCTGAANCGTTGCCCAGGGANCCCmAAAAAT GCAAA T 
CCTTGCCCATCCITGAAAACmAATmGTAGGTNTGGG^^^AAGGGGGGGCCT^ 
CTAAAAACAAT^r^AATTTGGCCCC^mrGTTTAAGNATAACATTTCTNAAGCAC 

SEQ ID NO: 693 ACATGGTAAAAAGTTGTGGCAAAGATGGCITCCATATCCGGGTGCGGCTCCA 
CCCCTTCCACGTCATCCGCATCAACAAGATGTTGTCCTGTGCTGGGGCTGACAGGCATGCOAGGTG 
CCTTTGGAAAGCCCCAAGGCACTGNGGCCAAGGTrCACATTGGCCAAGTTATTATG TCCAT CCGCA 
CCAANCTGCAAAACAAGGAGCATGTTATTGANGCCCTTCGCAAGGCCAANTTTAAATm 
CGNCCAAAAAATCCCATTTAAAAAAANNGGGNTTTACCAhnSTrmAAAGC^ 

SEQ ID NO: 694 ACGCGGGTAGGCAGAGAACAAAAATGTTAAGCATGGTGTrGTCTATCTTATT 
GAAGCGGrrGGAAATGAAAGCTTTTAATTTGATAGATTTATCAGTA TAAAA TTA AGGGA^ 
GTGNGGGGAATGAATCAATTTAAAACTrCGGGAATTGNGANGNGACi'i-riGNAACl 1 irCGTCTG 
NGTGTGACCTGTGAACCACTAGGATGTGATCTGCCCTTGKGGGCAGGTCCAGCATAmTAG GAGT 
TAGGCTTTAACATAAATTTCTAGCTGCATCTGAKrctCCTGGGATGGGTGCTCT^^ 
GCTGCGGATGGTGANATCAAANCAA>rrhrrrCTGOTGKrcGCCCCrGCAAT 
CANTGCNAATCACTTAGTNGTNAAATTTTAATCAAACACCACCAGGTCC CAAAT GCAGGTCATGA 
ATmT^AAATTCTTAAATrrACATNAAAAGTNAAGTTTTNACCGK™ 
AAAGAAATTG 

SEQ ID NO: 695 ACTAAAGGCTnTGCATGAATTAGGAAGGAGAGTCTTGGGGCAGAAGCAATA 
GGGGACAACTGTGCTGGTG<^GTCTTTTGCAGGATGTGTrrACCAAAAC:AT CTAATGC AACT^ 
TCAGACTTTACAGTTTGTAGTGTTAACCTCTTTAGAAAAAGAGCAGCCATCCTITm 
TGTATCATCCCCAGTGATGAGGAGAAGCTCTTCTGTANANAAGAATGACACTGTGCTGGGGCAAG 
CGATTGCATACTGNANCGGNCNCANGTCCCCGCNTACCnTGCCTTNTANNGGACAAGGGGCCCTG 
GACCnrCCCANCCCTTTG 

SEQ ID NO: 696 ACGCGGGTATTGAAGAAGATTTTAATATTGCACrAGGAGTTTTTGCTITAGCT 
GGACTAACAGATTTGTTGGATGGATTTATTGCTCGAAACTGGGCCAATCAAAGATCAGCnTGGGG 
AAGTGCTCTTGATCCACTTGCTGATAAAATACTTATCAGTATCTrATATGTrAGCTGACCTA TG^^ 
GATCTrATTCCAGTCTACTTACTrACATGATCATTCGAGAGATGGTAATGTTGATTGCTGCTGT^ 
TATGTCAGATNCCGAACTCTTNCAACACCNCGAACACTTGGCCAAGTATTrrAAAT^^ 
CCACTGCTAGGGTAAAAACXCAACATrCATCGCAAGGGGAATCCAGCCAGNCCANTTATCTNGNG 
GGCAGCTTCTTTGGCAGCTCa^GTTTTCAACTATCCTGANNGNCTITATOT 
>mTACAAGTTTCACCCCAGCTGGNTCACCTTTTAGTTCCTATlSrrTA>rrGCCGGG^^ 
AGGNGATAAAAANCCTGATNAAAGCCATCCCTTNCTGTNNGNAAGGGACCNNCm^ 
GGGGACNGGGGCCCATTGOAAATGTCCCTGCCCGGGGCGGGCCNGTT 

SEQ ID NO: 697 ACTACAATGTTCTATGCATTTCrTCATCCTA GACAT TAATAAAACACATCCCT 
TTGGTCrrAGATACTTCTCmGGTCTGTGTTTTCrrCTtTCT^ 
GAAATTTACGTGAACCTTTCACATATCTATTTTTTCCTTGGGCCAOTTGATAAm 
AATNCCAGCGTTTACCAACTCOTCGAATTCirnGATCTTTACrATCCTCTGAG 
TTGAATAGAGCTGGCATCAAAATACAAATGCCACACAATGAATTCTGTCCTTCAGCCGATGGGGA 
ATCAATGNAGNC^TTGATGAATTTAATTTGATTGATTCCCATGGGATTAAAA CTGGT ATCTATCAC 
CGAATGTTGCCGGTCAAGATATAGAATATGTCATCCCTGCACAGGCCGCCn'CnTNTCACTCAGC 
CX:CAGTGAGGCACACAGAATAAACGGNAGGGAmGGCCATCmCCCGNCTCTAATAGAAGATG 
NCCTATACTTCTTANATrCCCAG>n'GGATrAAGGCCCACCAAAGCCTGATGCGTACTGCCCTTGAA 
AACANCCTCATGAGTTGAAAAGCTGGCTTCCCACTCCAArrGGACCAGCANCANAATTNGTTTTGG 
TGG 

SEQ ID NO: 698 ACTAGGGATACAAAGACTTGGTTATTCTrGTrGGAGTAATGATTCTCCTCTAT 
CTGGGGAGOTCATCCTCTGCAGTTGGGGACTGAAGGACTGGGAGATGGGAGATTGAAACrTAAAG 
AGGTAAAGGTATGTCTTGGAATAAAACTAGTTATCAGTCACCACCAGACCATCATGGCGGCTAGA 
ATTCCAGAAACGGTGAATAGAAGTTCATCTTCTATITrGTCTrCATGGAATATATCACrrCTGGTAA 
ACAGCTGGTCTAAATATGATGGGTAGACTrrCAAGGTCrrCATrmANATGTAATTCnTNC^^ 
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AGGGTGTCTAGAAACCGTGTAACACCATACCrrCCTGAAAATGTTTCTGAATGCCAAAATTGGGG 
GCANTA 

SEQ ID NO: 699 GTCGCGGCGAGGTACACAGTCCAGGCTCTCCAAACGGAACTCTACAGGCAGC 
CGCCCCGTAAGTGTCACGTATCGGTTTGGTTCCGTGGGCCAGGACACACAGCICTGTTTATGGGAC 
CTTACAGAAGATATCCmTCCCTCACCAACCCCTCTCAAGAGCAAGGACACACACAAATGTCATG 
AATGCCACGAGCCrCCTGCTGGAAGCAATGGGAACAGTGTTACAACACCCGGGAACTCTGTGCCG 
CTCCTCTGCCACGGTCCAACAGCCTTCCACATTCAGCAGTCTCAAATGCTGGCAGCAAAAGCAGTG 
TCATGGACGGGGCCATTGCTTCTGGGQTCAGCAAAnTGCAACACTTTCACTACATGACCGGAAGG 
AGAGGCACCACGAGAAAGATCACAAGCGAAATCATAGCATGGGACACATTTCTAGCAAGAGCAG 
TGACAAACTGAATCTAGTrACCAAAACCAAAACGGACCCTGCTAAAACTCTGGGAACGCCCCTGT 
GTCCTCGAATGGAAGATGTTCCCTNGTTAGACCCGCTGATATTGTAAAAAAGATANCNCATGAGA 
GACTTGACTTGT 

SEQ ID NO: 700 ACACCGATTrGGGATACTTrrCTTTCAGAGCCGGCCACTGAACAATGAAGTA 
ATCTGAGAGATGAAACAGAATCTTTCCGGACATGGATAACGTTTCTACACGGCAGATGCTTTCAAC 
GTAGACAATGATCACTTTCTTTATTCCTAGTATCCCAAGGAGAAGGGCAGATACACAGATAGGAA 
CACATGTTCCTGOTCCGTTACACAACACCAAATCTGGCTTCACCCTGTGAATTAGGGGAAAGGAG 
AGCCACATGGAGTGCAAGGTGGTGAAAACGGTGGAGGGCCAGGACTGCTGACCTCCCGGCTTCTT 
GGAATTCGGTGAATGTAGTATTTGGTATACATGTTACTAGGGTCTCTTCAGCTCGATCTANTTCAA 
AAGAATTTATTTATTGGCACTCANTTTCATNATGTCAGCAATGACATAATGTCTAGGTGAGTNGGN 
NTTTGGACAAAGCTCCAAGCAGCCTNAGGNTCTNNGTGGTTTGCCCCCGGGNCCCAC^^ 
AATTCTGNAAGACNNCCGGGGCGNNACGTCCOTGGAACGAACCACTNCCCATTTTCGGAG 

SEQ ID NO: 70 1 GTACGCCTGTAATCCCAGTGACTTGGGAGGCTGAGGCAGGAGAATCGCTTGA 
ACCCGGGAGGCGGAGGTTGCAGTGAGCTAAGATCGCGCCATTGCACTCCAGCCCANCCACAGGGC 
AAGACTCCGTCTCAAAAAACAAAACAAAACAAAATAGATTTGTTTCCCCCTGCACAATCTG^ 
TACCATTTCTTCAGAGCACATACAGGCATTTCATCTTTCAACCTGTAATTTCTCCTAACrrCATGm 
TCATGTAAATTAAAAACACATGCACTAACAAATAANTNTACAGCATATATCAAGAACACAATTAT 
AATITGAAAGTGTCGAAANAAAATCITATACAAAArrrAAATAAAAGAAATCATmGTCCAACTC 
TANTCATCTTATTGNCTAAACACATTATnXSCAAAATATTTTTCAAAAGATTGTATGAm 
AAAGTTCTTTAGTGGGCTTCATCCACTGGNTTTGGNAGTTTCAAGCATTTm 
ArriTrrTTTGCAAT>rrTGAACTTATTTCrGATTGCAGNNACTNACT^ 
TNTGAAAAAAANTNACTACATTTACCCTTAACCTGTTANATTTANTrTNTT 
NTTNGAACC 

SEQ ID NO: 702 ACCTGACAAATTATTGGATTCCAGCACAGTGACTCACTTATTCAAGATAACTG 
AAAACATTGGTTGTGTGATGACCGGAATGACAGCTGACAGCAGATCCCAGGTACGATCTNACTCA 
CTGCAACCTGTGCCTCCCAGGTTCATNCGATCTCCTGTCTCAACCTCCCAAATAGCTGGGATTACA 
GGCACCrGCCATCATGCCCAAGCTAATTTTTGTTTTrAGTAGAAATGGGGTTTCACCATGTTA^ 
NGGCTGGTTTGAANTCCCGACCnMAAGTGATCCAACTGCrTTGGCClTCCAA^ 
AGGGCGmANCCACCGAGNCTGGCCTTTTTTmTTTTTm 
TTTOOTANGNCCAATTTNAGATTTTTTTCA 

SEQ ID NO: 703 ACATGACCTTTAGTGAAGATTATTTGTCATCAAATTACCXATATCCAAGTTTe 
CATGGGCCTGGAATTrCCTTrCCACrrGATAGAAGTATATATTAGGAAGTCCAGrrAATAATATTT 
TTATTTAAAAAAAAAAAAAAAGGAAAAAAGAATCAGCAGAGTCAAGTTGTCTTAGTC^ 
TTCTGGATTTCTTCCTTGGAGGAGGTCAGGATCTTCCCAAGGCCTGGG TCCTC GAAT ATTC TrCCAG 
TCATCAAACTTGGAGTCTTTGATTTTCTCATATTCCGACTCTAAAGATATTTTATTCTCTT^ 
TlTTlCAAGCTCAGGATCCATTTTACTCTTCACAGCATCmATCAGATTrGAGAAAACTCACGA 
GACCAAAAGAACCTCCAACNATCAGCAACAACATGGGGACTCCATAGCCNAGAGTNTTGTTCTTT 
GCGAAAAGCNCGCATCACCGCGGGTGCAAACATTGATTGGAACTNTTCCATCNGGCTCANANCTT 
CAAGNGGGCCTTANCGCAGACCTCCNAAATTTAAGAGATTGACGCTCTCNCAACCCNCGTTCCT^ 
CCCGGGNGGCCGTTTAAAGGGCNAAATTC 

SEQ ID NO: 704 ACAGACTGAACAGATTAGGTCTTTGTCTGAAGCTATGTCAGTGGAAAAAATT 
GCTGCAATCAAAGCCAAAAnATGGCTAAGAAAAGATCTACTATCAAGACTGATCTAGATGATGA 
CATAACTGCCCTTAAACAGAGGAGTTTTGTGGATGCTGAGGTAGATGTGACCCGAGATATTGTCA 
GCAGAGAGAGAGTATGGAGGACACGAACAACTATCTTACAAAGCACAGGAAAGAATTnTCCAA 
GAACATrmGCAATTCTrCAATCTGTAAAAGCCAGAGAAGAAGGGCGTGCACCTGAACAGCNAC 
CTGCCCCAAATG(>nSfCACCTGTGGATCCCACTTTGCGCACCAAACAGCCTATCCCAGCTGCCTATA 
ACAGATACGATCANGAAANGATTCAAAGGAAAAGAANAAACGGAAGGCTTCAAAAATTGACNCT 
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atgggaacctaccatnggttgacactgaaatctttaaccggaggggtcatnttgcccggaagact 

CANACTNCTTGCAGCCCAAGCCATGNCCTTGGCG>nWACCACNCrmAGGGGCNATrTCNACACAC 
TTGGCGGGCCGTTTCTAhrTGNNTCCNGGCTCCATNCCAAACrrGGC 

SEQ ID NO: 705 ACTCCTTGGCTCITTTGAGTCTCCACTTTTACTCACTGTCT 
GATCCrmcrrCTCITATAAATCATCCTCTTAATGAAAATTAGCCTAACA^^ 
AATCCTACriTGAGCCACTGACTTGAAATAACTCTTrrGGCAAGTTGCCTGACATCCT 
AGGTGGCATATTTGCATTTTTACTGCTTAAAACAriUUrillUUU'rACCATCT^ 
ATATTGATGGTAGGACTAACAGGCriTITAGAAGCTGGCTTTAACTTTGAGTCTCAAGCTAC^ 
CTGTTGGGCAGCCTGGTCTTCCCACGTGAGGGTITAACTTTGTTTATTTGCCTCCAGTTAT TC 
ATGCTTATTAAATGAAAGGCCCAGGAACATGTTTATTTTANTCACCTTTGCTTm 
TTGTAATCAATGAGTAATTCATGATGAATTATTTTTGACTAATGGATAGCCCGAAGGCCAAAGCTT 
TTAATTCTAATAGGNAATGTTCTTCTTTTNNCTNATGGAAACAATGAGAATA 
AATTGCACTCCGATNrrATGCTNGNGG>rmAhrrCACATAAGCACAATTNTGNGGm 

SEQ ID NO: 706 ACAGTTCACATTAATATTCACATCCCACTTrCTGCTACTTCTGTCAGCTATACT 
CTGGAGAAAAATACAAAGAATGGACTTACACGCTGGGCCAAGGAAATAGAAAATGGTGTTTATTT 
GATTAATTGACAAGTTAAAGATGAAGATTGTGACCTATTAGAAGGACAGAAAAAATCrrCTAGAO 
GAAATACTCAAGCAACTAGTCATTGTTTrGATGTCAGAGTGCTAACGCAGTTGCrCCTGAATTCAG 
ACCACAGATCCACAGCCACAGTCCAGATATGTAGCGGTTCTGTAAACCTTAAGGGTGCTGTGAAA 
TGCAGAGCTTATATCCACAGCAGTAAACCCAAAGTTAAAGATGCTGTGCAGGCAGTAAAGAGGGA 
TATATrGAACACAGTTGCTGATCGTTGTGAAATGCTATTTGAGGATCTGCrmGAATGAAATO 
AGAAAAAAAAGATTCTGAAAAAGAGTTCCACCGTCCTCCCTTATCGAGTCnTTGTTCCCOT 
GATCCACTGTAATGTTGTGTGATTATAAATTTGGACXiATGAGTCANCTGNAAAAATAAGGGACCA 
TTTTATGGGAGATGTTGGATCCCACAATTCAAAATAGAAGATTNGGAA^^^^GC^WANGAAAC 
CANNCTTTG 

SEQ ID NO: 707 ACTTGCTTACAGGAAGAGTAATTCCCrAGCAAAGGTCATTAGCTCCTAAGGC 
ACTGAGTCAAAGTGACAGCCCTGAAGGAAATTGCACTCCAGCCCTCCTCCAGGATGTCTAATAAG 
ATGGGAAACTTGGATGCCCAGCCATTTTGGTGACCTGAGAGTCTAACTACTCCAGTTAGACCTAAG 
GGCACAAATGCAGAATTCATGACCITGTAGTTGTGGCAGGGTCrAGGAAGTCCIOTCTCCCCA^ 
AAAAAATATTCTCrrGCCATTCCTGAAATTCCACATTCATATAATGGCTGTGCAATACATGCTTCTC 
ANTAAGAAAATTAACTGCATGTTTACTGT^^^CTGATCACATCAAANTTTTTATG^^ 
NCTCATTNTGGATTGANTCCAAACOTAGCTCTAATAANANAAmAKNGC 
TATTCTTCATTTTTATTTGN 

SEQ ID NO: 708 A Crn ' nTi ' r n'l Tn r i -l l i' i NGGTATCTATCrAATAAAAGTTTATrTGTGTAT 
GTGCAATGCATAACTCTATCTTANATATGAATCCTAACAGGATGAAAATACTTTCTTGC/^ 
TTATGCTTATGAAAGGTGTGAACnTGCAATGTCCTCCTGTCTTAAACCCAAGTTGACAGTGCCCTC 
TCAAAACTTTTCATAAATAATGACCTAATTTCATTTAAAAAATGGTTTCAGCAAAT^^ 
AAAGTCCGTTATTTGTCCATTrGTAATATGAGAAAAAAAAAGATGATNCATTCCTCTACAGAAAA 
AGTGGGTTTANAGAACAGTTCTGGTAATArrrCACATGGTAAAGTATCAAAAGATCTAATGAGCA 
G<XCCCTTGCTCAGGGAAAGACAGTGATTTCAATGTGTTTCTCTTCCGAATTGCTTAATAA 
AGTGATTTTCAAATTGGGGGGCAAATTCTGGCACTGAGTTCATCAAATTTAAATCTATCTTGAATA 
GGGACATTATAGAAATCAAAGAACATCTNrTACCCTGCACATNTGGTGGAGTCTGGAGTTANGGGA 
CTACATGCCCAAGTCATCAGCTAAATGANCCAAAAATTGAACirrAACCACTm'OT 
GTCT 

SEQ ID NO: 709 AOJCraGGAAGCATATGTTACTrACCTTGTTATTAAATATTTCTTGAAAAGCA^ 
ATTTTAATGGTTTAATTTTATGTGGACGTATGTTAAATTATCCAACTACCCTATTGTTAAGC^ 
GTmAAAATTTTrATGCTAATATAAATGCTCAAGTAATTTAAAA TATTC 
ATAAATTTCTGAGTAAATGCATTGGATCAGTTGGACTTTGAACGCCTTTGAAATGGC^ 
ATGCTCCCGCCACAAAGTTGTAGGAAATGGGAAGAGGAGTCAACTAGAGGCAAGGGAGTTGAGA 
GAGCTGCAACTGTAAAGGGCAAGAACAGGCAGAGGTAAAAAGAT GATGO AAGGTGTGGTGACTA 
AGGGCCACGTTTATTGGGTGAAATTTGAGATTGTAGGCCAACrrGTAriTrCAAGCITCTGAAOT 
GGCAAAATATTCATCGCAAAGTCTCTAGCGTCATATTTTrCTCACCCAAATTACGTTTCCCGAGATT 
ATTTATATATAGTTGGTCTATCTCTGCAGTCCTTGAAGGTGAAGTTGTGTGTTACTAGGCTGTGTTT 
TGGGATGTCANCAGTGGCCTGAAATGAGTTGTGCAATAAATG1TAAGTTGAAACCTC 

SEQ ID NO: 710 ACTATACTCAGTATTTAAATATGTCCCAGTATAGAANCATAACTTCAATTAAT 
TTGCTNACACTAACTTCrrAAAAACTTACAAATATTCAAAACAAAGGGAAAA^ 
AAGATTTAAAANGCCATTGCCnTGAACATGGAGCCTCCATAGCAAAANAGGANATATACTTGTAT 
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ACTCATTATATTTTATTTANNACCCATGTCAGGAAACACACACTCrCTTATNCNNAATA 

NANTr^AAT^^^CATaWACTCGAAAACCCAAT^^^CANCm^ 

TTTACCAANNTNCCOTATTTn-ANTANAANTCANNAGCTT^ 

SEQ ID NO: 7 1 1 ACATGGCCTTTTAAAAACCGGAGACAACTGGAAATTCATrGGACCTGATCAA 
CATCGTAATTTCTATTATTCCAAGTTCnTCGATTTGATrrGTCTAATGGAACAAATTGATO 
TGAAGTGGTATGANGACCTGATCCTTCANCCTCTTTTCCCACTTCCAAACAATGmT^ 
AAGCNTTGGATGTGGCCCATCCGGTNAAAATGNTTCCTAAAAmGGGAANATANTAANAATTGG 
GNATTCCTTCCGCAAGTGACCTGAAAAAAAAATCCNNTTNCTCATGGCAAGGGACAAGC^ 
NANAarCAGGTGGATITGGCrGACTGNNCTGCTAATATCAANTCrGCGNTTGAAAAGCAACCC^ 
TCAAAACAAACTGNTAAGGATNGGCCNCCCCTrmAAACTGNTTACTNTTCCTT^^ 
GNGAAACCC 

SEQ ID NO: 7 12 ACGCGGGGACCTGGGATAACGGCGGCGAGCGGACGGCTGCATTTACGGGGT 
CTCCCGGAGGGCCAGAGTCGTGGCTTACAGAAGAGACGAAATGTGGTCTGAGGGACGATATGAAT 
ATGAAAGAATTCCGAGAGAACGAGCACCTCCTCGAAGTCATCCCAGTGATGGCTACAATAGACTA 
GITAATATTGTGCCAAAGAAACCACCACTGCTAGACAGACCTGGTGAAGGAAGCTACAATAGATA 
TTACAGTCATGTTGATTACCGAGACTATGACGAGGOCCGCAGTTnTCTCATGATCGAAGAAGTGG 
TCCACCTCACAGAGGAGATGAATCTGGTTATAGATGGACAAGAGACGATCATTCTGCAAGCAGGC 
AACCTGAATACAGGGACATGAGAGATGGCmAGAAGAAAAAGTITCTACrCrm 
AGAGAGCGGTCTCCrrATAAAAGGGACAATACTTTTTTC^GANAATCACCTG^^ 
TTCTCCACACANCANTATCTGGTTCCAGTGTCAGTAGCAAGAANCTACTCTCCAAAAGGGAGCAA 
ATCATCTCTTTCCATCATTCTTNAACATANAAAGTCCCGTCGTNCTGGTGCCTCTACAAACGGC^ 
AATTGAAGGGAATTCCT 

SEQ ED NO: 713 A ClU - lUU ' llUl - ri ' l l U^ - ri ' ll - lUUUUll ' riN AAANAACCnTrTTATTCATCATCTA 
ACCAACANAGGTGGTTGGCTCNAACTCAAACTAAAATGGCCTCAAAAGGCCCACCTNGTTACNAC 
ATGACAGGGCAAAACCANAAGTAGGGACAGAGTTTACCCTCAGTTCTCTGCAAAAAAAACCAAGC 
NTNT^mTACACACAGGTGCCTNATTAAAAACTGATTGGCAATGTTCCACCAGCACANACCCANA 
GTGTGCAAAAATCCGNGGGGGCTCTGTATATNrrGTAATTCAAANAATCCTG(>IATTTCrr^ 
AAACAANCTCTTGTTTlTrGGGANGAGGGTGATCAAATTGTTTrCTO 
CAANTCTNNAANTnTNAAGGGGA 

SEQ ID NO: 714 ACTTTCACTAATTTGCTCCTGCTATCTAAAAGGCAGAGCCAGGTATACAGGAT 
GGAACATGAAAGCGGACTAGGAGCGTGACCACTGAAGCACAGCATCACAGGGAGACAGGCCTCT 
GGATAACTGCGGGCGGGCCTNACT 

SEQ ID NO: 7 1 5 ACGCGGGGTGAGAAGGAGAAGGAGCGGCTGGGAGGCGGTTTGGGAGTGGCG 
GGTGGTAACAGCACACGAGAGCGGCrGCTGTCTGCGCTTGAGGACTTGGAGGTCCTGTCTAGGGA 
ACTTATANAAATGCTGOCAATrrCAANAAACCAAAAGTTGTTACAGGCTGGAGAGGAAAACCAGG 
TCCTGGAG'n'GrrAATTCACCGAGATGGGGAATTTCAAGAACTAATGAAATTGGCACTTAATCAGG 
GAAAAATTCATCATGAAATGCAATGTTTTAOAAAAAGANGTTAGAAGAGAGACANTGATATTCAC 
AGCTACAAANACANCTTANGGANCATAACAAATNCTGGCAACATCmnTTACCANNCA^ 
AACTCNGTCATAAAAAAAGNAAGAAAAAGGTCTTTTNTCCTTTNAAAAA/^ 
TAGGATCNNGNGCAATTTNTNCTTmTGTNCCTCNCITNfNCOT 
TAACCTITCCNAAimiAATTTANAAATm'AAAAANTGGGTITAm'C^ 
TTCNNITITGGGGGTTSfATA>rrGGCCrrTrm 
AATTTCCTT 

SEQ ID NO: 7 1 6 ACTAAATCTAGTAAAGACATTTTCATACACACCAAGGGGAAAAATAGGTAGC 
ATTACAGAAA'ri-riTGATGCAAGAATATA'i-rrn-rCl-lArri-rnTi'ATCATGGTCTACGAGAACAT 
TGTTrGAGATGCX;AAATATTTCATTAAGTTGATGTTCCTTTT(nTrC^ 

GACAGGGTTTCGCCATGCTGCCCAGGCTGGTCrCAAAGTCCTGGCCTCAAGCGATCCACCCACCTT 

GGCCTCCCAAAGTGCTGGGATTATAGGCATGAGCCACCTTGCCTGAGCTGGTATTTCTTGAGTATC 

TACTTTGCAGAAAACAACCA.\TGAGTCCAGTGGCATAGCCriTrCTCmCAGTATTT^ 

TTGTCTCATTTGTTGGGAAGACCTCCAGATGCTCCGGTCCACTGAAGACACGGTNACCAATGCTGG 

AAACCATCCXrrGATCGCACAGGGTTTCTGCCCCGTGAGGAANGGC^IGCTCTGCTTGGGGNTGCCC 

AAAANATGCGGCTNAAATGNGNTGGTCNACT 

SEQ ID NO: 7 1 7 ACGCGGGGGCAGATCAGGGATCGCGATTGCGAATCCTCCGCTGAGGTGATTT 

ggatatccctagaacgttgagggcacgagtcgggtcctgagacx:aggtcctcagccagcagagcc 
acgttccitatgagcaccgtgggtttattccattatcctncaccactgacccgaanatgcccggcg 
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CCNTGGGGACTCCNGCTT 

SEQ ID NO: 718 GCl"l"ri"ll"l'lU"114U"l'l ri'rrrri l'rrri ACl'rriGGGAGGANATAACCAGTCTC 
TCCCTTCATATATATTCTnrrrATTTCrn'GTTATACCTTCCCA^ 

TAGAATGGCCATCTCCCAACATTTTAAAAAAACTGCNCCCCCCAATGGGTGAACAAAGTAAAGAG 

TAGTAACCTANAGTTCAGCTGAGTAAGCCACTGTGGAGCCTTAAGTGGTGAGGTCTTCCAATTTCA 

GAGNGATGTGTCrrCAACTTGTATCATCATTTTAGCGGAAAAACATAATn'AATrrrGGTGAAATG 

AGA^^CAT^^'CGTAACAGGAT^AGTAACAGCATTCACTGAATTTCACAC^CT^TTm 

GTGAAAAAAATGAATGGTAGCTAAAANATCAATGGGATTCCANCTCCAGCTGCANATGGAATGAC 

CATTTACCAGGACAAAAAACCCrmTITTn'GATTTACAAAGCTAACAGCAGT^ 

CTTAATTCNGACCAACTACCCACACCTTTmAAATGGCAAAAATACAAAAAAGTGTTCAAAAh^ 

mCAAAAAAAGNGCATAGTCTANGTGCTTGTAANTAAACCCCTGAAAAATTNTTGCTAGAGGGA 

GTTCANC 

SEQ ID NO: 719 ACGCGGGATGGGAATGAGGNTCTACCACTCTGGAAAATTCATGCCTGCAGGT 
TTAATTGCAGGTGCCAGrrTGCTGATGGTCGCCAAAGTTGGAGTTAGTATGrrCTACATGACCCNA 
TTAACNAAAGTNT^r^TCCACCT^^AAACATGATAAAATAATTAAAAANT 

SEQ ID NO: 720 ACAAATAAAAGTGATGGTGAGAACCTGGCTCAGGAAATGCAGTAGCAGGCC 
ATATTGCATCCAAAGGAATTACrCACAGCTGTGCTGTGTGCATrCTCTGTGGGCCTAGCAGGGAAG 
GGGACAGCCCTGTGGCAATGGGCATGACACGGATGCTCCTGNAATGCAGTCTCAGTGACAAGTTG 
TGTGTCATCCAGGAAGAAGCGGGTATGAAGTGATTATCGTCCCAACTTTGTrGGTAACTAIXT^ 
TCATCCriUll'l'GGGGTCAhn'CCTNGTGCrrrTTATCANAGACAAAAGAAOT 

SEQ ID NO: 72 1 ACri-rri'lU'ri'llU'lTTCITITGAmCCTCAGGACCTTAGAGGGAAAACAAAC 
AGTAGCAGCTAATATTCTCAAGTATATTGCTGCTTANAAAGATCCTCAGGAACAATTAGCAGCAAT 
AAGCAAGCCTTTGAAAAGATCTGAATKnrmCCTGAAATATrTACGATACACAGGTGC^^ 
CTGAAATCTGTTGGGTCCTCCTTTTAGGCAGTCTCrGTGGGCAANAGAGTGGGACTTGCNAGGTGG 
ACAGCrGTGNGGGATCCTGGGGAAANGGAGTTTTNAAAANGGGTGGCTCAGGGCNTGNTAAAAA 
NCChmTTGGNANOGATTNGTATTGAAAAAATNAAAACCCCTTTGGTAAGGCN 

/ SEQ ID NO: 722 ACCGGAAGAAGCAGCTGGCAAAGCAGCTCCCTGCACATGACCAGGACCCTTC 

AAAGTGCCATGAGTTGTCTCCCAGAGAGGTGAAGGAGATGGAGCAGTTTGTGAAGAAATATAAGA 
GCGAAGCTCTGGGAOTAGGAGATGTCAAAOTCCCTGTGAGATGGATGCCCAAGGCCCCAAACAA 
ATGAACATTCCTGGAGGGGATAGAAGCACCCCACANCANTGGGGGCCATGGAGGACAAATCTGCT 
GAGCACVW^GAACTCAATATTCCTGCTATTGCTGCAAACTGAGCATGAAAGAAGGTAACCCAGC 
CATNTATGCCTAAAGGGCTTGGCTArrATAAACTGTGGCACN 

SEQ ID NO: 723 ACAGCGTTCACAATGCTGGTATTAATCAGCTACATATTTTGAACATCTACTGT 
TACTGGATACCAAAGAAAGTGAGrrATTTAANAATCTrCCATTCTTGTTATAAGCriTCKrA^^ 
CAGTAACTTCTCANANGCTTTNCAANAAGCm'AAGTTCTTGCrrrGAGANA^ 

SEQ ID NO: 724 ACGCGGGGAGTGCGTGCCGCTCCGCCGACCGAANAGGCTGGACATGACACC 
AGTGGCATATCACGGCCATGGGGTCTCAGCATTCCGNTGCTQCTCGCCCCTCCTCCTGCAGGCGAA 
AGCAAGAAGATGACAGGGACGGTTrGCTGGCTGAACTAGAGCANGAAGAAGCCNTTGNTCATTCN 
TATATGNTGAATNCCCGGGAGA 

SEQ ID NO: 725 ACCTCATCGQTATCCAAGGCCCCGACTATGTTCTTGTCGCCTCCGACCGGGTG 
GCCGCCAGCAATATTGTCCAGATGAAGGACGATCATGACAAGATGTTTAAGATGAGTGAAAAGAT 
ATCTCCATTCTCTCTAAAGTAGTGGTTCTTTTTGCCCTTAAACTrAAAr^^ 
CGCCTCCCGGGCCCAAGTGATCCTCCCATCTCAGCCTCCTGAGTAGCTGGGATTACAGGCGCACAC 
CACCAATGCCCAGTTAGTTmGTGTITnCATAGAGACAGGGTCTCACCATGTCATTCAGGTTGGT 
CTTGAACTCCTGGGCTCAAGCAGTCTGCCTGCCTTGGCTTCCXIAGTGCTGGGATTACAGGCGTGAG 
CCACCGTGCCCGGCTAAAAAGTATrmAAGTrCTGCATATTGCrTATTTCACTTAACACT^ 
GAGATTGTTTrATATCAATACATATAGATATGCTTATTCrrGTTGACAGrrGCATAATTTTCCATTA 
AATTGATGTATCATGGGCAGTrAACCAAGTTACTCGTTTTACTCTTANCATAACTrrGGGGG^ 
AATGTGGATGTTTTGGNGGTTAAAGCTATTAAAACAOGGGTrrTTGCCCKG 

SEQ ID NO: 726 ACOCGGQTATAGAGGGCTAACTCAGGCATTGTCTTXSTTTATTTGTAGACTGGA 
TTAAAAACAACCTGTCCTGTTTTGTCAGTTCCCAGCTTCTTCGTTTAGAATA^ 
AAGAAACGTGCTTGTCrCTCTATACCCGCAGAATGAAGTTACTGrrGTTAAAACrGGATT^^ 
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TTTTACTAGGTTCCGAAGAGTCCAGATGCTTGGTAGATGTTCAATACKTGATTTn^^ 
ATGIGrrCATTTTAAAATCCTCCTTAACATTTCTAG 

SEQ ID NO: 727 ACTTACATGTGTGAACACATATAAAGTGCCAGGT TTAC AGACCCTGGCTCAA 
GGACAGTCTAGGATGGGAAAGGAGGTAGGGCGAGAAGAAGCACATATITTCTCCCTGGTGCTTCA 
GCCTCACCCTATCCAAGGGACAGACATATGGGGTGTGAGAAACCCATCCCCAGGTCCCAGCCTTC 
AGGACTGGAGTCCriTrGAGTCTGGTGGAGTCACAGATCCAGTCTTTGGGGGACACrrGGGTCTGTC 
TCCmTGAAAGCCCTGGAAAGGTGGGAGGTAAGAAGTAAAGGGAGACAGGTCCCTGCTAGAAG 
AACTTGACGCCTCGGCCATCGCTGACAGTGATGATCTCGGCCTTGTOCTCCTGCTGTAGGGCCTGC 
AGAGCCCGCAGTAGAGTGGCITCATCCAGCCCGTGGAACTCCTCATCCTCTGTGTCTTCCCCATTA 
GTCAGTTCATACAGGGTAAAGACGGAGTTGTTCTGCCACTCCTGGAAACCCACTGATAGATGAGTT 
TCCCCCATTCTTCTGGCCTCCGCCACATGATCAGGAAGCrGGACTTGCTCTTA TCCA ACCACTCGA 
NGTTCCCTTTCTTCTrAATTCTCTAATACAATCTGGATCGACTCCACAGGAAGC^ 

SEQ ID NO: 728 ACCACTGTArTGATTAGCCTGTATGTAGCAGGGCTCCCTTCATTGCATCTGAG 
GACTTGTTTTCTTrrTCTrTATTrrTAATCCTCTTAGT^ 

TACCCAGTTTGTGGrrTTTTGGGAGAAATGTAACTGGACAGTTAGCTlTrCAA^ 
TAACCCAA 

SEQ ID NO: 729 ACTTCAGGATTAGGAATTTGGGTrrGTCATAGATGTATTCTCTGGTGAGGGTG 
GCTGGGATATACCTGACCCACCATCTTCAGAAGGACCCATGTCAGGTCTGACCATTGGGAGCAAA 
GarATGTTCACACTGACCTAATGCAGAGTATGGAAGCATTGGGCTGGTTATACATTTCTGTTTOT 
AGATTTATCCTCCGCCTCTGTAGGCATGGACAACCTTTAATCAGAGCATCTA GAGTGG CCTCTTGT 
TTATCCTGAAGATACTGATGGGTCTTGTTTTCTGTTAGTCTGTTTTGTAATATTC^^ 
CATGGGGAGGCTTAGTTTGTCCAGTCCTTCCATGCCCTTCTATCCCAGATTACCTAAATGTTCCCTT 
CTCAGGAATTCTGTCTCATCAGTTCTTCACAGTGAGAAAAGAGGCTAGATGATGGTGTGGGGGGTT 
GGAGTTTTCTTCTAATACCGAGGGTTCCTGGCTGTGAGGAAACAGCCACATGTTCGTCATGATTGA 
GCTGTGAAGTCnTCrrGGACCTGTTGCTGAAAATAAAGTTAATTTGTTTGAGGCNTCT 
AGGTGGAAACTATTGAAGTTANCTNACAATCACANCATAGGTTCTGATCCTTGGAAAGGGGGTTG 

SEQ ID NO: 730 ACCTCCn^rTTCTCTrCTATTTTTAGGAANAAGTTATAACAAGTnTAAA 
CAATTCTNAAAAACAATAGGCTTTTAAAAAATAAGACriTGmTACCAGAAACAAG 
TTACATAACCATTTTCATATCACTACTCATTTCCATNATTTACCAATTCATCTTTGATGCA^ 
GAAACmTTAANCAGTCACTAGACACCTTGTTTTAGAATCTGAAGAAATTATTATCCACCAACAGG 
AATCTAATGATATATATATTTGCATATATTCAAAAATTNCATGAGGGAAAAAGGTNKTA^^ 
CCTAGTTNTrrTATACCANATATOGATATTCCTAGNANNAACAACCAAGGbnSfTTNAAATN 
NNAGNTGAAACTGNTGACTTATATATGAAGCTTTrrTCCAGNACTT^ 
TNNATTTTATTTCCCTGCTTGGGCTAAATTAATCAATTATTTTATTANCAAGAG 
AGGANAAATmATTTCCTT^ANNANAACCANTAATATN^^^^^ 
NTNAAAAATCX™ANCNTTTAANCNTATNGCATNAATNCCCAACn^GN<>JT>^ 

SEQ ID NO: 73 1 ACCTCCTCTTCTCTTCTATITrrAGGAAGAAGTTATAACAAGTTrrAAATATC^ 
CAAirCTGAAAAACAATAGGCTmAAAAAATAAGACTTGATTACXAGAAACAAGTAAT^^ 
TTACATAACCATTrrCATATCACTACTCATTTCCATTArrTACCAATTCATCm 
AAACAATTAAGCAGTCACTAGACACCTGTTITAGAATCTGAAGAAATTATTATCCACCACAGGAA 
TCrAATGATATATATATTTGCATATATTTAAAATITCATGAGGGAAAAGGTAATAAACTATTCT^ 
TTATTTTATACCAAATATTGATATTCCTAGTCAAAACAACAAGGAATTAAGATCTTTCTCCAGGTG 
AACTGCTGACTATATAGAAGCTATTTCCAGCACTTTCTTCTGGGGATTAAAAATGAm 
GCTGGCTAAAATGATAAATTATATATTACAAGTAAGTCCTTCAGAAGTOAATATTAATTCCTTAAG 
AGACCATAAAATCTATnTTAAAAAArrCTCTTAAAAACTGAAAACAATCCrTC^ 
TGGCTTAATCCCTACTTGACCAAAGAAAAAAAAAAAAGAAGT 

SEQ ID NO: 732 ACTTXTTTTTTTTTriTITr^^ 

CTTTTATTAAAGATCTACTCATACCATGGCTGAAATCATCTATTATTGTTGCTAGTTAGC CT 
CTATAGTTGGGTAATGTTGCCTTGCNACTGTNTTTNCCATCTCTCCCA^ 
AAAAAAAATTANTTGCTCCAANTTTTNAGGCCCANGGGAGGCTCT 
GTCa^CmCAQGAAGGGTGATCTTGNGTATAAAATTITCATACTTAA>mT 

SEQ ID NO: 733 ACCAGGCTGGCGACAGGTGCTACCAGGAGTGGGCTGAGGGGAGAAAAACTA 

tctcccactcttttggcccaggcaatgtcaacgaortccacattccctggcccacrtot 
ccccaggttcggctctgtataaggaccctcccctcccaaccccaaccccagagtgcagtgcaaatc 

AACCAACAATTTACTGGTGGAATGGCAATCAAAGGAAACAGTTAAACACCAAACAATTTCrr^ 
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GCCAAAAAATArrmCATGGAGrrGAACATTTrrCGAGTGTGri 1 1 1 ilCAAGTGTAAAAGC^ 

GACATrrTOTrCAAACAGAAGCAGCATCTAGGAATTCrGGCACTTGGGrrCTAGGGGGTTACA^ 

ATGCATCATGGATTCnrcrCCCTCGTATTTAAAAAGGCCTCGTGTTTCTATTCCT^ 

ACACCTGCTAGCTCTCCCCTCTAGCGGACAGTGGGTGGGCAGCCAGCCTCCTGGTTAGATTGGGCA 

ATGCCAANCAAACATNCC^ATTCACCTGOTGGGCrTGCTTTCTGATO 

GCANANAAAGAAACTTACAAAANCNCACCACCAAAGGNANCT^^^^ACGGGGACCCTGNaW 

AKTTGNTG 

SEO ID NO: 734 CAGCAGAGATATATGCCTATCGAGAAGAACAGGATrTTGGAATTGAGATAGT 
GAAAGTGAAAGCAATTGGAAGACAAAGGTTCAAAGTCCTTGAGCTAAGAACACAGTCANATGGG 
AATCCANCNAGCTAAAAGTGCAAATTCTTCCCGGAATGTGTGTTGCXTTTCAACCATGTCT^ 
CAAATTOGAATCOTCAATAANGTTGCCAGATATTTNCCTTCAAAACCTTGT^ 
CAmOTTNTTTTAAATGGGTGGCAGAAANmCCOTTAAGAGNAAGmCATTG 
mCATGGCCTTCNCTGGCTGhn'ATTCCCTTATATGATGCrGNNACCCTTTATGGGACNGAAr^ 

GAAAC 

SEO ID NO: 735 ACATTGAGACAGAGCTAAAGAAGAGGAAAGGGATCGTGGAACATGAGGAAC 
AGAAAGTlAAGCCAAAGAATGCAGAGGACTGTCTTTATGAACTTCCAGAAAACATCCGTGm 

tcagcaaagaagaccgaggagatgctttccaaccagatgctgagtggcattcctgaggtggacct 

GGGCATCGATGCTAAAATAAAAAATATCATTTCCACGGAGGATGCCAAGGCCCGTCTGCTGGCAG 

agcagcanaacaagaagaaagacagcgagacctccttcgtgcctaccaacatggctgtgaattat 

totgcagcacaacagattttatcattgaggagctcaacgcgcccatacggagaaacaaagaagag 

cccaaggcccggcccttgagagtaggtgacacggagaagccagagcctgancggtccctcctanc 

cocaagcgtcctgctaacgagaaggtaactggatgact^^tnntittgagaaot 

gactaggcngtncct^^cccgggccggtct^wggccnangtccaaaaaaaaccttt^^^ 

GANTGNATTCCTTTACNGGO^TAAbrrGCTATCCNGTTCCAAOTCAAACCAGm 

SEQ ID NO: 736 ACGCGGGATAGACGGAAATGGAGAGCTGGATTTCTCCACmTCTGACCATT 
ATGCACATGCAAATAAAACAAGAAGACCCAAAGAAAGAAArrCTTCTAGCCATGTTGATGGTGGA 
CAAGGAGAAGAAAGGTTACGTCATGGCGTCCGACCTGCGGTCAAAACTCACGAGTCTGGGGGAG 
AAGCTCACCCACAAGGAAGTGGATGATCTCTTCAGGGAAGCAGATATCGAACCCAATGGCAAAGT 
GAAGTATGATGAATITATCCACAAGATCACCCTrCCTGGACGGGACTATTGAAGGAGGAG)^^ 
GAGAGCCrCCCCTGGGCCTGAAAACTTGGAGTAATTAATTITITITAAAA^ 
GGAGAGATGGCAAACACAGTGGCAAGACAACArrACCCAACTATAGAAGAGAGGCTAACI^^ 
ACAATAATAGATGATTTCACCATGGTATGAGTAGATCnTrAATAAAAGATTTGTATTGAT^^ 

SEO ID NO: 737 ACATAAACTTCAAAGAGATGCTGTAGAGGArTGGACTGCAGTTnTCCTCATA 
GCCAAACAGCTGGAAGCCAGTTCCCAGAAGACCAAACTGCATGCCCCAATCGCCAAGGTAATtTA 
TTCTrATTAOTGATGTCCrAAAGCTrCTTrGAGAriTGCTATAAAAm 
CAAATGTCCAACATGAAATrnTTGGCAACArrAGGTGAACTGAATTCAACCACAATOT 
GGGAAGTCCAGAGAAAAGTTCACTTTTTAATCCATATriTGAGCCATCTrCAATrACTO 
CACrGTCmGrrAAGAGCTCTCTGrrTATTTTGAAATTTACAGTCCT^ 

SEO ID NO- 738 ACCAAGTGAGTGGGAATACATATTCTAGITAAAGCATTTGTGTCTAGCTACAC 
ACCGCTAACAAAGTTACTTAGTTATCAATGTAGGATTCTTAAGGAGCrrrAAGCTAA^ 
TTAGTCACTTANCTTATTTTGTATCTTTTCACTTAGGAAGATm 

SEO ID NO: 739 ACCTTCACCTGCTCCAGTGATGANAGCCTCCAGCAACATATAGAAAAGCACA 
ATGAACTCAAACCTTACAAATGCCAGCTCTGCTACTATGAGACCAAGCACACGGAGGAAC^ 
AGCCACCTTCGGGATGAGCATAAGGTAAGCCGTAACTrrGAGCTGGTTGGACGGGTTAACTTGGA 
TCAGCTGGAACAGATGAAGGAGAAAATGGAGAGCTCCAGCAGCGATOATGAGGACAAGGAAGAA 
OAAATGAGCAGCAAGGCTGAAGACAGAGAGCTGATGAGArmCTGACX:ACGGGGCTGCTCTTAA 
CACTGAGAAGCGTTTOCATGTGAATTTTGTGGACGGGCGTTTTNACAAGGGC^ 
AGACATGTGCmANACACGGGATGGCATXm'AATGANACCAAGCNNGGTGAGCANATANNA^ 
CNCCCANAAAGAGATCATGGAGAACCrGTTTAAAATGCCCTTCNTOTANGGAAANNGAAGATTGA 

CTANGCCCTTG 

SEO ID NO' 740 A crnT r ri rrrrn - fi Ti'i 1 1 1 1 1 aacakaaaggtataaagtttattaacatct 

TTAAAAAAAAAAAAAAAAAAAGATGGGCXGGGCNTGGNGGCTCACNCCTGTGATCCCAGCAC^ 
GGGAGGCCAAGGCGGGTGGATCCnTTCAGGTCAGGAGTTTGAAACCAGCCTGGCCAACNTGGNGA 
AACCCCATNTTTACTAAAAATACAAAAAANTTANCCAGGCNTGGNGTCGCACACCrGTAGTCrc^ 
AGCTACrCGGGAGGCTGAGGCAAAAAAATCGCTTGAACCTGGGAGGNGGAGGTTNNAGTGAA^ 
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AANATCGNCCGTTGACTCCAACCTGGGCAAAAAGCAAAAA NTCCAT CTAAAAAAAAAAAAAAAA 
ANGAIGGGCCNNACCNCNGCTACTGAATATTGATATAGNTCCTTTThn^ 

SEO ID NO- 741 GGTACGCGGGTGAAGTATTGCTAATATTACCGNGGTrTATGAACTATGTTCAG 
\vrrGAAGAAAATCCTAACrrTCAGrrAGAGGTrAGTGACGGGGTTCAGGACACCCTACACAAAA 
TACAGCACTTTGACATATTGAATArmAAGCTGAAGGCATTTGAGGAAArrGCAGAAGCAGGAA 
GGTGACICrGACCTTCTGCCTGCTGTTCrCCCCAGAAGCAGCCATAAAACCTOGGAAG^^ 
OACCrTCCCOTAAGTAGATCATAAGACTGTCATGTAAGAGGTGCTCTCCTGGCACCCAGAGAAA 
AGGAGCATCCTTACCrCAAAAGCACAGGGACACAAAGAGGAATCTAAACAAACAGGCCTCTCAA 
rrTTCCCCCAGTTATTACATrTAGCITGGTCACACriTrGCCTATGACAriT CT 
OTCTTCATCAAACCTACTATNAAAAACArrCAAGTCAACTGTTTNTTTGGGCCr 
AGCCCCNThn^GGTNGGGTAAACTTTATTNAANAAAmGGasrmC^ 

SEO ID NO- 742 GGTACTGTTGGTTAAATGACAATTTATGTGGATnTGCNTGTAATACACAGTG 
AGACACAGTAATTTTATCTAAATTACAGTGCAGTITAGTTAATCrrATrAATACTGACrCA^ 
GCCmAAATATAAATGATATGTTGAAAACTTAAGGAAGCAAATGCTACATATATGCAATATAAA 
ATAGTAATGTGATGCTOATGCTGTTAACCAAAGGGCAGAATAAATAAGCAAAATGCCAAAAGGG 
GTCTTAAnGAAATGAAAATrrAATTrrGTrmAAAATATTGrrTATC^ 
TATAOTAAGTrrTTTTAGAAGACAATTTrCATAACTTGATAAATTATAGTITGrr^^ 
TrGa-CITAAAAAGATGTAAATAGATGACAAACCGATGTAAATAAriTrcTNAAGAAGCT^^ 
AOTGTrrATACCGTGGAACACACCTACOTGAAAAGCAGAAAATCGGTGCCTGGTTTGCnTCl 
CCNCNTTATTTTTNGGATTGGGGGCNATTTCCCATNCAAAAAATGGGGGCC^ 

SEO ID NO: 743 ACTGGGATTATATAGGCATGAGCCACTGAGCCTGGCCCANAAGCGTTrrrCTC 
AAAGGCCCTCAGTGAGATAAATTAGATTTGGCATCTCCTGTCCTGGGCCAGGGATCTCTC^^ 
AGCCCCTGCCCCTCTGTTGOAGGCACAGTnrTAGAATAAGGAGGAGGAGGGAGAAGAGAAAATGT 
AAAGGAGGGAGATCTITCCCAGGCCGCACCATTTCTGTCACTCACATGGACCCAAGATAAAAGAA 
TGGCCAAACCCTCACAACCCCTGATGTTTGAAGAGTTCCAAGTTGAAG GGAAACA AAGAAGTGTT 
TGATGGTGCCAGAAAGGGGCTGCTCrCCANAAAGCTAAAATTTAAri-lUl-ll 1 1 TCCTCTGANTTCT 
GNACCTTGGCCGNGACCACCTN 

SEO ID NO- 744 ACTTTTCTTTCrrrGCTGGTAATTTTATGGAGCAGGTTAA GAAGG CTGCT 
GTTAGGATAAACTGTATACCAATAATGTTGACAACCTGTAATGAGTGTTGCATTITAOT 
CrmCCITCCTACCTrGATGCCAGTAATCTATAAGGGATCrrTATAGTTrGAATGTAT^ 
CTTCAGTATACTTrAGTrCTACTTITrrATTTGACTCACAACCATTOT 

GTGTTrrAAAAGCCTGAAGTCAGTGAGATGAAAnCAACATCAAGAATTrGAAGTAACTTGTAAG 

GAAAAATAATATAAAGATACCATTGGGGCAGTGGCTCACGCCTGTAATCTCAGCACTTTGGGAGG 

CTGAGGTGGAANGATCACTTGAANCCAGAGTTTGANACCANCCTGTGCAACAAANCAAOANCCCN 

CTITCAAAAACTTAAAAAATANCTGGNTGGGGGGGTGCTCACCCCNANrrCCNCrr^^ 

CTGAGGNNNNAAAACCCTTNNGCCChfNNAGGCCATC 

SEO ID NO: 745 GGTACTCCTATAAAACrCATTCTTGTGTGGTGGCTGTGCTA TAGAOTCT GTGT 
ATTGCTGTTCATATTCGGAGrrCTGGTTTrGTTITrcCCTrAAAACCTG^ 
TGGGGGGATTCAGAACTCrrGTTTCCCATTCCATAGCACCTGACATTAriTCAAGTm 
TTAAGGTGTATATTITATITriTrTATTGGCnTAGTTGriTlTrGr^ 

TCACTGTrGCCCAGGCTGGGOTGCAATGGTGTGATCrTGGCn'CACTGCAGCCTCCACCTCCCGAGT 
TCAAATGATTCTCCTGCCTCANCrrCCTGAGTAGCTGGGATTACAGGTGCATGCCACCATGCCCCG 
GCTAATmATATTTrrArrANANACCGGAmCCCATGTrAA(XAGCmGTCTCGAAC^^ 
CAGNGAACCTGCCCCC^^"GCCTCCCAANGGNTGGGATAACAGCTGANCCCCATNCCGGCCT^ 

GCTTANNTTTTAAATATCCCCCNANN 

SEO ID NO- 746 A CiTr i l ' i ' n ' m l n n 1 1 1 I NGCCrGGAAATGrnTTAATANAATrGGTCTAG 
TAATCGrrCAGGATTTCGGTGATGGGCCCTCCCTGTCTGGACACTGCCAACCCACAGCrGGAGGGG 
CACrrAAGGCACGTCATTTTGTGArrAGAATTACACAAAArrTGATTAATATTATAGCTGCA^ 
TAACATACACAATmCACTCATAATrrAAAATArrrTGATGAAAT^ 

ATCAATGGTA Crj - JTn - 1 1 1 1 1 1 1 i 1 1 1 L 1 1 1 1 IGCGATGGAGTCTrGCTCrGTTGCCCGGGCrGGAO 
TGCAATGTCACAAATCTCGGNTCACrACAACCmGCCrrCCGGGTTCAAGCGATT^ 
ACCTNCTGAAAAACTNGGACTA(XGGGGCCC(XCCCCC>rrCGG>n-AAA>rmGNAr^ 
AAAANNGGAGTTTACAATTGTTGGGNAAACTGGNNTTAAAANTOWGACCTAAANGANCCCC^ 

CriTGGN>rrCCNAANGGCTGGGAAAAANGNTTNCCT 
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SEQ ID NO: 747 ACrGTGATGTGAAGCTGTTGCTGGTAAAATTAGCATTTGGCCTCCTGAAAGGT 
CTCTCAAATGGGATITmCAAAGGTTGCCAAGACrrGGACTCCCATTTTGTGCCAATrGCA 
TTCATGTTACCAGATGTAATGTGGTAGACTAATGTTGACTTAGGTCTGTGAACTGCAGTATACCAT 
CCTGCAGCTAGCATGAGATTGACGACrGTATCAACTGGAATTACOTCTGCCACAGCCATTGGAGTA 
GCnTrATGGCCCGAGGAAACCCmCCCAGTCGCAATAATGATTCCArrAGGTCCATTTATATTAT 
CAACCCAACCTGGGAAAGGCTCCTGCCAAGTTGCTCCCACAATGGAGGGCCTTATGATGGCAATG 
TTCGGGTTCCTGCTCTCrrGCTGCACCACCATmrrTCCAANGNCTTGGTGTA^ 
GGCCCATCTCTGATCAACCTTGGGNGTAATCTCGNCA'mAAAACCATNGCTAACCCCTCAAGGGG 
AATNAAGGA' l " lU " l ' 14U ' lG GCTCCCCCGGGCNCGGGTANANAAOTCn'CCAAGNGCTTTANG 

SEQ ID NO: 748 CGAGGTACGGGGGTCTTGAGCGCAGAAACACTTACITITCCCCCTACCCTGCT 
CCTCCTCCTCCACAGCCGTCTTTCTCTTTGCCTCAGCCACrrCCTTCCTTGGCCTCACCCTC 
GCACTGAAGAAGGTAACCGGGTCCAGACCCACGCGGCGCCAGTTCTCCGGCGGGAAGGAAAACC 
GCGCAGAGAGGCAGCAATGAATGTGGATCACGAGGTTAACCTCTTAGTGGAGGAAATTCATCGTT 
TGGGTTCAAAAAATGCTGATGGAAAGTTAAGCGTGAAATTTGGGGTCCTCTTCCGTGATGATAAAT 
GTGCCAACCTCnTTGAAGCATTGGTAGGAACTCrrAAAGCTGCAAAACGAAGGAAGATTGTAACA 
TATCCAGGAGAGCTGCTTCTACAAGGTGTTCATGATGATGTTGACATTATATTACTGCAAAGAATA 
AATGNGGGTTACATATCTTTATGTACCCCGGCTNNATTGGATCCAATACTTGCCAACNGAACCAGT 
TACCCTANGGATACAACGCAATNCTANTCTAGAATCCATATTAACCATAGGGTTACCAACCTCATG 
GTGGACAGGAATCCNATQONCAACCCTNTTTAAAGGTCCTTGGTCAACAATAAAGNCTACCNGNC 
TTAGTTAAAACGNGTAATCNAGGTGGGTNTATTACTTAAATCTCCTNACTNCCCGNGGCC^ 
GGAATTA 

SEQ ID NO: 749 ACGCXjGGATTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAA 
AAAAAAAAAAAAAAAAAANNACACGGGGTNAATTACACAGTTCTATTAAAACCTNATGCCT^ 
ATTACATm-AATTTGAACTCTNAACTTCATGTTACAGAATGCTTTAAAGAT^ 
NTTAANAAAAThrrATANATTTGTATGTCAGTTTATACTTCAAAAATCCATATATnGTCATAn'A^ 
TTTTTTAGAANCCTCCTAArTGGATAACTAGATGGTATTTAAAATGAATGCCCAAAAATOT 
CCCCGCCTGTTTACCAAAAACATNACCTTTAGCATCACCAGTTTTAGAGGCACCGGCTGCNANTGA 
CACATGTTrAACGGCCCCGGNCCTTNGCCGGNACCACCTAGGGCNAATTCACTCANTGGNGGGCG 
GTAC^ANGGGATCCCANCTCGGTCCAACTTTGC^^^AACATTGGNATANCTNGTTCC^GTGTGAAAT 
TGGT^TCCGTTCCAATTCCCACAANATTCNANCNCGAAC^^*AAA^n^NTAANCCTGGG^^^C^ 
AGGACTAC™CTTTATGNGTTNCCTACTTCCCTn^\A^ 
CCAOCCCGGGAAAGCTTTTGhriTNGGGCNTTnmTm 

SEQ ID NO: 750 GGTACGGGGGTCTTGAGCGCAGAAACACTTACTTTTCCCCCTACCCTGCTCCT 
CCTCCTCCACAGCCGTCTTTCTCTTTGCCTCAGCCACnTCCrTCCTTGGCCrCACCCTCCCCAGT^ 
ACTGAAGAAGGTAACCGGGTCCAGACCCACGCGGCGCCAGTTCTCCGGCGGGAAGGAAAACCGC 
GCAGAGAGGCAGCAATGAATGTGGATCACGAGGTTAACCTCTTAGTGGAGGAAATTCATCGTTTG 
GOrrCAAAAAATGCTGATGGAAAGTTAAGCGTGAAArrTGGGGTCCTCTTCCGTGATGATAAATGT 
GCCAACCrCTTTGAAGCATTGGTAGGAACTCTTAAAGCTGCAAAACGAAGGAAGATTGTAACATA 
TCXAGGAGAGCTGCTTCTACAAGGNGTTCATGATGATGTTGACATTATATTACTGCAAGAATAATG 
NGGGTTACATATCTTTATGTACCCCNGGCTCAATTGATNCCAATACTTGGNCCACGGAACAAGGTA 
NCCTAGGGATAACAACGCAATNCTNTTT^m4AAAGTCCATATTAACAATANGGGTr^AC^ 
CATGTTTGGATCAAGGA 

SEQ ID NO: 75 1 GGTACGCOGGAACnTGTAAGATGCAAAGAGGTTGGATCAAGTTTAAATGAC 
TGTGCTGCCCCTTTCACATCAAAGAACTACTGACAACGAAGGCCGCGCCTGCCTTTCCCATCTGTC 
TATCTATCTGGCTGGCAGGGAAQGAAAGAACTTGCATGTTGGTGAAGGAAGAAGTGGGGTGGAA 
GAAGTGGGGTGGGACGACAGTGAAATCTAGAGTAAAACCAAGCTGGCCCAAGGTGTCCTGCAGG 
CTGTAATGCAGTTTAATCAGAGTGCCA'rrrrrn i IGTrCAAATGATTTTAATTATrGGAATGCACA 
ATTTTTTTAATATGCAAATAAAAAGTTTAAAAACrrrm 
NTTTTT^ANTT^^T^^TATT^^TA^^^TGGTTC^ 

NCACANTTGGCGGCCTTNCn^OTGOOTCCNAGCrrGGGACCAAACTNGGGGNAATNATGNG 

GCTNGTTCCCTGGNGGAAAr^GTT^TCamx:NAATTCCCAAAANA^TCAANCCGGGA^n^ 

TINNACC 

SEQ ID NO: 752 CGAGGTACTGGATGTGGTTGCCCCCATrTGTGTGTGTGGTTGTGTGTGTGTGG 
TTGTGTGTl'GGTGGCCACAGCTGAGCCTCTGTCACCAGAGAAGGCTGAGGCCCCAATGGCACACC 
TCAGAAACCTACACCCCGAGGCTGGACGGCrGGACTCCTGAGCACAAGCTC CCTCTCGCACCC TTT 
GCCAGACAGTTTGTCTCCAATTTCAAACTGACX:TAAGGCTCTrACT CCTGG ATTTm 
CCTTCTCCCAGCCAGTCTTCGGGAGGGCATGATTAGAGAAGTGCTCCTTTGCTGATGGAGGAGGG 
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GACCTAAGGAAAGAAAGGTGGATCCAAGTGCCTTCTCTCTAArrGATCCCTCCCA(XC AAGT TTCT 
rrGCCTCTCTTrCTTm'ACCAAGGCATGGriTITACTCT 

ACTNGTNAATGCACCCCATAATGGACANTTTrAAGTCCAAGCCCTNTNAATAGCATTCAAAAN 
CTGGAATCCCAAGCACCCTCTGAAAANCn^IAGAAAATNGCCTTrCAACnTITGGG^ 
GTTGAACTTTTGGCTGTTCAAAGGTGCNNAATTNTCnTrGANGG^ 
T^n^AAAAGACTNGGACCT^mAAGGGGGGGGTCCr^T^ATTCC™GACNCCC^ 

SEQ ID NO: 753 ACGCGGGGGGTCTCATTGAACTCGCCTGCAGCTCTTGGGrrmTGTGGCTTC 
CTTCGTTATTGGAGCCAGGCCTACACCCCAGCAACCATGTCCAAGGGACCTGCAGTTGGTATTGAT 
CTTGGCACCACCTACTCTTGTGTGGGTGTmCCAGCACGGAAAAGTCXSAGATAATTGCCAATGAT 
CAGGGAAACCOAACCACTCCAAGCTATGTCGCCrTTACGGACACTGAACGGTTGATCGGTGATGC 
CGCAAAGAATCAAGTTGCAATGAACCCCACCAACACAGTTTTTGATGCCAAACGTCTGATTGGAC 
GCAGATTTGATGATGCTGCTTGTCCAGTCrGATAAAACATTGGCCCTTTATGGTGGTGAATGATG 
TGGCAGGCCCAAGGTCCAAGTTAGNATACANGGGANAGNCCAAAANGCTTCTATTCAAGAAGGA 
GGNGGCTTCTATGGTTTTGACAAAAATGAANGGAATTTGCNAAACCTNCnTTGGGA^ 
CNNATGCTTNGGTNACAGGNCCGCriWmAATGACTTTAAC^^ 

SEQ ID NO: 754 GGTACCGACCATAGAGCAAGAATCAAGATTCTGCTAACrCCTGCACAGCCCC 
GTCCTCTTCCnTCTGCTAGCCTGGCTAAATCTGCTCATTATITCAGAGGGGA/^ 
AAGAGTGATAAGGGCCCTACTACACTGGCTTTTTTAGGCTTAGAGACAGAAACIT^ 
CAGTAGTGGCTTCTAGCTCTAAATGTTTGCCCCGCCATCCCTTTCCACAGTATCCTTCT^ 
CCCCTGTCTCTGGCTGTCTCGAGCAGTCTAGAAGAGTGCATCTCCAGCCTATGAAACAGCTGGGTC 
TTTGGCCATAAGAAGTAAAGATTTGAAGACAGAAGGAAGAAACTCAGGAGTAAGCTTCTAGACCC 
CTTCAGCTTCTACACCCTrCTGCCCrCTCTCCATTGNCTGCACCCCACCCCAGCCACTCAAm 
hTITGGTTTTNCTTTGGCCATANGAANGGTTACCAGTAGAATCCnTGCTAGGGTG^ 
ACATTCCTITAATAAACCATGGGGACrGGGAATATATAGGCATGAACCCTGAGCCTGGGCCTGAA 

ACGGTTTTTTA 

SEQ ID NO: 755 aCGCGGGGGACGAACACOTGACGCGGTCGGGCGGACCACTGCAGACTGAGC 

ggtggaccgaattgggaccgctggcttataagcx5atcatgtttctccagtattacctcaacgagca 

gggagatcgagtctatacgctgaagaaatttgacccgatgggacaacagacctgctcagcccatc 

crgctcggttctccccagatgacaaatactctcgacaccgaatcaccatcaaga aacgcr rcaag 

gtgctcatgacccagcaaccgcgccctgtcctctgagggtcccttaaactgatgtc rmct gcca 

cctgttacccctcggagactccgtaaccaaactcttcggactgtgagccctgatgccl 1 1 1 igcca 

gccatactctttggcatccagtctctcgtggcgattgattat gctt gtgtgaggcaatcatggtgg 

catcacccataaanggaacacam-gactttttttrctcatattitaaat^^ 

aagataaatgat^cgnnnnnannnnn^i^in^mn^^^ 

ANGGGA 

SEQ ID NO: 756 Ac rrrrnTriTi - i ' i ' n ' i 'i' n 'i'i'i' i u 1 1 r GCTTTrcAAAGArnTACTAAATCAT 
tttttaaacaaaatatacattaaatctcagatttacagaatatagaaataatitatccaaaga^ 
ttgcatttaaaattcgtaatattgcacccaacagtatgtctitgacacatttgca™ 

CCCACAArrrGCAAAAACAGGAGAGAAATCTGAAACTAACAGAATTACACAAGCTAAGTTTTCTG 

TAAAAAAAGAAAAAACTTACAATTTmATrTACAAGTTAAGGAAAAGTrGTAAACGT^ 

TrTACrrCCACAGAATAAAAAGCCrrrACATrcmTrATCATACCTAGAAAATGAACATb^ 

TGNGATTACCTCATAGGGAATTCAACAGGACTGATATTGNGAACATTCACAGCCCAATGGTAAAA 

AACAGAATTCTCGAACTOTGGGAACTAGNGNGGAAAAGACNCCGAATGAAAAAAAGNCTGGAAA 

pj^r^jYj^QCTCTlGmAGmTCmAATACCTAAAGTnm^A^^ 

CANCTTAGGAAGGGGGTGCANAACXnTrTTTNGAATANACAGNGAAAAAAAGGGGGh^ 

CCCCGAAAAACTTGNTTTGGGGGCCAAANTNGTTCCAT 

SEQ ID NO: 757 GGTACTATAATGGTCCCCATCTTAAmGAAAGCOTTTGAGAATCTTTTAGGA 
CAAGCACTGACGAAGGCACTCGAAGACTCCAGCTTCCTGAAAAGAAGTGGCAGGGACAGTGGCT 
ACGGTGACATCTGGTGTCCTGAACGTGGAGAATTTCTTGCTCCTCCAAGGCACCATAAGAGAGAA 
GATTCCTITGAAAGCTTGGACTCTTTGGGCTCGAGGTCATTGACAAGCTGCTCCTCTGATATCAC 
TTGAGAGGGGGGCOTGAAGGTrnGAAAGTGACACAGATTCGGAATTTACATTCAAGATGCAGGA 
TTATAATAAAGATGATATGTCGTATCGAAGGATITCGGCTGTTGAGCCAAAGACTGCGTTACCCrr 
CAATCGTTTTTTACCCAACAAAAGTAGACAGCCATCCTATGTACCTGCCCNGGCGGGCCG 

SEQ ID NO: 758 GGTACTATAATGGTCCCCATCTTAATTrGAAAGCGTTTGAGAATCTTTTAGGA 
CAAGCACTGACGAAGGCACTCGAAGACTCCAGCTTCCTGAAAAGAAGTGGCAGGGACAGTGGCT 
ACGGTGACATCTGGTGTCCTGAACGTGGAGAATTTCTTGCTCCTCCAAGGCACCATAAGAGAGAA 
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GATTCCTTTGAAAGCrrGGACTCTTTGCK5CTCGAGGTCATTGACAAGCTGCTCCTCT 

TTGAGAGGGGGGCGTGAAGGTTTTGAAAGTGACACAGATTCGGAATTTACATTCAAGATGCAGGA 

TTATAATAAAGATGATATGTCGTATCGAAGGATITCGGCTGTTGAGCCAAAGACTGCGTrACCCTT 

CAATCX}TTTTTTACCCAACAAAAGTAGACAGCCATCCTATGTACCTGCCNGGCNGGCCGT^ 

CAATTCCANCCACTGGCGGCCGTTCTATGGTTC(>rACCTGGTACCAACTTGGGGNAAACATC 

AACrmGTTCCGGGGGAAAT>rmCCGNTACAArrCCCACAACTTCCANNCCGAACCrrA^ 

AAACCNGGGGGCCCAAGAGGGGGCNCAACTNCATTNN 

SEQ ID NO: 759 GGTA CrnTi - i - riTi 1 i 1 i 1 i rrn iTi 1 1 AG^r^ATTCTTGCTTAATCAT^TAAA 
ATTATTTCTGGATTrACCTACrCACCATATGAGCTGGGG'mCTGACCTCCCrrcCTC 
CCTCCTTGTCTGTTCCTTTTTCTAGCTCTGTTTCCCTGCATCTTC 

ACCCCTITCCCAGAGCTCTGTrCCCCTTTCTTrGCCACTGAAAGGAGGTTTGCTGTCCTT 

CGGACACCGTTTGTTATTAAATAATACCTCTGGACACATCTCCTTGrrGTAAATCAATAAGC^ 

GCAATACAGGGAAAGACATACGGATTGGAAAAGTTGT 

SEQ ID NO: 760 GGTACTTTrAATGGTGGGAATTTACAGTAGAAGCATCCTTTGCTGAGTT ATAC 
ArrCCmATCAATCTCTTTTGATACAACAriTAAAACAAGTAGCTrCAAGAAACC^ 
GAGGATAGTArrrCTAAATAGCATTCAAGAACAGAGTATTATTGCACAOATCTGAAGA TCAAA AA 
AAAGCTCAAGGAAATACAGATCGGAAGTGCTGATGAGTTATArrrATIXJAAAACCCAACTm 
GGAAGTGCTAAGATCAGTCACCCATGTGAATAAGAAGCCAGGAAAGGAAAGATGGGGAAGCCCA 
GATCACCAGGCTTCTATTAAGGAGGAAAGCAACAGAGGAAACAGTGAAGGGGAACAGAAGGGGG 
TAGCAAAAGTGTrACAGAAAAACCGGACTGGATAGACAAAACTGCAGAAGGGGTTGTTGGGGGA 
GAACTGAAAGGGAAACCAAATCCTGACATGTCTTAAGTNAAGAAGQNNGTT AAGAA AACAATnT 
TTATNGCCTTGCACArrCAANNCTTTAANTCCCC^^ 
AAAATTTTTTTGCCCTTCCGAAAAAAAAArnrr 

SEQ ID NO: 761 GGTACTGGGArrATATAGGCATGAGCCACTGAGCCTGGCCCAGAAGCGTTTr 
TarCAAAGGCCCTCAGTGAGATAAATTAGATTTGGCATCTCCTGTCCTGGGCCAGGGATCTCTCT^ 
CAAGAGCCCCTGCCCCTCTGrrGOAGGCACAGTnTAGAATAAGGAGGAGGAGGGAGAAGAGAA 
AATGTAAAGGAGGGAGATCnrrCCCAGGCCGCACCATITCTGTCACTCACATGGACCCAAGATAA 
AAGAATGGCCAAACX;CTCACAACCCCTGATGTrrGAAGAGTTCCAAGTTGAAGGGAAACAAAAAA 
GTGTTTGATGGTGCTAGAAAGGGGCTGCTCTCCAGAAAGCTAAAATTTAATTTCTTTm 
GTTCTGTACCTGCCCGGCNGGCGCTCG 

SEQ ID NO: 762 ACTrTTTTGTTTTATTCTTTCTCTAGCTTATCCCTGCACi^ 
TGAAAAACCACTXTCCrGCTTTCCATTGTTATAAATTCrAAGOT 

TGACTGAATCAATTACAATTTATGGGCTAGAGCCAAATAGGTTGAAGACAATCATCCANACAGAT 

CAATGGAATAGGAATTTCATTGGAAATGTANAACACTTTCCCAANAAT GOCATG ACTrTCT 

rrrGAGAAGAGTTTCATbn'GCTGGACCACATTTrAGCTITNAKrGr^ 

NAAATTTITANCTACANGNGGCCCNCACTTrrACGTNGCCTACAACCTGTAGGTm 

hWGTAATimrCCTTTGGTlSrrrCGCTrAAAGATATCTCCNCATTCGATGGGTACAbr^^ 

ACrATAGGGGGACCTCTTAKTACCnTrmGGTTATAANTAGNGTTTATNAANTTC^ 

T^ITC^^ITNAAmGTNATTTNTNN^nmAT^ 

TGCNTGAANATNTTACCNCTGNG^WC^^^CCCCCTCTNGG^™AANTCTm^ 

SEQ ID NO: 763 GGTACCATAGTCCXIAGCACTTGGCCCAGGGTCCTAGACTGCrGGGTAGGTCC 
TCAGAGGTATCTGAAGTCATGTCTACATTGATACAAAGAAACGTATGGCTCATCTGTAACAATACG 
TAGAGAAACACAATTAAGAACAATAAGGATCACAGAACTCGAATGGATCATTATAACTTCAAGAC 
TCAGTrrAAGTTACCAAGAAATGCACTOTAOCCCnTrrCTCACITrrATATTCAGATCAG^ 
CCTAACATGTTCTTAAATGACCACTTATGGTCATTTAAGTGGTCAATGGGAATATTGCAGC 
ACTGCTGGTTTGTTTAAACGT 

SEQ ID NO: 764 aCAATAGATGCAACGCCAAAATGAGATGAAAGAGAATTCAGAATAAAATTC 

cgtcccn'ggagcatatrccarrggctttctctgctcctgcitccgctggtatct^ 

gaaaatgatgatgottacagigactatatttagaagaatgacgagcagtatgagccagaacgagt 

atccgtaactgtgggtcgttccrrtactggtggttgccgggtaaagcatttggaacaactcttcx^ 

agagttggttggactgcgtgttcgccacaaacagtatcatggtcacaaaaacgaaggatgctgca 

aactttttctttggttcrgcaagtccgtgactcaattcttcactactctccccacgaa^ 

tagtgatgaaaatgctcccarrtgaagccanannhnttaanagaattgnccrcggccgcg 

CNNT 
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SEQ ID NO: 765 AC T lTllU' m ' ll ' r ri I' l 1 1 1 1 ril lU 1 IN ACTNGTTTGTTTGGATGCTOT 

GCAGACTCATGTCAGGAGGCTAGGAAGGGATTCCTTGGGGACCACTGGATGCTGGTAGTTAAATG 

CCAGGAGGCTGAATGGACCTGAAGATGGAGGAGACTCTGCAGTCnTGGTCAGCCACCCTTGGGT 

GOTGCCACCCTGCAC^^^^AGCAGGATTGATGGTCrrcTGGATITGTAGCTGTGACCGGTCATGGTC 

GAATGCTCGGTGGTTTGCACTGGAGAGGCCCACATGGTGGCNACTGAGGCCCTNTGGGGTGAGGT 

TGGCTCATGATAGCT^^'GAAAGTTGATGGCACAATTGAAACAAGGAGGTGNAAGTTTTGGAACTT 

TNCCAGGGTCCTTNCANCCCCXjAN>rrAAACCm"CCCAACAATTGCANCrr 

TTAGGTTTITmn^AGTTGGGgTTAGGGACTNGNGGGCAAANAAAAOT 

AANCCTTTGNNAGGAACCCTTTITnmTriTGGNGNA/^ 

CTANCCNGCTTGNAAGGNATTAAACCTTTThrmGNAANGNAATTTTCCN 

SEQ ID NO: 766 CGAGGTACTGTTACCCGGACATACCTATTCOTCACAAGTTCTGGTTCTTCAG 
TGCrGCTTACrATTTTGGTAACTGGGCCTTTCTTGGGOTATTTTTGATTGGA^ 
GTAAAGGGAAGAAATCGGTTArrGAAGGAGTAGATGAAGATTCAAGACATAAGTGATGATGAGC 
CCTCTGTCTATTCTGCTTGACAGCCTTCTGTCTTAAAGGTmATAATGCTGACTGAATATCTGNT^ 
TACATTrTTAAAGTATTAAACTAACArrAGGATTTGCTAACTAGCITrCATCAAAAAT^ 
GGCTATAAGACAACTATATTTTATTATATGTTTCTGAAGTAACCATTGNATCATAGATTAACATTTr 
NAATACCATAATTATGCTATGTTAATATAAGACTACCrGCCTnXjTGAQGGAATGTTTGTG CAAAA 
TTTTNCThrrTATNGNTAATAGNGGTTAAATTGAATAAAAAATCTTCCAG>^ 
GNCACTTTTGGAAACATAATAAATTTTTGGATThWNGCAAAAAAAA^^ 
CNCNGGCGGCCTTCTNAAGGNCAATTCCCCCCTNGCGGCCGTTTTTNrGNTCC3>^ 
TNAKmAGGGCTAANTTTCCNNNNAAANG 

SEQ ID NO: 767 ACTCTATAAATCTAGTGGAAACAnTCTGCACAAACTAGATTCTGGACACCA 
GTGTGCGGAAATGCrrCTGCTACATTTTTAGGGTTTGTCrACATTTTTTGGGCT 
TAAAGGAGTGCAGCAATAACTGCACTGTCTAAAAGmGTGCTTATTITCrTGTA AA^ 
TGCATATTGAAATTTTTGTTTATGATCTATGAATGTTTTTCTTAAAAm 

AGA'll'lTCTl l 'AATAAAATGCCAmGTGCAAGATTTCTCAAAGATTAGGT ATATA TTTAAATG^ 

AGAGAAAATATTTITATGGGAGAAAAATCAmGAACCATGAAATTTCATCTTTmAAT^ 

AGTACTTTmrmTTTITITrTm 

NCATGGNGGATATNCAAGTTGTCCCCGCACrrm'AGTTAAAAAACTTrCmGCCC^ 
TGGGNCCCTGNCCAAAATCAATTTCCTTCCATGGGAGAAATTTTTTNGAANTN^ 
NAATCNGGTATTCCACCTTTAANAACAATTGGNTNGNTCNGCCTTTGTAAAAAmGTAAT^^ 
AATTAAOTNTCNACTTTTTTTNGTCAAAAAG 

SEQ ID NO: 768 GGTACGCGGGGATTTGTGGTGAGATTCTCTCCCAGGCCACANGACAITTCCTG 
CTCGGAACCTTGTTrACTAATTTCCACTGCnTrAAGGCCCTGCACTGAA^ATGCAAG^ 
CCGGTGGTCGTTGTGACCCAACCTGGAGTCGOTCCCNOTCCGGTCCCCCAGA ACTC CAACTGGCA 
NACAGGCATGTGTGACTGTTTCANCGACTGCGGAGTm'GTCTXrrGTGGCACATnTGTTTrc 
GCCnTGGGTGTCAAGTTGCANCTGATATGAATGAATGCTGTCTGTGTGGAACAAGCCGTTGCAATG 
NGGACTCTCTACANGANCCGATATGGCANTCCTGGAACTATITGTGATGACTATATNGCAACTNTT 
TGCr>nTC™AATITslNACCTNCCCGGGCNGCCCGmAGGGCGAATT^ 
GTTA^^TGTGGGAT<XTAGCTCGGTTCCAAGCTTONC^mOTCAATGGTCCTANCTT^^ 
GTGTAAATNGTATNCGNTO4NAATrCCNCCNAThrriTITATCCCGNAh^ 
TTGGGGTGNCITAANGNGGGANGCTTCCITANATTATTTCGhrmGGGNAACTG>rrCTCC^ 
GTNGAAANCTT^JNTGTCAATCTmATATGANACGNC^^^IACGCN^ 

SEQ ID NO: 769 CGAGGTACTTTTAAGAAAAGTCCAATGTTACAAAATCAAATGCTTATATTCA 
OACTGGCACACrTTTTAAATAAAAACTCCATACACCTCAGACATATAGCACACATGGAGACAACT 
TACTAATTGTGTGTAAGTATGATACAATXSAATGAGACTGCCTGAAGTC TAGTA ATCAAAGCATGCC 
ATAAGGTGAATGATTGTGGTTAAACACAGCAAAATAATTGTCACAAAACTTTCAAGGCCTAACAA 
ATTAGAATTTrCCAATAAAAAATATATATTTmCAGATGTTAATAAGACATATCAGTAGAGACAA 
AATTAGGATTTTGAAGTAATGCAATAAAAAGATGTTGGAGGCAAAAAAAAAAAAAAAAAAAAGT 

SEQ ID NO: 770 GGTACin'ri-l un U"ri U - l l'i U " l " riU ' i ' i ' rii rill l TTCCGGGTTNGNCTGATT^ 
rrATrrAAAAAAATGGAAAAACAAAAGNGCATrmCATTCAATAAATGTTCCATCCrrATTTA 
TTTGTTGCCGAAAGNGAAGTCCATGACTTTANAATGATAGCAATTTATCAACCAAAGAATCCGTNT 
TCACACCGTITCAATAACTGCAGCAATrTCCTTOAACTGTCTGTAAA AArrN TOAAACTGTGGAAT 
CGTCATTTCAAAGCACTTGGTCTTTACTTGGCCTGAATGATCTGCCACTm 
TAAGGATACTTAAAANATCTGCAAGTGTm'GAGCTCACAGCCATACCCAGTTTCCACTGAAAATCT 
ACAAGCTGGTTOGNGACATCGGACTTAGCATCCAACGGNGGhn'CGCTGGACNCCTCCATG^ 
CGTm'GGGACTTGCGGCCGGACTNNAAACCCCCCGTCCTGCX;CGGCGGGCGGTCNAAAGGNGAAA 
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TCCACCNACTGGNGGGCGTTCtATGGAANCNAACTCGGGNCAAACTTGGNGGAANNATGGCAA^ 

CTNGTTCCTGGGGAAAAGGTOTCCCT(XAATTCCNCAAATACAANCCGGACCTAAAGN^ 

GGGGCCCANNAGGGACAACCCAA 

SEQ ID NO: 771 GGTACCGCTGAAGACACCCAGAATGAAGGAAAAAAGACAAAAAAGAATAAA 
ACAGCrmAGTAACGTTGGAAGAAAAATTAGTCAGCGAGTTATrCACTTATTTGATGAGAAGGGC 
AATGATTTGGGAAACATGCACCGAGCAAATGTGATTAGACTTATGGATGAGCGAGACCTGCGACT 
GGTrCAAAGGAACACCAGCACAGAACCTGCAGAGTATCAGCTCATGACAGGATTGCAGATCCrrCC 
AGGAGCGGCAGAGGCTGAGGGAGATGGAGAAGGCGAACCCCAAAACTGGACCAACCCTGAGAA 
AGGAACTGATTTTGTOTCAAATATTGGACAACATGATTTGGACACAAAGACTA^ 
CAGTGGATTAAGAAAAAACACCTAGTCCAGATTACCATAAAGAAAGGGAAAAATGTAGACCTGTC 
ANAAAATGAAATGGAGGAGATTTTCATCAATACTCCAACTATCCTGGAATACTCNNTCrC^ 
GCCCAGCITGTCANGAGGAAACCTTNATGGGGGTCOTCGGCCTTGGCCAAAATGGGGGAAGGCTT 
TAAGAACrCAGAAACCCGGAAGAACCCTTTACCAANNCCTGGAATGTTAGGGATNAAGTTTCCT 
CNAATTTATAAAAAACCTCrnTAAAA 

SEQ ID NO: 772 ACTGTATATCCATATGGCACATTTATGACTTTGTAATATGTAATTCATAATAC 
AGGnTAGGTGTGTGGTATGOAGCTAGGAAAACCAAAGTAGTAGGATATTATAGAAAAGA TCTG A 
TGTTAAGTATAAAGTCATATGCCTGATTTCCTCAAACCrrrrGTTTlTCCrCATGT 
TATTTTTATCACAAACCAAGATCTAACAGGGTTCTTTCTAGAGGATTATTAGATAAGT 
ATCATTAAGCACGGATCATGCCACrCATTCATGGTTGTTCTATGTTCCATGAACTCTAATAGCCCA 
ACTTATACATGGCACTCCAAGGGGATGCTTCAGCCAGAAAGTAAAGGGCTGAAAAAGTAGAACA 
ATACAAAAGCCCTCGTGTGGTGGGAACTGTGGCCTCACTCTTATTrGCCTTCATTCAAAACAGTTG 
GCCCTTTNCATGACGAGGATCTCTACAGGTAGGGTAAAATCTTTTCTGTGCTATCAGCCAGAAATA 
GTTTTGGGCTTGGATATGArrTNAAACANATTTGGCTGGCACCANOCAAAAC^ 
CTNTTCCAAACCCTTAGAATCTCCNCi ri ITl 1 AATANTTrCCGGACCTGGGCACNCCCTANGGGA 
ATCCANCCTGGGGCG 

SEQ ID NO: 773 GOTACGCGGGATCTGGGAOAATATTTAATGGAAAATCGCTTGGTTAAAACCT 
GACAanTTAACAGTGAACAGCGTTCTGAGTGTGGACGAGTAGCCAGTGAAGATAATGAATGTCG 
AATGTGACTGACTAGCAGCrrCATTTTGAATGAGGGTCGCTGTCTGCCCATTGATAGAGGCCAGAT 
TGTCTTGGAAGTTCCAAAGTTGCAACGATTTCTGGCTAGTGC CACG AGGTTTACrTGACT^ 
TGAAAAGCrGATAAGAAAACXATCCAGAAAAAAGCTCTTCGTTTTACAAACATGAAAATAAAACA 
TGTAATTTTCCAAAAAAAAAAAAAAAAhfNAAAAAAAAAGGT 

SEQ ID NO: 774 GGTACTTTATGAATTTGGGGTAGGTAAAGTTTGTATTTTATCTTAAACATGTT 
TTCTATGATGAAAAGGAACAAAATTGTAAAAAATGAGGATCTrCCCTCTAAAGGTTTCAAAGCOT 
TAGAGOACATGCAATTAAATGTTGTTACACCnGAACAATQAGCCTCTT GAGTT TGTAGGAAGGGC 
AGACCGGCTCCATTACCAACAACTITGGGGTAGAAAGCACAGCTCTCCTCTmACCCAGCACAAA 
TGCAATCCTGATTATAAAACTATTTGTGTTTCTAAATACAACCAAAGGAAATCrrAGAGAAACATA 
AATTAGAAACCTCTTrrATTAAGGGGAAACAACAAAAAAAGGTGCrrrriTAAAAAAAj^^ 
AAGAAAAGAAAAAGAAAAAACAAGCTGTAAAACCATGAAGTTAAAAGAGCTGGGTCTGGAGCAG 
ATGTGCATAATAACTAGTTAAGTGCTCCCAGCANGATCAGAAACAGCTTTGGGGGACCAGANGAA 
TATGGGTTGGNGbaGTTCANAAAAGCNCCAGTTATACTCITCCATNAAAATGATGGGCACAGTGT^ 
C 

SEQ ID NO: 775 GGTACCCAATGAGGAACCTAAAGTTGCAACAGCTTATAGACCCCCAGCTTTA 
AGAAATAAACCAATCACCAATTCCAAATTGCATGAAGAGGAACCACCTCAGAATATGAAACCACA 
ATCAGOAAACGATAAGCCATTATCAAAAACAGCrCTTAAAAATCAAAGGAAGCATGAAGCTAAG 
AAAGCTGCAAAGCAGGAAGCAAGAAGTGACAAGAGTCCAGATTTGGCACCTACTCCTGCCCCACA 
GAGCACACCACGAAACACTGTCTCTCAGTCAATTTCTGGGGACCCTGAGATAGACAAAAAATCAA 
GAACCTAAAGAAGAAAAAGa^rATCGNACAACTGAAAGAACAAACAGCAACTGGAAAACAG^^m 
GAANAAAA>n'CAGTTGGGAGAAAATTm'GAAAGAAACAGCCCTTCTTCAGGANGCT 
G0GAATTGGGGTTTTTTAAAAGATTCACCGGAAAGCANGGTTGmX3ACCAGAAATCAGTN 
CNCATTCTTm'GGTAAACCCmTGNGTNCCCAGAANNTTCCTTGNGCCCNCNC^^ 
TTTAAnTAAACC 

SEQ ID NO: 776 GGTACGCGGGGAGGCTCGGACCGGCCCGCGGAGCTGCTGCAGTCCTTCGCGC 
CCTCCTCGCCCTCCCCACCGACATCATGCTCCAGTTCCrGCTTGGATrTACACTGGGCAACAGTGG 
TTGGAATGTATClXiGCTCAGAACTATGATATACCAAACCTGGCTAAAACTTOAAGAAATTAAA>^ 
GGACTTGGATGCCAAGAAGAAACCCCCTAGTGCATGAGACTGCCTCCA GCACTGCCTT CAGGATA 
TACTGATTCTACTGCTCTTGAGGGCCTCGTNTACTATCTGAACCAAAAGCrm 



108 



wo 02/29086 



PCTmSOl/30732 



GCCTCACK:ACTTCTOTCrrrGCTAGACCCTGTGNrrTaGCTlTA^ 

AAirrGAGAACCTACCCGACATTTTCCAACATACTGACCTCTTTCCATAANCCCTTTCCACTGCATG 

GGAGGTrrAAAACTGGAANTATGGTGCrAGATTATTAAAACNATGACTTTTA>rrGANATT>^ 

ATTACATONn^CCNNAACTNThnsIAACANTNGNTGC^ 

SEQ ID NO: 777 GGTACCTTGGCTGGCATTATTGGAATGAGGTTCTACCACTCTGGAAAATrCAT 
GCCTGCAGGTrTAATTGCAGGTGCCAGTTTGCTGATGGTCGCCAAAGTTGGAGTTAGTATG'ITCAA 
CAGACCCCATTAGCAGAAGTCATGrrCCAGCTTAGACTGATGAAGAATTA AAAA TCTGCATCrrCC 
ACTATTTTCAATATATTAAGAGAAATAAGTGCAGCATTTTTGCATCTGACATTTTACCT 
AAGACACCAAACTTGGCAGAGAGGTGGAAAATCAGTCATGATTACAAACCTACAGAGGTGGCGA 
GTATGTAACACAAGAGCTTAATAAGACCCTCATAOAGCTTGATTCTTGTATATTGATGTTGNCTIT 
TCrrTCTGCATCTGTAGGTAAATCTCAAGGGTAAAATGGTANGGGTCAACTTTNAAGGCTCT 
CCCCAmCCCTGCTCTGAGGAACAGTGGGAAAAAAAGTCnTrrAAGAGATrTACNATATCNGN^^ 
NTTTGCTCTCrTAAANCCAANACTGCTTTGGAATTTTTTTAAGGGAA^ 

SEQ ID NO: 778 CGAGGTACAGGATAATATACTCAGATATTrrrAAAATAAACTACTTAATAAT 
AAOAAATTAGCCATACCACATTGNTCCArrrGCTACAAGAACAAATrGGCAATGAAGACTATTTA 
AAAGAAATGCTCANCTCTACAGAGGGTGGTGGCAGGCAACACTTTTCCATTACAGAATAACCTCr 
ATTCTTCCATGATACATATTCCTGTGGAAAAACTTGTCAGGGCCCAGGGATGAAAAATANAGCTTG 
NCCTAATTAGCTAACTGTAGGTTCACTTAACATCrn'GGGAAGGACCCAAAAAATCTGGCCArr^ 
TTTCTrAAACATCTGCAAGCTGCAAAAATrCCTTAGTCCTCAGCTATAGTITCrGCTAN^ 
NAGCrGGGACAGCrNCACTGTGACTCCTCCTCAGCTATGGGGTGGGTGCTAGTCATCANAGTCTGG 
GAATGTCTCATAAGAAGTAACCACGGGAGCCTTTGGATGCATAANAAGCCGATGCCCCAGGTGGG 
AAAATCCGGGTAGGAACCCCAGAATGCCAATGGNTCAGCACNGGGAACGGCCCGGCTGGCCCC 

SEQ ID NO: 779 GGTACCAACATTTATTAATCCTCACAACACCCCTGTGAGGTAGGTCAGTATGT 
CCnTAGAGTCGAGAACTGAGGCAGAGGTCAAGCAAACCrGCCCTGGGCCACAGAGCAGCAGAT 
GAAGGGCCTAGACCTGGATCCAGAAGCTAGGGCTCTCGGTCCAGCATTCATCCACTGGTGGACAT 
CACATGGGCTTATrnTACCAGCGAAGGTTACGTGAAGGACAAAACGCACTCAGCCAGCAACGGA 
AACTCAACAGTTCAAACAGCACTGGGGAACATGTCAGTTAAAGAGACGAAACGCTGACCAGCTCA 
TGAATGAGGCAAGACAACATGCGGCTGAGGAAGTGTGGAATCATCACGACTGGGGATTAGACCA 
AGAACGGGCGCTCAAGAGGGTTCAGGAAAATGTAAACAAACTAGGAACCCATCCAAGGGGTGAC 
AGGCCCAAATGCCTACGGCTCCAAATGGTAGAAAAAITAGAAAAAGATAGNGGAAAAATNCCAC 
CCCCCANCCITACOTTTAACTAGGAATGCCATAANGGGNTAAGGTCATTCCCCTTAAAAG 

SEQ ID NO: 780 ACGCGGGGGCTCAGAGCTCGGGGGCGGCGCTCAGAAAACATCTGGAGAAAA 
TGACCCATTGGTTTCATAGGAACCCATTAAAAGCCACAGCrcCTGTGTCTTTTAATTACT^ 
AGTCACTGGCCCTTCTGCrrCAAAAATATGCAATGACnTGAGGTCATCCAGGGCACGACTCOT 
ACTGTTCACTGATTTGAGCTGTAATCCAGAAATGATGAAGAATGCAGCAGATTCATATTTCTCACT 
TTTACAAGGTTTCATAAATTCTTTGGATQAATCTACCCAAGAAAGCAAQTTACGAT ATATTC AAAA 
rrTCAAGTGGACTGATACATTGCAAGGACAGG'rTCCAAGTGCCCAGCAGGATGCTGTTTITGAArr 
AATTTCCATGGGATITAATGTAGCrrrATGGTATACCAAATATGCTTCAAGACTGGCTGGAAAAGA 
AAATATAACAGAAGATCAAGCAAAGAAGTrCATCGAAGCCTAAAGAATGNAGCrGGGATTACAG 
GCNCCACCACTACACCCAGATAAATnTGNATTTTAATAGAGACGNG 

SEQ ID NO: 78 1 ACACTTTAATATCnTCCCACCAAAGGCGCGACAGCACTCTGCCAGATCITGA 
CTGGCCTCTGGTGGGCCATGGACGATGATCAACTGTCGTGGTTTCATCT GATT AATGAl i i i i i TAA 
TGGAATCCCCATCAGAGCGTCCrrCATAGTCTATGTAGGTAACCCGGGCTnTATTTCAATAGACT 
CTGTTGTAGAAATACATTrAGTAGGAACATCAGATAAATCCTGATCCATAGGrrCATCTCCATTTG 
TCAAACCAGATTCTAAmGCTTTTTTCTTCriTCAGTAGCTTGAAGCTCTGGCACT 
TGGrTTGATAATCTCTCCATATTCATCCCATTTAATTCTTTCTTCTGGGGCAGGAAACATAGGATAG 
GACTrrmGCCTGTTTGAAAAAACTTCCTrrACGACTGCCTTCACCTTTCATC 
TCGCTTATGAGCTOATGGCTGGNCAATATCTTCCT CAATA TCACTCTCATCACTGGNATCTATATCT 
GGCTCTIT^GACTGNTCAAGCTTTTTTGCAAGCTT^rrm 

SEQ ID NO: 782 GGTACTTTAAGAAAAAlTrAAGTGACAAATGTTAGAAGATGATGGATTGAAA 
AATATAAGTAATTTGGCATATrGACCCTATATAAAAAAGTGGGCATTrTGTAACATTCCTTCAGGA 
AGAATAGAATTGTATGCTTTTTTCTGCATGrn'GTGATCACTGTGAGTCTCAGATGATTA^^ 
TTTCAATGATrTTrCAAAATAAAAGTCAATCAGCATrGrrATTTATCATTAAGGTATTC 
GATTGCATCGTTGAATGTGTCTTTAATCCACTACAAAATGTGCnGTrATTGATAGGATCATGTTGT 
TAAATTGTAAATTTTCAAAATrAATTGATGTmAAAAATGTTGAGACCAAAGTAGTATAGAAGTA 
TGGACTTAATTTAATTGTAAAATTATTAGAGGTATTTGTTGTAGAGCrrTATATATTAAAGAGAGG 
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TATTGGTGCATATAAAAACAGTAATATTATTGGTATGCnTGGATTCrrGCATTTCAGGGTAAGTAA 
ACCCTGAAGAAATCTTATTTAAGTATAGCTACNGCTGCCGTAGCTT 

SEQ ID NO: 783 GGTACCAGGAAGTAGGACAAGTAATTTCAAAAATATAAAGGTGTTTGCTACT 
CAGATGAGGCCGCCCCTGACCITCTGGCCAGAGAGACATTGCTGCCAGCCAGCTCTGCCrrTCCCAT 
CATCTCCTTTCAGGACCGTCCCACACCTTTTACTTGCTCAGTGCTGTCTGAAGATGCAGTrGCT^ 
TGCAAACAACAGGAACACCAGTTAAACTAA'TTAGGAAAAGAGGGAGATTTCCAGGCCTGGGTAA 
CTATATACTGTGACCATTGGAGGTAGAGACAGGTCTCAACAGTTGGAACCAGGAACTCTGCTGTC 
AGGTTGTGAGTTTTGTTTCTCTTCCAGCrmCACrrGTGTGQGGGTCTTr^ 

TCl'ATCACATGGCAGCrGACCrCTCACGCTCCACTCTACAGCTTGGACACCCAGTAGACCCTGAAT 
TTCACTCTCTCTAAAAGGTTCTGAGGGCTCATCCTGGGCCAGGGGCCCTCCTGTGCACTGTTAGCT 
ATGGCCACGGAAGCCmCAAGCTGNCTGGAACTTCAGGTTGACCTGTT 

SEQ DO NO: 784 GGTAc rnTi - nTi ri 1 1 i m r v i n 1 1 1 1 iGGGAATAAGrrATAcmTAnTT 

TCANAAACAAAAATGACAAGNGGCAACTTGCCTTTGTAAAAGArrAAAGAGTATCAGAGTAATAA 

GCTATCKrrCATAGAGACAGAAGCCCGAGAAACACTATTATACAGCTTTCAAAGGAAGTCCTATT 

AAATAAATCTCACAATAATATACTTTAACTCAACAAAACAAAAACCACAAAAAATTATACAGAA^ 

TCATAAAATGGTATATGAATGAAAAAGAGAAATCmAAATCCATACACTAGAA GTTCT CTATTAA 

AATCAAGGAGGCTCAAATCTGJl'l'Ul-lU'rAAAAAATATTTATGGATATAATTGCGTTTTTCTAAATG 

TTTCATTAGTAATACTCAGTCITGTAAGTCrGGTTCCATAGTAATCTATAATGTAACTACTTCCATC 

TGAAGGTCCAACAATCTGCATTGAAATAAGAATGAAATCAATTAGGCTCCCAATTCCACAAAACC 

CTCAGGGCAAACTTTAACAAACCCAAAGCAGGGTATCCANGNAAA 

SEQ ID NO: 785 GGTACACTTAAGTTGAAGACACAACACTTGATCTGA AACA AGAAGTTTGTGC 
CTACTCAACAGCriTGAAAGAGCACTTCCCAACGCTGCTAGTAGTCTTrGTTTTCTTCAG 
CAGTGGTGTAAACATAGCTCACrGTAGTCTTGAATTCCTAGACTCAAGCAATCCTCCCACCTCAGC 
CTCCTGAATAGCTATGACTACAAATGTGCACCACCACACX:CAGCTAATTAAAAAAAAAAAAAAAT 
GTAGAGATGGGGTCTTGCTATGnGACCAGGCTGGTCTCAATCrCITGGCCrCAAACAGTOT 
ACCTrGGCCTrTCAAAGTGCTGGGATGACAAGCATGAGCCTCTGTG CTCAG CTCTTCTAATAGTTT 
TATAGTTTCAGGNCTTACATTTAAGTCGTrCATCCATATTGAGrrCATTTrTGTATGTGGNGACAGA 
CAGAAGNCAAGTirCATTCTrATGTGTATGAAAATCCAGTTTTCCTAGCACATTTATTGAAGAN^ 
TGGCTTTCCCCATAAATGrrCTGGGTCTTTrGTCAAANAhfNAKrGGNTGNAAAACCAAA 
TGACNCCTATCGGTTCATGGGCTAGGGGNGGGCTTAAGNCCACCCGCCN'lTrGArrrATACTTGGA 
GGnrrTGAAGCCGGNGOGGGGCCCTAdT 

SEQ ID NO: 786 GGTACTr N il - lUl - ril ' l ' ri ' l U'innMU-Ll-14'GGAATCATTTTAT<nTrAAT^^ 
GTAGTAATTrAAAATATGAGAAAAAAAAGTCAAATGTGTTCCCTTTATGGGTGATGCCACCATGAT 
TGCCTCACACAAGCATAATCAATCGCCACGAGAGACTGGATGCCAAAGAGTATGGCTGGCAAAAA 
GGCATCAGGGCrcACAGTCCGAAGAGTTTGGTTACGGAGTCrCCGAGGGGTAACAGGTGGCANAA 
AAGACATCA>nTrAAGGGACCCTCATAGGACAGGGCGCGGTTGCTGGGTCATGAGCACCTTGAAG 
CGTTTCTTGATGGTGATTCGGTGTCGAGAGTA'nTGTCATCTGGGGAGAACCGAGCAaGATGGGCT 
GAGCAGGTCTGTTGTCCCATCNGGTCAAATTTCTTCAGCGTATAAACTCGATCTCCCrGCTCGTTG 
AGGNAATACTGGAGAAACATOATCGOTATAAGCCACCGGNCCNATTNGGNCCACCGTTAATCTG 
NAGNGGTCCCCCCGNGTCTGCCCGGCGGCCGTm-AAAGGCGAATTCANCCCCTTGCGGCCGNANT 
NATGGANCCACCTNGTCCCANCTTGGNGNANTATGGCANACTNTTCCGGGGNAAATGTTCCNTCC 
ANTCNNCAANTNNACCGGAACAAAANTAAACCTGGG 

SEQ ID NO: 787 GGTACAGTrCCAAAAATAATTAATTmTAAGGGAAl n l i CAAGACAAAAG 
GCAAATATTrroTTTCAAACCTACAAGCAAArrCATTCCAAGAAGTGAATATAG TAGTAA AACATA 
AAGACTArrGCAATAAAAAGTTTCCAGGCCAGTGACTTGTrrAATTACATTCACATITr^ 
ACTTTGCCTITCAAAGCAGGAGTATTTrATAAAACTGAAATAAAACTTTAAArrGTCAAGCT 
CrrAAAGTTATGTAAAATCTGATTTAAGTTTATTrCATTrCCAAGGAAAGGACCCTGAA AGGA GGG 
GACAGGGAGCTTCCTAGGTCTGTTGATAAACTAACTGATGCTTTGTCAAGTCTTCCAAGTTTrC^^ 
TGTAAATAATACAACTAACAATGTAATTTTGCrGGmCTAATAGAGAATGAATGAATATATTA^^ 
GNATTACATrGGTrCATGATAAAACAGNTn-CCTGATACrGGGAAATCTTTCrCTCCTO 
GQ^AAANACCTCTAAATAATATmAACTGGAGGCCGGNGGCACTGGTGACCGTTTTGGCAATrn 
GNANAAATGATANATmCTGCCCGCGGCNCAAGGNGAATCACCCCTGGNGGCGTCTATGGACCAC 
CCGGNCCACTGNGGATATG 

SEQ ID NO: 788 ACAGCnTCTTCGTCCTCCATGCTAAGAGATGTAAAA GCTTA AGGGTCAAAC 
AATACCAATTGTATAGGCTTCAAAAACCATCTAAGTrAGGGCATTCTCTA GTTTT AOCTAAGATAC 
ACCTGGAACACTGACAAGTCATCACTTACATAGAATAATGTGAAGTAAAlTnrrGAAAAATAAA 
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TTTTAGTGGAACAATCCTGAAGGATAACACCAGAAGAATAGCAGGTrACCAGTAAGGTGTCAGCC 

AATTTGTTCCAGTCACrnTrGAATCCATGTTCTATAATCTAAAATTTATTCTCm 

GAGCTTCCTATCATGTCAGTATCTATGITATGAAGAAAAGGAGACTTANGTGAGATGTTTTTATTr 

ATCGCAACTGCTGCATTAATTGCCrAGGACCTCAACAGCTTCATGAAAGTCTGGGAAATGTTCATG 

CATAAGGGTATTGCCTTAGCTGACTTAAAATTGGCCCATCAATGGTCCTCGGNCGCGAACCCCCC 

SEQ ID NO: 789 ACGCGGGCAGCTAAGCTAAAGGAGAAATATGAAAAGGATATTGCTGCATATC 

gtgccaagggcaaaagtgaagcaggaaagaagggccctggcaggccaacaggctcaaagaaga 
agaacgaaccanaagatgaggaggaggaggaggaagaagaagatgaagatgaggaggaagagg 

ATXIAAGATGAAGAATAAATGGCTATCCTTTAATGATGCGTGTGGAATGTGTGTGTGTGCTCAGGC 
AATTATTrTGCTAAGAATGTGAATTCAAGTGCAGCTCAATACTAGCTTCAGTATAAAAACTGTACC 

SEQ ID NO: 790 ACGAAGCCATCGACAGCAGAGATGGAGCATCTTGTGCAGAGTTGGTGTCnr 
TAAACATCCTCATGTTGCAAACCCACGACTTCAAATGGCCrCTCCAGAGGAGAAGTGTCAACAAG 
TCTTGGAACCCCCTTATGATGAAATGTTTGCAGCTCATTTAAGGTGCACTTATGCAGTGGGGAATC 
ATGACTTCATAGAGGCATACAAGTGCCAGACCGTGATAGTCCAATCATTCTTGCGAGCATTCCAGG 
CCCACAAAGAAGAAAACTGGGCTCTGCCTGTCATGTATOCAGTAGCCGCTTGACCTTCGAGTGTTT 
GCCAATAATGCAGATCAACAGTTGGTAAAGAAAGGAAAAAGCAAAG1TGGGGACATGTTGGAAA 
AAGCAGCAGAGTTACTGATGAGCTGTTrCCGGGTCTGTGCCANCGACACCCGTGCTGGTATAGAG 
GACTCrrAAGAAGTXjGGGCATGCTTGTTCTGGTGAACCAGCTGGrrAAAATCTACnTCNAGAAC^ 
CAAACTCCATrrATGTAAACCCCTAATTTGAACAATTTGACAGCTCAAACCTGAAAGACGN™ 
GCNCTGCNCAGAGAGTAACATACAAATACTNrCCTTGGACCCAGGNTTTTTTTGNAACCAT™ 
AANCTGAGGATNCNTCGCCGCAAANGGGGAATCCAACNCTGGNGGCGTAT 

SEQ ID NO: 79 1 ACTAAAGGCmTGCATGAATTAGGAAGGAGAGTCTTGGGGCAGAAGCAATA 
GGGGACAACTGTGCTGGTGCrGTCTTTTGCAGGATGTGTTTACCAAAACAT CTAATGC AACTATTT 
TGAGACTTTACAGTTTGTAGTGTTAACCTCTn'AGAAAAAGAGCAGCCATCtnTrmAGG 
CTGTAATCATCCCCAGTTGATGAGGAGAAGCTCTrCTGTAGAGAAGAATGACAG CTGTG CTGGGG 
CAAGCGATTGACATACrGTAGCGGACGCAAGTACGCGGGATAGCATACrrraACATTTTAAACAT 
GATAGTCCATAACCATTITGAAATGCTGGGCAAACTACATGAAGrrATTTATAATTAATTC ACAG C 
TAATCAGGCATITrGAAAGCTTAATrGGATTCAAAAACCATAATGTTGGAATTTGGTAAAAT^ 
ATGTrGATTmACTGTGAAAAGGTTTTTATAAGATATACACACCCTAGrrrAATGTTGTGTOT 
TGTGGATTTACAGATTTACTACAGGTATTCTGAACCAGGAACACAATCAGGTTTCAGGCCAGTTTG 
ATACTGGCTGCCTTAATTCTAATATGANAGTAGGACATCATACTAAATGTTATGTCAGTGGGACTG 

SEQ ID NO: 792 ACTTTTGTATTTTGATATGGACAGmATTCATTTGCATACAGTTATTGACTTT 
TTCCCAGCTGATTAAAAGATAGTCAAGAAATTCTGCAATATAGCTGCCA AAATA GACAGCTACAT 
TTXTATGATATTGTCATCTTTTCTGTTITTITm 

TAGCCACAATAGGACATATAAAAGATTATAAATACAGAGCTTTATTATCCTGACGTCTTGGGTCTT 

TTAAGTATATACTTTTCTGAAAGGTATCCATTTTGTAGGCTTGGGTTCrrCATGAGCATACGAT^ 

rrATTTTTGCTGCTGTTCTCAACATCATCATTGCCTGCTGATGTGCCACGATGCTGCTCX:^ 

AGCAATAAGATTGTCTCTAATTTGAGCAGTAACATGATTGCAAGAGACCAAGTTTCACAQCTrGTA 

AAGrrCTGTATTTGGGATTCrrGCTTATrmCCGCCTGTGTTTTTCT 

AATTGAATCCAGTAGTTTTCTATGCTATTTGGTGGNGGATAAGCTACTGGAAGAAACrr 

GGGAAAAATANAANGGAAACTTGAATCATCTCTTGATTAAAANGGAATAAAGAAAGNAC 

SEQ ID NO: 793 GTACAAATGTGCATTAACAATTCAGTGACGTANCTGTGGATCrCTGGATGGCT 
ATGTAAGCTGTGAGAAAGTCCCCCACTGGCTTTGCACTTGCTGCGCACCAGAGGTGACCATCCAG 
GCAGTATCAACCTGAAGAGGGGGTGAATCCCAGCAGCTGCTCGATGGGCTTAAACCGCCACTCGT 
CAGCCTCCAGCTCTTCTACCAAACCAGCTAOTrrrrCCATCCGAGCAACTTGCTGATCATGTTrCAC 
CTGTTTAAGCGTGGATGCCACTTGATAGCTAAAAACAGATTCGCAGAGGAATCTCTCANAGCCAT 
CTTCTAACATCTCTCCAGCTCCTTTGGGGTAGGTGAAAAGGGCTTTCCGTGGTGGGGCAAGGTCTG 
AGACACTGAAACCAGATCCAGTGACAGAACITCCACTGACCCCACCAGCTGAGGAGGATX3ATGAT 
GAAGCAGCCATrrCCrCTCTAGCAGAGTTOTACTTCACTCAGCGGGCTTrAGCACm 
AGCCCCACCCCCAGGACCAGOCTGAGTGGCTGCGCTCACCACGACNAGGACCGCGGCGTGTrCCC 
CGCGT 

SEQ ID NO; 794 ACATCTGATTTACTGAATriTAAAGTCTGGGATGTTAGTGGGGAAGAGGCGA 
GGTGAGCATrGCGTGACGCCGAGGACTAGGCGGGGCGGOGACTGCACCTGGCTAGGCACCCCCAC 
CCTGGGCAACTTGCCCACGGACCCCAGGGCAGTGAGTAGTGACAGGAGGTAGCCCGGGGTGAGA 
CCTCTCACAGCAAGAAGATGGTGTGGTTGCTQGGGCCTCCCTGGAGAGTGTCGTCCCTGCGGCCCC 
TGGGAAGTGCTCCCTCACGACGGAAGGTTTCCTGTCAGTGCGGTCCCGQGGCCTGATAGTGGCGG 
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TGGGCX3GGTGGGGTCACGTGTCCrrCAAGGTCCTGAATGCCCAGCTCTCCCCCATTCCTCTGATTCC 
CAGTGGCTGCTAGCTGGACCCAGCTGGTGTCCTGGGCATGANANGCAGGGCCACCCGTCC 

SEO ID NO: 795 accatagtgattggggaattccagtttaaagctcataggaatgtgctggcct 

CCTTimTGAGTArmGGTGCGATCTACAGAAGCACTTCTGAGAACAATGTC^ 

gtcaggtgaaggctgatggatttcagaaactgttggagtttatatacacaggaactttaaatcto 

ACAGrrGGAATGTTAAAGAAATTCATCAGGCTGCTGACTATCTCAAAGTGGAAGAGGTGGTCACT 
AAATGCAAAATAAAGATGGAAGATTTTGCTITrATTGCTAATCCITOT 

ATTACTGGAAACATTGAATTGAATCAACAGACrrrGTCTTCTTACTCTGCGAGATTATAATAATCGA 
GAGAAATCAGAAGTATCTACAGATTTGATTCAGGCAAATCCTAAACAAGGCGCGTTAGCGAAAAA 

GTCATCTCAAATGA 

SEO ID NO- 796 acgcggggagagtccgtaaggagcagcttccaggatcctgagatccggagc 

AGC^^r^n^GTCGGAGCGGCTCCTCAAGAGTTACTGATCTATGAAATGGCAGAGAATGGAAAAAAT^ 

gtgaccagagacgtgtagcaatgaacaaggaacatcataatgoaaatttcacagacccctcttca 

GTGAATGAAAAGAAGAGGAGGGAGCGGGAAGAAAGGCAGAATATTGTCCTGTGGAGACAGCCGC 

TCATrACCTTGCAGTATTrTTCTCTGGAAATCCTTGTAATCITGAAGGA^^ 

GCATCGTCAAAGCATTGTGGTGTCTrrmACTGCrGCTTGCTGTGCTTV^ 

GAAGGAGTGCATCAACAQTATGTGCAACGTATAGAGAAACAGTTTCTTTTGTATGCCrrACTGGATA 

GGCITAGGAArmGTCTTCTGTTGGGCTTGGAACAGGGCTGCACACCTTTCTGC^ 

CACATATAGCCrCAGTTACATTAGCTGCTTATGAATGCAATrCAGTrAATTTTCCCGAACCACCCT 

ATCCTGATCAGATTATTTGTCCAGATGAAOANGGCACTGAANGAACCATTTaTrTGTGG 

ATCTCAAAA 

SEO ID NO- 797 ACl T lTlTn Tr n il 1 1 1111 I GGCAGGNGACrGAAACTTTAATGCACCTAGG 
GATTGACCACAACATTTACAAATCAGTGTTACTGCArrGGTTTTTCA^ 
AACriTATTTTAAAAGTTrCACAAAGTTAAATTTCCnTAACACrTC^ 
AGTCTATCAAAATGGTAATATTTACCTTT 

SEO ID NO' 798 ACATCTATCATTATCACAACATGCTT ATTTG ATGAAGCTAAAGAAAAGCCAG 
AAGACTAATATGGCTGGATGAGAATAGCATTTTAAAACArTTTCAGCAAAAT^ 
GTTTAGrrCCAAAACTOAATTrTATACAAGTGCTATTATTCCAATAGTTT^ 
TGATCrrGATCCATATACTGCATTCATATTATCCCAAAATAAAAGAGTrrTTNATTAATO^ 
NGGrrrCAAAAGNOTAAATATArrrCAAAAACCAATGACNGrrAAAAGNAGGGGTrCTAAATm 
TGCATGTCTTATTAAGCTTTGTTGNATAGACTAAAATTTTGGTTTCATAGACTCANAAATACTAAA 
TTGCCTAlTGGGTTTTGGGGCTrCTTGCATCAGCTCCTGCAACTGTCCAArrTGTTCATCAC 
GTCATTAAGTirrCrTCTGGGAGGAAGGTGCCAAAACATGCTNTTCCAGTTGAAT^ 
CCGTTAGCCTGTTGCAATAACGCTGTGCATACACCTCCCATTCTTGCAAGAAACGCTGTGCCTCGT 
NAGAACCAACGGGNCrrATGTCTCCTAAATTTCGTCTTTCACGTC 

SEO ID NO: 799 ACAAACTATGTATCTGAAACACHTCTAriTGGCAATTTTATAACAAATCA^ 
mAAAAAGAACAAAAGAGATTGCAGATTACTTCGCAGATACAGAATAAAGCAATTGATGAAGTG 
CTTAAGCAAAAGAAAACAACAAAAAAAGAAAACACACTGCrrTTCTTITrAAA^ 
rrGCTATAGATCAAATGGATAATACCCTTATTAAACAACCATTCCAOATNGTTTAATANAACAAGT 
OCrmATITGCNCnTCACTTAATTTTATAAGACTCATTTTCATGTATA^^ 
TTAACGAATAAAGTCCCTCATAATTmACACTTrrAAATTTm 
ATGTATCGTGGAACCTTTCCCATrn-GGAACCA^GGTTTTAATTCTATAT^ 
TrrAAAAAATrTAGTCTAAAAACTGCTGGTTTITATATCACTGTAGGTAAAGTGAAA^ 
CCAGGAGTATTH'CTGCAGTTTCACTGCATATAACCACATACTTTACAATGTCOT^ 
GGAAATAACTGATTTCTTGATCACTGTCAGAAATGAGTGCCATAATTCTATTATGGGTATAGGTCC 

SEQ ID NO: 800 ACTrCAGCTGCTGATATGAAAATTAGATTATrrACTTCAGATCTTCAGG^^ 
AAATGAATATAAGG'rmAGAGGGCCATACCGArrrCATTAATGGTTTGGTGTTrGATC^^ 
AGGCCAAGAAATTGCAAGTGTGAGTGACGATCACACCTGCAGGATTTGGAACTTGGAA^^ 
AAACAGCTCATTTTGTTCTTCATrCTCCTGGCATGAGTGTGTGCTGCCTTCTTGAGG^ 
OTATGGGTGCAGAGAANAATGGAACAATCCGGTTTTATGATCTITTGGCCCAACAGGCTAT^^ 
TCrCTTGAATCAGAACAAGTGCCATTAATGTCAGCACACTGGTGCTTAAAAAACAOT 
GGAGCCGTTGCAGGAAATGATTGGTTAATTrGGGATATTACTCGGTCCAGrrATCCTCAAW^^ 
AGACCTGTTCACATGGATCGAGCCTGCTTArrCAGGTGGTCCACAATTAGTGAAAATCTOm^ 
ACCACTGGTTATCCTGGCAAAATGGCAAGCCAGTTTCAAATTCATCArrTAGGACACCCTCAG^^ 
ATCCTCATGGGTTCTGTAGCCGTTGGATCTGGACTGGCCTGGCATCGACTCTTCCrrCTGTGTGT 
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SEQ ID NO: 801 ACnTrCTTrrTTTACAG'l-i-l- ri Tl 11 1 1 1 f i 'ACACACATATTATATACAACnTT 
AAAATAACTGCAOTCTCTAAAAGTGAGATATAAAATTGTGCAGCrATTTTAAAAGTTGT^ 
TATGTGTGTAAAAAAAAAACTGTAAAAAAGAAAGGACAAACAGGTrGTTTTGTTCTAGTTCT^^ 
TTCTTAAAAACCACTACATGGTTCAAAATTGGAATAACATTrGGGGACACrmGGrrACCTCCA^ 
AAAAGGTTTTAAAAAGAAAATGGGGTGGGATTGCCCATTTTGGATTAATTTTGGOT 
ATAGCTGNTAGAGTCTGGNrrGGTmGNTTTTACTCTCAAAATCATAGTAAAGATCTCTCAGTC^^ 
CTGGCTAAAGATTGAAGGAAGGCAAATCTATTrCTAATTATACATATATCAGTAAGGGATGATCTC 
AACATAATAGTAATGTGTATCrrrrGGTATCCAGTrrTATTmGGCCTTCTAAAA/^ 
AACACAGAACATTGCCATTTIGCTCTTGNAGGCCTCAAATATGAAAAGCTATTAGTCCATAGAGCC 
TAGOAAAAAAGAATTGGATTAATGGGCCTTTATTITGGNAACCCTTATAAATGCTGNANATATTA 

SEQ ID NO: 802 ACTGGATTCTATAGGTCCACCATTAAAAGCTGGTATGGGACACCAATTTATAC 
ACATAAGGGTTAGATTAAAATTrTAATTTTTrGGTTGATATTAAACrGAAATTrATATAACT 

tctgaatcttaaaaaaagtaaatgataaaaatttaatatagttaactgttcactgatatgtctatt 

CACTTCATCATAACCTATATATTTAATTAAAAATCAAATTATGAGTCTGCAAATCAGATGCTATCA 

AGCAAArrGCCATCCAGGGTCCATAArrcrrmATATTmATCrCAGATGAATATATACGAr^^ 

GTAAArmAATGrrCCAAATrGTTCTAAAAAAAAAAAATTATCAAAAGCTTCCAGTTAACAGTTG 

GCTAATrCATTTGCCCCCAACGAACTACCTGTTrGTGTTGTGAGGTAGCATCAAAGACTATGATCT 

TCTGTGACAGTAGTAGCCnTAATrCATACGCArrCCCTCTTCATAGGAAGAGTATGGCAACAAAAA 

GGGACAGATGAGTCCCTXTCATrAATCATTGACTarrGGGTTTTCATAGTATGTTAAATGCCCGATr 

TCAATmACAACAACAAAAAATCCAATAmATTCTGAAGGCATCGNTGTCCAAGGACANATTTA 

ACACTTC 

SEQ ID NO: 803 ACCTCACAACCAAAAGCAGTrAACTATGCCTGGCATACCACCCTGTCATGTG 
GGCAGATCACrGTTCCCATGCTGGGTAAGAGTCTCCAAGAAGGGAAGCCCTTATGCAGTAATTCT 
GTAACTATATTAAATTTGGTAATCTAACATTAAACTTTTTCGTGAGCAGTGAATATACACATGGTC 
ATATGAAAACrGCCCTGGAGACCGGGTTAATTATTAAATAAACTAAAAGGGGAGAAATGCTGATA 
GATAAAATTATGTCAATTCCAGGGTGTTCAATGGAATAAAGAAACAGTAGCAGCTGCTTCAAAAG 
TAGACTATGATCAGAAACCTCAGATGGTAACCTTTAAAAATTGTGGAATCCAGAGTCTCAACCTA 
AACCTACATAGGAGCTACAGCCAGGOAGTTCnTrCTAAGTTCCTCAAGTGATTCTGATGATTGGCC 
AAGCTGGAAACrrCTGAACTGGGGGAGAAAGCCAACCTANGTAAATGTAOAGACTTrrAGAATrC 
AACArnGATTATCATTTACrCCACATTITCCCCACrGTCTCACCTTCTCTATTm 
CAATAAAAGGGCTTGGCAAGTGAANGGAAAAAGTCANTAGAGTCCT 

SEQ ID NO: 804 ACTGGGAACAGGTGCn-GCCTTGCTATGGCCACGGTTTGAACTGATCCrGGA 
GATCAATGTTCAGAGCGTCCGAAGCACTGACCCCCAGCGCCTAGGGGGG1TGGATACTCGGCCCC 
ACTATATCACACGCCGCTATGCAGAGTTCTCCTCCGCTCTTGTCAOTATCAACCAGACAATTCCTA 
ATGAACGGACCATGCAATTGCTGGGACAGCTGCAGGTGGAGGTGGAGAATnTGTCCTCCGAGTG 
GCAGCTGAGTTCTCCrCAAGGAAGGAGCAGCTTGTOTTTCTGATCAACAACTATGACATGATGCT^ 
GGTGTGCTGATGGAGCGGGCTGCATATGACAGCAAAGAGGTTGAGAGCTTCCAGCAGCTGCrCAA 
TGCrCGGACACAGGAArrCArrGAAGAGTTGCrGTCTCCCCCTTTTGGGGGTTTAGTGGCAm 
NAAGGAGGCl'GAGGCTTTGATTGACGTGGACAGGCTTGAGCGACTTCGAGGGGAAGAAGCCCGG 
GTAACrCANCTGATCCGTGGCTrrGGTAGTTCCTGNAAATCATCANTNGAATCTCTGA 
TAATNGCGAGTrrrACCAAATCCAGAAATGGCACCCANTNr™TrrCAGGGANCGCTAACCCA>n^ 
TGAACCANGCTNTATAAT 

SEQ ID NO: 805 ACTGGArrCTATAGGTCCACCATTAAAAGCTGGTA'rGGGACACCAATTTATAC 
ACATAAGGGTTAGATTAAAATTITAATTriTrGGTTGATATTAAACTGAAAm 
TCTGAATCTTAAAAAAAGTAAATOATAAAAAmAATATAGTTAACTGTTCACTGATATGTCTATT 
CACTTCATCATAACCTATATATTrAATTAAAAATCAAATTATGAGTCTGCAAATCAGATGCTATCA 
AGCAAArrGCCATCCAGGGTCCATAATTCITITTATATTmATCTCAGATGAATATATAC^^ 
GTAAATmAATGTTCCAAATTGTTCTAAAAAAAAAAAAAAITATCAAAAGCTTCCAGTTAACAGT 
TGGCTAATTCATTTGCCCCCAACGAACTACCTGTrTGTGTTGTGAGGTAGCATCAAAGACTATGAT 
(nTCTGTGACAGTAGTAGCCTTAATTCATACCCATTCCCTCTTCATAGGAAGAGTATGGGACAACA 
AAAAGGGACAGATGAAGTCCCirrCATTAATCATTGACTCCCTGGNGTTTCATAGTATGGTAAATG 
CCTGATITCAATTTrACACCACCAAAAAATACCATTTTTATTCTGAAGGCAN^ 
NCANATTTA 

SEQ ID NO: 806 ACTATTGCTATTAGGGGGTCTGTmATAAATATrrTCTTATCATACrmATT 
ATAAACTrrrrrAGTATGAAATrrGCTTCAACTGrrACAAACAGAATCATIT 
CCACATAAGGAAGTTArrCCrGTAATrACTATnTTAAATAGTCTTCTTAACTGTGGGA^ 
CCCTNCa^CAGCACGCACACTCTTACTCrrCCrGTNATGANGCTNAATGCTNrrCC^ 
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CAGTCANCATTCTGCCCATT 

SEQ ID NO: 807 ACCATATGGTAATGCTGCCTGTCnTrCTGAGGTTGACrilTATGCCATGTCrrT 
CCTAAGTGTGTAAGAATTTTTCTGTTTGCTTCACATTTGACTGAGAATC^ 
GCCCCTGTCCTGTGCCACTAAAGGAACTCGAACTmCATCACrrAGAGATTTCAGAG^ 
AAAAACAGTTCTAATCAATAAGCAAGCAATTCAAGAAAAATAGAATTAATCAGGCAATGGCTGCA 
ACATGTCCTATCrmAATCTATTITCTTATTAAGCTTGGACATTGACAATAGAACCAGAAGCnTG 
GCTGGATCAAAACATTCTCCATAGGCCTGGAGTTTCATpAGGGTCTATTClTITGTTGTTGTrGm 
TGGTTTTITGrnTTTGTGGG l ill 1 li 1 1 1 f 1 1 1 I GAGACGGAGTCTTGTTCTGTTGCCCAGGCTGG 
AGTGCAATGGTOCAGTCTTGGTTCACTGCAACCTCTGCCTCCCAGGTTCA AACA ATT CTCCT GCCT 
CANCCGCCAAGTACTGGGATTACAGGTGCATGCCACGATGCCTGGCTATTTTTTGNATTTTAAGTA 
NAGGGNGGGGTTTCACCATGTNGGCCAG 

SEQ ID NO: 808 A CTiTii i i iTiTmTn 11 1 1 i M ir GGNiTn-rmTiTnTrn-n iTL I'lNA 

ANACNCAAAANCAAACAAGTrrATNTTAAAAAATACCAAGTATCACATCCATANCCTATTCCTAA 

TTNTAATTACTTCTTAAAGCAAamTGNCTTTO 

brrOTCTCAAACTGATTGCTTACANTTTATAATACAAACmTI^ 

GCAGTTTAACTTAAAATGCATANACAATTACATACATTAAATTrGNCATTACTGGCAGTGAAAAAT 

CACAATTrCAAGAAGAAAATTACAACAAACTCATAGAGCACACATAA NAAG TCAAGTTCACTTAA 

TGGCAACAAGTNAhTTATCTGCTGATCANTGACAGGC^rNTGAAAGTGCCnTrATCCAGGGGTAAAG 

TGAAAAGGCAAACrm'CCTGTTCmGCAACTrGAATTGTGCNCTNATCAACACTACAAT^^ 

AAAAChrmAAATTCTTAAAAAAAAAAAAAAAANGCTTGC 

SEQ ID NO: 809 aCGCXjGGGTGGGGGGGGTCCTGGTCTTTGGCTTCTCGACTCGGTCCTGTTTCO 

acagcgaacatgtcgcggcctgtcagaaataggaaggttgttgattactcacagtttcaggaatc 
tgatgatgtanatgaagattatggaagagattcgggccctcccactaagaaaattcaatcatctc 
cccgagaagctaaaaataagaggcgatctggaaagaattcacaggaagatagtgaggactcaga 
agacaaagatgtgaanaccaagaaggatgattctcactcagcagaggatagtgaagatgaaaaa 
gaagatcataaaaatgtgcgccaacaacggcaggcngcatctaaagcagcttctaaacagagag 
agatgctcatggaagatgtgggcagtgaggaagaacaagaagaggaggatgaggcaccattcx;a 
ggagaattcccggcatgcnatgaanatitcctaattggaaoatgatgacgatagtnactatggca 

GTTCCGAAAAAAAAA 

SEQ ID NO : 8 1 0 ACCGCGGGATTTAAAGCATTTGTTCCAATAAAATAAATAGAGGGGAAACITG 
GATGCTAAAATTACATGAATAGGGAATCTTCCTGGCACTTAAGTGGTTCTATGGTATTGGAAAAAT 
GGATGTTCCCAGAAAGAArrACnTTTTCCTCrTATTTTTACTGCCATTGNCGAC^ 
ATTmATATATTQAATCTGAGTTCTTTTTTGACTTTT^^ 

TAAAAGAGAGAATTAGAAAATATTAAATCCTGCATGNAATATATCTGOTGCATCTTAATTGGACC 
AACNTCCCATTTATTTATOTAAAACTATACCGTTACCTCTTAATTCCATCCAAAGAANATAC^ 
TGAAACAGAAGTGTACCTTGGCCGCGACCCGCTAAGGGGAAATTCCCCACACTGGCGGGGCGhfTA 
CTANTG 

SEQ ID NO : 8 1 1 ACAGAATGGTATTTGTGTATGTGTGTGGGCTTANAGATTCACAAGTAAATATT 
CCTTTGGTGAAGGAATTTCAATAAAAACATCTATCAAGTOTCAGCGGTGAGTGTG'nTACACCACA 
GAAATTGGCAAATTGACAAATCAGAGTTTGArmGTITNTNNGTrmAC^ 
TTACCAGCATCCACTAAAGATrNCGGTrrACAAATAAAANOOTCTCGNTTnGAGC 
CTACTTTTTAANATTNTTCNT 

SEQ ID NO: 812 ACTGAAAAGTTGCCACTnTrATITAGTAAGAAAACAAACATTCTGGCTCACT 
AGAGTTCAGAAAAGTAATAATITGAGCCAAAGGAAmGAATTAAGAAAATAGAAACTAGGTTTC 
ATGTATTrAAAAAATAGGAAATAAAATAGAAACTCAAATGCCATGAAGTTAT CTTCC TCTrCCTTA 
TATCCCCTAAGTTTGGGTTGCAAATAACTITCCAATTCCTAATAACCTAAATTATTTTGA^ 
TTTTCAGTGAAATGATGAATGTTTGAATGmGTTTGGTAATCAACATAACA NTGCT NAC 
CCCACTTAGATTTITrAACTTTTAAAAGTCANCGTGGTTTTGATAATTrGATATm 
ACATACACACACACACACCCCACACACAAGCACATGTTAATAAACCTGATAGGATGGAGTGAGCA 
AAArrGTmCANGGAAGATGGCCCATriTANAAAAAGACTTGCCCAGATGATCTTCCATAT^ 
TCCCCCAATGGNATTTrrCNCTCATGCCCAATCCTCCTCrrGTrACCTGTGGCTATGGT^ 
C^T^AACAAGC^mCAr^GGGTCCNAGTATTTCANANACTTTTTAA 
CGGTTTGTGCTGT 



SEQ ID NO: 8 1 3 ACAAACTATGTATCTGAAACACTTCTAmGGCAATTTTATAACAAATCAAAT 
TTTAAAAAGAACAAAAGAGATTGCAGATTACTTCGCAGATACAGAATAAAGCAATrGATGAAGTG 
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CTTAAGCAAAAGAAAACACATTNAANNGAAAACACACTGCrmCTTTrrAAAAATAA^ 

TTGCTATAGATCAAATGGATAATACCCTTATTAAACAACCATTCCAGAATGTCTTATAANTAGCAT 

GTGCTrrrATTTGCACTTCACTTAA>nTATAAGACTCATTm 

ATOGTrAACGAATAAAGNCCCTCATAATTTTACACrmAAATT^^ 

ATTrAhrrGTTCGTGGAACCTTTTCCATTTTGGAACCAAAGGTGm'AATTCNATATGN 

NATNCCTTAAAAAATGTAGTGT 

SEQ ID NO- 8 14 actatatagttttaaaagaatgttgtcccaccaactattcatccaagcaaaga 

ATTGTAACTATNAATAAAGTCTCAGTTACACTITrGCCTTrATCACATAATAT rCA^^ 

TGCGCAGGTCCAAGAATANAGCTGCTCAAAATCTTrGTGGNAGTTTCCTTTAG'l'l-riU-lGTAACCCT 

GAGGCATATGTTCCAGAGAACAGGGATATTTGACTGGTCCAGNGACCTNGGTGATCATAGTCATA 

ATTGAAGAGATGCCTATGGGCATGCrrAAATCNCAArrGCCAAhlTGATATNGNTG m'GNA TTATTT 

TCACbnTCTTGGATCTATGNATGTAGrrGGNATAACAAATATTTAAATAGNTNTTATT^ 

ATNrAAAAAAAATCATTACTCTGGGCKirmn'CCCChns^ 

AATTCAT 

SEO ID NO: 8 1 5 ACATTTITGAATAGACCTCAAAAATACTTCATTCTGCTGCrGTrC^^ 

TTTTAAACCTCTCTGCAGTAGGACACrGAAAACAGCAAGAACTTCGGGGTGAACACCCGCTGATC 

CTTTAACAAGOATrrCTGGCAGGAAACTCACAAAAAGGAGAACTGAAAATTTAGACATACAGTTG 

GCCATTGTAAAAAACATCAGTTCCTCTCATACATTCCAAGTAAACCAAGTAAAATAAGTGTTGGG 

AAGTACACTTGCATAAAAGAATTTAAGGGAGTGATAGCTCTTTCTGTTCTGCCATTCCCAACATTC 

CTGGGGGGAAAGGAGACTCAATGAGTAATACTAmCACTGAGCCCAAGATGGAAACTTGGTTO 

ACCTAAACATCTGATTAATATAGCTAGCTGArrTCTTAAAAATTCGTTGCATTGAANGATAT^^ 

CATGTCTGTAACACCTGGCAATACTTGNTTGNATTGGATTCTGGATATTrCTTGCAGCrTGACTACG 

TGTAATTTGGGCCAGATCAGCTTTGCAGTAAAATTATGCTGCATCCTCGTGGCAAAATTCTI>^^ 

CTrAGNGAATGGTACCAAACCCCTTTArrGCTGGCTTAAGAAAGTGAAAGATTGGTGTAl 1 l^i 1 1 

TAAAACATTTTCAATCA 

SEQ ID NO: 8 1 6 actgttggttaaatgacaatttatgtggattttgcatgtaatacacagtgaga 

CACAGTAATTTTATCrAAATTACAGTGCAGTTTAGTTAATCTATrAATACTGACTCAGTGTCTGCCT 

rrAAATATAAATGATATGTTGAAAACTTAAGGAAGCAAATGCTACATATATGCAATATAAAATAG 

TAATGTGATGCTGATGCTGTTAACCAAAGGGCAGAATAAATAAGCAAAAT GCCA AAAGGGGTCTT 

AATTGAAATGAAAATTTAATTTTGTTTITAAAATATTGNTTATCm 

GTAAGTTmTTANAAGACAArmCATAACTrGATAAATrATAGTTrGTTTGTTAG 

TCTTAAAAGATGTAAATAGATGACAAACGATGTAAATAATmGGNACAGCTTAAAATGTTATAC 

mGAAACCACNTCATGAAAGCNGAATTTGGTTGNGTr^TGTT^mTNCTCTAT^ 

TACT , 

SEQ ID NO- 817 ACCCCAACmGCTGGACCTCATGCAGCTTTAGCTAATAAAAGri lui 1 1 AAG 
GCAGATAAAGTTACAATGCTGTGGAATAAAAAAGCTACTGCTGTGTTGGTAATAGCTAGCACAGA 
TGTTCACAAGACAGGAGCTTCCTACTATGGAGAACAAACTCTACACTACArrGCAACAAATGGAG 
AAAGTGCTGTAGTGCAATTACCAAAAAATGGCCCCATTTATGATGTAGTrTGGAATTCTAGrrCTA 
CTGAGrnTGTGCTGTATATGGTTTTATGCCTGCCAAAGCGACAATTITCAACTTGAAATGTGATCC 
TGTATTT<3ACTrrGGAACTGGTCCTCGTAATGCAGCCTACTATACjCCCTCATGGACATATATTAGT 
ATTAGCTGGArrrGGAAATCTGANGGGCAAATGGAAGTGTGGATGTGAAAACTACAACTTATTTC 
TAAACCGGGGCn^GATCrAATATITGGTTGNG(XCGATGGNGACAimTrACAGCTCTGGCTCCA 
GTTACGGTATATGATACAAATNG 

SEQ ID NO: 8 1 8 acagaatggtatttgtgtatgtgtgtgggcttagagattcacaagtaaatatt 

CCnTGGTGAAGGAATTTCAATAAAAACATCTATCAAGTGTCA GCGGTG AGTGTGTTTACACCACA 

GAAATrGGCAAATTGACAAATCAGAGTTTGTTrrTGTTITmGTTTTITACTTrCCATAAAGTTCG 

TTTACCAGCATACCACTAGAGATTTCGGTTTACAAATAAAAGCCATCrrGGTTTGAGCAAGACTAT 

GCAACTATGAAAATGTTCGTTTAAAAAAATCTTCATGATCCTTTTGTAAATACAAGGTGGTrGCCA 

AGCTTGTTAGTTTTGTTTATTrrATTGATAGATGTAAAATATTATTGTAACTTATTTGGATAAAGTN 

rrCAAAAGAACANNAGCTTACAATGAGGGAGGNTmGATrmTGCTANGTGAAAATTGCAATTC^ 

AAJWTCIXSCmCTCTCTGGNATGCANAAGAGAAN^^ 

GACTGNTTCNATTCTCGANCGNNACTTACTCCTTTTITn'GGANTAAAC 

SEQ ID NO: 819 ACATTGGAGAAGCTGTGCAGCAGCATCCTTTTCTGTGGTGGGCAGGGCAGGA 
GATGAACCATAGGAGCCAAAAGTCAGACAAACAGAAGAAGGCACACCAAGCCTGAACCCTCCGG 
ACAACAGCAGAGTTACCAGCTGAGGGATGTCCCTGGAGGTTTCTGACCCATGAGAGGCCCCCrCA 
CCCTCCTTCACCCTCCTCCrACCACCAAGCTCTCCGGCAGTCATGGACTTATTCCTCCCCATmAC 



115 



wo 02/29086 



PCT/USO 1/30732 



TGGACACCTGmrrCAGGTCANTCTGTCACCTGTATCTTGGNGGATClCTACCCCTACT^ 

GGTGGGATGTGG^m:GGATGGACAGCCANAT^CTACTACATGGCA^rIT^GACAAACACCTCCTTG 

AAACCATNTThTITITGTNGGTANNTCmAANAANCT^^ 

GATNATnTCCACTG 

SEQ ID NO: 820 ACrCAGGGGCATCATGTTGCTGCAGAGGCrACACTTTCCAGAAGTTTTCrCCT 
CXjCTGTGATCCrCGCACACCGGOGGCACTCGGAGGACTGGAAGCACTGTTTGTGAAAGCAAGCCC 
TGCACGCTGAACATCTTCTACATGrrcCTGTCTGAAATGGGAAGATGACAGTCGTATTCT^ 
ArrCACAAATAAAGCCCTTTCCrrGACACAGCTCACAGCCAGCCACATGTGCAAGGGAAGCTTTC 
AGAATGTCCITGAGTAAGGGTGCCAGCAGCCCnTCTrGATCCTGACCAGGTCCTCAAGGGAGAA 
CAGGTGGAGCrCATCAGCAAGTGTCCCGGCACCTGCTCGAACTCCTTTAATGCCTGTAGCAAACCT 
ACAGGTCTTCAACAACTTCTTGNTTGGAAAACTGTCCTGAATTCCTCACriTG^ 
CCATACAGGaTGGCGATGClmCAAATTAAAATGG^^^TGTCCAT^TCTGCAACACT^TGGAAAA 
TTGTNACGAAAA 

SEQ ID NO: 821 ACAAATCmGOCCTTTCTCTTGACATmCGTATGTCAAAAAGCAAAAAACC 
TTCATGTATirCAATCn'AGTGATTACTTTrrGCACCATAATTTGTTT^ 
ACmCAGTATCrGTAAAAGGTATTTAATCCTAAAACATACrTACCTAGAGAATAATTAAA^ 
ATTCAATACAATCTAGTATCTATTAGGAAATTAAGAGrrATCACTTCTAAAAGTCATTTGAAAGTC 
AATGATGTTATCTGGTCAATGGCAGGAAATGGGAACTGGAACCAATATAANACrr ATGGGGAT TT 
CCTCACGGAGACAAAAAAAGATATTCCTTTATGTTGmAAAAAGTGGCAGCTGCrcrrrCrnAT 
TCCATTITAATCAATGAGTATTGATTCAAGGTTTCCTTrCTATTmCCTTATGATAAGGTTOT 
OGAGCTTATTCAACAACCAATAGCATAGAAAAACTACTGOATTCAATTGATCATCAGGGAATAAG 
TTCTCAAAAAAACACAGGCNGGAAAAATAAGCAAGAATCCCAATTCAGAACITTACAAGCTGNGG 
AACTTGGTCTCTTGCATCATGGTACTGGTCAAATTAANANACAATOTATrGNTGNCTATTGGAGC 
NCCATCGNGGCCCATCAGCAG 

SEQ ID NO: 822 A C! ITl ITl ITl ' l ' l n i 1 i 11 1 i 1 1 I CAAAATGAGACTTGGA GTTTAATT AAAAA 
CAGAACAGGGATNCOTTAAACAAACAAACAAAAATTACTTTTCTGATTATCAA'rrri "1 Til GANAC 
TCAAAGCATCCCCAAAACATTGGANATCCAGCTTATTCCTGANACATCAACCATCACAAAAGGTTT 
TCACTCTGAACTATTCACAriTI>JGTGGCANAAAACANAACAAAGTTCTGCANA^ 
CTTTCTAAAATATATTCACAAACAGGGTNTmCATAGTCAAAAOAAAAACAAACCAGGril^ 
rrrGGCCAAATGGGCCTGrTACTCTCCCCTGGGATCrGAriTCTTAATAAAAAAGTTCA GGGCA CC 
AAATCCAACCAGAAATTCCCAGNACCCCAGNGGCTACTTAACTATGAGGGGATGGATGCrrTTGT 
CrTTrCTATTGAGGGGAATCATTCTCCCGGGATTTTNTGCTGCTCAACAGCCCCA GGACAG GTANG 
TGGGAANGNNGGGTGAAATGCAAAACCAAAAGGGTCCAAAAAAANAATGAGGGTTmTGNACA 
AACCATANCAAGGNAAAATNGGGCCANTTTTNCAAACCNCCCCnTNAAACTCAAAACTNCCW 
CAAAACTTNGNGGGGAAANGG 

SEQ ID NO: 823 ACrATCTCCTGCAGCTCAATATAAACAACGATGTATTrCTTTGTAGC AGTATG 
TGGAGTCAGAATGGGTTTTCCAATCTGATTrAATCCCTGAGCAGCTATGrrAAAAGrrAACl-llii 
AAAAAAGCCCAATAAAAAACTAAATACTGTATl'AAAACTCTACCATAATGTTACAGG GATAAACT 
TATTTCTGTACCCGATITrATrrCCAGTTTTCATCCGAATCTACTGTGGAATGGGATAA^^ 
TGTTrCArrGCX:AGGAATTAATCCTCAGCCrCCGGAGTAGCTGGGGACrATNAGCGCTTACTA^ 
AGCTTCAGGGCrAGAATAGCCTTTCTGTGCAAGTGGAAAAAACCTCANGNAmCCCAAATCGGT 
AGCTGCAGAGGGGCTCTATCCTACCAGAGATCCCGCGTACGACGTCNCATCTGGGAAAGAATCTA 
AAGTTGGTGANCTTCACCAACCTGGAGCATTCAATGGGGAATGCACTCATACNNACTGGGCATGA 
ACCNCCTGGGANACATGACCAGTGAANAAAGTGGNTGNCriTGATGANTTNCCTGGAGAGTTCCC 
ACCCCGGGGCAGAGAAAATATCACATTTTAGGCCAACCCXrrAATCGGATNTrGGCCTGAATCTGN 
GGGACTGGOANANAANAAANGGG 

SEQ ID NO: 824 A cnTriTiTriTiTn 1 1 i rrrn rrriTi - r rn iTi-rn n i iggggatataaa 
ctatttattaacanacaaggnctacanacttattmitrrrggacacacccacggngcxigccacgg 
gggccagnggtnttggogngctggcctcggacncaaaggccccaaaagngacncanccctttat 
gggcccgaatc^tnttnagncgctccaggn^r^tnacggagct^gntgnccaaaccattg gctang 
acctggntgnattrtccatnctrrananccttntggctgggcaaaaancaaatctgggnnt^^ 
tcccgggr^att^n^tacctta^r^^gggnaanccaacaaagggaatg^m^ataaoggaaagg^^^^ 
aaaaagntaaaagggaac 

seq id no: 825 acggaotgtgcaaaatgaagagaattattcagccrgttcctaaaagcgata 

AACrCTGGGATCTTCTCAAATGCGCCATATTTGTAAGCTTGAATAATATATTCTGAG 
GGTTGGAGTGAAAAAACCTGAGTGCGAAGTTACAGGATTGGGACGCAGCAGCATACTGACCTAGA 
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GATTCAGCATATCGGGTCAAAAGATAACCAATGGTATCATGCTGGATATGCTTAGCATCGAGGCr 

GGAATAAAGATCCACCACTGGrrCAAATGCCCCAGCATACAGTAGATTCGAACAAGCAGCAATTT 

GAACTGAGCATTGGAAGGGCrmGGGTrAATCCCTOTCCAACAAAGTCAGGGGCCTGCCACAC 

AAGTGGTCTCATCACCrGTTTCCCTCCATACArrCAATAAGCCCTTGGACAGCAAGGAGACAGTAA 

TAKrCANAAAATTGCAATTCTrGrmCAAACAAGGTTTCCCAAATTCC^ 

SEQ ID NO: 826 ACGCGGGGCCCTTACGGCGCCGGAGAGATGGCGGAGrrGGACATCGGGCAG 
CACTGCCAGOTGGAGCATTCK:CGGCAGCGAGATTTTCTTCCATTTGTGTGTGATGATrGTrCAGGA 
ATATTirGCCrrGAACACAGAAGCAGGGAGTCTCATGGTTGTCCTGAGGTGACrGTAATCAATGAG 
AOACTGAAGACAGATCAACATACATCTTACCCAT GCTCTT TCAAAGACTGTGCTGAGAGAGAACT 
TGTGGCAGTTATATGTCCITATTGTGAGAAAGAATITmGCCTGAAACACCCGNCATTNANTCAN 
ATCOTGGANGTGGAAAAATTGNAAATTCCCAAAGCCTTGAATGGGNTGCCACTNT^IGAAAOT 
AAAANACAATTATTTGATTTCCAAGNCAGGOAGAANACANCAANTrAACCATGGGAAANGTGCC 
NAAAAANTNGTGAAAACAAm'GNNNAGGGTTGCATrcANTNAAA^^ 
GGNGTTAATCTmACCCCAGACAGAAAGAATITCCITITmNGGTITrm 
AAAGAAAAAANCAAACCTATGTTCTTITGCCCCNTTNGNGCmTrGGNAAGGCCNT^ 
GGrrTTnTGCAAGGCTTTAAAAT 

SEQ ID NO: 827 ACTCCAGGAAGATGCCATCTTGCACrCAGAAGATAGTTTAAGGAAGATGGCA 
ATAATAACAACACATCrrCAATACCAGCAAGAAGCTATTCAGAAGAATGTTGAACAGTCATCGGA 
TCTACAGGACCAGTTGAATCATCTGTrGAAATAGAATGACATTAACrCAGAGGAGATACGTGTTTr 
ATTTGTGATAGCAAATTCCTAAATGAACATTAGGCAAOTOGTATCATTATCAGGCCAGCTGCAGCC 
TCTTGCCnGACCTGCATTCCTAGAAmcnTrGriXKn'GNAATTCTrGGArrAAGT 
TTCATTTTGNAATTTTGCTAATCATCAACAAATTCACTTGCATGACGTTACTGCCAAATATGA^ 
GCAGTTGAATTATTATGAGTGATTGTGGCAGANGTrrGTGCCATGGNGAAAACTrrGATGGTTGNC 
TGGGGTCATTGGATCCATCTTmAAATGACCTTACCATGAGTCTGGrrGNCAAACCTAAATATCTT 
TGGTTGAATTTAAAATGGGACrCTTArrGGTGNAGTTCANGNCTTCATTGCTTAAAAAATTGNNAQ 
AAATCTGCCATAAGAAAATTTrGmCCTGCNGGAATAAAGAGGAAGTAACAGGGAATCCCATAT 
TGGTCATATTGGGTNTTGG 

SEQ ID NO: 828 AC rrri ' l 1 1 1 ' l 1 1 li 1 U U 1 1 1 i U i I CTNAGTATTCAAGACTTTAATA'nTATG 
GNGTATCACATAAAAAACAAAGTCATATACrnTGCATTAATCAAAAAATAGCAAATCCATATAA 
TGGCAAAATCAGGAAAAAAATTNTAGTATTTCCACAAAATACATAATGTCTTACAGATGArrATGT 
GAACTTTAAATGTCrGCAGCCCTACANAGCTTrrGTTGCCAATTGAAAAACAAAAAAAT CC^ 
ACAGGATGTTCAAAAAGCCTAATTCATAAAAAGACAATTTArrCCATG'nTAATATAGNGrri 11 1 
AGGATGGTAACATAAGTCATGCAACAGCTCTGTAAAACAAAACAAAACAAGAAACTACGATGTC 
GGCTGCGGGTTAAATAAAAGGAAAACCNCNCATACAAAAAAAAATGTAAGGAATGGITAGTGGT 
GCTGCCAATTAAAAAAAAAACTGGAAATNATTTTACCCCCCAAAAGTGATimGGAAAAACTOT^ 
TGGAATCTNAACATNGGACTTGGGTTGNAGNCATCTTrrGGGAAANTATAAGTGAAAGNGGTTGG 
GGACCTCCTGNGOTrCCATnTTAAAAAAAAATTGGT 

SEQ ID NO: 829 GGTACTGCTAGCTGGAAGACACATAGTGOATCTGTATGGCGTGTGACATGGG 
CCCATCCTGAATTTGGGCAGGTTTTGGCTTCCTGTTCTmGACCGAACAGCTG^ 
AAATAGTAGGAGAATCAAATGATAAACTGCGAGGACAGAGCCACrGGGTTAAAAGGACAACTCT 
GGTGGATAGCAGAACATCTGTTACTGATGTGAAGTTrGCTCCCAAGCACATGGGTCTTATGTTAGC 
AACCTGTTCCGCAGATGGTATAGTAAGAATCTATGAGGCACCAGATGTTATGAATCTCAGCCAGT 
GGTCTTrGCAGCATGAGATCTCATGTAAGCTAAGCTGTAGTTGTArrrCTTGGAACCXn^ 
CTCGTGCTCATTCCCCCATGATCGCCCGTAGGAAGTGATGACAGTAGCCCXAACGCAATGGCCAA 
GGTTCAGATTTITGAATATAATGAAAACACCAGOAAATATGCCAAAAGCTGAA ACTC TTATGACA 
AGTCACTGATCCTGrrCATTGATAmGCATrCGCTCCCAAA'mGGGGAAGAACmO^^ 
ACCATTACCACCAAAGAWGANAATTTTCNTTTAAGCCG 

SEQ ID NO: 830 GGTNCAAGTGATTGTGACAAATGACGTAAAAATGGCATTCATG ATGTCT GAA 
ACAAGCCTAAATAGAArrCAAGATTAGACTAAATGArmCACAAAGCACATTCAAGGTTTTACAT 
TCTATGATTGAAAAAAATirrrraAAAACrTTTrATTTCATT 

TAACTTTGGGAATGAATAAAGTGGAATGGTAACTITCCAGTGGTTCAGAATTGAATTAGACrrCTT 

GTGACTGTGATGTn'GGrrTOCATTGAAATATATGAAGTGAGATGTCATATCCTGAATATAGTTT 

TCTTCCCCAATrACTTGATAGCATGTCTGTCAGCCAGTAAAGATTAAGAACAGAGTTTCTCTAAAT 

TCCrCCGATrATTCCACTAAGGCACATTAAAATACTTAATTrrGGGAAACCAGACATCACAGA'm 

CTCCATGAAGTCCTAAATCTTCTTTAAAGTCAGAATANGNATCTT AGTTA CTGACAGTATTCAGGN 

TTTTTTCTNCCTTGGGGGATATGTCAri>ICATCAGNGAAAAAAAATTTT^^ 
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AGGGrrCTNGNGATACATATrNNCAATCCT 

SEQ ID NO: 83 1 GGTACOTCTITAGTAGAGACTGGGmCACCATGTTGGCCAGGATGGTCTCT 
ATCTCCTGACCTTGTGATCTGCCTGCCTCAGCTTCCCAAAGTGCTGAGATGACAG GTGTGAGC CAT 
CAGACCCAGCATTTTITrrmAATrTAAATITAAATTTT^ 

TGTTITGrrGrrGTTGTTGTTGTTGTTGTTGTTmGAGACAGTCTTGGCTCTGTCACCCAGGC^ 

AGTGCAGTGGCATGATCTCTGCAACCrCTACCTCCCAGGTTCAA GCAATrCTT G TGCCT CAGCCTC 

CCAAGTAACTGGGACTACAGGTGCACGCTACCACACCTGGCTGArrTTITTTATGTTrrAGTAGAG 

ACAGGGTTrCAACCATGTTGCCCAGGTTGGTCTCAAACTCCTGAGCTCAGGCA^ 

GGCCTCCAAAGTGCTAGGArrACAGGTATGAGCCCCCACCCAGCTATTTmcriTCGTmAATT^ 

AAAGTGGGGGGGGCTAATTGGTATCCTGhrrGACTCGAACTCCGOACTAACGAATCrTGGTTC^ 

CCT 

SEQ ID NO: 832 ggtaccggaagaagcagctggcaaagcagctccctgcacatgaccaggacc 
crrcaaagtgccatgagttgtctcccagagaggtgaaggagatggagcagntgtgaagaaatat 
aagagcgaagctctgggagtaggagatgtcaaacttccctgtgagatggatgcccaaggccccaa 

ACAAATGAACATTCCTGGAGGGGATAGAAGCACCCCAGCAGCAGTGGGGGCCATGGAGGACAAA 

tctgctgagcacaaaagaactcaatattcctgctattgctgcaaactgagtat gaaa gaaggtga 

CCCAGCX:ATCTATGCCGAAAGGGCTGGCTATGATAAACTGTGGCACCCACTTGTnTGTCTGCAGC 

ACCTGCCATGAACTCCTGGTTGACATGATTTATTmCGAAGAATQAGAAGCTATACTGTGGCAGA 

CATTACTGTGACAGCGAGAAACCCCGATGTGCTGGCTGTGACGAGCTGATATTCAGCAATGAGTA 

TACCCCGCCGAAAANCANAAArrGGCCCTGNAACACTTCTGTGOTTTGACTGNGATAACANTCTrA 

CTGGGGAGAAATCCTGATGGCAATGACAAC 

SEQ ID NO: 833 gGTACGCGGGGGGGCGAGAAGTAGGGGAGGGCGGTGCTCCGCCGCGGTGGC 
GGTTGCTATCGCTrCGCAGAACCrACTCAGGCAGCCAGCrGAOAAGAGTTGAGGGAAAGTGCTGC 
TGCTGGGTCTGCAGACGCGATGGATAACGTGTAGCCGAAAATAAAACATCGCCr CITCTGCn TCA 
GTGTGAAAGGCCACGTGAAGATGCTGCGGCTGGCACTAACTGTGACATCTATGACui rrrilA TCA 
TCGCACAAGCCCCTGAACCATATATTGrrATCACTGGATrrGAAGTCA(XGTrATCrrTATTTTTC^^ 
ACTTTTATATGT 

SEQ ID NO: 834 ACAAAOATTGGTAGCTTTTATArrriTITAAAAATGCTATACTAAGAGAi^^ 
ACAAAAGACCACAACAATATTCCAAArrATAGGTTGAGAGAATGTGACTATGAAGAAAGTATTCr 
AACCAACTAAAAAAAATATTGAAACCACTrrrGATTGAAGCAAAA TGAA TAATGCTAGATTTAAA 
AACAGNGTGAAATCACACTITGGTCTGTAAACATATTTAGCTTrGCITrTCArrCA^ 
AAACTTATTTCCCGCGTACC 

SEQ ID NO: 835 GGTACTATTTATrrCCTCAAGTGCTTCCATGGGGGAAAAAATAAAAGTCTAAT 
ATGCCAGAGAAATCATCATTGAACCAATAAGACACAGTAACATAATTCTAGTAACCTACTTCTCA 
ATGAACACACATCTGAGAAAAAAACCGCCAGTATTTTATTCTCATGGAAAAACAGAACAAACCCA 
CAAGTTGGAGTCACGGAGATAAAATACAGATGAAATGGAAAACGGTCTGTTGTCATGAACTCTCA 
CTTTCAAATACCATTTTATATGGAAGTTACriTACrrGCGGGGCAAACAGAAGGC^^ 
TCTTACTTTTGGAAAATGOAOAATCAAAAATTTGCTAATCAACAAACAAAAAAAGGAGGGAAACT 
CCTTGGTAAAGCTCrrACAAACATAATTATCATTrATATrrrACCAATAAAAGATACTAGGGTAGAA 
AAAACAGATGGGTAGAAACTGGNGCCCAACCAAAGTGAAAGCirmGGGCCTTCTOTAAC^ 
TCTGTTCTTTANAAAACCCAGTTrCTAAANAAAGATTCCThrrGAATTC 

SEQ ID NO: 836 acgtgaaagacgaatttaggagacataagaccgttggttctgacgaggcaca 

GCOTTTCTTOCAAGAATGGGAGGTGTATGCAACAGCCTrATTGCAACAGGCrAACGA AAACA GAC 

AAAArrCAACTGGAAAAGCATGTrrTGGCACCTTCCTCCCAGAAGAAAAACTTAATGACrrTCGTG 

ATGAACAAArrGGACAGTrGCAGGAGCTGATGCAAGAAGCCACAAAACCCAATAGGCNATTTAGT 

ATrrCTGAGTCTATGAAACCAAAATTTTAGTCTATACAACAAAGOTAATAAGACATGCAAAAATT 

TAGAACCCCrACTTTAACTGTCATTGGTTTTTGAAATATATTTAAGCTrTGAA^ 

ATGAAATACTCTTirATTTTGGATATTATGATTGCAGTATAl^GGG ATCAA GATCACTAGTGGACA 

ATTTGAAAAAAACTATrGGAATAATAGCACTTGGTTrAAAATrCAGTr™ 

ATTTCTNGAATTTTTGCCTGAAATGGTTTmAAAATGGCn^ 

SEQ ID NO: 837 ACOCGGGGGCTCrrrCTAAGATGGCTGCCGCTA(XGGTGCGGTGGCAGCCTCG 
GCCGCCTCGGGTCAGGCGGAAGGTAAAAAGATCACCGATCTGCGGGTCATCGATCTGAAGTCCGA 
GCTGAAGCGGCGGAACTTANACATCACCGGAGTCAAGACCGTGCTCATCTCCCGACTCAAGCAGG 
CTATTGAAGAGGAAGGAGGCGATCCAGATAATATTGAATTAACTGrrrCAACTGAT ACTC CAAAC 
AAGAAACCAACTAAAGGCAAAGGTTGTTACGAGTTATGAACATG'nTTAAAATATATTTTGGTTAT 
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ATAACTTGCCTCTGCmTTGTGCCTCTTGCCCTGTTATATAGTGCTACTCTNTGCTATm 
NANTCTGCCTTTTTGGGTCAAGGAAGAAAAGTTCTGTTGCCANGCTTTGGGTGAC^ 
AGATmNGCTGAACTGANTTAAAAANTGAGACANTTCTTTATGrmCCGAAAAAr^^ 
AGT 

SEQ ID NO : 83 8 ACCAAGCACTGGGTAAGGCACTnTGTGGAGCATTAGACAGTAACCC TCAAG 
GAGCTAGAGAACCGGATGGGAGACATGAGCGGTAATTAACTCACTTGTTCCCCAGAGTTTCTIT^ 
GTTTTGAr r r i 'Cr L n-lU'CTGTGACTrATTTTCCn'AiTri'ClUl'CCT^ 

AACTAATATAAACACCTGGAAATTACAAGGAAAAAAAATTCTTCCTCTAATAACTrrCCAAAm 

TGGAATATTTATITGTAATAGCAGNTATCAGTTATGCTTATATAGCATTAAAAA 

ACTACACACACAACCACAGTGTGGGTCTAATCATGGAGATATCAGNAATTTTTANTAACNTGAAN 

TrTGNAGGACATTTNTTITGTTACATGTrmiCAAACTGimTNAAATCTO 

N ' lTn - n I'H 1 11 U 1 IGAATGGAGT 

SEQ CD NO: 839 ACGCGGGGCTGTGACTTAAATCCArnTCACrTAGAGAAATAGAAACACAAG 
GAAACCTTTGGACGCrrCATAACTGCTGGGAAAGGGGTATTATCAATTGTGTTTGAGAGTCAAACT 
ATAAATTACrrCCCAAGGrrAGTTCTACCTATGCCCAGGAATGAACAAGGACAGCTrAAA GGTTA 
GAAGCAAGATGGAGTCArrTGGGTCTGATCTCmCACTGTCATAATrrCCTCAGTTACAATT^^ 
TAAAGGTGGTTTCAAATGCmGCTGACCTCCCATAAACAAGGATGTGCCAATTGTAACTTCAGTT 
CTGCAATTCAAGTCTGCTCCCAAACTAAAGTCCATTTGAATTGCATACTGATTGATAATGTCArrA 
CGAGTTTATCTTTCCATGTGCAGAGCAAAGACAAGAGGAGATCAGCCATTCCrrCACCTACCCAGA 
GACGNTNGCATTGGAAAGGAGCCGTATCrrCTAATATGGTTTrGCAAATGTAAAATGTAAAATCTA 
TGCNCGCTCCAGGANGGGAAAAAAAAAANAAAAAAAhfNTCCTCGNCGNGACCAC 

SEQ ID NO: 840 actatttatttcctcaagtgcttccatgggggaaaaaataaaagtctaatatg 

CCAGAGAAATCATCATTGAACCAATAAGACACAGTAACATAATTCTAGTAACCTACITCTCAATG 

AACACACATCTGAGAAAAAAACOGCCAGTATnTATTCTCATGGAAAAACAGAACAAACCCACAA 

GrrGGAGTCACGGAGATAAAATACAGATGAAATGGAAAACGGTCTGTTGTCATGAACTCTCACTT 

TCAAATACCATmATATGGAAGTrACTTTACTGCGGGGCAAACAGAAGGCCATGCTGGAGTCTCT 

TACrrrTGGAAAATGGAGAATCAAAAATTTGCTAATCAACAAACAAAAAAAGOAGGGAAACTCCT 

TCGTAAAGCTCTACAAACATAATTATACATTTATATTrrACCAATAAAAGATAGCTAGGGTAGAAA 

AAACAGATGGTTAGAAACTGGTGCCAAACCAAAAGTGAAAGCTTTGGTGCCrrCTCTAAACTCCT 

ATCCTGrn'CITrAGAAAACACCAGrmCTAAAGAAAGATTACTCTGAA TTCACC AGGG'rrCTAT 

CCCCCAATTCATCCCTCCCTTTCACCCCCAAGACACXJAAAGGCCATGTAGri-l-l-liGTCCGGCACC 

ACTTGGGAAGGGGCTGGCTCA 

SEQ ID NO: 841 ACTGTCACAGAACTTTTACATACATTCTCAGTCCTAGTTGTGAAAGGCCTAAA 
GAGAAAGAAACrCAAriTGCAGTCCAACACAAAGGGGGGAATnCTAAAATAAATAATCCAAGAG 
TrrmGCArnriTAAATrAATTTTrCATrTTTTrAAAATA 

CAAAAAATGTCGTTGAAGAATAATNTATTAAAACTGTGGAAAANAAGGAAAAAGACACGTCACA 
AACTTTrAAGATTAATATGAAGATCATAATTTAACATAAAGNGAATATATTCTATGGATTTGCCNT 
CCCGATAANTATGAACAATACr 

SEQ ID NO* 842 ACTGGGACGATTCCGCGGAGCCGGGCAGAGGTnTAGGGGAATGATTAACAA 
AGGCGTCCGAAGAAATCGTTGTTGGAAGGTGACCAAGGTGGAAAGAGACGrrGCTTTGGCOT 
AAGTAAGAAGAGAGAGGGAATAGCCTGAAGGAGTAACACTAAATTTAAAATGACACTTTTTTACC 
AACCAGCGAAAGCAGATGTTCAAAGGGGATATTGGCCAGAGTCTGTCACACTAAGATGAGAAATG 
TCCTn-CrrCCTGAAGGTGTCTGATGTGTAAAAATATGATATACTTTGTGCTGTrrCCr^ 
TTTGCATArrArrCTGAAACAACATrAACTAGTTACTTTGCGTCATTGAAGGTATGCACrrCCCCrC 
TATGTrAGGAGTGAATAAAATTAAAAArAGATCCTTATAACAAAOAAAGGCAGATAGAATGATTA 
AAAATGACCAAAACATGTTAGAAACAGTCTCTCAGGTGTATGCAGATGGTAATTACAAAAAIA^ 
TTITCAAAAATOATCrrCTGTQTCATGrrTCTGGGAACAAGTCAAGATGAATGAGTTrGATTm 
AGCAGAAGTAGTNTGTGTTGGTGTCATCCATGAATACCACCAAAAAAAAAAAAAAAAANAAAAA 
NTCCTNGCCGNGACCACC 

SEQ ID NO: 843 ACT rrn - lTt ri 1 i U l U 1 1 1 1 1 1 1 1 1 1 1 ANAAAGGATGACTnTAnTCCATCC 
TGAATGATTCACACCATTATITAAACATCrcAAAAATCCTGAAATAATTTAAACTGAAGGCACAG 
AACAAACCAAAATATTTAACTATCANAACTAAAAATGAGAAAATCCAAATAGTrCTATAGTAACA 
ATAAATTATGAACAAGTITCCGTCACCAAATATCACTCTGACCAAAAATGACTGTCrrTTGTCATA 
AAAGCTACAGCTTAAGCTGATTCCAAGATITCTATAAAAATGAGAGTGAAGAAAi 1 iL-i ici i iCA 
AAATACTCATTATGCCACCAGGTTCAATGTAAGTAmTGTATATAACAAAGTAGCAGTCAGGATA 
TTTGTTGATGGATGGCTACTCCCCAAGAAATGACACATTClTACAAACTrTAAAAAAATAGCAAAG 
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TTCKjTTACAAAATTCTATTNGGGGAGCAGGGAAAAAACnTGTCCATGTCAATTAGAATATCAAAG 

CArrTCAAAAATCAAACTCNTTCCAAAGCCAATCATCACACAACAGTAATATACT^ 

TCCACTGNATCrTGTTATATGTTGATCATACAGNGTAACTAAGGGGNTGGAACAAGTCAGTANCA 

ATGATTGTNGCTGTAT 

SEO ID NO- 844 A CriTl 1 i 1 11 l - nTLri i i ll 1 i 1 ANAnTOCCTGCTGCTGCTAGGAGGAGGCC 
TAGTAGTGGGGTGAGGCTTGGATTAGCGTITANAAGGGCTATTTGTTGTGGGTCTCATGAGrrGGA 
GTGTANGATAAATCATGCTNAGGCNAGGATGAAACCGATATCNCCGNTNCGGGNTGTATAGGAr^ 
GGCTTGNAATGGCTTNCTOGAGTNGGGNATCTGCTCTGGNCTTATCATTNATATTGATA^^ 
GTGATATrmCCrrACGNCOTCCATGCAAATTACTCAC7>INCCATAGCGAAGCACACGCNTrAA 
rrGAAGTGAGAAGTAAANAAACNCrmiGTAACCNGCTCNGATTmCITA 

SEO ID NO- 845 CGCGTCXjAGGTACCCCTGGCAGAGCATTTGCAGATTAAAGAAGCAnTGAGA 
AAGAAGTTGGAATCATAAAAGCCAGCITGAGAGAAAAGGAAGAAGAAAGCCAAAACAJ^^ 
AAGAAGTCTCCAAACTTCAGTCGGAGGTTCAGAATACTAAACAAGCATrAAAAAAATT^^GA^ 
AGAGAGGTAGrrGACTTGTCTAAATATAAAGCAACAAAAAGTGATTTGGAGACACAGATITCTAG 
OTAAATGAAANArrGGCCAATCTGAhrrAGAAAGTTGANGAAGTATGTGAGGAAGTTTGC^TO 
ATAAANGAAGGAAATTTCTGCANANNGATGNNANGGAATNACTGCATTTCAGCATTGANCAANA 
AAr^AAGGATCAGAAGGAAC^fNTGTGTAAGTCCTT^JCACCATCNCNGAGTTAC^AAGAATACAA^ 

ATCTGNTTAAC 

SEO ID NO* 846 acgcgggggagttgtcctgcgccggtgttcccacgtgcggcctgaacctgag 

CGCATAATGTNATGAGGAGATGGGAGCACTNNTGArrCGCGGTATCAGGAATrn^AACCTANAGA 
ACCGAGCGGAACGGGAANTCAGCATGATGAANCCCTCTGTCGCTCCCAGACACCCCTCTA^^ 
AGCCrrCrcCGAGAGCANATTAGTCGTGAGTGTCTATCCANANGTTAAAGGAGAls^^^ 
AANATGAATAGCTGClTGTCGNGTCTAAAAGATGTGTATGTNOATTCCANAAATCCTGNGTTnTAC 

TTG 

SEO ID NO: 847 ACCCTQACCCATGAACACCTGGCCATGACCTrrGACTGCrGTrACTGTCCAC^ 
TCCCCCGTGCCAGGAAGCTATrrCCAAAGAACCTATCGTOATOAAAAATrrATATTGGA^^ 
AAACGCCTATTCCCATAAAGAAAACCITCAATTAAATCAGGAGACAGAAGCCATAAAGGv^ 
CTGTrGTArmAAAGCrAATGGTGGAGGGGCmGGTGGAAAACACAACCACTGGGATTAGOT 
AGACACACAGACGTOAAGAGGCTTGCAGAAGAGACTGGCGTCCATATCATATCTGGAGCCGGG^ 
mATGTGGATGCAACTCACTCCTCAGAGACCAGGGCCATGTCAGT^^^ 
TATGAATGAAATTCTCbrrGGAGCTGATGGANCCAGTNTCAAGTGTGGCATmTCGAGAAATrG 

TGTTCTGGGCTTTGATOAGAGTGAAANAA 

SEO ID NO: 848 TCGCGGCGAGGTAClTl"rn'l l'l'J'l 1 ITArnTl-i-l 1 1 1 1 1 1 1 IGCnTTATAGG 
TATCTATCTAATAAAAGriTATrTGTGTATGTGCAATGCATAACTCTATCTTAGATATG^^ 
CAGGATGAAAATACTTTCTTGCAACTACTrrATGCTTATGAAAGGTGTGAACTrGCAA^ 
TGTCnTAAACCCAAGrrGACAGNGCCCTCTCAAAACrmCATAAATAATGACCTAATTNCAm 
AAAAATGGTrrCANCAAATATGAAAATAGAAAGNCCCGTATITGCCATTTGTAATATGAGAAAAA 
AAAGATGATACATTCCTNTACAAAAAAAAGTGGGTTTAGANAACAGTTCTGGTAGTA'nTCCATG 
GTAAAGAATCNAAAGATCTAATGAC^GCCCCCTTGCTCmGGAAGGACAGGGAATTCAT^^ 
TTrrrcGATNGCrrAGNAACTCCANGGOAmCAArrGGGGGNCAATACTGGCCCNGNNTCA'rc 
TirAArmmTNNTKGGGNCTTATAAAATCAAAACrrm 
GGCCTNTGGCCAGTCTAT 

SEO ID NO- 849 cCGCGGCGAGOTNCCC^T^mGAATTTGAAGTGAANGATCCTGAGCTGGA^^ 
a^CAGGGAGATGACATGGTmTGATGATCCGGAGGCTGGGGAGATGACATCAGAAAAO^^ 
CAAACrGCrCCAAAAAAGAAGAAAAATAANGGGAAAAAAGGGTTGGAGCCTlCrCAGAG^^ 
CTGCCANGGTGCCCAAAAAAGCGAAGACATNGATTCCTGAANTTCATGATNCNAAm'a^G^^ 
TTTGGNCCATITATNCCGNGATCCNACATCATTCCCTATAATGATCrGCCCCGACTGGAGC^^ 
CrrCANGATCCAAATGTCGCTGNCGTTCNTGGNAAACCAATTCAGGGGTGAT€^ 
CNGATCCANGTTACCTATGGGAOTGCNGGAGCnTNCACANNCNCCAAmCTmAT^^ 
ATNCAANCTGATTGCCANACNTGQAAATGNTGCTGTNATTTGNAAAATGCNGACCT 

SEO ID NO: 850 ACnXJTGAAACCACCAACrrCAGTTGCCTCAGACTCCAGTAATACAACC^^^^ 
CCACCATGAAACCTACAGCGGCATCTAATACAACAACACCAGGGATGGTCTCA^^^^ 
TCTACCACC-TTAAAGTCTACACCCAAAACAACAAGTCrrrCACAGAACACA^^ 
ATCCACAATGACCGTAA(XCACAATAGTTCAGTGACATCTOCrGCTrCATCAGT/^^ 
AACTATGCA-rrCTGAAGCAAAOAAAGGATCAAAAmGATACTGGGAGCm 
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NATrAACGCTGGGANTTTTATCThmKmACATOGATGCCAAATGGNTm 
TCGGGGTTCNAACCATAAATNAACATGATGCCirmTAANOAATCCATGGACCAAGGATGGAT 

SEQ ID NO: 85 1 CCGCGGCGAGGTNCCCITNGNGAATATGAGGAGTATATTACTAAACTnTCA 
ACrACCACAAAGTTOTCCTATGAATACAGGAGTGGAGGCTGGAGAGACTGCCTGTAAACTAGCT 
CGTAAGTGGGGCTATACCGTGAAAAATAAAGGGAAAAAAGGGTTGOAGCCTTCTCAGAGCACTGC 
TGCCANGGTGCCCAjVAAAAGCTGAAGACATANGA'irCCTGAANTrCANGATCCNAANTCCGATGG 
TTNGGNCCATTTATNCCOGGATCCNACATrCATTCCCTATAATGATCTGCCCCGACTGGA GCGTGC 
TOTCANGATCCAAATGNGGCTGNGTTCNTGGNAAACCAATTCAAGGGTGAATCAAGCGri 1 1 1 iG 
TTCNGATCANG1TACCTATGGGANTGCNGGAGCTT 

SEQ ID NO: 852 CCGCGGCGAGOTNCCCTTNIWGAATTTGAAGTGAANGATCCTGAGCTGGAGG 
CCCAGGGAGATGACATGGTfTGTGATGATCCGGAGGCTGGGGAGATGACATCAGAAAACCTGGTC 
CAAACTGCTCCAAAJWW^GAAGAAAAATAANGGGAAAAAAGGGTTGGAGCCTTCTCAGAGCACTG 
CTGCCANGGTGCCCAAAAAAGCGAAGACATNGATTCCTGAA^fTTCATGATNCNAANTCNGATGTG 
TTTGGNCCATITATNCCGNGATCCNACATCATTCCCTATAATGATCTGCCCCGACTGG AGCGTG CT 
CrrCANGATCCAAATGTGGCTGNCGTTCNTGGNAAACCAArrCAGGGGT GATCA AGCNxrriiGTT 
CNGATCCANGr^ACCTATGGGANTGCNGGAGCrITNCACANNCNCCAA^r^C^^T^ATGCT 
ATNCAANCTGATTGCCANACNTGGAAATGNTGCTGTNATTTGNAAAATGCNGACCT^ 
CTCTNGAAAGGCCTTCNGGGGGCTrTNCTrGGrrTGA 

SEQ ID NO: 853 ACGCGGGAGGATrGTTCCACrAAAATTrATrnTCAAAAAATTTACTTCACAT 
TATrCTATGTAAGTGATGACTTGTCAOTGrrCCAGGTGTATCTTAGCTAAAACTAGAGAATGCCCT 
AACTTAGATGGTTTTlGAAGCCTATACAATTGGTATTGriTGACCCTrAAGCT^ 
CATGGAGGACGAAGAAAGCTGT 

SEQ ID NO: 854 ACTATTTATTrCCrCAAGTGCTTCCATGGGGGAAAAAATAAAAGTCTAATATG 
CCAGAGAAATCATCATTGAACrAATAAGACACAGTAACATAATTCTAGTAACCTACrTCTCAATG 
AACACACATCTGAOAAAAAAACCGCCAGTArnTATrCTCATGGAAAAACAGAACAAACCCACAA 
OITGGAGTCACGGAGATAAAATACAGATGAAATGGAAAACGGTCTGTTGTCATOAACrCTCACTT 
TCAAATACCATTTTATATGGAAGTrCTTItn'GCGGGGCAAACAGAAGGCCATGCTGGAGTCTCTrA 
CrrrTGGAAAATGGAGAATCAAAAATTrGCTAATCAACAAACAAAAAAAGGAGGGAAACTCCTTC 
GTAAAGCTCTACAAACATAATTATACAITTATATTTTCCAATAAAAGATAGCTANGGGTAGAAAA 
AACAQATGGTTAGAAiXTGGTGCCAAACCAAAGTGAAAGCTrrGNGCCrrCTCT AACCT CCTATCC 
TGTTTNrrrANAAACCCCAhrrrTTTTAAANAAAGATTNCT^ 
ATTNTTCCCTCCTrrCCCCCCCANAACACNAAAGGGCCnTrr^^ 
GGGCCTGGGCCTCANGGGCC 

SEQ ID NO: 855 ACTACTGTTAATATCTCTAAGAACAAAACACATTGAACATCCITCCAGAAAG 
TCrmGAGGGAGGACCTATACCCATAATAGAATTATGGCACTCATTTCTGACAGTGATCAAGAAAT 
CAGTrATTrCCrrACTGTTGGAAGGACATrGTAAAGTATGTGGTrATATGCAGT GAAAC TGCAGAA 
AATACTCCTGGTTGAGGAGTTTrCACmACTACAGTGATATAAAAACCAGCAGTTTTTACACTAA 
ATTTTrrAAAGAAATATTAGACAAAAATATAGAATrAAAACCTTTGGTTCCAAAATGGGAAAGGT 
TCCACGATACATAAATCATTTCTCATITGCrrTAAAAAA'nTAAAAGTGTAAAAATrATGAGAGAC 
TrTATrCGTTAACAATGGGGGTAAAGAGCTATATACATGAAAATGAGTCTTATAAAATrAAGTGA 
AGTGCAAATAAAAGCACrGCrACTATAAGACATTCrGGAATGGTrGTTAAT AAGGGGA TTATCCA 
TITGATCTATAGCAATGTGATTTTATTmAAAAAGAAAAGCAGGGGGTmCT^^ 
TmCTTTrGCTTAANCCCTrCATCAATTGCTrrATCTGGATCTGCGAA 
GTTCmTAAAATT 

SEQ ID NO: 856 ACCCATCCCAACTCTCAAATCGTTTGG JUMUl-]-l 1 iATCTTGATT GAGAT CCTC 
TTCTCACTATGCTAGTGGTGGAGATATTGACAAAATCCTATTTCTITCAAAGAG GAACnTrT CACA 
CCGAAAAAAGAGCATGGAATTATTTTATATTGTrATAAAAATCCCAGATGCAAA ill 11 1 i AATGC 
CAATTArrAGAGCrrCTGGGGAAAAAGTATAGTrCACGGAAATAAAACrATGTTCTTrCAGGGTTG 
GGTGGATAGGTGGCTGCTAGGGTGCTGGCTCCTGGCGGCTTTGCCATCCATGANGCAAGGGCTGG 
GAACACAGTGCTTrGCCTATGGTAOATCCATGTGAATGCAGGAAGCCAGCTCTTCAGTCTTGGAGA 
TGAnTCTGCTACAATTCTGTANAAAGATTAAGGATGG CAGAG TAAAAGQTACCAAGAATGCCAG 
QATGrrrrTCTTGGCa3TAGGANGTCCAAATTACTTrNCrriTrrGATGAAAGAGm 
TCCATCICrCrGGCTTCAAAAATCTCTGCCATTTTAACATCCTGNGAAATA^ 
GTATITAGTrTAACATTACCCACACCrTANAAATAATAGGTNAAAATCGCTTGCCTACrCTTC 

AGATGATCAAGTCAT 
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SEQ ID NO: 857 ACTGGTCSCA'CCTCCTACATATCAAGGAAAAGCAAAACACAGAATAATrTAAT 
ATGCTGAATAAAATTTOTTTACACCAGATACTATACATCATACTGATGGTTGTCCAGTATGAATTT 
TAAGGGTATTATGTTAGATTCTGCAAAATATATTCCTATTATTCACAAGTGAGGAGTCAAAGTCCA 
ACTATTCAAATGGCCATAAACAAAAATGTTGCAQAAGGTATAGACCATTAAAAATAAAAAAOTCA 
GGAGTGGGGCAGCTGACCCCTTAGGAGCCTCANGAATTCCTTTTAATGCAAGATAGATGGCAAGA 
GCTGGCrrmGGTTAAGTCAGCAGTCGGAAACCCATCAGGGAAAAGTTTTCAGNTTCAACTrGGT 
AAGGCAATGACTTCTTNAAC 

SEQ ID NO: 858 ACTTGAAGGAGAACAGTTTACATCGGGCGTTAGCCACCTrGCAGGAGGAGAC 
TACTGTGTCTCTGAATACTGTGGACAGCArTGAGAGTTTTGTGGCTOACATTAACAGTGGCCATTG 
GGATACTGTGTTGCAGGCTATACAGTCTCTGAAATrGCCAGACAAAACCCTCATrGACCTCTATGA 
ACAGGTTGTrCTGGAATTGATAGAGCTCCGTGAATTGGGTGCTGCCAGGTCACTnTGAGACAGAC 
TGATCCCATGATCATGTTAAAACAAACACAGCCAGAGCTATATATTCATCTGGAGAACCTTTTGGC 
CAGGTCTTACTTTGATCCTCXJTGAGGCATACCCAGATGGAAGTAGCAAAGAAAAGAGAAGAGCAG 
CAATTGCCCAGOCCTTAGCTGGCGAAGTCAOTGTGGTGCCTCCATCTCGTCTCATGGCATTGCTGG 
GACAGGCACTGAAGTGGCAGCAGCATCAGGGATTGCrrCCTCCTGGTATGACCATAGATTTGTTTC 
GAGGCAAGGCAGCTTGTCAAAGATGTGGAAGAAGAAAAGTTTCCTACACAACTGAGCAGGCATAT 
TAAGTrrGGGTCAGAAATCACATTGTGGAGTGTGCTCGATTTCTCO^NATXjGTCAGTATTTTGGTC 
ACTGGGTCTGTTGATGGA 

SEQ ID NO: 859 ACCTGCCTrGAAATTTAAATGTCTAAGGAAAATGGGAGATGATTAAGAGTTO 
GTGTGGCCTAATTCACACGAAAATGTATGCATTACATCCTGCTCCirrCTAGTTGACAGGAA^ 
AGCTGCTGTGGGGAAAGGAGGGATAAATACTGAAGGGATTTACTAAACAAATGTCCATCACAGAN 
TTTCCl'lU'lUUUlU'rriNGAGANATATTCTGGCTCTCGTCACCCANGCTGGAATGAANTGGTATGAT 
CTCAGTTGATGNGCANCCTCCACCNNCTANGTNCANGCNATTCTNATGCCTCACCTTTNAACNGNT 
GGAACTATANGCO^ATGCTACCKrGCCAGGCTANTITITATATATATANTAAAGNCGGGTNGTGGC 
NA^^^G^^mGCCAGANATG^^TNGAACTCNTGGCCTAAGATNAATCTGCCCACCGTNACCTCCXn'^ 
AGTGCTG 

SEQ ID NO: 860 actctaagtcagoaaaaattaagacgaccaatagttgcctgtgaacttggca 

GACTTTATAACAAAGATGCCGTCATTGAATTrCTCTTrGGACAAATCTTGCAGAAAAAGCTCTrTG 
GGAAAGCACATTCTCACATTTAAAGCNNTTAAAAATGGTGACAAACCTGAlWCm 
CCTGCCTGGGAAGGGGATAAAGGAACCACTAAAGGTGACAAGCACAATGACCTCCACCGGGCCC 
NTTNATTITGCCCGTTGTNGGCCmGGAAATTNACCGGCCAACNCAGGTCTTGmc^ 

SEQ ID NO: 86 1 ACl"lll"l"ilUUU'lM'l44-lUUl'l-ri'l41'lCCCTTTAAAAGAATTTATTAAGCCTGTT 
ATACCACACAGTNTGrnTATACACTGACATACANCTCCNTATTAAGATAAAGCAAAGACAAAAA 
AGTTTmXITNTTAGAAACAAGATNCNCCNCCAhrn'AT^^ 
TTTTA^ITmGACAAAGCATTNAAAAAACATITGCAAACT AGTT TTAA 
NCCAANAC^^^NACNGCCCNAAATGGTTTATTANGTTGNATTTTAACAACCTTTO 
GNAACCGGGGGAACCTGNA 

SEQ ID NO: 862 ACACAAATGCATGAGTATGTTTATACAGTG7TAGACTGATGTGAATTTGCATT 
TGTTACATTACATTGCCAGCGCATATCATTTAGCAAGTTGGCATTAACATTTATGCTITAArrAAAT 
GCCAGTATACCTATOTGTGCAGCAGTAAAAAATTAGTGAGAAAAAGCAACTTTTTGTCAC^ 
GAAATATTTTGTCITATTAAGTGTTCTTGGCACATGTATATTACTAAAGTANATAATTCCAATGAG 
AAATACrCCAGATTATTGGTATAAAATTAATITACAATGTCCCrGATATTGACTACTCrrAAAAAA 
AtXAAACAAAACTCGTATCTGATGTAACTTTGCCAATArmAAAAGCCAAAATATTCIOT 
CAAATITGTITGmCAAGGACAGGTTACCTTGCCTGGTAAACCTrCCNAACAGAAATATACTATCT 
ATCTTIXjGGmGGTTriTGGTTTTITGGTTGGTlXKjATTAAAAGG^ 

NGATGTATTGGCGCATCT^AANTNAC^GGNANCTCA^^CTCCNGGGTCAAGGATTCTCCGCTAACTT 
CTGAGTACTNGAATTAANGNGCNCCCCCCNCCCCCGGTAATTTrGGGTnTANCAAAACAGGGTTC 
CCNCNNTGGCCAGG 

SEQ ID NO: 863 ACAAAGGCTGCrrAAGGCAGTGCAGCCCCTTCTCAAAGTCAGCATGTCAATG 
AGAGACTGCTTGATAOTGTCCTTCGGAAAGCTATGTTTGCCAACCAGCrTGATGCCCGAAAATCT 
GCAGTTGCTGGG 1 1 1 1 i GCTGCTCCTGAAGAACrrrAAAGTnTAGGCAGCCTGTCATCCTCrCAGT 
GGCAGTCAGTCTCTCAGGTCAAGTCAGGTTCATGTGGATGTCACAGCCATTACAATTCTGTCGCCA 
TGAAACCTTtTGCCTTGAATCATGGATAGTTGAGGAGATCTrAAACCCANCAGCTGATGTCCACTA 
TGCTTATGAAGGGrrrATATGTCTTCNAAGAACTCTAACTGCTATTTCAAGCATG 
mTACAGTrAAACAATTCTTGACCCAAACCGACTGNTGCCTCrm'GAAAATAAAAACCT^ 
GACCAAGGANATAAANCTCTTACAAAACCACTGGATATCTGCTGNGTGWAITCAACATTGGTTGG 
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CC^GGTATAAAAATACAGC^^'ACCCTACCC^^^IGGAAAGGAGGAAAAGGAGANGAAAAGCr^rTT 
CCAAAACCTAATTATTmGGAGNCNTTCTAATNNAATGATTAAAA 

SEQ ID NO: 864 A criM - riu - iu - iu - iM - iiMM - i ' iu - rrrrri - rin - i AAAAATTAAATCCAAATTTrATTA 

AGGATTTCAGGTTACATACTrcAAATTTCTAGAATGGAATGGAATCATTITGGAACTGGA^ 

GCATAAACACTGACGTCCCTTAAAACrrCAATTITATAAAAAAAArrCTTCTGCAAACCACATCC 

CnTATGTAACAAGACTAGGTATTATCTACACCTTCACTTTGGCAA TAGC TATTTCCT 

AAAAANATGATTTrGCTACTTCAGTTCArmAAAATGGGArrCTATCTITGAAGTO 

CTCATTTCXiATGAACCATNGTTAAAAAAAAAAAGCNCATAKrGOTAATCAAAGCNAAGGGAATCT 

TTTTGAAAATNAANAAAAAAAGCnT^TTATTATTGGGrrAANGNTTTCC^ 

NAGOAQANGAATGOATNGCCCT^CAAACAAT^mm'CACATTTCAAGACCCTAACACTACT^ 

AAAAGGGAGTTTTCrrcaWTAAAAAACAANTCNAATOTATTTT^ 

TNCCTTTATTANAATTCAANTAATNCCCCTCCTTTGNAAAAAAAAGTCNACCCNAANTAAC^^ 
TTNCCCCGGGTNTTG 



SEQ ID NO: 865 ACCATGAAGAAGAATAAATGAGGGTAAGGGGCTAGTGTGATAGGGAGAGGG 
GTGGGATGCCATCATATTTAGGGGTGGTTGGGAACTGTCTACTGCTTTAGCATTTOTGTCT^ 
TTTCTCTCCTTTGGTrATATACCTTGCGTATrcCGCACATTGATAAAG 1' 11 CI' riCTTACAGAAGTTC 
TGATATTGAATTAAGGAATGGGGTCACTACTrAAGACTTTATCATTTCAGCTAC ACATAA AAGG 
TCTCTCCCCTATGGATTTTGCTAATGGTTGAtjTGATACCTAAAGGCCTTGTTGCATTTm 
GGGGTTTCTXTn'CCGNTTTGAATTCTCTCATGGGTACCAAAACTCTTTTA^ 
TTAAATGTGTATGGCTCTTTCCCAATATGAATTTTTTTOATGTAACTAGCT^ 
AACCTimGCTCTNCTGCATTATAGGGGTrrGGGCa^CTGANAATAATrGATGCCTGAT^ 
CAATCATNACCAAANACr^IXrmCTT^mTrNATATAAAAAATGAAGNGA^ 
ACCCXnSfNAATAACTTTACnriTCCTACCTTTACANGCTTGCTGGGAAT^ 
GNGAACCTGG 

SEQ ID NO: 866 A Crril ' l - ri 1 ri - rri - l - ri ' ri ri 1 IMIUN AAATAAGGNCTCACTCTGTCATCCAGG 
CTTCAGTGCXlGCAGTGTGATCATAGTrCTGTAACTTAAACTCCTGGCCTCAAGCAATCCTCCTGCC 
TCAGCCTCCCAAAGa}CTGGTATTACAGATGTGAGCCACCAAGCCCA GCCTAAACAAGCA rrTCT 
CTATTAAACCTTrrrCANAAAGACTGNATGAATTAGCCCAAAAGTGNCr J"l ■] Tim -ri'l'lOAAAAT 
GGAAGTC^CCCCTCTGG^TCCCAGCTNGAATGGAAATGGCG^^^^AAT^^CGG^^rTNN^•GGAAATC^C 
AACCCTNCTAAGGTCAAACAATTTCTCCTGGCTNAACCCTCCGAhrrAANCTGGGACTA;^ 
CCACACNANACCCCGGNTTAArmGTATTrrrANNAAAAAAAAGGGr™ACATAm 
TANTCTNAACCTCCGGACTTTGNGAACCNNCCTGGCCTAAACCTNCCAAAGTNGKTGGGA 
GGGGGNGAGCCACTTGGNCCANOINAAAGTGACTTTTTGTAAAAAAATTATTTTTAC^ 
AAGATAANAATAmxriTGCAATCANAAATTTTThn^AGAAATCAACAANAN^ 
GACCnTOmTAAAAAN 

SEQ ID NO: 867 ACATAGGGTCCTGTrACACCAGTTTTAGGATAAAGAAACTGOAAGAATTCCT 
CAGGGATCAGTCCATAACGAACTTTTCCTCCGTATTCAGGAAGAGGTGGGTACGCGGGGGAGGCA 
TTGAGGCACCAGCGCAGGGGCrTCTGCTGNNGGGGCAGGCGGAGCTTGAGAAACCGNAGATAAQ 
TTTTTTTCTCTTTGAAANATAGAGANTAATCCACTCTrAAAAAATATAGCCATA^ 
ATTTGCTTAACraTTAATTTTTACCGTATTTTAATAGCrrAAGAAm 

TAAAAAAAGTACKTGAAGAAAGGAAANATTAAAGGGTirrrAAACATGACGGAGGTTGAGAATA 

AACCTCTTCCTGGAhrrAAAAAATGTTTTNAAAGAAAATTGAAAGAAGGC^ 

TTCCAATAGAAGGCCATGCTTTTAAATAAAATGAAGGNGCn'AAACAGCTTAAGTTAAm 

GTTGGAGGTGATAAAAAAAAAAAAAAAAAAAAAAAAAGTNCTCGNCCGCACCCCNCTANGGGCA 

ATTTCAAC<XACTGGCNGCCX3GTTCCTAGGGGATCCaACCTCGGNCCCAANCnTrGGGGTArr^ 

GGCCTACCTGGTTCCTGGNGNGAA 

SEQ ID NO: 868 ACCTGTCCCATrCCTAAAAGGATTTGTGGGCAATGCTGGCACTrGGTGGCCAG 
GAGAATCTTCIXjACCCCACTCTCCCTCCTCTTCAGTCCTGAAGACCCCAAGAACCCAGTTAGGATC 
CCCTGGCCAGAGGTCTCTGTGACTGCCrCTGGACTCAACACGTGCAGCAACTTGGGAAGAATTTGA 
GCCAGTCTCAAAAAACrrnWCCCCCAGAATGAGAACCAGTGACCCCAAGCNAGGAAGGGCTGG 
GGAATCTGGAANGGGAAGANAGGGGGGTCCAANGGGACCCTGTTGGCTTAAGCCNTTGATNACC 
AG 

SEQ ID NO: 869 acccactgctattgcctaagggtgtagtcctgaaactgaagccagttgccga 

CCGTTTCCCCAAGAAGGCTTGGAGACAGAAGCGTTCATCAGTOrrGAAACCCCTCCT^ 

CAGCCCXTCTCTCCAGCCCAGCrrCAACCCTGGGAAAACACCAGCCCAATCAACTCAT^ 

CCCCTCa>AGCAAAATG0TGCTCCGGATTCCTO\CCCAATACAGCCAGCCACTGTrTTACAGACAG 
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TTCCAGGTGTCCXrrCCACTGGGGCn'CAAGTGGGAGGTGAGAGTTTTGAGT CTCC TGCAGCACTGNC 

TGCTATGCCCCCTGAGGCCAGGACAAAGCTTCCCTCTGTTGAGTCCCAGACTTTGCTCTCTT^ 

CCCTGTGCCCAAGGTNATGATGCCCTNCrCrGCCTCTTTCATGTrmGAAAGCCATAT^ 

NGAGACCCTCNAAAAGAAAGGGAAGCCANGGCTTTCTGTTGTTTANAACCTGCCCCTGrrATCAC 

CCrrWATCTGTATC^JTTACTGGTCCCGCTACCACTGGTGAANAATTGTGACCCTTGGNCG 

SEQ ID NO: 870 ACAGGCTTAAATCTATGTCATTTACACTCACntSAATCATCAACCTNA^ 

CCTGTTCCCTCGGTGTAGCGGGTATNATCTGCTTCCTCATCATCATCATTGACCAGGTCAGGACGA 

AATIX^TAACACrrrACGACCACATGATCACTAGTGCTTTCCCTGC^ 

TCX:ATAT^^TG^^■CAAGTTTATCAATCTmcrrGNCTTITCC^^ 

CTAGAGTGATmGGTAACATNTTGGACCTAGGGCANANCGCTCTCTCrCAAATTAAATCrrCT 
TGAAAA™c>rrCNIWTTNTCrCTNCnnTCTTmATCTA]mTCAAC^^^ 

GTGCATGACGAATACATGCNTAATATCACCCTCTITCANG GGCATA CCC CTAAAC ACI ^CArr ANT 

TGGATGTriTTCAATCAG>rrATCAGNAAATGCTTGGCCCA>rrrmANAGTrTT^ 

TTCC^WCTAACCCGTN^^mTNGGWTAACTANCTATT^Ca^CTT^^ 

SEQ ID NO: 87 1 ACTTACCTTCACCTGAAGAGCGTAACTATGAATn'CTGCAGT GGTAT AAGGAT 
GACrrATGTGCATTGGCATrrAAAGrrCTCCATGACAAGCAGCGAGGACCACTGGTrnTATGCGC 
AmACrCAGGCACrATAAAACCCCAGTTGGCrArrCATAATATTAATGGAAACTGCACGGAGAG 
AATAAGTCGTCTGCrmGCCGTTTGCTGACCAACATGTAGAAATCCCTrCATTGACTGCTG^ 
CATTGCTirGACTGTTGGGmAAACATACTGCCACTGGAGACACCArrGTCTCATCCAAGTCCAG 
TGCATTAGCTGCAGCTCGTAGAGCCGAACGGOAOGGAGAAAA GAAG CACAGACAAAACAATGAA 
GCAGAGAGACrnTATTGGCTGGAGTGGAGATTCCAGAACCTG'l'l'l ICl'lCTGTACC 

SEQ ID NO: 872 ACnTATGAACTTTATGrrGCTGTrTACrTCCCTTTTCTGATTTTT^ 

ATACTTTGACGAAATATGATGGGCATACATTGGCCTAGACAGGAAAAATGCTGCATCATGGGGTG 

TATGAAATTCATTACAGAGTACGCTGGGCAGAAGCAGAATGTCAGAGACAACAATTACCTGTGAC 

TTTTGGGAATAAACAAAAAGTTCTAGGAAAAGCACTrrCCTTAATCCGGrrCCCACT^ 

TGAGGAATTTGCAGCAGGTCCTGCTCAATCTGGAATTTrGTCAGATCGTGAAGTGGTAAACC^^ 

TCTTCATTTTACTGTCAACCCTAAACCCCGAGTTGAATACATTGACCCGACCAAGATGCTGTCTCA 

GGGGAAAGGAATGCTGCATCAATAGATTCCAGCAAGTAGAAAGCCCGCTGGGGITACAGTGGGA 

CGAAGTOATCGAATCAGGCAAATATAAATTATCAAAATrGACTGAAGAAGAAAAG CCTGA ATAG 

ACCAATrACCATGQAAGGAATrAAAAGKrATCAAAGAACTGTCCTTCCAANGNGGGTTTTACAGA 

TTCACAGTTAATANAAGGACTCTATAGTTGGATTTGGCTGNATGGACTATCATGGCCTACGATATC 

AAGGGATATCNGATCATTGAAT 

SEQ ID NO: 873 ACGCGGGGGQACAACCTGGCCATCCAGACCCGGGGTGGCCCAGAAAAGCAT 
GAAGTAACTGGCTGGGTGCTGGTATCTCCTCTAAGTAAGGAAGATGCTGGAGAATATGAGTGCCA 
TGCATCCAATTCCCAAGGACAGGCTTCAGCATCAGCAAAAATTACAGTGGTTGATGCCTTACATGA 
AATACCAGTGAAAAAAGGTGAAGGTGCCGAGCTATAAACCTCCAGAATATTATTAGTCrGCATGG 
TTAAAAGTAGTCATGGATAACTACATTACCTGTTCTTGCCTAATAAGTTTCTTTTAAT<XAAT^ 
TAACACTTrAGTTATATTCACTGGTrTTACACAGAGAAATACAAAATAAAGATCACACATCAAOAC 
TATCTACAAAAATXTATTATATATrTACAGAAGAAAAGCATGCATATCATTAAACAAATAAAATCr 
TTTTATCCCAAAAAAAAAAAAAAAAAAAAGAANGTCrrTGGCCGCGAACACNCTTANGGCG 
TCACACACTGGCNGGNCGTTCTAAGTGATCCGANCTCGGGACCCANCTTGGNGTAATCATGGGCA 
TAGCTGNTirNCTGNGNGAAAATGrnATTCCGNTTACAArrTCNCACAACATTCCANCCCGGAA 
AAAGNTNAAACCTGGGGNGCTAATGAG 

SEQ ID NO: 874 ACGCXjGGGGGTrCTTGGCnTGACAGCITCAAAGAATGGACAGTGATAAGTT 
AAAAGAAAirn'GTATATTGTCAAGGAAAGGGTCTTAAATCCGAGTCAAGTCCCTrCCTrGGGGTA 
AAAAATGTATTCTTAAAGCATTCTGATGTTAAAAAGAAAACTTAAGTTATCTAACCAAAAC AGAC 
GCAAGATrrrGrrrCTGCAGACTACTTGGCAATCAAAAGTGATCATAAATTTAGGnATCAGTm 
CAGAAAGTrGCTTTGTGAGAAAATTTrGTTAGATATATrCTCCrAAGCATGCrrm 
TTTCAGCCATTGCCACTGAATCAGATGTTAAAAATGAAGGGAAAATTGAGTGTGCACACACACAA 
CTGTTGTACC 

SEQ ID NO: 875 actgtccatatcttttgtatttacttcaaaggattctgoatcagcagtataaa 

TAAGATTCTCAGCATCTGCTrrACAAATGGTGTTAGCTACATGTCGACACAGCATCrrrAGCCAGT 

TTTCTTTTGGAAGTTCATCTGATGTCATCTGGAAACroAGTAGCACATTTGCCTGCTC^ 

CCTCACAAGCAAGGCAAAAGCATTATGGCAATCTTCTGTCTtrrCITATGTCCAATACCrrOT 

TGAAAAAGAGGCATTAGGTGAATATGCTTAAGAGAAGCTGGGGGTCGGGrrTGGCCATGAGGACT 

CCTAAAAGTGCCAATAACCTTGTGCCGTmcrrcCTATCrcrrAGGCAATCATTGAAGAGGAA^ 
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AGTTACnTGlTCTCCTCTGTCACAGGGGGTGCTCACXrrAGAGAAATTGTTTCAACTCGCT 

SEQ ID NO: 876 ACGCTGGATAGCCTCCAGGCCAGAAAGAGAGAGTAGCGCGAGCACAGCTAA 
GGCCACGGAGCGAGACATCTCGGCCCGAATGCTGTCAGCTTCAGGAATCCCCGCXjTACAGATGAA 
TCGGAATTTGTCTAATGAGGAGTTAACAAAATCAAAG(XATCTGCTCCACCCAATGAAAAAGGAA 
CCAGTGATTTACTTGCTrQGGACCCCCTATrroGACCATCTCTTGATTCATCllC'ri^ 

acttcatcatcatcagccaggcccacaactcctctttctgtaggcaccattgtcccacctccgagg 

cctgcttccagaccaaagcitacttcaggcaaactcagtgggattaatgaaatacccaggccattc 

agcccacctgtaacttccaacaccagcccacctcctgctgcaccattagcccgggcagaaagttct 

tcttctatctcatcatctgcrtcattgagtgctgccaatactccaacagtaggtgtgtcacggggtc 

ccagccctgtcagccritgqaaatcaggataccitacctgtggcagtgccctracaga^ 

gccl'a(ntraaaggagcagatccccagtgtattgngaanatactggngatatgacaatgtcntttc 

aagtggaattattat 

seq id no: 877 acgcggggagtgnttctggoatgogaaccacgccgcttcccagtctctgtgc 

GAGGCGTGAAGCGCGGACCTTTCAACAAGGGCirrATrAATrCTCACGCTGCGGCCCTGGAAAGC 

GATGGAGGTGGCGGCTAATTACTCCCTACGGGTGAAGAGACCTCTGTTGGATCCCCGCTTCGAGG 

GTTACAAGCTCrCTCTTGAGCCGCTGCCTTGTTACCAGCTGGAGCTTGACGCAGCTGTGGCAGAGG 

TAAAACTTCGAGATGATCAATATACACTGGAACACATGCATGCTTTTGGAATGTATAATTACCTGC 

ACTGTGATTCATGGTATCAAGACAGTGTCTACTATATTGATACCCrTGGAAGAATrATGAATrrAA 

CAGTAATGCTGGACACTGCCTTANGAAAACCACGAGAGGTGTTTCGACTTCCTACAGAm 

GCATGTGACAACCCGCTTTGNCATCTATCCATTTCTCATCTTCTACCTGGGTTACCnTGTCAGAAG^ 

GAAC^XJGAAGA^rcT^^^GTCA^^GNAACAAGGGGAACCGTGGNAAAAACCGCTTNTGAAJ^^ 

GGAGAATATGTTTAATGAAA 

SEQ ID NO: 878 ACAACCACCACTCCTGTTCCTTCCATnTTTCTGGCCTAOTGTCACTGCCAGOT 
CCnTCTGCCACTCCTACCGCAGCCACTCCTACCCCAGGACXrrACACCACGGTCCACTCTTGGTTCC 
AGTGAAGCATH'GCTrcrACTrCrcCACCTTrCACTAGOTCCCCTmCCACC 
CTTCTACX:AGCAACCCAAATTCrGCrrCATTGTCATCAGTTTTTGCAGGGCrCCC^ 
AACCAACATCCCAAGGCCTATCCAACCCGACTCCTGTAATTGCTGGTGGCTCTACTCCCAGCGTTG 
CCGGTCrACTrGGTGTGAACAGTCCTCTTTTGTCTGCGTTAAAAGGTTTTCTGACA 
CAATTTAATCAACTCCrCTGCTTTATCCTCTGCTGTCACAAAGTGGGGCTGGCnTCACTATCT^^ 
^TACTCT^CANAACTTTGACTCTIKrGGTITANCCCCTNACAAGTGGCT^^ 

CTACCCCANAGAGGACTTCAATCCAAGGGTGGCCCTT^^TCCAAGCCTGTCGTTNTCCGGGGNTAA 
CTCAACTT 

SEQ ID NO: 879 ACCrrTTrjCTAGCATAGCCTOGGAAOAAOTCACTGAAGGAOATTTAACTGAA 
GGTAATACAGCTGAGGAArrrGCTCCAGAAACACrrrCGCTGGGTTrAGGCTrCTrrGGA^ 
mTTTACTTCAGAAGTAGAAGGACGTmGATAAAAATGAACTTTCACTCATACCTGCCATAGA^ 
AGTGTTGGTCCCCCTTGGTAATGTGATAATCCTGTATCCTGGGTCAAGTCAAACAGTAACTCTGAT 
GACTTATTAGACACAACTCGTGATGAGmGTTATATAACCAGGTTTAGAAAAAGGCTGAACAAT 
AACAGAACTATCTGAAGATCTAGACTCAGGATGAAAATTTATTGGTGAGGCAGGCACTGGATTTO 
ATGTTGGATTCGTGTAGTGTTGACTTGTAGGCTTGAATGCAGCAGTAATACCTCGACTGAG TGCCC 
ATTCrrrAATCCCOTGAATATGAGCTGGGGTTAACTCTGTTCAAAATATIT GAAAT 
ACmGATATCTCGCCCACAAAGATTGGCTCATATCATCGTCATCCTCAAATTCrrriACl'rri INTG 
GCATGGTT 

SEQ ID NO: 880 GGTACTAATCTCTCTGAATrrGTCATGCGGAAAATTGGAGACTTGGCTTGTGC 
TAACATrCAGCATCTGAGTAGTCGCTCCTTAGTGAATATTGTTAAAATGTrCCGTTTCACTC^ 
GATCACATCAATrrCATGAAGCAGArTGGAGAGATAGCTCCTCAGCGAATTCCTTCCCTGGGAGTT 
CAAGGTGTCATGCACCTGACTCTTrACIXjCrCGGCCTTACGCrrCCT 

GCAGTGGCTGCGTCTTTGCCTCCTAGAGTGGCACACTGTCGAAGT AAAG ATGTTGCCAAGATTCTG 
TGGTCATTTGGAACTCTGAATTATAAGCCACCCAATGCAGAAGAATTTTACTCCAGCCTGATAAGT 
GAGATTCACAGAAAGATGCCTGAATTCAACCAGT 

SEQ ID NO: 881 ACTGAGGAAGACACCATrCCTTGACGGTGTCTAAGAAGCCAGGTGGATGTGT 
GTGGTGGCTCCAGTGGGTGTrrCTACTCTGCCAGTGAGAGGCAGTCCCCTAGAAACTCTTCAGGCG 
TAATGGAAAATCAGCTCAAATGAGATCAGGCCCCCCCAGGGTCCACCCACAGAGCACTACAGAGC 
CTCrGAAAGACCATAGCACCAAGCGAGCCCCTTCAGATTCX^CCCACTGTCCATCGGAAGATGCTCC 
AGAGTGGCTAGAGGGCATCTAAGGGCTCCAGCATGGCATATCCATGCCCACGGTGCTGTGTCCAT 
GATCTGAGTGATAGCTGCACTGCTGCCTGGGATTGCAGCTGAGGTGGGAGTGGAGAATGGTTCCC 
AGGAAGACAGTTCCACCTCTAAGGTCCGAAAATGTTCCCTTTACCCTGGAGTGGGAGTGAGGGGT 
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CATACACCAAAGGTATTTrCCCTCACCAGTCTAGGCATGACTGGCn'CTGAAA AA'n x:CAGCACAC 

CrCCTCGAACCTCATTGTCAGCAGAOAGGGCCCATCTGTTGTCTGTAACATGCCTTTCACATQrrCC 

ACCTTCrraCCATGTTCCAGCTGCrrCTTCCAACCTGGAAGGCCGTCrrCCCOT 

CTCAGGCTTTGGAGAAACTTNCTTAACGTCACCTTCTTCATTGAGCCTTTT^ 

CTTTTCTAACCCTTCCnTCCCCAACCCTTAATGNATNAAATrGCTr^ 

AmTGAAITGGAACCGGTAATT 

SEQ ID NO: 882 CCGGGCAGGACi'i r i ' l 'l- l ' ri - i - i - lU ' i " rri - ri " ri ' ] ' n 'lNGGNTAAAAGGANAATrG 
TO^riTATrmGGTTTTGTTTGTTTAQlTTTTAATAGGATTGCTC^ 

ACAACACAGGGAGACAAGCNTAAAAGCCGGAAGGGGAGGTAGCTGCAAAAGGCAGGAAAAAGG 

TGAGGCGGGCTGGTCCAGGAGGNGGCCANNATTNNGGGGTGAGGCAATGGGAGTAGCAAAAGCA 

GATTCTGGATGCrngTTNACAGTAAANCCTGTAGGACTTGCTGATGGATCAAATGTGGGTGTGANA 

AGTCAGGCAOOACTCCCAAGGTTTGGCCCTGNGGTGATGTTTAATATTAAGTGTCCNC TTGAT TGA 

AGGATTNACAGAATTNTTCCTAGGNGNGTCNNGAGGGTGTTC<XAAAAGAAATTAAC^^ 

TCAAGNGGGACCCGGAAAAGGCr^ACCCNCCCTCNATTTGGGTG GGCCCC AT^^TATCGGCT^C^ 

AAAGNGGCTTAAAAAAAACANANrrThrrGAGNNTTTTGGNNTTCATTT^ 

NTNACCCCC 

SEQ ID NO: 883 ACAGACAAGGTCTATAGAATGTGGTAAAAACTTGACTGCAACACAAGGCTTA 
TAAAATAGTAAGATAGTAAAATAGCrrATGAAGAAACTACAGAGATTTAAAATTGTGCATGACTC 
ATTTCAGCAGCAAAATAAG/ACTCCTAACrGAACAGAAATrmCTACCrAGCAATGTTATTCT^ 
TAAAATAGTTACCTATTAAAACTGTGAAGAGTAAAGAAAACTAAAGCCAATTTATTATAGTCACA 
CAAGTGATTATACTAAAAATTArTATAAAGGTTATAATTTTATAATGTATTTACCTGTCCTGATATA 
TAGCTATAACCCAATATATGAAAATCTCAAAAATTAAGACATCATCATACAGAAGGCAGGATTCC 
TTAAACTGAGATCCCTGATCCATCrmAATATTTCAATTTGCACACATAAAACAATGCCC^^ 
TACC 

SEQ ID NO: 884 GGTACCTCCAAACAGAGATGGAAGCTACACTGCAGTTCCCAATACTACTTCA 
GCATAGAGCAAAAATGTGAAGCCAATTAACAGAGAAATCATT TrrGGC ATTATTAGGCAATCAAA 
GGGGrrAACTAAAGTGAACTGTGGTTCAGAAATTGAGAAATTCl'1'l'r 1'C1^1^L 1 IGAATAAAAAAAG 
GAGATGAAAAACTTCCACTTCTTCTCAGTGGTTACTGTAGAAGATGTCT CTTT^ 
mCTACATTTTAAATGAGATTCAGGCTATCTTAGGGAATGAGCATTTGTCTTITC^^ 
GTCTACCCCAAGAATAGTTOCATTOATGAAGATTTTCTATATTTTTTCATATCTAGCTAT GCTA T^ 
CCTCATGAAAGTCCAAGACTTTTTATGACTGTGGTAATTITAGAATATACATGAATGATCTT^ 
AGTCACAATTTTGCCATATCGTTAAAAAAACnTATTCCCGGTTCATAGCCTCTGNATTAGCCCTC^ 
CTGGNCTATCCTAATCC^^^AGATTAGAAAGAAGAAATCTGCT^NTGGNOCCCTN 
AGA 

SEQ ID NO: 885 ACCC M -i-l-l-r i Ur r ri-]C'r j 4 i ' i - ri l l J U lU - l U AAGTATTGTTAACAATCCTTrGG 
AAGTCACTACTGGTCTrrGTGTGCTGCTTTTTAATAATTGAGTTATm 

CTATTGCCTGGACTAAAATTTAmCCTAATCrrCTGATGACCAAGAAAGGAAAAATTAAG T^ 

AGATGTGAGATGAAATATAGCCAOTGAATATGCATACTGATTCrGAATGAAAGGAATTAACTnT 

CAGTCAAGAAACAGTCTGCATGCAGTAAATTGAATTTTTCCTGCAACrGGA ATGArr rG^ 

CTKnTrGAACACTGCCCTTTCTCCAGTAAGAACACTAATGATTTQCTAATAT^^ 

TGmTTTITAATTAGTrAAGCTCAGACTTCCrCTrATrTTTTATCCT 

AATGATATATCAGTACC 

SEQ ID NO: 886 AC llU - r il' ril - lUU ' t - riU - rri - l - l ' ri ' rr AGGGTrATAAAAGCCCTTTTATAAAGCC 
ATTmAAACAAAACAAAANAAAAGTrXACAAAAGAAAAAGAGATACAGAAAAAGAATNACTTG 
CTTCATOTGTCCCAAAAAGAGAAAAAAATNAAGGGGACAATGCCNACATGCTCAACAATAAAGG 
CTrCrrTTNCTTATTTTTTTAATACAAAANACAAGCNATGG 

GGAGCNGACNCNCAGTCCTNGAAACChnTNAATAAAAGCAAAGCAGGAGTTTGTTTm 

CTATNCANATGC^^^ACAAGAGACTGGGATTTGTAAAAATTNAGTGGTCNCAAAAOACCATN ACAC 

NAT^CTACCAATGCATGTTGCAT^r^GTAATTNCCGAACATGGTCAACAAANATC^ T^ 

GCCCCCTTTNATTTTN>nTrAAATGAANAAAACCTrrAAAANAAAGh^ 

AACrrarrrAATANCTANGNNGNGNCACNGCCTAANGGCGAATTmATCACACTTGCGG^ 

TCTANTGGNTNCCACNTCNGNC 

SEQ ID NO: 887 ACCAAGGAGCTCTTCrrrATTTATTTCCATATGGCCCTCAGCAGCnTCATCTG 
GAGAAGCGAANCCAGGATAlTNCTTTCTGATCAGCTATTITCCCATATATACTCTrCCATGTrn' 
r^CANNC^f^AC^^^^NNNCCCATNCITCANNNTTCCAAGACTN^ 

ACGCTC^mATCGGNATTCTGCCTCCT^AAAGGGCACCTATCTGT^T^3ATGGTGAGCACATOT 
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GGTCTTGCANTATCGCTCTCCTACAAGGTTAATrAATTmTTrrTTNCGTG 

CWGOAAAGCrCNCTCTrAAGACTACTACTCGTGGCTCTGGGGTTCCGAACAGATQGT<XTGAT^ 

ACATAATCAGTGCTGN^fNAT^•CAAr^GGAAAATGCIT^CTTACAr^^CTN 

TGGCCCTCAATGCAAAAATNTTTANAGGCTKCACATTGGCTOTATA^ 

GCC^AANA^^'GNNAAT^mTAAAAATTCGGGAAr^^CCTCTTGGC 

SEQ ID NO: 888 ACTTTA- lTn - l ' lNTI i ' l 11 111 tN TTTmNGNNGTrAATTCACTTTTAATAGTAT 
AAACCTCATTTAGGTAGTAGTArrAAGCCACAACANTAATGCCNCATtGAAACAGCATTTAATAA 
AATGCATAAAGCTNATTCATGCACTCGAATACTCTNTTOTACAAACACAA C^ 
AAGACTGAAGACATTGATrAGATATAAAATTCAGTrTAAAAAGAACATGCl IT 1 4 ri' F AAATGCCN 
TNNCATAAACAAAGNGATTTNACAGGGANAAAAAAGCTGTrrAAAGCNGNNNhT^^ 
TrTAAACCTGGCATTAAAATGTAATGGCAAAACCA^TATTTTrrATh^ 
CANAAACGGNCCCAAGAAAA^^s^^AACNCCX:AAANTTTTCT^'GATOG 
GCTGGTAATGCAGTTACATTTTAACANAGAAGTTCAACTTCAGGTAAAAAC^ 
GGCCGTGACCACCmTNGGGNGAATTrCAA.CNCCTrNGGNGGCG'l"l"rCri"l'NGNG 

SEQ ID NO: 889 ACTnTirnTTTTTTrTTTITn^ 

CAATAAAATGCTCTCAAGTCCTTTGAATGTTCCAACAAATTCAAAACTTCATTTTCT^ 

CATAAATGCGAACTACCTGTTCGCATTGGTAACCTGCTGCTGTATTTCATGTCTTAACGGCTATTTT 

GAGGTTCATTAACAACATAGAAAGCCTTGAACTGTATAACCAGCTAGATTCCTTAATAATTAGTCA 

CTAGAGACAGCCCAAAGACAAATATTGGGCAGGAAATCAGTTCrCACTGAGCC CGGTrrC CATGT 

AAAATCTCTGTTGTGGTGGGCATAGGTGGCACCATCTAAAGAAAAGAGGTCTTGTTrmGm 

AAAAGTTTGTGGGGAGGAAAGACATCTGTGTATCACTTCAAAATATTGATTTACTGCTAA ACATC A 

CTCTGAATTTATGATGTGGATCTAACrTCATACATTTATCGGCATTGTCCAAAATATTTATTC^ 

ATGCGAAAAGNCNTTAATNTTCNAATGAAGGGNCNCATTA 

SEQ ID NO: 890 GAAGAAGCCAAAAAGAAACGAAATAGATGCGGAGCCGCCAGCTAAGCGGCA 
CGCCACAGCAGAGGAGGTGGAGGAAGAAGAGAGGGACCGGATCCCAGGCCCCGTTTGCAAGGGA 
AAGTGGAAAAATAAGGAACGGATTCTCATCrri'lCllCCAGAGGAATAAATTTTAGAACAAGACA 
TTTAATGCAGGACrrGAGAATGTTGATGCCTCATTCTAAAGCAGATACrAAAATGGAT CGTA AGG 
ATAAGCTATTTGTGATTAACGAGGTITGTGAAATGAAGAACTGTAATAAATGCATCTATTTrGAAG 
CTAAGAAAAAACAGGATCTCTATATGTGGCriTCAAAAlTCACCTCCGGGACCATCT^ 
NCCTTGTCAAAAATATTCATCCCCTCGCrrGACCrGAAGAATGACTTGGAACCIXjGT^ 
TTTCGGCCCTTTTGGCTmGGACCTmCTrrGGl^AATAACNCA>^ 
(XTTTTATTTCNAATmrrATNCCTCGGCCGGACNCCCTTAAGGCGAA^ 

GGTCnTATGGTTCCCACTTCGNCCCACCrrGGGGAACANGGCCAACNGTGTTCTTGGGGNAATGGT 

rrc 

SEQ ID NO: 891 GGTACCAGCACCAGCCCCTCTGAAAGGAAAAAGTGTAGTCATGACTGTCCAT 
CTCTTTTCAAAGCTTCCAGTCTTTGAAGCAGTGCGTTTCCAAATGCITCTCGGTAGGCGG 
GTGTATCCAACAAGGAGTAGACTGTTTCATGGTAGGGAGTCTGTAAATGATCATC TACCT GGTCAA 
AAGCATAGCCTACCACCTTGAGCCCTGCTTCAGTGAGTTCTAGGCAATATCTGrn'CTTTCC^ 
TrCCACATTGATATAGGCCACATCATCCGCACACCGCAGGCrnrCGAGACAAACATGTTGTTAAC 
AAGCAAAGAGAACATCATrrACAACTGCn-CAGCTTCGAGCCTCATGTCTITCATGTCAGTTCCTT 
CAAAACCGTTCAGCTCTGAACCTTCTTCAAATCCTGACATACTG>rrTAAGCTNCATC 
CrGTTTCATIXnTGATGATGCAGAAAGACAAGAATCCAGTACCTGGCCGGCGGGCGNTCNAAAGG 
CGAATTCACACACTGGCGGGCGTCTA^^'GGATCCGGCTCGGACCAACr^GGNGTAAACATGGCA^ 
AGrrGTrCCGNGGNAATGGTNTCCX3TACAA'rrCCCCCAAANACAGN 

SEQ ID NO: 892 GGTACTACGTCAGCAATTrCTCCAAACAGCTGCTCGACAGCATATGGCACCA 
GCCCATirrCAATTTGCTGAGCATCGGCXAAAGCCTGTATGCGAAAGCCAAGQAOCTGGACAGAG 
TGAAGGAAATTCAGGAGCAGCTOTCCATATCAAGAAGCTGTTGAAGACCTGTAGGTrTGCTAAC 
AGTGCATTAAAGGAGTTCGAGCAGGTGCCGGGACACTTGACTGATGAGCTCCACCTGTTCTCCCTT 
GAGGACCTGGTCAGGATCAAGAAAGGGCTGCTGGCACCCTTACrCAAGGACAT TCTG AAAGCTTC 
Cn'GCACATGTGGCTGGCTGTGAGCTGTGTCAAGGAAAGGGCTTTATTTGTGAATrrrGCCA 
ACGACTGTCATCTTCCCATrrCAGACAGCAACATGTAAGAAAGATGTTCAACCGTGCCAGGGCTTT 
GCTnTACAAACAAGTGCTTTCAGTCCCTCCGAGTGCCCCCGGTGTGCGAAGGATCACAAGCGAN 
GAGAAAACmcrrGGAAGGGGGGGCCTCTGCAGCAACATGATGCCCTGAGTACCTTGCCCGGCCG 
GCCGTTCGAAA 

SEQ ID NO: 893 GGTACGCGGGGCATGCGCCGTTTCTCTGCATGGTGTGCGTTCTCGTTCTAGCT 
GCGGCCGCAGGAGCTGTGGCGGTTTTCCTAATCCTGCGAATATGGGTAGTOCTTCOTTCCATGGAC 
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GTTACGCCCCGGGAGTCrCTCAGTATCTTGGTAGTGGCTGGGTCCGGTGGGCATACCACTGAGATC 

CTGAGGCTGCTTGGGAGCTTGTCCAATGCCTACrrCACCTAGACATrATGTCATTGCrGACACTGAT 

GAAATGAGTGCCAATAAAATAAATTCrrTTGAACTAGATCGAGCTGATAGAGACrCTAGTAACAT 

GTATACCAAATACrACATTCACCGAATTCCAAGAAGCCGGGAGGTTCAGCAGTCCTGGCCCTCCA 

CCGTrrTCACCACCTTGCACTTCATGTGGCrcrCCTTrCCCTAATrCACAGGGTGAACC 

TGGTGTGTAACNGACCAAGAACATGTGTCCTATCTGNGNATCTGCCCTTCTCCTTGGGATACTANG 

AATAAAGAAAGTGATCATTGCTACGrrGAAACATCTOCCGGTANAAACGTATCCATGTCCGGAAA 

GATCTGGTTCATCTmAAAATACTTCATTGGTCAAGGGCGGTNTTNAA 

SEQ ID NO: 894 acacaatcagataacaaggtttcaagctctagtcaaagacaagatgatatta 

TACCTAAATAACAArrrCTAAGGAACTTOAATAACAATCTCTAAGGACTCrrGTCACTATATGAAG 

AAOTAAGTTTATGACTCCACACTCTTAATTAAAAGGACAGCCACTTAATCAArmCTGCAGTGA 

ATTATTAGACTAACTTCCTAGCATCTGGAGATTACAGTATGTTCCACCTTGAAGCn'ATCACAATCA 

AGACCATTACCTGACATGCATTCAACAAATATTCGATGACAAACAAAGTAT AAGGTT AATTGTTAC 

AGAAAGTTAGGTCTTCTAACACCTAAGTGTTAAGTGGACTCAATGTTGAACCTTTTTGAGTGCCAA 

CATGACACTCAAAGGAAATGCTCATGGAACATTTCAGATTTCAGATTTGGGATGCCCAACTGGTA 

AGTATAATGCAAATATTCCAAACTrTGAAAAAAATTATGAAATCTGAAATACTTCTTGTCCCAAGC 

ArnTTGGAAAANGOATACTCAACCTGTCCTCGGNCGCGACACCTAA 

SEQ ID NO: 895 GGTACCrTGCAGCCAAGGGAAAACTGAAGAGCCAAAACACCAAGCCTTATCT 
AAAATCCAAGAATAATTGCCAGAATCAACCACCTTCTAAATCTACTATTAGACCCAAAAATGATG 
TTACCAACCATOTTOTTTTGCCTGTCAAACCTAAAAGGTCCATCAGCATTAAACTCCAGCCCAGAC 
CACCTAATACTGCAGGGTCCCAGAAGCCGAAGTTGGAGCCACCAAAACTTCTGGGCAAAAGGCTG 
ACTTCAGAATGTGTITCrTCTAACCCATACTCrrAAGCOTCTAGCAAGAGT^^ 
GCTGGATCGTCCACAACAGGAGAACTGTCAAGAAAACCTGTGGGGNCACTTAATATAGAGCAATT 
GAAAACTCAAAGCAGCAGTTAACAGATCAAGGAAATGGNAAATGTATAGACrTTATGAATAATAT 
CATGTTGAAAACGAATCTTTGGATACTTTCrrAAAGAACCAACCAAGAGAACTTGNTTCG'n^^ 
TAACAGAACCTGAGAGGAAGNCCGATCTTAANTNTlTACCCGAAGTAAGCCAAAACTNGNl'l'l"!"!' 
TTATTAACCCANAACAGTTTAGTTCTAACAAACCTTGGCNAAAGTTANTTAATAGGGNGGTT^ 
AA 

SEQ ID NO: 896 CGAGGTACTNCTTCXAAATGACGAATrrrCTGCrCCAAATAATGGGACAAAG 
GGCATCATCACATGGACACAGTCAATTTCACAGTGATGGACTCACACAGGTGGATGATCTGAGGC 
ACAAGCTGTGATrGGTCTTAAGAAAATGAAAGGTAGAGTTGGGTTCAAAATCCACATCTCACGAT 
TTCTGGTCCTTATTCATCCAAAAAGTTGAACTGCTTCATATCAGNGTATTAGACACATTTTA^ 
TTCTCAAGAGTTGCTCTGGGTGCAGCATGATAAGGAAAACCAGGTATTTGGCTTGATAGCAGATAT 
TCAAGGTATAGGTGCCNAGCTTATGGTAAAGCTTTTACCCGTAGGATATCTGAGATCTGNTGCCAC 
AAAGAOGACTATGGNCCTATGATGACTACTATAGTTCAGTTTCAACCTTTTTGGGGATT^ 
TCATAACCTTTCCTTCTCGTTNCATTm'AAGACAGGGAATTTAAANGArrCACTCAAGAAG^ 
CCCANTAACCAGGNGATGGTCATTCCATCTGNAATGGGTrCCTAAATAAACTGGCTrCCCCTTTCA 
ACAACTCTTGNCCNCCACAhrrCATTNGCTTTTCCAANTGGAACAAGTCT^ 
TGTTTTGGNNG 

SEQ ID NO: 897 GOTACTNOGGGGACrrAAGATGGCGGCGTTTGCACGGAGTGCAATCACTGCG 
TCCTTACGGGGGTTGCAAGGCGTCCGAAGTATGAGTCCACTAACAAAAGTCCAGAAACTCGCCAG 
TTAATAGTATTGTGTCTCriTCAAAATATCGGAGAATAATrrCTTTCTCGCTGATCGCC^ 
CTGACGAAGCTTGGAAGTTGCAGAAGGTTGGAAGTGCAATGGCGCGATCTCGGCTGACTGCAACC 
TCCACCTCCTGGATTCGGGCTATTCTCCTGCCTCAGCCTGCCGAGTAGCTGGGATTACAGGCATGT 
GTCACCGCACOCAGCTAATTTrGTATTTTITAATAAAAGATGGGGGTTTCTTCATGGT^^ 
GCTGGGTCriTGAACTCCTGACCTTAGGGTGGATNCGGCCCACCCTNAAACTTCCAAAArGTGNTT 
GGGATrrCAAGGCGKrAAGCNACCCGCNCCCCGGGCCAGAACAArrANTTTAA ACnT CCTACCC 
TTTGGGAGAAAGGCAAAAANACITTGATGAAAATCrrTNNTAANGAC™ 
ANTATATNTTAAArrANrrAATTTCCnTNCCmGGCGGGCNNCTNAAANGGGGAAA^ 
NTGGGGGCTNTTTATNGGNNCCAACC 

SEQ ID NO: 898 ggtacgcngggaagatgggaagtaagagtcacatatcaaaactaccctccac 

TTTATTCCCTGAGCGAGGGTTTATGAAGTATAAAGGGGTGGGAGCCCCGAGGTGAGCGGGAACGG 
TGCTGCTTTATTTGAAATGTTTTCTTACCTCATTCTGTGCCC 

CTGGCTTGGCCCTGTGTTCCrCCTGTCCCCTGCTCCACTGCCTATCTGGTGCCCCAGGTGCTGCrrG 

CCACTCCAOCTOTCAGArrGAACAGTrrCAATirAACTCTTAATGCn'C CTGCT TCCGAA^ 

AATTTCTTrmCTTGGCCTCTGGTlTniTrrrCTTTCrr^^ 

AGAAAATCriTCTTATGGCTTCCTTTGGTGAAAATGGAAATGGAAAAAAACTTAA1"lU'I1"lGGATr 
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TNAAAAGGNAGGGCATGCCTATAACArITCTTCTATATAAGAACTGC^^TG^TO 

GGTNGGGCCTACCTCATNAAGOGGTGGATNTTTCCCCCTCCKmAATCmGCGGGTTGGACCT^ 

TTGAAAAACC^^AAGGTTGTTGCCCCCCNGGGGANGGAGTGGGCCTTGCCNTT^OT 

SEQ ID NO: 899 ACATGGCTCCATGGAGGrrCrcCAGTCGGTGlTGCTGCTGCTGTmCGAGCC 
TTATCTCGTCTGTGCrCCrCATAAGTGTCGTCATCAGTGGGCAGCTCATAGCGGCACAAGGGACAG 
GAATrrGTCTTGCTTAGCCAGGGCAGAATGCAGCTGGAATGGAAAAGGTGATGGCAAGGCATCTC 
AATGGCAGTCTCCTCCTCCAGCCTGAGAGCCTCTGATGACTGTCCTGGGGAGGTTCTCAACCACAG 
TCTTGGCAGCTGGTGGAGGCAGGTGGTGGTCCCAATCTACTACCAACCCCAAGTCTTCAAAGTCCA 
TCCTATTGAAAAGTGACCTTGCGAGCTCCAGCAGCATGTTGGTTCGCGTCTCCTGCTCAGGGTCCG 
ACGGCTCGCAGTCGTGTTCATCGAAATAGGACGCCATGGCTGCCAGCCTTCTGACCCCCGCGTACC 

SEQ ID NO: 900 GGTACGCGGGGGGGTCGGTTTCCGCGGTGGCCATGACTGCGGCCGTGTTCTT 
CGGCTGCGCCrrCATTGCCTTCGGGCCTGCGCTCGCXCTrrATGTCTTCACCATCGCCACCGAGCCG 
TTGCGTATCATCTTCCrrCATCGCCGGAGCnTrCTTCTGGTTGGTGTCrCTACTGAm 
TTGGTTCATGGCAAGAGTCATTATTGACAACAAAGATGGACCAACACAGAAATATCTGCTGATCTT 
TGGAGCGmGTCrCTGTCTATATCCAAGAAATGTTCCGATrTGCATATTATAAACTCTTAAAAAA 
AGCCAGTGAAGGTTTGAAGAGTATAAACCCAGGTGAGACAGCACCCTCrATGCGACTGCTGGCCT 
ATGrrTCTGGCTTGGGCTrrGGAATCATGAGTGGAGTATTTTCCTTTGTGAATACCCTATCT^ 
CTTGGGGCCAGGCACAATGGGCATTC^TGGAGATCTNCrCAATCTTTCT^ 

GCTGGCATTATCTTGCTGCATGTATTCTNGGGCATGGATTrrrGATGKTGTGANAAAAAAANT^^ 

GCATCTTCCirmOTTTTTNACCACCTNTOGGGTAAGCCAAACTTCA^ 

ACCGCGTCGCATnTAAACNTGGGCTCANG 

SEQ ID NO; 901 CGAGGTACAGCAGCTTGGGAGTTCATTGCTGGTCTGGGACTATTTGCTTGGGC 
ACGTGTATAATGGCrOTGAACAGCGTrOACGTrTTGArrGGCACGAGACCrGCGTCGTGATOTAAT 
GCCAATATnTCACCTGATCATTCATAACGATCATGGTGCGGGTGAAGAGATCATGATGAGTAATT 
ATGCCAGTAAACCACTGGGTGGCAGAGTCTTGTCTATATACTCrCACTCTGTATCCATTTAAGGAA 
TAAGGACCTGGTAGGGCTGAAAGGCACTATCITCAGTTAAAAAATCCAGTrGCTT ATCTA CAAGG 
AATrCTACTGCAGTGACTGAACTGGGTATATTTCTTTCAACCAAGAGGTTTGAAAGTrrrG^ 
AGAATTTTAAAGTATCmAAATCTCAAAAATCCACAACAAGGCTTTA^ 
GCTGAAATTTCTATTTTGGTCTGGAAGAATATCTAmGNAAAAACCTCriTAGTCTC 
CANGCCAATNGNTNTCAAACCAACANTrbrrGAACTCCTTGCCGCNGNGTCANTGTm 
AAGCCNGACCCCCAACXjAAGCCGGNCTTGTACCTAACAAGNTCCACCAGCCAAAGACAANTTnT 
GTTGGNTTGGTCANCCNGGCANGATAAGGATACACTCCTTITrCATAANCCTG 

SEQ ID NO; 902 ACCACATGTCCACCAGGAAGGAGCTGAGQTCATGCTGAGnTTAGGCAACXjC 
ACTGAGGGAGTCCACAATATCAAGTTrrGTGTGCTTCAGCAGCTCCCCCCGAAGGTAGGGGTCCrr 
GAGAATTTTTATATTCCTGCTATCCTTGAAGAACTTGCAGAAGCTGCTCGGGGCTGAGTCCT 
ATTGAAGGGAATGCrCAGGGAGAAGCAGGCCrCATCXjTGGTAAGCACTAAGGAGACCCTGTCGAT 
CTCCAGAGTCATAGATCAAGTAATACTGCTGCAGGAATTGCAGGAACAGATTCTTCAACATCTCA 
GATCCAAAGAAGCTTCCrrACAGGTTGGTAACCTCTTGTQGGOTCANTACX:CGQQGGGGACCCCG 
GCTGGAAGGCANGGCATCACrATGGACAACCrcGCAGGATCCACCTNAAACACGTCATNGACTCG 
TNCTGGAAAAGTTCCAGAACAGNCCTACONGGNGhrrTTNATTGAGGCCNATGGGAGAAGGAAm 
GAAAAGNCCCCrmATGACAAAGCGCCTTCAAAATTGATCa^AGGAGAATCrGACTTGtmGGTr 
CCANCAATTrmGATAAGAGCGTNTCAAAAAAAGGCCAGACTITAAATAGGNCTGNT^^ 
AAAAAAAAAAAAAAACCTCGCC 

SEQ ID NO; 903 ACTGCTGACATCCAAAACTATGTCCCrnTAGGGTCTACTCGGAGAAAATTGC 
GGCATTCAAAAGTCAGGTGACCAGGGTAGCCACATTTTITACAGCCTGCTCTGACACTGTCCTTGT 
TGCAACCntjGGACTGCCATGACGGTGGTAAGAGGGGTAACTCGAG CCTCT GGCTTTCGAAAAGGC 
GOTGCTrCX:CGCCAGTTGTGAGAACAAGGCACAGTCAAAGCGGCGTmCCTrCCCCX:AGCCCC» 
CGTACCrCGGCCGGACCACGC 

SEQ ID NO: 904 ACTrrCTGrrcmGGCACATTTTGCCCAGCGGATGCAACTTCTATCCTCAGTC 
CAGTTCATATATCTCAGGCAGTGATCCATmGTATCAGCCAGTTrCCCrrGTTAGGGCCGCTAGAC 
CCGGCTGGCAGCCAAGAGCAGCACTCTCCCCACCTTCAGCANAACTTCTCAGCTCATGTGTGTGTT 
TTTAAAAGCAGTAGGCAGCTTCCATCANCAATGGAAGTGTCCCCCACTNCAG 

SEQ ID NO: 905 ACAAAGTGACATACAATTGGAAATCCATTTITGTrGTAAAGACATTGTTITrc 
AGACrrrrCAGATCAArrAGAAAAATGTCATTGCTITAAAATCATAGCTGrrCTGTrTAAGAGGAA 
TTGAATTrAAAAATGAGAGAGTATTAAAACTCATGTGGCAGTATCCTGGTCTrAATCAGGTATTGC 
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AGTGTTCAAACAAGGTATGGACACATCAACATATrCACCTTTrGGAAGGA CTAGTGACTTGGTTAA 

AAACTAATGAGTGTTTGCOTGTGGTGTTTCTATGCGAGCACCCATTCTCAA ri l riilliiii mil 

TrrTm'Grn'GAAGCCA'nGATTGATArrCTTGTTTCAGTGTCTGGTrCCCTATAAAGTAATATATA 

rnCCTCTnAAGCTCTGNGCCTNAATATGCAATCACriTGATATAANNTrAAATAT 

CCAAACTGGNAAATNGGATimTTTTCAAAACrrTrNTGGGGAAACATATGGGGGAT^^ 

TTATGGGnrrNTGTTGCCGGGGATNGGGTTG 

SEQ IDNO: 906 ACTGAAGATTATTGCTTCTAGGGCATTTrTAAACAGCACCATTGTATTGTTGA 
ATGTTTATGTAACTGATGGCTTTTCTATAATGTAATTTTTGAATGTTCAGGTGTTACAT^ 
TTTAACirrrAAAAAACCATCrrCrGATCCCTTTTATrGTCTGGGCCATACA^^ 
GTGCCAACATTTAArnrrmAAATGOAACATTTGCAGTTITCCATATTGGTACAAAAG^ 
TGATCGATATTAAATOGTArKJAAAACAAAATGGACTAAAAAGCAAATACTACTCTATGTrGGGG 
TGGAAGTGGOAGGAAAAGANTGACTC 

SEQ ID NO: 907 A CiTn - rnriiTrri - ir i 1 mm i gaaacggantcccactcttgtcgcccaa 

CTAl^GTGTAGTGGCACAATCTCGGCTCCCCACAACCTCTGACTCCAGGGTTCAGGT GATTAT nT 

GCCnx:ANCCTCCCAAGCAGCTGGGATTATANACACCCGCCAACACGCCAGGCTAATGTTnTGTAT 

TrrTAATAAANATGGGGTTTTGCCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATC 

CGCCGCCTNGGCCTCCCAAAGTGCTGGGATTACAGACGTGAGCCACCACrCCCGGCCCATAAAAG 

GNTrrmGCTGGATAATITGTAACTmCTAATTGGGAAAAAAT TCCTATTAA TCAC^ 

T^TIT^^nrGAArmGGNGr^ATT^GAT^ATA TACAAA GGAGACi■r^i'i 1 IGANATAACACTCA 

AATNG>fT N ' lTIC ' l - l i INTTTNGAAAAATTTTATTTTTTATNTGAAAATAACAGTTG 

SEQ ID NO: 908 ACAAAGGCTGCTTAAGGCAGTGCAGCCCCTTCTCAAAGTCAGCATGTCAATG 
ANAGACTGCTrGATACTTGTCCTTCNGAAAGCTATGmGCCAACCAGCTTGATGCCCGAAAATCT 
GCAGTTGCTGGGrrriTGCTOCTCCTGAAGAACTrrAAAGTTrrAGGCAGCCrGTCATCCTC^ 
GCAGTCA^^"CT^r^AACTGTCNGT^WGGTTCATGTGNATGTACACAGa^AT^ACAN^ 

SEQ ID NO: 909 ATCCACCTCCGAAACCCCrCGGCGGCGTTCrTCTGTGTGGCCCGCCTGCNGGA 
TTTTAANCTTGACTTTGGCAATTOCNANGGCNAANCANNNCAAACTTGGCGTG 
CNATTT^™AAA^^rCCTGN^^^ATNAATTGTGGGGAGTA^r^ATGGAAAATGAA 
AATNCTCTGGATGAGCNANAAGGGGTTAAAAGNGGAATGNCTGTTGTAATATAAGTTAAAGTrGC 
AACTCATGANGGAAAAGAAATAACCTGTCGAANTNATCTGATGACAAATNACGAAAGTGCTCCCN 
CATGCCCACAGTATAAAAAGATrATrNGCATGGGTGCAAAACAAAATGGTTTGCCGCTNGAGTAT 
CNTGATAAArmAAAGCAATACATCCCTATGACTTACAGGAATAGGTCTCTAATAAA'nTGANAN 
TCATCANAAAGGGGGANACNCANACTTTT^ATAAAANACNAGATATTTCTAAGGGGT^^^^C 
ACCTNATATTAAAATATTTNTAACACTTGA 

SEQ ID NO: 910 GTACTGACTrAAATTrcGAArrrACTAATTACTGGGGATACTn'AGNGAGTCr 
GCATATGTGTATTATTAATACATGrrAAACCATACTGCAATATAACAAAAAATATACTGACArrrC 
TCTTNCANAGAGTAATGACTGTATTCAAAGTCTGAGGGAATGACAAAACGGGATGCACATCTAAC 
ACrGATACACGGTTCTTCANAAAAGACTAGmCAGCTGTTrCCAGG'nTACATAAGATGATGGAA 
GCAGTCTCTAATATGTTAATCAAGAAAATATATGCAATTGCCAGCTACTACATAT ACAG AAAATA 
AAGATOGTGAGTCAAACAAAACATACAAACTTGGTAACrTCTCACCC TCCAG ATArnTGAACATA 
TTTAAAAAArnGAAGATAGGATATCACTGTGATCTTAAAAAGAGAACn^ 
ATTGGAATAGGAAAACTGAGCrrGCTAAAACCAAAAGTANTNAAACAGTTTmC TGATGT 
ACTCrrACTAAAACCa^AArrTAAAACTACCANCTAAATITATCTAAGTGTNGAcirr 
CATTCTnCAAACCTCCTrrATAATATTTTAAAATITCCTAANCKrAATO 
CCCNT 

SEQ ID NO: 911 ACGCGGGGTGGGGGGGGTCCTGGTCTTTGGCTrCTCGACTCGGTCCTGTTTCG 
ACAGCGAACATCTajCGGCCrGTCAGAAATAGGAAGGTrGTTGATTACTCACAGTTTCAGGAATC 
TGATGATGCANATGAAGATTATGGAAGAOATTCGGGCCCTCCCACTAAGAAAArrCGATCATCTC 
CCCGAGAAGCTAAAAATAAGAGGCGATCTGGAAAGAATTCACAGGAAGATAGTGAGGACTCATA 
AGACAAANATGTGAAGACCAAGAAGGATGATTCTCACTCANCAGAGGATAGTGAAGATGAAAAA 
AAAANATCATAAAAATGTGCGCCAACAACGGCAGGCGGCATCTAAAGCAGCTTCTAAACAGAGA 
ANAGATGCTCATGGAANATGTGGGCAGTGAGGAANAACAAGAANANGAGGATGAGGCACCATTC 
CAGGANAAAGATTCCGGCANCGATGAACATTTCCTATGGAANATGATGACCGATAGTGACTATGG 
CAGTTCNAAAAANAAAACAAAAAGATGGTTAAAAAGTCCAAACCTGAANNAAAAAAAAAAT^ 
ANAAGTCCCTCGGCCCNNANCACCCTAAGGCG 
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SEQ ID NO: 912 ACATAGAGAAOAAAArrTGGTnTAGCAAATGACAGAGCCTTCAAAAATATr 
TTTCGAATAATGTGAATCAACCGAAAACTGGGGGCAAGGCAGAGGACAGGTTTrCTCAGGTTAAG 
AGAAAAACGAAATrrrAAAAACTTTAAAAAATACTGATAAATTCGGATCAAATITGGGGGAATAA 
AAAATATTANAGCAAAGGAGTTTGCTGGTTGTGTCATTAriTAATGATCAAAGTATAGCATTGTAT 
GCCrTATTACAGACTTGTrGACTATAGGCrTAATGTAAAAAGGAATCriTGCCCAGATGTNGCT 
TAAGGAAAAAAGGGTTTTTAATAANAAATGACCrnTGATTAGTATGGTGCCNAGTCACAGGOCN 
TATTTTCCTGAmNTTGGGGTGATGNCAAAAGGTAT 

SEQ ID NO: 9 1 3 ACGCGGGCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACC 
TGAGTGCTACTCCCTATTTGCATCTGTTTTGATAAATOATOTTOACACCCTCCACCQA ATTCT AAOT 
GGAATCATGTCGGGAAGAGATACAATCCTTGGCCTGTGTATCCTCGCArrANCCTTGTCr nTGG CC 
ATGATGTTTACCrrCAGATTCATCACCACCCTTCTGGTrCACATTTTCATTrCATKjGrrAr^^ 
ATTGTTGTTTGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAA 
TTGGACACAGAAAGGGAAAATATGAAGTGCGTGCTGGGGGTTTGCTATCGTATCCACAAN GCATC 
ACGGCAGTGCTGCTCGTCTTGGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGC T^ 
CCAAATNACAAATAAAGCATCAGCAGTGCTOCTTCCTrGCTGTTCCACNCACTGGNGGACATTIT 
GCCATCCTATTTrOTCTGOGNCCTCTGGOTGGCTGGGCrGNTGANCC TGGO AACTGNAGGAGCTG 
CCCATGTTATGGAAGGCGGCCAAAGTNGAATATAAAGCCavrrrCGGGCmCGGGAACCT^ 

SEQ ID NO: 9 1 4 ACCAGCACATGAAGCXrCTTCTACAAAATTCCTGACGGACrGGGAATAAAAAT 
TCCTAGTGACAGCCCACTCOTCTCAGGCAGGTGTGATTGTTTGAAATCTCTCCCAATATTGAOAT 
GAAACCTGCTTCCCTGTAACTTCCCTGTAATTCTGTGGGTCCCTTGTAGCCACAGAGAAGGCAGCA 
ATCAGTAGGGGAAGTGCTATAAAAATATACTATCCCGGCCAAGCGTGGTGGCTCATGTCTGTAAT 
CCCAGCAOTTGGGAGGCCAAGGCGGGCAGATCACnTTAGGTCAGGAGTrGGAAACCAGCCTGGC 
CAACATANTGAAACCCCGTCTCGCTGGGTGTGGGTGGCTCATGCCTGTAATCCCACCACnTGGGA 
GGCTGAGGTGGGTGGATCACAATGTCAAANAGATGGANACAATCCTGCCAACATGGTGAAACCCC 
ATCTCTACTAAAAATACAAAAATTANTNGGGCGTGGTGGCGTGTGCCTTTNANCCCACTACTTGNG 
AAGGCTGANOCANGANAATTTGCTrGACCCrTGGGNGGCANNAAGGTTGCANTTGAACCNA^ 
TCACANCACTGGACTCCAAGCCTGGGTGACANATTGAAACTCC 

SEQ ID NO: 9 1 5 Ac riiu - i ' ri i ' i ' iu ' i ' ri ' rj ' rJU 'i ^ ' r i' i -r rt - i ArmGAGTATTGTnTATTAACCAAA 

ACACAAAACCAAAATGAAAACTGGCrrANAATATAAAATTCTCATTTTTCAAAGTGAAAGm 

ANATACTANCTAAAGTTGATAACrTAAATAGNGGTAAAAGTAAATAACT TAAAATTATG GCACCA 

ATCViCAAGAAGAAACAGAAAACAGGGGGACTAGGGATTCGGTGGTANACTTTTACTTTA AAAT AN 

AGCTATGCAGCANATTCTCCATGACTrGGCTrACATGCAGTATGTCCrATGGAAGTAAAATnTCA 

AATACCTCCATCACCTTCAGTAACTCATATTAATAAAGTAAAAGCCANGTN TATAA AACAANAAC 

CCAGTTrATAAACATATACATCAATTGGATCCOCAGGTAAACCAAAAGGAGTTrTTAAAAATATCC 

CCACCATACCrmAAGACAACTrrrCCCCITCCCTACAGTAGGAAGTACCACCATTCAAA™ 

TAGCTATAAAAAAGTAAAAGGGGCCNCAACCTTTTrrAAAATTAAATGCCACCA 

SEQ ID NO: 916 ACGCGGGGGCCATGGCAGCATCTTCCCTGACGQTCACCTTAGGGCGGCTGGC 
GTCCGCGTGCAGCCACAGCATOn'GAGACCrrcGGGGCCCGGAGCAGCCrCCCmGGTCTGCTrC 
TCGAAGGTTCAArrCACAGAGCACTTCATATCTACCAGGATATGTTCCTAAAACATCCCTGAGTTC 
ACCACCrrGGCCAGAAGrTGrrCTGCCAGACCCAGrrGAGGAGACCAGACACCATGCAGAGGTCG 
TGAAAGAAGGTGAATGAGATGATCGTCACGGGGCAGTATGGCANGCTCTTTGCGTGGTGCACTTT 
TCCACCGCX:A^^^GGAAGGNGACCTCTGAANACTNATCT^AAT^GGAAATOAACTANACTTGCGTG 
NNGAGAGAGAATCCACTGGAAAAGNCCTGCTGGTTGGGGCAGACACrrCACGCTGTTGGCAAGCC 
ACTCTCGAAAGGATCTTGTCGANTAATNCACAN(l\rrOAAAACAANTCITGGCCAAAATC^^ 
AATCAGGAAAGGAAAACTCAAAANAAAAAAT 

SEQ ID NO: 9 1 7 A ClTi nTnU ' i n i ri l ' ri I t ' l l 1 1 1 IGAACACAAGGGTCAGTITCTTCA^ 
ATGAGCAGTCANAACAGGANATGCTTAGGAAGAAATCGTGGCTGGTGCCTNTTCTCCATGCTCAT 
CCCATACCCCAGTGACAGGATACCGClTCCCTGAAGTrAAAAACATGCACCACACTTCCGGTAAA 
GGCTGGAGCCACAOAGGGCACCTAGCAAGCTTGCTTTCAAG GTCCCATCAACAA ATGAAAAGGGC 
AGCCGAGGACAGATTGACAGAAACCCCACTATGTTGCriTlUl-i ri 1 1 1 U 1 rUCCTGAGGCAGAN 
TCTTGCTTThmGCrCANGCTGGANTGCAATGGCACAATCnT^AACTCACTGAACCT^ 
AGTTCANCAATTCTCCrcCTTGCCnrCCGANACTGGATACAGCCCTGCACATGCCTGGTAATTrTG 
irmAm'ANAACGGATTTCNCAmGCCAGCTGTmACTGTGACCTAGGTACCCCTGCTACCTCC 
AAATGCTGGNTA 

SEQ ID NO: 9 1 8 ACAACCrrCAAACATTCavGrmTATAAAAAAGGGGCACACAATCGTGGTTT 
TGATCCCCrmGTTmGGACAAATGTTTCTACAAATACAGATrCAGCAAACCCAAAGGCTGCAA 
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ATTTACTrcCACAGTAATCTGCCAGCCCATTTACTCXACTTAATCCAGCTGAAC^ 

CCAAATGTCCATGGTCATTAGCAATCATAGCAGGTAGAAAGGCrrrATAAGT CCATAA \TGTGCIT 

TGAAArrCACATCAAATGACnTIGCATAAGCTCATCTGGACA GTCA AGGAACrrJ'i'IGCCTOTTA 

CGATTCGGCATTGTTGATTAGGATGGAAACATCGCCGACl^CrilllAACCTGGTCGGCTACTCTAT 

ACACrCCTTCCTTTGGCTGCAATCGCANGTATAGCGTGCACTCTrGTGGCTCCAGOT 

CrrACATGTTCTATrCCCTCCTATTGATTCCAGAAACAAAACAGATCCAGOjGCAAAC^ 

ANACCTTCCAGTCCTTC 

SEQ ID NO: 9 1 9 acaagccacaaaaatgctcacacaagcggtggttogaacaagcatacaaca 

CAAATATTCAAGCrnTGTCTGATGCACAGCCTATGAGCAAATTCAGCCCATGGTrrGAATCAACA 

TGCCAGCTACTATrrGGCTGCTCCACAGTTAGCTCCAGACTAOAArrATTTAAAAAAACAAAAGCA 

AAAACAAGAATGTTGGTCCAGTAGCTCCTAAACCrGTGAAGACTATGACrACTGCrGTGTCTAGTG 

TGTAGmCTATTCCAGCTnGTTTAGGCTTCCATGAAACTAATTTCCTTACm 

ATAATCAGTTCATGGAAGACCATTCCTTCmCCrrCTAAGTATGGTTGAGGACAAAGAAAAGGAG 

AAAAATCTTTGCTGAAATCAAAATGGAACTTGCTCATNAAAAAAGAAAGTGTGTGTCAGGGGAAC 

GCANATATCTATCCTATTTCTTATCTCTACCAGTGAGGAAATGCAAAGGGCAGTGGGTCATGANAA 

SEQ ID NO: 920 ACAAAAGAGCCCCTCTACGAGAAGGACAGCTCTGTTGCAGCCAGATTTCAGC 
GCATOAGGGAAGAATTTOATAAAATrGGAATGAGGAGGACTGTAGAA GGGGT TCTGATrGTACAG 
TTCATATCCCAGTTCTAGAATCAGTTCATmCTAAGGAGTCCTGGTrCCTTnArrGGAA^ 
ATCTCGGCACCAGGTGTGCTCCCATTCTAGTTGTTITCTGACCACATAACTGCrAACAAAGATGCT 
TCACTCTQGCTACACTGATGTGAACmGAACTTTAGCAGAAGAGCTCAGCTCTAGAGAACAATGA 
GCTCCTACATTACCTTTTTTCCTCAAAGAATAAGTAAGTCTAAGCAGAAAAAAATATGCAAAGAAT 
TTTCAGTATGATOAAATAAGACAANCATCAGGCTIKTGACTGTAACCAACACAATATAGTTATAC 
AGATCTGTAGAAGATCCTAAAATAAGAAGTCATTTGCNGGGGTATCAGGGANATCTGNTGTATTC 
GCTTTGCCGCTACANAACACATG 

SEQ ID NO: 921 ACGCGGGGCCCGAGCTTGGAACrrCGTrATCCGCGATGCGTTTCCTGGCAGCT 
ACATrCCTGCTCCTGGCGCTCAGCACCGCTGCCrAGGCCGAACCGGTGCAGTTCAAGGACTGCGG 
TTCTGTGGATGGAGTTATAAAGGAAGTGAATGTGAGCCCATGCCCCACCCAACCCTGCCAGCTGA 
GCAAAGGACAGTCrrACAGCGTCAATGTCACCTTCACCAGCAATArrCAGTCTAAAAGCAGCAAO 
GCCGTGGTGCATGGCATCCTGATGGGCGTCCCAGTTCCCTTTCCATmCTGACCTGATGGTTGNAA 
QAGTGGAATTAACTGCCTATCCAAAAAGACAANACTATNACTACCTGAATAACTAC CAATGA AAA 
NCGAATTTCCmANAAACTGGNNGTGGANTNGCACTrNAGNTNNCA^^ 
TNGAAATTCNATTNCNGGGGGAANTNAGGTTNATmCGGNAAGNNCCmAAAACGGTTCN^ 
AGAGG 

SEQ ID NO: 922 ACATAAGCCTTGATATTCCATTrTGTGGCrGGTCCAAGGGGCAGCCTAACTCA 
TCTTACAAGACGGACTCTAGGGACAGGTAGGTTGGATCCTCATTCCTGACAGTGC ATTT GTCTGAC 
ACACGGGACTACCTCCCTAATGTATGCi 1 1 1 C I i 1 GAGAAAAAAAAAGTAACAQA ! I i l tJ i i TACA 
TCT^GGCACCTrGTAAAG l ^ l ^ ^n ^ ^ ^ l 1 1 i 1 ATACAAAAGTTCAATAGrnTGACACTCCCCATTGTTA 
ATCACTACTTCACTGATAAACrrGGAAAAGTGTGACCCTOGAATTTCATCATGCAAAATATTTACT 
GCAGCAGGAGAAAACArnTTTAAACAACArri-ll'1 1 1 ICTTTTCAAATGTATGAACTTGTTTAAGA 
TAGCCAGGAAGGCAGTGGTAGGATAAACACAAGGGATAGGAATGTATCAAAAAACAGATTAACA 
CACACGCACGCGCGCACACACACACACACACACACACACAAAAACCTGT 

SEQ ID NO: 923 ACCTTTCATTGCAGGATrTCrGCTTAATATAACAAGCAAAAACAAACAACTG 
AAAAAATATAAACCAAAGCAAACCAAACCCCCCGCTCAACTACAAATGTCAATATTGAATGAAGC 
ATTAAAAGACAAACATAAAGTAACTTCAGCrnTATCTAGCAATGCAGAATGAATACTAAAATTA 
GTOGCAAAAAAACAAACAACAAACAACAAACAAAACAAAACAAACAAACAAAAAATCCCACCA 
ATCrrCATGGGTAAACTrrCCTGCTCAGGGATGTAAGCTGACTCTAGACAAGATATTGT 

SEQ ID NO: 924 ACTGCCGAATGTGTTTTCCATGACATACTTCGTAAGTCCAATAAGACTp^ATT 
CTGTAGGAACAACTGCTITGTTrAAATAGTGGCTCCAAAAGCTCTCrrGGAT TAGGG CC^ 
TCCTTTTCrrCTTCCrCATCCCCACTTGTCACAAGGGGAAGTATGCATTrATA' 
AGTTGTCATGANGACATAATNATCTTCmATATAANACTNCNGNTGTGGGCANGAGAGAACTCG 
GTGCCGGGCCAGNTNACTCGNAAAGGOATGTCCTCCCTGAGTTGATGAAGGGCTCGGCCOCCTGC 
GGACGCCTCCNGGAGGCCNNATAANACCCTGTAACTCCCGGCCCGGCCCGGGACCCANACTCCGT 

SEQ ID NO: 925 acgcgggggcagtcacgggggagcgaggcctgctgggcttggcaacgaggg 
actcggcctcggaggcgacccanaccacacagacactgggtcaaggagtaagcaqaggataaac 
aactggaaggagagcaagcacaaagtcatcatggcttcagcgtctgctcgtggaaaccaagataa 
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AGATGCCCATTTTCCACCACCAAGCAAGCAGAGCCTGTTGNTTTGTCCAAAATCANAACTGCACAT 
CCACANAGCANAGATCTCAAANATTATGCCANAATGTChmGAATAAAGTTTCTGNAAGAGAGCTC 
TGCCTTTTrCTCTTGAAACATGCTTGNAC 

SEQ ID NO: 926 GTACGCGGGGTTGAAAAATGGCGACTGTGGCAGAGTTGAAGGCTGT TTrAA A 
GGACACXriTCGAAAAAAAGGGGGTATTAGGGCATiTAAAANCAAGGATCCW 
mX}CCCTAAATGATNACCTGATAACCCKrANCCCGNGGTCNCCATGGTAGG CACGGCNACTAC^ 
TCAAAAGTTGATAGGGCAAAC>m'CAAATGGGTCGTCCCCNCCCCCGC GTAC riiiririlNri^ 
TTmrrAGGGNCTrrCAATrrTrrATTTAAATGCCTT^ 

N^^^AGGAACATGCrmGNCATGGAATGATCT^CNACrGCATAAGACAACATGGTNGAGGTCTGAG 

ATTTTACACAGAGTCATTCAAAAAGTTGCTTGTGGACTACTATGTGAAAGATACAAGACACAAAT 

GTAACirCTTGAGGACAAAAAAGGATTTTTGTGTGGATGAGGGCTCGGCTACAGC^^ 

OOAGCNGCTCAAAAAGAAATGTITCACTICrGCTCTGCTmTGATCCCAATrCAAATGCNTG 

AGTGAAACCCGTGNAGGCNCAAAAGGGTTTGGAAACrCNCAAANGTCCTGGTNGGTGAACA^ 

ACCTTGCCAAGATT 

SEQ ID NO: 927 ACCTAGCTTCTGATGTATGCAAAACACTGCAGGAAGAGAGAATGAAAGAAA 
GAAAAGCTCTAGCTATAGAAGGAATTTTAAAATCAAGGAGAAGAGCAAGGCCCCC GCGTA CTTIN 
ri ITl - l - l ' l 1 1 1 1 1 U 1 U 1 I GGGAGTTGGANAGTATAACCCTGACAGCATCTAAC AAGm TGGGTG 
GGAGCnTAATCTGAATAATCATATATTnTAAAACTAGGGACAATmCTTCTATmC 
ATmTCTACCACCACCTATTTATTCCTATTTATTrGAAAGTGTGACCTCCTTAAATTCT^ 
TCrCTTAATTTTTCTCCCATATTTrcCAAACTGTTCTGTATTGNGGG 

AAACTACTTGATCATCAGCTTGrrCTTATATTCTGAACTCTATTATATGCCTCTGAA CATCA TGT^ 

ATCTCAGGTTATACCTACTTGArrGAAGGAATAAAAAAAAAGGAAAACTGCn'CAGTTTTCAA^ 

AGGATTTAACAACCCCTTTGAACAGAATTTrTAACATTITCATTCAAATATrATOTrA 

CNNTNAAAGAATTTAAAGAAAAATGGANACNATGCTCATGAATGACrrTGANAATGGATm^^ 

C 

SEQ ID NO: 928 A CITl i ' lTl Tn ' rnTi - l ' l ' l I i l N GGAACACAAGQGTCAGnTCrrCAATTCATG 
ANCAGTCANAACAGGANATGCITAGGAAGGAATCGTGGCTGGTGCCTCTTCTCCATGCTCATCCC 
ATACCCCAGTGACAGQATACCGCrrCCCTGAAGTrAAAAACATGCACCACACTTCX:GGTAAAGGC 
IGGAGCCACANAGGGCACCTANCAAGCn-GCTTr CAAGGTCCCATCAACAAA TGAAAAGGGCAGC 
CGAGGACANATTGACANAAACCCCACTATGTTGCl'l-ri'Cl-ri'rri-l l'l l l I I 1 ICCTGAGGCAAANT 
CTTGCTCTGTrGCCCAGGCTGGAGTGCAATGGCACAATbrrAAACTCAKrGCAAOT 
AGrrCAAGCAATThnx:CTGCCTrGGCCTCCCGAGTANCTGG GATTA CAGGCNCCTGCCACCATC 
TGGC^AA^T^IT^^^ATTTTNAGTANAANACGGAN^T^AANC^mT^GGCC^ 
TGTTACCTCAGGTNAACCACXrrTCCTNAACCTCCCAAAATGCT 
NGCCCCCNANCXrCAAAhrrmAAAAAAAGGTGAAAAGCTTrmC 

SEQ ID NO- 929 actggtgcacctcctacatatcaaggaaaagcaaaacacagaataatttaat 

ATGCTGAATAAAAmGTTTACACCAGATACTATACATCATACTGATGGTTGTCCAGTATGAATTT 

tAAOGGTATTATGTTAGATTCTOCAAAATATATTCCTATTATTCACAAGTGAGGAGTCAAAGTCCA 

ACTATTCAAATGGCCATAAACAAAAATGTTGCAGAAGGT ATAGA CCATTAAAAATAAAAAAGTCA 

GGAGTGGGGCAGCTGACCCCrrAGGAGCCTCAGGAATTCCnTTAATGCAAGATAGATGGCAAGA 

GCTGGCnTTTrGGTTAAGTCAGCCAGTrCGGAAACCCATCAGGGAGAAGTTATCAGGTGTCAACrr 

GTAAGGCAGATGACATTCATCAAAAGCATCTTAAGAAGTANNANTGATCACAGAANGGAAAGCA 

TrCANGATTTNCACTANCNACAAATTGCATTAACCATOTCTAGGCCATITGACAGNTGm'ATTCTr 

AACGCTCCATANNCOTGAATCACANGGCCAAATGAGCCGACTAACTCTNACAT 

SEQ ID NO- 930 ACTCGATGTGTAATGAAACCTGAAATAATAAGATAATAAGAAAAGC AATAA T 
TTTCTAAAGCTGTGCTGTCGGTGATACAGAGATGATACTCAAATTATAATAAAACTCrrCATm 
TGAAn'ATAGAAGCrACrriTTTATAAAGCCArrJ"l"Jl"l-lAGGGAAACTAAGGAGTGACATAGAACT 
GATGAATGAGCAAAAGTAAGTmGCTGGATTTTTGTAGAACTCTGGACGTTGAGGATTCATTATO 
CTGTGGTTAACTTTAAATATTmGAATTCCAAATATCTGAATTAATGAGCCTTGTCm 
TCTOCCATrGTGCAACATCGGTGGATTrrCTAAAAATAATGTAAATGTCTTCTATTAAATGTTGAG 
TGCAATAAAATACAGAAGAATTCTCAANANATAAAm-AATANTAANAAAAGTNCCTGCC:NGG<^ 
GGCCGTCGAAAGGGCG 

SBQ IDNO: 931 ACATTAOAAAACTACTTGTGACATTATTrCTAAGTGCAGGAGAGCAGCTCCT 
GGTGGGAGAGTAATGAAGrreTTTGTCATAGTGTATGCCAAGGATTTACAGCACTCTAGAATnTC 
ACAACTCTTCCATGTTAGTGAATGACATrAGGTAAATGTrGTArrrGCCTACrCrCAGCTATCAGAT 
GTGGGATTTCAAACTAGAAACAAATAAAGCATACACTCAACCAACAAGGOTAGTGCAAGTGCTAA 
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GTAAACAGAATGACATGGA.'*TAAArrCATA(nTGCa\ACTAGTGAATGCTrCAGCAGGGAGACCC 

TGATmATGATGTCTGTGACTTGCATCCCTGrrCTCAAATGAATACTGTCACTGCTCX^ATCT^ 

TACAATGACCGGAAAAGGCTGCTGGGGTGAGGGCTGANTGAATGAGGAAGATGACAGGAACTCA 

TTrrCCAAACAAGTGGTTCAATTTATCTGTCACATCTGTATATAGTGTAAAGTCTGATTTGT^ 

AGATCTCAAACCTCTCCANAAAAAGCAGGTC 

SEQ ID NO: 932 ACGCGGGGGGGAACTGCAATTGGTGGCTTTGAAGGCGCGGCGAGCGGGAAC 
AGCTCTTGAGGAGTGAGACTGCAGGAGATGTGGGCCGTCCrAAAGAGATGGATGAGACTGTTGCT 
GAGTTCATCAAGAGGACCATCTrcAAAATCCCCATGAATGAACTGACAACAATCCTGAAGGCCTO 
GGATTTTTTGTCTGAAAATCAACTGCAGACTGTAAATTTCCGACAGAGAAAGGAATCTGTAGrrCA 
GCACTTGATCCATCTGTGTGAGGAAAAGCGTGCAAGTATCAGTGATGCTGCCCTGrrAOACATCAT 
TTATATGCAATTTCATCAGCACCAGAAAGTTTGGGATGTnTTCAGATGAGTAAAGGACCAGGTGA 
AGATGrrGACCTTTnGATATGAAACAATITAAAAATTCGTrCAAGAAAATTCrTCA^ 
AAAAAATGTGACAGTCAGCTTCAGANAAACTGAGGANAATGCAGTCTGGATTCNAATTGCCTGGG 
GAACACANT 

SEQ ID NO: 933 Ac ri " i " n ' ri ' rn - n ' i I ' l Ti 1 1 1 i i 1 1 n gaacccagttacaagaaacaggtctga 

CTTTCrrGCAAAGATTCrGCrrCCTCCTCAAAGrrCAATCTCTCTGCTGATCACCGC 

TCTTCTATGGTGCCTAANAACTTTCCAAACTACCAAGCCCCTAANACTGTGGTITGGArrc 

AGCCCAGGTGCCrcCCTGGCAACACACAGGCCTTCGTCCTCTATTTCCTTGTTC 

ATCATTTTGTTGCATCCCCTCANAGCCTCCCrrAGGAGCGAATATGCANAATTCTCnTGAATG^ 

CTTGAAGTTCTCATTGCACAAGGGGTAAATGAGGGGGTTCAGCGTGGAGTTGATGTANCCCACCC 

AGATGGTGAACATGTGCAAATGTTCATTGCAACAGTTCTTGCAGAAGGCAATGACCATGAAGAAG 

ATGAAATAAGGGATCCANCAGAGGATGAAGGCTGCCATGATAAAACCCAACTGTTrGGCGGCCTT 

CCTTTCGNGGTTCATGTGCAACCCAAATACATACTGTCTTGAATGCGAGCGGAGCCTNTTCCAAGT 

AAAACTTGATGTAATCCAGGCCTGGGTTANACCCACTCCTNAmTGCChrrrGCCTGGGGCTG 

TTGGG 

SEQ ID NO: 934 ACCGAGCACTTTATTCAGTGCATAGCTTTAAGCCAGTGTrGGATTCACTAAGT 
GGACAGTCAGTCTCCCAGCTCTCTGCCTTCCCCAAAAGGGTCGTAGTAGGTCACCCrrCTACAGCA 
GCTAACTAGAGTCCTAACTAATGGGATCCAGCAGGGCCATTTCTCCAGAGGGCCAGTATCCTATTA 
GGAGACTCTrGGAArrCTTAGGTTCTACTCAAGAGTGGAAGGACCAATCACCTCTGATATrCTGTG 
GAAGGrmGGGGTCAAATTCTGCCCTCTGCATTCTGTGCAACTTGTATAAAAGTCAAGTTAGTAT 
TACATGAATTTGGGGTAGGGTTAGTGCTTTGAAAAAATGTTGAACCGGCTGGGCGCGGNGGCTCA 
CGTCTGTAATCCCACACnTGGGAGGCCGAGGCGGGTGGATCATGAGGTCAGGAGTTCGAGACCA 
GCCTGGCCAACATAGTGAAACCCCATCTCTGCTAAAGATATAAAAAAATTACCTGGCGTGGTGGC 
GCACGCCTGTAATCCCCACTCTCGGGANGCTGANGCAGGANAAATTGCTTAACCT 

SEQ ID NO; 935 ACCCTGGCATTGCTGACAGGATGCAGAAGGAGATCA CAGCCCrrGGTCCCCAG 
CACCATGAAGATCAAGATTATTGCTCCCCCAGAGCGGAAGTACl-l'ITn' imTri-lTri'l-l' 1' i'l'l 11' 
GGGGATAAACArmATTTCAAmAAGGTAGTAGCAATACAAAATAAGTTTTGATAATTATAAAG 
TANAGACAATGAAAAATCCCACCCTATTTTTANAATTTTAAAATATTGAGCAT CATTCT^ 
TGGGrrmATGTAAAGNGGGAATrrACATGACACCAGTATCTTGGTAATTCACl'ril'rilGAATGC 
TACATATCCCAATCACCATATATTAAATTCTCTATTTTCnCAAAGTGCTATTG(XACAAGAGACTG 
AACGATACAGCA^rmAAAAAAAGACCCACAAACTAAATTGTAATGTCn"CAATATCTCAATTACT 
GTTTTTTAAAAATAAAAACntjCANTAGTGAATTrCm^ 

ATAACCTCAACAATTTTTATTCrrAGGCATGANCATNTTCATGCTAAGGGNAGTCTATACTT/^^ 
GCNTrATT^rIT^^ATAAAACCTGGGGCACAAAT^mTAAAATaSI^ 

SEQ ID NO: 93 6 ACATGAGATCAAACTGTATACAGAGCTGCCATAACGTATA AGTTAAGCC ATG 
TATATACATTAATACATACATATATGTTCCAATAACATGTTGAAGArrrCCCAGGTTTrrr^ 
TAAGTTATTGCTAGACATATCAAAACACAGTATAAAACTGGTCATCGCAACACCCTTCTGCAGTAA 
TGATGAGAGTGGGCTAGGAATGTGATGAAAGGCACAGAAtrCATAAGACnTGAGTAAGTAGATCC 
AAATTTGTTATCACTAATGGCTCAAACAATX3TGGAGCCACTGATTTTCTGAAGTGAATATAAGA^ 
CAGTAQTTAATAGTATTTATCCAGGCTTCTAGTATAAGACATTGT 

SEQ ID NO: 93 7 ACGCGGGGGAAAATCACTGTTTAGTCrrCTOGAGGCTATOATITTTGCCTTAC 
TCCCAAAGCCACGGAAGAACGTTGCTGGTGAAATAGTCCTCATCACAGGTGCTGGAAGTGGACTC 
GOAAGGCTCTTAGCCrTGCAGTTrGCCCGGCTGGGATCTGTTCTTGTrCTCTGGGATATCA^ 
GAGGGGAATGAGGAAACATGTAAGATGGCTCGGGAAGCTGGAGCCACAAGAGTGCACGCCTATA 
CCTGCGATTGCAGCCAAAAGGAAGGAGTGTATAGAGTAGCCCGACCAGGTTAAAAAAGAAGTCG 
GCGATTTTCTATCCTAATCAACAATGCCGGAATCGTAACAGGCAAAAAGTTCCTTGACTGTCCAGA 



134 



wo 02/29086 



PCT/USOl/30732 



TGAGCTTATGGAAAAGTCATnGATGTGAATrrCAAAGCACATTTATGGACTTATAAAGCCrrrCT 

ACCTGCTATGArrGCTAATGACCATGGACAmGGTTrGCATTTCAAAGTTCAGCrGGATTAAGTG 

GAGTAAATGGGCTGGCAGATlACTGTGCAAAGTAAATTTGCAGCCrrTGGGrrTGCTGAAAT^^ 

ATTn-GTAAAAACATTTGTCCAAAAACAAAAGGGGGATCAAAACCACGATTGNGTGCCCC^^ 

TATAAAACIXjGA 

SEQ ID NO- 938 CGCGGCGAGGTACCCAAGCCAGAGCTGAGACATGGCTCCCCAGATGGACTG 
GCTGTGGTCAGGAAAGQCCTOTAGAGGGAGCTGAGGGCTCAGAAAAT ACCTG CTCCGGGTGCCTG 
GGCTCAAGrrCTCATTCCATTTCrrrcATGCCACTGGCCACTGTATCTGCTTITOTAAAAAOT 

aaaagttatacatcaggtatctctagtcancttcctccgctgccacctctttgccacagatc^ 

ATCACTGTGAATGGTGGTGACCAGGTTGGGCANGGCAGGGGCTNGTCCTAAGTGCCTTAACCCAG 

GCCCCTNGOCCATNANACCTTNAAACAGNAATTTTCNCmCrimTAATNCAm 

AACTGGGGGGTANTGTmATACANAATAGCTNGATTAGAAAGGAA 

SEQ ID NO: 939 CGCGGCGAGGTACAGTGACCTGCAGAACTTAGCCAAGAQTCTGGOTCTCCGG 
CK:CAACCTGAGGGCAACCA.*.GTrGTTAAAAGCCTTGAAAGGCrACATrAAACATGAGGCNNG^ 
AGGAAATGAGAATCAGGATGAAAGTCAAACirCTGCATCCTCTTGrGATGAGACTGAGATACAGA 
TCAGCAACCAGNAANAAGCl'GAGAGACAAGCCACTTGGCCATGTCACCAAAACAAGGANAAGGT 
Ga^AGACTOTCCGTGTGGACCCTGACTCACAGAATCATGATAAGCAGGAAAGCCAGGATCTNAAA 
GCTTCTGCAAAAGTrCCITCrCCACCATACGAGCCCAAGAANCrrGTNAATGCTNTTCCTrAAO 

SEQ ID NO- 940 ACTGTCTCTCCCCAGAAGGCCnTCAAGGTrAACACACAACAKrGCCCT 

CTTGATrACTGGCCTGGGCTTACAAAGGCGAATACTCAGACAGAATCAGGAGGCACTCACGCITA 
ATGAAGGGGCAGGGATCTTGCAGCCAGCAGATTGCCTCGCCTATATTCCTmCCCTOGGACTC^ 
CTGAATTGTCTTITGGGTGTGGNTAGCTGCNAATNNCANCATACArmCTTrATATO 
AANATNGATTNCTTATTGCA 

SEQ ID NO: 941 actcatcactctgtccatacgcgatcacaatatcctctagttcttccatcaca 

GTCTGCGCACATTTGGTCATCAGCTGGAGAGCACGGNTGTCATTGGGTTTTGCAAAGTTGTGC^ 

tcatcaaaccnatggaaattcoggccgtccagccgcactaccacccagcngtotgccagggcagg 

TGTNGTNAGC^^■CTAAGCCCTNACOTACCTCGGCCNGTGACCACTCTANGGGCNAT 

SEQ ID NO- 942 acgcggggacagcgcgcggaagaaaaaccagcaagaaggcggcgggggaa 

GATGGCGGTCCrGGGGTAGAGTTTGCAAGCrrTCTGACTAGGCTAOTCOAGTAACTATTCGGGTCA 

TGGCGTCAAACTCAACTAAGThnrrCCTGGCATATGCCGGCTATGGCNAACAGGAACTGGATGCC 

AACTCTGCCCTrATGGAATrGGACAAAGOCCTNAGATCTGNCAAACTTGGCKAACAGTGTGAAGC 

ArrrGTTCGCTITCCCAGACTTTTTCAGAAGTATCCmTCCCTATTCrrATCAATT 

AGrrAGCTOATGTTNCAOANTrcGAAATAAmCCTNAGGCTATGTGTTCTTAAATrACCCAAC^ 

GT^GATAAACNT^GGANGANATT^^^AAGNTGATTAATTTGTGAANACAATTT^^^ 

AATONCNrGGGGNAANACCAmCCCTCGATTrrGKITCrGGATNATm 

TTAATAT 

SEO ID NO: 943 rrAAAAAACrrCGAAAGTCACAGACACAGAATTTAGGAAGCTGAAGGCTGAG 
AGTCrCCCrTCTCACTTAATCCATGCrrrATmGCATTCCTCACAGGTAAGGAGGCAGTC^ 
ATGCTGTNGACCAANACCAGCCCX:ACGGAGCTGATCTmAAAAAAATGGAATTrAClXrrGGCATA 
CrCCrATGTATCATACCTTTCCAANGNCAAATCCCAATAGACCTGNANGTGCAANTTrGGGCAATG 

ATCCA 

SEQ ID NO: 944 ACCrCTTCCTGTrCGAATGGGTTATCCAGTAAAAAAGGGCOTGCCCATGCAA 
AAGQAOGGAAATCTAGAACrmAAAGATTCCCAATmCTGCATTTGACTCCTGT^^ 
AAGCACTGTGAAGCCCrrAAAAGATrmGCACTTGAGTGGGCCAGCCGCACTGGACAAGTGACG 
AAGAAATGTGAAAAAGCArriTCCAATTrGAAATTGACAGCACTTGATTATGrrTCATCAAGAACC 
ATCTGTTCGGAACCCCAGAACNCNAArmATrChrmAAANTAAANCTITCCAGrrrG^ 
ATTNANCACNCNAANAAAAAAAATTAAmAAACTTrn-AGGAANAACeAATACTTC 
NATTGTGCrrACCATCNAAANAAANANGTCCCTTTAAGGAGGGGCNAAATTACCATTATNCCAGN 
GT^OTCTACTAAC^nSIT^fN^mACC^TNAGTCTTTGGAATNCTG 

GACTNAAGCCNGACATTGGGAANAArrTNmTGGGGAAAAAWGCTTATCNCAAAANAAAATAT 

TCTTGGAAACTJCrrCTrCCANATTAAAAGCTNCTTGANAAAAATATGNGAAArrAAAT^ 

AGCTCCT 

SEQ ID NO: 945 ACTGGAAACTAAATCATATrrCrTCCCTCCAAArrrCACCCATrCCTGACm 
AATCAATTGCAGAAATGCAGGTGTGTTACrnGTTGATCAATAACTTTOGAACAArrATGGATC^ 
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TTCTATGGTCACTCTGAATrn'CATGTCArrAATCACATAAAAATTGATA ATACC rCATTC^ 

ACAATATGATmATTTTGCCAAAQGCAAGACACCTATAGTTG AGCTG TATTTTGGGGGA 

GAGGAAGGACTTCTATCTTATCTCAACAAAAAACTGGCCAGTATTmGTTAATGTAAAGCn'CCT 

TTTCTTTCTAAAAAATAGTAACAAAATTATTTTTCATTGGCCTATrCrG^ 

CATTACATTAATTTTTAATCTTAGTrrCTGATAAACACAAGCCATTCCTATCAAAAATATTAm 

TTCAGTCAAATTTTACCAAAhrrAACAAAAGACCATTTATTNTCCGTTTTT^^ 

NATGATTTTTTGACANGCTTGTTNCCTCGTCCGTATAAAATITmCCAATCA^ 

C^^^ACTCTGGGCATATTTrmGNGAAGGTTATACACATTTGAANAACCCITAAA^ 

SEQ ID NO: 946 ACGCGGCjGAAATCAGCAAACTGGGAATrrCTGGTGACATAGACCTCACCAGT 
GCTTCATATACCATOATATAATCTOAAAOGGGC AGATr AAAAAAAAAAOAATCTAAACCrrACAT 
GTGTAAAGGTTTCATGTTCACTGTGAGTGAAAATnTTACATTCATCAATATCrCrCrTGTAAGTCA 
TCTACTTAATAAATATTACAGTGAAAAAAAAAAAAAAAAAAAAAAAANTANAATAAGAAAATAN 
AACAC 

SEQ ID NO: 947 ACTAAGGGGACAATACACCAAATTrGTrGAGTTTACAATCAAGTCTACTAAG 
GTTGGACTTCCTTATCAGTTrGGCGAGTCCCAOGGCAGAATAATCATCCATCTACAGGTCTCT 
TCCTCTCCCTCCACAGCAGTGGAGAGCATCCCAGTGTTTGGGGCACrGTGTrCCTCrTCGTCCCTGC 
ACCAGACCCTGGAAGCCTrGGCCAGAGACCTCACCAAACTCGACTTGCGGCGCTGGGCCAGCTTC 
ATGGATGCTGGAGTGGAGCACGATGACGTAGCAGAGCTGCTGCAGGAGCTACAAAGCCTGGCCCA 
GTGCTACCAGGGTGGTGACAGCCTCGTGGACTAAAGTTCCCAGTGTGGGAGAAAGGAGCTAGTTT 
GCAATAAAAACAGCTGGATGCAGGAGCrCAOTGTCrrCATGCAGAGGAGCTCAATGTCGCGGGAC 
TAGCTACACCAACATATGCACriTrTACATTrAGAAAACACTGTGATTAGACCACAGAACAATTAA 
TATGTGCCATCANACCAAAAAAAAGTNNGAGAAAGGGAGCTGAACTCXOTCTTCGATGCTAr™ 
CAGAAGGACATTNTGTAAAAGTNNTNNATAAAAGACCTTGNATTGATGCC^ 
CCCCTGGG 

SEQ ID NO: 948 AC inn - rin - lU4 " l - ] ' lM4 - illM - i - lU - J - ri - iU - l 'GGATATTACACCATAGGTTTTATTAA 
CGATAAATGTTTGCATTACrmAAAAGCTTAGCTCITACTAAGCATTCmAACA 
ANCAAGAAATCATTTGCCATACGGAAACTATATTCACAAACAAGACITTAATCCAATATTGAAAG 
CTAAAGAATTAGAAAAAATACAAAACACTGCTATGAGTC AATT GAACTOCTATCArrOAATrrGCT 
GCATTTAGAATGACATAAACATACTGAACATAAAAACAATmATGGATTTATTCTATAAGACTAG 
CArTAAGAATOACATACAAmrGTGATTTCCmAAAAATAATTTTrrACAACANAATCCATITGA 
ACAAAGGGT Cl 1] 1 11 1 J 1 CCCCTCATrTGAGGGGAAGACAATCTATGTTrCCCAAACAGATC CTCC 
TTTCATACTAAAATAGCAAACTGNGGCCTCCATCTCCTmCCCAGATGCTACTTATAGATGACTTr 
GCATAATAACrrAATTAAGAATTACTrrrrCTGGTAACAGTGTCAACGGCCATAAATAATCAGm 
TTAAAAAACAAACATCAAAGTGCCAAATm'AAAAAAACTTCCTrrAAAAGAATTACC 
G 

SEQ ID NO: 949 ACACATCAAGTCAGAATGGGCTAGCCCATCAGGGAAGCAGCGGTAGAAGAA 
ATCTGGGCGTGGCCTCCCTACGATCAGTTTTATTGTGTTGGTAAAGACGCCATTCAGAGCCAGGGC 
AAGGCTGGCAGCCAGGCAGGCTTGTCTGCTGTCTCTTGTOTCTGCCITAAACATCGGCTrGGTGGG 
GAAATACrCCGCCTCCACGTAGGGGTTCCGGTAGAGCCACATCTCCTCCGGCTGGATGAGTCTCTG 
GAACGGGGGGAQCAGCTCCGTCACCAGGAAOGCCGCGAACAGCGCGAGCCGCACGCCCACCCCC 
GCGT 

SEQ ID NO: 950 ACTTCCAGCCAACCTCGTGAGCCAGGCGCCCC AGATAG GCAAACnTrCTTC 
AGGCTTCAGACGCACGACCTTGAGGGCAGCAGGAACCACCATCCGCTTlTrCTCGCACGGTCCGC 
CAGAAGATGCGGCTGGGGGCCCGGAAGTGGTAGGGGCCTCGGGAAGGGTTGGTGTTCATCCGCTT 
GCGGANGAAANCCAAGTNCCTTGGCCCNNACCACGCNT 

SEQ ID NO: 95 1 ACGCGGGGGCCTTACAGTTGCTGAGAGGAGGCGAGAGGCGGGGGCGCTAGG 

gccgagatcatgtctgactgggagaggntccttggcagcagaggacgctaggtttgggatgaaa 

gaagctgggcagatgcaaaatctggagagcgcgagggccgggcggtcagtcagcacccagactg 

gcagcatcaccggtcagataccaaggctttctaaagtcaaccttticactctgctcagcc^ 

tggagctctttccagcanaagcccagcggcaaaaatctcagaaaaatgaagaagggaaaagcat 

ggacccttaggagataatgaaagagaagacccagagtntctactgacaaaaagacangtaaaga 

aaaactggtcttgtggtgggtgaaaaacattaaaattgkrgggmrcnactgtt^ 

tttacattccnggoccaaattgctttn'atnaaacattoggttaagggctgaaaaact^ 

TAKrnTrCCANAAAACCATGTTTTGCTrrGrrTGAAAAATGATTGGAAAATGGAT^ 
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SEQ ID NO: 952 A CTri ' l ' JTrrJ 'l' l ' n ' l 11 1 i I'lTlTn-UTi'l 1 1'AAATGGCCCANGCTGCCTTCCT 
GTCCTCCCATCCTGCATCCTGCTTTCrGTTCCCTGGATACCGTAGGATGGTTTTArrrCAGTTCA 
CNCAAATTANTCTGGACACTGNGGAGTCATAACAANAGTGGGATGGAGGTTCCAGGGCANTCATT 
TTCr^TGGAGGGAAGCTTNAAAA^^^ANGTTAAANCAATGCCCANAAAANCCACTG^^ 
CCATCCCCAGCAATGANCTAAAAACCAACCTCAGCCAAAAAGAGGGTCITTGATCACAGAGGGTC 
ACNNAGTGGGATGTGAT^AGTGGGTTTGGGGATGOAONCCCA^r^TAAACCAAANGGCAAAGGGA 
TCC^^WTA^™CNGGCTTAANGGGAATGGGGTrNAAAATTAAGNGGTTTGGGGGACr^ 
GGGGANCCCCAANGNNGAGGAAAACr^AAAAANG^^^GNNNTGGCNAAANNTAC^^^^G^ 
ANCCTGCOTGCrGKTAC^JNCTANCCCCCATTTATGCCTTTGNTAAAT 

SEQ ID NO: 953 ACAGTTGAAAGCAGAGTGTAACAAGGGATATGTCAAGGTAAAGCAGGTAGG 
AGTCAATCCCACCAGCATTGACTCAGTCGTAATTGGGAAGGACCAAGAGGTGAAGCTGCAGCCrG 
GCCAGGTTCTCCACATGGTGAATGAACTTTATCCATATATTGTAGAGTTTGAGGAAGAGGCAAAG 
AACCCTGGCCTGGAAACACACAGGAAGAGAAAGAGATCAGGCAACAGTGATTCTATAGAAAGGG 
ATGCTGCTCAGGAAGCTGAGGCTGGGACAGGGCTGGAACCTGGGAGCAACrCTGGCCAATGCTCT 
GTGCX;CCTAAAGAAGGGAAAAGATGCACCTATCAAAAAGGAATCCCTGGGCCACTGGAGTCAAN 
GCTTGAAGATTTCTATGCAGQACCCCAAAATGCAGGTrTACAAAGATGAGCAGGTGGTGGTGATA 
AANGATAAATACCCAAAGGCCCChTITACCArrGGCTGGTCTTACCGTGGACCrcCATTTCCAGTCT 
GAAGGCTGTGGCCNGGGAKACCTTGAACTCTTAAGCATATGCACACTTGT GNGGG GAAAANGTGA 
NTGGTANATTTTNCTGGGTNCAANAAACrCCGmTCCAATTGGCCTNCC^ 

SEQ ID NO: 954 ACCTAATGAAAAGATCTCCAAGAGGTTTGTCTCAITCTCCTrGGGCTGTAAAA 
AAGATTAATCCTATATGTAATGATCATTATCGAAGTGTGTATCAAAAGAGACTAATGGATGAAGC 
TAAGATmGAAAAGCCTTCATCATCCAAACATTGTTGGTTATCGTGCTTrrACTGAAGCCAATGA 
TGGCAGTCTGTGTCTTGCTATGGAATATGGAGGTGAAAAGTCTCTAAATGACTTAATAGAAGAAC 
GATATAAAOCCAGCCAAQATCCTTITCCAGCAGCCCn'AATTTTAAAAGITGCTTTGAATAT^ 
GAGGGTTAAAGTATCTGCACCAAGAAAAGAAACTGCTTCATGGAGACATAAAGTCTNCNAATGTr 
GTAATTAAAGGCGATCTTGAAACAATTAAAATCTGTGATGTAKGAGTCTCTCTACCNCTGGATGAA 
AATATGACTGNGACCTGACCNTGNGGCITGTTACATTGGCCCANACCCATGGGAAAC CCAAA NAA 
GCNGTGGANGANAATGGTGTTATTCTGACNAGGCNNACATAraGCCrrTGGNC^ 
NAAAATGATGAACTTTTNTCNTTCCCACACATTTAAhn^TACAGATGATGATGNATGATGAAANA 
AAACmTGNNGAANA 

SEQ ID NO: 955 ACAACTCTTGCTAATGGAATGCTATAATGCACAAGGTCAAGGA'ITTAATAAA 
TTCTAAAAGTGTCTACATATATCAGTGATAACTGTATTATTAGAAATATAAATGTATAGAAATATA 
AAGTATATGGTATTAAAAACAGACCTTGCTAATATAAACATATATAAAGTATGTCACrrCTCCTGT 
AATAACAGCATAAAGATCGATCTACAGTTTGCCCTTCGCCTGGCACTCrrAAACCACTCCTCCAAT 
GGTCAATGTTGACCTTGAATCAACAGCCGCTGAACCCAGGAGACCCCCACAGATGTGTAGATTCA 
GCACCTAGAGGGCCCCCCTACCCTCTTGTGCTGTGTGTTCCCATGACTCCAGAAATAATTAATCGC 
AACTTGCArTTTAANGTCCACAGGCAAGTITGAAATCTAACCTANAAAAAGTGNANGCANAGGCA 
AAATACGCGGGAATTTGTTATAAAAGCAACAAGATITNCTTAAAATGCTrCCAG'rTCAAAGT CAA 
AATTAAGOTGACATNAAGGTCCCACCANCrmACAGAAGTTGGGGATGTTTTGNTGNTO 
NAAAAAAGAAANAATCTNCAATAAACATGTNNATTTGAAAAAAAATCNTGNGl^AACTT^^ 
ACCCATCCCCAA 

SEQ ID NO: 956 TGCTGAAGCrrCACAGGGCGGCCAAACTAACTCGCTGATTnTGCAAGACCA 
CAGTGTAAAGGTCGGATGTCCACCrGAAGAAGGGGTGGGTGCAACTCTCrGGGTGCTGCACACAC 
CATGACCANCCTGGGCATGCATCACCCCAGCTCCCATCCATTCACACTGGTTGCCTTNGTGAGGTC 
CATTTT^fAGAGGGCTTTCATAGGCC^IT^^AATGAAAAAAAAAAATATCTGGTCTAC^ 
AAGATNCATACACCTCCTATTTAT 

SEQ ID NO: 957 ACCAGCACATGAAGCCCTTCTACAAAATrCCTGACGGACTGGGAATAAAAAT 
TCCTAGTGACAGCCCACTCCTTCTCAGGCAGGTGTGATTGTTTGAAATCTCTCCCAATATTGAGAT 
GAAACCTGCrrcCCTGTAACnCCCTGTAATTCTGTGGGTCCCTTGTAGCCACAGAGAAGGCAGCA 
ATCAGTAGGGGAAGTGCTATAAAAATATACTATCCCGGCCAAGCGTGGTGGCTCATGTCTGTAAT 
CCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCACTTTAGGTCAGGAGTTGGAGAC CAGC CTGGC 
CAACATAATGAAACCCCGTCTCGCTGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACrTTGGGAG 
GCTGANGTGGGTGGATCACAAGGTCAAGAGATGGAGACAATCCTGGCCAACATGGTGAAACCCC 
ATCTrmn'AAAAATACAAAAATTAGTTTGGGCGTGGTGGCGTGTGCCTGrrAANCCCANCTTCNTN 
GGANGCTGAAGCATGAGAATTGCnTGAACCTGNGAGGCAGANGGTTGCAANNGANCCCATNATC 
ACACCANCTTGAANTTCACCTGGGT 
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SEQ ID NO: 95 8 accacaaagggacccaaattcagcggtctgtgcctacaaacttcattaataa 

CTGCTTGCAGATTGGCAGCTATCTGGTCACTIXSACATATCCV^TGrrGCTATm 
AGTTCTCCCrrrCITCATCTACOTAATTTCATGTCCATTm 

TCACGGCCAATGGAATCn'GAAAAmAATTCCATGCACTGAAAAATGTGAATTATTTAAAOGOTA 

AGAAATAGAATGAAGACTGTGCCACTGAAGAGACATCACTCCCACCATGCTCAAGCAAGTCTTCC 

ATANATGAATCAATGTGAGGAACTCATTACATAOAGAGCACGGCATACCCTCAGCATACATTATT 

CCrAGGATTATCTCAGCAGTTGTTCAGTTGGCTCANGAGCAC TACAGC TATAAGTGAAAGCCAACr 

ACCCATAGGCCAGTTTTAAAAma^AGAAAAGCTTTGGATATTrmATTCA^^ 

CGATTAAAGTrCAGGCATTAmcrGGAACAGCTACGTAAAAAGTCTGCAACAATTT CATC ATCCC 

AAATCnrCTGTATCAGAACTAGTGCCrCCTGCNAATGCANCnTCATTTTCTTCCGCTAT^ 

SEQ ID NO: 959 actgoagctatactgacatcagtaatagtgacacccctggatgttgttaaaa 
ttagactccaagcccaaaacaacccactccccaaaggaaaatgrntgtatatagtaatggactc 
atggatcatctatgtgtctgtgaagagggaggcaacaaactatggtataagaagccaggaaattt 
ccagggaacattgagtgatggcagttcctgccacagttatttatttracctgctatgatcaa^ 
gtgctcttctgagatctaagttacgagaaaatgaaacctgcataccaattgttgcctggaattgta 
nccanatttggtgcam^iacrgtgataagtccactagaattgattagaaccnacatgcnatcc^ 

AGAA N r JT l i C 1 r A CmTGGAACTGCATt^GATTTGTCAGCATGAAAAGTATCTTGANNATGGTNG 
GAmCCCrrraGGAAGNGGCCTGGGCTCmACTGTrrCTrATAGATGTNCCTCNG 
CTCCTAAGGGCGA>nTCCAACACNCTGGCrGNCGTTACTAGNGGATCCCACTNCGOTNCNANCm 
GGAGNAATCATGGGGACAT 

SEQ ID NO: 960 ACGCGGGATrAAAAATrrCTTGTATTTCITGTGCATTAATCTGACGATA^ 
CCCTGTATATTATGrrCATTrAGCTGTTTGTAATTmGrrAATTAGATCAGGTTGTCTQCArTO 
GGTGTAAGTGAACATCATCACAGTTATCCTGAGTTGAGTTTAAGCCAAAT ACATG CATAGAAAAG 
GGTCTrCCTATTAATGGAAGAAGOTAATOrnAGGATGTGTATTATTTCAGTmGTATGm 
TTTATTAAATAAAGTGTTITI AAAACC AN AT 

SEQ ID NO: 961 acaaatcx:agcaaaactggaacagaacagggtaagaaagcacatagcaact 

GCTTTTTCAACAGTAAATmGACTATTCCCCTAAATATCTACCTAGTGGm 

GGTCTTTGGCrrTAAAAAGAAAGTCAAAAAATl'rTGACITTTAATGACrrGCACAAT^ 

CCAGTGTTTACCTGGGTATTTTITGTCACCTGTAGTTTACATTCCCTGCTACrGTTAAAGAAACAG^ 

ATTCTACAGTATNCAATTCTGTATATTGTCrmAAGGTTmCAATCANGACTCACTACTACT 

TGAGGAGTCCGTTGACATAATCTCTACATCATCTTCA'rri'l'Cl''riATTATGCCCAGGAGGTTCCAAC 

AAAAAAGTCACTACTATGATTTGGTGGTAACATATTTCATCGACATGTCATTTTGACTGCCAATGG 

TTACTGTCGGAGCCAAAGGACNTCTGGCAATCTTCCTGCTGCAANGTGCNTrnrCCTGGTAi^ 

CNATTCACGCCNnTAGTGGA 

SEQ ID NO: 962 ACGCGGGATATACACrGGAACACATGCATGCmTGOAATGTATAATrACCT 
GCACTGTGATTCATGGTATCAAGACAGTGTCtACTATATTGATACCCTTGGAAGAATTATGAATTr 
AACAGTAATGCTGGACACTGCCTTAGGAAAACCACGAGAGGTGTTTCGACrrCCTACAGATTTGA 
CAGCATGTGACAACCGTCrrrGTGCATCTATCCATTTCTCATCTTCTACCTGGGTTACCnTGTC^^ 
TGGAACTGOAAGATTGTATOTCArrGGAACAGGTGAACNGTGGAAATAGCGCrrCTGAAAAATGG 
GAGATTATGTTrAATGAAGAACTTGGGGATCCTTTTATTATAATTCACAGTATCTCACTGCT 
GCTGAAGAACATTCTATAGCTACCCTACTTCTTCGAATAGAGAAAGAGGAArrGGATATGAAAGG 
AAGTGGTTTCTATGGTTCCTCTGGAGGGGGGTCACTATCAGG^fNANAAAAATCAAGATAATNAAA 
AAATTTGAAAATOm'AANCNGTGATATTTNrNCGTTGGAAAGTCAAGTGC^ 
ATTGGACCCTGATGGAAAATGOTCTAATTGANTrOhfbrraCOTACAGNGTCT™ 
AG 

SEQ ID NO: 963 ACTTTGACTTACTAGGGTGATTCAAAGITTCAGGAAAAAGAAAATT CCCAG T 
ATC^^^rmcr^AATCT^ATTAAACCCAAACATAAGAATGCCA AAAAA TACAGAGCTCACATm 
TGGCATACATTTCCAAATTTTTAATGCCTCCCTGACAGGTGAATTTTAAGGATAAAAAAAGCANAC 
NCTTNCAAAACATTCCnTGTGATGAA>WANAAAAAGCCCTGGATAANTGGCCAGCTNCACTGGAT 
TTTTGTCTAAATTCTTNC:ATTAAC>rrTTCAGTmCA 

SEQ ID NO: 964 ACAG-nXjAGGAACTCAGTAACCAGATATrATCTGCACGGAGTTGGTTGCAAC 
AGGAACAAGAACGOATAOAAAAAGAGCTTTTACAGAAAATTGATCAGCriTCCrrGATTGTT^^ 
GAAAACAGTGGAGCCAGTGAAAGGGATATGGAGAAGAAGCTCAGCCAGATGTCAGCCAGGCTTG 
ACAAAATAGAAGAGGGTCAAAAGAAGACTTTTGATGGTCAGAGAACAAGGCAAGAAGAGGAGAA 
GATCCACGGGCGAATCACCAAGCTGGAGTTACAGATGAACCAGAACATCAAGGAAATGAAAGCA 
GAAGTTAATGCTGGGmACAGCCGTCTATGAAAGCATAGGATCCCTNANGCAAGTTCrCGAGGC 



138 



wo 02/29086 



PCT/USOl/30732 



CAAGATGAAGCTGGCNGGGACCAGCrACANAAGCAAATC CANC TGATGCAKAATCCANAMCCCX: 

CATGTGAANGGAGCTGGGACAAGOTCXCTAAAAGACANGTTTTGCCNGTAQGGGCNTAGGAGCC 

GGGTACNCThrrGTrGCCAAGGCCTTGNTTGCATTNAGGATrGTCCATCCATNGGGTGCATN^^^ 

AANAAATm-GTTTTNATGGGNCCTAAATGKTTACCNTNGGGAT^^ 

NNNTATNAAAATCT 

SEQ ID NO: 965 ACCTATITCTAAACAATGATTNAAAGTCTNTATCCCCTAAGCGGAGOTGTrGT 
NN^^^CTCCCTAATCTATCACCTGCACTACTTGAGAAAAT^^'AAAGTGTTTCTAN^ 
CnrCTTGAGCGATCTAATGTITCTrGTAATATrGATGANCCTACTAATNATCCTGCTGTN^^ 
TAACGCTTAATGAATAAAATGGCNCT 

SEQ ID NO: 966 ACGCGGGGACTTGTAAGGAGGAGAGAAGTCAGCCTGGCAGAGAGACTCTGA 
AATGAGGGATTAGAGGTOTTCAAGGAGCAAGAGCTTCAGCCTGAAGACAAGGGAGCAGTCCCTG 
AAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCTCrrGGCCTCCAACTTGTGGGCTACATCC^^ 
GCCTTCTGGGGCirrTGGGCACACTGGrrcCCATGCTGCTCCCCAGCTGGAAA^ 
TCGGTGa:AGCATTGTGACAGCAGTTGGCTTCTCCAA>n^GGCCrNTGGATGGAATGTGCCACACA 
CANCNCAOGCATCACCCAGTGTGACATCTATAGCACCCTTTTGGGCCTGCCCGCTGACATCCAGGC 
TGCCAAGGCCATGATGGTGACATCCAATGCAATCTCCTCCCTGGCCrGCATTATCrCTGGTGGTGG 
GCATGAATATGCACAGTfmTNNCCANGAATCCCNNAGCCAAAAACAGAWGGTCGGNAGC^ 
TGGAGTTCTTnTCATTCTTGGGANGGCCTCCTGAGATTCATTACrrNTNCCr^ 

SEQ ID NO: 967 ACAGCGGGATTAAAAATrrCTTGTATTNCTTGTGCATTAATCTGACGATAATT 
TCCCTGTATATTATAGTTCAmANCTGTTTGTAATT^raGTTAATTAGATCAGGNTGTC^GN^ 
GITGGTGTAAGTGAACATNATCACANTTATCCTGACTTNAGTTTAAGCCAAATACATGCATANAAA 
AGGGTCTTCCTATTAATGGAA 

SEQ ID NO: 968 GGTACTTAAAGATGGGATGGAGTTCTAAAGTGCrnTATAATACAATATAATT 
GrrAAAGGCAAGGGTTGACTCmGTmATTrrGACATGGCATGTCCTGAAATAAATATTC 
AATATGGCAAAAAA 

SEQ ID NO: 969 A CnTlTf rn ' iTn ' l ' n 1 1 1 l IGCTGTCAAAACGTn'A'n'GCAAAATGGAGTCT 
TANAACAAAGGAAAGCAAAGAAAAGTTCACATCAAAATGAAATGTATGACACCAACTTGGATTTC 
TGAATACACTGNGGACTTTGTGCTGGAATATCAAATTCCAACTATAAAATCAGTAACTGAAATAGT 
CrrACCACACAAGAGTAAAATTrAATCnCCATACAAAC ATTATA CAAGATATrrGGCATAGGACr 
TGCTCAGAATAACATTGCAATAGAATAATTGAGAAAAAGTT TTTGT TAAAAAAACAATAAAGNGA 
TAATGCCATOTATTCAAAGCTCACACTCAAAOAATATAAATATTrrCTATTAm 

SEQ ID NO; 970 GGTACCCCAGTTOTTGrrAGTGGGGACTATGATACTGTAATAA TATTT TTA^ 
AATTTACATCAAGAGAGGCAGTCATTCACGATGGTmGTGCCAGCTCTTmAGGGTITra 
ACATTANAGATATrrAGAACATATTACCCTGTGACTrACGTAGGAAACXTrAATATGCKlAGTATCT 
GGCACTTGAArrCCTGCTTITArrGCTGGAGGTCCACATCTGTGGTrGACCTCTGTTATTGTTGAAA 
AAAANNAAAATGAGGANAACGTTCTCCrrCTNCGATGCCTGTGAGGAGCCACCAACATTTGANGC 
TATGGANCTCArrGGTAAACCNAAACCCTACNWGAGATTGGNGAACNAGTCTNTTATAAGTGTG 
AAAAAGGATACTNCTATATACCTACTATTGCCNCCa^CTArrTGNGATCGGAANCATACATGGCT 
ACCTNTTTANANGACCCCTGCAATATAGAAACATGTGCATATNTACGGGNTCT 

SEQ ID NO: 97 1 ACAACGCTCACCCTGAACATGAATTAGCTCTGACACATCATAAACCAAATTA 
OOTCAATACAAGCTTATCTAAAGAGATATrAATACTITCITCAATTAAACATGCTTGGAGG^ 
ATGATAGAAAAAAAAAATCATGCTTCCTAAAATGCTGTATATGAGAAAGTAACTT GGCI IIIAAA 
GAAAAATATCrCTITAAATACATGGTTTAATAAATTACArrCCAATrGACTCAACATrrTGGTTGA 
CTGCACTGAAATTOTGTAATTACTATQTrmOCTCCCAGAAATACITCAGTTTGCTC 
ACACTCTCATGGAAAGTTmCAAAGTTGATGTCCTCCACAT^r^^CAAGTCACA TTAAG GCTGAAC 
AAGCGTGCACATATTCCCCAAATTATCACCATAACAGTTTArnTCCAAACGTATT^ 
AAAAAAAAAATNCACGCAAATAGTTGTCTCTAAGTCTTGTTCCCATAGCTC 

SEQ ID NO: 972 ggtacaagcaacaaataaaaaatagataaattggaatttatggaagttaga 

AAATTaGTOCATCCAAAGACATGATCAAAAAOTGAAAACTCAGGACAATGGCAGAAAATATTTG 

caaatcctatgtataagagtctagtatgcaaaatatataaagaacactgaaaacaacaaaaacca 
aacagcacaattcaaaaatgggcaaaggacctaaatagaaattitccaaagaagacctacaaatt 
gcgaagaagcacatgaaaagatgcntgacgttactagtcaccagagaaataaaaatcatgagat 
atgactttatactaaccagaataaaaatgtaagagaataacaagtgttggtgaggctgtggagaa 

ATTGGAACICrrGTACArrGAAACAACACCTCTTACGATAAAAGGAAAAAGTGTGAAAATATGTG 
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TTCCAGGAAAGAAAAAAGCACAGAACAAAGAGGTGAAGAAAAAGACTTTAGAGTCAAAG 



SEQ ID NO: 973 acggggaacaggacaaagaaaaggaacgagaaaaagacagatccaaagag 
atagatoaaaaaagaaagaaggataaaaaatccagaacaccacccanoagttacaatgcatcgc 
gaagatctcgtagttccagcagggaaaggcgtattgaggaggagcaggagttcttccagatcgcc 
aagancatnnnaaaccataananggaaattttctatatctccgtcccccagnagcagaaataa^ 
angattaaaagagaganaaagaangggaccacatnatgaaaa 

seq id no: 974 ggta cl - l - iuc4 i ctgtgcctactcagtaaaattgagaaaataatttrtgccca 

CTTCTATAAAAATATCA(XCCAriTGCTACCTAGCATATGAGAGAGACAAGACTAGCCTGTCACGT 

TTGTCTCTAAAAAGAAAAATGATAGGCCTGGTGGAGAGCAAGCTATATGGAAACATCAGCCTrTC 

CirrrCTAAGATACGTCACTCCAATTCrrCCTTAAATATAAACTTCCTCCTTC 

CrTAGGAGGCACTATAATAACmATGTTCCCATGOAACATAGGGTTTTTCAAGAATTTCTTAAAA 

TAAAAAAGGTTTGAAAGCTATGGAGTAAACTCAGCAGCTGTGATATTCCGTTCAATCTACCACAG 

ATOTCAAACGGTCACATTTTACAQAAGTCAAAGGAAAACAAAACCACAGTTCTGCACTCTTCCAC 

AGGGCAAGGAATTGATCATATTCTATTrrCCTTTCAGGAATTCATAAAATCCC 

SEQ ID NO: 975 GGTACGCGGGGGTGCCCGGTTCATCCAAGGCGCAAGATGGCGCTGCTTTTTG 
CACOTTCTTTGCGCTTGTOCCGCTOGGGAGCCAAACXjATTGOGAGTTGCCrCCACAGAGGCCCAG 
AGAGGCGTNAGTTTCAAACTGGAAGAAAAAACCGCCCACANCAGCCTGGCAOTGTrrAAGAANA 
hnsiATACCNGGTGTCAAATATGGCTTTGGTGGGATTGGANCCCACCAAGGNGGNCTTGAATGTGGA 
GCGCTTCNNGGAGTGGGCATTG>rraCTGGCANACNCAGCGNTCACCNNrrGGNN^ 
AAGTGACAGTGAAGCGCTCCCANCA>rrTCa3GATTTGAGTGGCATANGTGNACATGTTCCNGGAT 
TNNTTCATTONTGTTGANGANCAGATCTTGGGGTGTTCACCT^ATTNCCC^r^ 
AAATGNNCTATTTGCGATTANTTNAAGGGTTAAAAAC?miSITNAAAAGCAATTTGTO . 
NGTTTNTT 

SEQ ID NO: 976 GGTACAOAAGATAAACTTTGCTATTTCCTTTGGCGAATCATCCAGGATACCCT 
TGGACATAAAGTAGTTCACTCCCrCAlXTrGGG'rrGGCArrAAAGGTGAGGCTGCCTTCATCCAGCT 
GCATATACAATmCTAAAAGAAAATCCTAAAGGTGGGTTCTTATTGGATATGGGAACAGTQACCC 
CAAGTGGATTTGCACTUCCCTrGCCAGAGAAGTTCATCATrCGCAAGGTCCTGCCA^ 
AGCCAAGCAAAGGTCAGTTGCATTCAGGTAGGACAAGATGGTAAAGCTTAGCTCAGGAGGCAAC 
ATTTCCAAATrAATGAATCCrrCCraTTCTTTCOATrrCCTrc 

TGCCTCCTTGGACrrGrrTACGATGATTGGTGTTAGAAATGTTGCTCGCAGCCATrCTCCrcCT 
CTCrCTGGGGAAGTAGCCrrGCTCACTGTAGCCTIXnTGTrGC 

SEQ ID NO: 977 ACAGATACGCTGTCCCATACATCAGGATCAAATTATTAGTTTCAGTTTCACAT 
TGTAGGAATTTAAGA'rrrrilUU^'lU'nTAACACAGGAAATAATCTCATCATTTCCAAAGATGTCTT 
CATGTCCCATCAATGACATGCTACCAGACATATCAGATTCCACAGGATAATGGGCNCCAAGCTAC 
CCAAGTAGATGTTTCTGGTATTCTAGACrGCCGTrCATGCrrGrrTCCTAAAGTATACrTAAAAGTr 
TCAAATACAGTTTCACTTANAAACTGCAACCCTCCAAGTAATGTTATGrrTACTTAGGTATTAATG 
TTATGrrTACTTAGGTATGTATCAGAGGCAATAATTTCCAAAGCAGATCTT AGAAT ATAACCAATT 
TGTTAGATAACrAACTrTATCTCTATCACATCTGTTrACAAGCAAAGTATTACmG^ 
TTTCATCTTCAKGGGCTGGGAATCGGGGGCAACAAAACCAG 



SEQ ID NO: 978 GGTACTCGCCCCTTTTGGAAGACGCACGCCCA AGGC ACCCGTGTGTGCACAC 
GTATCCGTGTGTGGACCrrcTGTCCACAOGTGTGTGTTTTCAAAAGCCmCTCACCACATG^^ 
ATCACAAGTGTrCAAAATCTAGGCACACTmCAGAAGTTATITCTGGAAAAAATGTTGAATrCT 
ATAACAGCAATAAAAGGAAAATACCGCCTCAGCCAACTAATANAAAAAANNNAAAAAAAAAANA 
AAAGGTCCTATOTGACTATCATTGATGCCCCANGACACAGAGACTrTATCAAAAACATGATTACA 
GGGACATCTCAGGCTGACTGTGCCTGTCCTGATrGTTGCTGCTGGTGTTTGGNGAATTTGAAACrG 
GTATTCTrCAANAATGGGCAGACCCGANAGCATGCCCTTmGGTn'ACACACCTGGGTNGTGAAA 
CAACTAATTGTCNGTrWrrAAACAAANTNGATITCCANCTGAGCCACCCTACAGCC 

SEQ ID NO: 979 GGTACATCTTTGGCrrGTGAAAACCAAACATCTTTTCrrCTGGGCAATAGTAG 
GCCTGGTCTGAAGTCTTAGGTCAArrAGCTGTCCACTCAGATCrrGTCAACrrCTC 
AGGGTTGAAGCTGTTTGGTGAATCAGAGACAGGGATCTrrGGTGGGAGTCATGGGCCATTTCAGG 
AAGTTGTGCCTTCCAAAAGCAGCCCGTTCCACGCGCAATAATGTCCCCGAGGTGCGGAGTGCACG 
CCAGGCCAGTCCCCCTCAAGCGCTCCCTCCTCTCAACTGCCTTGTCTCCCGCOT 

SEQ ID NO: 980 AcnTNiiTiTn'iTi'i'n'i'i'i'i'rii i I'l 1 1 iagattanaataaaaattta'1'i i vi 

GTAAANAATTATArmGTATTTGCAAAAGCTGAAAATGCTCATAAAAATTACCANCCCANAGCTT 
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GGA^^"CCACCGGATCCACCACG^■GANACAAAAGAGT^rrGTCACT^^^TTT^^ 

GaOCCrmCTANACCTTGGATOTGTmCGAGGGAGCTGATACTCTTCAAGCAATANCCAGCCGA 

GGNGGNGGACCTGGrrrCCCTGGATCTGCACCTGAAGGCTGTCCTTGGCCCCAGGGGCAGGATTG 

ACGGTGGTGCTAGCCTGGCATNGCTGCTGAAGGATGGCAGCCACTGAGTATGGGGTNCAAACCAT 

NGCCTrCAAAGTTNCGGACCACGGTCACCtrrrTArrAAACNCTCrrTTTTQCT^ 

ATTGGACANATTNTTCCTTTrmT^ANAATGGGGCThmTOT 

SEQ ID NO: 98 1 ACGCGGGGAGAAATTAGACACCTTCCCACrGGGGACACAGAGACAGGAAGT 
AGCAAGAAGGGAGATGCCAA.GTGACAATCACCAGGAAGATGCCTCrCTCTAGTGACCTGGGTAGT 
TTGCACGGrrrGGCTGGAAACCACAGTCCCCCCATCTCTGCCAGAACCCCrCCATGTGGGCCACTG 
TCCTCAGACAGCTCCTGGAGCTnLQTGGATAAGCACTGGAATOGCTCCX}GCTCCCTCCTCCTC^ 
AGAAGrrTCTCGGAAAGTTTGAAGCAAAAACTGGTCAGAGTGCrGGAGGAAAACCTCATTTTGTC 
AGAAAAAATTCAACAGTrGGAGGAAGGTGCTTGCCATCTCAATrGTGAGTGGGGCAACAGTCACA 
TACTTATGATGATCTTCTGCACAAAAACCAACAGCTGACCATGCAGGTGGCTrGCCTGAACCAGG 
AGCTTGOCCACTTGAAAAAGCTGGAGAANACAGTTGCCArrCTCCATGAAGTCAGAAAT 

SEQ ID NO: 982 GGTA CriU - l - iU44 - lU - li ri J - rrrri ' l lTr GAATGTCCCAGGCTGCCTTCCTGTCC 
TCCCATCCTGCATCCTGCTTrCTGTTCCCTGGATACCGTAGGATGGTTTTATrTCAGTTCATGCACA 
AATTAGTCTGGACACTGTGGAGTCATAACAAGAGTGGGATGGAGGTTCCAAGGCCAATCATTTTC 
TTTGGAGGGAAGCTTGAGAAGTAGGTTAGAGCAATGCCCACAAAAGCCACTGCTTCTCCTTCCCAT 
CCCCAGCAATGAGCTAAGAGCCAACCTCAGOCAGAGAGAGGGTCTTTGATCACAGAGGGTCAGTC 
AGTGGGATGTGATTAGTOGGTTTGGOGATGGAGTCCCAGGTAAACCAGAGGGCAAAGGGATCCTC 
TGAGTCCTGGCTTAAGGGAATGGGGTTTCAAGTTGANGGTTGGGGACTrAAGGGCAGGGGGANCA 
NCAAGGTGAGGAAAACANAAAGGCTGGCATGGCGAAGCATCCTTGGC 

SEQ ID NO: 983 ACTrCCTCTAACATAGGTGGTCAGCATACTCTTTCACTCCTTATCCAATTTTAT 
CAAAAATAGAGATTAGGGCAATATGACATAACAGCATTAAGCACTAAACCATTTGACAGGGATTC 
CTTTCAAAAGTGAAAACCAGTGTACC 

SEQ ID NO: 984 A Crrn ' l - i - l ' i ' in ' n ' iTn - i 'l' l ' U i 'l i G CATATTAAAAAAATTOTGCATTCCAATA 
ATTAAAATCATTTGAACAAAAAAAAAATGGCACTCTGATTAAACTGCATT ACAG CCTGCAGGACA 
CCrrGGGCCAGCTTGGTmACTCTANATn'CACTGTCGTCCCACCCCACTT^^ 
TTNOTCACCAACATGCAAGTTCTTTCCTTCCCTGCCAGCCAGATAGATAGACAGATGGGAAAGGC 
AGGCGCGGCCrTCGTTGTCAGTAGTTCrrrGATGTGAAAGGGGCAGCACAGTCATTTAAACTTGAT 
CCAACCTCmGCATCTTACAAAGTTAAACAGCTAAAAGAAGTAAAATAAGAAGGCAATGCTTGT 
GGAATGTCCCCGGGGAAAAAGGATTCTGGATGAAATTGAAGAACATACATCAAAATCTAl'CCTrA 
CCTGATGCCGAATCAOATGAAGATGAAGAnTTAAAGACCG 

SEQ ID NO: 985 GCGTGGCGCGGCCXjAGGTACGCGGGGAGGAAGGAAATTGACGAACACGTOA 

cgcggtcgggcggaccactgcagactgagcggtggaccgaattgggaccgctggcttataagcga 

tcatgtttctccagtattacctcaacgagcaggoagatcgagtctatacgctgaanaaatttgacc 

cgatgggacaacagacctgctcagcccatcctgctcggttctccccagatgacaaatactctcgac 

accgaatcaccatcaagaaacgcttcaaggtgctcatgacccancaaccgcnccctgtcctctga 

gggtcccrtaaactgatgtctmctgccacctgttacccctcggagactccgtaaccaaactctit 

ggactgtgagccctgatgcctitrraccagccatactctttgggcatncaagtc rctctggcgaa t 

GATTATCrrGTGTGAAGCAATCATGGTGGCATCCCCATAAAGGGACACATTTGACrri'ri r I'l l NAT 

ATmAAATTNCTTCNCGAATNTTAAAGATAAATGTTTCrCNGNAmANCCTNCCN™ 

NTTCCTCCCCGGGGGCCTTTAA 

SEQ ID NO: 986 GTTCCTGAATGCTAAACTGCCTGGCTCCCAGCTTTTTCATTAAACl-i-i i CAQG 
GTCTTGGTTTCTITATCTGTAAAATGACAGAGTTGGACCAGTTAACTTrAATGGCCATCCT^ 
CCACACAAGTrGATAAAArnATCTGTTCAGCAAAGAGATTGAACAAAAAAGCACGTTAGTAATA 
TGAAGACAGGAAAACGAATGAAAGTCTAACACAlAACTCATATrcATTTACTTrATrTCTGrrAG^ 
TTITACACTCTGAAAATTTCACCTCArn'AGrrrGTACArrATAGCAAAGTGGTATrr^^ 
ATGCAAGGTTAGTTCAACATCTGACAATCAACCAATAGATTACCAACATAATAAAAACAATAAAA 
GTAGAATAGAATAAATATTAATAGAATAAAAAAATCACAGGATC ATCTC AATAGATGCAGAAAA 
AATATTTGACAAAATCAAATACCCCTGCTIXrrAAAAAGAAAAATATTTTNGGA^ 
ATACCTTGATAAAGGGCTTTATTTAAAAAAAATCCAGCTAACCGTATACrrATrGGTAAAGACrGG 
AAAGCrmTNCAAOTT 

SEQ ID NO; 987 tgtacgcggggacagacacttgtaaggaggagagaagtcagcctggcagao 

AGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCAAGAGCTTCAGCCTGAAGACAAGGGAG 
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CAGTCCCTGAAGACCKnTCTACTGAGAGGTCTGCCATGGCCTCTCTrGGCCTCCAACTrGTGGGCT 

ACATCCTAGGTCTrCTGGGGCrriTGGGCACACTGGTTGCCATGCTGCTCCCCAGCTGGAAAACA^ 

GTTCTTATGTCGCrrGCCAGCATTGTGACAGCAGTTGGCTTCTCCAAGGGCCTCTGGATGGAATGTG 

CCACACACAGCACAGGCATCACX:CAGTATGACATCTATAGCACCCnTGGGCCTGCCCGCTAACAT 

CCAGGCTGOCCAGGCCATGAATGGTGACATCCAGTGCAATCrCCTCCCTGGNCTGCATTATCTCrG 

TGGTGGGCATGAAGAATGCACAGTCmTGCCAGGAATNCCGAACCAAANACAGAATGGCGGTAG 

CANGGTGGAGTCTTTrrATTCCrrGGAGGCCTCCTGGGATrAATCCrGTGCNGGAAT^ 

ANCTCNNGGACTTTA 

SEQ ID NO: 988 GGTACGTTTTTTCCCCATATAGGACATACTTCGAAGTAACAAAATAAI"l'ri"l"r 
TAATCXjACTGTAGTGTTTCTTCTTCC3CGTTAGAAGTAGTTTCITTT^ 

GGCAATGGTTCCCAAATTATITATGAGATCAGCITnKjTCATGCCAATGCCTGTGTCTACCAAAGT 

CAGGGTACGCGGGGArrCTTTCATAAAAGATAATTATGGACTAAATCAGGATCrAOAATCAGAGT 

CAGlTAATCCTATTTrATCCCCTAATCAATTmAAAAGATAACATGGCATATATGTGCACATCTCA 

GCAAACATGTAAAGTACCAAATACAATCrGTAAATAATGGGCCATTTCTTCCCAAGCCCACAGCC 

AAGATGAAGTGTTAGGCTAAATTCAGAGCCCTGGCTCTTNCrcAGATGAAGTGGAAAACCTGCGC 

ATGACANGGCCGGTTCCariTCTCAAAGGKrririTCCCTCATC^ 

GGGTCACGGCCCNANAAAGCNCAACATGGGTTCGAANTCCAAAAAAAOCCGTTGGTGGGCCCAA 
NAAAAANA 

SEQ ID NO: 989 GGTACTGGCTTGAACAAAAmGTnTGTGTGTrAGAGT TATAA ATC ATTAA T 
CTTTATrTCGGGTXlGTTTACCTTTATGCCAGTTCCmATATTTAAAT^^ 
AATGTCTrrATAGAl'l'IClM'i'AAATTTCCTTATAGAACCATTAATAGAAAATC ATrA CATTTAA 
ATACCITACAGCAAAAGCATCCAAATAAOTATAGGGTTTATGTCCTrATTIT^ 
ACXjAATGAACACAGTGGTGGAATTTCTGAAGGGAAGTGATGAAATTATATTTArnrCAGTGGGCA 
CTTITCCATrrrACrACTGTACAAGATGTCCAAATATrGCGAAGATCTATTTC 
AAACAAGCACTTGAATCACATCCACTTGAACCAGGCAGGGCTITGCCATCCCXXZAATGACCTC 
AAAGAAAAATACTCATAAAAAACAAGCNGCTGAAACCTGAAGTTGAAAAAAAACAGCTGGAAGC 
TTTGAGAACATGATGGAACTGGAGAATTGCCTCCCACAACATTTAAAGGCGATATGAAAAGAGAT 
CGAAG 

SEQ ID NO: 990 GGTACAAAGrrGTCCAGTCITAGGTGCTGAGGCCAGTATGGGTGCAAGGGGC 
CCX}COTTGGTAGTCATGTCTTTGTGGGCT0ATGGCTGCGTGTGTATAGGCAGGAAGTTAAAAAA^ 
AAAAAAAAAAAGGAACAGAAAAACCAAGGCGCGCGACGGCAGCGGC CTCCCACG CTGGCTCCGG 
GCTCTCTTTAGGCTGGGTCTCGCCCCCGCGTACTTTAKrrri'llUU-llU'rrmTT mCCA TAT^^ 
TCAGCAGCTT^CAT^^'GGANAAGCCGmCCAGGATATTT^mTNTGATGAGCI^^ 
TACT>m'CCATGTCGGCrrCANTCnTACTTTITTCCCATNCT^ 

ACACTGrrAGTAGATACACTGCATAATCGNAATmGCCTCCTTAAAGGGGCACCl UUUUl'l TnTG 

ATCGNTAACCCCATTTGGGGNCTrGCAGTATCGCTCrCCCANCAAGrrAAAANAAA'rri-llXi-r 

GNNGGNACATTTTAAATTAAACNGGGNAAGCTTTrCTNTTA 

SEQ ID NO : 99 1 GGTACTCAGAGCTGTGGAGTAGGGGTTATGAGCTCCAGTArnTCAGCCCAG 
CAGCCACATCCATAAAGAGCAGCCTGCCCAACrCTCCCCGGATGrrrCAAGGCCAAGCCTCCACT 
GGAGACAGCAGCAGCAACATTCCCTTCGTGGTCCACAACCACAGCGCCTACCGTGTCCAAAGTGC 
CroAGTCATmCCTTCTCACTrGATTGNCrrcrTrrcrrTAG 

TCTGCCAGCTCTAGTTTCCTCTTOTTTCTmAAATGCAGCTAAACTGAATCTTGTGGTCATGATC 

TAGGAGGGCAAAAGGGGTATTCCATGATCTACrOCCCATCTGTAGGCnrCTTCTCCAACTAAAAAG 

CAGGGGAGGAATTTTGCCAGCCXGAGAGCTTGCCCnTrGCCCITCACATAAAAAGTCTG TTC 

AACCCGANACrrGGGTnTTGATTTCCCr(>GTGGCTCCAACTGGTTCCNAAAT^ 

CTTCOTATGCWGNGTTACACCTCAATmACCTTACAANATTTAAA 

SEQ ID NO: 992 acaactcitgctaatggaatoctataatgcacaaggtcaaggatttaataaa 

TTCTAAAAtKGTCTACATATATCAGTGATAACTGTATTATTAGAAATATAAATGTATAGAAATATA 

AAGTATATGGTATTAAAAACAGACCTTGCTAATATAAACATATATAAAGTATGTCACTTCrCCTGT 

AATAACAGCATAAAGATCGATCrACAGmQCCCrrCGCCTGGCACrcrrAAACCACTCCTCCAAT 

GGTCAATGTTGACCTTGAATCAACAGCCOCTGAACCCAGGAGACCCCACAGATGTGTAGArrCAG 

CACCTAGAGGGCCC<XCTACrCTCTGTGCTGTGTGTTCCCATGACTCCAGAAATAArrAATCGCAA 

CTTGCATTATTAAAGTCXrCAGGCAAGTTTGAAATCTAACTAGAAAA AAGTA NCAGCCAANGCAAA 

ATACCGCNGGAATTTGGTAGAAAAACCACCAAGAATTCTTAAAATGCTTTCC^ 

TTAAGGGGAANTNAGNNCCNCCCCXmrCC 
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SEQ ID NO: 993 GGTA Cn ' n - ITJTi I ' l l 1 J 1 1 1 1 1 IN GAGOnTCAACTTAACATrTATTOCACAA 
CrCACirCAGGCCACTGCTGACATCCCAAAACACAGCCTAGTAACACACAACTTCACCrrCAAGG 
ACTGCAGAOATAGACCAACTATATATAAATAATCTCGTGGAAACGTAATTTGGGTGAGAAAAATA 
TGACXjCTAGAGACTITGCXiATGAATATrrTGCCTAAGTTCAGAAGOT 
CAATCTNAAATrTCACCCAATAACCGCGGCCCTTAGCACCACACCITCCATCATCT n^ 
CTGrrcriXK:CCTTTACAGm'GCAGCTCTCTCAACrrCCCTTGCCTCT^ 
CCTACAACTITGNGGCGGGACCAriTrAANAAAAGCCATTTCAA AGGCGTT NA^ 
NCCAAGGCATTTACTCAAAAAATTTATTCCAACAGGGATCnTCAATTTTITAAA 
TTITGCATAAAATTTAAACCAAAGCnTAANATNNGGNAGTNGNAAAT TTAACT ^ 
TAAACCITAAATTCmTITNAAAAANTTrATTACAGGGAGTAC^ 
CTGNGGNA 

SEQ ID NO: 994 ACNAGCATGTTTTTATAGAACAATGTGCTCACTTTGAGAAATGAGAAACATG 
AATTGCAAAAOAATATCrCATATCCTCCATGmCAATCTGAAGCAGCACACAAGGCTACrrATTT 
TCrCCATTAATGCCCAGATAGAAATTAAATAACCTTCTGAACCAAAGTGGGGAAATCCAAAACTG 
TTGAGATGTrrGTCATATTATGATAGAACAAGGTAACTCTGGTAGGCATTTGGAATm/ACAGTAN 
GAGAGTGATGACTATACOTAAATGGTAAGTCAATAAATTATGNTGGCTGAGTATCACTTATCTAA 
AAACTGCTTATAACCTGGTAGAAAAGTaGCNArrrATGTAANATTrAAAGAAATGTGTTNAAA^ 
CAGGAATGCAACACCTGAAAGTCTGNTTGCCTNACTACCATTrmAAAGGGCCTTCCTTTCC^ 
TCACCCTTAATGCTCAANAAGANGCNCATTTTAAAAACTATGCTGGGTGATGTAAAAACACNCGA 
AAAAAACCirrCCCTGGTTAAAATTGCATTAAGNTTCTO^ 
ACCACTTAAGAAATAAATGCCTCTNnTTCCAAANAAAAAANCCAhrmCTT^^ 
NCCTITTTAACTTANNCGANGNANCTITnT 

SEQ ID NO: 995 ACGCGGGGGCCAGGGACTCGGGTGCCTGGG GCAGA CGAGGCCGGCTTCTCC 
GCGOACAGCTAGGGAGAGTGTCCTGGGTGTCAGCCAGAACATGTCTTTCAACC TGCAA TCATCAA 
AGAAACTGTTCATTTTCTTAGGAAAATCACTGTTTAGTCTTCTGGAGGCTACGAT^^ 
CCCAAAGCCACGGAAGAACGTTGCTGGTGAAATAGTCCTCATCACAGGTGCTGGAAGTGGACrCG 
GAAGGCTCTTAGCCTTGCAGTITGCCCGGCrGGGATCTGTTCTTGTTCTCTGG^ 
AGGGGAATGAGGAAACATGTAAGATGGCTTCGGGAAGCTGGAGCCACAAAAAGTGCACGCCTAT 
CCTGCGATTGCACCAAAAAGGAAGGAGTGTATAGAAGTAGCCGACCAGGTTAAA AAAA GAAATC 
CGGCGATGTITNCATTCTAATCAACAATrGCCGGAATCGTAACANGCAAAAAGTNCTTTGACTGNC 
CANATGAACrrATTGNAAAGTCTITGATGTGAATrrCAAAANCCAATTTATGGACOTATAA^ 
TCTANCTGKTATTATTGCTAATGACCATGGCCTTTTGGTTGCTTTrCA^ 
TAAATGGGTNGCAAATATTGGCANGNAATTGC 

SEQ ID NO: 996 ACGCGGGGTTGGCAACGAGGGACTCGGCCTCGOAGGCGACCCAGACCACAC 
AGACACTGGGTCAAGGAGTAAGCAGAGGATAAACAACTGGAAGGAGAGCAAGCACAAAGTCATC 
ATGGCTrCAGCGTCTGCTCGTGGAAACCAAGATAAAGATGCCCATrTTCCACCACX;AAGCAAGCA 
GAGCCTGTTGTrrTGTCCAAAATCAAAACTCCACATCCAC AGAGCA GAGATCTCAAAGATTATGC 
GAGAATGTCAGGATGAAAGTTTNTGGAAGAGAGCTCTGCCTTTITCTCTTGTAA^^ 
CCCAGGGANTAGTCTACCAAGGrrATrrGGCAGCrAArrCTAGATTTNGATCATTGCCCAAAGTGN 
ACTTGCTGGTCTCTmGGAATTTGGCCTTGGAAA GGTAT CATACATANGAAGTATGCCAGAAGTAA 
ATCCATITriTrGAANATCACriTNCGNGGGGCTTGTTITGGTCCACAGCATA^ 
CTTANCTGGGANGGATGCAAAATTAAGCATGGATTTAGTGAGAANGGANCrrTNACCnTAAN^ 
OTAAATCGQGTTTGGArrrcAAAGTmAAACTrGAATmATCCTGGCCGGACNC^^ 
TNNACNTrCGCGGNGTANTANGNGT 

SEQ ID NO: 997 ACTAGGTGCTGCAATGCAAAGGGTrATGACAAAACTGTCTGTAAATGTAGGA 
TCTGAATTGGTCrrTGATAGTmCCrGATTTGAGAAAGAAGTCTCTArrrGGCT 
TAAGAGAATGGACAAGGTCTGCTTATGTCCTTCTGGCAGCCTAGCATAGCrrrrGCCTCCCTCAAA 
TCAGTTATAAGTCAAAACAAATAAGGCACATTTTTTAAAAAAATTCCCCCCTTTAATTGACC>^ 
TAAAGCCATGACATTTCATTTGGTAACCTGTrrAGAATTATAAAAATCATTTCATTTGGCCACCCAT 
ACTGCCCAAGACAAAACTTNCAGACAATTCTGATGCCATCCAAGTTrGNTCTTACAAACTGCATAT 
TAAAAAAAAAAAAAAAAAATCTTCACC^^^CTAAATGTOATGTGCTCAAONGCGAACCTATTAAAN 
CTGGACGTCNATTTATTGTNCNTGACAAGCCCTCXjAlsrrATTCAAGGCCITr 
GCCANNCCATGGTC^TNGGOATATGCTTTCr^CCAGGAAATTAAACCTCAANGC^m 
TAACA>nTmGGCACX:GGCWAAANCTGGAGGAC>ICTrGGmTrrGGTAAAAC^ 
CGNGAACNCCTTANGGNGAATTTAAACACTGGGGGG 

SEQ ID NO: 998 GCGTGGNCQCGGCGAGGTACTTCNCAAGCAAGCCCCTATGA1TTGTCACTAT 
AGATGGAACCCTGACTTCTGCCCCATCCCTrCCTGCCCAACCrrAGAACCXAGGCCTCAAGTCm 
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CCCCACCCCTTTCnTGTTCTTCCAAGAAGCAGATGCCCAGTTGCTCAGGCAGCAGCCKjTAGAGACT 

TGAATCTGCCCACCAGTCACAAOGOjGGTCACAGATTCCTCITCCTCTCTTCTCCTCGTO 

ACCCTCCACCAATGTGCCTCAGCCTGTGTGCTGTGTGGCAACAGCATTCrGGrrCCCACTGC 

ATCrCCCACCACrCTGCTGGGATCTGCAGNGGCAANGGAGTGGGGGTTGTGTAAAGGGGAAGTCA 

TCTmGAGATCCAGATAGACATGGGTTGNGNACTTTACGTCCAOATGGGGAAGCATTCNTTCCTG 

NAACCCTTAAAATAATCATGCANCCTim'CAGAa^JGAtGCCATNGGTCCAAGGGCTrAAGGNGGA 

GGAAQCAAANNNGOCAAGCCCrGGCTrNNCONGGANCTCTAACTT^ 

AGAATTGGGGTTTTmTCANGCCTNGGGGGCCAATTGAANTGGGTTCTT^ 

AAACGTTCCAANCCTGAGGAAATTCAGGGGCCNCCrGGTCTGANGGGGNNCTAC 

SEQ ID NO; 999 ACNTGGCrrGCAGrrTOTCAAAAAACATTGACATCACAGAACACATAGATTT 
TGCCACCCCTATACAGCAGCCAGCAATGGAGCCIXnTrGCAATGGCAATCTCCCCACGAGTATGCA 
TACCCTGOACCACTTGCATGGGGnTCCAACCGAGCCAGCCTGCACTACACAGGGGAGAGTCAGT 
TAACAGAGGTATTACAAAATCTCGGCAAAGACCAATATCCACAACAGTCGCTTGAACAGATTGGC 
ACCCGAATTGCCAAAGTTrTGGAAAAGAACCAGACGTCCTGGGTCCTCTCCAGCATGGCAGCCCT 
NTAKTGGAGGGTGAAAGGCCAAGGAAAGAAGCAATCGACTGCCTCCGCCAQQCTCTGCACrATOC 
GCNACACCAGATGAAGGATGTOCCCCTGATTAGCCTGGCCACATNTTGGACAAI'GCNCAACrNTO 
GAATGACGCCGNCrTAGTAACCACCATGGCAQTAGANANTCGCACCACACnTGNTGGGAACCAC 
TTTACTCTGGGCAATNGCTACGTGGCAATGGAAGAATTGNAAAA>n^CTNGGGTGGGATGAAATC 
ACKITGAACnTrAGCCGNGTTGCCCACCAAAACCGATNCAACCriTCAGGNAOT 

GGACGGGcrnmAAGGACTTTicrrrrnTT 

SEQ ID NO: 1000 ACTATTTAATATATITCTCCATGAACTTTrrGTGAAATrCAGATCGCAGTGTGT 
CATTTACAAATCTTrTGTCTTTCTTCTGGTCATCTACACCTTTTGC 

ATCATCCCACCTTCTTTTAACTTTGAAGTTGGCCTGAGGCTGGGATG GGCCA GTGAGATTAAGGAG 

AGGGTTTCCGCTCAGAATGTTTTCCATAajAATCCTCTCTTCTTCAGCrrm 

TGGCCTOCTCrrCAGCTCTITCTrrTn'AATTTTT^ 

ATCATCACmCTl'CTTCAAAATCrrTCATOTCCrrCATCTGGTAAGAGGGGTC^ 

TTGGCGGCANGAATCrGGNCTTACCCGTGGCTTTTTTGA CACTrGAA GAAGAAG GTGN ATGGTCTC 

NGGGTNGACNANTCCTAl'l I'l l1'lCm■GGCAGAAC^^^'ll"l'rrl■iU'l^^TCCACT^mTTCTGAAGC 

ACGGTACCAACCTTTTNAGGGCCTrCCGANGAAGCCGGCTGGATTTATCTTTGGTGAA/^ 

GGTITIXriTGAm'CTGmTraAAAGTGGTTAAAAACNTTTCC^ 

GTGGCCNGTTNNNTGT 

SEQ ID NO: 1 001 ACGCGGGGGATCAAGmAAATGACTGTGCTGCCCCTTTCACATCAAAGAAC 
TACTGACAACGAAGGCCGCGCCTGCCmCCCATCTGTCTATCTATCTGGCTGGCAGGGAAGGAAA 
GAACTTGCATGTTGGTGAAGGAAGAAGTGGGGTGGAAGAAGTGGGGTGGGACGACAGTGAAATC 
TAGAGTAAAACCAAGCTGGCCCAAGGTGTCCTGCAGGCTGTAATGCAOTTTAATCAQAGTGCCAT 
TnriTrrrmGTTCAAATGATTITAATTATTGGAATGCACAArn^^ 
TTTAAAAACTTAAAAAAAAAAAAAAAAAAAAAAAAAAAGTCCTNGGNCGCGA 
CGAATTCAACACACTGGCNGGCCGTACTAANGGATCCNAACTNGGACCCAACTTGGNGTAATNAT 
NGGCATAN>mjGTTTCTGNGNGNAAATGNTATCCCTTACKATrrcNCACAACATACC^ 
CCOTAANGGTNAAACCTTGGGGGCTAATGAAGNGAmACTCNATTAATTGGGTOGCCTAC^ 
CCTTTCAAACNGGNAAACTGCNGGCCACTTGArTAAGAATCGCCACCCCNGGGNAGNGGGTGGGN 
TNGGGCTTTTCNTrCTT 

SEQ ID NO: 1002 CGCGGCCGAGGTACGTTGTTTrrGTTnTGTAri'lTl-rri'Cri 1 IG AAAG GGTT 
TGTTAATITITCTAATTTTACCAAAGriTGCAGCCrATACCTC AATAAAA CAGGGATATmAA^ 
ACATACCTGCAGACAAACTGGAGCAATGTTATITrrAAAGGGTTTTm 

TrATTAATGTATTAGGGAAGAATGAGACAArmGTGTAGGCrrriTCTAAAGTCCAGTACAAACG 

AGTCCTGGCCTTGTCTGTGGAGACGGATTACACCTTCCCACTrGCTGAAAAGGTCAAGGCCTTCTr 

GGCTGATCCATCTGCCrrrGTGGCTGCTGCCCCTGTGGCTGCTGCCACCACAGCTGCTCCTGCTGCr 

GCTGCAGCCCCAGCTAAAGTTGAAACCAANGAAAAGTCNGAGGAGTCNGACCAAGGATATGGGA 

TTTGGGCTCTTTGACTAATCCCCAAAAACCACCCACrrAACCNGGTTTATTTGCNAAAC/^ 

ANAAAGCTTACTTNTTTAAAAANCCAAAAAAAAAAAAAAAAuAAAAAAAAAGTNCTTO 

GCCCTTCAAAAGGNA 

SEQ ID NO: 1003 ACTGACACATCX:AAAGCATGAGTGTGTCAGAAATCCCTTGTCTA'rrCCTGTCT 
GTATAAAGTGTTrCATTATGACCAGATCTCTGATTGTATGGTCACTAGGTATGCAA TCACG CATrC 
AAAQAGGCTCTITACACXATCACTGTGATrGCTCrGAGAGrrGAGGGACTATTGGGCTTTAm 
ACAAACCAAACTTTrAGCCTGAAACCAACTITATGCCACrAAGTCATAGCCTCAGTTGTCCCAGTT 
ArnGTCXn'OCTGAAAATGCCTGAAACATCAGACAGACATrGCTTGCTTTACCCAAACTGATCAAA 
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ATCTTTAGGAGCACAAATGAATrrmAGTCTGAAATACCAAATAATGAATTGGTATAC 
GGAATCACACATOTTATCTTAAACCCACCATCATACCTAAGCriTrGOCCAAACCTCrrAT/^ 
TATCTAGCTGAACn'ATTTGGCATTrCAATGGGGAACAGGTTCTAAACCTAAAAAGGGGNCANGN 
TTGTTTTAAANAAATrrN'rilUCl'rrANGGCCCTNGGAChrrmA^ 

SEQ ID NO: 1004 acgcgggggtgcgccggtggcgggactctggggaaaatggctgcgtcttcga 

GTGGTGAGAAGGAGAAGGAGCGGCTGGGAGGCGGTTTGGGAGTGGCGGGTGGTAACAGCACACG 

AGAGCGGCTGCTGTCTGCGCTTGAGGACTrGGAGGTCCCGTCTAGGGAACrTATAGAAATGCTGG 

CAATTTCAAGAuAACCAAAAGTTGTTACAGGCTGGAGAGGAAAACCAGGTCCTGGAGTTGTTAATT 

CACCGAGATGGGGAATTTCAAGAACTAATGAAATTGGCACTTAATCAGGGAAAAATTCATCATGA 

AATGCAAGTTTTAGAAAAAGAAGTAGAGAAGAGAGACAGTGATATTCAACAGCTCAAAAACAGC 

TAAAGGAAACANAACAAATACTGGCACAGCTGNTTACNAAGCGAA GGAG AAACTCAAGTCCAAT 

AGAAAAAGCCAGAAAAGGGGCTm>ITCCTm"GAAGAAATAATrAANTITGC^ 

GCCAAGGAAATGCTGGATTGNGCCTCCACTGACCTGGGGTTCCANGGGANC^CCGGAAACCCTT 

NCCACTTGAATTN 

SEQ ID NO: 1005 OOTACACAAGCCATCAATTACCACTCTCTGTCCTGGCCTCCATGAAAATCCAA 
TTGGTCTTAAAGCCTGTCTAAGCAGAGAGCAAGCAGACCACATCATCGTGTTTCAACTTGAAATTC 
ACCCCCATCCCAGCACCCACACCCTTCCCCACACACACACACACACACACACCAAAAAAAAAAAA 
ACAACAA i ri - 1 Vn - 1 V ' l i 1 1 AAGAAGAAATGGCCATTAAGGTCTGGGCCAGTTGTAAAGGGTTGGN 
TTAATCCTGCAGCC^VGCACTTTTAAAATAGCTACTGGCTTCACAGCAGGCCAGTTGGAACI^^ 
AGCATGACAGGTAGAGAACAQAGGATOTGCTAAGAACCTGGCCTTGTTGTGGGCCTCGAGGCCTC 
CATGCATGCTTACACACCAACATAAGTATACTTrCCCCTTCCTCCTCCTAAGCAArrGGTCTNC^ 
GCCTTTAAAOCCrrGNACACTCAACTTTITNAATGGTATTITANAGATGGACAAA 
CTATGCAAGANAGCA lTl ' l i 1 1 IN TAAAAACTAAGGNCNAAAGNAAANANANGGCCTTTGT 

SEQ ID NO: 1006 ACGCGGGGGTGCGCCGGTGGCGGGACTCTGGGGAAAATGGCTGCGTCTTCGA 
GTGGTGAGAAGGAGAAGGAGCGGCTGGGAGGCGGTTTGGGAGTGGCGGGTGGTAACAGCACACG 
AGAGCGGCTGCTGTCTGCGCITGAGGACTrGGAGGTCCCGTCTAGGGAACTrATAGAAATGCTGG 
CAATTTCAAGAAACCAAAAGTTGTTACAGGCTGGAGAGGAAAACCAGGTCCTGGAGTTGT^ 
CACCGAGATGGGGAATTTCAAGAACTAATGAAATTGGCACTTAATCAGGGAAAAATTCATCATGA 
AATGCAAGTTTTAGAAAAAGAAGTAGAGAAGAGAGACAGTGATATTCAGCAGCTACAAAAACAG 
CTAAAOGAAGCAAGAACAAATCTGGCAACAGCTTGTTTACCAAGCGAAG GAQA AACTCAAGTCA 
ATTAGAAAAAGCAAGAAAANGGGCTATTTTCTOTGAAAAAATAATmAAGT^ 
AGNGCAAGAATGCTTGATTGTGCTTCACTGACCTTCGGTNCANGGGACNNCCGGANACCCT^ 
CAACTTANTTAAA 

SEQ ID NO: 1 007 acatccggcgagtagctggcxsgtcccgggtgctgctggttagtgtgctctga 

GGOAGGGTCCGAGCCAGCCNGCTGTnTGCCGGAGGAGCCCCTCAGGCCGTAGTAAGCArTAATA 
ATGTCrrrCATCmXjAGTGGATCTACAATGGOTCAGCAGTGTGCrCCAGrrCCTAGGACTGTATT 
NNT^OTT^^T N ^ i ^ l ^ ^i ^ I ^ l ^ ^l ^ I ^ i ITl 1 j " I CGTANAGATGGGGTTTCACN ATGTT GGCTAGG ATGGTCTCG 
ATCTCrGGTCANAGTCrrTTr^GTAAATATCCTTGGAAANAAGCA^ 

AATGCTTTAAGGAAAAAACAAAACAACTGCAAGTCTTCTGAAATGAAAAAACTCACCAGGGCT 

riTTNAAAACAA<XCCAACCAGCACTNNAATTATOATGCCCACAGGGGCCCCACTGANA^ 

GAAAAAGTTNC^L^TCNAAAACTTGGGATGCTCTTGACTATGGAAATATTGCNGCCCGANCCCAA 

GTAAANACCAAACAAGCm-AGGNCCCGTANTATITGGGGGGATrrrGGCAANAAAAAAAC^ 

NGGGTGhrrTNGGGATTCCATTGATCCCCCAAAATCTTCCGGGATGGGTAAAAGCCCANGGCCNGA 

AAAGTTANGGTCCTCCCAAAGGAAAAAAATTTGGGGGGAANATTGGG 

SEQ ID NO: 1008 GCGTGGGNCGCGGGCCGAGGTACTACrmCIT^ACrrnTCTGGTTAGCCAGA 
ATGTTCCATTAAGAAACAATAAAAGTrGTATAGTTCTCTAAGATGAAAGATTAGTATATTCAATGG 
CTArrATATTAACCAOTAGTGAACATACAACAAAAACTATCCTTATATTAATTGACTGAAGTTAT 
AACATAAGAAATAAGTTACACTACTACTTTGTCATTCACTTAATACTTACAAGATTACTCAAGi^ 
TCAAAATGGCrrCrCATrcCTTGACGTTTGTTGTCCAAATACITCACATTTCAATCT 
CAAAGAGGAAAAAATCCAAACATACmcarrACCTAAAAATATTAAAGAAGGCTAAAAGGCATT 
AGGAATrnTTTAAACCTTGAAAAACAGNGGTCACITCCAACATGTATTTAAAGTTC 
TTCTATOTGNGGCATAACrrCAATITAAAGTCATCATAAGTTATATTAAATGGGTCTTGGCTAGCT 
GGTGNCTATTrCCCAAAAGAGANTAAAGGGAGAACCGAGGNGGGCCCTCCTGTGAACNATGGTG 
GNAAAGGATGAGGTGAKGGGATCTCANCCCGGGGGNGAACCAGTAAAArnTNriTTAGGCCNGC 
NCNGGGNTCACNCCGGGNAATCCACCCTTTGGANGCCCNOGGGTGACNCTGNGGCOGAGTCANA 
CNC 
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SEQ ID NO: 1009 ACACTCTCATAGAGATAGAGAAGATCTAAAAAGTTGAGACTACTC AATCCAG 
TTAACAACAGCAGGAGCACTAGAGTTTGTTCATTTATTCrCTCTGTAAAACVAGCTGTGCl-l 1 1 1 f 
CTTCTGCCTTTAAAATGCCACOCGTGTATrcAAACCATGGCCACTTGATACrrATG^AGAATO 
GTGGGCTGATGCAAGCCCTTTAmAGGCnTAGTGTTGTGGGCACCAATGTCGAGC^ 
CTrGTGCTGTATGATTCTCACTGAAGAArrrCCrrrCAGCCAAGAAGCAGTGAGGTCTGGGA^^ 
TCAAAGTCATGTCTCTGAATATOTGTCCTTGACATGCAAGCTTTGCAAAACCNCATCCCGCTTAAG 
NGCGAGGCATNACCTTCTCCAAGTGGTTAAGTTCTTTTAACCCCCAAGATCATTCrrC GGNGAT ^ 
TATAAGTTTCATTCTACTTAAGGGATTGG>n'ANAAAACCAAGAAAGAACCCATTTAAA'in"ll"l'A 
TTrTGGAAATTTTArn'ATATGGATACTrAAAAGANGATTTTAAACCGGCGANCCTTAATTGG 
ACNGGGANGGACTGGATrrCTGGT ACAC TGGGGGGAA NAAAA ACCTCTTTAACGGATTTCNCT^ 
TTCCGGGACTTNCTTTTTAAANrrril'riAAAAACGGAAl'l'l'l'l'GG 

SEQ ID NO: 1010 CGAGGTACTACTITAGGAACAGTGTrGTAGATCCATTTAGAAAAAAGGAGAA 
TGATGCAGCAGTTAAAATCCAAAGCTGGTTTCGAGGATGTCAAGTTCGGGCATATATCAGGCATrr 
AAACAGGATTGTAACAATTATTCAAAAATGGTGGAGAAGTTTCTTAGGCAQAAAGCAATATCAAC 
TAACTGTGCAGGTAGCATATTATACTATGATGATGAATCTCTACAATGCAATGGCTGTCAGGATTC 
AGAGACGATGGCOAGGCTATAOGGTTCGGAAGT 

SEQ ID NO: 1011 ACAAATGTGCATTAACAAITCAGTGACGTAGCTGTGGATCTCTGGATGGCTAT 

gtaagctgtgagaaagtcccccactggctttgcactrgctgcgcaccagaggtgaccatccaggc 

agtatcaacctcaagagggggtgaatcccagcagctgctcgatoggcttaaaccgccactcgtca 

gcctccagctcttctaccaaaccagctagrntrccatccgagcaacttgctgatcatgtttcacct 

gtttaagcotgqatgccactrgatagctaaaaacagatrc gcag aggaatctctcagagccatctt 

ctaacatctctccagctccntrggggtaggtgaaaaagggcrmccgtggtggggcaaggnct 

GACACTGAAACCAGATCCAGTGACAGAACTrTCACTGCCCCCCACTGAGGAGGAT GATGA TGAAN 

CACC^^^TTTCTIm^AACAGAGTCTTACTTCACATGCCTGGCTGN^ 

TT^T^ITGAGACGGGGTTCCT^^TGTGN(XAGCrrGGNAA^n^lGGGCGA^^ 

CGCrrCCGGATmAACGAAmCTOCTTAAC CrTCC CAGAGCTGGGATANAAGGGTGCCCNCCTG^ 

CTGGTAATlTN>rriTTANAAANAANGGGrri'ClirNGGANGG 

SEQ ID NO: 1012 GGTA C l l 'I' rrn ' r i U -l U l U -i- l l U UU' r i riU GTCAATATTTATrGGCCGCCTATTA 
TGTGCAAGGCACTACACTAGOCGCrGGGGAAGATACAAAQATAAATCTGACAGACTGCCCTCAAA 
GAGCTTACAGTCTAGTATAGGAGCATACAGTCTCTGGAGAAGATATTITAAGTGTAACT AACCT NC 
CCCATCCCACCCCCACAAAAAAAAAGAAAAAACCTACTAGACITGGGrrCTTCCA^ 
GTTANCAGCTTCACGTAAAAAGCATAAATCTGAAAGTCTTTTAAAATGCATACll'l'lACTGGN 
CAAAATTNATrITCAT^AGAAAAAATGCTGAATATTTATTGCAA^^AAGAAAAAT^^ 
CACGGGGGCTCCCACCTGNT^rrcCCTAACAC^T^GGGAGGCCCAAGCNGGTGGATCC^OT 
GCAGGAGTTTGAGACCCCCCTGGCCACATANCAAAAACCCNTNTNTTmTGAAAAACAAAAA^ 
NCCNGQCCNTGGGGGGGGGCX:CCTGOATTCCACTmCC GANGGGTG NNGGGANGGAAATCCTTG 
AACCTTGGGGGGGNGGGTGCGNAAACCNAAAANNCCCCC14'rC i'l''l>fNCC CN GGNGTA NAANAA 
AmTCTirrmAAAAATANAAATTTTAANGGAANAAAAACTCCCCn^ 

SEQ ID NO: 1013 CGAGGTACTCCTTGACAGTrGATAOATTATATATTCrrCCATCCCTCAAACrr 
GCATTCCACTATATTTATrTTITGGCAAAAGATGAGCTGTATTTGTrTGAAATCT^ 
TCAATTGGATQTATCTGTTCAAATTTATrCCCACGTGACGTGGAAGTCCTTCGTTGG ATGTC ACAA 
CACTACAmAAGGTTGGTAAGGATGACTTGGAGGTCCATGGrTTTCATTACCAACATTTTAAGAT 
TCTGAATGTCGATGGAGTCTCACTGAAAGAGTCACCAAAGGNGCCTGCCTCCCCCCTGCT GGGAA 
AGTGNCAAKTGGAGACTGCCCCAAGGNGCTGAAAGAATCAATOGCAGGGGGTCTGGCTGCTrTTC 
ATCTCAmX3TGGGATGGGAGGGGNGGTCATGAACATmGATATACAATCTACTCTTGAAAATGG 
QACCCCAAGGGGTANCCATCACTTTCACTTATAATTNCCAAAAGAAGA CTACCACC NTGTCCTCCC 
AGAATCNAACWAGTTTCCTATTACnrmGGNGGGAANAGAArrACAAGTT^^ 
ACCCTTACAATNCCTAAATCTITnGGAGGGGGGGTICrTTTAm 
TmGimGGrnGTITmAASCTGTSCTSACmGrACmC 

SEQ ID NO: 1 014 CGAGGTACTrrGATrCCGTGCrCCTGGCCTITGGAAACCnXjCTGTTCCTGACG 
GGCCTGTCCCTCATCATTGGCCTGAGGAAGACCTTITGGTTCTrCrrCCAACGGCACAAACT^ 
GGAACCAGCTrCCTCCTGGGGGGTGTGGTTATCGTGCTCCTACGCTGGCCCCTCCTCGGCATGTrC 
CTGGAAACCTACGGATTCirCAGCCTCTTTAAGGGCTrrnrCCTGTCGCCrrCGG^^ 
TGTCTGCAACATCCCCTTCCTGGGTGCGCTGTCCGGAGACTTNAAGGCACTAGCTCGATGGTCT^ 
AAAACANAGATGAGCTCCTTGAACTTGGAlCArrGGTTGAGGGGGCTANGAGGGAGAATGGGGA 
ACCACCCNCTCAGNCCCCTGCACTGACTCNCTCCCGACATAATCCGGACCrNCCAAGTTCCAAAN 
GAAGGAATGAACTGAGCAACTGACGTCAA>rrCCCAAA>TOACITANGANGCTGCCAAGAAAC^ 
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ANATTCCAAACCCAAGGAGACTGGCCTGGGCTGGNAITCACACNCTCACl J ri'l-n iATNGGANGG 
AAAAGNGANANTAATTNCCAATTGGGCGTGGGGAAAAA 

SEQ ED NO: 1 0 1 5 GGTACAGCAGAGACCTTCCTGCTrmACTGGGGACTCCAGATTTTCCCCAAA 
CTTGCTTCrGTTGAGATTTITCCCTCACCTrGCCTCTCAGGCACAATAAATATAGTTATACC^ 
CCATCAAA 

SEQ ID NO: 1016 ACTTTCTGTTCTITGGCACATITTGCCCAGCGaATGCAACTTCT^ 

CAGTTCATATATCreAGGCAGTGATCCATmGTATCAGCCAGTITCCCTTGTTAGGGCCGCT ACCC 

GGCTGGCAGCCAAGAGCAGCACrCTCCCCACCTTCAGCAGAACTTCTCAGCTCATOTGTGTGTT^ 

TAAAAGCAGTAGGCAGCTTCCATCAGCAATGGAAGTGTCCCCCACrCCAGTGGTGGAGGTGCAGT 

CGCTGGACCTGAGQAAGTGGGGAAGCATTACAAACCCTCCTGCAGTAAGAATGGCAGAG'T TGGGA 

AGGATCTTCTCTCGCAGGCTGTTGGCCCAACTCCTCTTTTCTGTGTATGAGGGGCATCTTA^ 

TATGGCGTrrrCAAAGCAGCATATGAGGATTCTTANCATGCAGCTAANTGACAACCCCCATGGCCT 

ATGATGNAATGr^TGCTCr^GGACTAACC^r^CTTGATAAATAGGAGNGNATGTATCT^ANGGTC^TT 

GANAACACT^^T^^TITIT^m^"AAAANGCCT^^■GAAJ^ 

NCNNNGNAANOOANGGGNCTGG 

SEQ ID NO: 1017 GGTACTCAAAGACGAATCATGAAAAAGAAAAAAAACTTTATTTCAAACAGOT 
TCAGTGATATATGTGTGTGCTACAGCAAAGGCTGGTTGTGGCAAAGTTTCATTTCAAACTGTATGA 
TGTGGGCTGGGCAAGGTGGCTCACGCCTGTAATCCCAGCACmOTGAGGCCGAOGTGGGCTGAT 
CACCXTTGAGGTCAGGAGAGACX^GGCCTAGCCAACATGCTGAAACCCCGTCTCTACTAATAATACA 
AAAATTAGCCAAGTOTGOTGOCGCGCACCTGCAATCTCAGCTACTCGGGAGGCTGAGGCAGGAGA 
ATCGCTTGAACCCGGGGGGTAGAGGTTGCAGATCACGCCACTGCACTCCAGCCTGGGTGACAGAG 
CCAGACTCAAAAACAAAACAAAATAAAACAAACAAAAAAACAGAACTGCATGATGTATA ATTTT 
GACATTATGTGGGAATGTTTAAClTCTGCCCAAATGTAGATTCAATCCAACATTATGCGAArrirf 
ATATTAATriTAGTCCTAAGrrTCATAACCCAAAAAAAAAAAAAAAAAAAGGAATACCCTACCGT 
TCCTCTAAGGGACGAACTGGOAATAATTCCCAAATTO 

SEQ ID NO: 1 0 1 8 A C J Tr J- l - l ' ! J - nTrrn TJ Ti I l l t AATCAATATTTATTGGGCGCCTATTATGTG 
CAAGGCACTACACTAGGCGCTGGGGAAGATACAAAGATAAA TCTG ACAGACTGCCCTCAAAGAG 
CrrACAGTCrAQTATAGGAOCATACAGTCTCTGGANAANATATnTAAOTGTAAC TAACC TCCCCC 
ATCCCACCCCCACAAAAAAAAGAAAAAACXTACTANACTTGGTrrCTT CCArr TACi i 1 l AGTTTA 
GCAGCTTCACGTAAAAAGCATAAATCTGAAAGTCTTTrAAAATGCATACTTTTACTGGTAAACAAA 
ATTCArmCATTAGAAAAAATGCrGAATATrrATTGCAATTAANAAAAATCTTCAGCCGGGCAC 
GTGGCTCACACCTGTTATCCTAACACTTTGGGAGGCCGAGGCAGGTGGATCACTTGAGGTCAGGA 
ATTTGANACCAGCCTGCCAACATANCAAAACCCCGTCTNNCTGAAAATACAAAAArrACCGGACG 
TGGTGGCGGGCGCCTGTAATCCCANCTACTCCGGAGCTGANGCAGGAAAATCGCTTGACCTGGGG 
OGmGGAGGTTCAATAAGCCAAAATCGCCCCCTTOTTITACCCTGGGGCrAANACCAAATCTCC 
GNTTCAAAAAAATANA 

SEQ ID NO: 1 0 1 9 ACATATATCAATCTCCCTTGCTTGTCTrTAAGAAAGGGCCGTTCATAGC ATTT 
GOCACAAACCCTCTATriXn"GTrGCAmGCATGATrTTAAATAAGAAGGAAAATAAACATT^ 
TTATrrCATGCnX;CrAAGTTrCTGGGCAGGGACATGCCTTACTCTTTrAGAAACCAA^ 
GACATCrGACTGCATTTTTCTGTrGGTCCGAACrrCTAAACAAACACTCATAAAGTAAGTrTAAAC 
AATTTGGAGATGTATGAGGAAAAAGTCTTGTTCTGTTCAGTTCAGACrnG TTAAAAAA AAAAAA 
AANNNNAAANGGAAAAANTGCrCATTTCACATGTCCATGATCTTCCATGGATTTTT^ 
TrrGAAGTrTGATTAAAGGGACAAAAAANAANAGGCOGCAAGTTTrCCTATCTCTTTGGAGNGm 
CGCTCAGGAAATTTTGCTCATCAAAGTTCANCrACATrCCNAGCGGACAATNAAGGCAAACTGGG 
OTGCNCC 

SEQ ID NO: 1 020 ACTTANCTGTATTrTCAAATAAGTAATCTTCCCCCCTTTrOTAGGACrTTAAA 
ACTAGGCATCAATGAACCTGTITnCCrrATrATGCCTGGAATNTAGTCATGATATCATGATACCTrG 
ACrCAnCCATCATATTTCA.\aAGGATTNAOAGTGCTANAAATTATTrrGGTATCCTGTAACACAC 
GGCAACACTGGTCCTTGGGCCTATNATGACCCACAGATGACTCATTATAGAGTTCATTGCTGATTT 
NTAANTTANTAATTGAATCnTATGATA 

SEQ ID NO: 102 1 ACAAAACACCATACTTGGGGCTATATGCGAnTCAGGTTGGATAAACGAGTC 
ATGTTGAATGACAAAAAGTTAGACTGGGGAGATTTATGGAGAGAAGCAAGCACCTGTGATTTGTT 
GGmGGCGTTACAATAAACACTGTAAGTGAAAATGAGTCACAGGCAGTTAGAATGGCGrrAAGG 
AAACCArrTAGGTCTAAAAATCAGGACrGAAGTGATCAACAGrrAACAGATGAAGCCATGAGAAC 
TGGCACNCTCTGGAAAGGCTGGTGCANAGAGCAGGACGGTGAGCCTGANTGAGCCTGGAGTTAGT 
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CAACAGATTCCTCAACAGAAAAGGANTGAGGAGCANGGTCAAAGAAATGAAGGTGAATTTGGGG 
GGTGGT 

SEQ ID NO: 1 022 ACATTCCACAAGCArrcCCTTCTTATTITACTTCTTTTAGCTGm 

AAGATGCAAAGAGGTTGGATCAAGTTTAAATGACTGTGCTGCCCCTTTCACATCAAAGAACTACT 
GACAACGAAGGCCGCGCCTGCXnrrCCCATCTGTCTATCTATCTGGCTGGCANGGAAGGAAAGAA 
CTTGCATGTTGGTGAAGGAAGAAGTGGGGTGGAAGAAGTGGGGTGGGCCACTGTGAAANTCTAG 
ATATAAACCA 

SEQ ID NO: 1023 ACTACACCACITITCXrrCACCAACCCCCATCCTATTCTTGAGTTGCAGGATAC 
ACTTGCTCTCTGGAAGTGTGTCCITACCCrrCTGCAGAGTGAGGAGCAAGCrGTTAGAGATGCAGC 
CACGGAAACCGTGACAACTGCCATGTCACAAGAAAATACCTGCCAGTCAACAGAGTTTGCCTTCT 
GCCAGGTGGATGCCTCCATCGCTCTGGCCCTGGCCCTGGCCGTCCTOTGTGATCTGCrCCAGCAGT 
GGGACCAGTTGGCCCCTGGACTGCCCATCCTGCTGGGATGGCTGTTGGGAGAGAGTGATGACCTC 
TTGCX^TGTGTQGAGAGCATGCATCAGGTGGAAGAAGACTACCTGTrNAAAAA 

SEQ ID NOr 1 024 ACAAATTrGCCACAGGTTGAACACTTAATTTGTGTTCCTTAAAAAATAATGCT 
GTGCTTAGTTTATTGCCAGGAAACTTCrrGCTTGATGCTTTGGTCTGTT^ 

CAAAGAATATTCCGGCCTTTACTTCAGAGTTTCrrCCAAAGATACATATGCATnTGGATCACAGT 
TCTGGAGGTTGAAATGTCCACGATCAGAGTTCCTGCATGGTCTGGTTCTCGTGACGACnTITCCTG 
GCnTGCAGATTGGGGCCTTGCTGTATCCTCACGTGGCATAAAGAGAGCrCTTGTCTCTTCATTCCCT 
TGNAAAGGCTCCAATTCCATCATGAGTTCCCCCCGCGT 

SEQ ID NO: 1025 GTACTCTATTCGTATTANGAAAGAGAGGCTAGATACCAAACATCACGAGATC 
ATANAGATGGAANACTGTATANTAACCAAAAACCAAAACAGTGATATTTCCAGAACnTrOT 
TNGTTGTAAAACTGCATGAAAG^NAAGACCATTTAANrrAGGTGAATTATTANAAAT^^ 
GGGAAAG^fNNANCTAAATTNTCTCr^AAAAT^TrCCCCAAAGATAAT^INCCTOT 
NGAGAAATGNrrTNATCAAANGTGGGGTNCTTANTTNACCNTGACTATGTT^ 
TAGT 

SEQ ID NO: 1026 ACGCGGGGTTCCTCGGCrcGATITAAGGrrGCCGCTAGCCGCCTGGGAATTTA 
AGGGACCCACACTACCTTCCCGAAGTTGAAGGCAAGCGGTGATTGTTTGTAGACGGCGCTTTGTCA 
TGGGACCTGTGCGGTTGGGAATATTGCTTTTCCnillU"l'lGGCCGTGCACGAGGCTTGGGCTGGGA 
TGTTOAAGGAGGAGOACOATGACACAGAACGCTnjCCCAGCAAATGCGAAGGTATTTGAAGGGG 
GTAGCCCCrATAGGCATCGCX:CGGCCACACCTCCTTCTTCTAGGCCGGACATCCTAGCCTGTATTG 
GGTGGGTCACCCGACCTCCTTGAAGGCTTGAGGGAGGTTCTGGTGGAGCTTGGCTGAA'nTGGATG 
GGGTCAAGGAANCGAACGTGTAGTGCCACCTCAAACTGCATCCj^GAGAGAGCCTGCATTTTm 
GCTGGCTTGGAACCCCCCGCCCCCAATGGATTGTGGGAATTGTAGTTCANGGACCATCGGCCCGG 
GACTGa;AACTTITANAGCATCTTCAGTGAGTGTrTACCTCATGTCACCAGACCTTAATCTGCA^ 
AGCTTGCANGAACCGGGTGTGANGCTCAAGCCTTGNGGACATTCCGTTGCCTTACCTACATTANAC 
CCTCCCAA 

SEQ ID NO: 1027 ACATACACAAAAAAGTTACTGGAATGCTCGGAATAAGATTGTTrTTCTGTTGT 
CATnTrG CM ' rril ' i ' lA CAAGGTTTTTTTTCTCCTTTGAGATTAT/^ 

GTAAAGTCAGAAGTAGGACAGAGAACGCTCCGAAGGCTGGTTTGGTCATCCGAGATCATTAAAAA 

TGGCTGACCCTAACAATATGTACl-lTn I'l '1 1'l'l'l I'l 1 J 11 1 rriTm-I GGGTGGATTGAACANAAT 

TTATTGGCTGTCnTrGAGTGTCrrrGGTATGGCmGGCAGGGCTGTCTGGGTTCCTCCGCm 

TGrrmGGGCrXiCTGCTGCAGCCTTrAAGGCTCTTCITCGCTTCTTCAGC^^ 

AAACCCGAATGCACANAGCCTTCTTGGTCACNAATCGGCGGNGGGCTCTTCGGTAATACAGAGGG 

GGGAAGGTGTNCACTGTTCCAAAAGATCTCAATAAATTACTGGCTTTGGTAGGTGGAATANGTTCT 

TTGGGTGGGACTTGNGGAGCTNCTGGmAAAAGGNTTCTGTCNANTGTT 

SEQ ID NO: 1028 ACAGTCCCTCrCCTATAAGCAAGAAGCTCTCGTGTGCTAGTGTCAAAAGCCA 
AGGCAGACCGTCCTCCTGCCCTGCTGGGATGGCTGTCACTGGCTGTGCTTGTGGCTATGGCTGTGG 
TTCGTGGGATGTTCAGCTGGAAACCACCTGCCACTGCCAGTGCAGTGTGGTGGACTGGACCACTGC 
CCOCTGCrOCCACCTGACCTGACAGGOAGGAGGCTGAGAACTCAOrnTGTGACCATGACAGTAA 
TGAAACCAGGGTCCCAACCAAGAAATCTAACTCAAACATCCCACTTCATTTGTTCCATTCCTGAT^ 
CTTGGGTAATAAAGACAAACnTGT 

SEQ ID NO:. 1 029 ACAGGAAATTGACTTAGCACTTTCCCTGTTmCTATTGCATAA'rri'l'rri'rri 
AACCCAAAGATAl'llU-rrrrGCTGAGCa'GCCCAGTATTCACTGTTCACAACTTTGATTACTGGCTA 
CAAGAAATATTTTCTTGCCTTCCCCAAATCCCATACTCCCCAGAATCTGCTGGCAAAGTGAGCCCT 
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GGTA Cn i' iTn ' i ' n ri ' lTli ' l m i ni l ACrrCACAGCAAGATTrATTAATACATCCAAAANAAAG 

AAAGAAAAATOATmOTCACACTAXm'ACAGATAATACATACAGNGTTTCATACACATATTACAT 

CAGTTTTTACACAAGAAAAATATACATAAAAATGTAAACrmGTOTACAAAAACOT 

ATTAAACAAGGACCAGATAACAAAAGGAGACCAATTCATTATTATITCATCAAArr^^ 

GTAATACAGTACCACCTGCTCrCATGAACTCTrAAATCTGAATAATOTTCANATAATTTTCATAGG 

NGGAAGTGGCAAAACATACAACCAATTGTCCCTCCTCCCCAAGTCC ACATGCCA AAATGTTTGCA 

TATAAACTGACAAGANQAACCCANAAGGGGACrAAATACCACTATGCrrirriCriCACTAAAAA 

ATAG 

SEQ ID NO: 1030 ACACATGCACTTTGTGCrnTACACACACAAAAGTATACTGTAATCCACTGAG 
AATAACCrCAGCTGGGTCTGrrrCCTGGGTTATGTTATATCTTGTAAAAAACAAAACAAAACAAi^ 
CCAAAAAAAAGTrrGGATTrGGCrGGGTCTCTGCTCCCCACTTTCTGAATCAAAATGCCCAACT^ 
GTGGTrCTGCTGCAAOTCCATGAGCAAAGACGACTCAGAGGGGTGGGCAGGTTTGGTATCACCAG 
AACAGGAGCATCTACCATGGAAACACCACCrrCTCTGTGGCCCTCATTGTCAGAGGGGCAGAGTT 
CCCGAGGGATGTGTCCTCTGGGACTGTGTGGTCCTTAAAGTAAAGGTATCeTAAAATGGTCAGAA 
CATGCAATTTCCTTTCAAAGGCAGTTCAGTGCATTGGCTCAAGGAGGGTGCAAGCCCAGGATOAA 
GTGGAGTCCTGGGGAGGGCCGCACTCCTGAAGCCACCAAGCAGAGCGAGGAGCTCCGGGGGGTC 
TTCTCTGTCTTCCATCCTGCGTCTCAGTTCTCCCGGACCCTTGTCCTACTTCACAAGACAGANGCAT 
TanrrCCAAAOJITCCATTrGAATGTCGCTCTAACATGANCCGCCCACGAAATGGACT^ 
ACCITGTG 

SEQ ID NO: 1 03 1 ACTGTATAACATCrromATTATTTAATGrrTTCTAAAATAAAAAATGrrAGT 
GGTmCCAAATGGCCTAATAAAAACAATTATTTGTAAATAAAAACACTGTTAGTAATAATCAAAA 
AAAAAAAAAAAAAAAAAAAAAANGTNCAGTCAATAACACAATAGAAAAATCTATATACAGTGTG 
TTAAAATGGAGTATGGAGGAAAGGCAATAAATAGGTCATAGTANCGAAGTAATTACAAnTAGTA 
AATTAACACTGGAATGATGGATGTGCAGATGATGATGTGCAAGTAGAAATACTTGTGTGCAAAAA 
AAAAAAAAAAAAAAAAAAAAGGTCC^CAAAGACNANTC^^^GAAAAANAAAAAAANCTr^AT^ 
AAACAGG™AGTGAT>mn"GTGTGTGCTACANCAAAGGCTGGTrGTGGNAAA>^ 
ACTGTATQATGTGGGCTGGGCAAGGTGGNTCACGCCTNTATCCAAGCA>mTNNGAGGCCNAGGT 
NGGGCTGATCACCCTGAGGTCAGGANAGACCGGCCTACCAACNTGCTGAAACCCCmTTTACTTA 
TTATNCAAAAATTANCCAAGTGTGGTGGCCCCCCCTGTANTCTCAN>n'ACTCGGAAGCTNAGGCA 
GGANAATCTNTT 

SEQ ID NO: 1032 ACi r i T iTn Tinrv n 1 1 iti ri i 1 1 n inggngaaaaatacttatttcatgt 

GTTrAAAAATACATTmAGGGTGGGCCCTGCAGGAGGANACAGGCCGTCCACATNTCCrTCCCAA 

TAGTGTGTCCAGGTCGTrrCCAAAATCTGTGGGTCCCTCACAGCTTCTGATGACAGCTGCTAATGC 

CAnTGCTGAGGAACAAGGATGGGGAGGATGGCGAGGGCCTGGCCCCCAGGGCGGCCAC ACCAA 

AAGGTCGGANAAAGGCCCAAGGCGGATGCCACNCCCAGCAGTGGTGACTGCCCCCCACTCCTTTT 

CTGAGTCTATCAGCATTGTITGGTTTTCTIXnTGrrGCrmAAATCCTCGAGC^ 

AAGCANATGGTANAGGGTATTCTATOTCTGTAAATOGCCTNTCACCTGCCGGTGCAGGGGAGGC 

TGCCACAGGCrCANAACANArn*CCATGGCCTCCCCTrTrGACCGCTCTCCACTGTCCCCTGGGGC 

TTCTGCTGAGCTGGGCTTTGCTmCTTTGGCTTATNAAACCGGAAGTCCGGGGG^ 

NACCACCTTANGGGCG 

SEQ ID NO: 1033 ACGCGGGGAGGCTTGAGGGAAGCATGGAGGTCCATGGCAAGCCCCAGGCTA 
GCCCGAGTTOTTCGTCGCCCACCCGGGATTCCrCAGGAGTCCCAGTGTCCAAGGAGCTGCTGACG 

gcgggaagcgacggccgcggaggtatatgggacaggttgctcatcaactcccaacctaagtccag 
aaagaccrccactctrcaaacagttcggatagagaggagtcccttattggaccaggt 

SEQ ID NO: 1034 ACTGAAGCAGCATATCAATCCCAATAAGACArrGGACCCriTTGAAACCATG 
CTGAAGTCATTATTAAGGTATCAATCTGGTGGTGGCAGTGTGAGTGAAAACCACATGAGGAAAAA 
ATTGTATGAAAATGGTGTGACTGATTCTCTGAAGAGTAACTITGCCCrCCTCCTAAAGCm 
AGAATrATTAGATAAATGGCTCTCCTACCCAGAOACCCAQCACGTGCC CCTCAGCCAGCATATGCT 
TGGTTTTGCrATGAAGTCTGTTACACAGATGGTAATGGGTAGTACTTNNTl 1 1 11 1 1 1 1 1 i 1 1 1 i m 
TTTTTTTTAGTANANACAAAGTCXJACCATGnrrGTCCAGGCTCCACACTT^^ 
CAAACATNTTAATACATACTTATTAAATGCTTACnTrTTTTAACATTTAATATTrA 
GGOATGGAGTTTCACTCTTGITGCCTAGGCTGGAGTGCCAGTGGGCCCGCGT 

SEQ ID NO: 1035 ACTAGGCCTATCAAGAAAAGCTGTGAACAGCAACATCATAGACACAAAAGG 
GTCATTCCTGGGTGTCCTCACCCAAGAAAAAGATGGAGGCAAGTTAACACAAGATTITTTTTTAAA 

OATACACTAAAATcccGCGTACi-rrnrrrrn-nrr 
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SEQ ID NO: 1036 A cri ' ii ' i rii rii iu i i nr mTGCTAAAGCGTTTAriTCCArrcAATrcAcrrAA 

CTGArrGCTCTGCTTGGGTCn-CTGTGTTGGTTCACACKJACKiGAGTCrGTGGCCAGATCCCT^ 

GAGGGOOTCCCCATGGCCAAGACCTTTAGGCCCAGCAAAGGTCTGGTGGGGGGAAGGAGGTGGG 

ANA 

SEQ ID NO: 1037 ACCAGCrTCTAAAGCCATGATGCCATAGGTCCATITGrrGATGAAATTCCTAC 
CCACTGTCCTCGGGCATCTGACTCTGGTCTCTGCACTGGCATCAAGAGAACGCTCC^ 
AAGGCrAACACCTTACAGGGTAACACTGTAACACTGGCCCTGGAGCCAGGTGCrrrTCTCCATO 
AACITCCACCTTGGTAGCTCAGCCGACATAGACAACACACAAAGCGCAGCTCCGCACTrCTGTCCT 
TATCTTCACACAGTGACATCCACACCAGGTGGCCAAACAGAAOAGAAGGCAGAGGCCCACCAAO 
AGCTGATGCTGCGCAGTCCTTGGGGGATCATCCTCCGGTCTCACTGGGGACGAACCCAGGTTCTGG 
AGCCTCTCCCCTGACAGACAGCrrGTCACCGGCACTTATGGGTCCTCTGGGATTTCAGACAATACC 
CAACTTCTGTAGGTTCAGAAAGTGCmCAAGCAGGCAAGTGGCACCCACACCCGGTGGGACACA 
CCrCCTGOGTCCGAAACCACTCCATCATGTGGCTTGGTGTGGCCACCGTGCCCACAGGTCAGGCAG 
AAATTGGACGATCCCCCGCACAGCCACGTTGACAGATGGGCACACTGGAACONTGAAAGCCTrrO 
CANATGGCACAAC 

SEQ ID NO: 1 038 ACNCGGGGGTAATATGGTNNAAGANAACCCATATAG ACAGCTGCCTTGTAAC 
TGTCATGGAAGCATGCCTGGAAAGACAGCAATNGAACTTGGACCTCTGTGOTCAAhrmCCCTTTA 
CAATACTGGATTCCTCAAAAGAATGCTAT™GAATCTKITCACCATNGGimGGATGACAT^ 
ACCCTA^TAANGAC>^™ATCT^^^GAA^^^CN0AGTGTAC0aK30GCOT 

SEQ ID NO: 1039 ACmTTAA rrn - l l 1 1 1 1 1 1 i 1 1 1 1 1 I GGATTTTTAGTAGAGACOGAGrnTAC 
OTOTTGGCCAGGCTAGTCTCGAACTCGTGACCTCAGGTGATCCGTCrrGCCrrCAGCATCCCAAAGT 
GCTGGGATTACAGGCGTTAGCCACCATGCAGCCCCTTrCAAGCCTTTTAACATCATGTCACCT^ 
CAATGAGCAGTTGCTCCCATTACCCAGTGAACTCACCrTCTGATGGGACAGTTACCCTAGTTGGGG 
CTTCTCAGCCTTGAGAGATGTGCAGAGCAGGGACCTGTCCAGGGCAGAGCAGCCAGACAGCGTGG 
AAATAATCCAAACAGAGAAGGCATrCAAGAACTTGGCCTCTGGrrGACrrAACACACTACrrC 
GTATATGGCAGOTCTOTATAATAAACTCCrcCCTGTCTAGTCITCAGCCTCATAGAAAGCAAGA^ 
GCAGGTGGGCGTATGTTGCAAAACAATTCTTGGGTAGAACAAGGGGGACATr rGATGGGTITGGG 
TmTTCAGTAANAAAGATGAAAGTCCTGGAAGrnTAGGGTAAAAGTGGGTACl I H 1 H 1 H 1 1 1 
TmrnTTAAAACTCNNGGAAGGTNTTTrTTNTAG^ 
NTGG 

SEQ ID NO* 1040 ACAGTAATCCTGTGAGAAAGACAGGACAGAAACCACTGTGCCTATnTACAG 
ATACGAAAACTGAGACACAGGTAAAGGGGCnTGTCTGTAGTCCQATAGCTAGCAGATGGCTGGAG 
CCAAGACTGAGGCTCGTTCTTCAATGCTGAGCCAGGGCTCCTTCrGCTGCACCACAAGAACGCTA 
GACCACTCGCCACCAGCCTTCTCArrCCXn'CTrCXTCCATTCTAATCAmCTAGCTGGCTGGCCT 
CACAGAGCATAGGAAAACAGCCAAGGCCGGGCACGGTGGCTCATGCCTGTAATCTCAACACTCTG 
GGAGGCCGAGCCGGGTGGATAACCTGAGGTCAGGAATTCGAGACCAGCCTGCCAACATGGTAAA 
ACCrCATCTCTACTAAAAATATAAAAATTAGCCAGOCATGGTGGCGCACACCTGTAATCCCAGCT 
ACTCAAGANGCTTGAGGCAGGANAATTGCrrAAATCTGGGAGGCGGAAGTTGCAGTGAGCCAAN 
ATCGCGCCACTTGAACTTCCAGCCrrAGGCACAAGAGCAAAAACTTCCArrCTNCAAAAAAAAGA 
AAGGGGAAAAACCAGGGGCCAGGTNACCCATrn3GGGGAAAAGAAGCCCCACTrTAGGAAATCC 
TGGGGATTGTTAAGTOT 

SEQ ID NO: 1041 ACTACTAGTGGACTCAAGTGATATAAAAAAATAAAAAATAAAATACTTCCAT 
ATACAGTGACTGGACTCTTAATACAATGGAAAAAATACmGGTGAAACACTGACATTrAGTGAG 
CCCCTATCGTGAGCCTITOOOGTTGQAAAGGGGAGACGGGGAGTGAGTGGAAAG ATAA CrAAGA 
CrCAGACCTCATTACAGAGTCATTCACAATCCATGTTCCCACGTAGAAGGCACACCnTGTOTAGC 
TGTOCACAGTrAATAATAGGGGCAAGATGTGTTATAGCAGAArrACCAAATGCrATAAAAGTGGC 
ATGAAACAAAGAAGrrCTrCTAATTGGGGTAACGGGAOTCAAACTTACAGAATTTGGGAGAGAGC 
ANTAACArrTAGCrrGAATATTTTAATTrTAAACACATTTAAAGCTTCTA GACTC AC^ 
GTGCTCAAAGATGTITGANOGTrGTCAAGCAAAGCTGTrCAACTTCTGCATTTTCTGTG 
GCACAATGCCCACTCAGATTTTCTNCTANGTCCTGAATGACAGCTTATCTAOICANAATGGAAAC^ 
CCANATTTGTGACANCACATGGACANATGCTGCAATCANAACAATTCTGGATGTAAACACTGGTA 

GGCTGCACTGGTC 

SEQ ID NO: 1042 ACGCGAATTCGAGAAAAAGTTCAGTGGGAAGCATGTCGTCTTTATCGCTCAG 
AGGAGAATTCTGCCTAAGCCAACTCOAAAAAAAAAAAAAAAAAAAAAAAAAAAG GTNCTTC AAA 
TGTCATTGTAACAATACTCTGATCAAGGTGAGGTTGTGTTCCTGGCACATOCTGAAUlJirriCTGC 
AAAOTATrCCAGAGAGACTATCAAAGGCATGAAGATCTAATACATTAGAGAGATCAACTCTTGA 
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TCTTTGTGTTCTAANATITGTCCTAGTCCATTTATGGATCrAATTGTCGGTOT 

mCITCTGGAACCAAGGTCTGTmATTAATGGANAATGGCATCTGCCAAAGCAACITGCCTAGT 

ANCTTTCATTGATTAAGGCCATCANGGTTTCT^m'CTGGr^AAATGGTT^ 

ATN(XACCAATAGGT^A^™AAT^^CCCOTCAAAGAA^^AAAAT^T^CACr^rc^ 

CATTNAACCCCAAAACCATTANNAAANCCCCCCTTGCANCCCAGGGGGTC^ 

CAGGGGGGGNCCTTmAAAACCANKTTmKGTNAAATTCCAAAAATT^ 

GGCCAT 

SEQ ID NO: 1043 ACOAGCGGCGCGCCATGGAGTTACTGAAGGTCTCCAAGGACAAACGGGCCCr 
CAAATrrATCAAGAAAAGGGTGGGGACGCACATCCGCGCCAAGAGGAAGCGGGAGGAGCTGAGC 
AACGTACTTGCACA0OAAOTGTTGGCGCnTGTrGCATTCGTTGCTGCTa:AAGTTAA^ 
ATTGGAGCTCATCTCAGCACAGTGCTTGTTCCCACCCATGGACTTGCCAGACCAGGATOGT 

SEQ ID NO: 1044 ACmCTGTTCTTTGGCACATTITGCCCAGCGGATGCAACTTCTATCCT^^ 
CAGTTCATATATCTCAGG<>OTGATCCATTTTGTATCAGCCAGTTTCCCrTGTTAGGGCC^ 
CCGGCTGGCAGCCAAGAGCAGCACTCTCCCCACCTTCAGCAGAACrrCTCAGCTCATGTGTGTGTT 
TnAAAAGCAGTAGGCAGCTTCCATCAGCAATGGAAGTGTCCCCCACTCCAGTGGTOGAGGTGCA 
GTCGCTGGACCTGAGGAAGTGGGGAAGCATTACAAACCCTCCTGCAGTAAGAATGGCAGAGTTGG 
GAAGGATCTTCTCTCGCAGGCTGTTGGCCCAACTCCTCTTrrCTGTGTATGAGGGCATCTTAGTm 
TTATGGCGTTTTCAAAGCAGCATATGAGGATTCTTAGCATGCAGCTAAGTGACAACCCCCATGGCC 
TATGATGTAATGrrTGClXrrGGACTAGCCATCTrGATAAATAGGAGTGTATGTATCTTAGGTrCrTr 
GTAGAACAGCTA*1"I"1U17U' I W CTCATTAAAAAAGGCCrCTGAAAGCCCTCTNAAAATGGACCTCAC 
AAAGGCAACCAGTGTGAATGGATGGGAGCTGGGGTGCCTGCATGCCCAGGCTGGTCNTGGGGTGG 
GCAC 

SEQ ID NO: 1045 ACAGCTTIXnTCGTCCTCCATGCTAAGAGATGTAAAAGCITAAGGGTCAAAC 

aatacx:aattgtataggcttcaaaaaccatctaagttagggcattctctagt^ 

acctggaacactgacaagtcatcacrracatagaataatgtgaagtaaattitrtgaaaaataaa 

ttttagtggaacaatcctgaaggataacaccagaagaatagcaggtraccagtaaggtgtcagcc 

AATrTGTTCCAGTCACrriTGAATCCATGTTCrATAATCTAAAATTTATTCnXr^ 
OAGCTrCCTATCATGTCAGTATCTATGrrATGAAGAAAAGGAGACTTAGGTOAGATGTnTTATTT 
ATCGCAACTGCTGCATrAATTGCCTAGGACCTCAACAGCTTCATGAAAGTCTGGGAAATGTrCATG 
CATAAGGTTATTGCCCGCGT 

SEQ ID NO: 1046 ACCAGTAAAAACTTAAAGGCACAAATTCTCCrrGAAGACCTTCTCCCrnTTAT 
GTGGCCCCATATTTTATGTTGCrrrATCTTTGAAATTTTGCATGAAAA GGAA ATGAATG 
ATGAAATTGTCCTTrAGAGCATGATTACTrGTTCCCATGGACAAATATTrTTCT^ 
CTGGCCTGAAACACGGGAAACCAGAGTCAAAAGTTATCTCCCrrCTCCXn'GTGATGCCTTGAGA'nT 
TTTTCTGCGTTGTTTAATGCCTGAAATCCAAGTCTrCXnX:CATG GGAAAA TACTGTTATA 
ATTCTAGATGAGTAACAAAGATCTnTrAGGCCTTCATTTTATGi-riin-lCllAACTGTTATATTATG 
ATTGTGACATAGATTATACTACTACrAAmTrGGATGTTTCAAAAGGTCAAGAAGTAAAAGATGT 
TAGAAAGCAATGAGTGAGTCCrmGATITrTAACTTATTCCCCATOTCCCTATACTTCGTGTGOT 
TTC C l 11 1 I T l i 1 1 1 ' i GAGACGGAGGCTCACTCCGTCACCTAGGCTAAAGT 

SEQ ID NO: 1 047 ACCCCACAGCTCCCACACTGTCATCCCCCAGCCAAGGGCCATCXCTAGAAAA 
ACTGGTTTACTGmxn"AAGGAAACCATrGTCTATAGCCCTTAGCC^ 

CAGGTCAGGTG<XCrrANAGTGAGGCAGGGGCrrCAGCCAAAGTrGTGATCGCAGCrTCrGAGGCA 
GTTCONAGTGGAGTCAOAGTCCQCTGCCACCTGAGCTTTCCACCAGATCTTCmCCTTTCC^ 
GCTTrCCTCAAOCrGCAGGCTTGATCCCATCCCACAAATATGAGAGAATrCTGGAAAGTGCCCTGA 
GAAATGGGTTCTGGGTTTTTrrTTCATTmArr^^ 

NATCCTGCATCXn'GCrmCTGTTCCCTGGATACCGTAGGATGGTmATITCAGT^ 

TAGTCTGGACACnXSTGGAGTNATAACAAGAGTGGGATGGAGGTTCCAGGGCCAATCANlU'I'Cl'l'r 

GGAGGGAAGCTTOAGAAOTAGGTrAAAGCAATGCCCAAAAAAGCCACTGCTThn^CTTNCCA 

CACAATGAACTAANACCAACCTAACTAGAAAGAGGGTCITTGATCAAANAGGGTCAANCAGTGG 

GATGTGATTAGTGGGTTNGGGGATGGAGTNCCAGGTAAACCAAAAGG 

SEQ ID NO: 1 048 ACGCOOGGGGCAACGAGGAGGGCTOCOAGGCCATCAGCTTCCTCCTGTCCCT 
CATCGACAGGCTGGTCCTCTACTGCGGGAGCCGGCTGGGCAAATACTACXjTCAAGGAGAGGTCTA 

aggcaatgotggcttgctatccgggaaatggaacaggttatgttcgccacgtggacaaccccaac 
ggtgatggtcgctgcatcacctgcatctactatctgaacaagaattgggatgccaagctacatggt 
gggatcctgcggataritccagaggggaaatcarrcatagcagatgtggagccx;atttttgacag 
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TOCTATGACTGTCTGGT 

SEQ ID NO: 1049 ACAGATGTGTATGGGAAACCCCAACCCCTATATATTGTAAATAGATGGGCTG 
CKJCTAAACATTGTTGCCGTTTCATACrrCTACCAACTCAGCrmACACAATAAAGCTCTACT^ 
CTGOATAAAAAAAAAAAAAAAAAAAAAAGTACGCGGGGATCCCrGACTCGGGGTCGCCTTTGGA 
GCANACAGGAGGCAATGGCCACCATGGAGAACAAGGTGATCTGCGCCCTGGTCCTGGTGTCCATG 
CTGGCCCTCXjGCACCCTGGCCGAGGCCCAAACAGAGACGT 

SEO ID NO: 1050 ACTACTrGATOTTTATATCCAATTCCmCCATCATTrGATTGAATCTm 
GTCTCrrGCTTTTGTTCCTCTAAAAACTGGTCTGCTAACTTmA^^ 

CAATTCCTrCATCTGCrrGTCCACACTGAAAACrn*CATATTCrrAACCTCTTCACAAGGT^^ 

TTCrrCCTATAGTCCTGAAGCTGGTCTTCTrGTCOAAGAAATTOTGACGAAGCTCAGCAAGTrGCT 

CCmAAATGATGTATGTCTGCCATCTGATACGATATGTTGTTCTGTAGTnAACCTTQTCATCCTC 

GCATCrCTTCCCOAGOCCCTCTCTGGCCTGCAGCCGGCTGCTGAGGCGGCCGTAGTCGGCCrCOT 

CTGGTCGATCTGTTTCTTCCCGCGT 

SEO ID NO- 1 05 1 ACTCTGTrGTAATGGGAAAACATTAATATCTGCTrCTTCTGACACG ACAGT/^ 
AAGTATGGAATGCACACAAGGGArnTGCATGTCAACATTAAGGACACATAAAGATrACGTAAAG 
GCCrrAGCATATGCCAAGGATAAAGAACTAGTAGCATCAGCTGGGTrGGACAGACAAATATT 
TrGGGATGTGAATACTCTAACAGCATTGACTGCCTCAAATAACACTGTCACAACriCriCllTAAG 
TGGAAACAAAGATTCCATTTATAGCCTGGCCATGAATCAACTGGGAACAATCATTGTATCAGGGT 
CCACTGAAAAGGTGTTACGGGTATGGGATCCAAGAACATGTGCAAAACTAATGAAGCTTAAAGGG 
CACACGGATAATGTGAAGGCATTOCTATTAAACAQAGATGGCACGCAATGCCTGTCAGGCAGTTC 
TGATGGGACAATTCGCCnTGGTCCCrrGGCCAGCAGAGATGTATAGCAACATACCOAOTCCATGA 
TGAAGGTGTTTGGGCGCTOCAAGTCAATGATGCCTTCACACATGTGTATTCTGGTGGAAGGGACA 
GGAAGATTTATTGT 

SEQ ID NO- 1052 ACAAAAGAATACAATATGATTTGTCAAAAAACATATAAAAAGACA GCrGCT C 
TTCCTCyUATACATGAGCrAATGATAAAAGACirmCATGTTAATGTCTCCAAGrrCTTUl 1 1 1 1 A 
CATAAAAAAGAACATTATGGTGGCAAATGTGAATTATCCrrn-AATATTGAACATTATATTCrnT 
AAAATCCATCCAGATCAAATGCAATAATmCrrnTAACTCAACAACTGATGCTACCAAACGTGG 
ACTCAATATACrrGTTAAAACGTGTAAAGCGTGTCTCTAG TCTrC AAAGCnTCAGGTGAAGAGAG 
GTGCrrTTrCTTGATGCAAATCTCAAGGCAGAGAAAATCATmAAAGCTTATAAAAAGTGGACAG 
AGAAATATrAAAAACTTCTCTGAAATATACAAATATGTGTAATTATTAAAATTGAAGACAGTAAC 
ATCAGrrGCAAGTGCTTGGAAG'rCrGCCTGACCrmGAGTTTCrACATTTTCTTCAGTra 
GAGCACCCAGTGCTAGTAAAACATTCAGGCTGGGAAAGGAGATATAGNCTrCATTACTGNTnTT 
AAGAAAGAAAA^TCC^ATG^^^mGGCAGTCr^CAOTATCAAAGTATAGG^OT^CNATTACTAGAA 

ATTCCA 

SEO ID NO: 1053 A Crrrn - i - n - i t l n l n IM GGGGNNNCCATNAAAAAGCriTATITCCATTrGG 
TCCAAGGCTTGTrAGGATAGTTAAAAAAGCTGCCTATTGOCraGAGGGANAGGCTTAGGCAAAAC 
CCCTArrAOTrGCAAGGGGCCCITCAAAAOTCNCTGGGCTCAAAAGGCnCTT 
AGNG 

SEO ID NO: 1054 AC i - rnTi - n T i l i u i n i i 1 1 u i CArrcrAcn'iTcri'i attgtctggctaac 

TTACAAANATGCANATGTCTAGGGTAGTCTNTACCCTACX:ACTr ACACTA TCCrGATGACACANAT 

AGCAAAATGTGTCTGTTTACATAGTGCATGATATGAAAAAAAAGTTTTTCrrCCTCTACGGTCOT 

GACrATAAGGAGGGAAAAATTAATTrCATGCCAACArrmGGGGAACTrrAACAATCA^ 

TCTGCTACrAAAATAACAAAACrGGTATTACACTrrAAAATATAAAGACCTAACAGTrTTTACAAA 

TATGCAAATAATCTACTACrrAGACATAAAAAAAAGTTG AriTC ITl'l AAATCACAAAGTAAGGCC 

CArrGGArrAAACATTCTCCTGCmTACTAAATAAAATGCATAGTGAAATAAATACTGAACACTG 

AGnTAATACTCNATACATTCAATATAAAATAAGAGGNGAATGTAAA NNCTGN TACTGTGNATCA 

TrATCTGAAATOT^^'AAAAAACCNNTGTAGCT^G^^TAGGAAAAAATATITI^ 

TTNC 

SEO ID NO* 1055 accagaatactctcacattttatttcaaggcactctccagcaqaaagtataat 

AAGAATAAACTACmGTAAAATGTGCCATTTTTATAAAOACAACAGTGAGATCCAAAATGAGAC 
AAAGTAGAATTATACTGAATGTrCATTATCCCAAAQTGGCTrAATTTCATTTGCTTGCTGCAAAGA 
CAAAGACTGTAAAAAGGAGCAAAGAAAGTGTGCAAAGCACTTCCTTGGAAGACCTCGTCCACTAA 
CAGATCACAAGAAAAGCAAGTGCAGCCCCTCCACCGAAAGGAAGGCAGGCATGTAACATCGGAG 
ACTrCCAGTCTCCCCGACGTCTGCTCACGCIGGATGCTGGCACTGGCTTATCATOATGGAAGGAGA 
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SEO ID NO* 1056 ACCTCTCATTITGGCrrCTGCCCCTCrGGAAGATCCTCTCTGTATCCrrCATCCTG 
CGATGAGTGGTGGCAACGTGTGCCCTGAGCCCTATGCTAACGTGAGTGGmCTTTCAGTGTTCTC 
AGATTTTCCCCAGCTCAGTCCTCCCTCCTTTrCTGCAGCTTGGTCCrGGlilC i^ 
CCAACAAGCAATGATGGCTTCATCGTCCATrrCrrCCTCrrCTCCTGTCAGTACi ri 1 1 

SEO ID NO- 1057 acgcggggcaccgtggagagcagagcgcggcggctggaagctgctaagtca 

GAGCCGCGATGTTCCGGATTGAGGGCCTCQCGCCGAAGCTGGACCCGGAGGAGATGAAACGGAA 
GATGCNCGAGGATGTCATCTCCTCTATACGGAACTrrCTCATCTACGTGGCCCTCCTGCGAGTCAC 
TCCATTTATNTrAAAOAAArrGGACAGCNTATGAANACAGGACATCACATATGANTGCACGATAT 
TAAGAGCCTGGTTCAGTTTCTACTCCTCTT 

SEO ID NO: 1058 ACGCGGGACTCTAAAACTGGTGGriTCTTACTGAAGGTGTTCTCCATTTGA^ 
TTITATCTrcAAAGTATITATAAGTANTATCTrrAAGACATGACTTGrrAGT^ 
AGNCNGAAGAGTANCTCTCAAATITCTCnTAATGTAAATCACCTGGGAATCT^CTCAAGTTO 

GAAATTTTAAACCAC 

SEQ ID NO: 1059 ACAGAGAAACCACAGGTTGCCCTTTCCACAGCTGGATAGACrTATCCAAAAC 
GGCAGGATGGTTCTGTATTAATCTITrTGGAAAGCATGTCrGTATTAAGATTGCAAAACATACAGA 
TAGCTACCACAAATTAGGTCAAACGACTGATCAAGTTGTAACATCTGTGAGGTCAAATTCCAGTTA 
TAATAAAGTGCCTAGATACACArrrATACAACAGACCATAAGAGCTGAATTCTTrACAAATGTCTr 
TATGGGCATGTAAAATTGACTCTGCATrrCTGCATGTGTGCATrCACATAAGAGAGACCAGTCrGC 
ACTGAGTCATATATACTCCAACTTGAAAAAOTAAGTGTAACAACTGGTTAATCATGCAAGTCTGTT 
TGTAATATAACAATGACTGGTAAAACATGAATTCTCGCACAGTAGTAATAGGTGCACTCATTAAA 
AACACTACGGAAAAACACTGTATTTGCCCGCGT 

SEO ID NO- 1060 ACATACTGCTOAATTTAACTCAAAATATTTCAGGTAAGTGAAAGTGGTGCTTA 
ATGTAGACTATAGAATGACrrrCAGGTGTTTTCAACTGAAAGTATATATCCAGAACrGCATCOT 
TAGAAATACAAGTAAGACTTAGGATAAmGCCTTCAAAACAGTrrrCCTAATCTCAGCAGTATCC 
AGTGAGTGAAGAACACrrGACTGACTCTNGGGCCACCrCTGTrACTTACTGT 

SEO ID NO- 1061 ACATAAGTGGCTATCAGAGAAGCCAGCCGATATGGATTGGCCTGCACGACCC 
ACAGAAGAGGCAGCAOTGGCAGTGGArrGATGGGGCCATGTATCTGTACATGCrrrCTCT^^ 

CTGTc ri 'Ci-ni 1 i TGATerrTmAAATCTGATTTrCTarrrrrTCCCTCT 1 1 r 

CCAGTCTCTCGCn'CTCTTTAmCTCCCCATATTTCACTX:TGTTCTCTCAC<XCA^^ 

TGnTCTCTCXCTCCCTCTCrrCCCACCCCCTGCCrGGCCTrcCATATATCAAGCAGA 

CCrrATGCAGGGGCAGCCCTGa:ACCTGCCATAAAGrrTGATAGGCTAATGGCATmGTGGATATr 

GCCATGTC:ACAAGTa:AGGACAGCATCAAAAATAGCCXn-GATGTCTAAACCACTTCAGCTATCm 

TTTATITrrAAAATAAATACAlTCACATGCTmAAGAAACTATAAAAATATATAAAGTAAAAA 

TCrnXrrCTCACACTGCCTCCACCTCTCCTGGTCTACCGTTGTGCrrAGGGGAAACCATO^ 

GTirCTCCTGTGTCCTTCCAGAGTGTCTrTATGCAAATGAAAATTArrGNGATAATATAT 

SEO ID NO: 1062 ACAGACCAGTGAGTCTGGGGAATrGCGGTCTCCACCAAGATCTGTGGGTGCA 
OTGGCATGTrrGCTGCAGAAAAGGCCCCAGAATGGGCTGGCTTGAACTGGAAAAACACACm 
TCATCCCrmGGACCACGAGmmGAGAGCAAAGCATGTGTTTGATATrCCTTTGCTCACCCTC 
AGGCaTGTrraGCAAATTGCCTGGGATACAGAAAATAAGGACAAGGTCTGGGTGTAGTGGCTTA 
TGCCTGTAATCCCAGCACmGGGTGACCAAGGCAGGAGGATCTCrrGAGGCCAGGAGTrGCAGA 
CCAGCCTGGGTAACATAGTCAGACCrrGTCTCTGCAACAAAATTTAAAAATTAGCCAGACTTGGTG 
GTTCCCACITGCAATCCCAGCTATTraGGAOGCTGAGGCGAGAGGATCACTTGAGCGCAGGAATT 
TAAGGCTGCTGTGAGCTATGATTGTGCCACTGCACrCCAGCCTGGGTAACAGTGAGAGGCCTCATT 
TCAACAATAAAACCCAGOTGGGCCGGCGCGGTGGCTCATGCCmAATCCCAGCACTTTGGGA^ 
GCCAAGACGGGCAGATCACGAGGTCAGGANATAGAAACCATCCTGGTTAACACGQTGAAACCCT 

GTCTNTTCCAAA 

SEO ID NO: 1 063 ACNCTAAACAGTGGAmGAGTrCCANCGNTTATTCTTnTmui 1 1 1 1 iCANA 
TCACCATCrAAGnACATCTTTAGCTCAGGTCCATCCTTCCXJAGATCTNCTTCTrAG^ 
CCTGOTGCTGTCTGTGGTCAGGTGACCT^ACTCAGGAGCAGATAT^^•CCTTGGCCGCCATGG^^ 
TCATCCATCCACACGTGCCTOTAGCATTCCAGAGCrC^CrGNCCTrCTAGATGTGCOTCCCGCrTG 
OCTTCCAACGGCTTGTGCTCACTCTGTCTGCCAGGTATGAGAAGAACACGTAAGACCGCCACCAC 
ACrCACCCTCCCTCAAGGCCCTTOTGCCATAGGGGTOGCCACCCGACCTGCCCCCANAACTT^ 
ATACTGGANGCAA1TGCATA 
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SEQ ID NO: 1064 ACACTAATTTATrTCTTTAGTCTAGAAATAGTAAATrGTrr GCAAGTCACTAA 
TAATCATTAGATAAATTATmCITGGCCATAGCCGATAATmGTAATCAGTACJ-rri'ri-l'rriill 
TITTTTTrrCCAAGriTrTAAACTTrTTAmG 

ATCATTTGAACAAAAAAAAiVTGGCACTCTGATTAAACTGCATrACAGCCTGCAGGACACCTTGGG 

CCAGCTNGGTTTTACrCTANArrTCACTGTCXjTCCCACCCCACTTmTCCACCCCACT^ 

ACCAACATGCAAGTTCnTCCTTCCCTGCCAGCCAGATAGATAGACAGATGGGAAAGGCAGGCGC 

GGCCTTCGTTGTCAGTAGTTCTITGATGTGAAAGGGGCAGCACAGTCA'nTAAACrrTGATCCAACC 

TCTTTGCATCTTACAAAGTTAAACAGCTAAAAGAAGTAAAATAAGAAGGCAATGCTTGTGGAATG 

TACAGACCAGAGACAAAGCANGANAAGAAGCAGAGACTGTTGGCCCGGGCCGA 

SEQ ID NO: 1065 ACXirTTTTTCCTTmAATTAATACTAAATATTGTGATn^ 

AAAATGACCTGCTTGAAACTTTGATACATATTGGAATACATTATGTTAATAAACTrGTAGC^^ 
GTGAAAAAAAAAAAANAAAAAAAAAAAATAAANAAAAAATAAAAAANCGTANAAGNACTCCCC 
TA>riTAAAAAATAAAAAAAAAAAAAAATGGCCCCTnTNGCrAANGCTGGGA 

SEQ ID NO: 1066 A Cr J-lTl-l- Ii rT rn n 'i I' l 1 i in i i J AGGGGGACAGGAAGTANAATn^ATTG 
GTGAGTATTAANAGGGGGGCAGCACATTGGAAGCCCTCATGAGTOCAGGGCCCGCCACTTGTCCA 
NAGGOCCACGACTGGGGATGTACTrGTGACTACGTTTTTrCAAATATAGATAGATTTAAGCTGCrA 
ATTTTTITTrrAGTAATCACTACTATATCATGTCITrrACTCTGTTrAT^ 

AAGATATAGATATrAAACCTTGTGCTCATGCAACTrAGAGTAACATATACAGACAAATGATTGCAT 

GAGGCCATGTTTATATGTGTGACTAATAAGGCTTGTGATGATTAACATAATCCAGGTATGTCATTT 

CTGAAGANAATAGTCATCAAATTTATATCTCGAANATTTTAATT AAGGA ATTGCTTArrGTTGAOC 

TTANCAAArrAATAACACTATTTCTGGCACTAATTATmGAGGCCTITrAATACCACCT^ 

CCCCAAAACCCXrcrGCATTTGGGCACAATTTrAATTTAANAANAATTTNm 

ATATmAAATGCANACTTNAANCCAATATANAACATTrGGTrrGNGAAAATGGCATCATTCTGCT 

GGG 

SEQ ID NO: 1 067 AC J - iTn f l' l i rrnTl ' i - in i l l i l I QGGCATCAAAAAGCTTTATTTCCATrrG 
GTCCAAGGCTTGTTAGGATAGTTAAAAAAGCTGCCTATTGGCTGGAGGGANAGGCrTAGGCAAAA 
NCCCTATTACnT^GCAAGGGGCCCT^CAAAAGTCGC^GGGCTCAAAAGGCT^m'AATTCG 
GANAGTGANCCTTTCAAAAAGAAACTTCCCANCCCAACCTC 

SEQ ID NO: 1068 AC rri 'l Trn r i Trfl -l 1 1 r n 1 1 1 1 1 i CCA'ITKAATCTAAATGTTn' CTGCA ATT 
GTTTrrCCTTTAAACrmGCTTCAAAAACTCTTCATCTACrrG^^ 
TCTCGCATmCCCTGCCTTCrGATGGaTACCAGGGGAC CCTTT Nr nmT GGCT 
TCrGGATCAGTAATGTCCACATGCGGCTTCTGCAGNAAGGrmCTmriTGCTGATACTGAAACT 
CGGGGGGCl'CTGTrrNAAAAGGAATGAAAGGCCTNTGGTTITCAAAG<XCCCC^ 
ACCACTCGAGGAGTGACCmGAAACGTCAGCCTNATCATNAGATTCAAAATCAAAGGANCTGGA 
CAAGGATGAAAACAAAGGC 

SEQ ID NO: 1069 ACTGTCTTCTGGAAGCCGTATGGTTACTGTCAAATCATCTTCAGTCTGTrGCC 
AGTAATACAGAGGrrCTTrGATTrrCrCTGATATGTCTrCATCCATATTTTCTTCAAGAT^ 
AGCXn-GAACAAATGTGAAAGACnGTAGGATACAATCATTANACCArrrCCATCA GGCTCA ATAG 
CAGCATAATGTGGNACTGACTTrCCACGGAGAATATCACGCTTAATAATmC ATATTT TITATrAT 
CirGATTTTTCTTACTGATAGTOACCCACTCCAGANAAACATANAAACCACTTCCTmCAT^^ 
ANTrCCNCrnCTCTATTCNAAAGANGTAGNGGTANCTATANAAATGTTCrrCAAGCATTTAC^ 
TGAAAATACTG 

SEQ ID NO: 1070 ACAGAGGGTGCCCAGCAGGGTCTTCTACAGTGGCTGTTGAAGAGGCTGAAGG 
CAAAGATGCCrmGATGCCAACAAACTGTATTGCAGTGAAGTGCTGGCCATATTGCTCCAGGACA 
ATGATGAAAACAOGGAATrGCTTGGGGAGCTGGATGGAATCGATGTGCTTCTTCAGCAGTTATCC 
GTGTrrAAAAGACACAATCCCAGCACGGCTGAGGAGCAGGAGATGATGGAGAATCTGTTTGATTC 
CCTCTGCTCCTGTCTAATGCTTAGTTCCAATCGTGAGCGCTTCTGAAGGGCGAAGGGTtnrCACT^ 
ANAATCTCATGCTCATGGAAATNAATATCTTCCCGGAGCATT 

SEQ ID NO: 1071 ACTGCAACTGCCAGAACTTGGTATTGTAGCTGCTGCC CGCTG ACTAGCAGCTG 
GACTGATTTIXjAATAAAAATGAAAGCATTAAAGGGTrrCCCTACAAAACATTTTrcm 
TrrTGAAATGGCTATAAGCAGTTGACTTTCACCCTTGGAQAGCATCACACTrGTGTGAGGTrCAGTG 
ATTGTTGACCCTCCCCAGCCCCTCCTGCrrCTTTAAGTTATCTGTGTGCGTGCGCTTCCT 
TCTTTGCACGCTCATTrCTTmCTCTGACCCATGAGAAAGGAAAACTTACr^ 
ATAGTGTAATTATrCATTTATAGCATGTCAGGATAAATTAAAAAGAACATrrrGTCTGGAAATGCT 
GCCCGGGAGCCTATTGTQTAAAATGTAGGGArnTGNAANAATAACCTTGAAATrGNAAATTGAC 
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ACGTNGTTNGGTCAAATrGNGTCAAAGTrrAATTTGGTTNTGTTrcOT 

SEO ID NO- 1 072 ACAATTGTCTTTATTATCATATATGGTATGTATGTATGAI-ri CrnCCATTCCT 
CATATTTAGACTGTATATITATGTAGGTGTGAGTGATTGCCTGGTGCTTGCTTGTGCCA^^ 
GGCACCCTCCAACCCTGCCAACTTrrGTGGCCTCCCAAAGCATTCCTGTTACCAAAGAGGCTTCAA 
ACCTGACCCTCACTTCTCAGTGGACCCGAGmCCCTCCCATGCCATTATrrTCAGTGGGGAAGm 
TAGAGGTGAGCTGTTGGCCACAATATCAArmAAGTGTTCATAGCAGTTVi^^ 
GOCTCCTGGATTTACCACCAAGAGTCCCAAAATATTAATGCTCTTCCCTTTTTCTACCCrc 
ATAGTTGTATCTTATITmAAAATGAATrrTCATGGCCAGGCACAAGTGGCTACCCGCCTGTA^^ 
CCANCACTTTGGGAGGCrGANGCANGANAATTGCITGAACCAGAAGTTTGAGACCAGCCTGGGCA 
ATGTANTGAGACCCCCTGTCTCTAAAAAAAAAAAAAAAAAAAAAAAAAAG 

SEO ID NO: 1073 ACAAGAAGCTCAGCGCAAAGCGTGCGGATTTGCAGTCCACCTTCrCTGGAGG 
ACOAArrCCAAAGAAGTTTGCCCGCAGAGGCACCAGCCTCAAAGAACGGCTGTGCCAAGATGGCT 
riTACTACGGAGGGGTrrCTTCTGCCTCGTATGCAGCTTCAATTGCAGCAGCTGTGGCTCCTA^ 
AGAAGATrCAAACCACTCrGAGTAATCTGGTTGTrAAGGGCACAAACTTGATCATCCAGGAAACA 
CGGCAAAAACTCGGAATACCCCAGAAGAGCCTGTCITGCTCTGAGGAGTTCAAGGAACTGATGGA 
CCTGCCGACGTGTGGAGCCAGGAACrrAAAACAACATTTAGCCAAAGCCACAGCrrCAGGGATTA 
TGGGOAGCCCAAAACCAGCCATCAAGTCCATCTCGGCCTCAGCACTCTTGAAGCAACAGAAGCAG 
CGGATGITGGAGATGAGGAGAAGGAAATCAGAAGAAATACAGAAOCGAnTCTGCAGAGCrCAA 
GTGAAGTTGAGAGCCCAGCTOTGCCATCTTCATCAAGACAGCCCCCTGCTCACCTCCAOGGACAG 
GATCCGAGTTCCCCANGCTGOAGGGAGCCCCOGCCACAATGACGCCCAAGCTGGGGCGAGGTGTC 
TTGGGAANGAAATGAT 

SEO ID NO: 1074 ACAAATCTGGAGTGGCTGTAAAATTCGGTTCTCAGAGATGAACTTGCAGATr 
CGGACrrrCAArrGTTCTGTTGTTITAGTTmCTTATCAACTGGGGAACTGmG 
GTTAAAAGTAGAGAAGAGCTTrrCATAGTrCCAACATTAGTTGTTACCGGGCGCAGTGGTGTGTGT 
CTGTAATCCCAACTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCCGGGAGGTGGAGGTTA 
CIX}TGAGCCGAGATCGCGCCATTGCACCGCAGCCrrGGGCAACAGAGTAAGACTCATCAGCTCX:CA 
ATCCCCAOATrCTCCAGTACAAACTCTrCATAGTTTACTTGACCATCACCATCAATATCTGCT^ 
TO^^TCATITCATCAACTTCTTCATCTGrrAACTTCTCCAAGGTTTGTCATCACATGGCGAAG^ 
CAGCACTAATATAGCCATTGCCATCCTTATCAAACACACGGAATGOTCTCTAATTCTrCTTCACTG 
NCTGNTCrrrCATTTTTCTTGCCATCATTGGCAAAAATTCAGGGAAGTCAATO 
AGCATCTACITCArrAATCATGTCCTGTAACTCTGCTrCTGTGGGArrCTGCCCAANANAT^ 

C 

SEO ID NO- 1075 ACGCGGGGGGATGGGAGGGGTGTTCATGATCAnTGGATATAGCAATCTACT 
CrGAGAAATGGAACACAGGGAGTTACCTATCACTTrCACrrATAATrCCAAAAGATGACTACAAC 
CATOTCCATOCrCAGATTCAAACAGTmCCATATCACTTITGGGTGGTAAGATGAm 
Gi'l T l 1 1 1 1 1 lAATTGGCAGCACCACTAACCATTCOTACATTCrriTmGTATGTGTGGrn^r^^ 
TTAmAACCCGCAGCCGACATCGTAGTTTCTTGTmGrrmJTTITAC^^ 
TATGrrACCATCCTAAAAAACACTATATTAAACATGQAATAAATTGTCrrTITATGAATrAGGCm 
rrGAACATCCTGTGTTGGGATnrmGTnTrCAATTGCAACAAAAGCTCTG 
TTTAAAGrrcACATAATCATCTGTAANACArrATGTATTITGTGGAAATACTANAATT^^ 
ATTTGCCATTATATCGAT^GCTATITmGA^TAATGCAAAAGTATATGACTITG^^^ 
ATAACCATAAATATTAAAAGTGTTGAATACTAACANTGCT 

SEQ ID NO- 1076 ACATGOAATCCTTTGAAGGTATATTCAAAGAA CAGTAT GATACCATCCATCG 
CrrGGAAACAAACAAGTTGCGAAATGTTGCTAAGATGTTTGCTCACCrm 
TCCATGGAGTGTTCTTOAATGTATAAAACTGAGTGAAGAAACCACTACATCATCCAGTAGAATTTT 
TGTCAAAATATTTTTCCAGGAACTGTGTGAATACATGGGTCTrCCTAAACTTAATGCAAGATTAAA 
GGATGAAACTCTGCAGCCATIXnTTGAAGGATTATTACCCCGAGATAATCCAAGAAACACTCGGTT 
TGCCATCAACTTCrrrACrrCTATAGGTCTTGGAGGTrrAACGGATGAACTGCGGGAGCAT^ 
AAATACACCAAAGGTCATTGTGGCGCANAAACCAGATGTTGAGCAAAATAAATCCTCCCCATCCT 
CTTCCTCITCAGCGTCCTCCTCTrCANAGTCrcACTCATCCGACTCT^ 

CAGrrcAAAATCrrCCAGTGAAGAAAGCGACTCrrCATCCATCAGTAGTCATAGCTCTGOT 
TAATGATGTAAAOAAAGAAGGGCATGGGAAGACCAGAAGTAAAGANGTAGATAAATTGATCAGA 

AACCANCAACA 

SEO ID NO' 1077 ACGCGGGAnTGACAAAGATGGTGATGGAACTATAACAACAAAGGAATTOG 
GAACrcTAATGAGATCTCTrGGGCAGAATCCCACAGAAGCAGAGTrACAGGACATGATrAATGAA 
G-rAGATGCTGATGGTAATGGCACAATTGACrrCCCrGAATTTCTGACAATGATGGCAAGAAAAAT 
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GAAAGACACAGACAGTGAAGAAGAAATTAGAGAAGCATTCCGTGTGTTTGATAAGGATGGCAAT 
QGCTATArrAOTGCTGCAG/ACTTCGCCATGTGATGACAAACCTTGGAGAGAAGTTAACAGATGA 
AGAAGTTGATGAAATGATCAGGGAAGCAGATATrGATGGTGATGGTCAAGTAAACTATGAAQAOT 
TTGT 

SEQ ID NO: 1078 ACrGTCACITAACCCCTATTAACATACGGTCTTCAAGCCrTCCAGTATCAGCG 
CCrmGCATAGCATCGCTGCTAGTCGTTTCACTAGCATGCTGGCrAGGAGTOT 
TTTCrrCTCCTTGATAGTGTCTGTACTATmGriTmGCTITGGTTATTGT^ 
AGCTATGAGCTGATCATTGCTCCTTCTCACCrCCTGCCATGATACrGTCAaTTACCTTA^^ 
GCTGAATAmAGTAGAAATGATGCTTCTGCTCAGGAATGGCCCACAAATCTGTAATTTGAAATTT 
AGCAGGAAATGACCTTTAATGACACTACArmCAGOAACTGAAATCATTAAAATTTTATTTG^^ 
ANTTAAAAAA 

SEQ ID NO: 1079 A Cri - iTi ' JTI l - l ' lTi ' ri ' n i 1 1 ' l ri in 1 1 AAAAT GTCCC AGGa'GCCTTCCTGTC 
CTCCCATCCTGCATCCTGCTTTCTGTTCCCTGGATACCGTAGGATGGrmArrTCAGTTCATGCAC 
AAArrA^r^CTGGACACTGNGGAGTCATAACAAGAGTGGGATGGAGGTTCCAGGGCCAATCAG^TT 
CTTTGGAGGGAATCTTGAAAAGNAGGrrAAAGCAATGCCCACAAAAGCCACTGCrrCTCOT 
ATCCCCAGCAATGAGCTAANAGCCAACCTCA 

SEQ ID NO: 1 080 ACrCTGAAGTCAAAGGCCCAACATTACAGAGCGCACCTCTGCCTGAAATACA 
AAACTAAGTTAACTAGCAAGTTACAQQAAATAGCCTTCAGTAAATrCCACAAGCCAAGTGGCTAC 
TGCATTGTCCCTGAAGAAGGAGGGCCCAGTGTrcmCTGGGTGTGTAAGGTCTTACTTAGTrCAA 
GGTTTGTCCCTrCTTAGrrGTAGGTTGGGCCCrcrrTCGTTO 

CAATTGAATATATGTGCACAGTGATGAAACCACCACCACAATCAGGATAATAAAOAAGTCCrrCA 
CCrCGAAAAGTrCTCTCCTGCCCCTGGATAATCCATCCCACTAATTCCCTATAGGCAACCACAGAA 
CrcCTrTCTGTCrCTGTTGATTAGTTTGCATTnCTAGAATmATATAAAT 
GTTCTTTTmAATCTGGCTTCCTn'ACTCAGCATGATTGTCTrGAGAm 

TTGTTCAAAATATGCTGAGTATAATGTGTTATCAATGCrCCATCXjCATTrrGTTGCTGAGTAGCATC 
TCAATGTATGGATGTAATn-GTTTATCCATTTGCCTGTTGGTGGANGAAGCNCCGGGOCATTrGGA 
TT 

SEQ ID NO: 1081 ACCTrrOTGCATGTTGCCTTCATTCCTGAGCAGGTATCATCCTCAGGGAACCA 
GCATGGCACCTACCAGGCCAGGCTCTGTTCnTAGGAGCAAGGAGCITCITGCGCTAACAOTrCTGG 
CCTGAGACCTGGATrGAGCCTTGGCAGACrrCTTGTCTAAATGTrGGCCATTCAGTCrcAGGCCCT 
CTGTTCCATGGAArrGGGAATCTCCAGGTGACCTAATCCTCATTGGTGGCrTGATOTTrGCrrGGTAT 
CTTCCAAACrCAGnCCCAGACTAGArrGATACCTGGAGCCCAGCTGCCTACTCAGCATTrCCACT 
TGGGTGCTTCATAGGCATTTCAAACCTGATGTGTrrAAAACACTrGATTAGGCTCCGGTmCCTn 
GGCITCTGCrrrTCAGTGAATGGCATGACTGCCTATGTGGGTGGCAAGCCACCCANGTGCCGAGGA 
AAGAGACTGAGGGCACGAGCTGITCCANTATAATAAAATATATAAAATA 

SEQ ID NO: 1 082 AC ! nr ni l l - i - lTl I 'l l U J l ll ll I GOTAAGGACCAGTATTGTAnTGACnT 
TTAAAAAACCATTnTACCCTGGAGCTAAAATACGAGGTTATACTGTTCTGCITAATACAATACTG 
GCAGCCTAAAAATGCTTCAAGAAACCATATCCCCAACAGCGGCAGAGCATCGGGAGGAGACCCTC 
TGTCTCTGAGGCTTCGGTGCACirCTGCTCAAACGGTGGCGGGAGTGGAGGTOjCTGCTC 
GACGGTGTGGCCATGACACGGGCAGCACGGGAACGGAAGACGCCGGA GACCCA GAAGGCGCATG 
ACTGCCrGGCCTCGAGTCACTAAAAGCAGTTTGATTTCACTCrrGTCTTACTmCTAOTATGGCTT 
CCTCTACTCTCTTAAACTIKrTCAGrrCGACAACCGAGTrCCCAGTATITGAGATAGAGCCACATG 
AGTCTGCCATKrrGGCGrrTGrrOGTGTTCCAAACATCACCACAGT 

SEQ ID NO: 1083 ACTGTAAAAGrrCTGACACAAGACAGTGTTNGTG GrTACT TTTCATCGACTTT 
AGCATOTGATCTCAGGGACrCAGACATACGTCTAAGTrCTATTCTGAGrnTGGCAACAGAGCAGT 
GACAGATArrrCTGAATGAACAATITITAGGTGrmCAGCCATTTGAAAAGTAITGCCAACACAC 
TArrrGGTGTTAGCfCAACAGTCACGTTGTGCCAAGAATrAAAGAACTCTAAAGTCTACAAACATC 
TTACTTCACCAAGACTAACTATAArrOAAGGGTITACrATTrGTTrAATAAAAAATCACACATCAA 
CrmATCCAAAACAGCAACTACrACAAAAGGAATGACAAGAAAAAAAATGACTTCACAGAAATA 
CACTAAAAAAATCTGACAATGTTATGCAAGAACCCGCCCAAAGrrTNAGTGTTTAATAATGAATA 
GCACACNTGACCAANGTCCAAGATGTGAAGATNOCCATGTTCAAGAAACTGGOGGGGAAATCACT 
CTACAAACTAACT 

SEQ ID NO: 1084 ACTGCTGCCGAAGTTGCCCCAGTTCCATGGGGTTCGTGTCTITGGCATCAACA 
AATACrGAGGGATGGGTrrrGGGACAGCTCCATGGGCATGGGGAAGGCACTOAAACAGAGGACT 
ATAAAACATCCTTCTCTTATrCTCCATACrGTCTTCTACACCTTTAAAGCCTGAGAACTATACAACC 
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cpr> m MO- 1 OR? ACAACCTTCAAACATTCCACTrmATAAAAAAAGGGOCACACAATCGTOGT 

^^SS^^^TCATTAGGW.TCA^ 

TGNTACGATTCCGGCA 



r??tJ^?JIL^^CT^CTO^TCTAGTrOANGTra 

cS™^SSi-TACcroGorrrGocn™ 

^AAm^AASSAGGAT/^AAACCAAAAGTATTAm^ 
§^?^T^^^SSC^^TGGAATCrAACAAAC^ 



SEO ID NO- 1088 ACrrCIGTGAGATTACGGNCGCTATGACATGGCrCAGOTCGG-^^ 



ggtcagtotctatctgi 

Sg^SS^cggcciHGG^j.^^^^^^ 



NGGAAGGCTGNGGGCANANAGAAATTACNTTK 



TGCraAGATITGCNCCACrGGNACCCTGCCCCGGNCNGGGNCGNTrC^^ 

^^^Itccccaotggg^^ 

SEQ ID NO: 1089 ACrAAAGGAGATAAOTGTAAGTriTCCXA^^^ 
ATTTCCATCTNCNTNGNNGCCCTN 



TTTTNT 
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SEO ID NO- 1091 A cniM 1 li 1 irni Til i i u i irn I ' l i n i ccttaaaatccatctgactggg 

TTAATTCAAAAAACCTGTmTCCACKn'CTNAAATTC^ 

GcrmcAAcnrnnrrnGTAATTcmCAATGAAT^ 

TT^TGATATCrCCITmATATCCCAAACTGTTmCTGATC^ 
ATCTCATTGAGCTTCrrrAAAACCAACATTCTNAATrCm 

TGGTANGANCCATCGCTGAAAAGTTAGNGNGANCCrrCTGOACCACTGTAACACACrrr^ 

"StiJaaaagtccnngtac^^ 

CTGTOCTANTGGATCCAACTOGGTCCAANCrrGGNGNA^^ 
SL^GimCCNCAACAATITCANAai^^ 

GNCCNNATAmGANANCNNACTCNNATNATTNGNNNNGCCTNNATGCCT^ 
C 

SEO ID NO- 1092 A Crrn -i l 1 1 1 1 U 1 11 UT ™GNTTmTITGGGGCAGTrCAAGmAATAC^ 
ACTACAXijVGATTA^^ 

CAACTOAGGCOTTO^ACCAAAGGAAGA^ 

^S^^taS^ccananttc^ 

•rriTTOTrrGGAAAAATAriTmTNTrrrrr^ 
^T^^WANCNA^r^^^GGGNNANACT^TCC^mTNr^ 

SEO ID NO- 1093 CATCOTACAAAGATTTCTGCNGTGATTraTGTGAAGAAGAGAACGTn^ 

SSStcatgaaWctc^ 

^SNSmG^T^ 

NATNGTANA^n"^NTNCNTT^^^nmCAAANGG^^W 
ANGGNCTTATGCGCTOATTATNAAAA 

SEO ID NO- 1094 ACCTGGTNAANCACTGTGGCAACATACCTCTCnTCmTATTAAT^^ 

NCACTCAA^^^ 

TC^O^GGTOCT 

A^GGAGC^^ 

mGATcScAS^TOSANCn™ 

NyaTGG™ANGAAAAGCACCCCCCATGGaqNAAACACTGCACATGA 

atS^Jtgtttcgat™ 

AAANArmGGTGGCGAr^TIT^^TNGGAATTAGAGGG^^^ATNAAAAAT 

SEO ID NO- 1095 ggtacgcggggattcttcccctctctacaaccctctctcctcagcoot 

TTOTGGm^ 
CCACCrrOCACAAGTACT 

SEO ID NO- 1096 ACTGCCCCmCACATCAAAGAACTACTGACAACGAAGGCCGCGCCTGC 
TCCCATCTCTCT^^ 

?§ggoSSaa^^ 

GTCCTGCAGGO^^^^ 

TTGGAATGCACAATTTTTTTAATATGCAAATAAAAAGTTTAAAAACTTA 

SEO ID NO- 1097 NGTACCACCATTTGNACCTrAACGAAGAANAANATCTTCAAGTI^ACCC^ 
Ai^AGAG-rrrrAAAAAAC^ 

tonSg^agtgatc^ 

GCCATCAGTmCrOAAATGGATATTrCANATTTGCnTNAAGAAAA'^ 

agtoIaaattco^a^^ 
tS^tgnto^^ 

^^XScGAA^^ 

TGl^NTOTGCNCNNANTCCCANAAACATrrGACCGGGAAGCT^ 

TAANGTNGGGGCCNCCCANTAATTGGNGGGCGCCCATGNCCGTTTCAGTGGGANCCNCGNCNACT 
NTG 
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SEO IDNO: 1098 GGTACGCGGGATGGACTCTGCCACTGCCCCCGACAAGAra^^ 
CT^ATGCGG^^ 

gtocSctSI?ta€c/Sgggaaggagc^^ 
ctc^c^caStgatcgWtngggaa^^ 

NTCCTCTCnT^TGrmGTCnTNCNA 
NTCCATciAOTAATAGNrrC^ 

ctnggggngggcgg 

SEO ID NO: 1099 ACTTGTNCCTAGTTTTTCAAGGTATTGGCrrGTTCTATAGATGC.^^ 
CAGGTCATCTAGCACmCTGTim 

ttaaIgggacttgaggaagc^^^ 

NGlS^CCTAGTmGmrCTGGrrCAATAG^^ 
S^SAGTGAAGcSGGTGACATAAAAACTArimGACG^^^ 

HgoSncS^ksatgccntoccc^ 

S^NGGrrrATGNTITAANCACAAAANGGGTTCNTrGGNAAACNGANNACT^^ 

aaanctcaaaaaatnttntitgtgna 

I^CTSAAACrGATTGNrrACAmTrATAANACAAACN^ 

GGCAGrnAACTTAAAATGCATAGACAATWCATNCmTAAATITGTCATrACTGNCA^ 
ATCACAATTTCA^^ 

ANQGCAANAAGTNATTATCTGNTGATCAm'GACAGGCA-TOA^ 
NTGAAAAGGCAANCTATCCTCTTACTTTGCACACrrGAArrGTTGCGCTGATCAACACT 

GCTGGTACA^TrrCCACCAANA 
TGCAACCAAATC 

SEO ID NO' 1101 GTrCNCGGGGAGAANCTTGGACCGCATNCTAGCCGNCGACTCNCACAjL 
Gi^TTGCCA^^ 

GCCAGAGANACCACAGNCANACCTGGAGCCATGAAGGACAC/^G^^^^^ 

CCCANACCCTCrrCAQAGGTTGGGGTGACCAACNCATTlX3GACrTCACACATATGAANAATCTC^^ 

GirmASrANANAAAAGAGNNTGC^^^ 
J?MlfinOGCCTTCTCAATACTTGG>rm 

GSS^CTAA^^^TmTATG^mTC^^^ 

nnantataaaanctng 

SEO ID NO' 1 102 A crrnT i u u nri - ii i ii n u 1 1 1 n 1 1 1 1 1 n u u ^ ^ccciTAArrGGGGG 

N^SS™CAGNNCCC^ 

NATTCCCCANTGTNAANCTGGCANCNCNCCnTJAGGTmTTGAGNGTrTATCAACT^^ 

ANCCCTGCCNTTCGCGTTTmTCAAGGGAAAAANCTTCGGGTAAACAATGGTAAi^^ 
ACTCACCGATAACAGTGCCCCAAAGGGCA^^ 

tggaaaSaI^^ 

IS?^^^g^5aaS^acg^^^ 
naaancgggttcntgccrrnataaaaaannnaaanacct 

SEO ID NO- 1103 ACACTTGAAACCAAArrTCTAAAACrTGTrmCTTAAAAAATAGTTGTT^^ 
ASfrAAAoi^TAACCT^^ 

Tf^GGTmNAAGTCTCAAGGCCNTGACAGANAA^ 
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NAGGNTCCGTAANCATAAAAAACCGGGTITGAArrOTCmTCCAANGANrrrCGCK^^ 
GCAAAAAAATGGTNA]WAANATTAr[XOTGGGGNANTNCNTGCCC>r^^ 

NNCAAA^^IT^r^IT^OT^AACccAATTT^^ 
TTcrrrnTTT 

SEO ID NO: 1 104 ACrGTTTATTAACCAACCAGCTTAGAAAAATAATCATGGTAGACACCTTAGTT 
CATTCTrCTAATAAGCCTGTTGATCTGGTCCrCCCTGTTGCCAGCATCTCCACCTTCTACAA^ 
GTGGTC r i-I-LTCTTCATrCCACCTCGTGGAGAAGACAATTTGAAGGGCCACAGGAAGTTATTTGCC 
TCirraAAGCGTTTTCCAACAGTATAGATCTCATGAATCAAATCCTCCATGCANATC^ 
TrACCAANAGATCTANCAATCAAAGCGTrATCTGTCAAANCAATTTNCTrCTTATTGATm 
NACCANGCnTGNAGATTAAGOTCATTimrGACTTCAGATTGGGGTACCCTTGNTCNTNG^ 
GCTAATGGGCNAATITCATCTATOATTTOCOTGNCNGTrNCTTANCNNGANCC^ 
CTNATCT^'GGTG^^'ANAT^r^^^•GNATITANCTGTTTCCTCGTGTNA^ 
AnrCATAANNTNATANCNANCG 

SEO ID NO- 1 105 A Ci - rrn - rn 1 1 1 1 1 1 1 TG TiTn ' iTi i i ctcagctaagcccatacagaagaacc 

AGACTGATGAGGAAAAAGAACAGGAAGAAATAAAAGGAAAAATGAAATAACTAAGriTGAGAAA 

GAATCCrGGAAAAAAGTAAAAAATNCAAAATAAAAAGGAAGGCAGAAGGAATAGGCNCTCAAA 

GGAAGGAAAAAAGAANGAAAAATWAGGCTAmGGATAGTITAGTTAAGATATACTGAAAOT 

TGTGCCAACCTGNANAGAAGATCAGCCAATATAAAGTCAAGCCTCCCTTACCTTTACCTTAC^ 

CrGCCATCCCANrGTCACCTGGGTCrGGTCCCTAmrrCj^^ 

TAGAATmCAAACrGGTCCTTTATGNNANATTGCTWl'l'l Hill NCmTNCATnm"CCCTrTNTAN 

NTAATNCTfWCTTTTOGGCTOAAANGNmmGTCCmA^ 

TTAANNGAATTGCTTNNNCXrrTTTT 

SEO ID NO- 1 106 ACTAAATGGTATCaTAGATTAAAATTrrGTGCTTGATAACAGCTGKITnTC 
TACATTAGAAATAAGATGCCACACAAGGAACTACATTCCAGATTTAAAGAAATGAAAGGATACCA 
mGTGTGTATAACAGATTATrGTrCATACnGTAAAGCATCTrATGTCATTGAGAATATAAAGAA 
CAGTGCC^^AQAAGAC^^GTGAAAGGTAAGCTCTAGCTTAATGTCr^ATGATITGr^C^ 
AGGAAGGTAAGGATTGGTCAGAGGATCTAACTTGATGTGAGCAGTAGTAAACCTGTTTrAGATAT 
CATACTGNTAATATTTTATTGAAAATrrAmCAGAGCGGAGAAACITAAGCTAAAGTCTGTTATA 
CAGAATrGAAAGCCnTCGTATCrroGACCTTCCAACCATTrm 
NAAGCTAAATNGNlTrAATACCACTTTCCTTTGTACCTTTGGCCGCGAACACGC 

SEO ID NO- 1 107 ACTTCGTGTGCTCCGACCCATGGTGACGATGACACACCCTGGTOGCATGCCC 
GTGTATGTTGGTITAGCGTTOCCTGCATTGTTCrAGAGTGAAACAGGTGTCAGGCT^ 
ACACAAArmTAATAAGAAACATTTACCAAGGGAGCATCTTTGGACTCTCTGT^^ 
CTGAACCATGACTTCGAGCCGGCAGAGTAGGCrGTGGCrGTGGACrTCAGC/£j^^ 
TGCrGTTCAAAGAAATrACAGTTrAOTCCATTCCAAGTTOTAAATGCTAGT^i 1 1 1 1 1 1 1 1 U 1 1 1 1 
TCCAATAAAAAGACCATTAACmAAAAAAAAATAAATT^AAAAAAAAGTACCTNGGCGTNACCAC 

GC 

SEO ID NO: 1 108 ACCAGTAAGGCTGGATCrrACAQAGAAAGACrATGAAATACTTITCAAA'^ 
ArrAATGGAATCCCTTTCCCTGGAGGAAGTGTTGACCTCAGACGCTCAG^^^ 
AAAATATnTATAACTTGTCCATACAGAGrnTGATGATGGAGACTATITrCCTO^^^ 
TGCCrCCGGATTTGAAGAGCTTTCACTGCrGATTAGTGGAGAGTGCTTATTAACTGCC/^^^ 
TGTIGACGTOGCAATGCCGCTGAACTTCACTGGAGGTCAATTGCACAGCAGAATGTTC^^^ 
TCCTACTGAGTTGriGCTGTCATTAGCAGTAGAACCTCTGACTGCCAAmCCATM 
TTCCGTGAAGAATmAC^LATGAATGAAAANGrrrAAAGAAAGTITmAATNGT^^^ 
AAT^NCAGAATGGGNAAGA^TGGN^^^^ANTT^CACCAr^GGATGGA^^^AAAGTATCCC^ 

GOGNGTCCAAG 

SEO ID NO: 1 109 acgcggggatgcxaaggtcatgaaggatgcaaagacgaagaaggtan^ 

GTCAAAGAAGAAGGCTGTTCAGAGACTGGAGGAACAOTTGATGAAGCTGGAAG'nCi^^ 

GAOCGAGAGGAAAATAAACAGATTGCCCTGGGAACCTCCAAACTCAATTATCTGGACCCTA^ 

CACAGTGGCTrGGTGCAAGAAGTGGGOTOTCCCAATTGAGAAGATrrACAACAAAACCCAG^^^ 

AGAAGTTTGCCTGGGCCATTGACATGGCTGATGAAGACTATGAGriTTANCCAGTCTCAAG^^ 

AGAGTTCTGTGAAGAGQAACAGTGTGGTITGGGAAAGATGGATNAACTGAGCCTCAOTGCOT^ 

GTCCTGGGGGATNAGAGGCAThWAGGCTTNNCANTNCCCANCATCrm 

TGGAAAT^mT^TAANGGlWAACTTAATCAGT^KTCTATTGGNC^ANc^rl I lAAAAATATTTNNN 
ATTTCAAATTTN 
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TTCCTGTGGCAAAGTACCTCTATAATTTGGGTGACCAGTATGCACTGAAGATGAGGTTTGTGGACC 

ATGTGTTTGATGAACAAGTGATAGATTCTCTGACTGTGAAGATCATCCTGCCTGAAGGAGCCAAG 

AACATTGAAATTGATAGTCCCTATGAAATCAGCCGTGCCCCAGATGAGCTGCACTACACCTATCT^ 

GACACATITGGCCGCCTGTGATTGTTGCCTACAAGAAAAATCTGGTANAACAGCACATTCAGGAC 

ATTGTGGTCCACTCACCTTCAACAAGGTGCTCATGCTGCAGGANCCCTGCT^ 

CTACATCCTGTCTCACCGTTATCATCTATGGrcGGTGGACTTCTTCATCCCAAGGATCCA 

SEQ ID NO: 4323 ACTCTCTGCAGATGGTCCAAAATTGTAATGGAGTCTGTATT AGAA GAAAATA 
AGGGTAAAATCAGGCTGAACTGCATGTATATGGCTCCACTGTGGCTTGTGACACTTTTAAAATC 
CCGTATGTCAGTGTATCTGGATACACGAGGAAAAGGAAAGAGTCTCAGAGTGGAACAAAGAGTG 
GGAAGAGGTGATCTGTAATGTNACAAATrGTGCTATTACTCCAAGGTlXAACT^ 
ACATGG 

SEQ ID NO: 4324 ACCAGGTGGGGAGAAGTGTAGCAAATCTCAGTGCCAATTTGAGGGGAAGCC 

agtcattccaggagaagagctcaggggaaagagctgttgactrrcataatgcagtcttaa^^ 

cagtcaccctcctgccacatggcagaagccaggtggcagtgatggtggtgggggaaaca 

cacagtctctggcaagccccaccgggaaaggagggctca 

seq id no: 4325 actatccctgtaactgccaagagctcaggagccaggctagtgatcacaccag 
gggttagagttcactgctgaactccctgatggcaggtctgtgtttattactacattaaaacaaagt 
ctctgacttataaagcgaggtcgtaaaaattacaagttgcatgactgaaaaaatgc^ 
aaaatcagtcatatctttaacaccaacaagcaatttcccaccaacgaatgtagt 

seq id no: 4326 acagagtcttrrgcttcctcccacccctagggggaaaaactgcmgtc 
gggaagttgtctctgaaacccggggacagaggacgcaggacagactaggagggagccgggagga 
tgggctgcagctgtggaggagggtttcagaggagagaggtcggagagcanaggcctgagaagcc 
agaggcaggtggagagagggtggaaagtgagcancgggctgggctggagccgcacacnctctcc 
tnccatqttaaatagcacctttagaaaaattcacaagtccccatcca 

seq id no: 4327 acataagc\taatcagttatggacagcttcrrgtataaattgctattcagcaa 
tacataaactgccrrcaaagamatgcttacaggtagacatrcaatitaccaataaaacagcatgt 
tctgaaaatatgggcacattitaaaacatattaagacagttctgttaaccataatagtcccacagt 
atgactgagtaataagaatctacttcaaaagaaaaaaaaaaattaatcagtatagtgcatgat^ 
attcaacatagttcccagggaacagaccagtcact(>fattgcagactccttcataccagccatcat 
cattcttcrrnataacataaatgattgcaccctccataaatgacagctcatcatcct^ 

TATAATCATATATTGCACAACnTrCTCAATATAATTCTTGGGGGCCCAAGC^^ 

ATATGGATCATTATACTGAACTACTGCAGCCTCCTCATCTTCATAATCCACTGGTGGTGG 

GGGGANGT 

SEQ ID NO: 4328 ACTGGCACAGCTCATCTGGAGCCGAGCCTTAGGCTTCCCTCTAGAAAGGCCC 
AAGTCCATGAGCACAGAGGGTCTGATGAAGTTTGTGGACTCTAAGTCAG GGTAA AACTGGAGACT 
GGGTGAAAGTGACTACCANAAAGTGAGGAAGCCrAAATAAAAAGTATACU"i"riGTTTCAGGGGGC 
CnTAAAGACTTAAGATTAAATTATATCTGAGGCACTGATAATATGTTTG^^ 
TAAGACTTTAAAAGATGAAAAATGGTCCCTTCTTCCTAATCANCTNCCTTCCCCTGCCT 
GTTGGCCCATCATACNCATTGGTCCTGGANGATGAC 

SEQ ID NO: 4329 ACGCGGGGAAAGGAGAGACAATTATGTTCCTGAGGTCTCAGCCTTGGATCAG 
GAGATTATTGAAGTAGATCCrGACACTAAGGAAATGCTGAAGCTTTTGGACTTCW^ 
AACCCTCAGGTCACTCAGCCTACAGTTGGGATGAAmCAAAACGCCrCGGGGACCTGTTTGAAT^ 
TTTTCTGTAGTGCTGTATTATTTTCAATAAATCTGGGACAACANATANA^^^ 

SEQ ID NO: 4330 ACATCnTTAGAAACATCACrmAGCTCTGTGATCAGTCnTrGAACAATCAT^ 
CAAATGTGGAGATCTAGATTTAAGATCrrmGTTTTAACTGTATCAAATGTCCTO 
TTATTTCGAAAATGTGGAAACrCAGACAATCTTTGGTGAAGATTACnTAT^ 
TTATTTCCACrCTGGCAGTrrTGATAAGTGCTGAAATATTCTTmAAGAGACTGGTm 
TAAGCTGAAATTCTGTGTCTGTATTTCTITAAATTITITC 
rrCCATATTCTACTTGCAAATCATTATATGTTGCCTCCrTTGCAGTTCCT^ 
TCVVTATTAAOTCCAAACAATTTCTTTGATGGTCNGANOANTTCCTGGAAGCCT^ 
GCCTCCGGAATCT 
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SEQ ID NO: 4331 ACGCGGGGCTTTTTCGCAACGGGTTTGCCGCCAGAACACAGGTGTCX^ 

ACTACCCCTAAAAGCCAAAATGGGAAAGGAAAAGACTCATATCAACATTGTCGTCATTGGACACG 

TAGATTCGGGCAAGTCCACCACTACTGGCCATCTGATCTATAAATOCGGTGGCATCGACAAAAGA 

ACCATTGAAAAATTTGAGAAGGAGGCrGCrrGAGATGGGAAAGGGCTCCTTCAAGTATGCCT^ 

CrrGGATAAACTGAAAGCTGAGCGTGAACX3TGGTATCACCATTGATATCTCCTTGTGGAAAm 

AACCAGCAAGTCTOGGATTACAGGTGTCITACAlTITmATC^^ 

SEQ ID NO: 4332 ACl-lU' lT ! UU n-l- I T l T r i' l -ri HH 'rr r i'ri l 'nn'GAAAACACANTCrrTGCTCTGTO 
CCCAGGCTGGAGTGCAGTGACACKATCTCGGCTCACTGCAAACrCCGCCTCCCAGGTTmACNCC 
ATTATCCTGCCTCAGCCTTCCGAGTAGCTGGGACTACAGGCGCCCGCCACCACGCCTGGCTAAT^^ 
TTNGTATTTTNANTAAAAACGGGGrrrCACCCGTGTTANCCAGGATGGTCrCAATCTCCT 
GTGATCTGCCCrCCrCGGCCACCCAAAGTGCTGGGATTACAGGNGTGAGCCACTGTACTT 

SEQ ID NO: 4333 ACGCGGGGGATGAGGTTrmAAGATTATGCCATTNCANAANCAGACCCNTGC 
CNGCCATTTCNCCAAGTTCAAGGCATTTGTTGCTATCGTGGACTACAAATTGCCA 

SEQ ID NO: 4334 ACTTGGGTCGCrGTCTACTGCTCCTTCATCAGCTTTGCCAACrCT^ 

GGAGGACACOAAGCAAATGATGAGTATCTTCATGCTGTCCATCTCTGCCGTGGTG ATGT CCTATCT 

GCAGAATCCTCAGCCCATGACGCCCCCATCGTGATATCAGCCTAGAAGGGTCACATTrrGGACCCT 

GTCTATCCACTAGGCCTGGG(nTrGGCTGCTAAACCTGCnrGCCTTCAGCTGCC^ 

GAATGAGGCCGTATCGGTGCCCCCANCTGGATAGAGGGGAACCTTGCCCTTTCCTAGGGAACACC 

CTANGCTTACCCCTCCTGCCTCCXrrCCCCTTGCCTTGCTGTNGGGGG 

SEQ ID NO: 4335 ACTGCTACTTGAATAACTCAGTTAACGCTGTTTTGAAGOTACATGGACA^ 
GTTTAGGACTTCAAGATCACACTTGTGGGCAATCTGGGGGAGCCACAACTTTTCATGAAGTGCAT^ 
GTATACAAAATTCATAGTTATGTCCAAAGAATAGGTTAACATGAAAACCCAGTAAGACTTTCC^^ 
TTGGCAGCCATCCirmAAGAGTAAGTTGGTTACTTCAAAAAGAG 
TTATTTTAAGAGGTATITCAGTTTTAAATGCAAAATAGCCnTATTTTC^ 
AGTOAGCTTTCAAACACrATTTAATCTTTATrrTAACTTATr 
GGATTTGGANTNAAAAATAACTTTCCCTITOTCCGGA 

SEQ ID NO: 4336 ACAGAGGGTGCCCAGCAGGGTCTTCTACAGTGGCTGTTGAAGAGGCTGAAGG 
CAAAGATGCCTTTTGATGCCAACAAACTGTATTGCAGTGAAGTGCTGGCCATArrGCT^ 
ATGATGAAAACAGGGAATTGCTTGGGGAGCTGGATGGAATCGATGTGCITCnTCAGCAGTTATC^ 
GTGTTTAAAAGACACAATCCCAGCACGGCTGAGGAGCAGGAGATGATGGAGAATCTGTTTGATTC 
CCTCTGCTCCTGTCTAATGCTTAGTTCCAATCGTGAGCGCTTCCTGAAGGGCGAAGGGTCTT^ 
TGATGAATCTCATGCTCAGGGAAAAGAAGATCTCCCGGANCAGTGCCCTGAAAGTGCTGGACCAT 
GCCATGATTGGCCCCGAAGGCACAAACAACTGCCATAAGTTTGTTGACATTCTT 
ATCTTTCCCCTTTTATTGAAATCTCCCANGAAGATCAANAAAGTGGGAACCACTGAN^ 
TTAAGAACATGTCTTGTTOSfATCCrGGCrmCCTCCTGGCGNAC^^^ 
GCTTOTGAATAArrCACTGAAAATGACAGTGANAAAGGrrGACNGACTATGGGAGT^ 
AATATCTNGGGG 

SEQ ID NO; 4337 ACTACCAGAGCGAGGAGCAGGCAGAGGAGGAGCTCCTGGACATGGCGGTGC 
TAAAGGACTACATTGCCTACGCGCACAGCACCATCATGCCGCGGCTAAGTGAGGAAGCCAGCCAG 
GCTCTCATCGAGGCTTATGTAGACATGAGGAAGATTGGCAGTAT(XGGGGAATGGTITCTGCATAC 
CCTCAACAGCTAGAQTCATTANrCCGCrrANCAAaAANCCCATGCn'AAAGTAGATTGTCrAAC^ 
AAGTTGAANGCCATTGATGTGGAAGANGT 

SEQ ID NO: 4338 ACTGAACTGGAAGGAAGAGGAAAAAAAGAAATATTACGATGCTAAAACTGA 
AGACAAAGTTCGGGTCATGGCAGACTCTATGCAAGAGAAGCAACGTATGGCAGGACAAATGCTTC 
rrCAAACCTTTTTfAACATAGACCAAATATCCCCCAATAAAAAAGCTCATCC^ 
GACCACCTGAGTCAGGAGAATCTACAGATGCCCTCAAGCTTTGTCCTCATGAAGAATTCCTGAGAC 
TATGTAAAGAAAGAGCTGAAGAGATCTATCCAATAAAGGAGAGAAACAACCGCACACGCCTGGC 
TCTCATCATATGCAATACAGAGTTTGACCATCTGCCTCCGAGGAATGGAGCTGAC^ 
AGGGATGAAGGAGCTACTTGAGGGTCTGGACTATAGTGTAGATGTAGAAGAGAATCTGACAGCCA 
GGGATATGGAGTCAGCGCTGAGGGCATrTGCTACCAGACCAGAGCACAAGTCCTCTGACAGCACA 
TTCTTGOT 

SEQ ID NO: 4339 ACTTGCATGTAGGACAACTCAGTTAGAAAAGTATAGTGAATGOAT GGAA TCT 
ACTGTATGATAAAAATGCrACAAACACCATTTAGTTGCCATCAATAAGAAArrrACrrGT^ 
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AAAATCCAAATGCTGGCATTGTCCAGAAAAATTTAACAGGTrrATTTATA^ 

ACCGCTGAAACTTGTTCACTGAAACATTTTAAClTGCATrAATGCm 

AAAAATTCACACACAAATGAAAATGGAAAAACTGCCCCCGCGT 

SEQ ID NO: 4340 ACGAAAAAGATATGACrCTGCAAGCAAAACAGTGTAAGCTGCTITrTTT^ 
AGACCrGGACATTTTAAGACAGAAGCTTTGCAAAACATTACACAATTT^ 
AGTCTCATTrGTTACATCGTCACATTGCTAGTCAGAGAAATGTTGCAGTGATGAAGA^ 
TTGGACCAACCCAAGTCCTCATTCCTACAACATTCATrTACAAAGAA^ 
AACAAAACATTCTTGGNTTTCTrCATATTGAAGCCCCCCAAAAATCCCTTOT 
CATAAATTATAGGTCTTCATITrTCAAATCCCCCCAACTGTGTGAGAAATGTTTC 

SEQ ID NO: 4341 ACTGAAGAGCTATrACAATCCAAATimGCCGTTTCATAANTGTNATAANTN 
ATACTAATTCACAAAGTATTGTNTAATGGTGGATGACAAAAGAAAATCrGCTCTGTGGAAAGA^ 
AACTGTCTCTACCAGGGTCANGAGCATGAACGCATCAATAGAAAGAACTCGGGGAAACATCCCAT 
CAACAGGACTACACACnGTATATACXrrcrrTGAGAACACTGCAATGTGAAAAT C^ 
TTATAAACTTGTCCTTAGATTAATGTGTCTGGACAGATTGCGGGAGNAAGTGATTCTT^ 
TAGATACTTGTCACTGCCTATACCTGCAGCTGAACTGAATGGT 

SEQ ID NO: 4342 ACACACACGGAATCATCCTATTCATCTCCAAGCCACAGAAGGATGTATCAGC 
CCTGTAGCTAAAAGGATGTGTGGCGGAATCGTTTGGTAATGGCAGTTTTGAAGATGAAGTCTGAA 
AAATOTAANATGATTTTGCCATCTTCTTGAAATCTTTAAACAGGGNGGA/^ 

tggcag^rrccagccitacaacatcttccacaaagngngggtntgtggccagtgagot 

aggtgggtagttgacaaaattcctccaccantgataatacictgatgctctc 

tactccctgttaacaaggtcccrrgagaccttgccgatritgttgatccatgtgccaa^ 

gctacgtagatcactagagtcacaccagggtgccgactcaaaaactctctaatagcctgggagca 

ttcccagcagggactccaggacaagaaccaggtgatggagcagctgatggatgggtgaaaatctc 

tttctgacgtaaatttttttataaaattaacitccacgtgattggtggtgti^ 

cagatcrrccggtcatgccccacrco^tttcgtaaagcanacaggcctc^ 

gtcata 

seq id no: 4343 acctctcaacarraccaaaatcatttctttagagggaaggaataatcattcaa 
atgaactttaaaaaagcaaatttcatgcactgattaaaataggattatm 
ttttatatgaattataaactoaagagcrtaaagatagttacaaaatacaa aagt tc^ 
aataagctaaacgcaatgtcatttrraaaaagaaggacttagggtgtccgttttcaca 
tgttgcatttatgatgcagtitcaagtacttotcatgctgctgaagtccttctccagctg^ 
gccacccgcgt 

seq id no: 4344 acaaagccctaagtttattgtaagtgaaaactgagggaattcctgtcnt^^ 
aggaataatgattcatagatctagataggtggaaatatcattcaaaatagtcacrrgagctcaca 
aaaaaagcaaggaagaattctcatgtcctttgtcttccritctgtag(xattaacrg 
gtgaggaagacaggcttcccttccttcccccrccttagtgattttt^^ 
aggactttctggttcatttttgtttgttttgttttc 

gttgcccangctggagtgcggtggctattcacagatgctatcatagcacactacagcctccaactc 

ttgggctcaagcatcacgcccagcagtttctggttcctttaacagcaaaaggaaagagagg^ 

gattcttacctcagggttttttggtrgtrcattgttttta^^ 

cacaaggctaaaggtracagctgagatcntggaaccaaaggcagagcaagcagagcccgttgtc 
tggccccacaccactgcaggcaggtggatanaaotgcngnccctctcataotatgcccata^ 

AGG 

seq id no: 4345 acgcggggctcactgagcaccgtcccagcatccggacaccacagcggccctt 
cgctccacgcagaaaaccacacttctcaaaccttcactcaacacttccttccccaaagccagaaga 
tgcacaaggaggaacatgaggtggcrgtgctgggggcaccccccagcaccatccttccaaggtcc 
accgtgatcaacatccacagcgagacctccgtgcccgaccatgtcgtctggtccctgttcaacacc 
ctcttcttgaactggtgctgtctgggcttcatagcattcgcctact(x:gtgaagggacagga^ 
ggttggcgacgtgaccggggcccaggcctatgcctccaccgccaagtgcctgaacatctgggccc 
tgattctgggcatcctcatgaccattggattcatcctgttactggtattcggctct 
ccatattatgttacagataatacaggaaaaacggggttactagtagccgccatagcctgcaacct 
ttxscactccactgtgcaatgctggccctgcacgctgggctgttgccctgccccitggtx^ 

seq id no: 4346 actacattrtataacaatagagagtagctgaaaatactacatgctaacacag 
ataatatgatacacaacctcaggggggaagctggcagggagcacgtggcagaggccacaggttt 
agactaagagccirrcaatggactgctgaatggattggatctgctgttrcagctgcgagcot 
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TGATGGTGACAGAACAGGCGATGACAGGCCTGGAGACCCCACAGGCTCTCCCCAGGGCCTGCTTG 
GAGCGCACAAACACGTAGGGCACATrCTTGTCTTCACACAGCAGCGGCAGGTGCAGAATGATCTC 
CAGTGGCrCGGCGTCTGCAGCCATCACGATGAACTCAOAGATGCCCCTGTTGAGGGTTrrGGT^^ 
CTCATTGGCrcCirrCCGAAGCTGCTTATAGTrACATGACTGCTGAAC^ 

GTGAGGTGGGCATCGGCAAGGGGATAGGCCTTTGGATTCACATCACCTCAGTCATGGCTGCGGTT 

CCGCGGGCTCAGCACTCCTAGGGGAGCGCAGCTGACGTTTCAAAANCACTCGCGTGCCCCCCCX^ 

GTACCTTGGC 

SEQ ID NO: 4347 ACGCGGGCGCTCCTAAmCAATATTGTTGCGTTTCAGGGAATAGGGAAGCC 
AGAGGAGAGGGAGAGAGATAGGGGAAGAGCCATTCAGTGAAGCAGTCAGAACACACACAACATT 
TAGTGATTAAGTTCCCCACCTTATATGGGTGCAATTCATGGTGTGCCAAAACAACATAATAATAGC 
ATCAAACGTCACTCATCACAGATCACCAAAACAGATATAATAATAATAAAAAAGTTTGAAAC^ 
GCAAGGATTACCAATATGTAACACAGAGACTCAAAATGAGCACAAGTCATTGGAAAAATGGAGC 
CAOAAGACTTGCTCGrrACAGGGrrGCAACAAACCTTCAATITGTAAAAACACAATTA 
ACACAATAAAGCAAATCACAATAAAATGATGTATACCTGTATACGACTTACTGGTATTTATCT 
GACTACTATCAGCAACATCCTTAGATATTGTTTCTAATTCCAGAGTTGAACCATCTAGAATG/^^ 
CCATTGCAGTGAAACTCANTTGAATTATTCCACTTAATTTGCTTTCTAAATGAA^ 
AACTCACCAGTGTGTGATTTGGTTGTAATCATATAAAAGAGTAAATTTATCTCTTTGCTC^ 
GACTATTG 

SEQ ID NO: 4348 ACCAGAAGACCTTAGAAAAAGGAGGAAAGGAGGAGAGGCAGATAATTrGGA 
TGAATTCCTCAAAGAAirrGAAAATCCAGAGGTTCCTAGAGAGGACCAGCAACAGCAGCATCAGC 
AGCGTGATGTTATCGATGAGCrCATTATTUAAGAGCCAAGCCGCCTCCAGGAGTCAGTGATGGAG 
GCCAGCAGAACAAACATAGATGAGTCAGCTATGCCTCCACCACCACCTCAGGGAGTTAAGCGAAA 
AGCTGGACAAATTGACCCAGAGCCTGTGATGCCTCXTCAGCAGGTAGAGCGGATGGAAATACCA^ 
CTGTAGAGCrrCCCCAGAAGAACCTCCAAATATCrGTCAGCTAATACCAGAGrrAGAACTTCT 
AGAAAAAGAGAAGGAGAAAGAGAAGGAAAAAGAAGATGATGAAGAGGAAGAGGATGAAGATGC 
ATCAGGGGGCGATCAAGATCAGGAAGAAAGAAGATGGAACAAAAGGACTCAGCAGATGCTTCAT 
GGTCTTCAGCGTGCTCTTGCTAAAACTGGAGCTGAATCTATCAGTTTGCTTGAGTTAT^^ 
ACGAACAGAAAACAAGCTGCCGCAAAGrrCTACAGCTTCTTGGTTCT^ 
TGAGCTGCACAGGAAGA 

SEQ ID NO: 4349 ACTGTTATGTGTGGGTGAGCTGACCTGGATGTAGATGTTTTCCTCTCTCTTGCT 
GACCCCTCCGCCAGTTTTGTCTTGTGATGCCATTAACACATCTCTOrCTTTCTGA 
CCATTGGTGTCCCAAGAAATCGTGAGAATAGTTAGCCCCCCTCTCCCCAGCCTGTTGCm 
GTAGTTGrrCACAGTAGTTGAGAAGTTGAAGAGCTTTTGCCTATTGAAGGTGCACTGAGAATAAAC 
TCTTTCCTGCCACCAGAATTGCAGTGGTTCACGGCCTGCACTCATTCCCATGAATGCAGTTAATAG 
CCACAGAAATGTCACATrAAGCAAAGCAGCCAGGGTCTCATCGTGTTGAGACTCGAGTCTCTCAA 
ACCTTGGATTCATTCCCTGGTGTCmGAGCX^TCAGTTTCCTCATTGGTAT^ 
GTGTCTCACAGGGTCATTACAGAGATTAAATGAAATAAATGAAATAA CATAGACC AGGAGGGCGT 
GGTGTTTAAAAQTCACANATGGGGCACCCTCGGCCATCCANCCCAGTGTTTTCTTTANCCCCTATG 
ATGTTCArrmGGTATATCCCATTAGGTGCCCATATTTAAAATTGGGAGATTrCACATAA^ 
AAGGC 

SEQ ID NO: 4350 ACCAAGTAAAAATGCAGTCAmTGGATGAAAAATGTATAACATGGTCACAT 
AGrrCCCAGATTTCTTAAACAGAACATAATGAACAAATCTAATTAGCCTTGCOT 
CGTGAGCmCAAGAGGCAAACTGTTATOATOTCGAAGAAGCACAATGCrrCAAATATTGCTGT^^ 
AGTATTAACTCACTGAATGAAC^ACTATACAAAGCAGACATAAGAACrC:^ 
GGATCATTTACTAATGGGAAAAGAGATATAAGAGATATAAGTGGGCCCTGATGACATAACTGTTT 
GCCrCAAGATGCTCAGTCTAGTGTTAGAGTCAGACAAGAAAAATCAGTGGTTATAATTTCAAAAT 
GTGTTGTGGTAGGAGTAGGCAACACAAAGTGCCATGGAAAAATTTCAGAGGCACACTCAAGTTCA 
GCCTTATTACATAAQACATACmGAAAAGGTGACACTTGAGCTGAGATm 
AATTAATGATGCAAGTAATGGGGGAAGGGCATTCCAGGTATAAGAAATAGCACAGACAAAAATA 
TGGAGAGTAAAAAGACATAGCACTTCAAGGGAACCCAAGCAGGTTGGCAGANCTAGAACAATGG 
ATCAAAAGGTNTTTTT 

SEQ ID NO: 435 1 ACGCGGGGGAAGGTTCAAGTGGAGCTCTCCTAACCGACGCGCGTCTX3TGGAG 
AAGCGGCTTGGTCGGGGGTGGTCrCGTGGGGTCCTGCCTGTTTAGTCGCrrrCAGG^^ 
CCOTCACGACCGTCACCATGGAAGTGTCACCACTGCAGCCTGTAAATGAAAATATGCAAGTCAA 
CAAAATAAAGAAAAATGAAGATGCTAAGAAAAGACTGTCTGTTGAAAGAATCTATCAAAAAA^ 
AAAAAANAAAAAAAAAAAAGGTC 
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SEQ ID NO- 4352 ACGCGGGOTGACTGCTCAGGAAATTTTCCAAGACAACCATGAAGATGGACCT 
ACAGCTAAAAAATTAAAGACTGAGCAAGGGGGAGCCCACTTCAGCGTCTCCAGTCTGGCTGAA^ 
CAGTGTCACCTCTGTTGGAAGTGTGAATCCTGCTGAAAACTTCCGTGTTC TAGTG AAACAGAAGAA 
GGCCAGCTTTGAGGAAGCGAGTAACCAGCTCATAAATCACATCGAACAGTTm 
AAACACCGTATTTrATGAAGAGCATAGACTGCATCCGAGCCTTCCGGGAAGAAGCCArrAAGr^ 
CAGAAGAGCAGCGCTTTAACAACTTCCTGAAAGCCCTTCAAGAGAAAGTGGAA^ 
AATCATITCTGGGAAATTGTTGTCCAGGATGGAATTACTCTGATCACCAAAGAGGAAGCCTCT^ 
AGTTCTGTCACAGCTGAGGAAGCCAAAAAGTTTCTGGCCCCCAAAGACAAACCAAGTGGAGACAC 
AGCAGCTGTATTTGAAGAAGGTGGGTGATGTGGACGATTTATTGGACATOATATAGGTCCGTGGA 
TGTATGGGGAATCTAAGAGAGCTGCCATCGCTGTGATGCTGGGAGTTCTACAAAACAAGTTGGAT 

GCNGCCATTCA 

SEO ID NO: 4353 TGTATGTATCCTGTTGACnrn'CCAGAAATTTriT 
GAATTTAATCAGACTTTCTGATTAAAGGGTmCTnCr^^ 

GGTATGAATTTCTOAAAAAAAAAAAAAAAAAAAAAAAAANGTCGCGGGCACATGTGTC^ 

GCATGGACAATCATGTGTAGGCTGTAAAAGGCANAAATCTTAANCCACTTGAATATTTGGGATGA 

GTCTAAGTTTTCCAGGCGAAGGAAGAGGGGAGAmCTGTCCAAATTGTTTCCGTTAATGAA^ 

CTGAAAATCCCCATACTGACrmGAAAAGTCAAGClTACATCANAAAGTCAGCAG™ 

TGGAAGGAAAAGAAGTTCATGATAGCTTCACGTCAAGGAGAGATGAGTTGAATCTGCCAAAGACC 

GAANGAGAAGAAAAAAAAGAAGTNAAGGTATCCTATTATTTAAACrCCAAAGGCATTGT^ 

TCAAAATTGGGAGAKAAAGGAGGTGANAAGGAACGGAGGTGAATCTTCCrrTCTAGAGAAACrC 

AACATATAGGNCTAAAATTGATAAA 

SEO ID NO: 4354 ACTTrmA(>TTTAArrTTTTmAAGACAGTT^^ 
AGTGTATTCTAGTTGCAAAGTATGCACACATATCTTGAATGGCrrrATT^ 
TGAACV^CATGACTGTGATGCACAAATTCTTTACGTGTAAGGAGTCTATGCAT^ 
TTrrATGATCGGGTGATGAGACAGTTATACmCAACrGCCATTATT^ 
TCTrrACAGrrArrATAAAATTGTATTTATTTTATACAGATGGGTm 
GTITACTrCAGCrrGTTGACCmCTTTGTCTTATXnXjCATGT^ 

TAAAGGCTGTGGCAACTGTAATTAATTTTTGTAATGGGCTGGTCACACGTGGATCTGGm 

TGCATTTGGGATGATmGGTAACCAGATCACCnTITCAGAAATTTAGATC^ 

CATTTTCTCAACAAAAATTAATAGCTGGTTCTATTTTrmAAAOT 

TTTCAAA 

SEO ID NO- 4355 ACTTAAATGAAGCATATTCATGTAATGTGCl i i i 1 1 n 1 1 1 IGGCCAGCmTC 
TAAGCAAATAGArrGTCrGAATTAGTCACAGAATAATTTTGTGAAAATTCA 
TACCCrmnrrrmATATATTTrTAAGGTAT^^ 

TGTTTTCATTAATCTTCGACCTGGAGAGTGAAATACrGATATTTCTAGAAAAAAAT^ 
GATTArrrGAAATGCTGAGGAAAATGTCCCCCCCATrAAAACTrGTAAATAAGGAACTATATC^^ 

rrCAGTAGCTGTGTTCTGTTCCATCTTTTTTITIT^ 

AGTGCAGCGGCACGATCrTGGTTCACTGCAACCTCCGCCTCCCAGGrrCAAGCGATTCT^ 
CAGCCTCCCGAGTAGCTGGGGACTACAGGTGTGCGACACCATGCCTGGCTAATTTTTTTGT^ 
TAAGTANAAATGGGGTTTCACCATGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCA^^ 
CCCGCCTTGCCTCCAAAGGCTGGGATCACAGGCGTGAGCCCCATGCCCGGNCCAmi l i l 11 1 1 iT 

SEO ID NO- 4356 ACGCGGGGGAAACGGAAGTGAGCGGCGGGGTCGACTGACGGTAACGGGGCA 
GAGAGGCTGrrCGCAGAGCrGCGGAAGATGAATGCCAGAGGACTTGGATCTGAGCTAAAGGACA 
GTArrCCAGTTACTGAACriTCAGCAAGTGGACCTTrrGAAAGTCATGATC^^ 
TTTCrrGTGTGAAAAATGAACrmGCCTAGTCATCCCCTTGAArrATCAG/^^ 
CAACCAAGATAAAATGAATTTrrCCACACTGAGAAACATTCAGGGTCTATTTGCTCCGCTAA^^ 
ACAGAIGGAATTCAAGGCAGTCCAGCAGGTrCAGCGTCTTCCATTTCTTTC^ 
ACTGGATGTTTTGAGGGGTAATGATGAAACTATTGGATTTGAGGATATTCTTAATGATCCATCACA 
AAGCGAAGTCATGGGAGAGCCACACTTGATGGTGGAATATAAACTTGGTTTACTGTAATAGTGTG 
CTGrcATGGAAACCGAGGGCTGCATCTTGTTTATAGTCATCTTTGTCCTCGGNCGC^^^ 

AGGGCO 

SEO ID NO- 4357 actgaggataotgcaaaggtcaactcacgggacacaaaaaacgattgttta 

CATTCCAGTrCAACAACTTAGGCAATACTGATATCAACTACATCAAAGATGATACC^^ 

AGATrrGATGATAGGCAGOTAGGCTAGATGAAAGATCTTTrcrrGCrrrGGArrGGG^^ 

TTGAAAAAAAGATATTrTGATGAAAATGCTGCTGAGGACTTTGAAAAA^^ 

TAAACCTCCTAAAAAACCCmGTGAAATTAAAAGATTGCArmAACnm 

AGCTAGGTGCTQAAGATCCCTGGTATTGTCCGAATTGTAAAGAACATCAGCAAGCCACAAAGAAA 
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TTGGAirrATGGTCCCTGCCTCCAGT 

SEO ID NO- 4358 ACCGGCCAAGCCTGGTCCCCnCTTOTTGGGCACTGTGTATGGGCGGAGAAA 
ATCCAGCrTGTTCTrcCTGATGACGCAGAGGTCAATGTTGCTTCCGGAGCCCAGGTCGTTGAA^ 
GCCAGCTGCGATGGCTTCGCTCACCAGATTCnTGGCTTCCTCCTCCTCCATGTCrGGC^^ 
TCTTCAAATACAGCCATTGCrcCCAAGGAGCCAGAACCCATGGTGACATAAGGCAACTTATCAGT 
TGATCCATGAGGATAGATGCTGTAGAGGTGAGGTCCAGTAACATCTACTCCCCCTAAAACTACGG 
CToS^AATCTAACCrrGATACCTGAAAAQCATCTGOT^ 
SiS^ScaA^GAGAGGGAGTGGAGCTCC^^^ 

CTCrraTGTCTGCAGCTGTCCCANCACCACAACAATAAATATTAGGAGATATGAAGT^^^ 

^CAOTSoTCAGCAACAACCATCCCrrCAKTroCTOT^ 

CTTATAGACCCCGCGT 

SEO ID NO- 4359 ACCAGCAGCTGCAGGATGCTTCCATGCAGTGTGTGTTGACCTTTGAGGOCCTO 
ACCAACAGC^GGACAGTCAGGCCAAAAAGATAAAGATGGACCT^ 

CAGTOAGCCAAATCTCTACXATIWAGGCTCAAAACTGAAGGAGATCmGACAAGATC^^ 

CT^Crcr^i^CCTGTTCAATCTGOTCGGGGCTCTGTGTCTGTCA^^ 

CTGGAOTTCTTCAATACAAACTGGCAGAGAAA-m^^ 

SatgaaS^gca^^^^ 

^ggI^cmtcttgctc^ 

gga^gS^tgcctttcga^^^ 

A^^^AGAcScmCTA^CGCATGTC^GGGATGATrc^ 

TS?^f^COV^^AAACCGACAGGAGATTCArc^ 

TTGGCACAAAT 

SEO ID NO: 4360 ACrGTTGTCCATTTCATGAGAGTAGGCTTGAGGACACCATGGOCMG^^^ 
oi^QOTOCCAGCCrAAGCGTTrrAGA<^^ 
A^SAGGATAGGGTCTCAAGATATAATCCrrmATAGGCG^^ 

rCAG/^S^GGATCAAGCXrrGCTrGGAAGCATGCTGGGATGGCCATrrGGAAGGAGTGTTGCAA 

^^tgocctt^ggg^gccaggagcttaaggg™^^ 

CT^^^XScn-t^GCTTCCAGGAGGAAGAG^^^ 
SBr^CroGAATGGTGAACCa^TGGAGAGG^^^ 

T/^TCACCGGGTAGGGTCTGGTCCATraAGGCTGTANAGTTTGAGGGGTCANACTTTrGAG^ 

GGACTGATCGTCCAGCTAGGGGTGTCITCATATGGCTGGGAATCTCCCCCGTAAGAGTrArc^^ 

(XTCGGCATCTrGTOATGGTCCAAGGGGCTTCTGAGGCGATCGGGCACATCA 

SEO ID NO: 4361 ACTACCAGATAGAAATTCTGAAATTGGAAATTGGAGGCCAAAGCC^^ 
GGACTGCAOAGAGTATAACGCAGACAAGGCCATCGTGG.^^^^^ 

CCCAGAAGOTGTTrGATGCGGTOGTGGAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATT^^ 
ATGGTTTCTGGACT^^ 

ctaaaS-ctccatctacctgaga^^^^ 

S^ASTTCAGCCCATGATQGGGGCCGGCCTCAATTATGAATGTTACCAATr^ 
CATCCACAAATCCGCTGGTGATCGGTGCCACGGTGA^ 

SagaaSag^^^^ 

AAATT^GGG^CTTrCrCACANAGGATGTAGC^^ 
GCTCAT^TGGATTGTGTCXn-ATGCGCTCATGAACGTCTGTGGAArc^ 

GTCCTGC 

SEO ID NO- 4362 ACGCGGGATTGTTGCVUGATGGCCGCGCX^CAGTGATGGATtQ^^^^^ 
AACG)^GCGGTOGGOAGCAGGCAC^^ 
GoSGGAAACAAGATCGGAGGrc^^ 

StSSgtaggoaagac^^^ 

?GGA^GGACCCTGGGGAAGCGCGGCCAGATATCACCCACCAGAGm^ 

?G^ScAOA^CCGAATTC^^^ 
5SiSSCTS5^SAGCAGCTGl?G^^^ 

^^ScmSAGrrobA^^^ 

CGTGAGCTOTGCcS^NCANTG^ 
TGTGGAATNTA 

SEO ID NO: 4363 ACCCCTITATGGCTCTrCa:AGGTrACATGAGAACTG^ 
CrcAAGTAGVGCCCTGAArrAGTGTTCATTAACAAACT^ 
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GCrCAATrCAAAGATAAAGGGGGGTTAAACATTAAGAAAAATGGCATTTTrm 

TACCX:CAAGAGAAGTTAATAGGTCTACCTTGTAGATGCAGGGAAATCAAAAGGTCAAACTATAAT 

TrTTTATAAAAACTAATTTTITmACGCTrAA'l 1 111 i 1 1 1 1 GAGACAAGGTCTTGCIT 

TGrrGCCCAGGCTGGAGTGTAGTGGCACGATCTTGGCTCACTACAACCTCCACCTCCCAGGC^^ 

GTGAACCTTCCACCrCAGCCTTACCAATAGCAGGTACATGGCCACAGATCATCAAACC^^ 

TTOTGAAATCATGAACAAGATGAAGTAAAGCrGGACGTGAGTITGGAGCACCTGTCATAACA^^ 

CACTGTGGCCTAAAGTmrCACGTGGTCTTCCACTCCAAAAAAOACG AATO 

ATTCANGTAANTCAGGGCTTGTGTAGANGATCCCCAATTCACATCTGGTTmGNAGG 

ATA 

SEO ID NO- 4364 acgcggggaaacgacaggotaaaggaggtctcactgagcaccgtcccagca 

TCCGGACATCACAGCGGCCCTTCGCTCCACGCAGAAAACCACACTTCTCAAACCTTC^ 

ttccttccccaaagccagaagatgcacaaggaggaacatgaggtggctgtgctgggggcaccccc 

CAGCACCATCCTTCCAAGGTCCACCGTGATCAACATCCACAGCGAGACCTCCGTGCCCGACCATGT 

CGTCTGGTCCCTGTTCAACACCCTCITCTrGAACTGGTGCTGTCTGGGCTrCATAGCA^^ 

TCCGTGAAGTCTAGGGACAGGAAGATGGrrGGCGACGTGACCGGGGCCCAGGCCTATGCCTCCAC 

CGCCAAGTGCCTGAACATCTGGGCCCTGATTCTGGGCATCCTCATGACCATTGGATTCATCCTGTT 

ACTGGTATTCGGCTCTGTGACAGTCTACCATATTATGTTACAGATAATACAGGAAAAACGGGGTTA 

CTAGTACCCGCCATACCTGCAACCTTTGCACTCCACTGTGCAATGCTGGCCTGCACGCTGGGGOT 

GTTGCCCCTGCCCCTTGGTCCTGCCCCTAAATCANCANTTTmACCCACCACCTGTCTANAGT^^ 

SEO ID NO: 4365 ACGTCTACTGTAAAACCTATCCGAGGCACAAGCAGAGACAGATGTAGACCCT 
TTCCCrCCAGAGTCACGCACATACTCGTCATCXiCATCAOTGGGAGAATGGTTGTATCTTATGGAA 
GGAArrATCACATCAAGGAGTCAGGGGAAAGTGACTGGAAGCAAACGCCCTAAAAGITACCCATC 
ACGTrrCAGTGTAAATGAGTAACTATAGAAGACATTGCGTTATCTTATTTCCAAAACGTTCCAACT 
AAAAAACATTTTCCTATTAAAATAGACCrrCCAAAAAAAAAAAAAAAAAAAAAAAA^ 
TGTGACCGTGGTGCCCACGGGCAGCITCTGCAGAACATCCrCATCCTTCAGGGACTTGCCCrrGG^ 
GTCCAGGCGGAGGGACTGGCGGCGGGGT 

SEO ID NO; 4366 ACnTrrrri T l T r J - l i ' i J 1 1 J I ' n TNGGCATTTACAAATAGTTTATAAGANAAT 
CATTTGGGTGAACAATmCATITCACAAAATAAATAGCTCATATCCAAAAAAGACACC^ 
AAGCAGGGAAAAAAAAGCCCAACAAAAATAAAGTCATGCCCGTGGCACAGCCCTAATGTTAGTTr 
TAGGGAGGAAAACACCAACACTGAGACATACAAAAGTNTTTGAGGAGAATTACATGTTATTAAAT 
TTGTCCTAAAACCTGGGGGANAGTTCAGGGAAAGAGAACAGCTGGTATATrrAANAAAGATTTAA 
ATAGAAAACArrACACATGAGAGCAAAATAGTCCATTTCTCTTAAGAAAACCAATAAACACTCAC 
AAACTrrCATrmAGGTTrTCAGCAATGTnrmCACATTAGTCCAACTGCCACCCCA^ 
ACCAGCCAGCATGACAATTAAGGTGACAACAGGTCCAGTTGGCCATACTCCCTGTGCACCCTTGG 
ATGCTTTTCAAGCCAATGAGGGTAACATTTGTGGTGGCAGG AGCCA GAGTCAAGGGAGAATCACT 
TCTCTTCACTCANGAGGAAACCTCCn'CrTGAGCTrm'CGGAAri^ 
G7TGGGGT 

SEO ID NO: 4367 ACAGTAGTCTGACGTATTTCCCCTTCTGTCCCCTAGTAAGCCCAGTrGCTGTA 
TCrGAACAGTTTGAGCTCTTmGTAATATACTCTAAACCrGTTAT^ 
GCAGAACCXTTTGAAAAAAAAAAAAAAAAAAAA 

SEO ID NO- 4368 ACAAAAAAAAAAAAAAAAGCCTCAGCATTTTATCATTCCATGGAAGGAGAAT 
CirTTGAAAGAAAGCATTGCCTCCTACC^^ 

GCATACAAGTAATGTCGCTAGGGCrrAATAAGCAGCCGTTTGCTAATGTGCTTCCriTCAAAGGG^ 

TGGACCmAAATTGCTGCAAAAGGTAAATrGTATTTTTTTTTAAQTATTGGTGTTC^ 

CTAGGCrAAAATTTGCTAAATGCCrrGGTITCTrrrAAAAGTrCATGTAATATTTCT^ 

AATArrrGCAATAAGAGTCTCGArmAAAAAACACATGCATACACACAATTAAGAGCTCATGTCT 

TAGCAAGATCTGGGAAACCAACArrGCGAGAGTAGCTATTTTGAAAGAATAATTCTCCAGAAGTT 

AACATCTAATATCTAGTATCACCAAACAGTATCGCTGTrCTCrriTATTCATrrGAAATGAA^ 

rrATATAACTAACAATTGCCAAATAGATGAGAGAGCAAATCATGTGAGAAAATTCAGAATACCAT 

CTGTTTCATAGTCGCACAGATTITGGACrmCACAAACATTGGGAACTAANTrAAAA^ 

GTCTA 

SEO ID NO: 4369 TCCGGATTTCTAACAGTCCTTGCTTTGGGGGGTOTGCTOACAACTTAGCTCA^ 
GTGCCTTACATCTTTTCTAATCACAGTGTTGCATATGAGCCrrGCCCTCACrCCCTCT^ 
TTTGCACCTGAGACCCTACTGAAGTGGCTGGTAGAAAAAGGGGOTGAGTGGAGGATTATCA^ 
ATCACGATTTGCAGGATTCCCTTCTGGGCTTCATTCTGGAAACrmGTrAGGGCT 
GTGCCCACATTTGATGGAGGGTGGAAATAATTTGAATGTATTTGATTTATAAGTTrrri ri 1 1 1 iGG 
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GrrAAAAQATGGTTGTAGCATTTAAAATGGAAAArmCTCCITGGm 

ATrcrCTGTAAGTGTAGCTCAAATAGGTCATCATGAAAGGTTAAAAAAGCGAGGTGGCCATGTTA 

TGCTGGTGGTTAAGGCCAGGGCCTCTCCAACCACTGTGCTACTGACnTGCTGTGTGACCCTGG^^^ 

AGTCCTTAACTATAAGGNGCCTNATITrCCrrCTGTTAAAATGGGGAT^^ 

CAAA 

SEO ID NO- 4370 ACGCGGGGAAAAGCCGGGGCAGAAGTGCTGGTCTCGGTCGGGATTCCGGGCT 
TGGTCCCACCGAGGCGGCGACTGCGGTAGGAGGGAAGAGGTTTTGGACGCGCTGGCCTCCCGC^^ 

ctgtgcattgcagcattatttcagttcaaaatgaactatatgcctggcaccgccagcct^ 

GACATTCACAAAAAGCACTTGGTTCrGCrrCGAGATGGAAGGACACnTATAGGCTT^ 

CATrGATCAATTTGCAAACTTAGTGCTACATCAGACTGTGGAGCGTATTCATGTGGGCAAAAAATA 

CGGTGATATTCCTCGAGGGATTTTTGTGGTCAGAGGAGAAAATGTGGTCCTACTAGGAGAAATAG 

ACTTGGAAAAGGAGAGTGACACACCCCrrCCAGCAAGTATCCV^.TTGAAGAAATrCTANAAGAACA^ 

AGGGTGGAACAGCAGACCAAGCTGGAAGCAGANAATTGAAAGTGCAGGCCCTOAAGGACCGAGG 

TCTTTCATTCCTCGAGCAOATCTCTTGATGAGT 

SEO ID NO* 437 1 ACATCTCTGGCACAGATGCTATTGGTCCTTAATGTCCTGTGATTTTAGGA^ 
AGTTTGGATTTAGTTCAATITATTCAGAAACCAAATGTGTTTAATTAGCTTC^^ 
GTAAGGGTATGCTGGmAGTATCTTTATAAAATATATATAATGTATAGGTAAATCATAGTCnT 
ATCATACCTAAAATACTGTATCATTTAAACCAATATCTATTTTAACAAAATGm 
CAGCTGATGAAACACTAAATCCAAAGGTATGACAAAGAGGACTCAGTCTGTGTTGCTTAAAGAA^ 
rrTTAAAAGGCATGTGCCCATAACGTCATTCGAGGAGAGGGAAGGAGCCCCCCTTCTC^^ 
TACTATAGAATTCCTGTTGGGGGTGGCCANAAATTANCACTTCTAAATATTTCANANA^ 
GTGTTAAAAAAAAGTCTTrrATGCAAAAGAACGTCTTTCTGCATCANTGAA^^ 
AATTTCCTAATANCACAAAAAACAAAGCCAAGTCTATACATTTGATCANATATCAAAGTAATGG^ 

AGGTGGCTATTATGTCACCTANCA 

SEO ID NO- 4372 ACCCn^CAGAAAACCGACGACCACCAATAGCATCTATTTCATCCATAAAAATG 
ATGCATGGTTGATGATCTCTAGCATAATTAAAC^TrrCTCTGATCAAACGAGCACm 
TACTTdCrCTrrGTAAAAAAAAATAAAAmTrrrGAATTATrCTACCTTTG 
AAGAATCATCTTTAAGAAATTAAGCCATTTACATGTTTGTGT T^ 
TGCATTATATGTTTCAACCTAGTCTAAGTGGGTCTITmACATTm 
ATACAGCGATATAATmGGrrGTCAAATrCCTAATGCAACCATTTAGTCTAAACTTAG^ 
TTGTGACAATAAGATGTGTTCAGGGGCTCCCTGTTTlTAAGAGACTCTm 
ATGTITITATCnGAGTCAATATGAmGGTATmGGAriTACTTTTAATCTTAA^ 
TTATAGCTTCTCANAACATGTGGATGGGATGGGATTTTCGNTATTTTGCT 
ATATATGGACTATTCCTATAACCAAAGTCTCTGACAAGGTGCACXrrAAriTATATT^ 

SEO ID NO- 4373 ACTTTGAGCAGGATAATAACATAAATrrCATTTAAAAAGTTGTATTTAT^^ 
CCAGTAACCGGAAAGAATTATAAGTAArrATGGAAGTATTATATTCTGACCAm^ 
AAACAAAGAGrrCCTACTAAAGAGGAATATITrCAAGATGAT^^ 
ASArTGTITGTTITAATAAAGATTCTrrTGCAAATAA^ 

CTTGATGAAAAAATCTTAAAAATGAACCACTCGTGGTTrAAGAAGGGGGGAAAAAA/^ 

GCTACATATrGAAGTrCTAGAATGCAGCACCTCAACrrCACATCTTCCATAAGCATTTAAG^ 

GAAATCCAAGTGATGTCTTGACGATCGAATCACACTTTATAGTTCrrCCAATO 

TAGCATCTCATCTCAATTAAAAGTTCCCTAATACAAAATATTGTGCTAAAGAGTGCTAAG 

CATGCTOCTGCCATTCX:CTGGCCCGTTCATGCTTGTAGCTGCCX:CATGAGACAGCAAAA^^ 

GGCTCAAACrCCCACTGTATACnTrCCTAACTCCATGCTTCTC^ 

TTGGTrr 

SEO ID NO- 4374 ACAGCTTrGTAGTATTTTTAAATCTATCCAGGATCTTITACT^ 
CTGTATOTGTGGTAGAGAGAAACAGAAGTCCAACCTACTATAC^^^ 
GA^CACTTAGGACGAGATGTGTTCTGATGlTGGAGrrAAGTGCATAT^^ 
GGGACCTAGGTCTATACAGGAAATTCATTTATGCTTCATATACATGTACTT^^ 
TTTTTrGGGCTOCCAAAAGCmATTCGCAAATATGCT^^ 

TCTAAGTCAATGGAATGAAGAGCTGTGTCCAGGGACACACCACGCCGTGCTGAAGGAGA^ 

rrGTGTCCACCTCrrArrCATAGACCCAGTCATGAGCACAAGACT^^^^ 

GGCTTAAACCATAGGCTGATTTCrrrTTCAGCACTrmACT^ 

TGCCAACCTGAATGCAAAAAGTCCCCNCGAATGGTGCCTGCTITGA^^ 

CAANCATNACTCGGCCTGNrmACCACGTTCAGCCCCTCCAAACCATGGCCACAACCGGCC 

ANT 
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SEO ID NO- 4375 ACTTCAACKKiAGGGCAAGAAGTGGAGAGTGGAAAATCAGGAAAATGTTTC 
CAACCTGGTGATTGAGGACACAGAGCTGAAACAGGTGGCTTACATATACAAGTGTGTCAACATO^ 
CATTGCAAATCAAGGGCAAAATTAACTCCATTACAGTAGATAACTGTAAGAAACTTG^ 
TTCGATGACGTGGTGGGCATTGTGGAGATAATCAACAGTAAGGATGTCAAAGTTCAGGTAATGGG 
TAAAGTGCCAACCATATCCATCAACAAAACAGATGGCTGCCATGCTTACCTGAGCAAGAA^^ 
TGGATTGTGAAATAGTCAGTGCCAAATCTTCCGAGATGAATGTCCTCATTCCTACAGAAGGCGGTG 
ACrrrAATGAATTCCCAGTTCCrGAGCAGTTCAAGACCCTATGGAACGGGCAGAAGTTGGTC^^^ 
CAGTGACAGAAATTGCTGGATAAGCOAAGTGCCACTGGGTTCTTTGCCCrCCCTTCACACCA^^ 
ATAAATCTGTATCAAGACGGTCTTTTCTAGATTTCCTCTACCT^^ 
CTCTGAAAANACANCTACCTGCCTTCACTGAAATATACCTNAGCTGAAAm 

AGTCAATTG 

SEO ID NO- 4376 ACTTTTTTTTTITITn 

ATATACAGTCTATGTGTTTANAAmGTGTTGTAAGTAAACTACAGCm 

AA 

SEO ID NO- 4377 ACCACATCCTGCTATGACTCCCGGGCCTGGGTTATCCAGGTCCCATTGAGTGG 
TTTTCCrCTTGGCAGArrCTCAAACAGTCGCAGCTCTTTGGA 

CTCTCTGGGAGAGCCGCTGTTCCCnTCCTGTAGCAGCAGCATTTATGAATGGGGTGAATGGGGCT^ 

TTGTCGACGGCACAGCTAATGCCCGAACCCAGCCCCTGTCGGCAGAGA^ 

ATGTGAATAACAATGTTTTCTGTmAAGGGTGTCAGGAGTTTCGCTI^ 

CTGCAGTAGTAACTCTTCTTTCTCTTGAGAGTAAAAAATGAAATAAAAT^^ 

AAAAAAAAAAAAAAAAAAGTACACAATAAACATTAAATTAATATAGCTGTTCrTAm^ 

ArrAATATATTTATTTTCATATAAGCATATAAATAATAAGCrrAAGCATATTAT^^ 

AATTATTCTCnTrGGACrGCTGTAAAACATTTrCAAGATTCCm 

ACCTGTTrAAATTGACTCTTCCTTCTATTATTAGCCTTTAAATCCT 

TT 

SEO ID NO* 4378 ACAi^TTGAGACGCAGGAAGCAGCTGAAAGAGCTATrGAAAAAATGAATGG 
AATGCTCCTAAATGATCGCAAAGTAmGTTGGACGATTTAAGTCTCGTAAAGAACGAG^^ 
AAOTGGAGCTAGGGCAAAAGAATTCACCAATGTTTACATCAAGAATTTTGGAGAAGACATGG^^ 
GATGAGCGCCTTAAGGATCTOTTGGCAAGTTTGGGCCTGCCTTAAGTGTGAAAGTAA^^^ 
GAAAGTGGAAAATCCAAAGGATITGGATTTGTAAGCTTTGAAAGGCATGAAGATGCACAGA^ 
TGTGGATGAGATGAACGGAAAGGAGCTCAATGGAAAACAAATITATGTTGGTCGAGCTCAGAAy^ 
AGGTGGAACGGCAGACGGAACrrAAGCGCAAATTTGAACAGATGAAACAAGATAGGATCACCAG 
ATACCAGGGTOTTAATCmATGTGAAAAATOTGATGATGGTATTGATGATGAACGTCTCCGG/^ 
AGAGTTTTCTCCAriTGGTACAGTTCACAAGCTTCAGGCAAGGGGCAGCCrGAACTA 
TGTTCAGGCAm'CCAGCNCAGCAGGTCATTCACCCrTTTTCNCT^ 
TGACTCCTGGAGGGA 

SEO ID NO- 4379 ACAGTTTAAACAACAGCTGAAAGAACTAAAGAAGCAATGTGGTCrrTC 

GACAGAGAAGCTGACGGAACAGAAGGAGTGGATGAAGATATAATTGTGACCCAAAGTCAGACCA 

AOTCACCTOCCCCATTACAAAGGAGGAAATGAAGAAGCCAGTGAAAAATAAAGTGTGTGGCCAC 

ACCrATGAAGAGGACGCCATTGTTCGCATGATTGAGTCCAGGCAAAAGCGGAAGAAAAAGGCCT^ 

rrGCCCrCAAATrGGCTGTAGCCACACGGATATAAGAAAGTCAGATCTTATCCAGGATGAAGC^^ 

TTAGAAGGGCAATTGAGAACCATAACAAGAAAAGACATCGTCATTCCGAGTAGGAAAAGreA^^ 

GCCTGCAGGGACACCAGCAGCCTACCTCCTACCCCAGCTGTCTGTTGAGAGCAGTGCT^^ 

GCAGTT 

SEO ID NO: 4380 ACTACCAOATAGAAATTCTGAAArrGGAAATTGGAGGCCAAAGC^ 

GGACTGCAGAGAGTATAACGCAGACAAGGCCATCGTGGACAGTGGCACCACGCrGCTGCG^ 

CCCAGAAGGTGrrrcATGCGGTGGTGGAAGCrGTGGCCCX}CGCATCTCTGATTCCAGA^^ 

ATGGTTTCTGGACTGGGTCCCAGCrGGCGTGCTGGACGAArrCGGAAACACCTTGGTCn^ 

CTAAAATCTCCATCTACCTGAGAGACGAGAACTCCAGCAGGTCATTCCGTATCACAATCC 

AGCTrrACATTCAGCCCATGATGGGGGCCGGCCTGAATrATGAATGTTACCAATTCGGCAm^ 

CATCCACAAATGCGCTGGTGATCX5GTGCCACGGTGATGGAGGGCrrCTACGTCATCTTCGAC^^ 

gcccagWgg^^ 

AATITCCGGCCrrrCTCAACAGAGGATGTANCCAGCAACTGTGTCCCCGCTCANT^ 

CCCArJTTOTGGATrGTGTCCTATGCNCTCATGAACCGTCTGTGGAGC^ 

TCCTGCTG 
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SEO ID NO' 4381 ACCATATAATCCAACTATCATGGTAAGGCCAGAAATCTTCTAACCTACCAGA 
GCCTAGATGAGACACCGAATTAACATTAAAATTTCAGTAACTGACTGTCCCTCATGTC^^ 
ACCATCCCrrcrGACCCTGGCTrCCAGGGACCTATGTCTTTTAATACTCACTG 
GTTGCTTCTAATCCTTArrTCCCATGTGCACAAGTCTTTTTGTATrc 
TACTGTGGAATATTCATTTGACATCTGTCTCrmCArrTCT^ 
rmGCACCTGCTGAACTTCATITCTGTATCACCTGACCTCTGGATGCCAAAACGT^ 
TGTCTGTTGTAGAATTTTAGATAAAGCTATTAATGGCAATATTTTm 
ACTGTCACrAGGGCAATAAAATTTATACrCAACCATATAATAACATTT^ 
AGrirnATTTTAAAGTCTTAGCAAmCTATTACAACTm 
GACTAACATAGTAACAGAATCTTTATGAAATATGACCTTTrCTGAAAATACA^ 

SEO ID NO- 4382 ACTGGGAGTCAGAACGTCTGGGTTCrAGTCTTGACTGCCATTAACTAGCGAG 
ATCCGGAAAATGAGGCCCATAGGAAACAAGTGACTTGCTGAGTCCAGATAACACTG 
GAGAAACATGAACCAGAAGCTACTGAAGTTGGAGAACrrGCTACGAriTCACACTATT^^ 
AACTGCACAGTCTGTGTCAAAGAAGAGCATTAAGACAGTQGAGGCATGGGTTTTCATCTGOT 
CCTOTGTGGACAGCTCAACTGTGTGCCTGGCCCTGGCCAACAGATGTGCrCACTGGGGCTGC^ 
TCTCAGTATAGGCTTCTGGTAACAAAAAAGGAAGAAGGACCATGGAAATCTCAGTTATC^ 
AAAATCTAAAAAGGTGGTAGAAGTATGGATTGGAATGACTATTGAGGAACTGGCCAGGGCAATG 
GAAAAAAACACAGATTATGTATATGAAGCrn'ATTGAACACTGATATTGACATAGATTCACT 
AGCAGACTCACATTTAGATGAAAGTCTGGATCAAAGAAGTGATAACGAAGGCAGGGATG/VAGT^^ 
AAAAGTGGAGTAAATTAAAACAGGACAAAGTCNGAAAAAAATAAAGATGCTGTAAAAAGGCCCC 

AGGCAGATCCACTT 

SEO ID NO- 4383 ACGCGGGGCCTGCTCCGCTTGAGGAGAAGCGCCAAGTGCGCATGGGGACGCT 
ATAGCAATTCGTrrGCTGTCCTTCCTCTCOTCGAAGATGACAAGGCCTACC^^^ 
CmGGGCCGTCAGGCAGTTGGTTGGGACCCGCTCCAACCCTCGGTrCTTCCTGCAATACAGT^ 
TACAATTTGTCATGGCTACTCTGAGATAAGACCACrnTITATCTGAGOT 
GGACTTTGCTGGCTCACNGAGACAOTCTCTATGGAGCTTCAGTAGCAAATAAGGACATCATCT 

TATAACCrACAAGCAGTTGGACAGATATTCrACATITCCTCATTTTCTCT^ 
CTGGTAriTGTACCATGOAATAACrTrrTGAAACCCAGAATTATCTGGCT^ 

SEO ID NO- 4384 actaggtgctgcaatgcaaagggttatgacaaaactgtctgtaaatgtagga 

TCTGAATTGGTCriTGATAGTmCCTGATrrGAGAAAGAAGTCT 

taagagaatggacaaggtctgcttatgtccitctggcagcctagcatagct^ 

TCAGrrATAAGTCAAAACAAATAANGCACArrTTTTAAAAAAAATTCCCCCC^ 

GTAAAGCCATGACATTrCATTrGGTAACCTGTITAAGAATTATAAAAATCArrrCAm^ 

CCCATACTGCCCAANACAAAACTTCCAGACAATTCTGATGCCATCCAGTrm 

GCATATTAAAAAAAAAAAAAAAANAAAAAAAAT^m■CACCGTNCTAAATGTGATGGTGCT 

CGAANCATATTANANCTGGACNTGACATmGTNCCITGACAAACCCTC 

SEO ID NO: 4385 ACACAAGCTrraAGGAAGTGCNAAGGACTGACCTCTAGGCCAGAACAAG^^ 
GGAAAACrACCAGGCCCATCAGGCCTATAACCCAGACACX:AGCATGGACAAAACTCAGTrATA^ 
GAATrC^GAGACAAAATrCAGTGACACTCTTCTACCACTTAmAGGGTTCT^^ 
AACAGACrrAGTTTITn*GrrmGTTTTACAAACCT^ 

ACTAGGACTACGATGTTAAGACAACCACTAGCAGACAGCTGCGGACAGTTACTGGGTCTGA^^ 
GAGGCIT(XCAACTCAAACAGAGAAGTCATTGGGATAAACTCTGCTCm 

TATCCTTAAAACAGGACAAGATGGGATGACCACACCANGTAACATCACCGAGGTCTCAAAGTAGC 
AGCTAAAATGTGCCATAACTCTTACCCTGTCTAGGACCTATGTTCCACTCTCTCA^^ 
ATCrATACATOTTrrCrCAATAAGGGCTTTAGTrrCTCCTCCAGCOT 
CAGAAACCAATTCTACCTCTTT 

SEO ID NO- 4386 ACGCGGGGCCCTGTTTTTTGTGNTTTTNCGAGCTCAGC^^ 
TTACGTCrTAATTTCCAGGACr^^ 

GGTGGNCTTGT 

SEO ID NO- 4387 ACTCGGGCAAGCCATGATCTTTmCCATACTCGCAAAACAGCTA>nTGG^ 
GCAGCA^^^^^ 

AGAGGGCTGCGGTGATOGANCGCrrCCCANAGGGCAAANAOAAGGTmGCT^^ 

TGTGOXGCGGCArrGATGrrGA^^ 
GACGGGAATCCTGACAATGACACCTACCTGC 
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SEO ID NO- 4388 ACAmGTTAAAA^^GGTGAGCC^^^ATTGATACATTAr^AAGTCCATAGTTT 
ACATTAGGGTTCACTCmGTGCAGTTTTATTGGTTGTTATATm 
ACATGTATCCACCATTACAGTATCATTCANAATAGTTTCACTGCCTTAAAAATCCTCT^ 
CTATTCATCC^CCCTTCCCATGGACrACTGGTAACCACTACCITI^ 
CTTTrCCAGATTACCATATAGTTGGACTCATACAATATGTAATCCTrrCAGA^^ 
CAATATGCATTTAAGGTTCCTCCATTATCCAOAOATGGTTTTAGTTGTCACAACrrG 
ATCAGGGGTCAATGCTACTGGCATCTANTGGGTGGTAGAAACTGANGTAGCTCATGACATACTAC 
AATGCCCAOGACAACCCNCCTACAAAAAAGATTATrrCAGCCCCCAATGCCAACCCATTGCT^ 

SEO ID NO- 4389 CGTACCTCTATAATITGGGTGACCANTATGCACTGAAGATGAGGTrrGTGGAC 
CATGTGTTTGATGAACAAGTGATAGATTCTCTGACTGTGAAGATCATCCTGCCTGAAGGAGCCAAG 
AACATTGAAATTGATAGTCCCTATGAAATCAGCCGTGCCCCAGATGAGCTGCACTACACCTATCTG 
GATACATITGGCCGCCCTGTGATTGTTGCCTACAAGAAAAATCTGGTAGAACAGCACATTC^^ 
CATTGTGGTCCACrrACACGTTCAACAAGGTGCTCATGCTGCAGGAGCCCCTGCTGGTGGTGGCGGC 
CrTCTACATCCTGrrCrrCACCGTTATCATCTATGTTCGGCTGGACrrCTCCATCACCAAW^ 
GCCGCAGAAGCCAGGATGAAGGTANCCTGCATCACAGAGCAGGTCITGACCCTGGTCAACAAGA 
NAATAGGCCTTTACCCGTCACTTTGACGAGACCGTCAATATGTACACATGATGGGACCCAAACTCT 
CCTGGAGAGTTCCATTCACrrCTGAAAACGTGCAOTGrrrCAAGGTGCATGm 
ATTGOATArrXAAGATTTGCTTGGAGCANGCTGGGCAAGGGTGGCTTCATCCCTC^^^ 
ATGCTTGGGATTACA 

SEO ID NO- 4390 ACTTTTCTCAGTGACCrCnXjGAATAAAAGAAC TGACr m 
TTAATOTTGCATGAAGGTTGGrrrCGCTmTITCC^ 

GGGATTITGCCTTATGCTGAGGTCAAACTAAGGATAAGCAAGCTTTTGTCCTrcATm 

TGTCATACTGTrATGTrGACATATnCmATAAGAGAATAGAGGCAAAAGTATAGAACTGAGGAT 

CATTTGTATTmGAGTTGGAAATTATGAAACTTCACCATATTATGATCATACAT^ 

AGACrGACCAAAGCrCACCTGTTTmGTGrrAGGTGCTTTGGCTG^ 

TCCCrrrGGTGTTOTGTATGTCTCTrCATrTCCTCTCAAAATCTTNAACT^ 

GGCAGCAGGGATGCTGGCATCTGTGTATCCmATACTGTTTACTGATAACCCACA^ 

ATGGCANACCTAAGCTCANACCCTGCCrrm'CCrrGGCAATNCATAAG^^ 

SEO ID NO- 439 1 ANNCnTAAGCGGCCGCCNTNCAGGTACGCGGGGGGCGCTTCTAGGGCT^ 
NCCGTCATCrrCNGGAGCCGTGGAGCTCTCGGATCANCCGACACCATGGGTTTCGGAGACCCTGA 
AAAGCCCTGCCGGCCrCCAGGTGCTCAACGATTACCTGGCGGACAANANCTACATCGANGGGTGT 
GTGCCATCACAAGCNGAATGTNGCAATATTTGAAGCCGTGTCCAGCCCACCGNrrGNCGAl^^ 
TGTTATGCCCTACAGTNGGTATAATCACATCAAGTCTTAACGAAAAAGGAAAAAGGCCCAAAACA 
TGTANAACAATGGACNGCATCAAATTAAACTTTT 

SEO ID NO- 4392 ACATCAATAACCGGGGArrmATTTrmGCTATACTATm 

GGTGTGCTTGCTAGGCAATATAACCATCACAGAAGCAAACGTTAGGCAGAATACTGGTAA^ 
AACAAAGCAAGAAAAGAGCTACCAGATATACAGACAGGCTTGGTTCCACATTCACTACTGCAm 
TCAAGCGCCAAGTATTAACTGCACATITITGTTCAGCTAGAAAGGAGGGATm 1 1 TTOTIT 
TTTTCTITriTrTrTGGTTTGNTTTAAATCAGTGCATAAAT^^ 

CAAACAGATGGACTCTACAGCTAAGTGGAATATCAAAGGTAGAGGGGTGATTCTGTGANAO^^ 
AGGCCITGACTArrCTCAATICrCCCCACrGCAAGNGTCACNCAAOT 
CrTAAACACATOTGAGCAhrraACCraNATGAATGCCCCTTO 
AANCATOATCAGGGCTAGCCTCATTCAGGGNAGAAAT 

SEO ID NO- 4393 ACCCCCAAGATATCAGGCCAGANT^fTmGAGTCAGTTGCTGTGCNGCN™ 
TTCTTIOTCrCCACATCTTCrGAGGCTITAGA^ 

TCTGNAAGTNCTTAAAGAACCAGCTrCTrAGAATGTTC^GTTCTCAATGTGCT 

TCCTAAACATmTAAAACTCTNCCCrmCACCTCCAATTCC^^ 

TCCAGGAGGGGTANAGATTGCGCCNGACATAGCmACAGOTGGrnTAAAG 

SEO ID NO- 4394 ACCAAGGCTrGTrnGCAGANGGAGAAGCACTTCCATTATCTGAAAAGAGGC 
CrrCGACAACTGACAGATGCCTATGAGTGTCTGGATGCCAGCOT^^ 
CTGCACAGCrrGGAACTGCTAGATGAACCCATCCCCCAAATAGTGGCTACAGATGTG^^^^^ 
CTGGAGCTGTGTCAGAGCCCAGAAGGTGGCTTTGGAGGAGGACCCGGTCAGTATCCA^^ 
ACCCACATATGCAGCAGTCAATGCATTGTGCATCArTGGCACCGAGGAGGCCTATGACATCATTA 
ACAGAGAGAANCTTCTTCAGTATTTGTACGCGGGGTCTCTCrTTACT^ 
GAGGTTGNGGTGCTAGTTTCTCTAAGCCATCCAGTGCCAT 
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SEQ JD NO: 4395 ACCATTCCAGAGAAAACACAAACCTTCCCAAATTAA.GGAGGGCAGAATAATT 
TTGAGATGCAATTAGACAAATCTCTATGAAAAAAAAATTAAGGCTCCTATAATTAACAGTCTGm 
ATGGAGTAAAArrAATTTGTATCTTAATTCTTCAGGACCCAGAAACATTAGAAAGTGCAOT 
TATACTGTGTTAGGGTCCTCTATACCTAAGATCTGGCAGAACACTTTACTTTTTCAAGTACGCGGG 
AGAAAAAGAAGATGGAGAATGAGTCGGCCACGGAGGGGGAAGACrCTGCCATGACAGACATGCC 
TCCGACAGAGGAGGTGACAGACATCGTGGAAATGAGAGAGGAGAATGAATAAGGCACGGGACGC 
CATGGGCACTGCAGGGACAGTCAGTCAGGATGACACrTCGGCATCATCTCTTCCCTCTCCCATCGT 
ATTTTGlTCCCTTTTrrTGTTTTGTTTTGGTAA^ 

TCTAGCATACrGGGTATGCTCACACTGACNGGGGGACCTAGTGAATGGTCTTTACrGTTGCTATGT 

AAAAACAAACCAAANAACTGCTTATACCCCTGCmACGAAAACCCAAAAAGACCCAC^ 

CGGTGNCOTTGTGTCCTCCTCCCT 

SEQ ID NO: 4396 ACAATTGAAAATTCACTGCTGNAGCAAACATGGACAGGGCTCTGGACGATGA 
AAGATGGCCTTCATAAAATGCAAAGTGAACACGTTTCACTCTCATGTCAACCTGT^ 
TTTCACCAAACCAAGACTTCAAAGTTACTTGGTCCAGAATGAAAAGTGGGACTTTCTCT^ 
CTTACTATCTGAGCTCCTCACAAAATACAATTATCAATGAATCCCGATTCTCATGGAACAAAGA^ 
TGATAAACCAGAGTGACirCrCTATGAATTrGATGGATCTTAATCm 
TTATGCAATATTTCnTCGGATGAATATACTTTACrmACCATCCACACAGTGCATGTAGAA^ 
GCCAAGAAACAGOTCCCATAACAAAGGOTATGGArrTTGGTGCCCT^ 
TCTGCTGATTTGGAAAGTAAAATGT 

SEQ ID NO: 4397 ACAAAAAGTCAAGCCCTGAAGTAmrmCCCCCAA^ 

ACTTTAAAAAAAAAAAAGAGAGAGAGGTAGTGTGTAAAAATATrmACITAAA^ 

CrGCACCACrTATATTATAAATGAAAGTTAACCTTCTACATACTAACATTATm 

GTCAAATTTGTAAGACTCACAGAATGTTAAAGCCGGAATGCCATTAAAAATAGTATATAAATGGA 

AGAACTTCNGTGAATTCAACAAAANGTACCGCGGGGTAATTCCAACACTTTGGGAANCTTGATGG 

GGAGGATCACTTGAGCC 

SEQ ID NO: 4398 ACGTGATACAGCGAGGTGCTCAGTCCCCrCTGATCTTTCTCTATGTGGTTGAC 
ACATGCCTGGAGGAAGATGACCTTCAAGCACTCAAAGAGTCCCTGCAGATGTCCCTGAGTCTrCTT 
CCrCCAGATGCTCTGGTGGGTCTGATCACATTTGGAAGGATGGTGCAGGTTCATGAGCTAAGCTGT 
GAAGGAATCTCCAAAAGTTATGTCTTCCGAGGGACCAAGGATTTAACTGCAAAGCAAATACAGGA 
TATGTTGGGCCTGACCAANCCAGCCATGCCCATGCAGCAAGCACGACCTGCACAACCACANGAGC 
ACCCTTTTGCTTCAAGCAGATITCTGCAGCCTGTTCACAAGATTGATATGAA 
TGGGGAGCTACAGAGGGACCCATGTCCAGTAACTCANGGGAAGAGACCTTTGCGATCCACTGGTG 
TGGCmTGT 

SEQ ID NO : 4399 ACTGTGTGC AGCAGCTCAAGGAATTTGATGGGAAG AGCCTGGTCTCAGTTAC 
CAAGGAGGGTCTGGAGCTGCCTGAGGATGAGGAGGAGAAGAAGATGGAAGAGAGCAAGGCAAA 
GrrrGAGAACCTCTGCAAGCTCATGAAAGAAATCTTAGATAAGAAGGTTGAGAAGGTGACAATCT 
CCAATAGACTrGTGTCrrCACCTTGCTGCATTGTGACCAGCACCTACGGCTGGACA^ 
AGCGGATCATGAAAGCCCAGGCACTTCGGGACAACTCCACCATGGGCTATATGATGGCCAAAAAG 
CACCTGGAGATCAACCCTGACCACCCCATTGTGGAGACGCTGCGGCANAAGGCTGAGGCCGACAA 
GAATGATAAGGCAGTTAAGGACCTGGTGGTGCTGCTGTTTGAAACCGCCCTGCTATCTTCTGGCTr 
TTCCCTTGAGGATCCCCAGACCCACTCCACCCGCATCTATCGCATGATCAAGCTAGGTCTANGTAT 

TGATGAAAATGAAAGTGGC 
SEQ ID NO: 4400 ACCGCCAGCTCTCTGCTCTCCACAGGGCTCCCCGCCCCACCCGGCCTGATAAA 

gcgcgccgactgggctacaaggccaagcaaggttacgttatatataggattcgtgttcgccgtgg 
tggccgaaaacgcccagttcctaagggtgcaacttacggcaagcctgtccatcatggtgttaacc 
agctaaagtttgctcgaagccttcagtccgttgcagaggagcgagct ggacgc cactgtggggct 

CTGAGAGTCCTGAATTCTTACrrGGGTTGGTGAAGAITCCACATACAAATTTm 

ATTGATCCATTCCATAAAGCTATCAGAAGAAATCCTGACACCCAGTGOATCACCAAACCAGTCCA 

CAAGCACAGGGAGATGCGTGGGCTGACATCTGCAGGCCGAAAGAACCGTGGCCTTGGAAAGGGC 

CACAAGTTCCACCACACTATTGGTGGCTCTCGCCGGGCANCTTGGANAAGGCGCAATACTCTCCA 

CTCCACCGTTACCGCTAATATAAGTAAAGTTTGTAAAAATTCATACTTAATAAACAATTTAAGG 

ANTCATGTCTGCTTACAGGTGTTATTTTGTCTGrrAAAACTAATCTGAA^ 

TGT 

SEQ ID NO: 440 1 ACGCGGGGAGGATAGGCCGAAGCTNGACCCGGAGGAGATGAAACGGAAGGT 
GCGCGAGGATGTGATCTCCTCCATACGGAACTTTCTCATCTACGTGGCCCTCCTGCGAGTCACTCC 

atttatcttaaagaaattggacagcatatgaagacaggacatcacatatgaatgcacgatatgaa 
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GAGCCraGTrACAGTrTCGACTCCTCrCTGCAAGTGAATTGGCCX:AGAAAGGTGTAAGAGACTCTT 
TOAATXHJACATAAAATTCTtKnTGTTAAGAACAAGTTTGGCTCTGGTAACTGACOT^ 

aatatSLvactatttgggaagtatcaaacgatgtctcgtgatctogtgtacgtggaggct^ 

ACCTGGTCAGAGACTGATGTOCCrrANCGGCAATGGTrAGAG<nTnX:AGTGCATCC^^^ 
GTCGCCCCCATCCTCGGNTTCCTCACATTCANGGAGCCrrGACTTTGGATCAGACTrc 

SEO ID NO- 4402 ACTGCTACCATTACATGGTTCCTrATTAAATrrGAAAAGTGCCTQAAAGTTTG 
GGCACCAGAAAGACACCCCAACAACATCTGTCTAAACT^ 

AGTTACArrGTOAGAAGTGCTGAAGGTATGTGATGTCTTrCCCGGCACAAAGGTGGCOT^ 
TCACCAGATGGTOTAGCCACCATCTGAACCACCCATGAAGAAGTrCCX:CTrCCGCTGAGTTACGAG 
GACGTTGGCTCCCTGCATGACTGCAAGCTOAGCANCATTGGGAGGGCATCCAGGAGGTGGAGGAG 
GAATGTTGCCAGCAGTAGCCCCAGCTXXAAATCTGGCACCCGCATCATACCCTCCTTCACCANCAC 

TCTGGAGCCAGGTGOATA 

SEO ID NO- 4403 ACGCGGGTATCAAGAATTGGnTGTrCATTAAAGCAGGCAGTAGATATGAGG 
ACnrCAGCAA-mAGGAACCACCCATTTGCTGCGTCrrACATCCAG-IXn-GA^ 
CATCTirCAAGATAACCCOTOOAATTGAAGC^GTrGGTGGCAAATrAAGTGTGACCGCAACAA^^ 
GAAAACATGGCTTATACTGTGGAATGCCTGCGGGGTGATG'ITGATATrCTAATQGAGTTCCTG^^ 
AATGTCACCACAOCACCAOAATTrCGTCGTTGGGAAGTAGCTGACCTTCAGCCnrAGCTA^ 

GACAiiiLGCTGTGACCTrrCAGA^^^ 

CCGGAATGCCTroGCTAATCCCTTGTATTGTCCTGACTATAGGA-lTGGAAAAGTGACATCAOAGGA 

GTOATTACTTCGTrCANAACCATTTTCACAAGTGCAAGAATGGCTTTGATTGOAOT 

TCATCCTGTTCTAAAAGCAAGTT 

<!EO ID NO- 4404 attgtotccatttcatgagattotgcttgaggacaccatggocaangatcto 

ATGGTTGCCATCCTAAGCGTTTTAOACTTrrGACCCAGANATnTrTGTTTCT^ 

ATTTTAGAGGATAGGGTCTCAAGATATAATCCrrmATAGGCOGCAGGTCnAANAGATGAGGG 

CCANAGGAACGGATGAANCCTGCTTGGAAGCATGCTGGGATGGCCATTTGGAAGGAGTG'TTGCAA 

GG^GCATCGCOTGGCTC^^ 
TCTCGGCCTAATCTTGTGGCTTCCAA 

SEO ID NO: 4405 ACAGTTTTATGTGGGGAACAATrCATGCAGGCTACTGGAAAATTAAATOTAT 
TA^CAACrccrroTGATATXrrTTGCCATCACC^^ 

CATrGCTGATACAAATGGAGAGGGCAGAGAAGACTTrATACAACCAG-rTITrcCATTGCAOAGTCT 

TAAGAAAGATTATTAGATGACTrACCTATATGACTAATGCCATCAGOAAC^^ 

GGGGGTTGTCCATCCCTCrrCCATACTGAGGTGGAOATGCTCATGCAATACTmAAGGATGCAT^ 

GTCCANCCTTCAGTrATrCTTCACrGCTCTTGGTGAAGGTATGTGGGAGAAAAACTAA-IT/a/^^ 

CGmCCrAGCCrCTGATGGAGAAGGAA(>CCATrCTGATACCAGAACATGGTTTATAAGG 

TAGAAAAATCCCXAACCAATCTTAATTGAACCAAAGTCTGAACCAATGGAAAAi^^ 

AGTGTATAT-mGCAGGTrTAANACAACTCAAGGACAATTAAAAACAATGGACTTTACATGTT 

SEO ID NO- 4406 ACGCGGGGAGGCTTGAGGGAAGCA-raGAGGTCCATGGCAAGCCCAAGGCT^ 
GCrcGAGTTQTTajTCGCCCACCCGGGAnCCTCAGGAGTCCCAG-rGTCCAAGGAG^^ 
GCGGGAAGCOACQGCCGCGGAGGTATATGGGACAOGTrGCT^^ 
AAAGACCTCCACTCrrCAAACAGTTCGGATAGAGAGGAGTCCXT^ 
CAATTTCGGGCGATGGTOOTCACGCGGATCCGICTCTGGGTGCTGGACTG^^ 
AACroGA-TCGCrccrCTGCCTCCTTGGGGGATCGGGGTG-ITGTGCTOATrOACAACTTC^^ 
ATGCCAAGTGTAGATGTAGGATCTAGGCCACAGATTITCCACTGACTCGTGCCACCAACACCAAG 

CT^^^CTS^S^CGGTCCmCACATTCAGAOATACCNCATC^^^ 
TCTTCA-mCCCXlAGAAGTCrTTNCCTTCAAAGTACACCAAAATGCCA-ITCGGAAATCTCCA-n 

AATCTTTAA 

SEO ID NO- 4407 A CI - r iT l ] 1 1 I Tl - i - lTfl 11 11 111 lA AAAGGATGAGAAGAGATTTTAG^TTC 
ACTGrnrCALG-rSAACAAGGGATAC^^^^ 
GAS^A-TCAkAGACACTCmTAACAaGCAAATAArrC^^ 
TTO^^TACAGGTTCITATACAAATGTATAACTAA^^ 
ISS^GGCTT^GAGA/^GOCTTTO 

SG^^CAAAAGGAA-irrGTOAGGANAACAGAAATTAACTGTCANATOTC^^ 

AAATTATCCAAAGTTTGA^ 

ATTTGCNTAT 
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SEO ID NO- 4408 AC rrn i il'i 1 1 1 1 11 1 1 1 i iTl-iTrn"lTAATGACATCCTAAAA1TCANAGGAG 
GGGCCAGCGGGACCTCTGGGCTCAGCGGCTGTGAAGGAGGGACCCGCAACACCCGCTAAGGCAG 
GTAATTGCAANAAGGCACTCGCGAGGGGGACrrCAAGCCCCTCTTCTATTTCTTC^^ 
GGGGGATGGGGAAAGCTCCAAGGGCGAGGGAAGCA 

SEO ID NO- 4409 ACGCGGGGGTATTGGGmGNCTGGCCTCGATTNAAAGAGACAGAANCTGTC 
NNGGTCCTGGAAGACGGTCCCCANTACCCn'CCCCCCAAGTCCriTGGGACCGCTTGGGTCC^^ 
GCTGGGGAGATGGTTTGTGGCGGCTTTGCCTGCrCCAAGAATGCGCTTTGCGCTCTCAACGTGGTC 
TACATGCTGGTGAGCTTGTTGCTCATTGGAGTGGCTGCTTGGGGCAANGGCCTGGGTCT^ 
AGCATCCACATCATCGGGCTGAGTCATTGCTGTGGGATCTTCCTT 

SEO ID NO- 4410 ACCGCCAGCTCTCTGCTCTCCACAGGGCTCCCCGCCCCACCCGGCCTGATAAA 
GCGCGCCGACTGGGCTACAAGGCCAAGCAAGGTTACGTTATATATAGGATTCGTGTTCGCCGTGG 

tggccgaaaacgcccagttcctaagggtgcaacttacgocaagcctgtccatcatggtgttaacc 

AGCTAAAGTTTGCrCGAAGCCTTCAGTCCGTTGCAGAGGAGCGAGCTGGACGCCACrGTGGGGCT 

CmAGAGTCCTGAATTCTTACTGGGrrGGTGAAGATTCCACATACAAATTTm 

ATTGATCCATTCCATAAAGCTATCANAAGAAATCCTGACACCCAGTGGATCACCAAACCAGTCCA 

CAAGCACAGGGAGATGCGTGGGCTGACATCTGCAGGCCGAAAGACCCGTGGNCTTGGAAAGGGC 

CACAAGrrCCACCACACTArmGTGGCTCTCGCCGGGCAGCmGGANNAAGCNCAATACT^ 

AACTCCANCCGrrCCGGCTAATATAAGTAAAGTTTGNTAAATTCATAOTAATAA^ 

CAGTCATNTCTG 

SEQ ID NO- 441 1 ACCAATCCITITGACGCTATTCCATTAAGATAGAGAAAGAGAATCCT^ 

atcattctatgaagcctgtaataccctaagaccaaaaccaggaaaggatataaccaaaaaagaa^ 
actacagaccaacatncctgatgaacacagatgcaaagattcttaacaaaatacatctaaccaaa 

TTCAACAACATATCAAAAAGATAATCCACCATGATCAAATGGGTTTCATACCAAGGATGCAGGGA 

tggtttaacatacacaagtcaataaatgtgatacactacataaacagaattaaaaacaaaaatca 
catgatcatctcaatagatg-itgaaaaagcatctgacaacatctagatccctctatgattcaaact 

CTCAGa^AAATTGGCAGACAAGGAGACATACCTAANTGCTATNCAAAGTCATCtATTGACAAACC 
C 

SEQ ID NO- 4412 ACTGTGTGCAGCAGCTCAAGGAATTTGATGGGAAGAGCCrGGTCTCAGTTAC 
CAAGGAGGGTCTGGAGCTGCCTGAGGATGAGGAGGAGAAGAAGATGGAAGAGAGCAAGGCAAA 
GrrTGAGAACCTCTGCAAGCTCATGAAAGAAATCTTAGATAAGAAGGTTGAGAAGGTGACAATCT 
CCAATAGACTTGTGTCTrcACCTTGCTGCATTGTGACCAGCACCTACGGCTGGACAGCCAATATGG 
AGCGGATCATGAAAGCCCANGCACTTCGGGACAACTCCACCATGGGCTATATGATGGCCAAAAAG 
CACCTGGAGATCAACCCTGACCACCCTATTGTGGAGACGCrGCGGCAGAAGGCTGAGGCCGACAA 
GAATGATAAGGCAGTTAAGGACCTGGTGGTGCTGCTGTrTGAAACCGCCCTGCTATCITCT 
1TCCCTTGAGGATCCCCAGACCCACTCCAACCGCATCTATCGCATGATCAAGCTAGGTCTA 

SEO ID NO- 4413 ACCGCrCCAGGCX;AGCTGTGTCNNANATCTGAGCCTTGACAGCAGCGGTGCC 
CAACATCACAGTGCGGGTGGAGAACTCAACCCCGATGGTGGTGCGGCTGTCGTGGCTGAACTC 
TGCGCGTGAATCGGGAGAGTAGArraGTCTTCCCCACACCrGATTCGCCGATCNGa^CCACCT^ 
AGACAAAAGTTATAATCOTCCTCAGTTCCATTCCACATCITGGCTCCCGC^^ 
TCTTTCTGGGACAGAGGCGACAAATCTGTGTG'nTGCTCATGCCCTCAACCCTCA 

SEO ID NO- 4414 GGTACTCAACACCAACATCGATGGGCGGCGGAAAATAGCCITTGCCATCACT 
GCCArrAAGGGTGTGGGCCGAAGATATGCTCATGTGGTGTTGAGGAAAGCANACATTGANCTCAC 
CAAGAGGGCNGGANAACTNACTGAGGATGAGGTGGAACGTGTGATCACCATTNTGCNGAATCCA 

CGCCAGTACITIT^IT^^mTmTTTNAT^^^ 

AKGATANTAGNCTATANAATGCCTGTOTGCTrATmCAATCACCA^ 

NGGAATNTNTTGmrCITmAAGGAAim^GNTCAATTN^^ 

GTGANTTCTTCCT^^m'GGTTACATTTTCTG^mTNAAANGGGATNr ^ 

>nsfArmGAAATTATGAAAATTACTNGTTrCbrrAACTNATGN^ 

NNTTTGTTCANTTGTCCTAATAT^I^^^^CTC^ 

SEO ID NO: 4415 GGTACTCrrGATGAAAGACCGTGAAACCAACAAATCAAGAGGATTTGCTTNT 
GTCACCrrrGAAAGCCCAGCANACGCTAAGGATGCAGCCAGAGACATGAATGGAAAGTCATTANA 
TGGAAAAGCCATCAAGGTGGANCAAGO^TCCANACCATCATTTGAAAGTGGTATNACGTGGACra 
TCTTTANCTCCAAGAAGTATAGGCCCTCCAANAGGTNNTATNNGTNGAATAAGANGNTG 
ANCCNGNGGANTTTTNC^mCTG^n^ANGTCTCTTTGT^ 

^R^rITNT^r^A^T^T^fNTNNTGGAm 
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NTm'AGATNTCNTGGTANNTTTTNTITm^ 
TNTrrTATTCrrTTTNNATNfN^^ 

QYTCT i T iTiri 't^TTANTmArmrNriririTi'rm^^ 

SEQ ID NO: 4416 GCGTGGTGGCGGCCGAGGTACrri"rrri"l"i'l'rrri-rri"l'l ill l iGGAGCAGAA 
GGAACrCTTTATTGGAAAGTGGATGANAGAGGCNCCTCCAGCCGTGGGCATCCTGAATGGG 
AGAATGGACAGTGTGGGAAGGGGAAGGGCANCAGGGACTTAGGACCAGATGGGGCCTGTAGCTC 
TGNGGACGGCACAGGTTCATNANGGACCGGCTCCNTOTC>n^GGGGAACNAATCNGGCCArc^ 
CTAGANCCTTCNCANNANTTCTTGATTCCTGNNCAGTTTANTNT^^ 
NATTCANCCrmGCTTCANCTGNrmNNTGTTNATGTTGCNATTGANCN^ 
GNACCTTrrATCITNGCNNTNrrTrCCTN 
TTGNATNTNTITOTANTNNTTCnTIOTTlOT 

Tam■ATT^ra^mcTTTATTOT 

TTGTANCrmATTmrcmTCTATrTATCTGNAT^^ 
NAN 

SEQ ID NO: 4417 GGTAL T r n r i l - l 11 11 1 1 i lU - IlU - inu - l ' llT llUU-n-lTTWAAAATNAAArnr 
ATTTOTCATTGATANAANCCATGAAAAACKrTTNCCCm 

GGAANATTTNTNGCCATAAANAANATCTNGCTGATTNGNAAAAGNGAAANANCAGT^ 

TCAGGGNAAANAAAAANGACNTAGTAAANGATNGTTTAATTrrTAAANCCAAACATAANCATAN 

GGCTTCNTTATNAACANCCNGTNTAGCCCATTNCTAAATTNTT^ 

ATAAj\NAGGCATKn>jAATATTGGTTNGGCCANCNGGANTCCCCNAAA AAAT rA 

AAATTTCAGCCAACCTTTITGCACAAAAAAAAa^GGGNGGGG^ 

TTGGAastACCCTTTATACCCAAAANNTNGGTTTNNAAAAATTm 

SEQ ID NO: 4418 GGTACT rnTiT n 1 i ri 1 1 1 1 1 1 1 fiTTTmGcrnTiuTrrn-ri ri'1'1 ITITTT 

TTTTTTTTTTTTTTTNANCCA^ 

CTTTTAGGGCTGCTTCCNCCrGAAGGAANATCCTTTTGNAANCCn^ 

CAAAGGANANNGGANCACCCAACNCNNAAAACTACCGTTTGGGCATGGNTAAAAACCNGG 

TTITANANCATCCTGGGCNTTTCACATCCATGAANTAGGAATTGGG<^^ 

TTITGGGNTNCCNCTTCNCCNCTmTGGAGAGGGATGAAGGAAATCCTTTG^^ 

CNNGNGCa^AGGGNNCACCCCCCGAAAAGGCCCCNNTTCTTGCCCGGNGGGCCNTTTA^^ 

GAAATTCCACCrCnT^GGGGGCGGTTNTAANGGATCCAACCNGGGNCCAANTTNGGGTO 

SEQ ID NO: 4419 GCGTGGNCNCGGCCGACGTNCGCNGGTAAGATAGTTAANCGTGCNTAANNN 
AACmCCAATNTACATACTCNGCrrAAAATTTGGGGGAAAATrrAGA^ 
ATNGGAAANTTGGTATAATGAATGAAACATTTrGNNATATAAGATTCATATNNACTTOT 
TTGATAAAGNNAGGCNTGGTTGTGGTTAATCTGGNTTATTTTTGNNCCACNNGT^ 
ATNACTOGNNNNANCCTCATGACrrANGAAAANANNGGAAANANCTTTGTGAOT 
GCCATGCTNNTAAAAGNAANAGNNGNCAAGGCTQ^ANATTTGAAAAACGT^ 
TTOCNCNTTTNGATGNATCNAACTNGTCGTITOGTCATNGG>^ 
AAACNTNTCCTATNGANTTNNCTTTCTCKrANAGTrm 
TTTArmTTTGTTNGNGTNTANNGNTTCCCTmCTNCTANC^ 

SEQ CD NO: 4420 Ac nTrriTrrrn rm i i ru iT T T Arrrrrn rrn 1 1 1 1 1 i 1 1 1 1 itni m 1 1 

TTTTmrmAGGGrrrCATTATTTATTTATGAC/^^ 

AAANTTTNTmGAAAmANGCCNTNGGCCTTGGCCAATCGGAN^^ 

TNCTACNAANGGNCCCCCAAATNGCNTAAGTTTTAAACTGGNCNTT 

NAACCTCGGCCNCGrmATTTGGAAGGANGCCTGGNCCrrGGCNCCCATNANGC^ 

GANATrrCCNNGGGACGTNCCTTGGNCNGNANCACCCNAAGGGNNAATTCCAGCNCACTO 

NCCGNTACTNATGGANNCCAACrTCGGNNCCCAACTTTNGNGGNANCAAGGGGNANAA^ 

CCCNGGNNGNAAANTNGTTTCCCCTCACAAmCCCACAANATANCAACCCGGNANCTTO 

TAAAACCCNGGGGGGCC 

SEQ ID NO: 442 1 NCCAGCGGCCGCNCNGNCNGGCNCri'lTl-n i 1 1 1 i 1 rTrrTTTTTGGTTGGNN 
CNNTATNGNNNACATNNCTACTGNGCATACNCATATATACNNTGTATTTO 

AATATGNACNTTGATCACTNGGNTNACAACGTAAATATATTGNGAACATTGTGCrOTCTACAAC^ 

GTTAAAAGAATTGAATNCTTGGAGGAAACACANTNTANTAAACNATCTTGTG^^ 

GCATAATITNTTrrCTAAGGAGGCTTAATNNTTmAACAANGNCm 

CTNGTCTTATATNGCTTNCTATANANGATGGAAACNTGCCCTrCCATrTAGCCi 1 1 1"^ 

mrACCACGACCTAATCACCAATCAAGrrACCCATTTTOGTmAAACCN^^ 

CCGNmCTACCCAATrm-CCTGGGGAAACNCCNAAGGGGNCTrCATT^ 
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CCTACCCTrAAAGAAGGNACCTTTCCTTTTAAANAA 

SEQ ID NO: 4422 GGTACTNTCOT>rrTCTNCriTrTn^ 
TTTTTTrTTTmATGGNTGTGGNTTNN>^^ 

ANCTCTrrNATTNTGAAACCAAANCTGGATTCTGGACNCTrrGCA^^ 

TGATTGGTANCATAACCTGGGTAAGGTTACNNGAGTGAAGGTATTGANGAAGATNCNCAATTGAT 
(nrCTGTGAATTTGGTGCAAOTGCACCTATGATTTGKTGCNACCATCTTG^^ 
NAACTATGCAAAAAAGAGAGTTCTNGAAGGCCTGAACTA(nX3NCNGGGAGACTGOT 
TTTTrTAAAAAAGAAAGGTCACATNTTATACAATATAAAAATNCACAAAGCTN/^ 

SEQ ID NO: 4423* ACAGGAGAGAATCAATAAGGAGATAGTTAATTCACCACTACAGCnTGTGCA 
TTAAAGACAAGTITGCAGAAAGCTATGTCTTACAATCCTCTTTrAGrrGCATCA AACA 
TCAATGACTTTCTACTTCrrATAGGCATTTTATGCNrrTN^^ 

TCNAACNCCNACCNAGATGCNGCAAACCAAATCATNCNAACTGAAAGATGAATGNTATTAGTGA^ 

TNACATTGGmOTTTAATGhnsrGNATTGGTTCAAACAAAACAT>WAATO 

ANTTTGNCNAGGCGaWTGACmCCCCCTOTGATNCCACACTT^ 

TCNCAAAGNNNNGNGATCTATACCCTTTTTTCTT^ 

AAAAAATT 

SEQ ID NO: 4424 A crrnTi ' iTi ' i J i i n J i J i"i - J " i itj i r iattaatatttgggncttacaaatga 

TCACnrnAAATGGACrrTTCTGTAANAATGTAAAACTCAAAAATTTGCC^^ 

CNCNCAAATCCCCAAAAAGGGTTOn'GGGTNGTCTTNATTAACGCAAATlSrrTO 

TNTTACTGNAGGATCTTGAATATGTTTTACAATAANGAANCTNCAAAGTT^ 

ATTGNNAACTATAAATAACATITGTATTAAAAAOAAACTGGGNAATACAAAAATNGNGA^^ 

GNGGGCNNGCAATTGNGANGCCAGAATATTTNrmGCTTTGGGAGCNGGTGCATCOT 

ACCCNCNATGNGGNNTGNAATCATCTGGCTGGNACCCmTAm 

GmATTTCANAACriTTTTGNTATTACCGAAGTTTGTNAAACT^ 

GWGCTTTNAAAAATGGTrrGNANCTGCACCCTTAAmTrGGGCT 

SEQ ID NO : 4425 ACGCGGGGTATroAACrrGGGGGTTGGTCTGGCCTACT GGGCTG ACAT TAACT 
ACAATTATGGGAAATGCAAAAGTTGTTTGGATATGGTAAGTGTGTGGTTCTCTT^ 
TCAGGTGATTNAATAATAArrAAAAACTACTATAGAAAACTGCAAANCAAAGGGA Al-liCriC 
ANGGGAACCrrTTGATTTATNAANTAAAATCCWI^ 

GTGGAAGAAAAAAACCTTTCCATTTGTTAACTGNAAAACAAAAAGOTAAGGGAT^^ 

TTTGGNAANITNrrATTrrAAAACTTATCTGTTTNj^^ 

GTCTACCTTGATTGCTTTAAAAGGNNGGCCTTITGTTAATNAAGGAAGGG^ 

TNTAAACTTTTTAAANAANNATTmAANNANTACCNACC^ 

AAAAAAAANCTNNCTTGGAAAOTGGNNAATAAAAGGNNAAAA^ 

GGGGGAANANCNTANNCCNrrNANAmrmNCNTCTANNA 

TTTAANCATAA 

SEQ ID NO: 4426 ACACAAGCTTTGAGGAAGTGCAAAGGACTGACCTCTAGGCCAGAACAAGAT 
GGAAAACTACCAGGCCCATCAGGCCTATAACCCAGACACCAGCATGGACAAAACrCAGTTNTACT 
GAATTCANAGACAAAArrCAGTGACACTCTTNTACX^ANTrmANGGNT^ 
ANCAAACTTANTTTTTNGTCTTGGTGTANNAACCm 
CTTTGATTNAATTTTNTmTNC™ATT^^ 

SEQ ID NO* 4427 ACTTTCCAGCCACTAATTGAGATGTAGTTATGAAAGATTAGAATTGCCrTAAA 
AAGGGGTCATGATACAGCTATCAAAGCAATATTCAAGCOTGATTTCAAGGATGCTTTCAGGGTTA 
AAATTAACCTTCCTTGNCAAAAAT^m}CACAACTTNATCAAAAAC^ 
CNTTTNTTTTCnTGACTTTCCCCATTTCTrCATNTNAAGCAATO^ 
NGNCTNAArmTTTTTTCCCAGChmTTTTn^ 

AGAACTTGGTCAATNNNAATCTTANTNTTCCGNCCAAAAAGANAACNAAAGANCN 

CATNCATGGNCTTGGTCCATNCNTGAAGGGTNKITGAATTCTTGGAAANCCAT^ 

NTTCITNNGANCXrTGGNNCGCNTrGNKmAGACC^ 

TNNACTGGNACCTNNNCNGGCGGGCCCnTrAAAANGGNCAATTNCAC 

TTTGGGANTCCAGCTCGGGNCCCACOTGGGGGNATNKmGGNATATACN 

GNTGTTT 

SEQ ID NO: 4428 ACAAGTCATnTAGGAACTAATAGAAATAGGAATGTGGGAAGGCCAGGTGGT 
TCTGTAGAATTITGAACAAGGCTrrTCCAAGAAACTCCTCC^ITCCGCCCC 
AGrrGGTOANCTGAAGTGGATTGACAGOTGAOTCCTTCTOTTTGCAGGGNm 
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NNCrCNTGTNCAANATTTTmCCAGGGTCCGOATTCATCANATCACCT^ 

TGrma^ACTNGATTTTANCAATTCCTTNCCNAGGCTGNNGAACNm 

TGGOTAACTAAGNANACITITAANGGGGGClTmAATGAGTC^ 

ANACTTTATCCAA>rmCCXIWrTTTTTAAGTAAGGC^ 

NTANTTTTAANGGGGGGTTNTTTTOTAGTAAAAAGTCTTa^TC^^ 

GCNGGNGNANATAGGGGGTNAAGCCCTTTGGCTAGGGTTATGGNNTrGGCThrm 

GGGGCCCrrCNACTCCCAATTTTNAThrrGAAANTAmrrGGC^ 

NGNCCNC 

SEQ ID NO: 4429 ACTCTGGATGCCAGCACCAAGATCTATGCTGTGCGCGTGGATGCCGNCCATG 
CCGATGTATACAGAGTCCTTGGGGGGCTGGGCAAAGATGCACCGTCTTTGGAAGAAGTAGAhR^G^ 
CATGTTGCTOATGGAAGNGCTACTGAAATGGNAACAACCAAAAANGGNTGTAAANCC^^ 
ANGNACTTACACAGACCTATTGAGCAGAACATAAACAACCTCAATGCTCCAANGCANATCGGAAT 
TGTGAGATTGTCCTTGTTNCATAAGACAmAGCOTCATITGATGANTNCACTACAGCANG 
CATGTCNCTTCTNCNCTGCCATANCKrACATAAGTNAACTGTTGm 
TNTa:AAGGGNGAAACCT^^•AGGGTTGCAACAGTAAGGNTGG^WANAANGTT^^ 
NGCCrrrGGCCCCNNTTTNGCAAGANTANCNNAAAATNTT^ 
CTANCAGGA 

SEQ ID NO: 4430 CGTGCCCNNACGATTNCNCNGCTTTGAAAGTGAAACTATGGNTACCAA^ 
CTATTNTNATTATAGCATNrrACAGTGTCTGGAAAANGATNTAACAATAANTAACTGT^ 
CAOSrAGACATCCATITACITAATCACAGAAGTGGATCNTGCTACATAhn^^^ 
TCTACAAATTCTCCAGGCTNACGAATGTCATCAANCTNNAAAATNAT^ 
AGAGATATCITGCNGCTTTTT 

SEQ ID NO: 443 1 ACTTTrnriTiTriTiTiTi^ 

TCACGAGCGCTCrCGGTAGCTCAGGAAAGCGACATAGTCTCTANCACTTAGCCCTCTCCTACAATG 

CAAAGCAAAAAAGACTGTGGCTCCAGGACTCTCTGTGGGCGGAATCGGCNCTAAGGAGTTGGNGC 

AATTATTTTGTTGCAAGGNANGAAGCCNAAAAGCCnXjCATGCAA^ 

OTGTTTCCCCCCrCCTTNrnWAAANAANGNAGCTGGTAATXSGCAAANA^ 

TTTAAANAGOsTGNCAAAAAraAANAANCCNGACAGNGAAACANGNCCTTTrC^ 

TrrmrmGTGNANNTTAAAAACCCCX:CNrrcCT^ 

AACANCTTGCCGGCCTT 

SEQ ID NO: 4432 GNCTCGNCGCCTTTN>n^GTCrrCKCACCAANCCCATNGNNGCNW^ 
AACACAmTCC^^AAGGAATNAGCATGGGTTCOTAGGAAGCAAAC^^vrAGCTCTCCATA 
CACTTTCACAGGATGATTAGGTGGACCTGCAATGAANANAATACATTTCAAAAGATGGGTTCT 
CTTACACX:AAGTTTTCACrGATNTACrrTAAANAAAAANC^ 

AAAAAATTCA7NACGGATAAAATANATCTCAGGAAAAGGTCCAAGTCCTCTCANAGACATACATT 
TGCTAATTAATATNAATTTAAAGTITGACAACAAAATTNCTATTGGGANCTACOT 
ATOCTOAGTrCNTOKrrCimfTTTACNTGC^ 
ATTTm'CTTNOTNfOTATNATNCAACATNGGrrGTTCT 

SEQ ID NO: 4433 ACGaJGGGGCGTCTTGTTCTTGCCTOGTGTCGGTGGTTAGTTTCT 

TGTTGGGACTGCTGATAGGAAGATGTCTTCAGGAAATGCTAAAATTGGGCACCCTGCCCCCAAC^ 

NAAAGCCACAGOTGTTTTGCCAGATGGTCAGTTrAAAGATATCANCCTGTCTGACTACAAAGGA 

AAATATGTTGGGTTCTTCTITACCCTTrrGACTTCACCTTTGTNGCC 

TTGATAGGGGCANAAAAATTmANAAACTCAA>rrTGCCAN>n'GATGGTNC^ 

ACTirrWCAANTAACCATGGGTAArrACAOCTAAAAAAAA^ 

ATTCCTITNGT^^^CAAAACCCNAACCCNCCATTNG^^mAAG^ 

NAANGCATTCTNNTNAAGGGGCXnTrTrNrmATTAN^ 

hrmAAAATNACCNNCCCTNNTGGGCCCmriT^ 

TTCNATTTAC™ANAAA^^^GGGGAAT^ITTNCCCACTTGGN^ 

AACCC 

SEQ ID NO: 4434 acgcgggggctgcgcgccgcctaggtgtctgggcgatctatgggcaagagca 
agggccacnatnacagattacggcgaggagcanctcaacgagctggaggccctggagtccatct 
accctgactccttcacagaattatcanaaaatccacccagcttcaccattactgtnacgtct^ 
ctggaaaaaatgatgaaactgtccagactaccxrmaagtttanatacagtganaaatccc^ 
aaactcccotitatgaaatattctcosfamaaaatntraaaaana 
ttaaaattactancatttcatgcmannaaaatcmggtatnatgatgam 
cnggancaataaaaaa^^^aanggatataa^r^tttaaatanaattt^mtam 
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natrataanntgaatciwaaantttanrranantttat^^ 

anaaotatnnataaaantaataatngnagnanttattnanatnanctanatnantancan™ 

atnnnhttantagtnaaananttwntgaaatgtgaattca^^^ 

aaaataaa 

SEQ ID NO: 4435 ACGCGGGGGCAGTCAGCGAGCCCACGTGCTTGTGTTGACTGGACAACTTCCT 
GGTGGAAAACCGCGACTCTTGCAAGTGGGCAAACnTGACOTTTTOT 

ccggagccanagccanctactcaacacctctaantaaaaagcanaaaaaacat>^ 
cggtnaggagcatncctaotataanannggttttnataatnggatn^ 
ncct>maaaaatamt^tnnancatctnnaaantanaacatcrgot 
tatttnnggttatnnatnanaactacrrtatnntatt^ 

seq id no: 4436 accgccgggttgaaaaagaaacaaagg aatac ntgagagttggggagaaa 

GTGGAGAAAAAAATGTGTTTGAAGCTCTTTCTGAGCTCATAATTTTAACAGCT^ 

ATGGAAAGGAAATCAGAAGTCNACTCA^^^GAAAAGGTACCACAGCTGTATGCATATTTGGATGGA 

GGTITCACCCATCCAGCCTGTC^(:^TACC^WGTGGCTGCCTT^ 

ACCTNCTCNGCNNANCCOSfGOTCTmCCCTNANGCCAN 

CTTNGTNGACATTCTCCTNCTGTNCrTNTTGCTCNTTAATGGGA^^ 

TNTCCTCNGNNCCGTNNNTCmGATNOWCCNCTGGCCGCCN^ 

CNCTNCCT!^CC>mTNCCCa^CNCCTNCCCT 

CCANGACCCTACTNTTCTCCTCTT 

SEQ ID NO: 4437 ACirnTITITITITrrr^^ 

AANTGCAAAATCrGTrCCTGGCATTAAGCTCCTTNTTCCmTGCAATNCNGTC^ 

CATNAATGCTTTCTIOTCCTraATGGTNTGNAANCGGNNATGGNCAAAOT 

AAACTNAAGGTAAATTTTTNAANANACC 

SEQ ID NO: 443 8 ACCCTNCAGAhn^TNGTGNT>nTrACNCCATTTTCTACAACr^ 
GATACCTTTATAGGANAANGCAAANATTATATTITrGTGTCTAACATATATAATCT^ 
GTNGATCTGTATATTCTTAATCATCTAATNAACCCACCCAATTTGAAAAC^I^^ 
GAA>rrCACAAATANAACAasrrCATATGTAAANGGCTGGACACTCACTCAATATO 
ATACTNGTTNAAAmTGCTNAAGTNCCAAAATGCACATT TAAANC NATT^ 
AANTTCAAAAAATTTTAATACANACAANCNANTTGATTnm^ 

SEQ ID NO: 4439 ACANGNGGAACAATCNGGTNTTTTTAATCAAAGAAGGNGGTGTTCAGTTGCT 

gctcacaatanttgataccccacnatttggaaatgcattggataatattaattgntggca^ 
tatcaactacattgatagtaaatttgaggactacctaaatgcaaaatcacnantnaacanaot 
atatgcctgataacagggtgcantgttgtttatacttcattgctccttnaggacatngac^ 
cattnngaatattgatrrattgaa]^gtttgnataaaaaantgantatca^^ 

AGTAANATNCNNTTACNCCNAAATGAAATNCNATCAT^m'AAAAATCANT^^ 
AGAACATTATATTANAATTATTNCNAATrrONlNANAANCGTATTAATAAT^^ 
CNTrANAAAANTAACGGNCCCN>rmACTm'ATNCNTGGAGN^^ 
AATAANTG 

SEQ ID NO: 4440 accatagccaaatctgggacaagcgagtgtttaaacaaaatgactgaagcac 

AGGAAGATGGCCAGTCAACTTCTGAATTGATTGNCCAGTTrGGTGTCGGTTTCTATO 

TGTAGCAGATAAGGTTATTGTCACrrCAAAACACAACAACGATACCCAGCACATCTGGGAGTCT^ 

ACTCCAATGAATTrTCTGTAATTGCTGACCCAAGAGGAAACACTCTAGGACGGGGAACGACAAT^ 

ACCCTTGTCTTAAAAGAAGAAG(>TCTGATTACCTTGAATTGGATACi^ 

AAATATTCACAGTTCATAAACITTCCTATTTATXjTATGGAGCACAAG 

CCCATGGAGGAANAANAACWCCCCNNGAAANAAGAANAATTTTGTGATTAANCTGC^ 

NNNNNNNNNNNNNNNNN>^ 

GGGNCG^^^^mTm'GNANCCANTTNGNCCCANNTNGGNNNT 

SEQ ID NO* 4441 ggtactggatccaggtgaggttgtgggctgngncccgaaggtgcitggctcc 

TTrATGGCGCATCACAAACTCAATCCAGAAGAaXjCrCGATCCAGGGGCTTCACC^ 

ATGAATTCTTGATAATTTCATGATATTCrCmATAGATAGGGTCATTAATGACT^ 

TrGAGCAAATCTCTACrrGACATGGTCCTGATGTCCACACTGAGGGCTGCTCCCTTGGC^ 

GAGCAATGTTATCATGTTGATCCGCAAACAAGGGAATGCCCACCATAAGGGATCCCATGGTANAT 

CGCCTCATAAATGCCCATTGGTTCCCCATGAAGTTATAAAANGGrrGGTTTGGGATGACCAANAA 

GGNCATTCTGGGGTAACCCCCCGCGTACCTNNCCNGGCGGNCGmWAAANGGGCNAATTCA™^ 

NCTNGNNGGCCGTNA>n^ANNGGmCCNANCTNGNNCCNANOTGGNNNA^ 
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^^S^NNCCT^W^^^AAAATTNNTTNCK^ 

SEQ ID NO: 4442 GGTACGCGGGGAGGAGATCGCCATTATCCCX^AGCAAAAAGCTCCGCAACAA 
GATAGCAGGTTATGTCACGCATCTGATGAAGCGAATTCAGAGAGGCCCAGTAAGAGGTATCTCCA 
TCAAGCTGCAGGAGGAGGAGAGAGAAAGGAGAGACAATTATGTTCCTGAGGTCTCAGCCTTGGAT 
CAGGAGATTATTGAAGTAGATCCTGACACTAAGGAAATGCTGAAGCrrTTTGGACTO 
GTCCAACCTTCAGGTCACTCAGCCTACAGTrGGGATGAATTTCAAAACGCCTCGGGGACCTGm 
AArrrTrrCTGTAGTGCTGNATTATTTTCAATAAATCTGGGACANCCCX:^^ 
NNATTTACNCCChrmN>TOGCCTNN^ 

N>rrrTNCCGGTNGGNCKITCMAAAGGGGGGAATTN^^ i i i i AAAGGGG 

TTTT^TCCCNNCTTNGGGGGN^^SIN^^^ 

SEQ ID NO: 4443 ACCTATrAGTAGTCACCGCCTTrTCCCTTCCTCCCAGCCCCTAACAACCA 

ATCTACnrCCTGTCTCTACGGATTTGCCTACrCTGGACATTTCATATAAATAGGTTAATACGATGCG 

TCCTTTTATACACAAATGTTCATAGCAGCArTACTCATAAAAGCCCCAAAGCGAAACACCTCAAGT 

GTCCATCAACCGATGAATGGATAAACAAAATGTAATATATCCACACAATAGAATCrrATTCGTCA 

ATAAAAAGGAATGAAGTAC iUUMU - J '1 14 1' lU U'ri4i4TGGGATTTTTTAGGTAGNGGGTGT^^ 

TGAACGCTTTCrrAATTGGGGGCTGCTTTTANGCCTACTATGGGTGGTAAAT 

SEQ ID NO: 4444 GGTACXjAUTGTNrrAGTGATGAGTTTGCTAATACAATGCCNGTCAGGCCACCT 
ACGGTGAAAAGAATGATGAATCCTAGGGCTCANAGCACTGCAGNAGATCATT^WATATCGCTO^ 
GTGNAGTGTGGNGAGCCAGCTAANTACriTGACGCCGGTGGGGATAGCGATGATTATG^ 
AGGTGAAATATGCTCGTGTGTCTACGTCTArrCCTCTGTNAATATATGGNGTGCTCACA CNATA NA 
ACCCTATGAAGCCAATTGATNTCATAGCrCAGACCATTCCTATGTATCCAAATOGTTCT^ 
GGANTNTAATGTTACAATATGGGGANATTATTCCGNAAGCCTGGTAGGGATOAGAANTOT 
TCANGGNGACCNAAAAATCNTAATNNGTGTCTGGTTAAAGAATGGGGGTCTNCTNCC^ 
GGTCCTANNAAAGGGGGTCGTNAGNGGTTGCNGCNTGTTGTAKrAmTN^ 
CACTGGNGAAAANhriTGNATAATTCTGG NCTCT C> WCNN TATNGAC^ 
TTTNGTTTGGGNNTTGGCNGGGGGTTTTATTTTAAATTTTGGGGG^ 

SEQ ID NO: 4445 ACNCGGGTGCAAGAGTCTCGCTCAGCNNNAAATANGNTNGCTTTC^^ 

TACANTGCCCATTTTGAAATTGCCTATACAGNCITAGNGACCATTTAAACCGGACGAACTA CGT^^ 

TTAATTTTCACrCTTNATGTTCAATTANCAGTTCAAATTAAAGAA 

TGAATGGTTTTGTATTAAATTGCmGAAATAGATITCATTTCTTGT^ 

CAATGGGNCGTGAGCTAGTTGAGGGNTAACOTGTANGTTGCANAGTGCATTNGCTTGNTTGNT^ 
ATCTTCTCTGTGATGAGGTCAGTGCTCTGATmTGAAGGAGGATATTCACTGAAGCTCATAGm 
TAAACAAGGAAATCACTGATAANAATGGGAAThmGTNCTGNGTTCTGGGAAAAAC^ 
GCNACTGATTTCAGCCAGCCTTTGCCACTACCCCTATAATTAAOTGCCAGT 

ANAGANAGTTrAACTANTATTTGNGGNCCAANAAAATATTCACGAGTTTGOTANAGATrACCCT^ 

TGCATGTGCAGAGa^CTmAOTGCCACCAGCTTTTCAAGAAAAANGCCC^^ 

NCCGCGAACCACGCTT^^WGGGCGGAATTCCAA^^AANANTTGGGCGGCCGTTO 

ANTCCNGCCTCGGGGTTCCCAAANCCTTNNGGGGGAATCCNATGGGGCCATAG 

TGGGGGGGNANAAATGNNTTTCCCGTCTCNCAATNCCCCCNNNANCANTATCNC>^ 

AA^OTTNAA^fNTGNGTAANANCCCCNGGGGGNCCCCNNAGNAGGGCGAGCC 

SEQ ID NO: 4446 ACAAATGTTTTTrATTCAAAAATACAAAATAAATTATCTGTAGGCATGGACA 
ATGACAGCAGTAAACCATTATATATT^rrcGCAACTGAAACCAGTAACTGATOGT^ 
CAGCCAGCCTTTTTCTTCATTITCTNCANOTGACT^ 

TNGGGCTTCCTGNCACAGNNCA>rmCTAGTAGGGCCAGCTGNATTANNGAA>nWC^^ 

CGGGGAGNOTANNNCCTCCCATTNCACCCATTGCACCCATTCCAGGGNCCTTCT 

mCTGTGACTACAACTITCTGCTGTAGTTAACANAAGAGGCCNCACNAANCAGC^^ 

AGCAGGGTCTTCACANACCrrrGTmGGGCCCAATGGATTCCCITITNTC^^^ 

ATCTCCANCCATTAGNANNNTANCCCACTTTNTGAANGAANCTTTGG^ 

TCAAAAGATCCTTCCACAACCTGCATTTCTTAACCAATGGGCCAhrmGGNGGNATT^ 

TTTAATAA^^^T^^^WACCCAATTTTT 

SEQ ID NO: 4447 ACnTGGGAGAATCGTGGTGTCCTGGATGGCCTGATCAATGTATTAAAATCA 
AGCAGTATGTCTTCTTAAAATACTCrrAAATCCATACAAATGTTGCTTGGGT^^ 
rrCTCCTACIXjGCATGATAAAATTTGAACATGTGTTATGAAAAGrrCACACCAArmGGrm 

CCTTCTTCATTAAKCCCGGTGGGCrrCCCTTTTCCATTT^^ 

TTTNTrrGGCNGNCNA2m'C(>rNGTGGNCmAAAAA'ri'l"i"ri J 1 IGGTTTGCAAGGGCTGGAi i J J i' 
NTTTTTTTmCCATNGCATGGGl^TTTCCCGCCCAGGG^ 
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NCTTGNGGGAATTAAAGGGCANAA>rrTGGGGGACCGGCAAATCNACNAANAAAAACCa^ 
CCC^m'GNGGGAANGCCAAAAKTAAAGGGGNTN^INTI>^TATTAA>^^ 

TNCANAANAGGNGGGGNGCGGGGG(XN>1NAAANATAGNNGNGNCCCAAGNGGGTCCAAAAAAA 

NrrTTTNGGGGGNGGGG>WCANNTTCCCATTCCm'Cr^ 

NTTT 

SEQ ID NO: 4448 ACCTACTGGTAGTTGGGTTCAGGGAAATGGGATTGACTTGGCCTTCAGGCTCC 
TTTGGTCATAATTTTAAAATATGGGAGTAGAAAACAACAAAGAATGGAATGGACT^ 
TGAAAGAGCATTTATCGTTTGTCCCTTGAATGTAGAATTTGTTTTTTGAm 
AAAATGTGACAGTrAAAATGGNGCATTAATGGTITITAT>rnOTAAT^ 
TAATmACCTTTCCCAGGGGGANC^^^AAGGC^^mAAAAr™ 
T^O^GTT^AACC^GNAGTNGCCAAGNTATAANTNC(:™CAGGAA>^^ 
AAANCAANNNTTrT>rmAACCTTTGGAACCNTO 
TTTAAAANTTTTTTANTTTAAAAATNGCNANTCCTT^ 

GGGGGGNTAACTTCACCANNTTNTAAACCCCNCCNGGGGGGGCCCTTTAA^ 
ACCCC 

SEQ ID NO: 4449 ACGCGGGTGCACACTTTGGGAAGCATCACCATGAAGGTGAGGGACAGAGCTC 
TGGAGCTTTCACCGCCTTGAGTGTCCTTOTCCAGTGATCTTAGACCTTGGGAGG/^ 
CATTCTCAATGAAAATCTGAGGGAGCCCTTTCCAAGGCATTGGAGATTGAGAAGTTNACGACTTC 
ANTAGGAGAAAAGGAACTGCANAATGTGGNTGAAAATAGAANGCAAACTTGNANTTAAAAOT 
GGNACTGNAGGNCCGGCCCNAATAGGCTNAAACCTGGG^r^^NCCNNAACTm 
NNGGGGGNCTNA>INACCAATGT(>JTNGANATNGAGACCCTCCriTGGCTACACAGAGNAAC^ 
CTNTACITANAAATACAATATNAANANAAATAATCCATTNOT 

GGCAANCCATCCGOTANGGGCTrmGTTOAAGGGGCANGGCTGGANim'GNNGAA^ 
NATAAGTTTTTITTTNTrTGAAAGGA 

SEQ ID NO: 4450 GCGTNGNCCCCNGCCGTTTTNCTGGCNGAGCTGAGGCTCNAATTCCmfNAGG 
NCAACGTGGTGGGACTCACCGmrWGGCCAGGGTGCTTATGGAAACANGTGNCGAGGACGCCGA 
ATGTlSrrGNACCAACCAAAACCTGNCGCCGACGGCNTNGNNAAGNGNACACAACCCANANAC^ 
NCCCCATCTGTTm'GCCCTGGNTGCKrCANCCCTACCATCACTGGGCANGNCTANN^ 
NNGAGGAANNNCCTGAACTTNCTNTGGANAGTGAAGATAAAGGATGAAGGCTACNAGAAGACC^ 
AGGAAGCTGhriTNGCTCCNNAAGAAACNTAAANCCTGGNATGATAT(>^ 

CAGNGAGTGANANCTGGCGCCANGCAAANTGANAAACCNTCNGCGTATCCAGCGCAGGGGCCCG 
NGCATCATCTATAANGAGGATAATGGTATCATCAAGGCCTCAGAAACATNCTGGAANTACTCTGC 
TTAATGNAAGCCCNCTGGACATTrrGNAACGTGCCCCTGGTGNGCTAGTNGGACCTTrm 
TTACCTGAAACGCCriTCCGGAAANTTNATGAATTGTTCCTTNCNGGCCGG^^ 

SEQ ID NO: 445 1 ACll"lUlUUlUU-n'l"llllU"'lll"ri"l"lTri"rATTAGGGCAAGTGCATGTTCTGTA 
ACATATTTCACTTGCAAGCATGAAANATGAGGTCTOTCTNTriOTACATGGGC^^ 
TGGTCACAATANA<XNCCACAGNGGGGGCAAATNACCTTNCTTTCGGGCAGGGGGCAAAGC^ 
GTTAAAGGACCACCANGNGGNGTTTCCTCCTGCAGCTGCTrAACCTTTO 
GNAGTTTTTTCCACTGTTOTCTCTTGGCAAAGNAATCAGNGATACNT^ 
CNGTTATITAAGCACCATTlSrrGGGGACCTTGGTANC>m'CAGTGT>m^ 
GAAAAGTCAAGGGAAGANTTANTGGTTGTGGANGCr^A(XiG^mCATANATNANm 
CAANCCTCGNTTTCATGAACAGNACACTATTACAGTAACCAAGTTTTT^^ 
TCTCCACNCTTNCOTTTGTAAGGNCATTAAGATATCCNCANATTCANTGTT^ 
AAATNCANNAAAGAATTGNCTTGAAAAAATGGAAANACTAAACNNGNGANTGC^^ 
GTAATTTAGNGNGCAAAAANCCTGN^^S^ITmGNGGGAAAAATT 
rmTTTTTA 

SEQ ID NO: 4452 GGTACAATGCCTATaJCrrCTTAATCCAAAACGTTCTGNGGCTCCAGANGGA 
GGAAGAAATAGAATTTCTCTACANTGAAAACNCGGNTAGAGAAAGCCCCAACATTO 
GGGATCCTGTCCTTCATGCAlWrCrrrCATTGGCTCTTrrGAGACT 
TOTACNGTGGTGCCTCGCCTGGNCAAGNCTTGTNNATATTCTNACCAAA^ 

NACCGCANAANATTTAAGGGNGAAATATGGNATGGATGATTGTTGATATOGCCCTATAAACCTTG 

TAAAGTGNNTTGNTITTTCTTTGCANACTTATGGATCCCTACCCACCTT^^ 

ACCrmCCCTGNACGGCC^ICTANAGGGCTAATTNNNNNCANC^ 

C(>IANCATCNGTACCCNAGCTT^GGCGT^^TNCATGGGNCAATAGCTTTNT^ 

GTNATTCCTCTOTCAATTTTANANAAA>rmCCXjNNNTN 
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GNGNNCGCNATANAAT 

SEQ ID NO: 4453 ACNGCGGGGCTCACTGAGCACCGTCCCAGCATCCGGACACCACAGCGGNCCT 
TCGTTCTACGCANAAAACCACACTrCTC/VAACCrmACrrCANCACT^ 

ATGCACAAGGAGGAACATGAGGGGGCTGTGCTGGGGGCACCCCCCAGCACCATNCTTCCAAGGTC 

CACCGTGATCAACATCCACAGNGANACCTCCGNGCCCGACCATGTCGTm'GGTCCCrrGTTTO 

ACCCTmTCTTGAACTGGTGCTTGTCTGGGCTTAATANCOTTTNGCCT 

GACAGGAAGATGGTTGGCGANTTGACCGGNGCCCANNNCTATCmTATCCGNCAATGTGCCTTGA 
ACAThn'GGGCCOTGNTTNTNrGGC>m'CCATANTGACCATTNGArrAATO^ 
TGNTCTTNNACAN^^r^rITCCOTTATTTTG 
AGNCACCCATTCTCGGNAATCTTTTGNANTTATNTTKTTA^ 

SEQ ID NO: 4454 GCGTGGCCGCGGNCGATGTNCANCACAGGCTACATGTAGGCAAGCTAAAGCT 
ATCATGAAAGGAGGATACAGNATGGOTAAGATCCTGGTCTlSrrmGTATCATTNACTATC 
GCANGGGGAAGCAGCATGTCrrCTTGGCCCNTGNNCTGNACATACTGGAGCNAANGTCTATN^ 
ATGATA(>^CT^^'CANGCAAC^™CCATNTNTTCTAACAG^rmGAA^ 
CATCCTATAGGGAAATTNCrmGGAAAGGCNNATGAANATCTANT^ 
TNGCTNNCAATCAATCTTNrmTTTCGNATACCTACTNa^AATT^ 
NNAACACNCnTNGTrGGAATCTNANTCTNTACTNGATCTNTTCT^^ 
CNNANGAAATCrTNTTGGAAATATTACCTGTNANrrCA>m 

TACNANGNTTCATCNCAAAGCTTTGGAOTACAANTGGGAAAANCn>JATTNA>^^ 
CTTTCTTTGATGNNAAATATmAANNCTTNTTGNGC^ 

SEQ ID NO: 4455 AT^m^CCATGAAATATCCATGAACATACTTATANGTNAAGTAr^ATTTATTTG 
AATCTACANAAAACAACAAATAATTTITAAATANAATGATCrrTCCTAGATATO 
ATACANNATAGCTANATTGANGCCAAGGGCCAANAGAATATCCGNACTTAAATTTCA NGA>rr TGA 
ATGGGTTNGCTAGAATGTGATTTTbn^AANCATCACATATAATATGATGGGACAATAAAm^ 
TATTAGTCAAATTTANCTTGQAAAATCCNGNAKCTTTITrCTGT^ 
GCrmTNCATGATCCACAAAGTCCTThmrCrNNGCGCCCr 
TGGTNNNNAGGmrrmCCNCNCATCTTACCm'CNCAG 
AGCANATOTTAAGTN^^m'GC^^^CANAACAATAAATTATr^ 
NTCTGNATATNTrCGGGNTAGAATGNAANCAANCOTATATTTNGNGCCT>nvJA 
N^^ITAANNTACTTTTNTATT 

SEQ ID NO: 4456 acgcgggggagggcggtggctcaggctcctggaaaggaccgtccacccctcc 
gcgctggcggtgtggacgcggaactcagcggagaaacgcgattgagaaatggaaaagaaaatga 
aataaatcagcagttatgaggcagagcctaagagaactatggcaacatcaggtgactgtcccaga 
agtgaatcgcagggagaagagcctgctgagtgcagtgaggcgggtctcctgcaggagggagtac 
ctcggcccnanc^r^cc^^^aaggncaa^^r^cnacncnntngl^ 
ntngnacccaacntnggnntannaanggnattacmotttch^ 
nantnccc^wnacattncaacccgaancnt^aangggnaancccngggtgnc^ 

NTAa^TANATTTATTNNCTTGGNNOTACrrG(XCGrm 
TNATTAAATCGCCCACCCCCCGGGAAAGCGTTTGCTTTTTGGNGCn^ 
TTNCnTGNCNTGGCNTTNGGTrGGGGAACCGGTTTAACTTAm 
NATTAGGGTTACCCNGAAA 

SEQ ID NO; 4457 GGTAC iU ' r i-lun TriU -ll' l U'llTN' n 'lU' i lU- i - i CriTl-iU-ril-iUU ilU'iTNAAT^^ 
NCXlAAAirmATTTTAATGCNCAGNATGANAAATGAACTrrmA^ 
AAANGACNCGGCAAATAAATTANACCTNTGTTAAAGCGAAGGTCAGCTAAATOTCC^^ 
GATNTAATGGGCNCCGATAAACANATTCCACAGTCNTTTITAATANAGTAT^^ 
TTGNTAAAAACTGGTCCAAANATNGACAGCNCGTGGGAATGCTTAACAGGGGNGGNGATCAGGG 
ACNCNTTT<XTGGGNGCCNNTrATGATGATGTTGTCCACNCNCANAATOAC 
TGCACTCAAAAAAACCTGTNGCTTCATTTAGAAAAC>rmCTGNGTATNCCCAGGG 
TKmCTATGGGGNCCnKCCCTTATTTNCATTNCCACTANNGG^ 
GCCCTNGTGCTTGGCCCCNNGTNCNGCCCTGCTATNTCCTGCNATATGTNNANC^ 
NCWn'CTTNGGmTTTCNATAANACmcrrTGCN 

SEQ ID NO: 4458 GCGNGGGC>mGGCCGANGNNCACNGGCTGCTNCCN>nSfGT^ 
ATCNGAANCANGAGTNACATNTNNCAAACNNNGACTTANGAGCNNCOTCNGNG 
NTCAACATGCACATGGCGGACTCTGGATTGATGCTGGAAGTmATACNGAAAATGANAACAACAA 
TGGAACANAAACACmCTGGAGCATATGGCTTCTANGGGCGCCNNNAACAG^^ 
CTGGAACITGAGArrGAAAATATGGGTGCTCmh™AATGNNNATACCTO 
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ATACTATGCCAAAACANTCTCTAAAGACTTGCCATGAGCTGTAGAANTTCTTG(^ 

AAACAGCCATTGNGAGAAGCAAATATTGAACT^GAGrcGTGGAGTNATCOT 

AANGTTNAATCCTATTTACAAGAAAATGTTTTTGATNATNWTOTGCCNC^ 

GCGCmGGACAOGACAATT TTGNGN CCCACTGNTAAAATTCAAATTITTrAGTC^^ 

ATNGG>OTATTTTACCrCCCCTTTTCG 

SEQ ID NO: 4459 GGTACTT ri - n - i ' lTn -i Trn " ! 1 1 L i 1 1 n TAGGGCCAGGTTTATTCCCTCACAT 
GGGTGGTTCACATACACAGCACANAGGCACGGGCACCATGGGANAGGGCAGCACTCCrGCCTTNT 
GAGGGGATCTTGGCCTCACGGTGTAANAAGGGANAGGATGGTTTCTCTTCTGCCCTCACTAGGGC 
CTAGGGAACCCAGGAGCAAATCCCACCACGCCTTNCATNTCrmANCCAAGGAGAAGCCACCTTGG 
TGACGTTTAGTTCCAACCATTATAGTAAGTGGAGAAGGGATTGGCCTGGTCCCAACCATTACAGG 
GGTGAAAATATAAACAGTAAAGGAAAAATCCCGTITGGATGAAGGCCCCNGGAAAGGAGCANAT 
GACCCCrrCNAAACNTNTTTCCNGGGAAAGGCANNTNNCTGGGCCnTm 
CIT^GGCAGAAAGGTTAGGGAAAATGATGGGGCTCOTTTG^TNGNATT^^ 
NGGCmGNGGAAmCCCCCCCANGOANCOTCCCTTTNGNOTGOTGGC^ 
AN>nvIAAAACTNCmAANNAACGGGNNTnsiNCCCa^^ 

SEQ ED NO: 4460 GGTAKTrTNGGGAGCTACAGATAGTAAAGATGATGATGACATTGACCTCm 
GGATCTGATGATGAGGAGGAAAGTGAANAAGCAAANAGGCTAAGGGAAGAACGTCTTGCACAAT 
ATGAATCAAAGAAAGCCAAAAAACCTGCACTTGTTGCCAAGTCTTCCATCTTACTAGATGTGAAA 
CCTCGGGATOATGAGACAGATATGGCNAAATTAGAGGAGTGCGTCANAAGCATTCAAGCANACG 
GCTTAGNCTGGGGCTCATCTAAACTAAG^^^CCAGTGGGATACGGAATTAAGAAACT^C AA^ 
GTGTGTAGTCGAAGATGATNANbTITGGAACAGANNTGCTGGAGGAGCCAGGATCACTGCTm 
GGACrATGTCGANCCCATGGATGTGGCn'GNTTTCTACATGATCTAAAATCCATCTGGAN(^^ 
TTTNAAArNNNAANATTm-CNGGATrnGTCrCGNGCTO 
AT^^^C^TTGC^•GGCTNC^^mGGGNGTATC^CAT^O^ 
GG^^^^GGCCNGTTTAAATGGGGAAAr^SfTTOAAN^^ 
ACCAANTGGGNTAANNNTGGG 

SEQ ID NO: 4461 NCNAGCGGCCG(>lCNGNCGGGNACGCGGGGCTCTTCCTGa4CrACATNA 
O^CANGATCAANGTGAAGAGGATAACCCCATGCNGGAACTTCACATNNGCGAACnCT 
ATOTGTGTTGNGNANAGTGGACACAGACTGACGCNANCNNNCANGGTGNNGGAGCAG>rmACAG 
GGCOTACCChrrGTGTTTTCCAAAGCTAGATCACTGTCAGATNCTT^ 
GATNGNTGNNCACrrGNACANTTNGAGGGGCCNANGCAGAAGANAT>r^ 
GTNCNGGANTOTGAGTTAAGAAAAAACAACTTCTCAGATCTGGAAACTTTGG>r^ 
GAANACATCGm'CTGGGGATNAAATATNACCCAAGCATNGGTOTCTACAGCCTGTACT^ 
NNGGCnT^GGTACGNCAOmrCAGCmCTCAAAACANGAAACTCAC^^ 
NTACACGAAATCAGCTNAGAAGAAGNCCTGCCTGTTNCAGCCANAAGTTTGATNNGGTC^^ 
CTGGCAAATAATTCCGTTCTATCCAAAAAACCNTAACATNrmGGCC^ 
AAmx:CCGGGNTCNTTTGTGCX:CGACCGCC 

SEQ ID NO: 4462 ggtacttgcatgtaggacaactcagttagaaaagtatagtgaatggatggaa 

TCTACTGTATGATAAAAATGCTACAAACAGCATTTAGTTGCCATNAATAAGAAATATAOT 

AAAAAAATCCAAATGCTCGCArrGTCCAGAAAAATTNAACANGTTTATTTA™ 

CTGAACCGTGGNAACTTGTTCTCmGAAACNAATTANCTTGCATTNATGCCh^^ 

ATTGCNATTATACTCNCACAGATGAAATTGGTmrACTTnsIAATTO 

TTTCCNCATGOTATTGCriTCTC>nmTvrmCTCT>^ 

TGCc^mT^GC^r^^m•ANCNTTTAAAT^mc^TCTACANCT^^ 

TNTTATT^^'CGG^^^^NNCTITITATTNGTCT™ 

NTTTTTAmTATGNATCTC^rmTCTCTATCATAATANCTTCCAT^ 

TNGTTNNNTTCNNNTNNG 

SEQ ID NO: 4463 ACANAGTmTrTCAACAACCTGAATTTTAAGTTTCTrrC^ 

CTACATGGACCTTCCATAATCTTTCTGCAATGTGATCATGCTCTGANATGATOGCATAAAGGAGGC 

NTTGGGGTGGGATGANGCANGCTGGGCTXjGGGCrTCTNTTTCCAG>n^>^ 

NCANCNCANTNTITm'AATNfNTATATTCTAANGChrmcrGA^ 

ANATCCTGAACTATGAAGACTTNCrrrrnGGAAAGAATATCATTATTGAN^^ 

GCmCCTCCGTGGTOTAAATCACTATCTAGCNATATNACCNTATNGATCTCCTCCACCA^^ 

CrAAAGAACTTATGGOCTCNCTCGGNAANOTATNAATTTNTCTCCACTCCA 

AATGAGGGCTNAAATGN>nNlAAAAAAAAAAATNrrCNCCKrGACCCAT>m 
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NCTCTNNAANGATAAATTGNCArrTGCTTCCAAAAAATCTN 

SEQ ID NO: 4464 ACCACCCTGAGTTCCrGTCC^GGCCTATCAAGCCCrCCCCACCATACTTTGGC 
CTCCTCCTGGCCrCTOTGGGGCGGCTCTCACATTACCmCAGAAAGGCTTGCAGGC^ 
GGACACCTATAGOTGACAGGAGTGGAAGCAGCTCCCCTGACTCTGAAATCACCGAACTGAANm 
CCATCAATAAAATCATGACTOATCTTQTAGCGGATGATTCnTCVW^ 
NAGTTTACAGCTCTGACTITACAClTCGGNTmGGANACT^ 
TATTTTATTATGCNGAAANGGTNTTTGGGAANCmGTCACTKGTAm 
CNGCNGNNOAANCACTCTAANGCGGCGAATTOCAAGTNNANTGGCCTGCCGAAACTAN^^ 
C^fNGANCTACGGTACC^^S^GCTTNNGNNNANNATANGNNATA^^ 
ATTATNNATTNNCNKmACGNNANTN^ATTm 
GGGAGNCTNAAAT^rA^^TNATGNNTANTCAN^^^^NN^^> 
GT 



SEQ ID NO: 4465 GTACGCGGGGGATTTNCNATGCGGGGAANANGTGGTTTTGACANAATGCCTC 
CTGOTCNGGGTGGGCGTNCCATGCCTCCAThn'ATAANANATTATGATNATATGAGCCCTO 
GGACCACCTTCCCCTCCTCCCGGACGAGGCGGCCGGGGTGGTANCAGAACTCGGANTCITNC^ 
TCCTCCTNCAO^ACCACCTATAGGGGGAGACCTCATbWCCTTOTGACATAACAGC^ 
AACCNNNNCAACTGCNTGGTTTGGTTTC 

SEQ ID NO: 4466 ACNAGAAAAGGGTCCGAGCACAAGCCAAGAAGTTTGCGCCCTCATAAGCAG 
COACCTTGTGGCATCGTCAGAAGGAAGGGATrGGTrmGGCAAGAACTTGrrrACAACAT^^ 
AAATCTAAAGTTGCTCCATACAATGACTAGTCACCTGGGGGGGTTGGGCGGGCGCCATCTTN 
GCCGNaraT^GGTGTGOOGACATTGANTTNCOT 
CCGTCCTTITNGGATTTTAGAATGGmOTOTAAN 
AGTATTCTCAAACTGCTG>fNAANAATATAAAACriTrTAANACTT^ 
GCCGAANT^CCCQ^CTCNTGGGAATGCCAGNCCATGCTTT^T^WArc^ 
>rrTGGNCC™aAGCTGGGCrGTAOT 

TAAAACCTTTCATATAAACNNGCGNGTCTGGNNCTGTCTTITC^ 
NNATTTTCGGGNNCTTGCG 

SEQ m NO: 4467 GGTACITITITTTITriTITIT^ 

GAATGACACTGTATACAGGTGTGNGGGTATAAACTGCTGTATCTAGGGGCAGGACCAAGGGGGCA 

GGGGCAACAGCCCCAGCGTGCa^GGGCCAACATTGCACAKrGGAGTGCAAAGGTTGCATGCTANG 

GGCGGCTACTAATAACCCCGTTTTCNNGTAnATCTGNAACATAATATG^^ 

CCGAATACCACTAACAGGAGGAATCCAAOTGGTCATNGAGGATGCCO^AAATCAAGGGC^ 

ATmTNAATGCCCTTTNNCNGCTGANGCATANOCCTGGGCCCC>IG 

TCTTGTCraCTAGACTTCACrNGTmTANGCGAATTGCTTT^ 

TTATAT^r^AAANANNNGGNTTGNACAATGGGCr^CNNTAC^m^A^nrGN^^ 

NTOTCCTNTGGATNrmGATCACCrcrCTCAACCm>^ 

ANGANAAACmcn^TATTrmmCCTGGNNT^^ 

SEQ ID NO: 4468 ANATTTGGCCCTGGTTGATTTCTCTTCTGAATAGTTTCCATCrc 
GACCTCTCAAOTATTAGTGCGA(:L\CCACTT(rAGAGGAGriTGAATTANAAGGATT^ 
AGANCTTCTTTCAGGAACrrGGATTnTOWAAAGGTCACC^ 
TOAGC^WCGACGAATACNACNGCAANGCTNGATCmTATNGGNAAATGGATTG 
CTAGGCTNATTCAGTGTGAAANTGANGNAGGGAAATTGTTGGGTATCACGANANATCCCNGAATT 
AATACn-GGAAGACNCCAGNGTAGCNAAAGAGAACCGArrCTACAAGAACAATCTGANATAGAGT 
CGCTTG CTOTAAN TGGGAOTCCANGGTNTAAAANCAGGGTGNTT^ 
^^^ATAT^ mTTT^ nrNNCNNGANANGAATO^ 

GGim-GGNCTrmCCTAAAATCrrGGNGGCCTTNAATATTAAAC^^ 
TANTNTCTTT 

SEQ ID NO: 4469 GGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGOTCTATTTCGACCGC 
GATGATGTGGCn-CTGGAAGGCGTOAGCCACTTCTTCCGNGAACTGGCCGAGGAGAAGCGCGAGGG 
CTACGAGCGTCTCNTGAAGATGCAAAACCAGCGTGGCGGCCGCGCrCTCTTCCAGOACATCANAG 
AANCCAGCTGAAGATGAGTGGGGTAAAACCCNAGACGCCATGAAAGCTGCNATGGCCCTGNANA 
NAAAGOTGANCCAGNNCCirrTGGAACTTAATGCCCTGGGNGNNTGTNGCGC^ 
^^^TGTANANTNTTCTGGAN^a^TTTANCTTCT^^'ATNANGTGGGy^^ 
NATTG^mGATNANT^m^ANCTAAN^ITT^^^ATTNTGT^ 
TGAAATNrrCTTOTAGATNGGATAANCANTTTCAGCTATGTATTTAAGA^^ 
NNNGNTGACCNTTGGGATANGGCTOTCTT(mTATTTAAAATOGGT^ 
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T^^^sITCATGAGGNTTGCTNTTTNAATlW^CANACANCTCNCX;OT 

SEQ ID NO: 4470 GTACGAGATGGCACCCCTCCAGAGCCChnTCTATGGAGATNAGATGAATCTT 
TThrrcNCTGTGCCATAAGATCAANCNGTGTGACTACCCACCACrCCCNGGNG 
NAAGTNACCANAACTNGTCAGCATGTNCATbrrGCCKTGACCCCCACCANATAOT 
ACGTGCACCAGGTGGCCAAGCAGATNCACATATNGATGTCCAGCACCTGAGCATGNATGCACCNG 
TCCTTATCAAANNCANGCACCACnTrGCCTTACTTGAGTCGTCTC^^ 

GCCTANAACAGATAANACCCAGGGNTCAGNANGTTCCCCAAANGCTGCNCAACCTTACANCANAT 

GCTTNAGGCATANNNAACTGANGGAGGGGCGCTGGCCACAATGTGNACTGATGGOT 

AANTTCCTTT^^TTATACTGTGTGNAGCA^mTC:^AAANTTGGT^ 

ACNCAGC 
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Figures. 

SEQIDNO:4471 

MASRSMia.LLLLSCLAKTGVLGDIIMRPSCAPGWFYHKSNCYGYFRK^ 

QSYGNGAHLASlLSLKEASTIAEmGYQRSQPrV\n[GLHDPQKRQQWQWroGAMYLYR^ 

WSGKSMGGNKHCAEMSSNNNFLTWSSNECmRQHFLCKYRP 

SEQ ID NO: 4473 

MEKIPVSmLLVALSYTLARDTTVKPGAKKDTKDSRPKLPQTLSRG 

EALYKSKTSNKPLMIIHHLDECTHSQALKXWAENKEIQKLAEQF\a.LNL^ 

PDGQYWRIMFVDPSLTVRADITGRYSNRLYAYEPADTALLLDN^^ 

SEQ ID NO: 4475 

MRAWIFFLLCLAGRALAAPQQEALPDETEVVEEWAEWEVSVGANPVQVEVGEFDD 

aeeteeewaenpcqnhhckhgkvceldenntpmcvcqdptscpapigefekvcsndn 

ktfdsschffatkctlegtkkghklhldyigpckyippcldseltefpliyv^^ 

vtlyeiudednnlltekqklrvkkihenekiu.eagdhpvellardfek:^^ 

qfgqljdqhprogylshtelaplraplffmehcttrffetcdldndkyialdewagct 

qkdidkdlvi 

SEQ ID NO: 4477 

meklviqlkesfggsseivdqleveirnmtllvekletldknn\a.^^ 
ceaskdqntpvvhppptpgscghggvvniskpswqlnwrgfsylygawgrdyspqhp 
nkglywvapl>m)grlleyyilynn.ddlllyinarelritygqgsgtavyn>^^ 
myntgniarvnlttntiavtqtlpnaaynnrfsyanvawqay 

SEQ ID NO: 4479 

mterrwfsllrgpswdpfrdwyphsrlfdqafglprlpeewsqwlggsswpgyvrpl 
ppaaffispavaapaysralsrqi^sgvseirhtadrwrvsldvnhfapdeltvktkdgv 
veitgkheerqdehgyisrcftrkytlppgvdptqvsssi^spegtltveap^^ 
eitipvtfesraqlggpeaaksdetaak 

SEQ ID NO: 4481 

msstspnlqkatolaskaaqedkagnyeealqlyqhavqyflhvvkyeaqgd 

irakcteyldraeklkeylknkekkaqkpvkegqpspadekgndsdgegesd 

lqnqlqgaividrpnvkwsdvaglegakealkeavilpikfphlftgkrtp\^ 

pgtgksylakavateannstffsisssdlvskwlgeseklvknlfqlarenkpsot 

slcgsrseneseaajookteflvqmqgvgvdndgilvlgatnipwvldsairrrfekr^ 

iplpepharaamfklhlgttqnslteadfrelgrktdgysgadigits^almqpv^ 

qsathfkkvrgpsradpnhlvddlltpcspgdpgaiemtwmdvpgdkllepwsm 

lrslsntkptvnehdllklkkftedfgqeg 

SEQ ID NO: 4483 

MHKEEHEVAVLGAPPSTILPRSTVINfflSETSWDHVWSLFNTL^^ 

VKSRDRKMVGDVTGAQAYASTAKCLNIWALILGILlVmGFILLLVFGSVT^^ 

EKRGY 
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SEQ m NO: 4485 

MRTIAILAAILLVALQAQAESLQERADEATTQKQSGEDNQDLAISFAGNGLSALRTSGSQ 
ARATCYCRTGRCATRESLSGVCEISGRLYRLCCR 

SEQ ID NO: 4487 

MPAPEQASLVEEGQPQTRQEAASTGPGMEPETTATmASVKEQELQFQRLTRELEV^ 

QIVASQLERCRLGAESPSIASTSSTEKSFPWRSTDVPNTGVSKPRVSDAVQPNNYLIRTEP 

EQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQHS 

FIGSTNNHVVRNSRAEGQTLVQPSVANRAMRRVSSWSRAQSPSYV^ 

SLGSGFGSPSVTDPRPLNPSAYSSmPAARAASPYSQRPASPTAIRRIGSVTSRQTSNPNG 

PTPQYQrrARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAWQHLGPSLQRTVHD 

MEQFGQQQYDIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTY 

YSPVYRSPNHGTVELQGSQTALYRTGVSGIGNLQRTSSQRSTLTYQRNNYALNTTATYA 

EPYRPIQYRVQECNYMU.QHAVPADDGTTRSPSmSIQKDPREFAWRDPELPEV^ 

QFPSVQANAAAYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACG 

LWGK^TDENKIAMBCNVGGIPALLRLLRKSroAEVRELVTGVLW>^ 

ALSTLTNTVrVPHSGWNNSSFDDDHKIKFQTSLVLR2m'GCLRNLTSAGEEA^ 

EGLVDSLLYVfflTCVNTSDYDSKTVENCVCTLRNI^SYRLELEWQARLLGLNE^ 

KESPSKDSEPSCWGKXKKKKKRTPQEDQWDGVGPIPGLSK5PKG 

TLLAESSNPATLEGSAGSLQNLSASNWKFAAYIRGGRPKRKGLPILVELLRMDNDR 

SGATALRNMALDVRNKELIGKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSK 

NMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQVLNTLWQYRDLR^ 

WQNHFITPVSTLERDRFKSHPSLSTTNQQMSPnqSVGSTSSSPALLGIRDPRSEYDRTQP 

PMQYYNSQGDATHKGLYPGSSKPSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNF 

DAYRLYLQSPHSYEDPYFDDRVHFPASTDYSTQYGLKSTTNYVDFYSTKRPSYRAEQ-^ 

GSPDSWVYDQDAQQRNSFFLTLFRLR 

SEQ ID NO: 4489 

MSGIALSRLAQERKAWI^HPFGFVAVPTKNPDGTMNL]^^ 

FKLRMLFKDDYPSSPPKCKFEPPLFHPNVYPSGWCLSILEEDKDWRPArriKQILLGIQEL 
LNEPMQDPAQAEAYTIYCQNRVEYEKRVRAQAKKFAPS 

SEQ ID NO: 4491 

MCDRKAVIKNADMSEEMQQDSVECATQALEKYNIEKDIAAHIKJCEFDK^ 
GRNFGSYVTEIETKHFIYFYLGQVAILLFBCSG 

SEQ ID NO: 4493 

MENFQKVEKIGEGTYGWYKARNKLTGEWALKKIRXDTETEGW 

HPNrVKLLDVIHTENKLYLWEFLHQDLK:KFMDASALTGIPLPLIKS\^^ 

HRVLHRDLKPQNLLINTEGAIKLADFGLARAFGWVRTYTHEVVTLWYRAPE^^ 

YSTAVDWSLGCIFAEMVTRRALFPGDSEmQLFRIFRTLGTPDEVVWPGVTS^^ 

FPKWARQDFSKVWPLDEDGRSLLSQMLHYDPNKRISAKAALAHPFFQDVTKPWHLRL 
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Figiire 2. 

S]^QIDNO:4472 

AAGATATAAAAGCTCCAGAAACGTTGACTGGGACCACTGGAGACACTGAAGAAGGC 

AGGGGCCCTTAGAGTCTTGGTTGCCAAACAGATTTGCAQATCAAGGAGAACCCAGG 

AGTTTCAAAGAAGCGCTAGTAAGGTCTCTGAGATCCTTGCACTAGCTACATCCTCAG 

GGTAGGAGGAAGATGGCTTCCAGAAGCATGCGGCTGCTCCTATTGCTGAGCTGCCT 

GGCCAAAACAGGAGTCCTGGGTGATATCATCATGAGACCCAGCTGTGCTCCTGGAT 

GGTTTTACCACAAGTCCAATTGCTATGGTTACTTCAGGAAGCTGAGGAACTGGTCTG 

ATGCCGAGCTCGAGTGTCAGTCTTACGGAAACGGAGCCCACCTGGCATCTATCCTGA 

GTTTAAAGGAAGCCAGCACCATAGCAGAGTACAtAAGTGGCTATCAGAGAAGCCAG 

CCGATATGGATTGGCCTGCACGACCCACAGAAGAGGCAGCAGTGGCAGTGGATTGA 

TGGGGCCATGTATCTGTACAGATCCTGGTCTGGCAAGTCCATGGGTGGGAACAAGC 

ACTGTGCTGAGATGAGCTCCAATAACAACTTTTTAACTTGGAGCAGCAACGAATGCA 

ACAAGCGCCAACACTTCCTGTGCAAGTACCGACCATAGAGCAAGAATCAAGATTCT 

GCTAACTCCTGCACAGCCCCGTCCTCTTCCTTTCTGCTAGCCTGGCTAAATCTGCTCA 

TTATTTCAGAGGGGAAACCTAGCAAACTAAGAGTGATAAGGGCCCTACTACACTGG 

CTTTTTTAGGOTAGAGACAGAAACmAGCATTGGCCCA 

TAAATGTTTGCCCCGCCATCCCTTTCCACAGTATCCTTCTTCCCTCCTCCCCTGTCT^ 

GGCTGTeTCGAGCAGTCTAGAAGAGTGCATCTCCAGCCTATGAAACAGCTGGGTCTT 

TGGCCATAAGAAGTAAAGATTTGAAGACAGAAGGAAGAAACTCAGGAGTAAGCTTC 

TAGACCCCTTCAGCTTCTACACCCTTCTGCCCTCTCTCCATTGCCTGCACCCCACCCC 

AGCCACTCAACTCCTGCTTGTTTTTCCTTTGGCCATAGGAAGGTTTACCAGTAGA^ 

CTTGCTAGGTTGATGTGGGCCATACATTCCTTTAATAAACCATTGTGTACATAAGAA 

AAAAAAAA 

SEQIDNO:4474 

ACCGCATCCTAGCCGCCGACTCACACAAGGCAGGTGGGTGAGGAAATCCAGAGTTG 

CCATGGAGAAAATTCCAGTGTCAGCATTCTTGCTCCTTGTGGCCCTCTCCTACACTCT 

GGCCAGAGATACCACAGTCAAACCTGGAGCCAAAAAGGACACAAAGGACTCTCGA 

CCCAAACTGCCCCAGACCCTCTCCAGAGGTTGGGGTGACCAACTCATCTGGACTCAG 

ACATATGAAGAAGCTCTATATAAATCCAAGACAAGCAACAAACCCTTGATGATTAT 

TCATCACTTGGATGAGTGCCCACACAGtCAAGCirrAAAGAAAGTGTTTGCTGAAAA 

TAAAGAAATCCAGAAATTGGCAGAGCAGTTTGTCCTCCTCAATCTGGTTTATGAAAC 

AACTGACAAACACCTTTCTCCTGATGGCCAGTATGTCCCCAGGATTATGTTTGTTGA 

CCCATCTCTGACAGTTAGAGCCGATATCACTGGAAGATATTCAAATCGTCTCTATGC 

TTACGAACCTGCAGATACAGCTCTGTTGCTTGACAACATGAAGAAAGCTCTCAAGTT 

GCTGAAGACTGAATTGTAAAGAAAAAAAATCTCCAAGCCCTTCTGTCTGTCAGGCCT 

TGAGACTTGAAACCAGAAGAAGTGTGAGAAGACTGGCTAGTGTGGAAGCATAGTGA 

ACACACTGArrAGGTTATGGTTTAATGTTACAACAACTATTTTTTAAGAAAAACATG 

TTTTAGAAATTTGGTTTCAAGTGTACATGTGTGAAAACAATATTGTAT^ 

GTGAGCCATGATTTTCTAAAAAAAAAATAAATGTTTTGGGGGTGTTCTO 

AACTTGGTCmCACAGTGGTTCGTTTACCAAATAGGATTAAACACACACAAAATGC 

TCAAGGAAGGGACAAGACAAAACCAAAACTAGTTCAAATGATGAAGACCAAAGAC 

CAAG1TATCATCTCACCACACCACAGGTTCTCACTAGATGACTGTAAGTAGACACGA 

GCTTAATCAACAGAAGTATCAAGCCATGTGCTTTAGCATAAAAAAAAAAAAAAAAA 

A 
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SEQIDNO: 4476 

CGGGAGAGCGCGCTCTGCCTGCCGCCTGCCTGCCTGCCACTGAGGGTTCCCAGCACC 

ATGAGGGCCTGGATCTTCTTTCTCCTTTGCCTGGCCGGGAGGGCCTTGGCAGCCCCT 

CAGCAAGAAGCCCTGCCTGATGAGACAGAGGTGGTGGAAGAAACTGTGGCAGAGG 

TGACTGAGGTATCTGTGGGAGCTAATCCTGTCCAGGTGGAAGTAGGAGAATTTGAT 

GATGGTGCAGAGGAAACCGAAGAGGAGGTGGTGGCGGAAAATCCCTGCCAGAACC 

ACCACTGCAAACACGGCAAGGTGTGCGAGCTGGATGAGAACAACACCCCCATGTGC 

GTGTGCCAGGACCCCACCAGCTGCCCAGCCCCCATTGGCGAGTTTGAGAAGGTGTG 

CAGCAATGACAACAAGACCTTCGACrCTrCCTGCCACTTCTTTGCCACAAAGTGCAC 

CCTGGAGGGCACCAAGAAGGGCCACAAGCTCCACCTGGACTACATCGGGCCTTGCA 

AATACATCCCCCCTTGCCrGGACTCTGAGCTGACCGAATTCCCCCTGCGCATGCGGG 

ACTGGCTCAAGAACGTCCTGGTCACCCTGTATGAGAGGGATGAGGACAACAACCTT 

CTGACTGAGAAGCAGAAGCTGCGGGTGAAGAAGATCCATGAGAATGAGAAGCGCC 

TGGAGGCAGGAGACCACCCCGTGGAGCTGCTGGCCCGGGACTTCGAGAAGAACTAT 

AACATGTACATCTTCCCTGTACACTGGCAGTTCGGCCAGCTGGACCAGCACCCCATT 

GACGGGTACCTCTCCCACACCGAGCTGGCTCCACTGCGTGCTCCCCTCATCCCCATG 

GAGCATTGCACCACCCGCTTTTTCGAGACCTGTGACCTGGACAATGACAAGTACATC 

GCCCTGGATGAGTGGGCCGGCTGCTTCGGCATCAAGCAGAAGGATATCGACAAGGA 

TCTTGIGATCTAAATCCACTCCTTCCACAGTACCGGATTCTCTCTTTAACCCTCCCC^ 

TCGTGTTTCCCCCAATGTTTAAAATGTTTGGATGGTTTGTTGTTCTGCCTG 

GGTGCTAACATAGATTTAAGTGAATACATTAACGGTGCTAAAAATGAAAATTCTAA 

CCCAAGACATGACATTCTTAGCTGTAACTTAACTATTAAGGCCTm 

TAATAGTCCCATTTTTCTCTTGCCATTTGTAGCrrrGCCCATTGTCT^ 

GGTGGACACGGATCTGCTGGGCrCTGCCTrAAACACACATTGCAGCTTCAACT 

TCTTrAGTGTTCTGTTrGAAACTAATACTTACCGAGTCAGACTTTGTGTO 

TTCAGGGTCTTGGCTGCCTGTGGGCTTCCCCAGGTGGCCTGGAGGTGGGCAAAGGG 

AAGTAACAGACACACGATGTTGTCAAGGATGGTTTTGGGACTAGAGGCTCAGTGGT 

GGGAGAGATCCCTGCAGAATCCACCAACCAGAACGTGGTTTGCCTGAGGCTGTAAC 

TGAGAGAAAGATTCTGGGGCTGTCTTATGAAAATATAGACATTCTCACATAAGCCCA 

GTTCATCACCATITCCTCCTTTACCTTTCAGTGCAGTTTCTTTTCACA^ 

GTTCAAACTTTTGGGAGCACGGACTGTCAGTTCTCTGGGAAGTGGTCAGCGCATCCT 

GCAGGGCTTCTCCTCCTCTGTCTTTTGGAGAACCAGGGCTCTTCTCAGGGG 

GGACTGCCAGGCTGTTTCAGCCAGGAAGGCCAAAATCAAGAGTGAGATGTAGAAAG 

TTGTAAAATAGAAAAAGTGGAGTTGGTGAATCGGTTGTTCTTTCCrCACATT^ 

GATTGTCATAAGGTTTTrAGCATGTTCCrCCTTTTOT 

ArrAATCAAGAGAAACTrCAAAGlTAATGGGATGGTCGGATCTCACAGGCTGAGAA 
CTCGTTCACCTCCAAGCATTTCATGAAAAAGCTGCTTCTTATTAATCATACAAACTCT 
CACCATGATGTGAAGAGTTTCACAAATCTTTCAAAATAAAAAGTAATGACTTAGAA 
ACTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

SEQIDNO: 4478 

CGGCCTCTCATITCTCCTAGCCCTTCTGTTCTTCCTTGGCCAAGCTGCAGGGGATT^ 

GGGGATGTGGGACCTCCAATTCCCAGCCCCGGCTTCAGCTCTTTCCCAGGTGTTGAC 

TCCAGCTCCAGCTTCAGCTCCAGCTCCAGGTCGGGCTCCAGCTCCAGCCGCAGCTTA 

GGCAGCGGAGGTTCTGTGTCCCAGTTGlTTTCCAATTrCACCGGCTCCGTGGATGAC 

CGTGGGACCTGCCAGTGCTCTGTTTCCCTGCCAGACACCACCTTTCCCGTGGACAGA 

GTGGAACGCTTGGAATTCACAGCTCATGTTCTTTCTCAGAAGTTTGAGAAAG^ 

TCCAAAGTGAGGGAATATGTCCAATTAATTAGTTTGTATGAAAAGAAACTGTTAAAC 

CTAACTGTCCGAATTGACATCATGGGAGAAGGATACATTTCTTACACTGAACTGGAC 

TTCGAGCTGATAAGGTAGAAGTGAAGGAGATGGAAAAACTGGTCATACAGCTGAAG 

GAGAGTTTTGGTGGAAGCTCAGAAATTGTTGACCAGCTGGAGGTGGAGATAAGAAA 
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TATGACTCTCTTGGTAGAGAAGCTTGAGACACTAGACAAAAACAATGTCCTTGCCAT 

TCGCCGAGAAATCGTGGCTCTGAAGACCAAGCrGAAAGAGTGTGAGGCCTCTAAAG 

ATCAAAACACCCCTGTCGTCCACCCTCCTCCCACTCCAGGGAGCTGTGGTCATGGTG 

GTGTGGTGAACATCAGCAAACCGTCTGTGGTTCAGCTCAACTGGAGAGGGTTTTCTT 

ATCTATATGGTGCTTGGGGTAGGGATTACTCTCCCCAGCATCCAAACAAAGGACTGT 

ATTGGGTGGCGCCATTGAATACAGATGGGAGACTGTTGGAGTATTATATACTGTACA 

ACACACTGGATGATTTGCTATTGTATATAAATGCTCGAGAGTTGCGGATCACCTATG 

GCCAAGGTAGTGGTACAGCAGnTACAACAACAACATGTACGTCAACATGTACAAC 

ACCGGGAATATTGCCAGAGTTAACCTGACCACCAACACGATTGCTGTGACTCAAACT 

CTCCCTAATGCTGCCTATAATAACCGCTTTTCATATGCTAATGTTGCTTGGCAAGCAT 

ArrGACnrrGCTGTGGATGAGAATGGATTGTGGGTTATTTATTCAACTGAAGCCAGC 

ACTGGTTAACATGGTGATTAGTAAACTCAATGACACCACACTTCAGGTGCTAAACAC 

TTGGTATACCAAGCAGTATAAACCATCTGCTTCTAACGCCTTCATGGTATGTGGGGT 

TCTGTATGCCACCCGTACTATGAACACCAGAACAGAAGAGATTTTTTACTAT^^ 

CACAAACACAGGGAAAGAGGGCAAACTAGACATTGTAATGCATAAGATGCAGGAA 

AAAGTGCAGAGCATTAACTATAACCCTTTTGACCAGAAACTTTATGTCTATAACGAT 

GGTTACCTTCTGAATTATGATCTITCTGTCrrGCAGAAGCCCCAGTAAGCTGm 

AGTTAGGGTGAAAGAGAAAATGTITGTTGAAAAAATAGTCTTCTCCACTTACITAGA 

TATCTGCAGATATCTAAGTAAGTGGAGAAGACTATTTTTTCAACAAACATm 

TCACCCTAACTCCTAAACAGCTTACTGGGGCTTCTGCAAGACAGAAAGATCATAATT 

CAGAAGGTAACCATCGTTATAGACATAAAGTTTCTGGTCAAAAGGGTTATAGTTAAT 

GCTCTGCACTTTTTCCTGCATCTTATGCATTACAATGTCTAGm 

TGTTTGTGTCATAATAGTAAAAAATCTCTTCTGTTTGGCGTATAGGGATTC^ 

AGGAAATATTGCCCAATGACTAGTCCTCATCCATGTAGCACCACTAATTCTTCCATG 

CCTGGAAGAAACCTGGGGACTTAGTTAGGTAGATTAATATCTGGAGCTCCTCGAGG 

GACCAAATCTCCAACTTITrTTTCCCCTCACTAGCACCTGGAAT 

GGCAGATAAGTAAATTTGGCATGCTTATATArrCTACATCTGTAAAGTGCTGAGm 

TATGGAGAGAGGCCTTTTTATGCATTAAATTGTACATGGCAAATAAATCCCAGA^^ 

ATCTGTAGATGAGGCACCTGCTTTTTCTTTTCTCTCATTGTCCACOT 

AGTAGAATCTTCTACCTCATAACTTCCTTCCAAAGGCAGCTCAGAAGATTAGAACCA 

GACTTACTAACCAATTCCACCCCCCACCAACCCCCTTCTACTGCCTACTTTAAAAAA 

ATTAATAGTTTTCTATGGAACTGATCTAAGATTAGAAAAATTAATmC^ 

ATTATGAACrmTATTTACATGACTCTAAGACTATAAGAAAATCTGATC 

AAAGTGCTAGCATTTATTGTTATCTAATAAAGACCTTGGAGCATATGTGCAACTTAT 

GAGTGTATCAGTTGTTGCATGTAATTTTTGCCTTTGTTTAAGCCTGGAACTO 

AAATGAAAATITAAl'1 1 'i'lTmCTAGGACGAGCTATAGAAAAGCTATTGAGAGTAT 

CTAGTTAATCAGTGCAGTAGTTGGAAACCTTGCTGGTGTATGTGATGTGCTTCTGTG 

CTTTTGAATGACTTTATCATCTAGTCTTTGTCTATT^ 

GTCTATAGGATTGGCAGTTTAAATGCrmACTCCCCCTTT^ 

ATGTGCTTCGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 4480 

CTCAAACACCGCCTGCTAAAAATACCCGACTGGAGGAGCATAAAAGCGCAGCCGAG 

CCCAGCGCCCCGCACTTTTCTGAGCAGACGTCCAGAGCAGAGTCAGCCAGCATGAC 

CGAGCGCCGCGTCCCCTTCTCGCTCCTGCGGGGCCCCAGCTGGGACCCCTTCCGCGA 

CTGGTACCCGCATAGCCGCCTCTTCGACCAGGCCTTCGGGCTGCCCCGGCTGCCGGA 

GGAGTGGTCGCAGTGGTTAGGCGGCAGCAGCTGGCCAGGCTACGTGCGCCCCCTGC 

CCCCCGCCGCCATCGAGAGCCCCGCAGTGGCCGCGCCCGCCTACAGCCGCGCGCTC 

AGCCGGCAACTCAGCAGCGGGGTCTCGGAGATCCGGCACACTGCGGACCGCTGGCG 

CGTGTCCCTGGATGTCAACCACTTCGCCCCGGACGAGCTGACGGTCAAGACCAAGG 

ATGGCGTGGTGGAGATCACCGGCAAGCACGAGGAGCGGCAGGACGAGCATGGCTA 

CATCTCCCGGTGCTTCACGCGGAAATACACGCTGCCCCCCGGTGTGGACCCCACCCA 
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AGTTTCCTCCTCCCTGTCCCCTGAGGGCACACTGACCGTGGAGGCCCCCATGCCCAA 

GCTAGCCACGCAGTCCAACGAGATCACCATCCCAGTCACCTTCGAGTCGCGGGCCC 

AGOTGGGGGCCCAGAAGCTGCAAAATCCGATGAGACTGCCGCCAAGTAAAGCCTT 

AGCCTGGATGCCCACCCCTGCTGCCGCCACTGGCTGTGCCTCCCCCGCCACCTGTGT 

GTTCTTTTGATACATTTATCTTCTGTTTTTCTCAAATAAAG^^ 

TAAAAAAAAAAAAAAAAAA 

SEQIDNO: 4482 

GGGGGAGGGTCGGAGCTCTGGTGGAGAGAGTGTTGTCTAAAACAAGTTCCGGAAGG 

GAGGCTGCCCITCGCGGTCCGAGAACCACCGGCCTCCCCAGTTTGAGGGCTGTTACC 

CCGTGCGCGCTTCGACGTTGCTGCTGTTGGCTCTCCTCGCCCCTCGTTCCCTTGGG 

CCGCCTGGGAACTCCGCCATGTCATCCACTTCGCCCAACCTCCAGAAAGCGATAGAT 

CTGGCTAGCAAAGCAGCGCAAGAAGACAAGGCTGGGAACTACGAAGAAGCCCTTC 

AGCTCTATCAGCATGCTGTGCAGTATTTTCTTCATGTCGTTAAATATGAAGCACAAG 

GTGATAAAGCCAAGCAAAGTATCAGGGCAAAGTGTACAGAATATCTTGATAGAGCA 

GAAAAACTAAAGGAGTACCTGAAAAATAAAGAGAAAAAAGCACAGAAGCCAGTGA 

AAGAAGGACAGCCGAGTCCAGCAGATGAGAAGGGGAATGACAGTGATGGGGAAGG 

AGAATCTGATGATCCTGAAAAAAGGAAACTACAGAATCAACTTCAAGGTGCCATTG 

TTATAGACCGACCAAATGTGAAATGGAGTGACGTTGCTGGACTTGAAGGAGCCAAA 

GAAGCACTGAAAGAGGCrGTGATACTGCCTATTAAATITCCTCATCT^^ 

AAGAGAACACCTTGGAGGCkjAATCCrATTArrTGGGCCGCCTGGAACAGGAAAGTC 

CTACTTAGCCAAAGCTGTAGCAACAGAAGCCAACAACTCAACATTTTT^ 

TTCCTCTGATCTTGTTTCTAAGTGGCTAGGTGAAAGTGAAAAACTGGTTAAGAAT^ 

ATTCCAACTTGCCAGAGAGAACAAGCCCrCCATTATCTTCATTGATGAAATTGATTC 

TCTCTGTGGTTCAAGAAGTGAAAATGAAAGTGAAGCCGCACGTAGAATTAAGACGG 

AGTTCCTAGTGCAAATGCAAGGGGTTGGTGTAGACAATGATGGAATTTTGGTTCTGG 

GAGCTACAAATATACCCTGGGrrCrGGATTCTGCCATTAGGCGAAGATTTGAGAAAC 

GAATTTATATTCCCTTGCCGGAACCCCATGCCCGAGCAGCAATGTTTAAACTGCACC 

TAGGGACCACTCAGAACAGTCTCACGGAAGCAGACTTTCGGGAACTTGGGAGGAAA 

ACAGATGGTTATTCAGGGGCAGATATAGGTATCATTGTACGTGATGCCCTTATGCAG 

CCTGTTAGGAAAGTACAGTCAGCTACTCATTTTAAAAAGGTTCGCGGACCTTCCCGA 

GCTGATCCTAACCATCTTGTAGATGATCTGCTAACACCTTGCTCTCCAGGTGACCCT 

GGTGCCATTGAAATGACGTGGATGGATGTCCCTGGAGATAAACTTTTGGAGCCAGTT 

GTTTCCATGTCGGATATGTTGCGGTCACTATCTAACACAAAACCTACAGTCAATGAA 

CATGACTTGTTGAAATTAAAGAAGTTTACAGAAGATTTTGGTCAAGAAGG^^ 

CAAAGACAAGGAAGATGCTTACCATATGTATTCTTTCTTTCATAGATATTm 

TTTGGATCGCATTAATTGTTTCCAGTAAAACTCTTTTACCAC^ 

TCACm'CAGAGTTCCATTAGGTTTTATATTGTACTTTTCCrC 

TCCTATTAACAAAAGGTACAAAATAACAGGTTATGAGGAAATGAGCGATATATGAA 

CGGCATAAAAACAGAAATTACCCAGTAAAAAGGATGTCAGAAATTGACATACAAAT 

ATTTACAATTTITATGAATGGTGGTCTTTGCAAAGAGCATTT^^ 

TACTAAAATGATATATGGGtTTATTTTATATTTTCAAAAAAA 

TTATCAATGTAAAATTTACGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

SEQIDNO: 4484 

GCTCACTGAGCACCGTCCCAGCATCCGGACACCACAGCGGCCCTTCGCTCCACGCAG 

AAAACCACACnrCTCATACCTTCACTCAACACTTCCTTCCCCAAAGCCAGAAGATC 

ACAAGGAGGAACATGAGGTGGCTGTGCTGGGGGCACCCCCCAGCACCATCCTTCCA 

AGGTCCACCGTGATTAACATCCACAGCGAGACCTCCGTGCCCGACCATGTCGTCTGG 

TCCCTGTTCAACACCCTCTTCTTGAACTGGTGCTGTCTGGGCTTCATAGCATTCGCCT 

ACTCCGTAAAGTCTAGGGACAGGAAGATGGTTGGCGACGTGACCGGGGCCCAGGCC 
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TATGCCTCCACCGCCAAGTGCCTGAACATCTGGGCCCTGATTCTGGGCATCCTCATG 

ACCATTGGATTCATCCTGTTACTGGTATTCGGCTCTGTAACAGTCTACCATATTATGT 

TACAGATAATACAGGAAAAACGGGGTTACTAGTAGCCGCCCATAGCCTGCAACCTT 

TGCACTCCACTGTGCAATGCTGGCCCTGCACGCTGGGGCTGTTGCCCCTGCCCCCTT 

GGTCCTGCCCCTAGATACAGCAGTTTATACCCACACACCTGTCTACAGTGTCATTCA 

ATAAAGTGCACGTGCTTGTGA 

SEQIDNO: 4486 

ATATCCACTCCTGCTCTCCCTCCTGCAGGTGACCCCAGCCATGAGGACCATCGCCAT 

CCTTGCTGCCATTCTCCTGGTGGCCCTGCAGGCCCAGGCTGAGTCACTCCAGGAAAG 

AGCTGATGAGGCTACAACCCAGAAGCAGTCTGGGGAAGACAACCAGGACCTTGCTA 

TCTCCTTTGCAGGAAATGGACTCTCTGCTCTTAGAACCTCAGGTTCTCAGGCAAGAG 

CCACCTGCTATTGCCGAACCGGCCGTTGTGCTACCCGTGAGTCCCTCTCCGGGGTGT 

GTGAAATCAGTGGCCGCCTCTACAGACTCTGCTGTCGCTGAGCnTCCrAGATAGAAA 

CCAAAGCAGTGCAAGATTCAGTTCAAGGTCCTGAAAAAAGAAAAACATmACTCT 

GTGTACCTTGTGTCTTTCTAAATTTCTCTCTCCAAAATAAAGITCAAGCATT 

SEQIDNO: 4488 

CTACTGTTGTTTTTGAGGGGCGGGCAGCCGCGCCGCCGCGGCACTTTm 

CGGGTGCCGCAGCAGCGACCCCTCGGCGCCGATGTCCCTGATCCCTGGAGCGACGA 

CGGCCGCTGCCTAAGCTGGGAAGAGGAATGCCAGCTCCTGAGCAGGCCTCATTGGT 

GGAGGAGGGGCAACCACAGACCCGCCAGGAAGCTGCCTCCACTGGCCCAGGCATGG 

AACCCGAGACCACAGCCACCACTATTCTAGCATCCGTGAAGGAGCAGGAGCTTCAG 

TTTCAGCGACTCACCCGAGAACTGGAAGTGGAAAGGCAGATTGTTGCCAGTCAGCT 

AGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCATCGCCAGCACCAGCTCAACTG 

AGAAGTCATTTCCTTGGAGATCAACAGACGTGCCAAATACTGGTGTAAGCAAACCT 

AGAGTTTCTGACGCTGTCCAGCCCAACAACTATCTCATCAGGACAGAGCCAGAACA 

AGGAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGGGATCATTGG 

GTAACTCAAGAAGTTCAACACAAATGAATTCTTATTCCGACAGTGGATACCAGGAA 

GCAGGGAGITTCCACAACAGCCAGAACGTGAGCAAGGCAGACAACAGACAGCAGC 

ATTCATTCATAGGATCAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGA 

CAAACACTGGTTCAGCCATCAGTAGCCAATCGGGCCATGAGAAGAGTTAGTTCAGT 

TCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCACAGGCGTGTCTCCTTCAAGG 

GGGTCTCTGAGAACTTCTCTGGGTAGTGGATTTGGCTCTCCGTCAGTGACCGACCCC 

CGACCTCTGAACCCCAGTGCATATTCCTCCACCACATTACCTGCTGCACGGGCAGCC 

TCTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGATTGGGTCAGTC 

ACCTCCCGGCAGACCTCCAATCCCAACGGACCAACCCCTCAATACCAAACCACCGC 

CAGAGTGGGGTCCCCACTGACCCTGACGGATGCACAGACTCGAGTAGCTTCCCCATC 

CCAAGGCCAGGTGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCAC 

AGCATCTGGGACCTTCACTGCAAAGGACTGTTCATGACATGGAGCAATTCGGACAG 

CAGCAGTATGACATTTATGAGAGGATGGTTCCACCCAGGCCAGACAGCCTGACAGG 

CTTACGGAGTTCCTATGCTAGTCAGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGC 

CGTGTCTCCCGACTTGCACATTACTCCTATATAtGAGGGGAGGACCTATTACAGCCC 

AGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGATCGCAGACGGCGT 

TGTATCGCACAGGTGTATCAGGTATTGGAAATCTACAAAGGACATCCAGCCAACGA 

AGTACCaTACATACCAAAGAAATAATTATGCTCTGAACACAACAGCTACCTACGCG 

GAGCCCTACAGGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTCA 

GCATGCAGTGCCGGCTGATGATGGCACCACAAGATCCCCATCAATAGACAGCATTC 

AGAAGGACCCCAGGGAGTTTGCCTGGCGTGATCCTGAG1TGCCTGAGGTCATTCACA 

TGCTTGAGCACCAGTTCCCATCTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACC 

TGTGCTTTGGTGACAACAAAGTGAAGATGGAGGTGTGTAGGTTAGGGGGAATCAAG 
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CATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGAAGAATGCT^^ 

GCCCTTCGAAACCTCGTTTTTGGCAAGTCTACAGATGAAAATAAAATAGCAATGA^ 

GAATGTTGGTGGGATACCTGCCTTGTTGCGACTGTTGAGAAAATCTATTGATGCAGA 

AGTAAGGGAGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTGTAAA 

AATGACAATCATTCGAGATGCTCTCTCAACCTTAACAAACACTGTGATTGTTCCACA 

TTCTGGATGGAATAACTCTTCTrrTGATGATGATCATAAAATTAAATTTCAGAOT 

CTAGTTCTGCGTAACACGACAGGTTGCCTAAGGAACCTCACGTCCGCGGGGGAAGA 

AGCTCGGAAGCAAATGCGGTCCTGCGAGGGGCTGGTAGACTCACTGTTGTATGTGA 

TCCACACGTGTGTGAACACATCCGATTACGACAGCAAGACGGTGGAGAACTGCGTG 

TGCACCCTGAGGAACCTGTCCTATCGGCTGGAGCTGGAGGTGCCCCAGGCCCGGTTA 

CTGGGACTGAACGAATTGGATGACTTACTAGGAAAAGAGTCTCCCAGCAAAGACTC 

TGAGCCAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGCAAGAA 

GATCAATGGGATGGAGTTGGTCCTATCCCAGGACTGTCGAAGTCCCCCAAAGGGGT 

TGAGATGCTGTGGCACCCATCGGTGGTAAAACCATATCTGACTCTTCTAGCAGAAAG 

TTCCAACCCAGCCACCTTGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTAG 

CAACTGGAAGTTTGCAGCATATATCCGGGGCGGCCGTCCGAAAAGAAAAGGGCTCC 

CCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAGTTGTTTCTTCCGGTGCAA 

CAGCCTTGAGGAATATGGCACTAGATGTTCGCAACAAGGAGCTCATAGGCAAATAC 

GCCATGCGAGACCTGGTCAACCGGCTCCCCGGCGGCAATGGCCCCAGTGTCTTGTCT 

GATGAGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCAAAAACAT 

GGAGAACGCAAAAGCCCTGGCCGACTCAGGAGGCATAGAGAAGCTGGTGAACATA 

ACCAAAGGCAGGGGCGACAGATCATCTCTGAAAGTGGTGAAGGCAGCAGCCCAGGT 

CTTGAATACATTATGGCAATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGT 

GGAATCAGAACCATTTTATTACACCTGTGTCGACATTGGAGCGAGACCGATTCAAAT 

CACATCCTTCCTTGTCTACCACCAACCAACAGATGTCACCCATCATTCAGTCAGTCG 

GCAGCACCTCTTCCTCACCAGCACTGrrAGGAATCAGAGACCCTCGCTCTGAATACG 

ATAGGACCCAGCCACCTATGCAGTATTACAATAGCCAAGGGGATGCCACACATAAA 

GGCCTGTACCCTGGCTCCAGCAAACCTTCACCAATTTACATCAGTTCCTATTCCTCAC 

CAGCAAGAGAACAAAATAGACGGCTACAGCATCAACAGCTGTATTATAGTCAAGAT 

GACTCCAACAGAAAGAACTTTGATGCATACAGATTGTATTTGCAGTCTCCTCATAGC 

TATGAAGATCCTTATTTTGATGACCGAGTTCACTTTCCAGCTTCTACTGATTACTC/^ 

CACAGTATGGACrGAAATCGACCACAAATTATGTAGACTTTTATTCCACTAAACGAC 

CTTCTTATAGAGCAGAACAGTACCCAGGGTCCCCAGACTCATGGGTGTACGATCAA 

GATGCCCAACAGAGGAACTCTTTCTTTCTAACCTTGTTCAGATTGAGGTGAAAAGTC 

CATCTTGCTGATTTCATGATTGAAATGTGAAAGTGAAGTGGAAGGAATGAATGAAG 

T:QjQYrnTrTTTCcnrrn:GAGGAATTATCA 

SEQIDNO: 4490 

GGATGGGAAGCGAGCATGGTGAGTCCTCAAGTCGCAGCTGGGCCTGCCACGTGGGA 

GTGGAGGGTGGAGGAACGTGTGGAGTTTCGGAGTCCAGCCCAGTGCGAGACAGCCT 

TGAAACCGTGGTTGGCGGGCGCTCCACTCCGCTCTGGGCTCGAACCCTGCCTGACCC 

TAGCTGTGCCCCCCACTTTCTCCCTGTCTGGCCCCTGCTCCCCGCCCCCTCACTTAGA 

GGAGGGCACGGGGAAGGGCAAACGGTCCAGAGGGCGGGCGGCTGCGGGCTCCTCT 

GCATCATGTGAGGAGGGCGTGGGGAAGGACATCCTGGTGGGGCCCGATCTGGGCTG 

CCTCCAGCCCGGGCCTGTGTCTTGGACTTAGTCGTGGACCTGGAGGCCAGTGCCCGG 

CTGGCCCTGTCACCCTCTCGCTGTGACGCCAGCGCCTGCTGACTGGAGGACCCAGGT 

TCCTTCGCCTGCTTITTCTCAGGCTGCCCTGAGGATCTGTGTTTGGTGAAAAGG 

AAATTCACCTGCAGGGCAGGCGGCTCTAGCAGCTTCAGAAGCCTGGTGCCCTGGCG 

ACACTGGACCTGCCrrGGCTTCTTTGATCCCAACCCCACCCCCGATTTCTGCTCTGCT 

GACTGGGGAAGTCATCGTGCCACCCAGAACCTGAGTGCGGGCCTCTCAGAGCTCCTT 

CGTCCGTGGGTCTGCCGGGGACTGGGCCTTGTCTCCCTGGCGAGTGCCAGGTGAGGC 

TGCGGCGGCTCCGACGCAGGTGGAGCTGCTGACCTGGCCCCnTCTGCGGCTGCGAG 
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GGACTTTGAACATGTCGGGGATCGCCCTCAGCAGACTCGCCCAGGAGAGGAAAGCA 

TGGAGGAAAGACCACCCATTTGGTTTCGTGGCTGTCCCAACAAAAAATCCCGATGG 

CACGATGAACCTCATGAACTGGGAGTGCGCCATTCCAGGAAAGAAAGGGACTCCGT 

GGGAAGGAGGCTTGTTTAAACTACGGATGCTTTTCAAAGATGATTATCCATCTTra 

CACCAAAATGTAAATTCGAACCACCATTATTTCACCCGAATGTGTACCCTTCGGGGA 

CAGTGTGCCTGTCCATCTTAGAGGAGGACAAGGACTGGAGGCCAGCCATCACAATC 

AAACAGATCCTATTAGGAATACAGGAACTTCTAAATGAACCAAATATCCAAGACCC 

AGCTCAAGCAGAGGCCTACACGATTTACTGCCAAAACAGAGTGGAGTACGAGAAAA 

GGGTCCGAGCACAAGCCAAGAAGTTTGCGCCCTCATAAGCAiGCGACCTTGTGGCAT 

CGTCAGAAGGAAGGGATTGGTTTGGCAAGAACTTGTTTACAACATTTTTG 

AAAGTTGCTCCATACAATGACTAGTCACCTGGGGGGGTTGGGCGGGCGCCATCTTCC 

ATTGCCGCCGCGGGTGTGCGGTCTCGATTCGCTGAATTGCCCGTTTCCATACAGGGT 

CTCTTCCTTCGGTCTTITGTATTTTTGAITGTTATGTAAAAC^^ 

TTGATGTCAGTATTTCAACTGCTGTAAAATTATAAACTTTTATACTTGGGTAAGTCCC 

CCAGGCGAGTTCCTCGCTCTGGGATGCAGGCATGCTTCTCACCGTGCAGAGCTGCAC 

TTGGCCTCAGCTGGCTGTATGGAAATGCACCCTCCCTCCTGCGCTCCTCTCTAGAAC 

CTGGGCTGTGCTGCnTITGAGCCTCAGACCCCAGGGCAGCATCTCGGTTCTGCGCCA 

CTTCCTTTGTGTTTATATGGCGTTTTGTCTGTGTTGCTGTTTAGGTA^ 

ATATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 4492 

GGTAGCGACGGTAGCTCTAGCCGGGCCTGAGCTGTGCTAGCACCTCCCCCAGGAGA 

CCGTTGCAGTCGGCCAGCCCCCTTCTCCACGGTAACCATGTGCGACCGAAAGGCCGT 

GATCAAAAATGCGGACATGTCGGAAGAGATGCAACAGGACTCGGTGGAGTGCGCTA 

CTCAGGCGCTGGAGAAATACAACATAGAGAAGGACATTGCGGCTCATATCAAGAAG 

GAATTTGACAAGAAGTACAATCCCACCTGGCATTGCATCGTGGGGAGGAACTTCGG 

TAGTTATGTGACACATGAAACCAAACACTTCATCTACTTCTACCTGGGCCAAGTGGC 

CATTCTTCTGTTCAAATCTGGTTAAAAGCATGGACTGTGCCACACACCCAGTGATCC 

ATCCAGAAACAAGGACTGCAGCCTAAATTCCAAATACCAGAGACTGAAATTTTCAG 

CCTTGCTAAGGGAACATCTCGATGTTTGAACCTTTGTTGTGTTTTGTACAG 

TCTGTACTAGTTTGTCGTGGTTATAAAACAATTAGCAGAATAGCCTACATTTGTATTT 

ATTTTCTATTCCATACrrCTGCCCACGTTGrmCTCTCA^ 

ATAAATCTGATGCACCG 

SEQ E) NO: 4494 

CGTTGGCCAAATTGACAAGAGCGAGAGGTATACTGCGTTCCATCCCGACCNGGGGC 

CACGGTACTGGGCCCTGTTTCCCCCTCCTCGGCCCCCGAGAGCCAGGGTCCGCCTTC 

TGCAGGGTTCCCAGGCCCCCGCTCCAGGGCCGGGCTGACCCGACTCGCTGGCGCTTC 

ATGGAGAACTTCCAAAAGGTGGAAAAGATCGGAGAGGGCACGTACGGAGTTGTGTA 

CAAAGCCAGAAACAAGTTGA.CGGGAGAGGTGGTGGCGCTTAAGAAAATCCGNNTG 

GACACTGAGACTGAGGGTGTGCCCAGTACTGCCATCCGAGAGATCTCTCTGCTTAAG 

GAGCTTAACCATCCTAATATTGTCAAGCTGCTGGATGTCATTCACACAGAAAATAAA 

CTCTACCTGGTTTTTGAATTTCTGCACCAAGATCTCAAGAAATTCATGGATGCC^ 

CTCTCACTGGCATTCCTCTTCCCCTCATCAAGAGCTATCTGTTCCAGCTGCTCCAGGG 

CCTAGCTTTCTGCCATTCTCATCGGGTCCTCCACCGAGACCTTAAACCTCAGAATCTG 

CTTATTAACACAGAGGGGGCCATCAAGCTAGCAGACTTTGGACTAGCCAGAGCTTTT 

GGAGTCCCTGTTCGTACTTACACCCATGAGGTGGTGACCCTGTGGTACCGAGCTCCT 

GAAATCCTCCTGGGCTGCAAATATTATTCCACAGCTGTGGACATCTGGAGCCTGGGC 

TGCATCTTTGCTGAGATGGTGACTCGCCGGGCCCTATrCCCTGGAGATTCTGAGATT 

GACCAGCTCTTCCGGATCTTTCGGACTCTGGGGACCCCAGATGAGGTGGTGTGGCCA 

GGAGTTACTTCTATGCCTGATTACAAGCCAAGTTTCCCCAAGTGGGCCCGGCAAGAT 
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TTTAGTAAAGTTGTACCTCCCCTGGATGAAGATGGACGGAGCTTGTTATCGCAAATG 

CTGCACTACGACCCTAACAAGCGGATTTCGGCCAAGGCAGCCCTGGCTCACCCTTTC 

TTCCAGGATGTGACCAAGCCAGTACCCCATCTTCGACTCTGATAGCCTTCTTGAAGC 

CCCCAGCCCTAATCTCACCCTCTCCTCCAGTGTGGGCTTGACCAGGCTTGGCCTTGG 

GCTATTrGGACTCAGGTGGGCCCTCTGAACTTGCCTTAAACACTCACCTTCTAGTCT^ 

GGCCAGCCAACTCTGGGAATACAGGGGTGAAAGGGGGGAACCAGTGAAAATGAAA 

GGAAGTTTCAGTATTAGATTGCACTTAAGTTAGCCTCCACCACCC 
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SEO ID NO- 1 1 10 GGTACTATTATACTAAAAGCTCCTACTGTGATGTGAAATGCrCATACTTTA^^ 
AGTiUU^ 

TSAGTW;AGTATOTTGAATCAGTAGmC^ 



GGGCTAGTTTCTGTGTAACn'GTAAATACTACAAAAACTrATTTATACrGrrOT^ 
?mCArAGAm^ 
TOT 



SEO ID NO- 1 1 1 1 GGTACTACAAGTrrAGTGGCrrCACGCAGAAGTTGGCAGOAGCATGGGOTC 
TGGCrTCGCAACCCAAAAACAAATGAATTANG CTGCAGAGG ACTGGTTTGGT 



AAATGCTTTACCGGACCACCATGGCGCTGACTGTGGGAGGGACCA^^^ 
CATGGCirOiCAACCCAAAAACAAATGAATTANGCTGCAGAGGACTGGT^ 

^^JSS&rrccrmcArrGGT>^Ti^^ 



aoaaI^t^^^ 
SSmgI^^aot^ 
SEO ID NO: 11 12 cgaggtacatgctctatctgatgctcgatgtgtcttt^^ 

TTGC^CmACGATGAACATGTTAGACATrGGTGGAGGATTC^^^ 
ISSOT^^CAGCCCTCTGrrGGATATCT^ 
TO^AACCCGGAAGC^C^^^ 
I^TOTOAAAATGA^T^ 

TG-S^T^^G^TGATGGTGTnATGGTTCrmGC^ 

ISgIgg^ac^^^^^ 

cSTOCTGTGATGAACOTGATCAAArrGTG^ 

S^ScnSttaatgatgcattcangatogggnta^ 
tgaaaacttttttgcct 

SEO ID NO' 1 1 13 CGAGGACGCGGGATTGCAACAAGAAAAGCCAGATTTCTGCTTTITG^^ 

cg?S^ItcotgoatSctggSaat^^ 

Sm^AAoIrGGTCC^^^ 
^SSrGACT^GCAi^TCAACCAAGGTrCC^^^ 

GGGAAATGCCGCCCATTTAAGTACCTGCCCNGGCNGCCGCrCGA 

SEO ID NO- 1 1 14 NCTmAGTAACGACCCAATCTAAGGAGCCTrGGAGQCn-GTGAAAA^ 
TCCACAAAGATGCT 
rAlrrACCTCAGCATGAATA^ 



i^G^GC^^^ 
CTOTGTCCATrCCTTCACATCTGTAGAGGAGTTCGATATAACCAOT 

SSSfS^G^ATCACAAAC^^^ 



CTNCTGTAGAAGNGAATTAAGTAATACTrrCTGGGOCn^^ 

^SggS?^^S^SW 

GGGC 

SFO ID NO- 1 1 15 ACGCGGGAGAACTGCGCGAGAAGCGAGACCTTAGAAGGCAGCGCTTCCCGC 

^S^c^ocAATxxx;cGGOAcrrGTi™GAT<K;aTaj^ 

TnTCATGCTXTlKrTGGAAACnTITAATCCC^ 
CCrrnTGATCCACTTGTCAA 
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SEQ ID NO- 1 11 6 AC>GCCAACGGTTTCCCrrGGGGGCTITGAAATAACACCACCAGCGGTCTTA 
AGGTTGAAGTGTGGTrCAGGGCCAGTGCATATTAGTGGACAGCACrTANTAGCTGTGGAGGAAGA 
TGCAGAGTCAGAAGATGAAGAGGAGGAGGATGTGAAACTCTTAAGTATATCTGGAAAGCGGTCTG 
NCCCTGGAGGTGGTAGCAAGGTTCCACAGAAAAAAGTAAAACTTGCTGCTGATOAANATGATGAC 
GATOATGATGAAGAGGATGATGATGAATATGATGATGATGNTGATTTTGATGATGANGAAANCTG , 
AAGAATAAGCTCCANTGAAGAAATCrATACGAGATACTCNNGCCTTANATGCTCATAAGTCTATr 
CnWATGGGAATNGAOTCANATCCCArirriWfNCmCCriTATC^^ 
CTCTrAANCANNTCTAANNCTCOT*AAT>n'CACCCACATAGGACCC^^ 
rrrAANGCTAATAATNGCATrWCCTGTTTATNTAAAATNNGTNGGQ 

SEQ ID NO* 1 1 17 CGAGGTACGGGTATCACnrCCGGAGCTGGTGAAGATCATCAACGACAATGC 
CACATACTGCCGTCTTGCCCAGTTrATTGGAAACCGAAGGGAACTGAATGAGGACAAGCTGGAGA 
AGCTGGAGGAGCTGACAATGGATGGGGCCAAGGCTAAGGCTATTCTGGATGCCTCACGGTCCTCC 
ATGGGCATOGACATATCTGCCATTGACnTGATAAACATCGAGAGCrrCTCCAGTCGTGTGGTGTCT 
TTATCTGAATACCGCCAGAGCCTACACACTTACCTGCGCTCCAAGATOAACCAAGTAGCCCCCACC 
TOTCAGCCCTAATTGGGGAAGCGGTAGGTGCACGTCTCATCGCACATGCTOGCAGCCTCACCAAC 
CTGGCCAAGTATCCAGCATCCACAGTGCAOATCCTTGGGGCTGAAAAGGCCTGTTCAGAGTCTGA 
AGACAAGGGTACACTCCAAATATGGACTCATTITCCACTCCACCTTNATTGGCCGACAACTrGCCA 
AAACAAAGGCCGAT^^CCCATCX^•GCAAACAATCATATTGC^^ACAATCATGGT^mTA^^ 
CATirrrGGGAAACTTCANACAArrAAAACATNGCrmANCTGGNAAACCCAANATrGT^^ 

SEQ ID NO: U 18 NCTGAAATrAGTAGAAGCTAGGCCAATGATCCATGAATTGTTAACrGAAGGG 
CGGAGATCGAAGACTAACAAAGCAAAAACCCTTGCTACATGGGCAACAAAAGAACTGAGGAAAC 
TGAAGAACCAAGCTTGATCTGTTACCATTGGGATGATAACCTGAGGACCCCCACTGGAAATCTCCC 
ATCTTTTGAAAAACCrGGAAGTGAGGAGTGTGCACGGATGCTGAATGTTTGGGAATGAGAGGATG 
AGTGAGTGAGGCnTGAAAACACACCACATTGAAAATCCTGCCCAGCAGCAGCCCGCAGCCCGCCA 
ACAGCAGCGCTGTTAGTrGAGCTAAGTAAAGCACTGCTTCGTAGAAAACCATAACATCGGCCATC 
rrGGAAAAAAGAAAAACAATGGAGTTACTTATITAAAAAAAAAAAGAAAGAAAGTTA'mCCTTCC 
AGGAAAGNTAGAAGTACTTrrCTOTCrmGGCCAATGCCCANTG GAATC CTGGTTTGGGGAGQAG 
GAAOGACTOGGTTCACTGOGOKGCTn'GrnGTAAAAGGCNCTGGCCTmrrCT^ 
GGANCTGGGT^NAAANC^WC^TIT^maTGGCCGGACCCCTAOGGNGAA^TC 
GTTCrrTGGACCACCNGT 

SEQ ID NO: 1 1 19 ACGACAOCAOACTCCTCCAGGGCrCTGTCCACTTGCAGGAAATTCAGTTCAT 
GAAGATAAGAAGTTTCrrGACAAATACATGCCCCAGTTCATGAAACATCTTCATTATAGAATAATT 
GATGTOAGCACTGTTAAAGAACTGTGCAGACGCTGGTATCCAGAAGAATATGAATrTGCACCAAA 
GAAGGCTGCirCTCATAGGGCACirGATGACATTAGTOAAAGCATCAAAGAGCTTCAGTnTACCG 
AAATAACATCTTCAAGAAAAAAATAGATGAAAAGAAGAGGAAAATTATAGAAAATGGGGAAAAT 
GAGAAAACCGTGAGTTGATGCCAGTrATCATGCTGCCACTACATCGTTATCTGGAGGCACTrCTGG 
TGGGTTTTTTTCTCACGClXjATGGCTrTGGCAGAACCCTTCGGTTACTrGCATC^ 
CTCAAGCNOA<>GCACACGAAATCrArrmCTNCTAATATGa^GGTTCCAl^ATGACACAA^^ 
TCCrmAAGTCCTNNGGCGCGACCNCGCTANGGCGAATTCCACNCCXTGGCGGCCGTCTTATGGA 
TCCACnTGOACCAACrrTCGGNAACATGGCTACTrGTTCTNGNGAATGG>rrCCCTNrAA^^ 
ATCAACCGN 

SEQ ID NO" 1 120 NCrGTTCAAAAAGGAATGCCCCACAAGTGTTACCATGGNNAAACTGGAAGAG 
TCTACAATGrrACCCANCATGCTGrrGGCArrcmTAAACAAACAAGTrAAGGGCAAGATrCTrG 
CCAAGAGAATTAATCrrGCGTATTGAGCACATTAAGCACTCTAAGANCCGAGATNGCTrCCTGAAA 
CGTGTGANGGAAAATGATCAGAATAAGANAGANNCCTATNANAAAGGTACCTCGGCCGCTCCAC 

GC 

SEQ ID NO- 1121 ACGCGGGGCTAGTGGCITGAGGTATCCGCAGGAGCGGCCGGGTGGCGGGAG 
GAACCGTrACGGGAACTGAAGTrGCGGATTAAGCCTGATCAAGATGACAACCTCCCAAAAGCACC 
GAGACTTCGTGGCAGAGCCCATGGGGGAOAAGCCAGTGGGGAGCCTGGCTGGGATrGGTGAAGT 
CCrGGGTAAGAAGCTGGAGGAAAGGGGTrTTGACAAGGCCTATGTTGTCCrrcGCCAGrrr 
GCTAAAGAAAGATGAAOACCTCTTCCGGGAATGGCTGAAAGACACTTGTGGCGCCAACGCCAAGC 
AGTCCCX3GGACTGCTTCXK3ATGCCTTCNAGAGTGGTGCGACGCCTTCTTGTGATGCTCTCTGG 
GCTCrCAATCCCCACCCTCATCCAGAGTTTGCAGCCGAGTANGGACTCCTCCCTGTCCTCTACGAA 
GGAAAAGATTGCTATrGTCGTACCTTCGGCCGCGACCCCCC 

SEQ ID NO: 1 122 ACCTrGACTTAAATCCAAGAGCAAAAGTTAAOACTCrCCTCCTCT ATTnTG G 
TAAACAACTGCATGGTAAACrTAGATGACTaTCCCCCTGGATTTTACCTGGGAGTGGCL i 1 1 i i A 
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CATTTITATITAAAAGAGCKSCAGGmGGCACTTTrATACrGATGTCACCAATGTTAATA^ 

GGATCrCAGGAAGArrCATATTCTTTACAGCTGATACAGCACGGGCrGGAGCTCCTGCTAAGCCAG 

CCrCAGATTITTCCAGCrrATmGTGCATCAArrrGTGTAACAATCTCATr^^ 

ACCATTCCCCCCATCACCAiTIXn-GCAAGAATATTGTGAACXrrTGTCrACATGGAA^ 

AGCTCACAGACATTTrCAAAACATITGTCTAATGTTTCCACAAATACCTrGAATrAGATCTAAAAT 

GCCAAGTCACTrTCTGAAGAATCX;NCCCNGAAGACAAAATATACGTGCTAATON 

TTGGTGGCAGAACCrCCAATAANAATCCCCCTmAGGNAATCCAACATTTOTimh^ 

ANGOAAG 

SEO ID NO' 1 123 ACATOACCTAATnTTACATCATAGTAAAACAGGCCCTATGGAGAGAGGACA 
TGGGTTTCTCTGCTGAACAGCXl^TrATrrATACTCGTTCCAAGGCrrCTAACATQATGATACTAm 
CCTCGTATrACCACCATTCCAATATTGTTCTGTTGTCCACTAGTCGCCATCTCCACACArrCA^ 
TCACAAGGTrCATAAAGGGATCAAATCCCCGCAATATTCCTTGGACATGTCTGCCACCATTTAATT 
TCAATGATAACTTCITGTOCATAAArn-mCAACTCGGGAGGGTGAGCnTGCTCATGGTGTATAC 

TCCGCGGGCTCCCCCGCGTACC 

SEO ID NO- 1 124 COAGGTACTTGCCCCTTCCCCAGAAAAGCGGGACTTGCTGCrAAGGGTGAAG 
GACCAAGGCAGTTGTCCCTGCGTGGTCTGACACCCTTGAAACGTGGGTGTATAATCAGAGAGGCA 
TCCCTGCAATGATTAAACACCAAGGGAAGGCTGCCTTCCCAGTCTGTGACCAGCGCCXSGAGTmG 
GGTCCACGGATAAAACGTGTCrCTITrGTCTCTACCAGAAAATGAAAGGAATrGAAATTAAOAGA 
AGGGAGAGATTOAAGTGTAGTGCCAAGATrGAAAGGAGAAAGTGGTTGAGGGATGGTGAGGGAA 
GTTGGAGAAGAGAGTAAAAAGAGGCTGCTTACCAGATTTGAAATTGGTGAGATGTITCTrGGGCT 
CGTCGGTCTGAGGACCroAGGTCGTAGGTGGATCrrrCTCAGGGAGCAAAAAACCAGGANGACCG 
AGGATrGATCrTCCAAGGGANGTCCCX:GATCCNAGTCATGGCACCCAAArrCATGTGCCrNCATTT 
GAANAAANCNCCNACAGCTTTNTGTGACCACAAGGm-GTTTTmAACm^ 
GTCCCAAAAANAATACNCCCCCNTrC 

SEO ID NO- 1125 CAAAATAACAGGTTATGAGOAAATGAGCGATATATGA ACGGC ATAAAAACA 
GAAATTACCCAGTAAAAAGGATGTCAGAAATTGACATACAAATArrTACAATTmATGAATGGT 
GGTCnTGCAAAGAGCAnTATA riTi C ' l 1 1 11 U f r TACTAAAATGATATAGGGTTrATmATATT 
TTCAAAAAAATTGTTAAACATCATTOTATCAATGTAAAATTTACGrrArrAAAAAATTAC^ 
ATAGAATCmACTCAGGGATGGGCAATAAAACAGCAAAGAGCTTTGTCA 
mGAATTATAGACAGTCCTCTTAATAGTTTITAATAAGTTQATATTTrmC^ 
TAATTrCrrTAAAAGCCGTAAGCTTTGGCrcGGCATGGTGGCTTATGCCTGTAATNCCAGCAC^ 
GGGGGGCTACGATGGGTGGACCCTGANGNCAGGAGTCAAGAACAGCTGGNCACATNGNGAAANC 
CGrn^CTAAAAAACAAAAATAACITGGCCTNAAGGGANTGTrmTrATO 
TGGGGGGG 

SEO ID NO- 1 1 26 ACGCGGGGACTATGGCTTCCAGCACTGTCCCGGTGAGCGCTGCTGGCTCGGC 
TAATGAAACTCCCGAAATACCGGACAACGTGGGAGATTGGCTTCGGGGCGTCTACCGCTTrGCCA 
CTGATAGGAATGACTTCCGGAGOAAOTGATACTAAATITGGGACTCTITGCTGCGGGAGrrrGGC 
TGGCCAGGAACTTGAGTGACATTGACCTCATGGCACCTCAGCCAGGGGTGTAGCCAAGTAGACAA 
ATGGAATCCTGTGCTGAACCCGAATCTTCCAAAAAACAGCCTACAATCTGTGACCACCACAA^ 
GTGCCCTGATGGCAGCTGAAGTTTGATTCAAATGGGCACrmCXrCCCTrcCTGCCTAGm 
rrGrrccrrGAGTCCACGCAGAArrCCATTCTCTGGTO^IGCAGACAGGCTTAAGCnT^AAAGTAr^ 
GCIWATrcrGTAAAAGTTCTGTANCTmGGCCGCGACCACNCTTANGGCGAAATTC 
GGCNGGCCGTACTA>rrNGGATCCNAGCTTGGNACCAAACTTGGNGGAANAATGGGCAAAATGGT^ 

NCTGGGGGNAAATQTNTTCCCTNCCAA 

SEO ID NO- 1 127 ACAATTXrrrCACAOACACATGAAGCAGAACATITGAAAATCAAAACTrACTT 
AAATACAAGATTCTAAAAGTGAATGGCAATTTAATGGTTAAAATTTGAC^^ 
AGCAAGCrACAAAATCCATCACCACCAACAGTrrCAATGTTAGCACTAAGTA'lTAAACCAAAGTA 
ATGC^TATTCTGGTTTIXjCTrCTrCAAGAACAGCCCGTrrATTCTGmCAATCTC^ 
CTGGTAGOTAATAGTTACACTGACAATTCCCACAAAAAAACAGTGTTGGTTCTTGCCATCACTGAA 
rnATAGTATAAATCTCCCAAGGCTITAATCTAGTrCATCAAGTGCAGTrrAGATAAGAAAGCCAT 
AAAAACAAATGGGCATACTGNCAGTTAATGAGAAATACAACTGCACATGCNCAATrAATATTACT 
CTCrcrrAATCTACTAAGAGAACAGGTTATrAGTAGAmACAAAAANCNTCTNTTGAATTC 
AGGGCCXn-AmCCGTGGGTCriTATATNAANAATANGNAGCCANACNGGTATACTGGATNAACC 

CAGTCNAAGGCTGNT 

SEO ID NO- 1 128 CGCGGGGCTCTrCCTGCTCTCCATCATGGCGCAGGATCAAGGTGAAAAGGAG 
AACCCCATGCGGGAACTTCGCATCCGCAAACTCrGTCTCAACATCTGTGTTGQGGAGAGTGGAGA 
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CAGACTGACGCXiAGCAGCCAAGGTGTrGGAGCAGCTCACAGGGCAGACCCCTGTGTTTTCCAAAG 

CTAGATACACTGTCAGATCCrnGOCATCCGGAGAAATGAAAAGATTGCTGTCCACTGCACAGTTC 

GAGGGGCCAAGGCAGAAGAAATCTTGGAGAAGGGTCTAAAGGTGCGGGAGTATGAGTTAAGAAA 

AAACAACnCTCAGATACTGGAAACTTTGGTTnGGGATCCAGGAACACATCGATCTGGGTATCAA 

ATATGACCCAAGCATTGGTATCTACGGCCTGGACTTCTATGTGGTOCTGGOTANGCCANGTTTCAA 

CATCCAAACAAAAANCGCAGGACANGCTTCANTTGGGCCCAACNCAGAATCACCAAAAAGGAGG 

CCNTGCNCTCGGTCCNNCWAAAATTTAAGGOATATrCrrCTGGCWANAAAATNCCGK^ 

AAANAAATAAAAAANTITA 

SEO ID NO- U 29 CGAGGTACGCGGGACTCTTATrCTGCGCCTGCGCGCGGCTACAGCACGGTTC 
GTTmCCTTrAGTCAGGAAGGAOjTrGGTGTTGAGGTTAGCATACGTATCAAGGACAGTAACTAC 
CATGGCTCCX:GAAGTrrTGCCAAAACCTCGGATGCGTGGanTCTGGCCAGGCGTCTGCGAAATCA 
TATGGCrGTAGCATTCOTGCTATCCCTGGGGGTTGCAGCTTTGTATAAGTTT CGTGT GGCTGATCA 
AAGAAAGAAGGCATACGCAGATTTCTACAGAAACTACGATGTCATGAAAGATTTTGAGGAGATGA 
GGAAGGCTGGTATCmCAJ^AGTOTAAAGTAATCrrGGAATATAAAGAAmcrrTCAGGlTGAATr 
ACCTAAAAGTTTGTCACTGACTrGTGTrCCTGAACTATGACACATGAATATQTGGGCTAAAAAAAT 
AaTTCCTCriXjATAAATAAACCArrAa^AANN>WN>nWN>^ 
GGCNGGCGCrTNAAAGGGCGAAAT^CACCA^^^T^rmGNCNTTACTAATO 
CAAA>nTGGGGGAAAATTGGGNTAAATGTTT 

SEQ ID NO: 1 1 30 acaacatcagtncaataaattcacaaaactatattacagctagttaatcagt 

TTAAGAATrGTTCCCGTCAGTCACArriTITGOCCCTCAOAAGrr CATr CCTAAAGAm 

TCTAAATrrCTAGCTACCAAGAAGTTAAGAATGATrATAAGAAGCTTTCCAAGGAGTTAT^^ 

rrTGTAGACCAGAGGCCAACTATCATCACCTCAAGTCTGCrCTCACCAACAGCCCnTGTATr^ 

GGGAGAAATCTCTAGGAAAAAAGTCAGACACCAGTGTAGTCACrATCTCCCATGTCAAACCTAGG 

GOACTAAAATGGTCAGTArrACCATAAAATGATAATTTTGAGGTnACCTTAAAAGGCITATrCT 

GlCrCAAAAATTAGATAAAGAATATCrrCTACTGAAATGAATATTGACTTAACATAGAAGACTGNT 

GTCCTTrrGCCGTOTATAGGGAAAGGCATAGACATGAGTCnAANCTNCTGGATA^ 

TGNNAGCTGGAAAATCTGGAAATNAAAGCNGGCTNAAAANCTNAAGGCCATCTm 

TGGTnTAAAANNCNTTAAGGGTTT 

SEO ID NO: 113 1 ggtactcaaaaatcttcccctactcaaattcagaaatctgttactagatgtgt 

GTGTCTACCTGTOTGTGTTrrAGAArrAOCrTCTOTGOAAGTrCCTrAGTCC^ 

GTTTCCAAAGGTCTGTTTGATAAGrrGACCAAATATACATGrrGATATGGTrrcGCTGTGTrc 

CCCAAATCTCATCTTAAATrGTAGCTCCCATAATTTCCATGTGCCATGGGAGGGACCCAGTGGGAG 

ATAATGGAATCATGGGGACAGGTTTTTCCCATGCTGTTCTCATGATAGTGAATAAATCTCATGAGA 

TCTGATAGTmATAAAGGGGAAGTTCCCCTGCCATGCTCTCTTGCTTATCACCATGTAAGATGTG 

ACrTTGCTCATTCrccmGCCATAATTGNGANOCCTCCCAACCATOTOONACTGNGAATCAAATr 

AAACCTCTTrCCTTTATAAAmCrrcATTCCAAATATATATNAAACTCAGGCCCAAAG 

TCATGCCTGTAATCCTAACCCCTTTGGANGGTGANGGGGGCCAAACACTrrAGGGCAGGAGTTNA 

AAACNCCCCGGGCCACATGGGGNAAAC 

SEO ID NO- 1 132 ACGCGGGATTGTNGGAGTCCTGTGCACAGATTCACAAGGACTrAATCTGGGT 
TGCCGCGGGACCCTGTCAGATGAGCATGCTGQAGTGATATCTGTTCTAGCCCAGCAAGCAGCTAA 
GCTAACCTCTGACCCCACTGATATTCCTGIXJGTGTGTCTAGAATCAGATAATGGGAACATTATGAT 
CCAANAAACACGATGGCATCACOGTGGCAGTGCACAAAATGGCCrCTTGATGCTCATATCTGTTCT 
TCAGCAGCCrGTCATAGGAACTGGATCCTACCTATGTTAATCACCTrATAQAACTACTAAAGTTCC 
AGNAGTTAGGCCArrCArrTAATGNGCArrANGCACCTrrNCTGGTTATTTAAAi^^ 
TCTAATGCTCTATGGGCCCGACTATCAAAGATATTA0TAAGAAAAGGATCmX3TTrrTGAACAACA 
AGGTCCANGTCACrmGTATATAANAATITGGCTGGATITCAATAAAATTTGTrTGGNGGGGNGG 

^mANNNNNNN^^NA^^IT^wcNNN^ 

CNNAAANmGGGGGG>I>n^TTrNTrGGGGNCCCCCCNN 

SEO ID NO: 1 133 AarTAGCTCAGAACTGAGGGTmACTrnTGGAAGAAGTCAGGAGTGGATG 
TTGGGAACCAGCTTGCAGTrGCTACCACAGGCCCAOAGCTCTGrrGAAAGGAAGGGCOT 
GGCATGGTGGCTCACGCCTATAATCCCAGCCCnTGGGAGGCTGAGGCAGGTGGATTGCCTGAGG 
TCAGGAGTTTGAGACCAGCCrGGCCAACATGGTGAAACCCCATCTCrACTAAAAATACAAAAAAA 
ATTAGTCGGGTGTGGTGGTGGGCATCTGTAATCCCAGCrrCrCAGGAGACTGAGGCAGGAGAATT 
GCTraAATCTGTGAAGTGGAGGCTGCAOTGAACCCAGATTACCCCATTGCACTTCCACCCTCGGTG 
ACAAAGAACGAAACTm-GTCTTCAAAAAAAAGAAGAAAGGGAAGAGCCCTTOGGTrGGGCCCCA 
GTGGC^CACGT^^TGTAATCCCACCNCTTTGGGAAGGCCAAGGTAGGTGC^^^rANCT^ 
AmrCAAAACNAGCCTGGCCAACTTGGGGAANCCCTITrrCTrAAAATGTNAAA^ 
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GNGGGGGGGGCCnTATNCCCCTCrrTNGGAG 

SEQ ID NO: 1 1 34 GGTA Cn - lTriTl mill I 'l' ll I'l H 1 1 U ll l U 1 11 1 1 f l l AAGGATTTTAAA 
AATATATm^JTTTTGAAAATATTOTGAAAGTAATTAAAATCACCTATTGGGAAGGAAANAAATC^ 
ACNCAATGAAAmAGGAANANAAATGCAAAAGCAGCTGCANAATCCTGCCCCTTCCACCCCAAG 
GGGCACAGCCCGGANATGAANCTTAT^T^AGGATCACAAGTGCNCCCT^aGGTANACC^ 
ACmXjTCAGTCTTNrrGGCTGAAGGGCCTTGTCCAAAGTCAGGATCANAAATAGGNCAG^ 
ATGOTGGCTrrOTTGGNCTOGNANGTTAAANAAGGGCAAGGGCnTGTITrCATGAGC^ 
TGGAACACTCCCNGGGTACCTGCCCXrGGCGGGCGNrrAAAAGGGNGAATTTCNCCCCANTGNGGG 
ONfNGTTNCTNATNGGTrCCNAACCCGGGNCCAACCTGGGGGAAAAATGGGCNNANNTNGm 
TNNGGAAA 

SEQ ID NO: 1135 GGTACl-j-i ri'l l i 1 1 Hi i 1 1 Tl i U 1 1 U 1 1 I CJ l 1 1 NG AANAT GG TTTA TTTTAC 
ATAAAGTTACTGNGAAAGGGAAAGAAAACAATANAAAAGrmAATATATTrrATGTTTTCACATT 
ATACTTmAAATAAAGTTAAACGTTTGATATGCrrCCTACTTCTCTTGATrAGT^^ 
ATCAACTTCAAATGGrrAGANANATT>n'ACCAGGTCGTCATCAGTCCATTCTTCCTCCrCATGTrrG 
GATTTCTTTGANACGTAATCATCATCTTCATAGCCTTTCCTCTTCTTCGTGArrACGrrAAGT^ 
GCTCAGCAAAATGACATCGCTCCAACTTmGTTTCANAACAGCAGCTTCTTCAAAACTNGGGNGG 
GTCATACTTCTITACTAAAGTTCTCAAGCCmCANTATATCTAANAAACTGGGACCAGGCAANT^ 
TTG>riTrrriTrNACAATNAAACTTTTTGANAANAACCm 

SEQ ID NO: 1 1 36 ACGCXOGTAAGATAAACTArrTAGGAAAAAAGTCTGACGAATTTA ATGCA TG 
TOGGAAATTGrTACTCAATArrAAGCATGATGAAACTCATACTCXjAGAGAAAAATGAAGrnTGA 
AAAATAGGAACACTCTGAGTCATCGTGAGAACACmGCAGCATGAGAAGATTCAAACTTTAGAC 
CACAATTTTGAATACAGTATATGTCAGGAAACCCTCCTTGAAAAGGCAGTATTCAATACACGGAA 
GAGAGAGAATGCAGAAGAGAATAACTGTGACTATAATGAATTTGGGAGAACnTrCTGTGATAGTT 
CATCCCTCTTGTIXrATCAGATACCTCCATCAAAGGACAGTCACTATGAATTTAGTGArrGTGA 
AGTrCTTATGTGTGAAAGTCCACCCriTCTAAACATGATGGGGTATCTATGAAACACTATGATrc 
GGTGAAAGGTGGGAATAATTTCAGGAGGAAAATGNGTTTGTCNC^CNTCANAAAGGNGATAAAG 
GGGAAAAACCCrJTNATGTrATGNATGTNGGGAAGCmrmTGGNGAAGCCCATTNNCT^ 
AANGGGG 

SEQ ID NO: 11 37 Ac nTrnTm n i i 1 1 u u 1 1 1 1 u GGiTnii ri ii 1 1 n 1 1 1 riTTTNGACT 

ACAACC^GNGTTTATIWrGATITGTCACCACTCTTn-CATGTCrrrGNTTTOT 

ATmAATAACCAAAACTTTACTAACATACNAATGAAAAAAACATGCCCNNCATGTNTGCATGGCA 

GGTAGTCAGGAAATNTGGCCAGCTGACTGGTTCCTITACCAAGGTTrGCAAANTAGGTl'GNGm 

AACACCTTNTGNGGOTCTGNGTCATTrCCAAGTTGAAAAATTTCANCCAAAGAGCAACATGTCAC 

ATTGATTAAAAATGGGTAATGACCCAAAAACATTTOTGTTAATCTAAKGGGAAANGNGGGNCC^ 

TAATTAATNAATTTNCCCGGAGCCCCCCOTrrnTITOmT 

TACCnTGGCCGGGCCCCNCTAAGGGGGAATTCCANNCCCTTGGGGGCGGTTATTN 

SEQ ID NO: U 3 8 GCGTGGTCGCGCCGAGGTACTAGTCTCTACTTGGGGACAAGAAAATAGAATA 
TGCAACTCAGAAAOGAAAOAGCCCAAAGACGAGAOAACCrGCTTGTTAGCTCATTAACCTGTTTA 
GTAAAGATCTGCrn"AAAATGCCTGATGCTGTGCAGTATCATACAAAACAATCTTCAGCCTTCAAA 
GCAGCrGATGCACOTCnx;AGAGATCnX}TTrGTCTGATTAACAGTCrcCIX3CC^ 
GTCirGGTCACTTTCGGTCATCGATGGTTTTAATGAATTCTACTGCTGCTGTG^ 
ATAGGACTCCTCTCCAGACAGACAGCTAGCATAAAAGCTACTGATATACTGCACAGTAGACAGCA 
AACAGGGTGGATTTGCCrrrATCAACACAAACACCAACACAGGAACAAAGTCArrCCGCTCCANG 
GACAGAGTCCTCATTGGCCAGCTCAGGANGGTCATAATCGTAAAACACATTCTCAGGATGCACTG 
CCrrrGTCCNGGGGGOmATAANCCTTAATGGCCTGAATTCTGAATGGGCNNATGGCCrrGGGCN 
TNTNCAAAANAACCCTTGGT 

SEQ ID NO: 1 139 A Ci TiTri Tl - l I " IT 1 1 1 1 1 1 i 11 1 li 1 1 lA lTn 1 1 1 1 1 1 GAATCAAAAGCAGGGT 
TTATTTTTCTATCAAATCCCXAATCCATGTrCCAGCCAATGGATGAAGGGTGAATC AAGCCCC ACA 
TANACTCTTGGTAAAAACAArrcTAACrnXrrAAAAAAAAAAAAAAGCCAACACACl ri^ 
hnriTCAAAAAGCTCCCAGGCCTrrGGGAACAGCTGAAACAAATTCATATCCTGACTAGG 
TTCTNTTAGGTATTTGGATGGTCCCTCrCTGCrrGCCACmrrGCACAGATGAGGCACT 
OrrGCAGGTCACTCACAATCCTAGCTCCACATCACTCCATGGGTTGATAACCTAAA ACCCCG TTAT 
GATrrCCATTTATAATGCCTAAAACAGCTOA AAANA CTO TATTN AATTCTGCAAATNlillTGG 
GCCACrANTTTCTTGGCCAAOCTTAGGCCTTATTTTGTGGTTTTAAA 
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SEO ID NO: 1 140 A Ci i u u i N irnnrii u 1 1 i ganatggagittcgctctgttgtccaggct 

OGAGTGCAATGGCAGTrGrrTCACCGTAACCrCTGCCTCCTGACTTCAAGCAArrC^^ 

GCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCGTGCCCGGCCCANATAGGTrcATCn^^ 

GTAACAACCCAAAAATAAATCTCATAGTCAAGATrrGGTAGATAAATTTAAAATTAAAATATTCT 

GCAGTrGGGAGTGAAATOTGATAGCACATACGTTGACATrATGCATTTAGAGATGTrATA^^ 

GTATAGGCAGTNTACACAGCACTACTCAAGAAGCCAAAGAAACACTTGTGCAGTGCTAAGTGTCA 

CATGTCTGCrrCCCCAAAGGCTAGGAATAGTGATCCTGCTNTAATGNGAGAACCCAAATCATGTTT 

ATAAAATAAGGACCCTGGGAACA^^rraCAGGCCCCAATGAANTGGGGTCCT^CCGGCCCTTAKAT 

TCCTAAAATrCCCTTNCCTCANACAANAAAmGNGGGOTACmANCTTTGGCCGO^ 

AGGGGAAT 

SEO ED NO- 1 141 ACCTGAGATCCTGGTGCACAACATAGTGATCTTCATGCGAACTTCAGTGAAG 
ATTrCATACATTGGCCTCATGACCCAGAGCTCCrTGGAGACACATCACTATGTGGArrGTGOAGGA 
AArrCCACAGCTATTTAACAACTOCTATrGGTTCrrCCACACAGCGCCTGTAGAAGAGAGCACA^^ 
ATATOTTCCCAAGGCCTGAGrrCTGGACCTACCCCCACGTGGTGTAAGCAGAQGAGGAATTGGTrC 
ACrrAACTCCCAGCAAACATCCTCCTGCCACTTAGTAGGAAACACCrCCCTATGGTACGa3GG 
AATGGCTCCACGAGGGrrCAGCTGTCTrm-ACTTTTAACCAGTGAAATTGACCTGCCCGTGAAGAG 
GCGGGCATAACACAGCAAGACGAGAAGACCCTATGGAGCTTTAATTTATTNATGCAAACAAGTAC 

CTNGGNCGCGACCACGCTTAN 

SEO ID NO* 1 142 GGTAC rn - i ' i 1 U 1 1 1 1 1 1 i 1 trri l l l i l U l I NGAAATrAAATACmTAATTA 
rcATrCTAGCCAGGATCATACTAAGTAGGATCTCATGACAGTCACATATGCAGCGACrrCACCTAA 
ACCGNGGCACTGAATGCTCTGCCATOAGCCOCAAGCAGCACAGTGATCATCACCCACAAGGACAG 
GTTGCTOGGATGAGGCACCCTTrCCTnXlATGrrTAGGNTCTrCTCAC^ 

AGGTCCCAGCCACACAGCGTT^^^^TTAGGGATTAAAGTAGTNGGAAAAAAATAAAGAGAACACA 
NCATCCATCCnAAAGAAAAAAAGTAAATCC ACTT TATGGNGGACTTCAGCTATGGGACAAATTT 
NQGATCAGNGGTrrrCAATCTGNACATAATCTITITGTrACCTGGGAAAAAAGNGGCAAGG^^^ 
TGCCCGGGNGGGCGGrnAAAAGGGNCNAArrTCCACCCCACTGGGGGGCGGTTATTAANGGGAN 

CCGACCNG 

SEQ ID NO: 1 143 A CTn - lTl T rri L- li - l 1 1 J 1 1 1 1 i 1 1 i i ' ^ ^'^™'^'^'^ll^JJ}!^^^^^ 
ATITCmCTITrrrAAriTAAATNGAGTNCATrTAmGAAACANACrGGGCCA^^ 
AATTCCTGGTCAGCACCNCCGATGTCCAAAGGNGCAAWCAA^GAAGGGCAGGCGTGATGGCTN 
ATINGTITNGGATTCAANGATNGGCTITCrCCArrCATrNGNCTITITA^ 

AACAGNGTAAGNGAACCTGCTGTI'GCCXrrCAGCAACAAGTTCAACATCNTTAAAGCCCTGTAAAA 

TGACAGCCTITITCAGGrrGCCAGCTCCTNATCCATGTATGCAATGCTGrrcn'GCAGGG^^ 

GATGTIKmAAAGGCATANTrTGGCCACCAGGCNCCTGAAAGGCNAhriTGGGAACCCr^ 

GAA^^NccCTCCTITNNCNTnAArrNAAA^rrGG^m^ 

GGTTTTTTGCCCNNG 

SEO ID NO- n 44 GGTACrCAGAACTGTGTAAAAGTTAACCACCAAAACrATGTnTAAAGAAAA 
AAGTTTAAATGTAAGACTAATGTCTTGGGGACrrTACTAAATG^ 
AATOTAnAAGTATTGGCAATrcrAATrTAAAAATmAAATTm AATTATTGGAAA m 
CAATAGTrrCAAGAGCTAAGAATCAGATTGCrmTAAAACAGAAl I'l I'l l 1 1 1 V rTCCTQAGGGAC 
TCATACrrGACAACrCCAGrrAAACATAATACTCCACCCAAATCCC 

ACCCTGGAATAGTAAAATTATAAAATGGTArrrCTAAATTATAATATATATACATAATGCAC^^^^ 

rrAACTGNCACATTTACCAGCAGAATTATGAAATCAAAAACAAATTCTACATTCAAGGGAO^ 

CGATAAATGCKnTTCATTGGTrrAAGAATCCArrCCArrCTTrGOTQGrrrCTACTCCATAm 

AANTATCCCCAANGGACCTCGAAGGCCAATCAATCCANTTCCTGAANCCACrGGCATAGGTCAAG 

NGNCCTTGGGCTATNAGCAATTCCTCTGAANACTGGGCCCTAAACTGCNCGGAA 

SEO ID NO* 1 145 ACTAAGCGGCCTTGGATACCTGGCCGCGGGATGCTGGGCGGCGTCAGGTGAG 
CGGTGOTCGCTGGGCCTCAGGTAACCATGGAGAAAGAGCTGCGGAGCACCArrcrm 
TACAAAAAGGAGATArrrACCACCAACAATGGCTACAAATCCATGCAGAAAAAAOTCGGAGTAA 
TTOGAAGATrCAGAGCTTAAAAGATGAAATCACATCTGAGAAGTTAAATGGAGTAAAACTGTGGA 
TTACAGCrGGGCCAAGGGAAAAATrTACrGCAGCTGAGmGAAATCCTGAAGAAATATOT 
ACTGGTGGAGATGTCTTTCTGATGCTAGGAAAAGGNGGAAAATCCAGATITGACACCAAT^ 
CmTTACTAAAAAAATATOGAATCATNGGTNATNATGATGCCTGGGGGTAAAAAAGGNm 

NAATA^rmcATCCT^AANAA^rmr^ATITrcAANGGAOT^^^ 

N 
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SEQ ID NO: 1 1 46 A tmr iT I T ITV m r I ' IT I 'l IT Vn i I INGAGGCTTGNGTTTTATTCAAGGCrT 
GTCAGCCrCCCTCCTTCCCCCTGCGGGTGAGGTAGGAGGGATAGGGTTGGGGGAGGGAACAGGGG 
CTAGGAAGGGGCAAACTGCAGCTGCCCTTTGTAAACATATTCCANTGCAGTGGCAATGGTGCGCA 
TGTCCAGCrCGATGCTCCGAGCCCAGTTCTCCACATCCCCAATTTCCTTGAGTGCCTGGTTGAAGTT 
hrrCCACCATTCCGATCCACTGGCCTGTCTGCnGGCAAATTGGGCAACCTGGACCTGTAGGGTCTT 
CACCTCATGGGCCAACTTTCTCTGQNTCATGTAAGCCCTGGCCACACCCC CATTGAN GNGATCCAC 
CCAAAC H ^ l lT^ ^ NNAGGCAGGG^n<CTmAANNAANANCC^^TmCC^^' CTGGGANNN 
CCNTGGNGNNAAT^WG^n^^GGGCCNGGGGOT^mT^AAGAAGNNGGG/^^ 
G 

SEQ ID NO: 1 147 ACACTGrrGGTGTTATATGGGGATGGGGTTCTCGGTAATTTrGTTrATTATTTA 
TGTTTATTATTATOTTTTATCATTAArTATTCAATAAArrTTTAmAAAAAGTC 
AATCTTCTGTGGGGGTGGGAGGGACAAAAGATTACAAACCAAAACTCAGGAGATGGTAACACTG 
GAATTGATAAAATCACCTGGGATTAGTrGTATAACTCTGAACCACCAAACCTCTGCTATCAAGCCT 
TGCTACAGTCATGGCTGTCCAGAAAGATTTACAGTTATTTTTCTGAGAAAGGATCCATGGGCTITA 
AGAACrrCAGAACTrrAAGAACTTCAGAAGrrCTTAAGTTGCTGAAACTCAAGTAACCAAAJSIlTG 
AATGCCATCCAAAAAAAAATACCAAGGGAOTCAAGGNTTTGAAAGGCACANTCTTATTCrrAAAG 
NGACTGGTTNAAACCNGGCCAAAACCANmAAmrCCTGGNGAATCCNAGNNACCNAATGGC 

ArmccGC 

SEQ ID NO: 1 148 ACCATTTCGCCCTnTGCCTTCATGCCAGGAAAACTTGTCCATACTAATGAAG 
TCACTGTTTTACTGGGGGACAACTGGTTTGCAAAGTGCrCAGCAAAGCAGGCTGTAGGT TTAGT TG 
AGCACCGGAAAGAACATGTAAGAAAAACAATAGATGACTTAAAAAAAGTGATGAAAAATTTTGA 
ATCCAGAGTTGAATTCACAGAAGATTTGCAGAAAATGAGCGATGCTGCAGGTGATATTGTTGACA 
TACGAGAAGAAATTAAATGTGACTTCGAATTTAAAGCAAAACACCGAATTGCTCATAAACCGCAT 
TCCAAACCAAAAACTrCAGATArrTrTGAAGCAGATATTGCAAATOATGTGAAATCCAAGGATTTG 
CTACTGATAAAGAACTGTGGGCTCGACTTGAAGAACTAGAGAGACAGGAAGAATTGCTGGGTGAA 
CTTGATAGTAAGCCTGATACrGNGATTGCAAATGGAGAAGATCCACATCTTCTGAAGAAGGAAAA 
GGAANTCGTACNCCAATGTGAATGCGATGCATCAAGTACAGACTCTATACTCCTGGCATAAGGTG 
TTCAGTCAAACCKrCNNGGCAATGATANCCNTGACTNTCANGAAGGTCCT 

SEQ ID NO: 1 149 ACAAAAAACTGTOACATCAAGAAGGGCAGOAGAAACAAAAGGCATnrCTAT 
AACATCTATCTGATCCTAACAGAGTATGTAGGAACAGAATAGTAAGTCrrTAGTGCCATAAGATCT 
TAACATCTCACTTCTACTCCTGCTCTCCTAGTTCCCCCCAAAAAAGAAATACTGACCAGTGTCTCT 
ACmAAACCCTACCTGAAAClTGAGACTATGTCTAATATAGAAACTCACATAACTAGCCCAGGTA 
ACACAGCAAGACCCCATCTCTACAAAAATATTAAAAATTTAGCTGGGCATG GTGGC ATGTGCCTG 
TAATCCCAACTACTAGGGACAGTGAGGCAGAAGGATGGCTTGAGCC CAAOAG TTTGAAGCTGCAG 
TGAGCTATGATCAAGCTGCAATCCCCCTGGGTAACACAGCAGGACTCTTITrNAAAAAAAGGAAN 
NNNANGAAOTCCrnWAGTTCAmCCCTANAAGTATNmCAATGACCATACA^ 
TAAACnTNAATTT 

SEQ ID NO: 1 150 ACATGAATrAGAAGCGTCCATCTAGGATTATGGCCAAACTGTTTTAAAAATG 
CAGAAATCTAAAATTACATCTTGAAAATATGAAGAGATGGTCTACACACrrCAAAAATCAAATGT 
TGCTTATAOCAGAGATGTATGACAATCACGGGATTCAAGTGACAAGCAGTAAGATCTCAAAAATT 
AATACTGGTCAAAGATAATGGGAATATITITGCAmCACTGAAAATACATTGACTACTAGAATAC 
GAAATCTAGCAGGAACrCAGGGAAAAAAAArrACAAAATCTAAAGCCAATTAOTAATATTTCTT 
ATTACCTAAACAACAGCATGACATTAACAGAAAACTGCCCrGCATTrCAATTGCCAATCrcACGTT 
AATTAAAGTTCTTCAAAAATGAAATGCAACa^AAACGGGTTCAAACTTCAAATGAAAGTCTGC^ 
GGCTCAGATTAAAACATGGNATGGTTCANATGAAAGTAATTAAAACACT ArroG GTTTGGTCTTAT 
TOAATGGAATAAATOTTGACKTTCITrGNAACTTGGACTACTOAATn'CATr^ 
A' ri ' rriLM ' r AANGGGGCAAGTGCTrAAGTNCTrAGGAGCATThfTCAAGGCC 

SEQ ID NO: 1 15 1 ACTACCACAGCCTTTAGGTGACATTGATTrATAACTrGGTCACAATTCACTGC 
ATTTAGGAAAACCAGCATTCTrATCTGGTCAGTGCTCGCTrCTTAGCAACCC CTAA TTAAAm 
TCATCTCTAAATCrrAGCTTCAACTrrATrCAATTACATTTGGCrGACGGCTGTTTTCT/^ ^ 
TAAOTGTTGACCATAAATGCAAAACTrCCAGTATCTGNTGGGTTTTATTAGCAGATGCTGCTTTrA 
rrrAAAAAAAACCGACAGTATAACTGNCATAATrATGGAAGGCACTGCTTCCGATAATrATATTCT 
ATTAAAAAAACACCATTTATAGTGAACTCTGCACTGATAAATAAACAATAAATATCTCAAGTGCC 
NAAAGGACAGAAAGCrCTCCCTAAGArrAACACTTTGOCCNAAATTGGCAGCATATTATTCnTAA 
AGTCTGACAACTGAGTCTGCACTAAACACCTGGAACTGGGCTCTTTCANGGGCCTTGGAAGAAAC 
CAAAArrCCCAGAACrrAATGNGGCrrNTNGGGGAAGGGCCCAGGGAAAGAAAANTCTAAGCTTT 
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GGTTTNCKjNCCNCTTnTAAAGAACCGGGGNACAAATrANGC 

SEQ ID NO: 1 1 52 accagtgagaagacagctttgcagtcacactggagatcagagttccaggctg 

CAGCATGTCACCAACGCCAGGGAACCTGOCACTTAGGCCTGTTGGCTGCTCAGTTTAACTCCGGCT 

AAGCTGCCATGCATGGGCrCAGCTACTrcATCAGTCATOTTGTCAAACTGTTCCCCATCCTCACTG 

NCAATTGTGCTATTCTGCCACTCAATCTOGGGGTTAGATAAGaSCTCATCATTCTCAGATGCATTC 

TrCCACTarGACTGGTCCTTAGACCAAAACCCrrcAACrrTTCCATAGGAACT^ 

ACTGCTCTCTGCACCTTGCATGGTAGCCTTITITITGGCAGCTGGOT 

CTITGGGAAGACTCGTCAGACCAAGTTNCTGGGrrCATrrCOCAAQTATK3GCCTCAAGTCTC 
CTT^G^^^GGGATCCAAGCCCAC^K^^AAAANGNGAACCCCGT^mTCACmAACAACATCGOT 
CKAACCTACrATCAGAAGCrm"GNGCATAANTrGGTGGGTCCTANCCAATGAANANTrCCNAN^ 
TGAACCATCCCriTIX3GGCAACmTCCCAAATrTGAAAATCCrTO 

SEO ID NO: U 53 NGTACTCAGCTCTGTTCrcCTACrCAGGCI'GGGACACCCTCAACTATGTCACT 
GAAGAGATCAAGAATCCTGAGAGGAACCTGCCCCrrCTCCATTGOCATCTCCATGCCCATrGTCACC 
ATCATCTATATCTTGACCAATGTGGCCTATrATACTGTGCTAGACATGAGAGACATCTTGGCCAGT 
GATGCrGTTGCTGTGACTmGCAOATCAGATArrrGGAATAriTAACTGGATAAT^ 
GTrGCATrATCCrGTTTTGGTGGCCTCAATGCCTCCArrGTGGCTGCTTCTAGGCJ-l^ 
GCTCAAOAGAAGGCCATCTCCCTGATGCCATCTGCATGATCCATGrrGAGCXjGGTCACACCAGTG 
COTCrCTOCTCTTCAATGGTATCATGGCATTGATCTACTTGTGChrrcOAAAACATC^ 
NTAACTACTACAGCTTCAGCrACTGGGTCmGTGGGGCTlTCTATTGNGGGGCAAC^ 
NCrGGAAGGACCTGATCGACrrGNCCCCTNAACTTAANGGTrCrrcCCATGGCTITNGOT 
TTTTCCGGGG^^'GGTCCC^TTCAGGGGACITmACTNCCTATTG 

SEO ID NO: n 54 ggtacaaatatttaagagtgttgattgggagtaagggaatgtcaactgccaa 

TAAAGTCGAAGATGAAAGAATAGGACTITACACAQAGCATATTTAGTrATGGGTCTCTGTCTCCTC 

CCCACACAGAAAAACTCCCAGACATTCATGACriTCATCCACCCTGCCTGGCAGATAGGCCATATIT 

CCTGACX:CCCTOGCTCTrCACCTCAAAAGGTTTATCTTTCCACCACACTAGCAAAGACCCTCAGGA 

CACACAACrCATACTGCCAGCTAAGCATCTACCCCGAGGGACAAOGCAAGCACACACTAGGGCAG 

CrGGOCCATCCTGGCCCTAATCCCTCCAGCGGTGTCCACACrGAGCATTGCAGCACTTGTAGAAGG 

TGGTCATCGGCTCATCTGCAGANCGGGTCTGAAGCTGCATGAAGTAAGCACGAAGATGTTCGCAT 

rrGGGACACGACTTTTGCAGTAGAATCAACArrCTTCCAGCAGCTGrrCACCAGCACATNATO 

rrcmCAGriTNOGTACCTGGCCCGGCGGCCGTTCNAAAGGCGAATTCX:NCACACTGGCGGCCGT 

CTANNGGACCCACTTGGNACCAACnTrGGGGNAACATGGCATACTGGTNCTGaGGNA 

SEO ID NO: 1 155 ACTGAGAACAATrrAACTCTGAACACAAAAGTrrAGTGAArrTGCTACTGTTC 
CATTACAGGACAATTAAAAATOAGACTATATCAACrrCACTAGAAT TTAATT GCTAAAGCTACOT 
ATGCACATCTATTAAACTAAAAGAAACGACTITAACCCCrrCAGTTG'il-J"lUAAGACAOCACTCCT 
TrACAGOOAOTCAGGTTrGGTAAATATAAAGGATACATAAAAAATACAGTATAAACTGCATAAGC 
TTAACAGTAGCAAAAACACTGATGAACTTTTAAAAAGTCAAAAATATATAAAAATATTAGCCTGA 
AATCGCAAATTTTCAAACACCGATCnXjTGTAAAAATGTTTAAATATTGATGmACTCCAAAAATAT 
ATACTTATTCTATrriTrrrCTAmGCAACAGTITATAAAGQCAAACAAACACCTGCAATO^ 
ACAAAGCCCAATTCACAAACTTTGGATCAAGAAAAATCCCrrCCTATGGTITAACAGCATCTTCT^ 
TACCATAGTGAATGAATCTAATTGOTCCCAAANriTAATTCACATACTTTACCAAA>nTATrrA^ 
TACGGAAAAGGTrGGTNAAAAGACAGTAGTCTGACAATATTTACATTCAAANNN 

SEQ ID NO: 1 156 TCGAGCGGGCCGCCCGGCCAGGTCTGCAAAGACCCATCrrCCCTCCAGTTAA 
TACACTCCCAGGATGGGCTGCAGAGOGGGAOACTCTOAGAGAAGCTGGAGGCCCACAAAAGTCC 
ACTGACCCTCirrCrGTCCCAGAAATGAATAAAGGACCCAGTrcTGCrrTCXnTa^^ 
ACAAAGrrGTTTGTGCrCCAAGAAAATGTGGGAATAAAAAAATCATGTCCCAGGTCATCTTrGTGT 
GTGTGCGGGGGAGGTGGATGGGAGGAAAAGGCATGTATTAATAGATACTGCTOCTATAAAATGAC 
ATAAAT^ATAGCCCT^GATCTG^m'CTGTAAACAATGCCAGCTTCT^CAGGT^AT^GGCA^ 
CCTAATATACCTAGCCCAGATCCmCATAAAGTCAAGTOCTATATTTCCAAAATAATCCTATGAA 
ATCATGAAAGGTGTGAAAGGTTAATATGAAGACCCCrrCl'IirGNGNGClliiiTCACCCITTATA 
AAGAAGrrGCAATrrATCAGGACCCCAATAAGTAArrAAACCCTCAGGGGTAGCrCACTTATGATC 

caacctgacactggc<xatatttataaatcatgggtac:attgagggoaaaagottgccggcaata 

GNACCTT 

SEQ ID NO- 1 157 ACCTAGTGGCTGCTGTCrn3TmGCTCCATITITrTCAGOTC^ 

CTCACCCTTTACrmGCrGCTACCATCTrCTCTCXAGGAGGATCGTAAGOTTrGGGACGGGC^ 

CCCACAGAGAACACCTGGAGATGGGAGGCTCCTAmGGAGAGCCCCTGGTAGAGGGGCTCTGCr 

GAGGAGACCCCAGATAGGACTCTGGGCTX:ATACAGATGCCACrrATCATTATCTGAAGGGGTGTCT 
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TCCrCCmATGCACTGAGGGATCATGGCAACGTAACK::AGTGTAGTCT<3GOT 

GATATCCACrrCACTGCCCAGCTCTAAACTAAAGOAATOATCTOGAGTGGAGGACAGGACCCCTO 

OOGAAAGGGGAAGAAGTTOTAAGAAGGTGAANGGGGCAACCTGGTCNGGTmGGTAAACTTT 

GGGAAATGGCCAATTGGGTTCACCGTCTGGGGGGGCTGCTrATTANTCTNCTGGACTAAGGGGGC 

CAAANAGATCACAAGTGTCATTCAACGTGGTCANAAGGCATCTGCATTGGGTTrcANGCATmTO 

CCCACANGGCTTCCAATCGAACTCCTrAAATCCATTOm-CAACATCCANC^ 

SEO ID NO* 1 158 ACACTTGAAACrAAArrrCTAAAACrrGTrmCrrAAAAAATAG 
ACATTAAACCATAACCTAATCAGTGTGTrCACTATGCITCCACACTAGCCA^ 
TCTGGTn-CAAGTCTCAAGGCCTGACAGACAGAAGGGCTTGGAGArriU-rrUCllTACAATTCAA 
GTCrrCAGCAACTTGAGAGCTITCrrCATGTrGTCAAGCAACAGAGCTGTATCTG^ 
GCATAGAGACGGTITGAATATCTrCCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACAAA 
CATAATCCrGGGGACATACTGGCCATCAGGAGAAAGGTGTTTGCAAGTTGGTTCATAAACCAGAT 
TGAGGAGGACAAACTGNTCTGCCAAmCTGGATrrCmArnTCAGCAAACAUi I lui i lAAAGC 
TTGACTGNGTGGGCACTCATNCAAGTGATGAATAATCATCAAGGGGTTGGTGCTTGCrTGGATTAl" 
ATAGAACrrCTTCATATGTCTGAGTtXCAATAAATNGNCANCarAACCnTrGGAAANGGOT 
CAATn-GGGCCAAAAACCCTTGGOGCCNTITGGNTCCAGGriTACTGGGGGACT 

SEO ID NO: 1 1 59 ACCCTrrAAGATATCCATCTTrrrCTTTTTAACCCTAATC^ 

TTITArrGTATAAAAAGTTrcACAGGTCAATAAACITAGAGGAAAATGAGTATTTGGT^ 

AGGAAAAATAATCAAGArnTAGGGCTnTAl"l"l"rnUl 1 1 1 GTAATTGTGTAAAAAATGGAAAAA 

AACATAAAAAGCAGAATmAATGTGAAGACATITTTTGCrrATAATCATrAOT^^ 

rrAGrrrAGTGTGTGTGCAGAGTCCATTTCCCACATCraCCTCAAGTATCnTCTArrm 

ArrCCCrmAATCAACrGTAGGTTAmAAAATAAATTCCTACAACrrAATAAAAAAAAAAAAAA 

NNNAAAAAAAAAAGNCCrrTGGCNGGAACNCCChrrAAGGNCAANTaWCO^ 

TANGGGTCCAACCTNGNCCCAAGNTITGQGGAAACATGGGCAAACTGGTTCCTNGGGA^ 

CCGTrCCAATTCCCCCANTNCAANCCGGAGCCTAAAGGTNAAACCCGGGGGCCCATGAGNGNCTA 

CCTCKCT^AATOGGTGGCC(>TGNCC^mTCAAT^GGAAACN 

SEO ID NO- 1 160 ACACGTrCTrGTTGTCrcGCTCGGCAACAAACACCACTTCCTGGCCAGTCTTC 
TGGTTATGGAGTCGGACAAATGTCTGGTGAGGAGTGAGrrCAGCACCAGTGTTCACATCTACCAG 
CTGGAAOAACAAGGCGAAGTTCTGGTGGCTGTCTGCGATGAATOTGCCCTrGGCmGGCr^^ 
TGTCACCCGGGTAGTTrTGGGTGCAATGCTCTGATCCTTATCCACGGTGGAAAGATCAACATrrGT 
GATGCX;AACrrCAGTGGAGATCTOACTC:rGAGCTCTACGGTATITGCAATATACCGGTrGTCACC 
rrCAACrrCGACAAGGAAGTCATAATAACCACTGGAAAATITGACGTTC^TGAAATrTAArrCAA^ 
AACATCCCCTACAGGGGTGAAAGATGTCTTCTGGAAGACAATGGCn'CTGGAAACCACAAGATTTA 
GCATGGTCTAArrrAACAATGGCCrOAATCAAAAGCrrGAGACCGAACATTrGGTGACim-(^^ 
CCCCAAGAAAACCTGGTCATTAATGTCNGAACCAAAACCCTTAGGNNCAACCNCANTrGGACCTG 
GNANCCAT^^TCCAAAACNCNANNACTT^NNAAGCCCCCr^AAAG^^T^GAAA^ 

SEO ID NO- 1 161 ACGCGGGACAGCACGGrrCGTmTCCTTTAGTCAGGAAGGACGTrGGTGTTG 
AGGTTGGCATACGTATCAAGGACAGTAACTACCATGGCTCCCGAAGTTITGCCAAAACCTC^^^ 
GCGTGGCCrTCnXJGCCAGOCGTC:rGCGAAATCATATGGCTGTAGCATTCGTGCTATa:C^^ 
TGCAGCrrTGTATAAGrrTCGTGTGGCrGATCAAAGAAAGAAGGCATACGCAGArrTCTACAGAA 
ACTATGATGTCATGAAAGATrnGAGGAGATGAGGAAGGCTGGTATCmCAGAGTGTAAA^^ 
TCrrGGAATATAAAGAATrTCTK^GGTTGAATTAOTAGAAGTTTGTCACT^ 
ACTATGACACATGAATATGTGGGCTAAGAAATAGTTCCTCTTGATAAATAAACAATTAACCAAAA 

AAAAAAA^^^^^^^^^^w^^^^WNN^^ 

SEQ ID NO: 1 162 GGTACAGCTTrroTCCCAGrrTGTTIXnt^CATnACTGGATGCAmATGGAAG 
GCACATTACAGGTOTTGIGGGAAGAAACAGAAAGAAATCACAAAAGCAATTAAGAGAGCTCAA 
ATAATGGGGTTTATGCCAGTrACATACAAGGATCCTGCATATCTCAAGGACCCrAAAGTTTGTAAC 
ATCAGATATCGGGAATAAATICrAirACGTTACCACTAATAAACnTATTTrACAGTAAAAAAAAAA 

AAAAAAAAAAAAAGT 

SEO ID NO- 1 163 ACGCGGGGGGAAATGCGTGTTCTAGCTrTCTGTGTGCTTAGGTGCCCGAGCrA 
CTGAGGGTCTAAGTCCGGGCAGCCGAAGAGTGTGGTAGGTAACGGTCCTCAGCGCAAGGGTCATT 
TCGTCGCTGGGAAOGGACGGCCCTCGCCCGCGGTGATGGTGGTTAGCAAGATGAACAAAGATGCG 
CAGATGAGAGCAGCGATTAACCAAAAGTrGATAGAAACTGGAOAAAGAGAACGCCTCAAAGAGT 
TGCTOAGAGCTAAATTAA'ITGAATGTGGCrGGAAGGATCAGrrGAAGGCACACTGTAAAGGTAA^ 
TAAAGAAAAAGGACTAGAACACGmCTGrrGATGACTTGGTGGCTGAAATCACTCCAAAAGGCA 
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GAGCCCTGGTACCm 

SEQ ID NO: 11 64 acatattattagcatagaattcatcattttcagcagataggaccacttctctt 

AAGTCrrrACTGATTCCCGGCACTCTGGAAAGATCAATCCGATTGTTGTTTATGCCTAGTAGTrCGT 

GGACCATGGCCTGATATGTCCACTGGTTTAGCAATGGGGTGATGGCATCATCACAGCXjATCTAAA 

ATAAGGAGCAATGGAGGAACCTCTGTCCGACGGAATTCAAACAOTTCATATTCTTrAGTTATCACr 

TGCTTAACGCACTCTGCAAGTCTCTTTGCTGCCTCTGATGAGAGCKjATAACGAATC^^ 

TTCTTCAGAGATAAAAGGAGAGCTOTAAGCCCTTGAGTTGGTCTAAATAGCrGGGCTGGAT^ 

attcgaccctggcagcaacccaaaatattganggaaaacaaatgtggggtcacagcaatgtaatc 

ccataaaattoctgaacttaaccccaactttctggtcattancttragcccatgc^ 

tgctoacacattctggaataaatgaaatamaggganitgggcttngaccncngaa^^ 

CCAmrCNlTGNGGCGAAAAACNAATGCCrTAGGGGrnTAAN 

SEQ ID NO: 1 165 CGAGGTACATGAAAAAAAATAAAAAATTrGATCATGAATCAAAC CCTG GAAC 
AGATGAAGACAAAAGCGGATGAGTGAGTTATATAAACTTACTTCCAT TCTGT TTCGGATTn'AAGT 
TTGAGAGACTTGCTAATGAATCrCCTrrATGTTGTTTTCCITITCAT^ 
TTGTCCTTTTTTTTCTrAATGTGGATTTCATTGAGTTGATTTm 
TGT 

SEQ ID NO: 1 1 66 ACGCGGGGCATCACAGTCAACTTCACAGCGACCAAAGTTAAAAAGAGTGATG 
AAAGAAAAQACCAAACCrCAGQGTGGAGAGGGCAAAGGCGCTCAQTCAACTCCGATCCAOCACT 
CCTTCCTCACTGATGTCTCAGATGTTCAGGAGATGGAGAGAGGGCTGCTCAGTCTTTTGAATGAT^ 
TCCACTCTGGAAAACTTCAAGCATTTGGAAATGAATGTTCCATTGAACAGATGGAACATGTTCGGG 
GAATGCAGGAGAAATTAGCTCGCITGAATnGGAGCTCTATGGGGAGTTAGAGGAACrrCCTGAG 
GATNAGAGAAAAACAGCCAGTGACTOCAATCTX}GATAGGCTTCTGTCAGATTrANAAGAATTG 
rrCTTCCATACNAAAACTCCATTTGGCAGATGCNCNAGATGGTCCCAATAOT 
TGAAATGNANITTCnTTCnTGNGAATTGAAAAAAACNNCAGNCTTT 
TNCCNAATNArrmCCCACCNGGGATnA^nWNTTAAAGGCCNGTTNAGGAAl^^ 
NGTTAANAAAGNGAACCCNNNAATTKCCCAAATGGGGGCN 

SEQ ID NO: 1 167 CGAGGTACGCGGGAAAGAGGATGGTCAGGAGTATGCTCAGGTAATCAAAAT 
GTTGGGAAATGGACGGCTAGAAGCAATGTGTTTCGATGGTGTAAAGAGGTTATGTCACATCAGAG 
GAAAATTTAGAAAAAAGG'iTTGGATAAATACCrcGGACATTATrrTGGTTGGTCTCCGA^^ 
AGGATAACAAAGCTGATGTAATTTTAAAATACAATGCAGACGAAGCTAGAAGTCTGAAGGCATAC 
GGCGAGCTTCAGAGCATGCTAAAATCAATOAAACTGATACATITGGTCCTGGAGATGATGATGAA 
ATTCAGTTTGATGACArrGGAGATGATGATGAAGATATTGATGACATCTAAATrGAACrCAACArr 
TTACArrCCATCTmCTOAAGATTGCCTACAATrrGGATTTTOATCATGACCAAG^ 
TirAlTAGCATGAATGCCATlTGGTTAAACNAGACrGATTGGTTCTAAGANATnTTGGGTTm 
AAACTGNTAATAATGCTGAAATATCTTAAGTGAGATGTTAACCCCCTTTGGCCTTTAAT 
AGCTTITGGTNAAAAACCTTGTTACTAATTCCAAAAAAAAAAAANCCTCCTTCT^ 

SEQ ID NO: 1 1 68 CGAGGTACATCTTATGTCCAGTTTAAAGAAGATAGC CATC CTnTGATCTTGG 
ACTrrACAATGAAGCTGTGAAAATTATCCATOACrrCCCTCAGTTTTATCCrrTAGOGATTO 
CATGATTGATCrTGATGGATTTTCATACGATTGTAAATGAGCrATATTAAAGTCTATTAAAGGAAG 
CCCTTCTTGTrrGAGGGAGAGATTrcrrGTGCTTTCTCATATTrAATTTGCTGT^^ 
ACCTAGAGTTTTTGATGGAACTGATATATTGACAGTrCrCACCGAAGTCCTTrTATAAAGAA TTGC 
TACTCCAATATATGGTCAGATTAGATGCAAGAATAAAGCAGTrGTCCGAGTCTAAGTrrCrATnT 
ATTAATAAAAACTAAAATGOTAAAAAAAAAAAAAAAAAAAGTACCTGCCCNGOCGGGCGC^ 

SEQ ID NO: 1 169 ACTTTCCTGCCTTrTAGTTCCTGTGCACAGCCCCTAAGTCAACTTAGCATT^ 
TGCATCTCCACTTGGCATTAGCTAAAACCTTCCATGTCAAGATTCAGCTAGTGGCCAAGAGATCCA 
GTQCCAGGAAC(XTTAAACAGTTGCACAGCATCTCAGCTCATCrrCACTGCACCCTGGATTTGCAT 
ACArrCTTCAAGATCCCATTTGAATTTmAGTGACTAAACCArrGTGCATTCTAGAGTGCATATAT 
TTATATTrTGCCTGTTAAAAAGAAAGTGAGCAGTGTTAGCTTAGTTCTCTTTTGATGTANG 
TGATTAGCTTTGNCACTGmTCACrACTCAGCATGGAAACNAGATGAAATTCCATTTGTA<^ 
GAGACNAAATTGATGATCCNTTAAGTAAACNATNAAAGTGNCCATTGNAACCCAAAAAAAAAAN 
AANAAAAAAAAAAGTGCCACChrrmCAGTGGCTAGNAATTTAATGCCNGGCTCGGCTAC^ 
r^^AANGNGGGGTTGAGCCNT^m^GGAAAATNGAAAACCAACCGGGGGGCCAAGGGTAATNCNT 
TGGCNAACCNCNANACCAAAQGGNTTNANCCTrTGGTTNAAAACCTtTTGC 

SEQ ID NO: 1 1 70 CGAGGTA Cl 1 1 1 1 1 1 U 1 1 1 11 i ' l n IT l i 1 1 1 1 i 1 1 GTCAGNGCTTTCTACTTTA 
TTAAACATCAAAGCCCAAATAGATCTTCCCTGTGGAGGAGGACTTAAGGACACTAGGGGAGGAGA 
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AACKjGACACCTGGGAAGAGAATCACACCACAGAGACCAATCTTCACAAAAAGGGTCCAATATTG 

AmCTAGGGAGGAGCAGGGCATGGTCAGCrcAAATTTGGTGATAACGTCAGGATGAAGOACCCC 

AAGCTTCCGACGCTTTGACCCCTGGCAAAGATCTCTGCACATCGCCCGGGGAAGAAAGCAGGCCC 

TTCTGATGCTTTGATCACATATCCCCCCTTGNCTTCACCAGGAGGCCATNGAGCAACTGCATAAT^ 

CTGTCCAGCAACOZATGAATGATCmAAACCCAGGArrCTTGGTGNAATAAACAGCACAAAANAT 

GTCTGNAATTITrTGCCCTACATCTGGATTAAAATCrrrTAl^AC^ 

imCAKTGGAAAOGGCATTTTACCAATTGCTGCTATGGCTTOAN GANG CCCCNAAAA^ 

CIGNCNCCCGNAATTAACrGGNTTAGGATCCTATGNGGCNGCCTTnTGN 

SEQ ID NO: 11 7 1 CGAGGTACGCGGGGAATACTAAATGTTAAGTrCTAGGCAATTATACGGGGAC 
TCAGAAGGACCrGGCCGCTGCCrrCATTGAGTITAAAOGGACAGGATTGCCCTrCCGTCAAGAAA 
GTATGTAAGTGTTGGACrGCACAAATTAATGTTTTTCCCACAACCGAGACTTrGGAGATTAAGAAC 
TTATrrGAGGATrrAAGAATTAGGGAAATAATTTGGTGGAAACCGGGAATGAGTTCTATTCTTAAA 
CAGCCrmriTITCTTTTTAATGTrGGATATACGGCGAGGTAGAG^^ 

AGATTGACGTATATGmCTGCATTATTTTTACAACAAGTTTGTGTATCAGAACCGGGAGTGCCGG 
GGGAAGGGAAAGAAAACAAACAGGTTTCAGAATTGAATAGGCAAAGTGACTGG'nTAAAGAATA 
AGTANTAAAGAAGGTCTrATCrAAAAAAAAA>nWNNNNNNAAAAAAAGTO 

SEQ ID NO: 1 172 acgcggggaattcgtcaccaggaggaagacggagctggctgcccagcccaa 

AGGCCCATGAGGGGATGCAGTTATGGGCTCTGTCGCCXiTGGATTGTTArmGTGTCAGTAAGTAA 

TCCATAAAGTGCCAACATGGGAAAGAAAOjGACAAAGGGAAAAACTGTTCCAATCGATGATTCCT 

CTOAAACriTAGAACCTGTGTGCAGACACATTAGAAAAGGATTGGAACAAGGTAAmGAAAA^ 

GCrrTAGTGAATGTGGAAl'GGAATATCTGCCAAGACTGTAAGACTGACAATAAAGTGAAAGATAA 

AGCTGAAGAAGAAACAGAAGAAAAGCCTTCAGTTTGGCTGTGTCrrAAATGTGGCCATCAAGGCT 

GTGGCAGAAATTCTCANGAGCAGCATGCCTTGAAACACTATCTGACGCCCAGATCTGAACCTNAC 

TGNCTGGGTCTTAATTTGGACAACTGGANTGTATGGGGGTACCTATGTGATA ATGA AGNNCAATA 

TTGTAGTTCAAACCAATTNGGTCAATGGGTGAATATNTCAGAAAACATGCCCCTTTCAAATTCNAA 

NCCCCCANAAAAAAATGGGAAATTACCTTNAATAAAAANTTrAAAANA 

SEQ ID NO: 1 1 73 A CU^l - ri ' l lU UUU-jl 1 1 rrri ' lUUl - iU GGATATCAGAA ATGCA T mAA Trm 
ATn'OAAAACAACTTAAATTTTTAGACAAATGATTTTAGTATATAAATTTGCT^^ 
GAATATAAAGATTTCrCTCATTAATCrrCCATGTGAAGGGTATrACAAGCCTGGAGGAANATACTT 
TCTGCACACAAGTATGTATCTTATOTOTGCAGTATTGOAAACCAATGGTGTAGTGCTCCTACACAT 
AAATGGGGTCAAGTGACATCACAAATTAAAAGGGGGAAAGAGAAATATTCTAGTTAATCAGATGC 
NAGAAGCAAACAAGACGCAAAAACTGTGCAAAlTVAGACCAAGCCAGTAACTrTAGTTACCACACT 
GCAGATTACACTGGAATAACAGGGTTGTGANGCTATAGTGNGCACCACATTTAAACAGCNAGAAA 
GAACn>ITrTATArrGAAANGCTGGAATGANGGATTTTACTrAAAGCCAA>rr^ 
GCCAAACCAAACCAAACTGGACCCCCCGTACCi rrri ri-ri ihR^NhTIWrriTTCmTNANCCATGO 
GNGGmTTNCCCGGATNANGGGTrrTACCCGGGNAAATTTCN 

SEQ ID NO: 1 174 A ClM lMll rri ' ri ril i ' l 1 l ^^l'14^1 H ^^^i^^^ l ^IGGANATGAGGTCTCGCTATGTTG 
CCCAGGCTGGAGTGCAGTTATTCACAGGNGCAACCACAGGGCACTGCAGCTTTAAACrCCTGGGC 
TCAAGCXjATCCTCCrGCCTCAGCCTCCCAAATAGTTGGGACTAGATGCACGCACCACCACGCCTGA 
CTCAGGACATTATTCTTAAAGGNATTATCCAGGAAACAGATAAGGGCATTCATAAAACACACGGN 
11 1 1 ! C1 r fAGCTCAGNGTTAACAATGAAAGTAGATTCCACTATTGAAGCACAAGTrGCAAATT 
GGTAACATAGTGAACATATrGCTGTAGGAAAGGGGGTrCANTGTGGNGTG TTATAT GAGCACTTG 
AACTTTTTCAAGGGGNCATAAACCCAG'l'l i IN I'J GGCNCCAAANAAATTTCCmTNAGGGATTCA 
AmTCCTCNAAAGGAANAACACTGGGGACATmGGGGGCATGAACTmAAAGNGG^ 
CAAACAGGTCAATGTNTTTN 

SEQ ID NO: 1 175 GGTACTAGTCAGTTATCCAGATTATCXjAGTGTTTACCACATCATCTATTrrCA 
GAATGCTCCGAACAGTTTCAGTTGCAAGAGTCAGAGCACTGACTGATACCAACAGAGGCrGGACA 
ACCAGTTCCTCCAAAATGTTGGAAATACCACCCrrrCGGACATTAATGCCCGCAGTTTITTCTCCCT 
GGGCATGCCGGTrrCTTAGTTXTrGTTACTGTAGAAATGGGATTCAGGCCGGCATTn'CAGCT 
TAOATGGAATGACCTCCATAGCATCTGCAAAAGCACGAACGCAGTAGGATTCCATACCACrCAOT 
GTTCGTGAATATTCAGTTAATCCGTAGGGCCAACTCTATTTCTGGAGCACCACCTNCTGCAATAAG 
AGCCCTCTTCmACTAAACAACGAATAACACATAGGGCATCATGAATGGAGCGCTCACTI^ 
ATCACCAGTTTGrmGAACCCCCGAACCAACAArrGGAACTGGTITmCAGGGCTGGG^ 
CTGGAATCITGAGCAGTTGG 

SEQ ID NO: 1 176 GCTACAGAAAAGCCXJAOATTrAAATACATTTAAT ATGTC ATnTAAAAATGA 
TTTTAATAATTCATTrCTTAAAACACrGAATGAATTrTGAAGCTTAATGTr^ 



171 



wo 02/29086 



PCT/USO 1/30732 



TATCmGACATCTAATTTACCATCAAGTrGTAAAATTATTrCMjAAAAATACAGAACT 

TGTATACTTATATGGAATCTGCATGTOAGGTGTTTGAGGGCATATGTITGAAAGAGGGAGCATCAC 

CACAGGAATCCmCTOTOAGGTGGAAACAGTGOTCCTGAATCATTGTOCTCACACCTAACTTGAA 

ATCTGGTCTTACTTTCATGCTGTTATGATrrCACCTGGTGAATCAAGTGTTITAAATAAGAAAGGN 

AATAGTTGOTAANGCCCATGGTATTTAAATGAAAGTAGTTAGAAAAAAOCTCn'CCTATTCTACC 

ATTTTTAATTCrriCnCCCTTCmKn'ACCCAGNGATCAAGAGTTTCT 

AAATITGTNT 

SEQ ID NO: 1 1 77 ACGCGGGGAGGCGAGGCAGGTTCCGAGGTTGGAACACCTGGCGAGTCCTCG 
GTGTCGOTGGCCGGCAGTCATCTCGCGGCCGTTCAGAATTATAAGGCTGTCTGCAGAGATTTGAAA 

aatggcaacaaatgaaagtgtcagcatctttagttcagcatccttggctgtggaatatgtattcac 

TTrrACCTGAGAATCCTCTGCAAGAACCATTTAAAAATGCTTGGAACTATATGTTGAATAAT^ 

CAAAGTTCCAGATTGCAACATGGGGATCCCTTATAGTTCATGAAGCCCTTTATTTCTTATTCTGm 

ACCTGGATTTTrAmCAAmATACCTTATATGAAAAAAATACAAAATTCAAAAGGATAAGCCAG 

AGACATGGGAAAACCAATGGAAGTGTTTCAAAAGTTCTTCTCmAATCACTTCTGGTATCCACTG 

GCCmGAATTGGGGGAACCTATTATmCCNGAOTATTTCAATATTCCTTATGATTGGGAAAG/^ 

TGCCANGAATGGTNTT 

SEQ ID NO: 1 1 78 GGTACACACCAGAGCACCACCTCATTCAGGCTGTGCTCCTGTGTCTGCTGGAT 
TTATTCCCCATCCTGGAGAAAACCCTOCACTGGAAAGGGGATQGAOCTCGACCCACCACCCATTG 
TGATGAGGTCCTGCGGCTGATCCTGACCCACATGGAGCCAGAGCACCGCCTTCTTTTACGCAGGAC 
CTACGCAAGAAACCTGCCGGCTTTCGTGAACAGGTTXjGGGATCCTAACTGTCCGGCAOT 
GGCTGGAGAGAGTCATCATTGGTTATCrGGAGGTTTATGATGGACCTGAGGAGGAAGCTAGACTG 
AAGATATTXjGAAACCCTAAAACTTNTCATOCAACATACTTGGCCCAGAGTITCCTGCAGAOT 
OTCTTACrGAAGGCCCTmGAAACTGATTTTGTGATGTAGCAAGGGATCCAAACCTTACACCTGA 
GTCTOrrAAGAGCGCCCTGCTACAGGAGGCCACAGACTITCTGATIXn'CCTGGACCGCTGTrCTCA 
AGGACGGGTAAAGGGGTCTCCTGGCCAAAATTCCCCAAAGCTGTGAAGACAOAAAAAGTNGGTG 
AACTATATCAANAAAAGTGCAANCAGGTTTNTTAANGGCGGCACCCCTTOCAATGGGA^ 
ANCTTGGTATTACTTTCCCAAGAAGAAAANGGATTTTTrCCChmXCATTITGGATGAATG 
ATTTAAAA 

SEQ ID NO: 1 1 79 GGGTACi ri I'l ri i'l'Cl 1 rrrn ' rn ' ri ' i AGATGGAATCTCACrCTGTCTCCAGG 
CTGGAGAGCAATGGCATGATCATGACTCACTGCAACCTCAGCCTCCTAGGTTCAAGCAATTCTCCT 
GCCTCAGCCTCCCAAGTAGCTGTGACTACAGGCATGTGCCACCATGCCT^GCTAAl'l'l'ri' I'l i'l "H i 
GTArnxraGTAGANACAGGGTITCACCATACTGGCCAGGATGGTCTTGATCTCrTGACCTCGTGA 
TCCACCTGCCTCAGCCTCCCAAAGTTCTAGGATTCCAGGCGTGAGCTGTCACACCCAGCTTCAGTG 
AGTTTCTTAATTCTGAGrrCTAATTTAATTGCACTGTGGTCTGAGAGACTGTTATGATTTCCGTO 
TTTACATTTGCTGAGGAGTGTmATTTCCAATTATGGGCAATTrrAGAATAAGTGCTATGTOGTGC 
TGTGAAGAATGTATATTCTGCTGATTrGGGGTGAANAGTTCTtjTANATGTCTATTAGGTCCACOTG 
GTCCANAACTGANGTCAAGTCCTGAATATCCTTmAANTrCCTGCTrCATTGANCCNAAArrGG 
CAGGGGGGGNGTrAAAANmrCCNCTATNATTGNGTGGGAGGCTAANNCTNTT^ 
AAAAAACCT^GGTTTTATGAA^^S^NTAACCTGCCTCCTATATNGGGGGGCATGTGTATT^ 

SEQ ED NO: 1 1 80 ACTCTCTGAGCCCAATATACAGAGAAAGGAGGAAAAAAGCTAGAATTCrATG 
CATTACTACACAGGGGCCTAGCACCCTCCAGCTTCCAGCAGAGCGAAGGGAGCAGG J'l l l I'CJ 1 1 1 
TTCCCACAGAGCTCGGTGGTGTrGATrCCATACAGTmTGTTCAGACAGGAAGGGATAAAAATGA 
ACTrCGAACAGAAAGGGGTAGAGACTCTTTTCCCATTOTATTCTGCTCAAGGTATTTCCCCCCAAA 
TAAATTGAGAACCATGGAGTAGAGAAAAGAGACCTCAAGAACAGGGCGACTGAGCACAAGAGGG 
AAAAAAAAAANAANAAAAAAAAGACTGCAACTTGCTCCCAGGGACTGGAGAAAATTTAAAAAAA 
GGAAGGTTGGAATCCATCAGNGGTCTATTTTAATCATCTTCTCCTTCATTCCTCCm 
CCTTNATCATCATCTmATCTTTNTTACCTTOATCCTNATCCCCnKr^ 
TCCTCCTCTTNATNATNANN 

SEQ ID NO : U 8 1 NCCGAGGTACGCGGGGAGAAGCTTGGACCGCATCCTANCCGCCGACTCACAC 
AAGGCANGTGGGTGAGGAAATCCAGAGTTGCCATGGAGAAAATrCCAGTGTCAGCATTCTTGCTC 
CTTGTGGCCCTCTCCTACACTCTGGCCAGAGATACCACAGTCAAACCTGGAGCCAAAAAGGACAC 
AAAGGACrCTCGACCCAAACTGNCCCAGACCCTCTCCAGAGGTTGGGGTGACCAACTCATCTGGA 
CTCAGACATATGAAGAAOCiCTATATAAATCCAAGACAAGCAACAAACCCTTGATOATTATTCAT 
CACTTGGATGAGTGCCCACACAGTCAAGCTTl'AAAAGAAAGTG'mGCTGAAAATNAAGAAATCC 
AGAAATTGGCAGANCGGTTGCCTCCTCAATCTGGGTTATGAAACAACTGACAAACACCTTTCTTCT 
GATGGNCAATATGTCCCAGGAATATGTrTGGTGCCCATCTTTACAGNTAGAGCCCATATCACTGGA 
AGAAAT^CAAANCGNC^^^'ATGCTTACAAACTGGAGATCAGCT^^•GTGCTTGCAC^^•GA 
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TTTAAATGCTGAAACCGAArrraANAAAAAATTCAANCri'irrrilA • 

SEQ ID NO: 1 1 82 ACAAGACACTACGGGAACAGTITGCCTCCCrrCCCAGCCrCAA aiACAATTC^ 
CCATGCTGGGQCTGATGTGGGCTAOTAAOACTCCAGTTCTrAGAGOCGCTGTAGTA'l'rri'l'rrriJ'r 
IGGCTCATCCrrAGGATACTTCrrTTAAGTGGGAGTCTCAGGCAACTCAAGTT^ 
TTTTGTTTGTTTITrGAAACAGGATCTrGCrCTGTCACCCAGGOT 

GCCCAGTGCAGCCTtXACCACCTGTGCTCAAGCAATCCTCCATCTCCATCTTCCAAAGTGCTGGGA 

TGACAGGCGTGAACCACAAGCTNCCAACCTAGGCCTTAATCrrGCrGGTATnTCCATGGACTAAA 

GGTCTGGCATCTGAGCTCAOKn'GGCITACACAGCTr^AAGGGCTGNTCC^ 

TTGTGAAGCTCKSTGGCCCAAACAAAACTGCTATr^rrGAGCCAAAATTG CAAAA GClUM 

ACTGGCCTGAArrACCTGGAAGCCAATTGTTGGACCXCGGTrCCAAACCTTTTGCCTGGTAGGAAA 

GNTTAANACNCCTAATTAC IT n ' 1 

SEQ ID NO: 1 1 83 GGTACCTGAGTGCTCTTCCCATGGTCTCATAATT(l^TATCAGGTrrGlTnTGT 
DCTTCCCCCACAACCTGOACACTGCTrrAGAATCCACCAATrrAAAAATGCCTTrCTCTCGCTGG^ 
TCCACTTGATGTATTTAGGACAAGTAGCCTTGTCCTGGAGCAGTGCCAGTAAAAACTCCCAAAGAT 
AAATTGTGTTTCCCmCCGTCTTTGTTTTTCTTCTrCAC^ 
ATCTCGTCGTGGTGGTTTAGllUl'lCllCCriTrTTCCTCTrAGGCTGTTCT 
GTGAGTCTGCATA' ri -r r i'CriGCACCrGCTGTGTTTCCATCACTTCAGGAATCCCATCTAATGTGA 
GGACACATGGGTGACTGGGGCAACAACCATGTCATCrrCAGGTGAACTAAATATA'rTATTATTTAT 
TCGTTTTTCATCCAGCATAGGGCCAGGGGGAATCCATATTGAGGAGTGCCrCAACAGCCTCAATAA 
GTnx:AATTGTTTCATCCCCCGTCATGGCAAGAAGCTTCAACTGTAANGGTGATGTC ATC 
GCTATGAATTCTTCTTCACCACATNCAJ^GAACTCTCAGTAATCATGTCACTGGGCrcm 
NGCTrANACCX}GCNTAAACTATrGNGAAATATCAGCCCNNGGAACATGTTCCCCAATrACCG 

SEQ ID NO: 1184 GOCA C i Tl -l- ITrnTl ' lTl I ' l 11 l i i^ fNATACTNTTmCTCANT^NAAG'^TAAT 
ACCAACTNCNAAAGATTAATGGGTTGCTCTACTAATACmCATACAAACCAGNAGCCTCC^^ 
ACGCCAAmT^AGGCCKn'CCrrACCAAANGAAAAAAGGCTGmCT m'CC^ 
CCTGCCTTGTAANACACCANANTTCGGCTGAATCTGAAGNNTTGNGTTTTACTA^ 
AANTACCNAANAGGNTTTGGT>rmATGGCraCCCNCCGTANCCTGGCACTAAAACAGNa^AG^^ 
TOAKrmGCTTGGAAAAATATTNTTTGCTmT^ 

TTCCAGCCAGCTGGGCNCACTTCCCCATGrrrGTCATTGAACTGGAAGGCCTGAACTATTOTCAAA 

OTCTTATCCCAAAACGGCCAACAGGGAGGNCATTTACAGTGATCTGCCCAAAAAATACCCTTATC 

ATCGATGATTAAAANGCCC^^ItfAACCAGAT^rCCTIOTTmACCCT^^AAAAC 

NCAATGGTGCCNCTTCNGGTmGAATCCAAAAAGAATGTTGATTGGGTCCCCAGCCCCCTTGG>OT 

TTAAGNGTTTTGGGCCCTTGCmATTGACAAAAAGGGGAANAATTCCCCGNNGCCCCNNATO 

SEQ ID NO: 1 185 GGTA crri 1 1 1 i ' l u 1 1 1 i l l 1 1 1 i i i i J ggcataaaggaatat aaaac tattta 

TTAACCACTGTTCACCAGTATITACAATAAAGTAAACAATATACAGTrGGATAACATrrTGA'rrAC 

TACAAAGTTGTTCnrCCTGGCTmGCTGAACCAGTAAAGCAAACTCAANATTGAGCCTCC^ 

ATGAATTGGGGTAAAGAAAAAACATGCAGGTCAATAGGTTAGGTTACAAAAGGTTGTrCACACAT 

TTATGACAGCAGGTCCTAAACTOCCAACACCTCTAACCATCTGATrAOG'nTCTAT OAGC CAAGTC 

TTACATATTCCATTCATCATGACCrmAGTCAATGTAGCAAC AGGG ATTCCAACATTTTGCTAAG 

GAATGGCCCGCTAGGGAAACTmAAATGTTCAmAACTTAGTTTTGTT^ 

ACTAACArrTGTCTAGTTTCCTCATCTGGATGAGGAAACCNGCTCTGATGGCAG TGATA AAATTTT 
TCCTTTTAGGAATTITGCAAAATNAACCCACTGCAATrrATAGACACTTTAAAAAT ^ 
AAATTGNCCNTAT^r^AAAATTACTTATT^TACTTTGNAATGNT^GG^^^C^fC^ 
mTrACCTTGGAANGGGATTATCCTCTTATTTTGNCriTGGGAAA^ 

SEQ ID NO: 1 186 ACCCTTAAACrGGCAGGACATTTTTGAAATCACAAATTtGCACATAAAGAAT 
GTCACGAACAGCCATGTATCCATATACAGCAATCAAATAAGGAACTTATGACCTAAAGCAAAOar 
AAACTrrCrrGAAACTTAACArrCTATACCAACTAGGCAACCTCTGCCCAGGATGAGAGTTGGATT 
TTTCAAAAACCTCTAATTTAATAGTGCAGCATTrCGTmCCCrGATGGCCTGTGm 
TTTTAAAGACTGCTTGTTCAACTATAGCTGCAGCCTATATCCCAGCTATGGAAAAAAAAGTA 
TTAGTTCAATTTTTGCCAGTTGTTTCTGTATITAAAmAAAAAAAAACACAC^ 
TTTAGAGGTTTATTATCAGTCTGTGCATAACTAAAAGTTCAAAGCAAATTCAATmGCTTAAGGG 
AACArrGTAAAGTAACAArrCTTGGTATTACATGCCTCGTATGATCCATTTCAAACCATAGAGAAT 
TACACCriTGTGTCACTGTITCAAGAGACAAACAOATTTTGAACAGCTAAAACATCTrAAAAATGC 
ACCAAGCTTATGAAAGTCTCAAACAAAACTrcAATTTTCTGTACCTraGGCCGGGACCACGCT^ 

SEQ ID NO: II 87 GGTACACAGCTTAAAGCTATAGGTTGCAGCTTGGCTCTATCTGCTGTCTCAAT 
AACAGCCnTTCAACrGTCCACGTATCnTAAAACrrCTGCATATTTTTTAATAGCCATCT^ 
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tggga tttgaaaaaagtatttccaatgtitmaagtcttctottattaataaaa 

cttttaaatctatatccgcatcctcagggaaatctggatgactgtcgccagagccatcttttcgga 

atattcccccgtcatctccttccttcaattccccacattctgcaataacgcacaatttaocagom 

ttcacctttcacttcracatitrccaatatcctrgccactccrattcc^ 

ccacatgtttcccatccaaatgaggagttggaactgttgtgataaaaaactgagaaccgtttotgt 

tgc ggcct gcatttgccatgctcagtaaaccctcccgatcatocrrgtaatggaaattttcatcttc 

aaattmcaccataaatacrrrcrccacctgtcccatrctgatttgagaagtctccaccctga^ 

ataaactrcttaataaatogatggaaaanggcatccctttg^^ 

tgtgtncaatggcccttittctcctgtacctccatgaanaaagccctaotcctgn;^^ 

seq id no: 1 1 88 ggtaccr mgtg ttitacrrcagtgaggagattggagtctgaatcgatctgt 
tttccaagagapctgagaaatttttgtattcagcagttggaaa 
acttcccttttttgatgtagatgca gatat tctatacagttctgttgtot 
cttttgtgataaaattcaaataagatmamcrrggtaattttggc^ 
atccttgagcaatcrgtatacaatraagaqatttctgacamattcttacactaaatcgatcaac 
tctaggatttaggcatgtraacttctgrtgtgttttgaatctctccangagttgcatgtanat^ 

TlTATTT CTGTXjCC CTrAAAC CCATT TAGAAAATAACTACCAAAGTAAAAATGTAGAGGAAATAGG 

AAATGTAnTrnCATGAACATmGATACAAArrTCATCArmATGATTCACCANTTTCTTGCA 

TTA ATTTGA ATTTAAGCATTTAATTCAAAGAGGAGGGGGANCCATCCCATTATTGGATACCATGNT 

GGGCTTrrAAAAAACTTCNTTCCNTTTAT^ 

TTTTTT^^'CANNGGANAAAAATTCNAAC^r^CCCCNAGC^ 

SEQ ID NO: 1 1 89 GGTACATGACCTAATTTTTACATCATAGTAAAACAGGCCCrATGOAGAGAGG 
ACATGGGTTTCTCTGCTGAACAGCCATTATTrATACrCGTTCXAAGGCTTCTAACATGAT^ 
TTTCCTCGTATTACCACCATTCCAATATTGTrCTOTraTCCACTAGTCGCCATCTCCACACATTCAT 
CTATCACAAGGTTCATAAAGGGATCAAATCCCrGCAATATTCCTTGGACATGTCrGCCACCATTTA 
ATTTCAATGATAACTTCTTGTCCATAAATTTTTTCAACTCGGGAGGGTGAGCm 
TACTCCGCGGGCTCACAGATGCCrrGGAACGCAACGCACXjGCnTCCTCCCCGCGT 

SEQ ID NO: 11 90 AGGTACCACGCTGGTCTAATGCAAAAATGGAG ATTGCTACAAAGGACCCTTT 
AAACCCTATTAAACAAGATGTGAAAAAAGGAAAACTrCGCTATGTTGCGAATTTGTTCCCGTATA 
AAGGATATATCTGGAACTATGGTGCCATCCCTCAGACTTGGGAAGACCCAGGGCACAATGATAAA 
CATACTGGCTQTTGTGGTGACAATGACCCAATTGATGTGTGTGAAATTGGAAGCAAGGTATGTGC 
AAGAGGTGAAATAATTGGCGTGAAAGTTCTAGGCATATrGGCTATGATrOACGAAGGGGAAACCG 
ACTGGAAAGTCATTGCCATTAATGTGGATGATCCTGATGCAGCCAATTATAATGATATCAATGATG 
TCAAACGGCTGAAACCTGGCTACTTAGAAGCTACTGTGGACTGGTTTAGAAGGTATAAGGTTOT 
ATGGAAAACCAGAAAATCAGTrTGCGTTTAATGCAGAATTTAAAGATAAGGACTTTGCCATTGAT 
ATTATTAAAAGCCACTCATGACCATTGGAAAAGCATrAGTGACTAAAGAAAACGAATCGGAAAAN 
GGAATCAGTTGCATGAATACAAACTITOTCTrGAAAACCCCCTTrCAAGTGTGATCCTGATGCCT^ 
CCANAACCNTTT^^^GGAT^GCTTTACCCCCCCCTrGTGGAATTTGCCTTGCAJ^ 
GGCGGG 

SEQ ID NO: 1 1 9 1 ACGCGGGTGG AGCATGTGTATTATGTGGCCAATGTCTTCACTCTAACTrGGTT 
ATGAGACTAAAACCATTCCrCACTGCrCTAACATGCTGAAGAAATCATCTGAGGGGGAGGGAGAT 
GGATGCTCAGTrGTCACATCAAAGGATACAGCAmTTCTAGCAGCATCCATTCTTGTrrAAGCCT 
TCCACTGTTAGAGATTTGAGGTTACATGATATGCTrTATGCTCATAACrGATGTGGCTGGAGAATT 
GGTATTGAATTTATAGCAl'CAGCAGAACAGAAAATGTGATGTATnTATGCATGTCAATAAAGQA 
ATGACCTGTTC TrGT rCTACAGAGAATGGAAATTGOAAGTCAAACACCCTTTGTATTCCAAAATAG 
GGTCTCAAACATTrTGTAArmCATTrAAATTGTTAGGAGGOTGGAGCTArrAGT^ 
TCCAATACACTCrrTAATATAGCACTGAATAAATGATOCAAQTTGTCAATGGATGAGTGATCAACT 
AATAGCTCTGCTAGTAATTGaTTTATTTTTCTTCAANTAAAGNTGCATAAACC 

SEQ ID NO: 1192 ACAAAAGATGrrGGAGTCCAGTAAGCCCATACCTAAATTAACrGTAAAGCTT 
CAAGGACTTTrTGTTTATTmATATACTAOTTTTCrrCTGTGACy^ 

TAAGGCCATTCCCTGGCTCTITGCACGAATTCrTATATGGCCGGTAACAGCAGCATCTGGTCCCT^ 
GTGAATTGGCTTCTCAATGCTGACATTTGCAAGTGCATCTTCACCAAATATGGAACGAGCATAAAG 
GTT GGCT GCXrATAAAGCCACAGTAACCAGAAAGGGCCmTCTGGAGTCAGGCATTTCATATTGGT 
TGACTTTAATATGTGCTGTAAGTAGTCATTTAAATCAACCATGTTGGTGTTAACTGTCACmGTT^ 
TCCCATrCAAATTCGGCCCACATCTGACGGAATrCTGCATCANTGCAAGTTGCAGGCTGGATATAA 
GTCCATGATGTCGATGTGAATATCACTGAGAACCACACAATITCTGTCACTTGCTGCTCCAGAAAC 
ATCATAAACTATATTACCAAAAATTATTCCATTmCTGTTGATGCTACriTGACGTrAGCriTAAT 
AmGCGAAAGTChrrGAGGAGCAAGAOTCAAAAGGAGACNGNTTTTTCACAAGTTTNANATCCCC 
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TANTGGNACTAGhTTTCTAANGNGGCAAriXn'GCAAAATTTCACTGGTrTGGGT 

SEQ ID NO: 1 193 ACGCGGGGCTTGTAAGGTGCGGCTAGAAACTGGGGACATGGCAGCOCCTGGC 
CCAQCGCTCTGCCTCTTCGACGTGOATGGGACCCTCACCGCCCCGCGGCAGAAAATTACCAAAGA 
AATGGATGACTTCCTACAAAAATTGAGGCAGAAGATCAAAATCGGAGTGGTAGGCGGATCGGACT 
TTGAGAAAGTGCAGGAGCAACrOGGAAATGATOTGGTrGAAAAATACGATrATGTGTTTCCAGAA 
AATGGCTTGGTAGCATACAAAGATGGGAAACTCTrGTGTAGACAGAATATTCAAAGTCATCTGGG 
TGAGGCCCTAATCCAAGATITAATCAACTACTGTCTGAGCTACATTOCOAAAATrAAACTCCCGAA 
GAAGAGGGGTAOC 

SEQ ID NO: 1 194 GGTACTTGTTGTTGCnTGTTTGGAGGGTGTGGTGGTCTCCACTCCCGCCITG 
ACGGGGCrGCTATCTGCCn-CCAGGCCACrGTCACGGCTCCCGGGTAOAAGTCACTTATGAGACAC 
ACCAGTGTGGCXriTOTTGGCTTGAAGCTCCTCAGAGGAGGGTGGGAACANAGTGACCGAGGGGGC 
AGCCTTGGGCTGACCTATGACGGTCAACTTCGTCCCTCCGNCAAACACCCAATGAATGGTGCGGG 
TATNATAANACTGACAGTAATAGTCAGCCTCGTCCTNAGTCTTCANTCCANAGATGGTAANGGAG 
GCAAGA 

SEQ ID NO: 1 195 ACTrAGCATOGGAACTGCAATGTTAGGAGCAGGGTTGTGTGrrGGATTTGAC 
ATAGATGAAGACGCATTGGAAATATTTAATAGGAATGCAGAAGAGTrrGAGTTAACAAATATTGA 
CATGGTTCAATGTGATGTGTGCTTATTATCTAACAGAATGTCCAAGTCATTCGATACAGTAATTAT 
GAATCCTCCCTTTGGGACCAAAAATAATAAAGOGACAGATATGGCTTTrCrAAAGACTGC^^ 
AATGGCAAGAACAGCAGTATATrCCTrACACAAATCCTCAACTAOAGAACATGTTCAAAAGAAAG 
CTGCAGAATGGAAAATCAAGATAGATATTATAGCAGAACTTCGATATGACCTGCCAGCATCATAC 
AAGTTTCACAAAAAGAAATCAGTGGACATTGAAGTGGACCTAATTCGGTTTTCCTTTTAAA^ 
CGCAAACAAAAGTCGTITAAAAACCTATTTAAAAATGAATAAAAAATTGGTTTACCGANAAANA^ 
AT^m^S^NTA^rmATATAATNG^r^CCCCGGGGGCAGTTCGGGCGGTCCCCCGGGTCTC 
TTTNAA NAGG TGTmGGANCGGAAACAAAATCCCGGGGAtU'Iiri'riNCAGNCTrCNGANCGCCC 
TTCCGATTn'CCNTTTCCCWITGNAACCTCCGGGACCATTTTT™ 

SEQ ID NO: 1 196 GGTA(mTACTGAAAGAACACTAGTGTTCTTTCCTTTCCGTrGTGAAAAAAGT 
TGmCTGAGGAArrGAAACCCCAGAAGATAACTACAACAAAAACATGTTAA' 14 ' j ' rrri ' rr AAAAA 
TGATGATTCAAAGGCAGATTTGAAGGGAAOTAATATTTAGGTGGCAGAAGAAGGCAAATGCAGCC 
TCTGAAGGGAACTGTTCTAATTATTACCTAAAAAATAAAGTTACACAACTATATTCAAGGACATGA 
GATAAAGCACTGCTTGAAAACCAGAATGACTGAACAGTTAGGTGAAAAGGAACACTGAAATAGG 
AAGGGGAAATGGACTGAAGLAATAATTTTGAATCNGGGACAGGTGATCCATCAGTCCrAGATGCCT 
TCTTGGTATGGAAA>n'ATCrK3GAATCACATTGTrrTCCTTCTTCTNGA^ 
TCrrCACANGCACTNACATTAAGQTTGCCATTTTGGTNAGGATTCAAAATrrCAATCC^ 
TCAAGANnm'GAATAAATGCCAGGCCTrTNATTTTTACCCATNATAAGGGTT^^ 
TGGACTCCCAGTTITrTAAAACCTTmACAAGCCCTG>mTCKm 

SEQ ID NO: 1 197 GGTACTGArrrCCATCGTTGCAmACAACTGCTACAAAAATGCCAGCACTCC 
ATCGACATGAAGAAGAGAAATTCTTCTTAAATGCCAAAGGCCAGAAAGAAACTTTACCCAGCATA 
TGGGACTCACCTACCAAACAACmCTGTCGnOTGCCTTCATACAATGAAGAAAAACGGTTGCCT 
GTGATGATGGATGAAGCTCTGAGCTATCTAGAGAAGAGACAGAAACGAGATCCTGCGTTCACTTA 
TGAAGTGATAGTAGTTGATGATGGCAGTAAAGATCAGACCTCAAAGGTAGCTnTAAATATTGCC 
AOAAATATGGAAGTOACAAAGT 

SEQ ID NO: 1 1 98 GGTACCAAGGGATGGAAGAAGTAAATATAGCTCAGGTAGCACriTTATACTCA 
GGCAGATCTCAGCCCTCTACTGAGTCCCTTAGCCAAGCAGmcnTrCAAAGAAGCCAGCAGGCO 
AAAAGCAGGGACTGCCACTGCArrrCATATCACACTGTrAAAAGTTGTGTTTTGAAAlTTTATi^ 
TAGTTGCACAAATTGGGCCAAAGAAACATTGCCTTGAGGAAGATATGATTGGAAAATCAAGAGTG 
TAGAAGAATAAATACTGTTTTACTGTCCAAAGACATGTTTATAGTGCTCTGTAAATGTrCCm 
TTGTAGTCTCTGGCAAGATGCTrTAGGAAGATAAAAGTTTGAGGAGAACAAACAGGAATTCTGAA 
rrAAGCACAGAGTTG AAGT TTATACCCGTTTCACATGCTTrrCAAGAATOTCGCAATTACTAAAGA 
AGCAGATAATGGTGTTrriTAGAAACCTAATTGAAAGTATATrCAACCAAATACTTTAATGTATAA 
AATAAATATTATACAATATACTTGTATAGCAGTTTCTGCTTCACATTTGATTIT^ 
mATATTrAOAGATCTATATATGTATAAAATATGTATTTTTGTCAAAATTTGTTACTTAA 
NAGAGACCAAGTTTTCTCTTGGAAGmGGTTTAAATGACANNAAGCCXiTTTT^ 
A 

SEQ ID NO: 1199 GOOA Cn i n il 11 n i U 11 ril ri - ll l - lCri - Ll CAGCATTGNGTTTTAC l ' ri - ] - l 
GGGANANAGGCTAGGAGGAGGAAGGGGTGAAAACAGCGTCTCACTGGAGTCTCAAAAGTGTATG 
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AATCnrCTGGTAGTGCAAGGATGGGATAANATGGCCAGGGAAGTCANATGGAAAATCCCCAANAT 

TCTTTTTGCTACTGAriTCTATAATTAAAATATGACATATGTAAGGGACrAGTGCATGATATTC^ 

AAATGTCAGTTGTCTTTCCTAACTAGGTTCXn-CACAGGCTAGGrrATGCCTANATATCATCATCCTC 

CTTTCAGGGAATGAAGCrCACCTANAAAACTAGGGAACTAAAAGTGCAATATGGTTTGGGTAATG 

CGGTTGGTrAGCTGTCTCCCCATCCTCCCAACTCACTATTCCAGGGAGGGGCTGAAAACANA AGTG 

GCTCCCCrrGAAGTCTA>nTAGCATGTCATGACAGAATCCACATGAAGGGCTGTGGGCTGCAACTnr 

CTAGTGCCACAGTCCTCTTITITGGCGATOATAATTGTAGGGAAAGA ANCCCC ^tCACNCATGm 

ATTTCACGAGCTGTNTTTCAGGATCTTAACAACCCTrcCTGNGCTCAmr^ 

TTCACAGCTTNGAACTTbWrCCCCmCCTTGCAGTCCTGCTh^^ 

SEQ ID NO: 1 200 ACTriTTTriTITITTITrT^^ 

CTAACACATGGNGTTAGAAAATGAATrTTGGCACCGNGATTAANAA'l' TIC 1' 1" riCAAGTTTAACCT 
TTACATTAAAAACAGTAGCTACAATAAGGATATTTCAACCTTACTTAGAGAAGTGATAAAACATC 
AAGTCAACAAGTATTTTTGTTGGAGAATTTTTTTATAAGCGGGATAGAGGGAAGTTAACATAGAC 
ACTCAGAAGAATAAAATCGAAATTATGCCAGGAAGATAAAAAAGCAAATAACCC TCCC CCCAAA 
AAAAGAATAAGGAGCGAGACAAAGGGCAAAACGGAAGAAGCAAGGCrCAACAACrrTGTTTTCC 
TGATATAAAATTCAAGTACC 

SEQ ID NO: 1 20 1 ACGajGGGTTTTnACTTGATATAAATGTATrnTACTGTGATAGTCCAAGTG 
CCCTGGGGGGGCAGGTGTGCTCTATGTGGTTCTTCTTCCATTGGAOAGCTGGCGTAOAGATCTGCA 
GTGTTCACAAGGATGTTGGmGGAGATGTCTGCTGCTAGGACCTGGGarGTGTGACTCAGTCCAT 
ATATAGGGACATCrGGOTGGAGGAGTAAATTCCTGTGCTCTOAAATGCCACTTGGTAGCTCTGGAC 
AATGAAGGACAATTGACTCAAGGGTGCCTGACnCTGCTGCrGCTGGGAAAAAATTCAGTTTATA 
GCATrCCTGCACCTCCCAAAGTAGATAACCTGGAGGTCATTCAGTTAACAACTGTCrCTGAGGACT 
CAGTTTTGGGGGAGGGGrrATCTGGGAGAACnTAACCTGTTCTGAGCCATTAGGAGACAT^ 
AATTGGAGCACTGGAGAATCCTACAAATGGCCTATGTCTCAAAAAAAGCrGGGACCTCCTTCCAG 
CTGCTGCAGATOCTGACCAGGCCCTGGGAGGCTGCTGTGCTCTGGAAAAACITGGACCACrCATTT 
T^^rTGGaTAACCTGGCTGCCTTAAAAAAGAACCAGTCAGGACTTTGAAGGG AAGC ATC^ 
CTATACCCATAAACCTGNAGTTTGGGAAGTCAAGCNTITTNGAAATGTNCACCTTTGNCC^ 

SEQ ID NO: 1202 ACrrrAGTAAAGACArrCATCTCAGTCATTTCTCTCT CCCAG CTTGACCTrAG 
GTTAATATTTCATTTGGGTCAAGAAAATAATACGTAGGAGAGGTATGrrATnTAACAAACAGGA 
AAATCGACAAAAAATTGATAGrrTGCCTACATTAAAGTAAGTTAAATTCATGTATCGATATAATTA 
AATCATGTAAGAACTAAGAGTTCTATATACATTrCCATTGTCTrACTTGGGCTTATTCTAAAC^ 
ATGCTAGTGAACAAAGTGTrAGGAATATACACAGGCTGCTTCTCTGGAGTTATTACCAACTAAAA 
GACCTCAGCAAGCAGCTACCACCAATAAAACAOTCTOAAOCTGCCTCCAAATATAATATrGTAGG 
AGTTTTCAAGGAAATTTTTATACTGTATGCrrn'GTCTXjTGACTATGCrm 
TGTCACATnTAAAAAAGTTCATATACAATACAAAAATCAAAACAAAC ACCC TACAATACACAAA 
TCTAACAAAGTATATGTGGTGAGATTCCAAAAAATGTTTGAAGATGCATTTTCTTTNCTTCAm 
GNATCTAAAATGTGCCTTTTCAGAAGCCCATTGGTCAATATGTACCTCGGGCGG 

SEQ ID NO: 1203 ACTGAACACAATATTTGTGTTmATTATTT ATGCC ACGTCAGTGGGGCAAGA 
AATCTGGAGTGAGTGAAGAAAGCrAAGTTGTGAACAAGAGTGTTTTTATAGCATATGTGTTGAAG 
TAACAGCTTGTGCCCGAGAAACITAACAGATGAGTTCTTGAATCTGGGATG AGAT GACGGATGTA 
AATATTTCTAAATTTTAAATACTACATTACTTGGTGTCCTTTTTrCTCCCAAAC^ 
GGAAGGAGTTCAATTTTrrCTTGTrCTACTTrCCCTATTCTrATGGAG GTAA ^ 
AGGAAAAAGCAGCmCACn-ACAAAGTnCGTGTAAAAATATCTTnTTTOT 
CATACAAATTGGGATGGGAAGAAAATCCmCCTCTTGGGAAAGTAATAC AAGTC TTAAGTTCCAT 
TGTAGGGTGCGCCITCAGAAACCTGCTGCACTrrTCTGATATAGrrCAC^ 

CTITGGGGAArrTTGGCCAGGAGACCCTGAAACATGAACCNAACAGCTTTGATA'ri"! I l l 1 11 1 1 1 1 

AATTACTTTCCCCTTTGCTTGGATTCTOm'CCTTGGNATCATATreAAA ^ 

TGACACTrrAGGGGCCAGGTCACTAANCCAAAATAGGGATTANOTGGGTrrrr^ 

SEQ ID NO : 1 204 GGTACTCACTITITCCAAATGATCCTAG.TAArrGCCTAGAAATATCTrrCTCTT 
ACCTQNTArrTATCAATTTTTCCCAGTATTmATACGGAAAA AATTGTA TTGAAAACACT^ 
GCAGTTGATAAGAGGAATrrGGTATAATTATGGTGGGTGATTATTTTTTATACTGTATOTGCCAAA 
GCTrTACTACTGTGGAAAGACAACTGlTITAATAAAAGATITACATTCCAAAAAAAAAAAAAA^ 
AAAAAA 

SEQ ID NO: 1205 ACCAAAGGATAGCTGTrCTGTTrAAGTAGGGACCTCTCATGGCCTACAGGCTT 
TGACATCTGAGAATCAAACTGGAGAACATTCCGAAGCCGTTCTTATAAGTGTCTCCATCTCrACCr 
GGGCTGAAATGGAATGTGCAAATGTAGCCCAGCCTGGTCCTrGGGTGTTGCCAGTTGATTGATGAC 
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TGGGAGCCAAAGTGGCATCTCCTTTGACCTAAACXjGGCGATGATGAAATAAAACTCAACAGCOT 

TCTCTCATCTTGCArrGTGAGATGCGAAATAGAGCGTGTCTCTCTGCCrCTCATTTTAGGCTOAG 

CGTCCAAAGCGGCCATGCCCCATGTTTCCACTAGATGGCGCTGACACriTCA^ 

GGCCTCTCAACCTTGCAAAGGCAGCCACTTAAAGTCGGTGTCCTGTGTGGGGCACCAAGC^ 

TGCAGACACCCAGTATGCGCGAGQCAAATGCGTCCCATmAAAGAAGCTTGTATTTATGAACTCT 

TTGCTTTCTCCCTTCCACTAACTTTAAAGAAT IXjCT CTrCATCTCC^ 

TTGNCTTATTTITrGTGAAACCCrmCAAGOTATTrTCCAGTCCATTTTGCATO 

ACCGGAAGCNGGGCTCATATGCTArrGGTGTTAACGNGGACTANlATrTATGTGTrrGA 

SEQ ID NO: 1206 ACGCTrnCACCCCACCCCCAAGTCCTGGGAGAAATGCAGGCAACACTGAGA 
CATGGGAGAGQCCAAGATATGCrrGACAGAAAGGGTGATTTTGAGGCrcAGTTAATArrrCAAAA 
TTGTAACCGTAGCAAAACTGCATTGGTATrrAGAAAAATAAAAAATTTCCAATATGTAGTGCTGTG 
TTATACCTGCCICrGCCATGCAGCATCATAGCCTGTGGGAACCGGGAGGGCTrcCCTTACCACrc 
GAGCAGAGGAGGAAGGTGATGGAATATGGGGTGAGGGGAGGAACCTGGTGGCCCCTCX:CTGAGA 
TGGCCAGAAAGCCCTrGGCCTCACCTGGGACTGACCAGGCAGCCCTAGTCTAGGCACAAGGT^ 
CTTTCACCCTTCATGGCTGTGGGAATATTTCCTCITACTCTTTTrCTCCCATACA 
ATGCCCAAACTTGGGCCAAATGTTGCCXIAAACTTNGGCCAAAAATGTTGCCCAAGAGACCAAGAC 
ANGAGGAAAACAAGTTTCCAAATCTATGTANATCATGANCCAGAAATCTQAGGCTTGAATAAAAG 
GGCTTAAANGGCAGGAACTITTTTGGGGGTGTCCAAAACAGACGCCCATNCCAANGACTTCXATT 
GGAATNGGGGGGGAAAGCTTOGTANGGTrmAArnTrATCGNCNATriTAATAAATT^ 

SEQ ID NO: 1207 G0TAClU-J-iUUU4-IUn'lTJ-!-l-rriU'lMU"l"lU'l'lCriUNAAAAATAATATTTAAATT 
TATTGTTTCACATTAGTn'GAATAAAGCa^CAANCCAAGATGGTITAAATATCATTTGCANATGTCC 
AANCTT^mTACAATTACAGGTCATGGGCATCATCAACCTAGTCCT^GACr^GCAGCTAAAAA^ 
CAAAAGGTCTGTGTGAAGTGACANAANANCNOsrCCCTACrCATGTCATGANCAAGGAATCAAAGT 
CGAANAACAGTAACAGTCTTGACATCAGCACTGTAANAATTACTCTGCAGTAAACTCCAACCCAA 
AATGGGGAANAATACTGAATTAGCTGAACACATTCTGGACACAATTCTGACCAATTTCAGTCTC^^ 
GATCrCTGTTGGAGCAGGTTTCATTCTCCAATTCTCTTTTACANAACAGCAGAACT^ 
AAAAAGNGATTAACGGCCTnCCGCTTtnTCATTTTTCCTCCAGCCAC^ 
ANCACCCTCTGNGTCATNArrATNTTCANNATCCTTTTCGCGAGCGTTTT^^ 
CTTAACITCCArmGGGGCAAACCTTrrGGGCTTTCAACNAACCCGTTCTGTCAACANC^ 
CATNATCCCTGGGNn^GGAAAAACCNGGGTAGG'l'l'I'Cri'ircCAATTAAANGNNTTNTT 



SEQ ID NO: 1208 ACCCAAATACACAAAAGGTGTCCCTTTAAGGAAAATAAAGAATTAAGTTTTA 
AATAACATTACATTTTACAATCTGACATCTGGAGTATATTGAACATAGGCTAmCTTGATATAAC 
ACTCATTTAATTGTGGCCATCCAAATGAATATTATTGCAGAATTTATCTTGTTCAT^ 

aatggtgttatagctgaatacctgtgcatgaaaatgggcaatattttcatctgtcrracttgta 

ccatagaggccaatatgcacaatattaactaatgccaagacatggctgtttaaaaaatttaatgtt 

caaacagttatcactgatgcritrccactatitattaataaaatcatatattgtgtacc 

seq id no: 1209 acaagtgtatacaggcacaaacatgttntaacaaaagaaggaggaaacact 
catgacaattacagtcgttatrtctacagctggtcacgtggtcatagctggtatcgatgactacct 
tcttctacracccattctgtatrcccttagtcrrcagtaagcaccccagtaggtcatatgg 
tgaacagggatgtagtgggaatcacctccccn'gcatcmcaagtcctrcatggtgacact 
ccgcaaaccctccagaggtgcgatatrgcttttgatitacaatttttctaggtagaga 
atggcttccatttggcctttcccaccataatagcccrcaccctaccagtcagggaaccaatot^ 
ggttttgtcagctgcraagtatgtctatgccaatratgcattcitgcacrrggggaaatc^ 
gatgagtatggggacccactggacccattgtaaagtcaagacrtaagtaaaactgcattaattac 
ctoacctcacaagcccctattttaactggggaaccacaatgatggtttgagtncctggaaatcagt 
ggcagctcaaaaccagtgtcccitacttcccgaaatngthrkjatca^ 
aagtancctggttaaaaagctggangnattcctrrggggaaaaatngtagnaaaaaattaaca 

SEQ ED NO: 1 2 1 0 GGTACGGAGATGTCTTATGGTGAAATTGAAGGTAAATTCTTGGGACCTAGAG 
AAGAAGTAACGAGTGAGCCACGCTGTAAAAAATTGAAGTCAACCACAGAGTCGTATGTTTTTCAC 
AATCATAGTAATGCTGATnTCACAGAATCCAAOAGAAAACTGGAAATGATrGOGTCCCTGTGAC 
CATCATTGATGTCAGAGGGCATAGTTATTTGCAGGAGAACAAAATCAAAACTACAGATTTGCATA 
GACCnTTGCATGATGAGATGCCTGGTAATAGACCAGATGTTATrGAATCCATTGATTCACAGGTTT 
TACAGGAAGCACGTCCTCCATrAGTATCCGCAGACGATGAGATATATAGCACAAGTAAAGCATTT 
ATAGGACCCATTTACAAACCCCCTGAGAAAAAGAAACGTAATGAAGGGAGGAATGAGGCACATG 
TTCTAAATGGTATAAATGACAGAOGAGGACNAAAANGAOAAACAGAAATrrAACTCTGAAAAAT 
CAGAGATIXjACAATGAA'n'ATrCCAGrrrNCAAAGAAArrGAAGANCTTGAAAAGGAAAAANATG 
GTTTnGAGAACXGTGTNAAGAAATCTGAACCirrrTNNGGAACAArm 
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GGGTCATAATrAATGGNCCNCTTAAANCCTGATTGAAANAAAAAAAAAGATCTrrGrrTAATAAA 
GCTTNTNCCCATC 

SEQ ID NO: 1211 GGTACGCGGGGQTGGCTACCATOCTCnXn'CGCGCGGGTGTCGCTGGGCTGTC 
GGCCTGGACCrrGCAGCCGCAATGGATTCAAGTTCGAAATATGGCAACrrTGAAAGATATCACCA 
GGAGACTAAAGTCCATCAAAAACATCCAOAAAATrACCAAGTCTATQAAAATGGTAGCGOCAGCA 
AAATATGCCCGAGCTGAGAGAGAGCTGAAACCAGCTCGAATATATGGATTGGGATCnTrAGCTCT 
GTATGAAAAAGCTGATATCAAGGGTCCTGAAGACAAGAAGAAACACCTCCTTATTGGTGTGTCCT 
CAGATCGAGGACTGTGTGGTGCTATTCATTCCTCCATTGCTAAACAGATGAAAAGCGAGGTTGCTA 
CACTAACAGCAGCTGGGAAAGAAGTTATGCrrGTTGGAATTGGTGACAAAATCAGAGGCATACTT 
TATAOQACTCATTCTOACCAGTTCTGOTOGCATTCAAAGAAGTGOGGAAGAAAGCCCCCCACTTTT 
GGAGATGCGTCAATCArtGCCCrrGGAATlTACrAAATrrCTNGATATGNAATTTGATGAANGCTN 
CATCATTTTrTAAATAAATrCAAGGTCTGTCATCTCCTATAANAAAANAAAGAAAAAGNCCOT 
TrrTCCCTTAATACCGGTTGCAAAhrrGCTKACCAGCATGAAGTATTT™ 

SEQ ID NO: 1 2 12 actgatacaatggatcaagttcitaaaaattttaaaagaataaatgatgctg 

GAATTCTGGACCTGAGGATCTTTAGTATrrGAACAAATCCACAGATTAAACTGAAGGTAAArrAA 
GACTTCAGTTTTCGTTATTTCCAACTTITCTGTGAGGATGTATATCCAT^ 

ccacactgttactmgtaggtaatcttcaagtmatggtttaaaatccagccaaccagtggctgt 
aacagccgaattagccataattcactggctcacacagtggagatctggctctcattcataccaaag 
camcitggtaatgacctccatatttagagatcaggggaacaacntagtgagcagcaaccaatc 
cactcaaacaotctgccttgtgcrrtatgcctgacn'rjw'lcjlgtgcctacctcagcaacccccaat 
gtaaaactgcttttatcttrgaaaaatttmgcaac^^^ 

ccaj^cattgcrrm'ctgaatccactgggcatcctggaaaanaattctgcgaagaaaagacati^ 
gccttggtcccagngttccatgcnaaaaccatntttttaaaaaatgggncccaat^^ 

GA>m'CTAAAATGGCITrCmGAAATCTTATCTTTrAAACCTG>n'C^ 

SEQ ID NO: 1213 ACGCGGGGACAAACTTTCAGAGACAGCAG AGCACACAAGCTTCTAGGACAA 
GAGCCAGGAAGAAACCACCGGAAGGAACXATCrCACTGTGTGTAAACATGACITCrAAGCT 
GTGGCTCTCrrGGCAGCCrrcCTGATTrCTGCAGCTCTGTGTGAAGGTGCAGTmGCC^ 
GCrAAAOAACTTAOATGTCAOTGCATAAAGACATACTCCAAACCTTTCCACCCCAAAT TrAT C^ 
GAACTGAGAGTGATTGAGAGTGGACCACACTGCGCCAACACAGAAATTATTGTAAAGC TTTCT GA 
TGGAAGAGAGCrCTGTCTGGACCCCAAGGAAAACTGGGTGCAGAGGGTTGTGGAGAAGTTTTTGA 
AGAGGGCTGAGAATTCATAAAAAAATTCATTCTCTGTGGTATCCAAGAATCAGTGAAGATGCCAG 
TGAAACTTCAAGCAAATtrTACTTCAACACTTCATGTATTGTGTGGGTCTGTTGTAGGGTTGCCA^ 
TGCAATACAAGATTCCTGGTTAAATTTGAATITCAAGTAAACAATGAATAGTTmCATTGTACCT 
CGGGCCGCGAACCACGC 

SEQ ID NO: 1214 ACATnTCATAAAATATGAAGGGATAACTACAAACTGGAGTAAAAATGACGG 
TAATTAAAAAAATCCTCAGTATCCCTAGCTTGTXrrATTAACTGTGATAATCTGACTTGAG TCAGAT 
TGAATAmGGAGTGCn'CCCCAGAATAACCACTTATTmGAAGCTATCATGTGAA GTA'rr rrriT 
AAAACAAAACAAAAATTATGOTCATrAAAAAACTAGAGAAnAGCCATATTAAGGATrm 
ACTGCAAATTACTTCTAAAGAATCATCAGTGTATAGATTAGAAGTC 
AAAAAAATTCAGTTATAGCTGCTmGAAGAGGTTTCCATTTTTATTTAAATrACT 
AGAACAATTGTTTATTTmCTCrrraGTTrrAGATAT^^ 

CAAAGAAAATATTmATAATTAAATAAmAATGrriUrCl'l'CCTTTTCATTACCTACT 
GTGTTAGGGTATCTGTTTACCTTrAAAAATGATAAGTCTCACTCAAGATTT TTTATGTATGTATAAA 
AATTTTGGNGTGCTACAAAAGCCmTGCAAArrATCAGTAGTAGTTTnTTT^^ 
TTAAANAAA0AGCC0AT^^^TOGCT^AATGCCCNCTAGGOGGACArrCCNNAGGGAAGC 

SEQ ID NO: 1 2 1 5 GGTACCITCATGAAAACGGTATTATACACCWTGACrrAAAGCCAGAGAATGT 
TTTACTGTCATCTCAAGAAGAGGACTGTCrrATAAAGATTACTGATTrTGGGCACTCCAAGATnT 
OGGAGAGACCTCTCTCATGAOAACCTTATOTGGAACrCCCACCTACTTGGCGCCTGA AGTTCTT GT 
TTCTGTTGGGACTGCTGGGTATAACCGTGCTGTGGACTGCTGGAGTTTAGGAGTrATTCTrm 
TGCCTTAGTOGGTATCCACCTTTCTCTGAGCATAGGACrCAAGTGTCACTGAAGGATCAOATCACC 
AGTGGAAAATACAACTTCATTCCTGAAGTCTGGGCAGAAGTCTCAGAOAAAGCTCTGGACOT 
CAAGAAGTTGrrGGTAGTGGATCCAAAGGCACCGTTrTACGACAGAAGAAGCCrTAAGACACCCG 
TGGCTTTCAGGATGAAGACATGAAGAGAAAGTTrCAAGATCTTCTGTCTGAGGAAAAATGAATCC 
ACAGCTTTACCCCAGGTTXn'AGCCCAGCCTTTCTACrAnCGAAAGCajGCCCCTGAAAGG^ 
NCCGANGOOTGCCGANACCACAANAGCCOCCCCANCTGGTGTGGTGCTGCrrGOTTTGTGAACCT 
CCTGGGTITTGAACACCAAAANNAAATGlNCCrGCCCNGGCCGGGCCCrmGA 
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SEQ ID NO: 1216 GGACGATCGAAGGGACTATGTCTTCATTGAArnTGTGTTGAAGACAGTAAG 
GATGTTAATGTAAATriTGAAAAATCCAAACTTACATTCAGTTGTCT(X}GAGGAAGTGA 
AAGCAmAAATGAAATTGATCTTTnCACTGTATTGATCCAAATGATrCCAAGCATAAAA 
GACAGATCAATTTrATGTTGTTTACGAAAAGGAGAATCTGGCCAGTCATGGCCAAGGTTAACAAA 
AGAAAGGGCAAAGCTTAATTGGCnTAGTGTCGACrrCAATAArTGGAAAGACTGGGAAGATGATT 
CAGATGAAGACATOTCTAATTITGATCGTTTCTCTGAGATGATGAACAACATGGGTGGTGATGAGG 
ATGTAGATTTACCAGAAGTAGATGGAGCAGATGATGATTCCAAGACAGTGATGATGAAAAAATGC 
CAGATCTGGAGTAAGGAATATTGTCATCACCTGGATTmGAGAAAGAAAAATAA CTTCT CTGCAA 
GATTTCATAATTGAGAGAATITCCrcAGTTGATAGCTCTTAAAGGCAGATTTGCTGr 
TITAACCCATTmCAACCCTGTTTGGriTmAAAAGGCTTCNCrrA 
TGGCCGGGCCGGGNCXXrrC 

SEQ ID NO: 1 2 1 7 ACGCGGGGATGTAATTTTAAAATACAATGCAGACGAAGCTAGAAGTCTGAAO 
GCATACGGCGAGCTTCCAGAGCATGCTAAAATCAATGAAACTGATACArrTGGTCCTGGAGATGA 
TGATOAAATTCAGTrrOATOACArrGGAGATGATGATGAAGATATTOATGACATCTAAATTGAACT 
CAACATTTTACATTCCATCTTn-CTGAAGATTGTCCTACAATTTGGATmGATCATGACA^ 
ATTAAAATITCATTAGCATGAATGCAATTTGTTAAAGCAGACTGAmGTTrCTAAG^ 
TTTrTTTAAAACTGATAATAATGCrGAATTATCTTAAGTGAGATGTTAAGCCCACTTO 
ATGTAATGGAGCTTATGGGTAGAAAACCATGTCTACTAATrACAAAAAAAAAAAACCATGCATTG 
CTGCTTTTCCTACCACTTCAGTAAOAAAATGGGTGrmGAANAAATCATTTGCCTTGTCCTCAC^ 
AATCTGATTAAACCCCTGGCCTCTTGATTGNATAGAAGTCATTGNGTATATTCCAGTTACCCTANA 
TATTCCCTTTOGAGATTTTGGATTCCAATmGANGGQANGGCANAA ArrcnGCA ^ 
AAAAAAAATAAAGTCTGNrrGGCCATATTTAAANTAGCCCrGGGGGCl ITl 1 11 lANTNC 

SEQ ID NO: 1218 ACGCGGGAAACACAATTTCAATGCAAGCTCAGTTTCCTGGTGTTCAAAAACT 
GTTGATGTGTGTTGTCACTrTACCAATGCTGCTAATAATTCAGTTTGGAGCCCATCTATGAAGCTG 
AATCTGGTTCCTGGGGAAAACATCACATGCCAGGATCCCGTAATAGGTGTCGGAGAGCCGGGGAA 
AGTCATCCAGAAGCrATGCCGGTTCTCAAACGTrCCCAGCAGCCCTGAGAGTCCCATTGGCGGGA 
CCATCACTTACAAATGTGTAGGCTCCCAGTGGGAGGAGAAGAGAAATGACTGCATCTCTGCCCCA 
ATAAACAGTCTGCrCX:AGATGGCTAAGGCTnGATCAAGAGCCCCTCTCAGGATGAGATGCTCCCT 
ACATACCTGAAGOATCmCTATTAGCATAGACAAAOCGOAACATGAAATCAGCrrCTTCTTCTGGG 
AGTCTGGGAGCCATTAITAACATCCTTGATCTGTCTCAACAGTTCCAACCCAAGTAAATTCAGAAA 
TGATGACGCACGTGCTCTCTACGGGTAATGGTCATCCOTGGCAAGCCCGTCrrTGAACACCTGGA 
AGGGTriTCAACAGCANTTGGANCAATCAGAAG>rrTATA AGCTN CTN ACATTC 
rrrroCCCAANCTTTCbn^GGCNNGGGAAAAAAGCCCCNCCTTTTC^ 

SEQ ID NO: 1219 ggtactgaaggacaaaaacttggatggcctcaaaaggttcttgaacaccact 

CTGATirrOCAAGGACGAATrACGTAAArrATACTTrCATACAAAGGAGACGATAAGGCA GTAAA 

CATCGAGACACGGGGGACAGCGTCCACACTCAGAGGGCCTGGG CCACA GCCCCGATGriiviiii 

CAGAACTCAGCCCCTTTCCrGArrTTACTTCrAAGAGGAAAATTArmGGGGAOG^ 

TCGTGATTAGAAmAACrGATGGrmGTATTATAACrrCTAAGACCTGCCAGAATGCTAGTCCTG 

AGAGTGTCAGACAAGGAAGAAATCCCTGGGOTCTTCCCCTCACCrGGCCCTTGGATTTCATGGAG 

CAGCCACTl'AGCATrGAATTGCACTACCCrGAGCTAAACGTGTCTGTGCmCTAAGATA;^ 

TGATCCCTTrcrTCTGTCTTAAGACAGCACCTCCTGAAAAGAATC GAAGT TGTCACAACTCTCAAT 

TATTTTrrAAATACTGCATAGATTGAGrrTTGGGTTATTACTAACCCTTTCCAAAAT^ 

CTAAAACTACTAGAATCTCATTCCAmCCCATGTTAAATTACCACANAACCGCAATACCTGGCCC 

GGGCNGGCCGCTTCN>WAANGGGCOAAATTCCACNANANTTGGCNGGGCCGTTTCTrAGNGG 

CC 

SEQ ID NO: 1220 GGTACATGGTGGGTGGGTTATAAATATTGGGACrrAAGGCAGCTTOTTCTATG 
TATrrATCmGCTCrrGGGTOACTTAGGOAATGATmAmGATITAACCTrC^ 
CGAGAATACTCGCCAGTGGCGCTTGCAGTrGTAGCATTrACCCCAAGATAACrrrGCCTACGAAAT 
ATTrCGCrmArrATTTTCACATCATICrAGTATATGGACTTrGGAAACAAAAGACATTGr^ 
TTATAGCATT Cnil ' l 1 M 1 1 1 1 AGTAGCGGTATTTCCATTTACAAAATATAGTAACTCTTGATTACT 
GAAAATGTCAAATCCTAGAAAACGTAGCATCCCTATACATGATGTTAACATCATrCTCGAACAGTT 
GTTGGCCGAAGATTCATTTGATGAATCCAATTTTTrcAAATAGACAATrCrrOATGTTCn^ 
ATAACTCAGrrmATCriTmCACATTGAAAATCAGTrAGATTTGCTTAAGCCrrCA^ 
TTTATQTAAATTAGCGCTGGCAA' lUWlU ' rin - l - rrilA AACAGGAAAAGGGTTAAATGAANG GTGA T 
AAAATGGATGTTCAATTGTCTTTCTGAAAAGTGAAGTGGCTTGGAANGGATXjAATAAAAAT^ 
TAATATATITNAAAAAAGNCChnrrGCTITrNTGGGATG^ 
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SEQ ID NO: 1 22 1 ACTTCCCTGAGACATGGATCTGGGATTTGGTGGTGGTAAACTCAGCAGGTGT 
GGCTCAGGTAGGAGTAACAGTCCCTGACACCATCACCGAGTGGAAGGCAGGGGCCT TCTG CCTGT 
CTGAAGATGCTGGACTTGGTATCTCn'CCACTGCXn'CTCTCCGAGCC^ 
GCTCACAATGCCTTACTCTGTGArrCGTOGAGAGGCCrrCACACTCAAGGCCACGGTCCTA/^ 
CCTrCXXrAAATGCATCOGGGTCAGTGTGCAGCTGGAAGCCTCrCCCGCCTTCCTAGCrc 
GGAOAAGGAACAAGCGCCTCACTGCATCTGTGCAAACGGGCGGCAAACTOTGTCCTGGGCAGTAA 
CCXXVVAAGTCATTAGGAAATGTGAATTTCACrGTGAGCGCAGAGGCACTAGAGTCTCAAGAGCT 
TGTGGGACTGANGTGCCTTCAGTTCCTGAACACGGAAGGAAAGACAC AGTCA TCAAGCCTCTGTT 
GGTTGAACCTGAAGGACTAGAGAAGGAAACAAACATTCAACTNCCCACrrTGTNCATCA^ 
GAAGGT^'CTGAAAAAATATCCCTGAAAACTGNCACCCAATGTGG^r^AGAAANAAT^^^GC 
CCTrnGTCTTAANTrnGGGGAAAAAATArrAAGGCTTTTGCCTTGCCA 

SEQ ID NO: 1222 AATAAAGAACCTCTATCAGTGAGACTTCrCATrrrATAGCAAATACATTTTTG 
CAGCTTAAATmCTTGAATrCATATACGCrrCTGTCATrTAAACAAACTrCCAGAGAAAACTGGT 
CTXTrATATATTTAAGTAACAAATTTGACAAAATACATATTTATACATATATAGATCTCTAATATAA 
ATATTAAATTrGAAAAAATCAAATGTGAAGCAGAAACrGClATACAAGTATATTGTATAATATTTA 
TTTTATACATTAAAGTATTTGGrrGAATATAOTCAATTAGGTrrCTAAAAAACACCA 
TCTTAGTAATTGCGACATTCTTGAAAAGCATGTGAAACGGGTATAAACTTCAACrCrrGTGCTT^^ 
TCAGAATTCCTGTrrGTTCTCCTCAAACTmATCTTCCTAAAGCATCTrGCCAGAGACT 
AAAGGGACATTTACAGAGCACTATAAACATGTCTTTGGACAGTAAAAACAGIWn^ 
ACTCTrGGATTrTNCAATCATATCnTCnx:AAGGCATGGNTTCTTTrGGCrc 

ACArrAAAATTTCAAAAACCACAACrrnTACCAGOGGNGANTATGAAAATGGCAGGGGGGCAGG 
NCCCCTGGCnTTCNC 

SEQ ID NO: 1223 ACAAGTTCGGCmGAGCTTCCTCAGGGGCCTCTGGGAACATCCTTCAAAGG 
AAAATATGGGTGTGTAOACTACTGGGTGAAGGCTTTTCTTOACCOCCCOAGCCAGCCAACTCAAG 
AGACAAAGAAAAACITTGAAGTAGTGGATCTGGTGGATGTCAATACCCCTGATTTAATGGCACCr 
GTGTCTGCTAAAAAAGAAAAQAAAGTITCCTGCATGTTCATTCCTGATGGGCGGG TGTC TGT 
CTCGAATTGACAGAAAAGGATTCTGTGAAGGTGATGAGArrTCX:ATCCATGCTGACTritiAGA^ 
CATGTTCCCGAATTGTGGTCCCCAAAGCrGCCATrGTGGCCCGCCACACTTACCTTGCCAATGGCC 
AGACCAAGGTGCTGACTCANAAGTTGTCATCAGTCAGAGGCAATCATATTATCTCAGGGACATGC 
NCATCATGGCGTGGCAAGAACCTTCGGGTTCAGAAAATCAGGCCTTCTATNCrTGGGCTGCAACAT 
CCTrCGAGTTGAATATTCOTACTGATCTATGTrAANCGTrCXn"GGATCCAAAAAAGNCATCCTrG 
CCCTGNCCTGGTAAATTGGCAGC 

SEQ ID NO: 1224 GGTACTATGATCCAAACACCAAAAGCTGTGCAAGAITCTGGTATGGAGGITG 
TGGTGGAAACQAAAACAAAmGGATCACAOAAAGAATGTGAAAAGGTrTGCGCTCCTGTGCTCG 
CTAAACCCGGAGTCATCAGTGTGATGGGAACCTAAGCGTGGGTGGCCAACATCATATACCrCTTG 
AAGAAGAAGGAGrcAGCCATCGCCAACrrGTCTCTGTAGAAGCTCrGGGTGTAGATTCCCTTGCA 
CTGTATCATTTCATGCTTTGATTTACACTCGAACrCGGGAGGGAACATCCTGCT'GCATGAC CrrAT C 
AGTATGGTGCTAATGTGTCTGTGGACCCTCGCrCTCTGTCTCCAGGCAGrrCTCTCGAATACmGA 
ATGrrGTGTAACAGrTAGCCACTGCTGGTGmATGTGAACArrCCTATCAATCCAAATTCCCTCTG 
GAGTmCATGTTATGCCTGTTTGCAGGCAAATGT AAAGT CTAGAAAATAATGCAAATGTCACGGC 
rrcrCrATATACTTTrGCTTGGGTCATTTTrrrrCCCTTT^^ 
AGCCTGTGTTTCX3GGGGAGAAACAAAANAACCAACITm™ 

AACTTANAATITNAAGCTTA'1'lTrn'rn'r Tl'l 1 GAAACCCTITANCAAAATGATCTTGTTNNGAAA 
G 

SEQ ID NO: 1225 A Cl - fl ' l ' l 1 11 1 ' l I ' l I ' l - l 1 1 1 rm i 1 1 1 1 1 i ATACAANAACTrATGTrTATTGCAA 
ACAAACAAACAAAAAAAAAAGGAAAGAGAGGAAAAGANAAAATGGTCANAAGCNCAACATATA 
AGGT^AA^lAAT^TAAAAGCAT^r^^ACATTNTGCa^^AATGGCAGCATAAT^ 
GCCGT^m■GC^GCCTGCCGCANCCGGAGGGT^^ITmGCANACC^GACGAGCAAATm 
ATGTAGTATGAAGGAANAAAGCrrGGCGGGTCnrCACTGCANACriTGGACTCCCAGTGTTTCGGA 
CTGGCATTCCCTGCATGGCCTGGCGGGACACGTGACT^^^'AACACGAGGGTCCT^^^^ 
TAGGAAATAACTTCTCTTCTTCTGACTGGGTGGGCATTrrCAAGCCTCCATATTTTm 
GCCAACAAATTGCACATAATCTACACTGCATATTAGGNGGGCCCCAANAATACCACTGGTGANAC 
TGTGTAACATAACAACTOTCACAGGCrCTCCCTAAAANAGGArrO XiAGG CTGGACCT 
CAAGCTTCATTTACAAGNACCAGGCTTCCCATTTACTAAGGGGAAATTTTGG^^^^ 
TNGCTGNANGGTNGGGAAAATATACCTTGGTTCANNTnACCCTCNANCTTNNGC^ 

SEQ ID NO: 1226 ACTCTGGATCCCAAGGTGACTGGTTGTrTAATCGTGTGCATAGAACGAGCCA 
CrCGCTTGGTGAAGTCACAACAGAGTGCAGGCAAAGAGTATGTGGGGATTGTCCGGCTGCACAAT 
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GCTATTGAAGGGGGGACCCAGCriTCTAGG<K:CCTAGAAACTCrGACAGGTGCCTTATrCCAGC 

ACCCCCACTTATTGCTGCAGTAAAGAGGCAGCTCCGAGTGAGGACCATCTACGAGAGCAAAATGA 

TTX^AATACGATCCTOAAAGAAGATTAGGAATCTTTTGGGTGAGTTGTGAGGCTGGCACCT 

GGACATTATGTGTGCACCTTGGTTTGTTATTGGGAGTTGGTGGTCAGATGCAGGAGCTTCGGAGGG 

TTCGTTCTGGAGTCATGAGTGAAAAOGACXACATGGTGACAATGCATGATGTGCrrGATGCrcA 

GGCTGTATGATAACCACAAGGATGAGAGTTACCTGCGGCGAGTTGTTTACXCTTTGGAAAAGCrG 

^aGACATCTCATAAACNGCTGGT^ATGAAAGACAAGTGCAGTAAATGCCATCTGCT^ITGGGGCCA 

AGAATATGCTTNCAAGGTGTTCrrCGATATGANGACXXjCATTGANGGCAATCANGAAAATrc 

TTm'CACCCCCCAAAGGAAAANCCAATCrcCm'GGGTm 

SEQ ID NO: 1227 A CinU -I'J'] ril U ri l l'i 1 1 rri ' i ' I ' l ' l rJM - i7 'GCCGGTTCCACACCTGCCCTTTATT 
GGTCT^^T^^•ANCANAGTGG^rrcCAGGCCCTmACGCCTCTNAA^ 
GAAGGNGCCKWATTrTTGTGAAGGCCCAGANCTTACCCAAGmiTGGAGCCCAA 
AACCAAAGGGTTGGGANANGAAAAGGAAACAGGCNCAGGGGAAAGGCAAGGCT^™c^ 
GGGGACTGATNTNAANGG 

SEQ ID NO: 1228 ACTGGGACAGTTGGGTGTGTTATGGATACATAACrTGAGGAGCCGGGGGAAG 
CTGGCCTTGGGTGTTTTACXrrCAATCATATATCCACACAAGTGCTTCTCnTGACAm 
GGGAGAAGAAGAATAAAATTGTTTATCCTCCACAACTGCCTGGAGAACCTCGGAGACCAGCAGAA 
ATCTACCACTGTCGAAGACAAATAAAATATAGCAAAOACAAGATOTOOTATTTGGCAAAATTGAT 
ACGAGGAATOTCTATTGACCAGGCmGGCrCAGTTGGAATTCAATGACAAAAAAGGGGCCAAAA 
TAATTAAAGAGGTTCTCTTAGAAGCACAAGATATGGCAGTGAGAGACCATAACGTGGAATTCAQQ 
TCCAATTTATATATAGCTGAGTCCACCrCAGGACGAGGCCAGTGCCrGAAACGCATCCGCTACCAT 
GGCAGAGOTCGCTITGGGATCATGGAGAAGGTTTATTGCCATTATmGTGAAGTTGGTGGAANGG 
CCCCCACCTCCACCTGACCACCAAAAGACGGCAAGTTGCCCTGCCAAAGAATATATTCAGCAGCT 
TCCANCCGGACCATCGTrCACACTCTATGATGAGGAGATTCAGACCTCCACANGTGTATATATnr 
GGCCTTTATTTTNTAAAAAT 

SEQ ID NO: 1229 ACAGACAAAGTGGGAGGTTTTATTTCTrcGTCTCTTCC^^ 
TrGATGATCTCCTOrrTCTTGGCCTGGAGGCGCTCTrCACGGCGC^ 
ACCTGajGGCCTCAGCCTGGTCAGCCAOGAGCTrCITOCGGGCCTTGTCTGCC^^ 
TGTGTTCCATGAGAATCTGCnTGTmTGAACACArrCCCCrrcACCTrcAGGTAC^ 
GCATCTGATGAAGCGAATTCAGAGAGGCXrCAGTAAGAGGTATCTCCATCAAGCTGCAGGAGGAGG 
AGAGAGAAAGGAGAGACAATTATGTrCCTGAGGTCTCAGCCTTGGATCAGGAGATTATrGAAGTA 
GATCCTGACACTAAGGAAATGCTGAAGCTTTTGGACTTCGGCAGTCTGTCCAACCTTCAGGTC^ 
CAGCCTACAGTTGGGATGAATTTCAAAACGCCTCGGGOAOnXJTITGAAriT^ 
TATTATTTTCAATAAATCrGGGACAACAGCTT 

SEQ ID NO: 1230 ACGCGGGGCrrCTCCAAGATGGCGGCGATCGGCGGCGTTGAGGCGGGATCCG 
GGCGAGCCGAGTGAAGATTGTrTTGATGGTGATCATACCTTTGAGGACATAGGAarTGCAGCT 
CCGAAGCCAACGAGAGAAAAAACGTTCITACAAAGATTTTTTAAGGGAAGAGGAAGAAATTGCTG 
CTCAGGTCAGGAATTCTTCCAAGAAGAAGTTGAAGGATAOTGAACTrrACTTCITGGGGACGGAC 
ACACACAAGAAGAAGAGGAAGCACTCCTCTGATGATTACTACTATGGAGGACAGTGGTrCTTGTT 
CTGAGTGCCGATAAACCAGTGTGAAGGAGGAGAGAGGTGGAGGGCTGGTATTTTGGAGGATTCAG 
CATAGAAGAACAGTCTGCATTTGTGATACACCAGATAmCGTCTTTGGAATCGTCACAGAAGAAA 
AAGAAAAAGTCCAGCCCACAGTCTACTGATACAGCTATGGACCTGTTGAAAGCTATCACTTCCCC 
ACTGGCAGCAGGCTCCAAGCCCrtXAAAAAQACTGGGGAGAAAATNCTTTQGCTC^ 
TCGGAGAGTAAAAANGGACCCCCACAGGAAGAAAAGTCAGTGGN AAGCAGTG GGGAACrCCCCT 
AANAGGATGGNGTCrTCCACAAATCGAAAAAAAAATAAAACCCT>l'ri'l'llU"riGAACCACAONNA 

SEQ ID NO: 123 1 A cnTrfrrn ' riTn iTi m i l 1 1 n i u i gggaaanatgoggtitcaccatn 

TTGGCCAGGCTGATCTTGAACTCCTGATCTCGNGACCCACCCGCCTCAGCCTCCCAAAATCCTGGG 

ATTACAGGCGTGAGCCACTGCGCCrGGCCAANAtmATATNTTATCAGTAGCCTOAGGTTTCCX:C 

CTTTCrCTCACrTTCATTACTANAGTCACCANAAGGAACArrTACAACATTITAAA^ 

TGCCCAGCATACTCCTATTTCCrcrAGTTTCAAACATAAAGGGGAACCCAGCCCAAACAANAACA 

AGCCTTGCTGCATGCCrGCCAGCTCGTGCATCCTCCTTTn"ATTTCAAGAGGNGCCAGCTCCAAAC 

AAAGTTACAAGGTTAAGNGCAACTCCAAGTTCCTGACACAGCTA ATCXr CTGCTCGGCT TCAAA AA 

ATAAAGGCTGTAATTCCTTTTTOAACATGCCrrCOAAOTGGAGGTCTrTG^ 

GACATCATCTAATGCCCACTGGTGATCCGTCAGCTTCAAANGCTTCCACTTNTGTmGAAAGCT^ 

GNTTGNNGTrrGCGGGCATTGGCCATGG>rrG>rmCCGGNArrGNTCCTGGGATTAATTCGOGAAT 

NGGGCAAGGGGCATTATITrrGGNCCAAAATNAAAAAAGWAAATGCTTCCNAA 
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SEQ ID NO: 1232 ACCCCTTAACCCCTTCTCCTTCACCCTTAGCAGCAAQTCCCACTTTTCTAGGG 
GOCAAGAAACCCCAAACCCCTTCan"CCGTGTCTrrACGCTCTCTTTT^ 

ACTATGGGCAACCTTCCATCCTCCATTCCTCCTrCTCCCITAGCCTGTGTGCTCAAGAACTTAA^ 

CTCTTCAACTCACACCTGACCTAAAACCTAAATG(XTCATTTTCTTCTGCAA^ 

ATACAAACTTGACAATGGCTCTAAATGGCCAGAAAATGGCACTTTCGATTTCTCCATCCTACj^ 

CCTAAATAATTTTTGTC^AAAAATGGGCAAATGGTCTGAGGTGCCTGATGTCCAGGCA^ 

CACATCTGTCCCTTCCTAGTCTCTGTGCCCAGTGCAACTCGTNCCAAATCrTCCTrC^^ 

ACCTGTCCCCTCAGTCCCAACCCCAGGCGTTGCTGAGTGTGTCTAATCTTNCnTITCT^ 

TCTGACCT^^TOCCTTCT^^mCACGCCAAGCTAGGGCCCAATTC^ 

CCCTGNAATCTTTTTATCACCTrCCCTTCTTAAAACCTGGGTC^ 

CTAAGa^CTNCCCCACCTGOCCAAGNAATTTTAATCnTAAAAAAGGGGGGNTGGGG 

SEQIDNO: 1233 actactgtccagcctcctcaggagaaccaaacatccagtataccitcaccag 

CAACTrrGCCAGTCrCAGCACTTAAACAA(X:AGGTGTTGAAGGACTATGTTCCAAAGAACAGAAG 

AGAGTATGGTTTGCAGATGGTATATTGCCCAATGGTGAAGTTGCAGATACAACAAAATTATCATCr 

GGAAGTAAAAGATGTTCTGAAGACTTTAGTCCTCTCTCACCTGATGTGCCTATGACAGTAAACACA 

GTGGATCATTCCCATTCTACTACAOTGGAAAAGCCAAACAATQAGACAGGAGATATTACAAGAAA 

TGAGATAATTCAGAGTCCTATTTCTCAGGTTCCATCAGTGGAAAAATrGTCTATGAACACAGGAAA 

TGAGGGGTTACCTACTTCTGGTrCATrrACACrAGATGATGATGTTTTTGCAGAAACTO 

ATCTAGTCCTACTGGTGTCTTAGTTAACAGCAATTTACCTATTGCTAGTATTTCAGATTATAGGrrA 

CTXjNGTGATATTAACAAAGTATGTCTGCAATAAGATTAGTCTTCTACCTAATGATGAAGGACAGTT 

TGCCCCCACnTCTGGTTGCATCTGGGAGAAAAAOGGATCANGGGCCTGNANTAANAAGAACATTC 

CTTCTNATTGAGCNGAACATTmGCrrCTTTGAAAGGGGAAGGCIT^ 

SEQ ID NO: 1234 GGNACnTIWrN lUU'lU-n-l NTri"i-l ri'l i-i'l'l AGGGGGTCATCGTCANANCTG 
CTATCTGTGCrGGTGCTACTGCTACTGGAANANCTGGAATNTGAGTCrGANTCANANGANGAGGA 
ANAGGNTGNGGNGGANGCTNACGATGAGGAACTNCAGGAGCmCATCACTGTCACTGTCCTCTG 
AGGAGGAANAGGTAAATGmCTTCACrC TCTGATGA ANAATCACTGGCAGAACTGTCACTGCTA 
CTOCTACTGGAACTGGTTACACTCTTANACC'l I'l l I I'l ClU'GGCCnTClTrCTACATTGGNTTCTCC 
AATGCTTrGTTGCAATAANAANCTGTTTTXnTTTTCTTTTAAAGC^ 

AQGGCCTATOTGAGGNArri-lClllUnCCTGTGCATTCATAAAGTCCAATGTCCAAATTCCAAGCA 
rjTCrGACATCTTACATGTTGCTrATTTGCTTCAACTrGTCTCCGGGCTA 

bm;CCCATCITAGCCCCCCCGTACCTGCCCGGGCGOGCXX}NTCGAAAAGGGCGAATITCCANCCA 

CACTrcGGCGGCCGGTTNCTAA>rrGGGANCCCNANCrCGGGGACCCAACCTTTGGCGGTAA^ 

TTGGNCAANAAACTGGTTTCCTrNGGGNGNAAATTGNTATrCX;CCTNCCCAA^ 

N 

SEQ ID NO: 1235 GGACGTCGGGGACGTCCGCAGCGTCACACAGAAACATATCCAGGAGTGGGG 
CCCATTCGATCTGGTGATTGGGGGCAGTCCCraCAATGACXrrcrCCATCGTCAACCCTGCrCGCAA 
GGGCCTCTACGAGGGCACTGGCCGGCrci^Ll-lUGAGrrCTACCGCCTCCTGCATGATGCGCGGCC 
CAAGGAGGGAGATGATCGCCCCTrCTrCTGGCTCnTGAGAATGTGGTGGCCATGGGCGTTAGTGA 
CAAGAGOGACATCTCGCGAITTCTCGAGTCCAACCCTGTGATGATTGATGCCAAAGAAGTGTCAG 
CTGCACACAGGGCCCGCTACTrcnX3GGGTAACCTTCCCGGTATGAACAGGCCGTTGGCATCCACTG 
TGAATGATAAGCTGGAGCTGCAGGAGTGTCTGOAGCATGGCAGGATAGCCAAGTTCAGCAAAGTG 
AGGAC<>TTACTACGAGGTCAAACTCCATAAAGCAGGGCAAAGACCAGCATTTTCCTGTCTTCAT 
GAATGAAAAAGAGGACATCTTATGGTGGCACrGAAATGGAAANGGTATTTGG'ITTCCCANTCCAC 
TATACTGACGTCTTCAACATOANCCGCTTGGCGANGCAAAAACTGCTGGGCCCGTCATTGGACCGT 
GCCAATTCATTCCNCACCTrNTTTNGNTrCCGTTGAAGGAATATITI^^ 
AT 

SEQ ID NO: 1236 GGACGCGGGGGAGTCAGTCCCAGTCAGGACACACCATGGACATGAGGGTCC 
CCGCTCAGCTCCTGGGGCTCCTGCTCCTCTGTCTCCGAGGTGCCAGATGTGACATGCAGGTGACCC 
AGTCTCCAGCXrrCCCTGTCTGCGTCTTTGGGAGACGGAGTCACCATCTCTTGCCGGACAAGTCAGG 
CCATTACCAAGTTrGTTAATrGGTATCATCAGAGACCAGGGGAAGCCCCTAAACTCCrAATCTATG 
CTGCTTCCATTTTGCAAACTGGAGTGCCATCAAQATTCAGTGCCAGTGGATTTGGGACM. 
CTCTCACCATGAGTAGTCTGCAGCCTGAAGATTTTGGGAATTACTATrGTCAACAGAGTTACACTT 
ATCCTTCCACCrrCNCCCAAGGGACACGGCTGGACOTAAACGAACTGTGQCTC^ 
TCATCrrCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTTGCrGAAT^ 
ACTTCTATCCCANAAGAGGCCAAAGT 

SEQ ID NO: 1237 GGTACCATCCCTCCATATTGCCACCAAAGATGCGTGAATACACCAGGCTCAT 
TTTATTQCCAGTGCAGTCCTGGGTrTCAATTGGCAGCAAACAACTATACCTGCGTAGATATAA^ 
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AATGTGATGCCAGCAATCAATGTGCTCAGCAGTGCTACAACATTCTTGGTTCATTCATCTGTCAGT 
GCAATCAAGGATATGAGCTAAGCAGTGACAGGCTCAACTGTGAAGACATTGATOAATGCAGAACC 
TCAAGCTACCTCTGTCAATATCAATGTGTCAATGAACCrGGGAAATTCTCATGTATGTGCCCCCAG 
GGATACCAAGTGGTGAGAAGTAGAACATGTCAAGATATAAATGAGTGTGAGACCACAAATGAAT 
GCCGGGAGGATGAAATGTGTTGGAATTATCATGGCGGCTTCCGTTGTTATCCACGAAATCCTTGTC 
AAGATCCCTACATTCTAACACCAGAGAACCGATGTG'nTGCCX:AGTCTCAAATGCCATGTGCCGAG 
AACTGCCCCAGTCAATAGTCTACAAATACATGAGCATNCGATCTGATAGGTCTGTGCCATCAGAC 
, ATTCTTNCAGATACANGCCCACAACTATTTATTGCCAACCCCCATCAATAC 1 1 1'l 1 CGGATTNAAA 
TCTGGGAAATGNAAAATGGAGAAGTTrnTACCTACCAACAAACAA>rn'CCTGTAANTGGCAATGQ 
CrrGGNGC 

SEQ ID NO: 1238 GGTACTTXrrGTCTTCCAGTTrrCCTTCACCAAATCGCACTGGCTCCT^ 
TTrCCTATCTTCACCACGAACTGCTGCTTGCTCGCrrGCTCCTCAGTCCTAGCT^ 
GTTCCrGGAATCCTGTCTGCTGCTGTCTTCCTAGATTCACTGAATCCACTTCTGTGTAGCACCTGGG 
TCAGCTGTCAATTAATGCTAGTCCTCAGGATTTAAAAAATAATCTTAACTCAAAGTCCAATGCAAA 
AACATTAAGTTGGTAATTACTCTrGATCTTGAATTACTTCCGTTACGAAA 
AACTAAGCTACTATATTTAAGGCCTTCCAAATTCITCrAACTCTrcCAA^ 
TTTTTTAAATTACACCAGTCCriTrAOTAGCTTTTrGATGTG 
TTCAAGTATTCTTCTAAATTGGTTCTGGTCTAOiTAAACACCCTCATCTTCT^ 
AACTTOraCACCACCAGA/^ 

CTCAl i rrriCAGTGGTA ri riATCCAAi 11 i iGGCTTTATAi i 1 1 1 iCNATCTT^ 
ATACTTGGCNTAAACTTGGGTTTTCATTITCTArra^GNAACCCTTG^ 

SEQ ID NO: 1239 ACATCG3TAATCTGTCCCCATGACCCAAACATCTCCCATAGGCTCCACCTCCA 
GCATTGGGGATCAAATTTCAACATGAGGTTTGGCATGGTCAAACCTCCAAACTACAGCAGGGACr 
1411-inunUU4-lUU-r i-lUnUU 'lUU-rin-lNaTCTTrCACTGNTNGTTCTGAGTCC^ 
TTTATTCATAGCAGCTTTTCCAAAAAAGTTGCTCATNATATTCCCnTTCCCTGGNGCCTTGCT 
GCTGCAAATGCATTTGTTACIT^^ITTAGCCTCTGTTTTCGTTT^ 
AGCAGCTITGGAGGCAAACATTCC(>TAATTCCTITGGGCTGCTGGGAAACCTGC^ 
NGGGCCATGACCATTGGGGGNCANCTCATrGTTGGCITGNGTCTCACTTGACATGTGAANATGTGA 
CTGCTCAAACTTTTTGGAAAAC^JAANAGGATTCANCANGGAGCimAGGGGACCGGCAGCTGC^^ 
ATTGNATANCACTAAATTTGCTGNNATTTGCAAGOTGGCTITTAAAGGATNGNNATAN^ 
TGNAACANAAGGCCCCCNGGCCCTTTNAAAAAGGGT^T^^TGGGATGG^^rGNAC^^ 
ANCNCCX;CTANGGGCGGAATTCCA 

SEQ ID NO; 1240 tcctcccgccgcccaatatgccgaaaggaaagaaggccaagggaaagaagg 
tggntccngccccagctgtcgtgaagaagcaggaggctaagaaagtggtgaatcccctgttngag 
aaaaggcctannaattttcgcattggacaggacatccagcccaaaagagacctcacccgctttgt 
gaaatgcccccgctatatcaggntg 

seq id no; 1 24 1 ccgggcaggtacataggtaaccaaagtatatagcntattixjgtgaatcttcat 
cctcattacgttttctggacaqccgcacacggatrcggtatggcacattcctratrcctttggccca 
gacagctttgttgagcctggtgtcaatgcgcacatctggagttcccatctccttcatggcaaattt 

CCGAATCrcrTTGAGTGCCCGAGGTGCACGCTTCITGAAGCCCACTCCATGGATOCGCTTGTGAA^ 
GTTGATGGTGTATTCTCGGGTTACCACTTCGTTGATGGCAGAACGGCCCi'l'l'l-l'Cl'rCTCGCCACCC 
TTCnTGCGGGAGCCATTCTGCAGCGTCCAAGTTGGAAAGCCCCGCGTACC 

SEQ ID NO: 1242 GGTACTCTGGATCCCAAGGTGACTG07TGTTTAATCGTGTGCATAGAACGAG 
CCACTCGCTTGGTGAAGTCACAACAGAGTGCAGGCAAAGAGTATGTGGGGATTGTCCGGCTGCAC 
AATGCTATTGAAGGGGGGACCCAGCTTKrrAGGGCCCTAGAAACTCTGACAGGTGCCTTATTCC^ 
CGACCCCCACTTATTOCTGCAGTAAAGAGGCAGCTCCGAGTGAGGACCATCTACGAGAGCAAAAT 
GATTGAATACGATCCTGAAAGAAGATTAGGAATCTTITGGGTGAGTTGTGAGGCTGGCACCTA^^ 
TTCGGACATTATGTGTGCACCTTGOTITOTTATTGGGAGTTGGTGGTCAGATGCAGGA 
GGGTTCGTTCTGGAGTCATGAGTGAAAAGGACCACATGGTGACAATGCATGATGTGCTTGATGCT 
CANTGGCTGTATGATAACCACCAAGGATGAQAGTTACCKJajGCGAGTTGTrTACCCTTTGGAAA 
AGCTGTTGACATCTNATAAACGGCTGGTTATGAAAAACAAGTGCAGTAAAATGCCATCTGCTATG 
GGGCCAAGAATATGCTTTCCAGNGGTCTITCGATITAAGGACCGGCNrrTGAGGTCAATCA 

SEQ ID NO: 1243 GGTACATGGCAATTAGAAGTTGTCATGGCAAAAGAAAACCACAGCTGGCCTG 
CCACAGCCAACACAAGAACCAGAAAATGGTAGATGAAATGAAGGAATAAAGGTGGGGTTTATTC 
CTrATTATAAAAGAAAAAAAATAATTOTCAGCAOTCITAACAAAGACATCAAGATACAAAATTA 
CAAGTOTTTTGACTCCAGCCCTGTCCCCATCTCCTCCAAGAGCAGAGGTAGGAGACAGTrGAAGC 
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AAACAAGCAATTCTGTAAAAATTACCTAGAAACCCrACAAATITGATTAAAATCTAAACTTCrATA 

ATTTTGCTITrrAAAAAATTTAATATCAAAAGGCCrGCTTTAGTGACATGCTA 

ATAACCCCAATACCCCCTCrOTCAGTAACATGCrCAAGTTGACCAGCCAACTCTTATCTCTAAACC 

TATGTGGACAAGTGTGTCTTTTAAACCAAAGCCACGGAGATCAAGTGACTGCTAGTAACGTGTCGT 

CTGTTAACTAATCCANTGCCCCCITCTNCAGANGGTGGGCAGGGCAOAAACCCAAAAAGOTCTTG 

GAGGGCCTAATNCCAGATCACCATCTAANAACCACTTCCnTGCTCTOTACTNAAAATGAGAAGOT 

TATITITAATCCGACTATTTGNCTTAAAATTTCGNGCCCCGNGAACCTGGATGGGT^^ 

CCA 

SEQ ID NO: 1244 ACAAGACAAAGGCGAATGAGACTTCTCTTATATCTGAGTAACATTAAGATGG 
AAATCAAATTTAAATGCAGTCCACTCTGCITTrrGAAGAGGCTTTGGTTCAGCTC^ 
TTGCITGACGCAGTCTCCTATGAGAATACTCAGAAGGTGTCTrCTTAAACAACAAACCTATT^ 
GTGGTGGAGCCGCrcrTAGTAGCTGTGTCTGCGTGGGACTGATAACCAATCACTATCTTTGGAGGA 
AGTCCTAACCTTTCCITOTATACCCTCCCTATATGTOTAACAGCITCTCTGTrTTCACATTCAGTAQ 
TCCATATTGCTATCITATCACCmAGCTCrAACATTAACAACAGCGCCACATACATCATCACTGTA 
GTCATCAAAAGATTCTCCAATAAGGCACAGAAGTGTCTCTAGCCAAAAGCGATCGAGGTCACTTC 
GTCTCrGCTGTTTGTTCAATGTAATTAGCCATCGTCCrCCCCGTTTGTTTn'CTCATCT^ 
GGCTCAATACCATCCTTAAAAAGTGAGTAGTCACAGCCAGGCATTAAATTACTAGACAACTGGAT 
ATNGTTGNACCTCGGGCCGCGANCACG 

SEQ ID NO: 1245 ACTGTCTTCAATCCTATGCGTGCAGGTGTCTACCACAGGCAAACAGTTTTCTC 
CCCATTTTGTAGTAATGTGATTTTCCTATTAGCAAAAAGAGGTCACCAGCCCCTGTAGACTTAAG^ 
GACrCAAGTCACAGGATGGGGATTTCCTCTTAATATTTTTTATITrGTTGTTTGAA 
ACATTOTAGAGCAGGGTGTTCAGGACCTGCTGTGCCCAAGGGACTGATAAAGGAAAAAGCTCTAT 
TTArrCTTTTTGTGArrrGATGCACAGATGAAAAACITAACACACAATAACAGAAGTTGGTCG™ 
ATAAATCACATCCTAGTCirrcAGCGCTTCCGTAAGCAGACGACATOTCAGTmCTAGCTCT^ 
AGTTTCAACACTGCAAC ATCAA TGATGCATATGTCCAGAATCAGTTACAAAGACCATCCGATTCTT 
TTTCTGTrAGTTCATCTATTTTTCACTGTCrCTTGGTCCCAAGTGTATCTGAGTGATTACCTTCTC 
ATTCTCTGCTArrGNTO;GrrGGGGTGCICTCGArrGCCCCCGTGTTTTGTGGGCTGGTTGGGA^ 
GGCCCCrrGGGAAAGATGTNCCACTGGCNGGANGGTOTGAGTCACTTGGGATGCCrCCAGGGANG 
ATCCCnrCCATGGGCCCC 

SEQ ID NO: 1246 AATTCGCCCTTAGCGTGGTCGCGGCCGAGGTACATCATGCCGCTGATGAAGG 
TGCAGGCGAGGGCCACCCACGCCAGGATGTAGGAGTAGCCGTAGCTGCCTTCTCTGOTCACGGGA 
TAGAArrTCGCGTTTrrGTCGTGAATGTCrTCACGCCTGTCTGTATAAATGGAGGCCGCAATCATG 
ACACACAGACATGACATTAGCTGGATGATGGAGGTTAGGACAAACCTCTCTCCCTGCTTCAGGCG 
GAAGAGCTGGAGCACGAAGATGAAGAAGGCGATGCAGCAGAGAATGGTGGAGAGGATCATGGTG 
GCCTGGACCGCCTGCAGCGTGGAGTACTCC 

SEQ ID NO: 1 247 ACGCGGGCACCAGAAAAACTAATGAGATTTCTCTGGAATACAAGCTGATATT 
GCrACATCGTGTTCATCTGGATGTATTAGAAGTAAAAGTAGTAGCTTTTCAAAGCmAAATTTGT 
AGAACTCATCTAACTAAAGTAAArrCTGCTGTGACTAATCCAATATACTCAGAATGTTATCCATCT 
AAAGCATTmCATATCrCAACTAAGATAACirrrAGCACATGCrTAAATATCAAAGCAGTTGTCA 
TTTGGAAGTCACrrcTGAATAGATG'rGCAAGGGGAGCACATATTGGATGTATATGTTACCATATGT 
TAGGAAATAAAATTATTTTGCNNAANAAAAAAAAAAAAAAAAAAAAAAANTGTCGCGGGGACAT 
GTGNGGAAACrACAAGAGGTANAAACATTTGTTGATTTACCAGTGTTTTTAACTTaTrGCTGGG^ 
NAAAACTGCTTGTTTCGNGGAAAAGCAAAACTTACAGCAAACAT^^"AAAATGAAGAGCTCCC^ 
CTTITGAOGAACAAACCGGAATGCATTGGGAACCCTCrACrCATGGGCTTTTTGAGCCCCACATO 
CAGGGTOa;ANCCCGGAACCCTATGCrGNGGANAAGAAAGAAAATrCNGGGAAACCCITGGTNTr 
TTTGGAGGGGGTTANCATGNGNGGGCNCCCITCGGGTTGGNGGGGGGCCCCNKmCACCGOaAAA 
ATTT 

SEQ ID NO: 1248 GGTACm-N rrri rri-ri-i'lTri'l'ri'ri'rri'l'GGGGGGACATGCAAAAAGACAA 
TATTTATTITACTCAAGCmCCTATAAGAAGTTAAACAGAAACAATCCCTGGTTGAANACTACCA 
AAAATGTTAACATTGTTTCACATATCCCTTCAGTGCCCANATGAAGGGAAAAACTTAACTGAACG 
AAAGAATAAGAATTACCATCCTGACACrGGGTrTGCTTAQGTATGTTAGGAAAAAATCCACATAG 
GCTAGACCTAAAACAGTAAATGAGCCTACTAATrTCCCACAGGGGATGGGAAATCCACTGATACA 
GACACCTCCAAGAGCrCCCAAGACCTAGGGATCGCAACCACCACAGGTATTCCCATAACITCCTn 
TCCCTCCCirnCCCrCAGCrATTTTACTGCrAAAAGAAGAGGATGGGAACAAAAGATCACTTCCC 
CACTCCCAGACTGCATGACTGTCAGCATCAATGTrrGTTGGGAATTTCACAGCATCCTCTTCATGG 
CCATCTAAATAGGATAACAGTAAAAATGGCTNGGGOQAGGCATGGGGTTTTOCITNCCANAATTA 
TAAAATCCCTGGAAAAGGAATTAATCITATnCTATTCCAGGGAAAANCCAAANrrTNriTr^^ 
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GAACNCGCNQAACCAAAGTTGAAACATmCTNCGGGTNNTTTrCCANCCCTCAA™ 
ANAAAAACT 

SEQ ID NO: 1 249 ACAACACrATTrCATTATCTGGATrGTCCATGCTAGAAATAAATTCATCCITG 
TTAATTGTATCCAA AOCA TArr CTTC AGCTAAGGATCGACATCTTACCACTCGAAGAAATGCAGAA 
TTGCTGCAGAGTAATmAATTCmCTCTGAAATCGACTCTGGTGCCTOGCCAATGGACTGC^ 
AATTrGGCAACATGATTACCCACAGCGGCAGCATCCl TCI Tl GCmTTGACGGTAAACGnTTGCA 
GTm ATAT ATTTGCCTGAATCTGCAATCATATCAGGAATTGTGCCTCGAACAGGTAAATTTOC^ 
ACCCTCrrrGGCCACAAArrCCrrTAAGGCACGAGCTAAAATCCAAAATGATGGAGTCTGTTTGGT 
GATATTTATGCAGCGATCATCATTAAATATATCnTCAATACTGCITGGGATCTGAG 
GCTGTGlTCACATrmAATAGOTCTrCAAAATTCTCTTCATCTrCTGGAGCCCC^ 
TAAAATT CCTTG NCTAATCAAATCTCTGAAGTCCTCl'l'l'! ICl'l lATCCGTTTAAGGATTCGGCCAT 
TTGTrrcACTTTCCACTGGGCTAAATATm 
CCCTTrTTTCCATTTNANCCAAATNTTNGACNGAAAATGTT 

SEQ ID NO; 1250 ACTNTN'l 1 1 1 11 1 11 1 1 1 1 111 11 AACANCAAAGANCGGGAGCACGATTGGGT 
AATAAAATGT^^*ATTGAGAATAANACNGCCTTITNACC^TrTAGGGTCTAGGGCTGTAAAGTGTCT 
CAGGGTTGCTGCC^^AACNACCCATGAACTGGGCTCGGTTTTTATATTTGATGAAAAANAGCCT 
ACGCT rCTGATTTGGGATAAAGAAAAAGGAGCATTAACCTroACTATGTCTTTAGCTCCAGCCACC 
TTTTTAAGAGTAAATTGCTGGGCAGGTGGGGGAGGGCrAOTNrACGGAACCAAACTNTANGCCNGA 
CCAAGTGTGAGGAGGGGAGGTGATAAAAAGATTACAGGGTGGAGGAGTGGACCCTGAGGAAAAA 
TTGGGACXn'ANCTTGGCGTGGANAGGAGGGGAGAAGGTCANATOGO'nTGTNGAAAAGGAAAAT 
TA . 

SEQ ID NO: 125 1 ACAACACCATCTGGCTTCTCrATTTTGAAACTCCACACCAGGTTAATCrrGTT 
CATGGCrGTGGCTTAAATCTAAAGTCAGGTTCTGGTGTTrrCTGGAATCCATGCCTAGCTCm 
TCTGAAACCAAATCTQGATTCTGGACTCTTCTGCATTGATTGCTAAAGCAAGTITTTGrn'GGTAGC 
ATAACCTCGGTAAGGrrnTGAGTGAAGGTATTGATGAGGATTrrCAATTGATCTTCTGTGAAm 
GGTGCAATTGCACCTATGATTrGTTGCTACCATCTTGTGTGAAGAGGGGTCTTCAACTATGCAGAA 
AQAGAGTTCTGGAGGCTGAGCTACTGTCTGGGAGACTGCrrACAGCrCATTTTTTAAAAAGAAAG 
GTCACATATAATACAATATAAAATTCACAAATGTAAAGCrrATGGTTTTCrGGGTTITGACA^ 
AAAACTCCTGTGCAACTTCAAGCCTTACCAGGATATGGATGAAGGGTTTCTATCCAAAAG^ 
GTATOrrAACAAACAACTTTACGTTTTITGAATTGTCCAGTGCCCTGTGAGGATTCCTTACTrc 
TTrmXlACAAACATATTCAGAAGGCTANGCrNCAATTANGGrrGGGNAAGArnC 

SEQ ID NO: 1 252 ACATrTCTACCGAAGACTTCCCGCCGAACTGTCTGCCAGTGAGATAAGTGTTA 
TGGGAAGAACTGATGAAGTAGTGAGCCAGAGGATGGTCCATTTCTTGQTAAAGTTCTAAACGATC 
TAGGAA GACTGGGGCGTTTTCATCTGACATCAGATATCTGCV^AAACCCATCACTTGATATAAGG 
TTTTITCrrCAAATCTrCATCAGGlTCATACATCTCAATGATCTGCATTGCCCTTTrGG^^ 
AATGGAAATAAAATirCATTCAATCGAGGATCTCGTTQATGTTCATTrAGAAAGCTCACTAATTGG 
TCTACCG TTAAA TAATCAGTTTTGTCTCCATTGATTTTTTTXjA^^ 
GACAAATCTTTTGTGTCAGTTCATAGAACrmCATAAGAAAATO 
CTTTCCACTGGOAAGACXTAACTCCTrGAGrGCTTGAAAGATCACCTTTTCTGT^^ 
AATGTTCTAhTTAATACTCXn'AACnGGAATITrACCATTTGTGTTGGGCATAA 
CAATGGmcmGAGGCATGTCATTTGGGACTGACCTT 

SEQ ID NO: 1253 ACGCGGGGGCAATGCAACAGTCTCAmTCCCATGACGCATGGCAACACCGG 
ATTCAGTGGCATTGAATCCAGCTCTCCAGAGGTGAAAGGCTATTGGGCAGGTITaGATACATCTGC 
TCAGACTACTTCTCATGAACTCACCATTCCAAACGATITGATTCGCTGCATAATCGGGCGTCAAGG 
CGCCAAAATTAATGAGATCCGTCAGATGTCTGGGGCGCAGATCAAAATTGCGAACCCAGTOGAAG 
GATCTACTGAT AGGC AGGTTACrATCACTGGATCTGCrGCCAGCATTAGCCTGGCTCAATATCTAA 
TCAATGT CAGG CTITCCTCGGAGACGGGTGGCAlX3GGGAGCAGCTAGAACAATGCAGA'rrCATCC 
ATAATC CCrr rCTGCTGTTCACCACCACCCATGATCCATCTGTGTAAGTTTCTGAACAGTCAGCGAT 
TCCAGGTTTTAAATAGTTIGTAAATTrrCAGTTTCTACACACTTrATCATCCACTCGTGAm^ 
TTAAAGCGTTTAATTCCTTTCTCTGTTC 

SEQ ID NO: 1 254 ACCCAGACACAGAAAGnTTAGOGTAAATAGTAAACTACAAATACCCrCTTG 
GTTAAGTTAATTCATCAAGTTAATAAAGGTCATArrATCTATClTCTGCTGGTGACAACTTGTTO 
TCAGTATAGTCTGTCTCAAGAAAGAACrGGTTCAGGrrGGGrrTTOGAAAAGGAAAAAGACm 

CTTGCATAAATCArnrCAGTGATCAACATCTOCATCCrCAAACTGTCCAGCAACCGTTGOTC^ 
TATCCACCTCCATCCCATCrrCATAATCTOTATTQAATCTTCTGTCCTGACCCCAGCCATArrATA 



185 



wo 02/29086 



PCT/USOl/30732 



CTGGTrGCTCACAGACTGAGAAAGCATrCCTTCTAATCTCTCCAGTGTGGCTTGGCXnTCTGCT^ 
AGATGGGATAATCCTTCTTCATAGOTOTAAAATOTANGGGATGTCCCCCTGTCCTTGTTCATGTGC 
TrCACATCATATr Cl 1 1 11 CATCGTAGTCATCTGAATCCTCATCCTCAGGATCTGGATGCAAGGCCr 
GGCATTCNCACATTGGCAGNGGAACATTGCCTTCAACGCTGATTTATTAACCCGCGTACCTTGGGG 
CCQ 

SEQ ID NO: 1255 GGTACCGGATICrGTCmAACCCTCOCCnCGTGTTTCCCCCAATGTrTAAAA 
TGrnGGATGGTTTGTTGTTCTGCCTGGAGACAAGGTGCTAACATAGArrTAAOTGAATACATTAA 
CGGTGCTAAAAATGAAAATTCTAAC CCAAG ACATGACATTCITAG CTGTA ACTTAACTAl^ 
CTITTCCACACGCATTAATAGTCCCAl 1 1 1 iCTCTTGCCATnGTAGCTTTGCCCATTOTCn^ 
CACATGGGTGGACACGGATCTXKnXjGGCTCTGCCTTAAACACACATTGCAGCTrCAACTm 
TrAGTGrrCrGTrTGAAACTAATACrrACCGAGTCAGACTTrGTGTTCATTTCAm 

gctgcctgtgggcttccccaggtggcctgnaggtggocaaagggAagtaacagacacacgatgit 

gtcaaggatggttttgggacraaaggctcaatggtgggagagatccctgcagaaccccaccaacc 

anaacgtggtttgcttgaagctgtaacmanaanaaaaaattctggggcrggtctta^^ 

ataacattctnacataaagccccaa>rrrcattcaccatttcctrctt^ 

tntttttcncattaagggctg>n'gggtccaaacttmggggaacarc 

seq id no: 1256 acccactgggagatgattgaaitatgggagcgggtcntcccatoctgttctc 
atgatagtgaggtctttctgttcrrrcccatgctgtcctrgtgatagtgatrgggtgtcat^ 

TGATGArnTTAAAATGGGAAATGCCCCACACAAGCTCTCrrCTTrGCCTGrrGCCATC^ 

ATGTGACTTCCTCCTCCTTGCCrrCCACCATGATIXiTGAGGCTrCACCAGCCATGTGGATCT^ 

TACCCTTTACATGATl'CCCCAGACCTCAAATGGGCrAACACGCTTCTCrrCTCCAGCAGTOT 

TCCGTGAAGTTTanrCCAGArrGTTACATOGAACTOAAAACAAAGGGAGCCTCAGCTGGAm 

ATCTGGAGCATGCCACAAAGTCTIXrACriGGCArnTCGAGAAGAACCCATCAGAGATCATAAG 

AAATACAAGAACAGAGGTTGCATAACTCGAGGTGCITGGATATTCATTCTAAGOTTCTTGTTC^ 

AGAAAAOTGGGGTTAGGGTGGGTGCATGCCCAAGAATCCAAGTNGGTCAACCCTTrCANCCTTAA 

CCTACTANTnTGACAT 

SEQ ID NO: 1257 ACCTAGAAGAGAGGCGGGTCAAAGAAGTAGTGAAGAAGCATTCTCAGTTCAT 
AGGCTATCCCATCACCCTTTATTTGGAGAAGGAACGAGAGAAGGAAATTAGTGATGATGAGGCAG 
AGGAAGAGAAAGGTGAGAAAGAAGAGGAAGATAAAGATGATGAAGAAAAGCCCAAGATCGAAG 
ATGTGGGTTCAOATGAGOAGOATGACAGCGOTAAGGATAAGAAGAAGAAAACTAAGAAGATCAA 
AGAGAAATACATTGATCAGGAAGAACTAAACAAGA(XAAGCCTATTrGGACCAGAAACCCTGATG 
ACATCACCCAAGAGGAGTATGGAOAATTCTACAAGAGCCrCACTAATGACTGGGAAGACCACTTG 
GCAGTCAAGCACl VVi CTGTAGAAGGTCAGTTGGAATrCAGGGCATTGCTA'rTTATTCCTCGTCGG 
GCTCCCTTTGACCTTTTTGAGAACAAGAAGAAAAAGAACAACATCAAACTCTATOTCCGCCGTGTG 
TTCATCATGGACAGCTTGTGATGAGTTGATCCANAGTATCTCAArnTATTCCGTGGTGTGGGTGA 
CTCrGANGATCTGCCCTGACATCTNCCGAGAAATGCTCCACCAAACCAAAATCTITGAAAGTCAT^ 
CGCAAAAACATTTGTTAANAAAATOCCTrcGGCrrCTTTTm 

SEQ ED NO: 1258 ACAGTTGGAGGTGTTCCTAGCrrGGGCGGArrGCTTGAGGTCAGGAGTTCAAG 
ACCAGTCTGGCCAACATGACGAAACCCCGTCTCTACTAAAAATACAAAATTCAGCCAGGTATGGT 
GGCATACGCCTGTAATCCTAGCTACTIX3GGAGGCTGAGGCATGAGAATCAATTGAACCCGGTAGA 
GGCGGAGGTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTAGGTGACAGAGGGAGACTC 
TGCCTCAAAAACAAACCATCCCTGTCGCTCCCATCCTAGACCAGTCTAATTGCAAGTATCTCACAG 
GGAGCCCAAGTATCAATGXrriTTTGAACAGCTTTATGGAGGTATAATCTATATACCATAAAATC 
ACTTATTTTAATATATGATTITAGTAAATTTAAAGAOTGTGCAGCCCTTACCACATCATACAA^ 
AGAATGTTTTCATCCCCAAAAAAGAAACTTCATGTCTATTTAAGCACTCXn'GGTTCTACTTCCAOT 
CTAGGGGGGCATTAATCTGA 

SEQ ID NO: 1259 ACAAAGTCCAAAnCTTACTTTATGGATGTAAAATGTCCAGGTTGCTACAAOA 
TCACCACGGTTTTCAGCCATGCTCAGACAGTGGTTXnTTGTGTAGGTTGTTCAACAGTGTTGTGCCA 
GCXTACAGGAGGAAAGGCCAGACTCACAGAAGGGTGTTCATTTAGAAGAAAGCAACACTAATGA 
TrCAAACAGCTTCCTGAATmAATTTTGTGTraTCTCACAGAAAGCCTTATCATAAATO 
TCTA ATTAATTT ACCAA GATAATGTAATTACATTrGgrrTTGTAAGGTATACAGCAGTAATCrCCTA 
TmGGTGTCAGTTTTTCAATAAAGTTTTGATTATGGGCC 

SEQ ID NO: 1260 AciTn'rrrn i"icaaaccatgctattgaatcaagaaaagtagaaaaactgaa 

TGTTAAAATTAAGGTmAAAGCTCATTCTTmGGTAAAAAAATAAAATTGAGATACCTrATG 

TGAAGTTITGGTAGGGAATGCCACCAGATTAAAAAACAACAACAACAACAAAAAAGAACCrTACT 

GrTGCCAGTATrrTCCCTCCCCTCAAAGACACrGGCAAAGTGCATCTGTTTAGACAAGGTTAGAGC 



186 



wo 02/29086 



PCT/USO 1/30732 



CAAACCATATCAGTTTTCAACCATGCTATTAAATTGTTTAATGGTTTCTAAAGCCITAGCCC/^ 

AGTGACCCAACrrGTrGTGTTCCAACTAAATCCCTCTGCCAAGAAATGAAGTATAAAATGCCTTTC 

TCTCAACAGAAACAOAGCAGACACAGAAGCAGCTACATAAGGAAAGGAGCTGAGAACATTAGCA 

TTATmGATAGCGGCrATTTGCAGGOTAAGACAATTAGCTAGCATTTTACACTAACTGGAATGAG 

AACAGCATGGCATNCAAAGCAGGTGATCAGTCATCAGTCATTGGATTAAAAACCAGGCACXACCA 

AAAACAAAAACX^NCAACAGCAACCAAAATACCCrGGCTTACC^nNrrACATAACCAAACC 

GAATmCCAGAAGTTTCAAAATTTCTTAATGGAAGGGGAAATTNGAGAAACCAAT^ 

SEQ ID NO: 1 26 1 GGTACriCOGGGGGOAOTGATCGAAAOCATGGCGTCGGTGGTGrrGGCGCTGA 
GGACCCGGACAGCCGTTACATCCTTGCrAAGCCCCACT(XGGCTACAGCTCTrGCTGTCAGATACG 
CATCCAAGAAGTCGGGTGGTAGCTCCAAAAACCTCGGTGGAAAGTCATCAGGCAGACGCCAAGG 
CATTAAGAAAATGGAAGGTCACTATGTTCATGCTGGGAACATCATTGCAACACAGCGCCATrTCC 
GCTGGCACCCAGGTGCCCATGTGAGTTGCTCCGTTGCTGCCCCCCTTTTTCCTTTTCT 
TCTCCTTGCCCCTAAGCATGGTAATAACAGTTGCATGTATTGAGTGCrTACCAAATGGCAAGCATT 
GTGCTGATTCCCATGCCTACACGATCTCAmCITCCTTACCACATCCCTGTAAGTAAGGTGAAATG 
CCAGAGACCNAGACGGGGTGGAAGOACAAGTGACTGCTGGGAnTGCACCANGTCTGCCTAACTC 
CANATCACTATGATTTNCrCCGGTGTTGCAnGGCCTGGTATTCrrrGGTTCTC^^ 
ANTTCTTGGAGGACAANAAAACCTGGNGAATTTAAAATATTATriTAATC^ 

AAAirr 

SEQ ID NO: 1262 GGTACAGACATTTTCAAAGTTGCCAGTGTTACnTAATrGGACTGCCrrCGTA 
ATTCATTGCCTCTGCTTCAACAATGTGCAACTCATCCmGCACCAGCCCCTAAACTG 
AAAGATAACTGGTGCTCATTTTTCATTATCCACCTTAAAGTGATAATCTTrGTCGGCCm 
CAACCGAAAAGATAGTTCTGGOGCCTCAGGGGGCTCATGTCCATGTCCATCGAATCTTCCATCGGG 
TGGCGGCACGCACTTAGGTAQGAGAGAAGGCGGACGGAGATAAAAGAACGCTGCrCCAGAGAAC 
AACCGCGCAGGACGGAATCACACCAGGGACCCCGCGT 

SEQ ID NO: 1263 ACTGTGTGCAGCAGCTCAAGGAATITGATGGGAAGAGCCTGGTCTCAGTTAC 
CAAGGAGGGTCTGGAGCTGCCrGAGGATGAGGAGGGGAAGAAGAAGATGGAAGAGAGCAAGGC 
AAAGnTGAGAACCTCTGCAAGCTCATGAAAGAAATCTTAGATAAGAAGGTTGAGAAGGTGACAA 
TCrCCAATAGACrrGTGTCTTCACCITGCTGCATTGTGACCAGCACCTAC 

TGGAGCGGATCATGAAAGCCCAGGCACTTCGGGACAACTCCACCATGGGCTATATGATGGCCAAA 

AAGCACCTGGAGATCAACCXn'GACCACCX:CATTGTGGAGACGCTGCGGCAGAAGGCTGAGGCCGA 

CAAGAATGATAANGCAGTTAAGGACCTGGTGGTGCTGCTGTTTGAAACCCGCCTGCTATCrrCTGC 

TTTrCCCTTGAGGATCOCCANACCCACTCCAACCGCATCTATCCATGATCAAGCTAGG'nTAAGTA 

TTGATGAAAAATGAAGTGGCAACCAGAAGAACCCAATGCTGCANGTTCCTGATGAAGATCCCCCC 

TCTTCAAGGGCGATTAAGGATNCCTXniXGCrrGGAANAAATCCArrAAGGTTNGGA>^ 

TTGGAAAACTTGTNCCrcrrrNTATATTGTTCCCAATGGGCTTCCANCrGC^ 

SEQ ID NO: 1264 actggctggcctatgtcctgggtcotccaagagcaataattgttcacaatgac 

AGGTCAAGTCACCTCCAGAGTGTATAACTAATGTAACCAAGTCTGTGGTATTCTGTTACAGCAGCA 
AAAAGCAGACTGAGACAGrrGGTCrrCAGAGAGCAGACCITITCATATTCGAGGATATITCATCCT 
TCAGGGATACTTCTCTTATmCrCCAAGAGTGGGCGAATCGCATGGCGTTGTTGCAAGCATCGTC 
TGAGAACCAAGGTAACGAACTGACTGCTGATCCAAGGCAGGCACTCAACGGTAGGATTTTAAGGT 
CCACAOATATGGGAATGAAGAACTCCATAGAAGATAATGATGGAAACTGTAATCCCAGCTACTTG 
GGAGGCTGAGGCAGGAGAATCCTTGAACCCAGGAAGCAGAGGTCGCAGTGAGCCGAGATCATGC 
CATTGTGACAGGAGAGAAACTAACAGGAAGTTGGGTGOAAGCAGCGCGGACCCACGGCGCACCG 
AACGCACTTCCCCCCGCGTACCTCGGCCGNGACCACG 

SEQ ID NO: 1265 GGTACGCGGGGAGAAGGGGAAGAATATGGAGGATAGGAGAGGGTGATGATG 
CTTTAGCCCATTAAGGAAGAAAAGTTCCCCTTACTCTTACAAAAGGTGATTGCACCACTTAAAAA^ 
GGATAGGTAGAAAATCTGGCCATTTTATGTGTTA'll rri ri'CCTGCATACCAGTGCCCCCTGGA^ 

gatggggatgctcatttcagagacagcctcttcacctgtcctccgtctctcx:attacagaoccac^ 
agccaagcttgagtractggttaatttccttgaagtcaatagagacrctctgggtcctcatcccca 
tgcatatatgtctgggagaagcaagcattctccaaaacagggctgttgctttctrcctggccgcat 
atgtgactotcrrcccctagactgatggtatacttitgagaggggcggataaaggtngcaaactt 
taa'rgggcataaggatcccctagaagcatatgtaaaatgcagattcctgcaccccaactgcagac 
actgrraagtgggactggagtggggccccnggtctgcattcrmactggtacctgcccnggccggc 
cgntcgaaagggcgaattccacccactrggcgggccgtracritatggattcccaactcggt"a 

SEQ ID NO: 1266 A Ci ^ lU ^ ll ^ l ^ ll ^ ^U ' ^l ^ ll ' 1 ^ ^11 ^1^14U^ll ll'iNCCAAAANATANAATCATANAATTTA 
AAATCTGGAAAAACWrCAGGGATANAa^CTOTCACATAACCCThrrGCCACTGCCCAANATTC 
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AAGCTGGOGCANAGCTCKKlACAACrCATAAGCAOCANACACGGhrrACCTGTCCTNT^ 

>n'Cn'GTCAGCTG(::ACTCATGGAAGGAGGGGGTCTNAAAGGCTATCCCAAANACAAAAGGOT 

GCCCAGNATTCCAACACTNTCATTTTCAACAATCCTATTTATTITrcATCra 

CAGGAACTTACATGTCCACACTGCACGCCAT^TOATT^r^CTCCAAGTAGGAAG^ 

GGNAGCCTCGTCCrcCCCTACCAAGCCCGCGCCTrAACATCTrTGCOATOTTAGAAGTCAC^ 

CTOATCAACTGGOGCTTCAGQATTCTTACCTGGAAACAGGGAAAANTGGGAGGGAAAGGAGGTG 

CCCCTCAAAAAAlTGlTANAAGGTTTAAAACriTCCATGGTITITITAAAAGGNGGNCCC^ 

CC^XJ^INTCANAATCCTGNGGGAAAAGTTAC^CAAAATaGCAQGG^rmCAAAANCCCCCCCCCAAA 

NATTCTTGGAAAAAAAAAAGGTTn'GGAAAGGGGCrrGCATTT^ 

SEQ ID NO: 1267 GGTACACAAAGAGGGGGTGGGTGTCGGATGCAGAGTGTGTGGCCTGATGCTC 
CACGGCGTGCAGGACGOGGGGCrAATAOTAGGTTTCCTTCTCCACCCAGCCGCCAGGGCGTCGCC 
TGATGATGAGTrrrCTCACTTCGTCATATACGAAGATGAGAAGAGAGTAGGGGAAGGCACAGAAC 
CACCGGGTAGGnTGAGGGGATACATCCTAAGAGCAACACCCArrCCAOGGCAGTAGGAAAGGA 
AAGCAGCCAGGGCTGTCrCTTCAAAGAGGCXAAATATCAAGATCTTGTTCTTCATCCOCrGCT^ 
AGACCGvV^TTCCTCCrGGTCTTACAGATGACCAAGTCGGCCCACrGCACCACCACOATACrrGACG 
AAGAAOGCTOTQTOOCAGGTGAACTCCACGATTTTCCTCTGCTCATAGGTCCACTC 
CTGTCrrcC^CATCGTTGATCCACGGTCATCCCAGTCCACrCGGAGGCCCAACANGTGAATTGGGA 
GGAAACCGTTCTCANCCANAATCACAAAGTAAGTAAAGAAGCCTNCCAAGGCCTGGATCATTCCA 
ATCTTCCCCATAAGCCATGCTGATCANCCGTTCATrCACAAATmGTCTGTTTrGGGAATTCTO 
CCTGGCnxnTTATGAATGTCACTCrCAACXrrACTCATAANCCNAGGGAAAiNrrGGCNGGO/^ 
TTNANTGC 

SEQ ID NO: 1 268 ggtactagccggacttggattttctggaaagatttcagttgaggaacgggaa 
caaagattatgatagcmccgaccaccaccaacrrcaatttccttagcrqccgtaatatrca^ 
ccctgagctgagccttgaggtccgagttcatctccagctccagaagagcctgggagatgccggac 
tcgaactcgtccggcttctcgccatrgggcncacgatcttggcgctcgaactga^ 
cctgooaoaactrgccgagcgccggcttaggaagagccccgcgtacraactttataatcattttca 
attaactcttcttctgtttcctgttrcttcaaatcatggaaagaatccagttcatc^ 
tcacatttacgtgcttgaataagctgcccaatggtgttctacaatgcttgctrtcatc^^ 
taacgatgaaccaacrrratctccagagtaagatatccaaatcacctggggaggttagagtttcrr 
ctcttatacrragacctgataaaaattrggaaaaatcatactcgtcgggggcaaoctgcccccgag 
tcataagcggnt^mgagactctccacta^^taaat^ct^gccccx}g^^^ggc^ttgac^^ 

seq id no: 1 269 ggtacccccacgattacacagcntgaacaagtgaaactatggttaccaaac 
agctatititattatagcatctacagtgtcrggaaaaggatgtaacaataaataactgtagacgca 
tcacoagacatccatttacrraatcacagaagtggatcttgctacatagttttcrcaatgtcttca 
tcttcagattctccaggcttacgaatgtcatcaatcttcaaaatcattctaaccatttc 
gagatatctgttgcmttgccaatcaagotn'cratgacatgctgttgcttcatatcat^ 
cttgtgcaaacagtcgatgccaagagcagggttcatctccttcaccnx}tcrggct^ 
catagtctggatgggattcatgccactgtnrcagagagggccatggggatgacctccagtgcgtc 
gocaaacgctctcatggcatactgttctaaggtggggcacttatcccgctctrggct'aactggcag 
ggcacaagatatctaacagcccctncttcatacaccacacgattatcgcggatgaggttccggat 
gacacacaaaocatcotoaaaggoatcgtitcgcctcctcaatgatcatctrattrcctcctct 
aaaaaatgggtaccacirrtggagttotacanctgctcgangacccannatm 

T 

SEQ ID NO: 1270 ggtacaagacactacgggaacagtttgcctccctcccagcctcaaccacaat 

TCTrCCATGCTGGGGCTGATGTG<KKn'AGTAAGACTCCAGTrCTTAGAGGCGCTGTAGTA*l"l"iri"i"l 

TTTrrGTCTCATCCTTTCGATACTTCTrrTAAGTGGGAGTCTCAGGCAACTCAAGm 

CTCTTTTTGTTTGTTTTITGAAACAGGATCnTGCTCTGTCArc 

CACAGCCCAGTGCAGCCTCGACCACCTGTGCTCAAGCAATCCTCCCATCTCCATCTCCCAAAGTGC 

TGGGATGACAGGCGTGAGGCACAGCTCCCAGCCTAGGCCCTrAATCTTGCTGTTATrrrCCATGGA 

CTAAAGGTCTGGTCATCTGAGCrCACGCTGGCTCACACAGCTCTAGGGGCCTGCrCXrrCTAACTCA 

CAGTGGGTITTGTGAGGCTCTQTGGCCCAAACAAACCTGCATATCTGAGCAAAAATAGCAAAAGC 

CTCrCTCAACCCACTGGCCTGAATCrACACTGGAAGCCAACTTCOTGGNACCCCCGGTTC 

CTirmGCrGGGTAGGAGANGCTAAAGAACACCCTAAAATTTACTrAATCm^ 

TACArrrGGGGCCTTAACAACTNCCCACNCCAATTTNAAANGNCACCCC^ 

N 

SEQ ID NO: 1 27 1 GGTACn 1 1 lUU'l"rilU"IUUU-l'Il"lU-l"n-inMU'llUNCNAAAGGTTTTATGAAATA 
TGGNCTAATAGTATGAAAAAAATCAAANCCCATGTKrGTTTTAKTTAACAAGGAAAACNCAGNGA 
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TTTAAATGCTGCACATAAACmnmTAAAATNATGNGGGCATTGNGCrCCCAGNGGTCCACAGC 
ATTTACATATAAAAATCTAAAAAGTTTrO'CATAGTCCAAANCACTGNQGTCAGGGTANCAATCAA 
TACCTATATATAATAATGATCTGTTTAGNCAGGTAAATGGAAAGCATTACAGTCAATCNTCCAAAA 
TATbnrCATCCAAATTGATATCTCNNGNGTCAATATCTTCTAATTCCAA^ 

TAGrrcNAGTAAAcrc'iTj- 1 cr re i 1 1 gttggctgngggggaggcccaataacccggagtaggaaa 

AGCAGGTTTCXICATCAANGTrCCrGTGTGAGCTGTAAATCTrGAGGGCTGAAAGTCTmCC 
CCAT 

SEQ ID NO: 1272 ggtacaacttatagaaaaggtaaaggaaaccccaacatgcatgcactgcctt 

GGTGACCAGGGAAGTCACCCCACGGCTATGGGGAAATTAACCCGAGGCTTAGCnTCATTATCAC 

TGTCTCCCAGGOTOTGCrrGTCAAAGAOATATTCCGCCAAGCCAGATTCGGGCGCTCCCVvTCTTGC 

GCAAGTTGGTCACGTGGTCACCCAATTCTTTGATGGCriTCACCTGCTCATTCAGGTAATGTGTCrC 

AATGAAGTCACACAAATGGGGGTCATTrnOrCAGTOGCCAGTTTGTGCAAGTTCCAGTAGTGACT 

GATrCACATTrTTTTCCAAATGTAATGCACACTCCATTGCArrCAGCCCGCTCTCCCAGTCATCACA 

GTCTGGTTTCTTGATATCCTGAAGGAAGATrCGGCCACCTCGTTGGTTCraCAGCTrCATC 

TCACATGTTCCTCTCCTCATGAGATTGGGTGAAGAAAAGTNTTTGGCCCCCGCGT 

SEQ ID NO: 1273 GGTACTATGATCCAAACACCAAAAGCTGTGCAAGATTCTGGTATGGAGGTTG 
TGGTGGAAACGAAAACAAATTTGGATCACAGAAAGAATGTGAAAAGGTTTGCGCTCCTGTGCTCG 
CCAAACCCGGAGTCATCAGTGTGATGGGAACCTAAGCGTGGGTGGCCAACATCATATACCrCTTG 
AAGAAGAAGGAGTCAGCCATCGCCAACTTGTCrCTGTAGAAGCTCCGGGTGTAGATTCCCTTGCA 
CTGTATCATrrCATGCriTGArrTACACTOGAACTCGGGAGGGAACATCCTGCTGCATGACCTATC 
AGTATGGTGCTAATGTGTCTGTGGACCCTCXjCTCTCTGTCTCCAGGCAGrrCTCTCGAATACm 
ATGTTGTGTAACAGTTAGCCACTGCTGOTGTTTATGTGAACATTCCTATCAATCCAAATTCCCTCTG 
GAGTmCATGTTATGCCTGTTTGCAGGCAAATGTAAAGTCTAGAAAATAATGCAAATGTCACGGC 
TTCrCTATATACrrTTGCTTGGGTCATnTITrrCCCT^ 

AGCCTQTGTrrcGGGGGAGAAACAAAANAACCAACTTITrmnrrCCCITGCC^ 

AACTTANAATTTNAAGCTTATTTTTTTITI^^ 

G 

SEQ ID NO: 1274 ACACAGGCTGCTACCCAAGTTGTrCTGAATGTTCCrGAAACAAGAGTAACAT 
GTTTAGAAAGTGGACrCAGAGTAGCTrCGGAAGACTCTGGGCrCTCAACATGCACAGTTGGACTCT 
GOATTGATGCTGGAAGTAGATACGAAAATGAGAAGAACAATOGAACAGCAC^CTrrCTGGAGCAT 
ATGGCTTTCAAGGGCACXrAAGAAGAGATCCCAGTTAGATCTGGAACTTGAGATTGAAAATATGGG 
TGCTCATCTCAATGCCTATACCTCCAGAGAGCAGACTGTATACTATGCCAAAGCATTCTCTAAAGA 
(HTGCCAAGAGCTGTAGAAATTCrrGCTGATATAATACAAAACAGCACATTGGGAGAAGCAGAGA 
TTGAAOGTGAGCGTGGAGTAATCCTTAGAGAGATGCAGGAAGTTGAAACCAATTTACAAGAAGTT 
GTTTTTGATTATCTTCATGCCCAGCTTATCAAAATACTTGCACTTGGACGGACAATT^ 
AACTGAAAATATCAAATCTATAAGTCGTAAGGGACTTAGTXKjATTATTATAACCACACATTTTT 
GGGGCCAAGAATAGTGCTTGCTGCTGCTOGGAGGGOGTTCCCATGATGAATrGCTTGACTTAACC 
AAAGTTTNATTTCGGTGACTCTTTATGCCNCNCCCAAAGGAGAAAATACCAGCnmTGCOT 
GCA 

SEQ ID NO: 1275 ACCCTTGGAAOATGGGAAAGGTGAOGGAAATATTTGAAGCAGGGTCAGAAC 
ATCCACTAAGAACATAGCACCTCAGTAGAGCTTACATTATAGTGCCAGGGTAGAGTTATTACTGA 
ATAGCTTAGGATGATGAACATTAACCTTCCTACAGGAGTAGTAGCAGCTGATrTGGTGACCATCAT 
TGGTCACCrmAGTGTAACTGCAAACTAAGAACAATTATGGCTTGACATATACTCCATGTAGGGA 
AGTGATGGGAGAGGCAGCCTCTGTGTGGTCCATCCTTGGAAOCACTGCATCGATmGCTCCCCTC 
TGGTTTTAAGGAGATCTTTTAGACCTTTCCTTGCTAACTGCm 
ATICTGGATTCAGACATCnXrrTCrcACCCrJ'ClJU'llCATTGTAGCAATGATCT^ 
AAATTGGCITGCAGGAATAAmCAAGriTmCTAAAGACOTGGATTAACAGGTTGATrACT^ 
GCTGATCCAAGGTACCTCGGC 

SEQ ID NO: 1276 A CrrriTlTn - lTl - lTl i - J ' fl ' l IT l UNGGTANAATATAAAAGAAITGAGAGCA 
TAGGmAATTTCATTTTTATAAGGAANACTACATTTACCrGAAAAANAAATATGGCATCACACAC 
AAACACNCrrrcAGGGAAGGAGTATATTTAANAATATGCNGTCATTCrGAATGCrAA^ 
GATCCCTGTTTACTATTAAAATATGTAGTOTAAAATTCTITAAAGGAAACAACrCACTAAATCnXr^ 
ACTAGGTAAGTTATTGNGAATITAGTTTTCAAATCACAGTAATCCACATAACCATTATTj^ 
NTGTTTCATANAACTGCAACAAGCCAACAAACATTTAAATGCACTTCACATCAANAGTATGTAAA 
CTTTATTTTrrcCTCTTCAAGCCrrCANAAAGTTAGGGGCTATTO 

CTCTGAAGGCAAAATTTCAGTGCCTCACAGTGTATATGACCATCTGTTGCACTGCCACATNrrTCCG 
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CGTACCT 

SEQIDNO: 1277 ACGCXJGGATTGATAATAGGGTGCATTGTtGCTTTTACTTTATTrcACClTrrC^ 
ACATGGACTTAAGCCCITAGATGTGGCXiTTTATGAAGGCAATACACAACAAGGTGAATA 
CTGTCATTGCAAAAGCTGACACTCTCACCCTGAAGGAACGGGAGCGGCTGAAGAAAAGGA'nrCTG 
OATOAAATTGAAGAACATAACATCAAAATCTATCACTTACCTQATGCAOAATCAGATGAAOATGA 
AGATn'rAAAGAGCAGACTAGACTIXnx:AAGGCTAGCATCCCATTCTCrGTGGTTGGATCCAATCA 
GTTGATTGAAGCCAAAGGAAAGAAGGTCAGAGGCCGCCTCTACCCCTGGGGTGTTGTGGAAGTGG 
AGAACCCAGAGCACAATGACTTTCTGAAGCTGAGAACCATGCTCATCACCCACATGCAGGATCTC 
CAGGAGGTGACCCAGGACXnrCATTATGAAAACTTOCGTTCTGAGAGACTCAAGANANGCGGC^^ 
GAAAGTGGAGAATGAGGACATGAATTAAAGACCAGATCTTGCTGGAAAAAANAANCTGAACTCC 
CCGCATGCCAAAANATGATTGCAAGGATGCAGGGCCCANATGCAAATCCAAATGCAGGGCCGGG 
GATGGCNATGGCX}GGGCriKITCGGC(XCACCTGTAAGGNGATGGGCCCTOrrCAAG>^ 
AAA 

SEQ ID NO: 1278 GGTACTTTTTITITITnTITnTTT^^ 

GAAAAATTANCANANATTCTGCOTCACAATTA^L^GT^^AACATGAGTCTTCCAGTGTmAAAi^ 

NAACTGTAm-ATACATAAAGCAAAATArmGGCATCACCTAGTGAAAACANACGATGNGGTNTT 

TTTrCANCCATCTCATATAGTTAOOCCCCOAGTrACAATrTOANATAAGTrGATTCCTAAAC^ 

TGNGCTTTACANAATGNGACCAATTITATTACTCCCAAGGACCTTGTnrAA^ 

ANAGTTCACIXJATCTCACATTTTAAAAAGAAAAAAGTTTGTTAACTGTTTTAATCAAT^^ 

TAACAAAC 

SEQ ID NO: 1 279 ggtacgcatcgaaaggattgacggcgtgagtttactggtgcagagaaccatt 
gcaagacttcgcctaaagaaaatttitagtggtgtcmgtaaaagtcaccccccagaatctaaaa 
atgctgcgtatagtggaaccttatgtgacctggggatttccaaatctgaagtctgtccgagaactc 
attttgaaacgtggacaagccaaggtcaagaataagaccatcccrctgacagacaatacagtgat 

TQAGGAGCACCTOGGOAAOTTTGGCGTCATTTGCTrGGAAGACCTCATrCATGAAATTGCCT^ 

AGGGAAGCATTTCCAGGAGATCTCATGGTTCITGTGCCCTTTCCACCTCTCA 

TACCAAAAATAGAGTGGGCTTKOTAAGGAGATGGGCACACCTGGCTATCGGGGTGAACGCATCA 

ATCAGCTCATCCGTCAGCTGAACTAGACCCANGTCANGC^.GGGGCTGAAAACTGCCCTTGGGCTG 

ACTTTTGATANGCCATGCCmGCCACTTTACAAGTTCTrTTrG 

AACCCTrGANATTTGGGAAGQAATAAANGGAGGCTGOTNCCTGCCCGGGGCCGGGCGGTNeAAA 
A 

SEQ ID NO; 1280 ACAATTACCCACCACrGGATrTGACTCAGAGAGGACCCCCAGAGGGTGTCTC 
CATCTTCCCTATTrATTTTCAGCCCTTGAGGGCTTCATTGTAGATCAAAGCCAAGGCCCCCAGGAA 
GGTGACATACTCCTGGAAGrrCACCTCCTGGTCCrrGTTCCGGTCCAAGTXriTC 
ATTTCAGCATCXrrGCAGCTTCTAATGTGTTAQAATGTGAAATCCATACrCAGTGOTGATGACAACC 
CTGGATTCTTCCCCTrCCC<XTCCCAGGCV^TCCTCTCTGCAAGTGGCr^ 
AAGGACCCATGTCACTITGGCATrGCnTCrrCCTCAGCTAmCTCAGTTACT 
AGATGGATATCCGGCTGGAAGCATCCCCTACrCGCTGGGAGAGTGGGTCTACAGCTCAGGGTCTA 
CATGTGGACCANGGCCTCAGAATGTGGGTAAATGTGAGTCCCGCGTACCTCGGCCGNGACCACGC 

SEQ ID NO: 1 28 1 A C lT lU l- lU ' rriU - lU lU - i - l i 1 I GGGCTITA rri - rci l i GGAGGGAGGQTCTrGCr 
TGTCATCCAGGCTCTAGTGAATGGGCATGTCATAACTCACrGTATCCACAACCTCCrGGGCTG 
TGATCCTCCAGCCTCTGCrrcCCAAGTAGCIXKKjACTACAGGTAAGTGCCAGCAC^^ 
TTnTTACATCrrTGTAGANATGAGGTCTCACTATTTTACCAGGCTGGTCTCAAATTCGTGC^ 
TGTGATCCTACCAGCTCAACCrCCCAAATCGCTGGGATrGCAGACATGANCCACTCTGCCTGACTC 
AAAAATCATACTTTATTCTTGAGCCCATTGGTCAGGGTANATTCGCACACTCCCTCrcCA 
CX:AGGAGC(XCTCTCCAAGCTCAAANCGGAAGCTGAGCTGC AAGC CrGCCATCTTCTCCGCA TAG 
TCCGGATNGAAGCGCTCATTCTGCGCCACGGTGAAAGNGGTCirrCCANTCCCCANCCATGCCTT^ 
NCTCATGAAAGGGGAGATGCTGNGGTCCATGAACNCCTGGGGGGACNGGNGGGNGTAN^ 
AATANGGTTCNTmTNATITTCCTTimACGACCGGGNCTTGAACCAT^ 
CTTGGCANOGGANCNGCCCACNAAKrCCAGGATCTTTTrGAAANCTCCCITrTGG 

SEQ ID NO: 1 282 A CI l ' l 1 ri i ITn - i ' l ' l i l l I ' i M I U 1 1 1 1 J GGANATGGAGTCTTGTTCrGTANC 
CCAAGCTGGAGTGCAGCGGTGCAATCTCXiGNTCACTGCAACCT^mKXn'CCCAGGCTCAA GGA^ 
TCTOn'GCCrCAACCTm'GGAGTAGTTGGGACrGCAGGCATGCNCCACCACACCCAGCTAArrr 
GNGTITrrAGTAAAAACAGGAmCA<XATGTrGGCAGGCrrGGTCTTGAACTCCTGACCTCA^ 
ATCTtJCCCACCTTGOCCTCCCAAATTGCTGGOATTACAGGCCTGAGCCACrGCGCCTGOCTOAAAT 
TTTTGATTITTAANACCTTrAACTTCCCTAATCTGCCTATCTAGGNGCATATO 
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CCGNGACCACGC 

SEQ ID NO: 1 283 GGTACmrriTrTTnTITITITIT^ 

TTCTANCANAGTGGCTCCAOGCCCTTCACGCCnWCAAACACCACCCATGAGGGTTTAGGAAGGN 

GCCATCATTCrGTGAAGGCCCANAGCTTACCCAAGTCTTGGAGCCCAAGTTGAATCACCAACCAN 

AGGGTTGGGAGAGGAAAAGGAAACAGGCANAGGGGAAAGGCAAGGCTCTGCAGTGAAGGGOAC 

TGATATCAAGGGAATGCTGAGGTCCAGCAGTGTCTCCTGAAGGCATGCTACATCCTAAGGCTCCTC 

AGGACTGGATGGAGTAGGANATCTGTGTGTTGAGCANTTCACATCTATATGGCAACTTTAAGGAG 

GCNCTTGATGTCAGGCTCAATGTTTATOGNTGGGAANGTGCGGNTGTAJ^CGTCGGAAGGGCTCTC 

CCTCCGGCCCTATGAGGAACTTrm;AAAGTTCCAGGCCACATmTGAAGCGGNGCACAGGGCTO 

CAAANGATTGAGCCrrGGNGAATCGGNCATGAAGGGAAAATNGGGTCATTATAAGGGNANGGGA 

ACTTTNGCCTTCANGTT 

SEQ ID NO: 1284 ACGCGGGGAGCGTCCTCGAAACCACGAGCAAGTGAGCAGATCCTCCGAGGC 
ACCAGGGACTCCAGCCCATGCCATGGCGGATTCTGAGCGCCTCTCOGCTCCTGGCTGCTGGGCCGC 
CTGCACCAACTTCTCGCGCACTCGAAAGGGAATCCTCCTGTrrGCTGAGATTATATTATGCCTGGT 
GATCCTGATCTGCTTCAGTGCCTCCACACCAGGCTACTCCTCrCTOTCGGTOATTGAGATGATCCTT 
GCTGCTAlI'l'rCri'lOTTGTCTACATGTGTGACCTGCACACCAAGATACO^TTCATCAACTGGCCCT 
GGAGTGATirCTTCCGAACCCTCATAGCGGCAATCCTCTACCTGATCACCTCCATTGTTGTCCTTGT 
TGGGAGAGGAAACCACTCCAAAATCGTCGCAGOGGTCCTCGGCCGCGACACGCTT 

SEQ ID NO: 1285 actogaattttgcatatctgtagagtgtatctaaatattcctgcctaaaacca 

TGCnTGTCCGCCAGGTAGTCAAAGAGCATCCTACCATCCCIXJGTTGACTGCATTTGCCTTGTAGm 

CTGGATCTTCAAACATCnCACAATrOGTTCTGTTTCTGCCTGAAGCrcriTC^^ 

TGTGGTTCriTrCTCTCTCAAAGCATGAGGAATATCATCAGAATAAAGGTTTTTGTATAC 

GCAAAGTCTACCATGTTGGTATCACTAAGAAGGTCCAATTTACCTTGTAATAATTCCrm 

ATATCTCCnTACAGAGAGAAATTCAAGAAGCGGAAAGACTAGATGCCGATCCAAAAAGTGCGCG 

ATGCGAGTAGTCAAGTCGTACC 

SEQ ID NO: 1286 GGTACAGACCAGTTATTTTGTAAAATGTrCCCCAGTGTGGGTTTATCTGAOGT 
rrCCrCATGATTAGATTCAGACCATGCATTTITCGGAGGATTACCACTAAAACAATGTTGTGTTCTC 
AGTGCATCCTATITGAGGCACATGATGTCAATTrGTrCTCATATTGATGAGGTTAACTTITATTATT 
TGAATAAGGTGGTATCTGCCAGGTTTCTCCACTATAAAGTTACTGTTITrCCCTGTTA^ 
AGTCATrrGTGAGGAGTTTCTTrGAGACTATTTAAATGTTGTGTTCTCCAAAAAAAAGA^ 
GAGAGAGAGATCAGAAAGATTIXlGTGGCTAGGTGCAhrrGGCTCAOGCCTGTAATCCCAGCACTTT 
GGAAGTCCAAGGCAGGCGGATCACCTGGGGTCAGGAGTTTGAGAACAACCTGCCAACATGGTGA 
AACTCTGTCrCTACTAAAAAATAACAAAAATTNCCNGGTGTGGTGGCGCACACCCATATGTCCCA 
GCTACIXriTGGACGCTGAAGGCAGGAANAATTGCTTGACCCCANAAAG 

SEQ ID NO: 1287 ACCCAAATAGCTAAGTTTAAAATTTTAAGAAATCCGAGAATI AGTTrAAAAT 
TGTATAAATTCTCCTTTTAACTTTCTAAAATGTCCATATGCACTCTCATTCTGTm 
ATTAGAAAAACAGACTGCAGGCCGAGTGCAGTGGCTCATGCCTGTAATCCCAGCAGTTTGGGAGG 
CCAAGATGGGAGGATCGCTTGAGGCCAGGAGTTCAAGACCAGCCTGGTCAACATAGTGAGACACC 
ATCTCAAAAAAAAAAAA 

SEQ ID NO: 1288 GOTACi'iM"iuu'i'riMM-i'nM-rri'i'ri"i"i i gggtccttccccaacagcagttggaa 

'I'l'lUCl'lllGAACACAAAGTAAATTAATGTTTATACTGrmrrCACCTGAGTCATGTAAAAGGNGA 

ctcctttcattttaaaaagttatatitaatttttoggggcctt 

gtg11'1'1'1'j]1"11gtaaacagtctacatgtcaacaaatggataagggttaacaaaggcaaatactg 

ACTrCATTTGTGTTTTAAACACGATrATATGAATrmCTrTm 

CATTCATATAGGNCCTCTTCTCTCAACTGCrrrGAGATATANCrrTTAAATATGGGTAGATCAAGA 

AAGTAATGTTGGTAATCTXm'ATCTrGCATAGAAAAGAAAAAATAAAGGAACTTATrrCOT 

AGGTCTCANCTAGTTTCTTAAAGTCMU'ri'Cri'CANCTCCAATOGAAATTTCTCATAGCAC™ 

AGACTGGCTTCATGTCAAACTCCA(:V\AAACTTATTCTTGAAGTGTTAATTTANTGGTGC^ 

ACAGGGCAAAGCCANTTCCGCNCCANQCCNTrriTAGAGCCGANACCNCATCCCCTO^^ 

ACGATTGCAGNGGGAAGCCAACmrCNCCCAAATAGCKTGGGTNTAGTGNGT™ 

SEQ ID NO: 1289 GGTACCATAAGGAGACACAAGAAGAAAGGTGACACTAAGGCTACAGTGCAC 
AGAAAACAGACCAGGTGTGGCTTCGACTGTGCGGACCTOCCCACTAGCCTATGCrACAGATTrGA 
AATGTCTITCACnTTGACATGACACACGGTTTATATTACACAAAATGAATGAAACGACAATGGCTA 
AAAATAAATGAGACAGCCTGCACACAAAAAGATGATGACTGCTACnTTCCTCCCATCAGAAAAT^ 
ACTCAAAAAGOGAGTATTTAAAGGAAACTCAAATCAGGAGAACCCGGTAGGCATCAGAGGTTCA 
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GGOCACCAAGGCCTTANGGCGGGAACACnTITCAACCCAAGCCAGGCTTCAGGCKjCA^ 

AACAGACCCCAATTTCCACAGGGGAGGCAGATCrrCTATACCTACAGTGACAGAAAATACACTAA 

AGTGCAGTATAAAATATAAAAAGGTrrGATTCTOAATAGACCAACTGCTAATmCCTrAAAAAAA 

TTmAATrTGGTTGAGTAAAAACCAAATTAGTTCACTGAATCTCATTTTGTAGGTA^ 

ATTTTGCAATACGAAAAACTGGAGCTTATGACrcNTITrOATTTTCTOT 

CAGTATTANTGGGAGAACACTTACAAAANOGGGGCriTGNGGGNGAGTTChnTrGCATTAGTGG^ 
TmTAAAAAACAA 

SEQIDNO: 1290 ACACACCCAGGAAATTrGTCATCCACCCTGAGAGTAACAACCTTATTATCATT 
GAAACGGACCACAATGCCTACACTGAGGCCACGAAAGCTCAGAGAAAGCAGCAGATGGCAGAGG 
AAATGGTGGAAGCAGCAGGGGAGGATGAGCGGGAGCTGGCCGCAGAGATGGCAGCAGCAITCCT 
CAATGAAAACCTCCCTGAATCCATCTTTGGAGCTCCCAAGGCTGGCAATGGGCAGTGGGCCTCT^ 
GATCCGAGTGATGAATCCCATTCAAGGGAACACACTGGACCTTGTCCAGCTGGAACAGAATGAGO 
CAGCTTITAGTGTGGCTGTGTGCAGGTTTTCCAACACTGGTGAAGACtGGTATGTGCTGGTGGG 
TGGCCAAGGACCTGATACTAAAOCCCCGATCnGTGGCAGGGGGCTTCGTCTATACTTACAAGCI^ 
TGAACAATGGGGAAAAACTGGAGTITITGO^CAAGACTCCTGTOGAAGAAOTCCCTGCTGCTAT^ 
GNCCCATTCCAAGGGAGGGTGTTGATTGGTGTGGGGGAAACTGTTGCGTGTCTATGACCTGGGAA 
AAGAANAAGTTACTCCGAAAATGTGANAATAAACCrrATTrGNCAATTATIWCTIWG 
AAAACTATTTGOACATAAGGGGNAATITGTATTTTGATrGTCCAAAAAAANTmATrCT^ 

SEQ ID NO: 129 1 ACAGGGGCAGTCAGTGGAGGGCGAGTGGnrCXjGAAAAAAAAAAAGAAAAA 
AAGAAAAAAAAAGAAAAAAAAAAGATTTTTTTCTTCTCTTAATCGGAATCGTOATGGTO 
TArnCAATGGTGGGGTTAATATAGCATGTTATCCTGTCTATCrmAAAGATTTCTGTATAANACT 
GTTGAGCAGTITTTAAAATAGTGTAGGATAATATAAAAAGCAGATAGATGGCGCTATGTTTGATTC 
CTACAACGAAATTATCACCAGCTITrmCATTCTTAACTCm 
ATCTGTGCTGGACTTTAAAAAAACAATTCAGGACCAAATTITITCTCAGT^^ 
ATAGG 

SEQ ID NO: 1292 GGTACAAGCTTTTGTCCAAAATGOCACAGTGAGCACAAATGAGTTCCTGTGT 
GATAAAGACAAAACTTCAACAGTGGCACCCACCATACACACCACTGTGCCATCTCCTACTACAAC 
ACCTACrCCAAAGGAAAAACCAGAAGCTGGAACCTATTCAGTTAATAATGGCAATGATACTTGTC 
TGCTGGCTACCATGGGGCTGCAGCTGAACATCACTCAGGATAAGGTTCCTrCAGTTAlTAACATCA 
ACCCCAATACAACTCACrCCACAGGCAGCTGCCGTTCTCACACTGCTCTACTTAGACTCAATAGCA 
GCACCATTAAGTATCTAGACTTTOTCTrrGCTGTOAAAAATGAAAACCGATmATCTGAAGGAAG 
TGAACATCAGCATGTATTTGGrTAATGGCTCCarmCAGCATTGCAAATAACAATCrCAGCTACT 
GGGATGCCCCCCTGGGAAAGTTCTTATATGTGCAACAAAGAGCAAACTGTTTCAGTGTCTGGAGC 
ATTTCAOATAAATACCCTTTGATCTAANGGGTTCAAGCCTTTCATGTGACACAAGGGAAAGm 
ACANGCCCAA 

SEQ ID NO: 1293 Acri'ri'i'rri'rrri-rrrrrn'i'rrrnGGGGGAAAAGATATATATATATATATAT 

TCANAATTAGGCAGCTGGACTCAGTTTAAATGATCCCAATmGTTGGCAACATC 

AATCAGGAGCCAGTCGAACATATGCCrrCrrrnmCCATCAGGCCGAATCAGGGTGTTGACOT 

CCACATCAATGTCATACAGCTrCTTCACAGCCTGTTTAATCTGGNGCTTGTrGGCTTTAACATCCAC 

AATGAACACAAGTGNGTTGTTGNClTCTATCTrCTrCATGGCANACTCAGNGGTCAGCXjGAAACTr 

GATGATAGCATAGTGGTCAAGCTTGTTTCTCCTGGGAGCGCTCTTCCGAGGATATTTGGGCTGra 

CCGGAGTCGCANTGTCTTCGGCCGCCGGAAGGTGGGTGACGTGCGGATC'll Cl'l'Ciril'i GTGGCN 

GNGGGACACCrTTCAACACrcCCTTTTTGGCCTTTAA^ 

GGCANGAGC14 I C r ThrmNGTrrrNGGNGCCAT Cl i I I GAAAAGGCCCCNCGTACTTGGCCGNGA 
CCA 

SEQ ID NO: 1294 A Crn ' nTm - n ' iTl ' n ' l ' i i 1 U 1 11 ITl ' l AGGTATATCCATAGTCACCTTTATT 
CTGAATTAACCATTTATCAANAGTGCNCCTGAAAAGAGTANAAAAAAATAAAGGAGCCCATCAAA 
AAAAAGTTCCCTGGCAAGTGGGAGGGAGGACATGATGTTAGGAGCCCTGTTTGGGGAAGGAAAT 
GTTATCCAGGTCATGGATGCCATTTTTGTCAATGATTCGAACACTGAAGGTTGGCANATTCAGGAT 
GAAGCGTTTNTGGAGCTCCrCCAAACATrrCCTAAGGAGTTCCACTGCCCTCTCACGTGANATAGT 
CGGTGTGTAGTATCGGTCGAGGATACTGAGAGTCAGGAAGGCNCCATAGCCCGTOGGCTGCAAAA 
OGGGCCrrGGCCAAGGCTGCCAGGTAGTCCATGTAATACAGCCGCTGGCCCTrCATGCrCATCATA 
ACCAGCCAGGAGGAGGTTCACATGATATGGGGTCCCGACTCCCAAAACAGTCAGCCAGGnTCGG 
CGTGTOAANTTAACTNCTGCCOTGGGAAAAAATTCATmrCATTTCGCATTm^ 
Ci rn 1 1 1 IGAATATATmTGAAAACTGGGACCTTNGGCCCGGGA 
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SEQ ED NO: 1295 GGTACGCGGGGAAGTCGCTTGTGTATGAACGCAGCGGCGGACCTGTGAGGGG 
ATCCGACTTGCCGGCAOA.\CTrTAC0CTQC00GACCCCGGGCACTGTTGCrGCrGCGGGAGGTCAA 
TCATGCGAGGClXjTGACAACATGTGTAGCTACTGCArrGTTCXrmCACCa^ 
AGTCGGCCTATTGCCTCCATTCrrAGAGGAAGTOAAGAAOCTTTCTGAGCAGGGGCnXjAAAG^ 
GACACTTCTTGGTCAGAATGTTAATAGTnTCGGGACAATrCGGAGGTCCAGTTCAACAGTGCAGT 
GCCTAOCAATCTCAGTCGTGGCTTTACCACCAACTATAAAACCAAGCAAGGAGGACTTCGT^ 
TCATCTTCTGGATCAGGTCTNCAGAGTAGATCCTGAAATGAGGATCXGTriTACCTCT^ 
CAAGGATTirCCTGATGAGGTTCTGCAGCTGATTCATGANAANAGATAACATCTGTAAACAGATCC 
CXrCTGCCANCCCAAAATGGAAACNATCCCGTGTOTITQAANGCCATTGCNGAAGGGGATATTCAN 
AAAAA 

SEQ ID NO: 1296 ACGTCATAGAAATAGCAGCTCCCACCTTGTCATCAATGGTATCAAATTTGGTA 
AOAACAATGCCATCAATGAGCCGAGGTGTCrGAGCCATAGAATGGTCAGCCAAGGCTCTGTTGAA 
CTTGACCAGCnGGTCCACGGCTrCATTGCCTACTAAGGCTTCTCCTACAAACAGCACCAAATCAGG 
TGTATTGACAGTAATGAGTTTGGCCAOGGCAGTCATCAGAGGGGCATTaTCTTaCATQCGGCCTGC 
CGTGTCCACCAGCA<XACGTCAAAGCrrTGGTTACGTGCVU^GCAATGGCrrCCATGG 
CAGCAGCATCCTTGCCATAGCCCTTTrCAAACAACTGCACCATGGTGCGGCCACCATGCriXrrCT^ 
GAGGGTGTAGGGCACTCAAACGCCGGGTGTGTGTACC 

SEQ ID NO: 1297 GGTACTTTTmrmrrriTiTrr^^ 

ANCAAAGNGGCTCCAGGCCCTTCACGCCTbrrCAAACACCACCCATGAGGOTTTAOGAAGGTGCCA 

TCATTCrcTGAAGGCCCANAGCITACC(>AGTCTTCGAGCCCAAGTrGAATCACCAACCANAGGG 

TTGGGANAGGAAAAGGAAACAGGCAGAGGGGAAAGGCAAGGCTCTGCAGTGAAGGGGACTGAT 

ATCAAQGGAATGCTGAGOTCCAGCAGTGTCTCCTGAAGGCATGCrGCATCCTAAGGCTCCTC^ 

ACTGGATGGAGTAGGAAATCTGTGTGTTGANCAGTTCACATNTATATGGCAACTTTAAGGAGGCG 

CTTGATGTCAGGCTCAATOTTGATGGTTOGOAAGGTGCGGCTOTAKCGTCGGAAGOGCTCTCCCTC 

CGGCCCTATGAGGAACTTNTCAAAGTrCAGGCCACATNTGAGCGGGCCNCAGGGCTCCAAATGAT 

GAACITGGGATCGG 

SEQ ID NO: 1298 AATTCGCCCTTAGGCGrOGTCGGCOGCCGAGGTACAAAGGCAAAAGGACCA 
CAGCACCACTTAGGTGTAGCATGGATTTTAAACTGCAGTCAGTATCAGATCCTGTrTGATAAATAA 
GCTGACTGTTCTCTCTTGAGAACCTGTGGCCTCAACXAGGCACCAAGCTOATGTGGCCCAAGTC^^ 
TCTCTTGGTCTTCTCCTTTGAGGCACAGCCTATTrcrGAGCCAAGGGTTGGGGAAGCCTGTCTAGA 
TGTGGGACTCATTGCCCCAAACCAGGGAGAGGAAGAGCTCCCACAGGGAGAGCCCAGGCTCTCrT 
TGCAOiXTTrCCCAOTTTGGTGTITAAGCAOTGCCATGTTCCTTGTrTGACAACAAGACAGTCT^ 
AAGTATTGCTCrrAAAAACAATTAAAAAGAACCCTTrCATATTGGCACCATrGCCTTAhrrCCT 
TGGGTTGGTCrrCAGCCAGCATTCTGGTOGGAGTGACTGGCArrAACAAGACTGGAAATCGGGGG 
TCAAAGTAAAATATCrrrcrnTGCmCATTCCAAAAGTAATGAACCAGCTT^ 
CCCAACAACACTirrGGTCTGNGGACTGCTGGGTGAATATrCANAAAGGGAAGTAAGTNTTCAGG 
GGGGTAAACANGNCmCCAANArrmGAATGGrrCCAAACCAArrAATNCACATrGCCAATTTCA 
AAA^TAAAACA^^^CCCT^GCTrNGAT^T^ACCC 

SEQ ID NO: 1299 GGTACAATGTGGGGCAGGCACCAGACTGGGCAGGGAAAGGCCTCGGATACX: 
ACCATCATAGCAGAGACCAGACGCTCATCCTTGGTTTGAGACATGAAGTCACAGGAACTGAGATG 
GGdTCCCACATACCAACCACTGGGAGGGCAAAGGTGGGGAAGGGCACAGGCTAAAAATTAACA 
AGGTGCCCAAGGTAAAGGGCAAGCCCrraTCAGCCTGGGATACTGTCTCCTACTCCCAACCnTGG 
GCCCAACAGAGGAACCAGTTGAAAAGGAGGGCCAAAGACATTGCAGTAAGTAAGCAACAGGACA 
ATGAACTCCATGTTGCCCAGATCCCACTGAGAGTGAACGTGCAGTCATGCCCATAACCGACACAC 

atcccagtccatgtgggtcagtccttcatcacxctccctgcttctgacaacagcagactccaccat 

tccattatcattcacagccx;aacccaaacagtcaagtggcttgaagaaaganaaatcanotatac 

tctatotccacatataccnrmctgccacaaggotcaccaactggcangatrccccccanccccca 

acttctnctnattcctttgrrcccaagcngggtcaattgggagahntgaccaaaaatj^ 

tncngaaagggtattccaaaatgggggaoaactccaocaoaccttaagogagaagaaatggcct 

TNTT 

SEQ ID NO: 1 3 00 ACACTCCAGATATAACTGGGACTTCCrGTGTAGATCTGAACGAGTGCAACCA 
GGCTCa:AAACCCTGCAATTTrATCT0CAAAAACACAOAAGGGAGTTACCAGTGTTCATGCCCG^ 
AAGGCTACATTCTGCAAGAGGATGOAAGGAGCTGCAAAGATCTTGATQAOTGTGCAACCAAGCAA 
CACAACTGCCAGrrCCTATGTGTTAACACCAnGGCOGCTrCACATGCAAATGTCCrCCCGGATTT 
ACCOW\.CACCATACXiTCCTGCATTGATAACAATGAATGCACCrCTGACATCAATCTGTGCGGGTCT 
AAGGGCATrrGCCAAGAACACTCCTGGAAGCirCACCTGTGAATGCCAGCGGGGATTCTCACrrG 
ATCAOACCGGCTCCAOCTGTGAAGACGTGGACGAGTGTXjAGGGTAACCACCGCTGCCAGCATGGC 
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TGCCANAACATCATTGGGGGCTACAGGTGCAGCraCCCCCAAGGCTACCTTCA(X:ACTACC^ 

GAACCAGTGTGTTGATGAAAACGAATGCCTCACCGCTCACATATGCGGAGGAGCCTCCTGTCACA 

ACACCCrOGOOAGCTACAAGOTOCATTGTGTNCCCCCNGCrrTCAAATATGAACAGTTCAGTOGA 

AGGATGCCAANANATCAATGAATGGGGGrn^GACCAAGNCCCXn'GCANCTTTrGGNTGGTrCC^ 

ATACCG 

SEQ ID NO: 1301 acgcggggctctttccctgccgccoccgagccgcgcggaggcggaggcttgg 

GTGCGTTCAAAATITCAACTTTAACCCGTAAACCCACCGCCTTGGCCXjAAGAAAGGCTT^ 
GGAQGT 

SEQ ID NO: 1 302 GGTA crn ' i ' iTri ' rn 1 1 itititi riTrrfi Ti AGGA AAATCAATG'n'i'ii;ri'iA 

ATGCTGAGAATTTTTGTTAATATTTCnXjACATTTCATAAAACATCTmGTTGA 

ATAGCATCCAATAATOTCCCAATACTrrCTTCITOTa^AGAGAGTTTATATATC^^ 

TAATATTCATTATTTCAGGAAGATTmCANANAAACCTGATGACCTTCTACTTGAGATATTAGGT 

ATTCTrGATTTATTTGTTTTTCnx:CCTCTrCAATGTAAACAATTAGA^ 

NATACCAAATTTTTTCAAAGCCTCTGAAATATTGTTATrrGGGGAAAGOT 

ANATAGAGTTCTTCTCTTCATTTTTCCCAmTTGTANAGGTGAAC^ 

ATCTGAAATGGATCAACAATCCTOTAOOATTTATCAGTGATCCATCCGATGGTGCCNTCCATGGCC 
TTTCrrCTCAAGTCrCCCGCATITTTACATCmAAANAACAGAAGGG^ 
AAAAGGCCAGCTGArrGTGTTNAa;GCATTTCA(XANANTANCCANGGATITrrCC^^ 
AAGTGCGTCCCCCCNG>nTCCrrnjCCCCGGGCGGGCCGCTITGAAAN 

SEQ ID NO: 1303 GGTACGGGAGTTTCITGGTAAATCCAGAATCAGGATACAATGTCrCTTTG^ 
TATGACCTTGAAAATCTTCCGGCATCCAAGGATrCCATrGTGCATCAAGCTGGCATGTTGAAGCGA 
AATrGTTrrGCCTCTGTCTTTGAAAAATACTrCCAATTCCAAGAAGAGGGCAAGGAAGGAGAGAA 
CAGGGCAGTTATCCATTATAGGGATGATGAGACCATGTATGTTGAGTCTAAAAAGGACAGAGTCA 
CAGTAGTCTTCAGCACAGTGTTTAAGGATGACGACGATGTGGTCATTGGAAAGGTGTTCATGCAN 
GAGTTCAAAGAAGGACGCAGAGCCAGCCACACAGCCCCACAGGTCCTCTTTAGCCA CAGG GAACC 
TCCTCTGGGGCTGAAAGACACAGACGCCGCrOTGGGTGA(>ACA7TGGCTACATTACCTTTGTGCT 
GTTCCCTCGTCCACCAATGCCCAGTGCTCGAGACAACACCATCAAACCTGATCCACACCGTNCNG 
GGACTACCCTGCCCTACCACATNAAGGGCTCTANGGGCCTATNTTCACACACGTTATGC 

SEQ ID NO: 1 304 ACTGGATGGCCCCACAAOATGCTGCCACnTAATAAOGCTGCAATACACTOT 
GTATCTTACAGGAGTATTCITATCCATCCajTGGAAAAGGTTGCTrAACAACTGCAGTCTCAGAGA 
CGGGCGTTCACCrrCGCGAAATTTGACCAGCrmCACATAGGCTTTCAATCAAAGOT 
TCTGGTTCCAGGATCAAGAGTAGGGATACCACACTGTTCATCACACrrTCAACATCTrrATCATCC 
TCCrrCAGACACACATCACAGGCTrcAATAATTTGAGCTAAATCAACATGAAGTCCACCTTCCGAG 
TICrCnTCTOAAATCTCAGCTCCmAGATITCAGATAAGCACGAAGCTCAGCAGCCTOA^ 
CACTGATGTCGATGAAGGCX;GGGACGCTCATGGTGCAGGCCCGGGCACAAGCXjGACACTCCACTC 
GCAAGACCACGCCGACCGGAAAANGGAACTGCCCCGCGTACC 

SEQ ID NO: 1 305 GGTA Cl ' l - lTrrrrn - ri ' l ' l I ' l l 1 1 11 lOGTNTTAATGATGTTTTTAATTGACAAT 
AATIGTAGATATTCATCGGATACNCAGTGATGTTTTGATACATGGGATGTCTAGTGATCAAATCAG 
GOTAATGAGCATATCATCATCTCAAGa}mATTATGTGTGTTOAGAATGTTCAATATCCTCCITCT 
TGCTGCTTGAAACTATATAATATATGGTAAGGAGTrTGTTTITCCG^ 
ATTAGGATCTGTGAGCNGNGTTm'CTAOTTATTCTGCTTTATmCCrCTGCTCCT^ 
GATAGCTGANATrTCCTCCGGATGGGCANAAGCrCCCCTGCCTTATGGCTTCCGNAGCTGAACACT 
GGGTCnrCACTGAGCNGGATNAAATGANTGTGGGTNGTTGCTTGGA 

SEQ ID NO: 1306 GGTA Cri ' l ' 1414H ll tlll41Unil ' lUlll ' l GANACTGTTCTnTA'n'CTGAGTCA 
ATAAGTAACATGTGATACAAACTCAAGATTTTATTAAAATCATTTGTAAGTTAAATTANCATATCT 
CGGNGGTGAAGGAAGACmTAATAAAGCAAAATAATGAGTTAGGN GAATC AACAGAAAGAGTG 
AAAAGNGTTAACAGTGACATTTAAATGGTTNCATGATTTTAATTATTCTITrcATA^ 
TTG'rGATATCTATTCAANAAAAGCCAGAGCCCACTATGGAATTCGCAGTTGAATAAAAGCTITGTr 
CCCTGCTTTTGGTAATGTTAAAGAAACAAATGGAAATGGTTTAAAGACCTTATAACAATCT 
CTAGCAAAAGAATCTAAAAGGTAAGGNGGGAAGCATTAGAAAGAGNGTGAAAAGAAAAATNAA 
AAAGGCCTNACACGCANCATTGGGTTrrATTTOATA 

SEQ ID NO: 1 3 07 GGTACTTGCCCACTTTTCaxnTGTGGGCCTGTrCCTAGAATCCA 

GTGGGTGACATCAGCCroCTACTGCGGGAAGAAACAGAATTTTATGGTGCCCAAGATAAAATCGA 
CTATGATGATGCAGAATGAAGACCTGGAGAGGAGTCAGAGATTGCAGGGGATAAAAAGAGAGAA 
ATCAGAGATGCTCCCACAAACAGAGAAAACCTTCAOAAGCAGCAGGAAGGACAGAOTAAATGGA 
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GAACAAAAACTCAAGTCAAAATAGTCAATGGAAATAAACAGTmGAATTCCAGTTATAATTTGG 

AAAAOAAGCAAACOTTrATTTTAAATTAAOAAAAAAGTCGCTTATAATTTCCrGAAGT^ 

AGTOAAAAATAGTTAAACCTCXn-GTTGTTAGCTAAGAAGTGTCrArrArrATAAGTTTGTCGCTGG 

AGAGAACATAAAAATCTCTATCTATTGCTA'l'rri-l'rrrrri-l'i'ilGCATAAGGGGATAGATAAAAG 

AGTGGTTTATTTTATn'AGCCmGGCCrGTAAAGTAAGATTrArrGAACCAAAGGGCCATCNGGA 

AATGATAAGAATTTTTTCCCTAATTTCXn'AATmmA AGCTAAA OCT^ 

CTAGTGCCCCATAT^A^^^AQCAACAGTCA^fNCC^T^CAT^Tmcf^^ACCT^ 

AAATCC 

SEQ ID NO: 1308 ACTAGGAAAAGATACAGTrACCAAAGATAATGCCATCCGTCCTTCCTCACTG 
GAGCAGATOGCCAAACTAAAACCTGCATTCATCAAOCCCTACGGCACAGTGACAGCTGCAAATrC 
TTCmCTTGACTGATGGTGCATCTGCAATGTTAATCATGGCGGAGGAAAAQGCTCTGGC^^ 
TTATAAGCCGAAGGCATATTrGAGGGATmATGTATGTGTCTCAGOATCCAAAAGATCAACTATr 
ACTTGGACCAACATATGCTACTCCAAAAGTrCTAGAAAAGGCAGGATTGACX^ATGAATGATA'n'G 
ATGCTTTTGAAriTCATGAAGCTTTCTCGGGTCAGArmGGCAAATTTTAAAGCC^^ 
TTGGTTTGCAGAAAACTACATGGGTAGAAAAACCAAGGTTGGATTGCCTCCriTGGAGAAGm 
ATAACTGGGGTGGATCTCTGTCOCTGGGACACCCAmGGAGCCACTGGCTGCAGGTTOGTCATGG 
CTGCTGCCAACAGATTACQGAAAGAANGAGGCCAGTATNGCTTAAATGGCrOCGTOTGCAACCTG 
GAAGGGCAGGGCCATGCTATGATANGTGGAAGCTTATNCAAATAATTAGAATCCNANAAGAAAG 
TGACCTGAAGTTTCTGTGCAACACTCACACTAAGGCAATGCCTrrCATNCCTTACTAAATGACA 
GGT 

SEQ ED NO: 1 309 A CIlU ' llll i nU - ril - i - l^ - l - l - ll ' IU lN GCTGGGGTATTCA TITCTG CATGTATAG 
CTTTATrAATTGCTAATGAAAATTANAACTmCTGGGATCTTCTGACAAGA'in'lU 
AAAATGCCTTITCTrCAGTGAAGCCV^TCTTTGGAGTTAGTCATrACTCTCACOT 
GACTTCAAACTGATATrCCTCTTCTTTTGGTCCA>M.CCCTCAAATmAAAAGTAGOT 
GGAAAGGTCATTTTTCCACAGTTCAGTTCTCTGAAAAACTTCCATCTCCCACTGAAAGTCACAGTC 
CAGGAGTGAAGTAATCACATGCTAGAACATCAGGGCCAATTGGAAAGTCATTATGAACACTTGCA 
TTOGTCGATCTTATTTATCACCACAAGCXTGAAAATGCAATOTCCTGAAAAAGGTGACCrCTCTGT 
GCACACCGTAATTTTTAAAAAGGAGAGGGTAATATCAAGGGGACTGAGGCTTGATCACCAAAAAT 
CAGCNCATGGAAACAAACAATAATGAATAATGAGCNCTANAATTCAAATTNfCCAGAT Gm 
NAAGATGGGGGNGCCAGNTTTmATTCCGTITGGACCCCNCATrTAC AAAA AAA 
AAATAAAAAANGGTTGGGGGGGAAAGGAAAAAAAAnAAAANGAATTTTCrrAACITGrrG 

SEQ ID NO: 1310 GGTACGAAAACAGAACCAATCTAAAAATGGCTGATGTTACTTTAGGA GCCT G 
AAAAAAACAGGAGATCCITGAAGACCCAGCCACCCCTTCTAGAATGTTCAATAAGGGCACCmC 
CAAAGCTACTAAGCAGGCACTTGGCATTTCAGGAGTTTGTCrrATGGTTGCATAAAAGTATCCOT 
TCACCCAGACCTGACACCCrrrATGGTTCAAAGTTAAGAACGGOAAGAATGGGTGGCAAGGTGGC 
TCCTGGAAGAGCTCACCCAGCACAGCTGCCCTGAGCTCGGGGCCTTGGTITCrcTCCCTGGGGATA 
TTTATATTTAATAAATTmATATAAATACACAGAGAAATAGAAAATATAAAATCTGAGGGOTrGG 
GGAAGAAACGTGAGGGACTCAGCAGGAAGCCAGAGCTGGAGAGGTCCATTCAGGCCAAC TTGA C 
CTCCTCCTTGACCCCmGGCITCTGGCTTCmXrrCCTrGGT^ 
TCTCCrCTGGGTCTTTCrCTITmCATNCTCTGTTTTGACTCrC^ 

GAimrmGCGGGCCCCCNTTTNCCl'lC'l 1' 1 ICAGAAATCGGGAGAACTNTTTC CTCAC ANGCCA 

ArmC>mTGTCAAAAAGOANCCAGAATCGANAATGCNCCTT0TCANGGGTCGNTm 

TN 

SEQ ID NO: 1 3 1 1 Acr rn T i Ti- nTi - n - i ' iiTi - fi itiiin ggaaagngtaaatttatttaatacca 

AATGTTTCCTAAGTTTACTTTTTGCACAATGCACANACACTITTGGOACAATGTATGOT^ 
TAATATTCTACC^ACAGCTCCATGCTGCTCTANATCATCCTGAAGGAAAAAGTAATGCAAAG^^ 
GCrQGGTCTCCAAOTCCCAGGACTGGCCOATAGCCTCTCCCAGTGGATAGCAOGTCCTAATIXrrOT 
TCrrCGAGCANATTCTCTGATTTCyiGCTGGGTCTrCCTGATGCTCrCC^ 

TTCTTTTAAAAAAGCCACACTrGAACAGGATGACCAGAATCACGATCAACACCAGAAGTCCACCA 
ACGCTGCCTTTAATGATGATAGGCAAAGAATGGTACC 

SEQ ID NO: 1 3 12 GGTA Crn - nTrn ill 1 1 1 i n n ) i'l n U l U i -NGCATnTANACTCCCACAT 
TAACTTGTOTAGCANANAAATAAAATCTTAAACTCTATGCTAATAGTTATCCAAAAmA 
ATGACnTKKjTTTATGGATAGAACmAACArrAAATGAO^CTGAAAATACrCAGGCCC^ 
GCCrrArrNTT^AACANACmAATATTTTTCTCANAAATTAGATGAGGGCAT^ 
AAAATTCTCAAATAATACAGTNAATAGAATTTTCrAAAAAATACCAmATOACT^^ 
ATATAACATCTCAAGCACTANAATTAACAAATGCNGAATTAATGT 
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SEQ ID NO: 1 3 1 3 ACAAAGTAATGACCATAATACCrTGTCTAAATCAGTAGCAGGGCnTCATQGC 
TAGGAAACTTTmAAACAGTAAAATGTCATTAGGATAGATCACTCrCTCACCAACATTAAATGTG 
GCAGAATTAACCAAAAAAAAAAAAAAAAAANGAAAAAAAGATTTACACGTAGATATAGCAGTAA 
CAGAAAATTACAAGGCAGAGAACACAAAAGAATCAAGACAACGCTATAGGrrCAACAGTATTCC 
ATGAGGCTAGAAAACCGGTrAACrTTGAAAGCCCTAGCAAAATATCACTCTTGTTTATAAATC^ 
TATTTATCATATGACCACACGTTGAATCCrnGTCTITCCACATACrGAAGAATCCTACA 
TTAAGCATTCTGCATTTCnTrGCCAATCrrGTAGCTGAGTATCTTGTTCCAG^^ 
GTAACTGGAAGCTGCCG GCGCA AGGCCTTGAATGT GGTAT CTGACATTGTTTGATA GTTTT CACTA 
ATTGCTGGCTGATACrCATTTrCTGGATTCTCTATGATTTTAAATAAACTCCI^^ 
TTTCATTCGAAAACAGTTAGTGAATCCTGTACCTTNGGCCGCGAANCA 

SEQ ID NO: 1314 ACrnTl'lTi-l I rJ'l i rn'i-l"lTl'l"l rrriGGGNGGTCAGGGnTATTAAGTANA 
TTCAGTCTTATATATGATATACAGATTCAAAAGGAAAAGGATTCCACATACTTATGAAATAAGTGC 
ATTTAGCAAGTAATACCATGGANATAACAGCTCX3ATTAACAGCTCATTGGTCTriTGTrAAGTG 
GGANANAGGGAGTCGGCCTGIGCTAGTCATTCCACAAACAAAACAGAATITAOTCACATNACOTA 
GANAANACACATTCTAAGAATTAAAATATTAAGCCACATTTACTGGNAAAACrCACTATNTAGAA 
CACAAATAATTmATAGAGAGCATTGAGGAGGAAGTCCTCACCTGCTTANAANAGCCCCQTrTAA 
GTTGCATAGGNGCATAT^rIXlTAAAAATATAATTTAGAGGTGAAGCCATTAATAAGAGTCCCm 
AAAGTrGGNGCCATCCACrTG-nTGCCTTAAGNCNTACTATCrGGA 

SEQ ID NO: 1315 ACGCGGGAnrCGCCATGGATGAGG ATGGGGACGAGAGCATTCACAAACTGA 
AAGAAAAAGCGAAGAAACGGAAGGGTCGCGGCTrTGGCTCCGAAGAGGGGTCCCGAGCGCGGAT 
GCGTGAGGATTATGACAGCXiTGGAGCAGGATGGCGATGAACCCGGACXACAACGCTCTGTTGAAG 
GCTGGATrCTCTTTGTAACTGGAGTCCATGAGGAAGCCACCGAAGAAGACATACACGACAAATTC 
GCAGAATATGGGGAAATTAAAAACATTCATCTCAACCTCGACAGGCGAACAGGATATCTGAAGGG 
GTATACTCTAGTTGAATATGAAACATACAAGGAAGCCCAGOCTGCTATGGAGOOACTCAATGGCC 
AGGATTTGATGGGACAGCCCATCAGCGTTGACIXKjTGTriTGTTCGGG^ 

AGGAGAGGTGGCCGAAGACGCACCAAAAAGTCCAGACCGGAGACGTCGCTTGACAGGTCCTCTG 
TTGTCCAGGTGTTCTCTTCAAGATTNCATTTTGACCATGCACCTTTGGACAAATANGACTGGGGGT 
GGAACnTGCTGGGTTATATTTAATCCTTTTACCGGATATGCCTAGTATTITGAGTTGCNAAATAAAT 
GTTCCANTnTTGTTTTAATAAAAA 

SEQ ID NO: 1316 ggtactgccagattcgtctaaatgtctgtcatgtccagatttactttgcitctg 

TTACTGCCAGAGTTACTAGAGATATCATAATAGGATAAGAAGACCCTCATATGACCTGCACAGCr 

CATTTTCCrTCTGAAAGAAACTACTACCTAGGAOAATCTAAGCTATAGCAGGGATGATTTA 

ATTTGAACTAGCnCl i rGTOACAATTCAGTTCCTCCCAACCAACCAGTCTTCACTTCAAGAGGGC 

CACACnXjCAACCTCAGCTrAACATGAATAACAAAGACTGGCTCAGGAGCAGGGCTTGCCCAGGCA 

TGGTGOATCACCGGAGGTCAOTAGTTCAAGACCAAGCCTGGCCAACATGGTGAAACCCCACCTCT 

ACTAAAAATTGTGTATATCrnGTGTGTCTTCCTGTTTATGTGTGCCAAGGGAGTATI^ 

TrCAAAACAGCCCAATAATCANAGATGGAGCAAAACCAOTOCCATCCAGTCTTTATGCAAATGAA 

ATGCTGCAAAGGGAAGCAGAATCTGTATATGTTGGTAACTACCCACCCAAGAACACATGGGGTAG 

CAGGGAAGAAGTAAAAAAAGAAGAAGGGAGAATACTGGGAAGATAATGCACAAAATNGAAAGG 

GACT^^mTAAGGATTAACCTAGCCCCTT^AAGGGATTAACTANTT^AAGGATTAATAGCCAA 

SEQ ID NO: 13 1 7 ACTGCACCAAAGCCTGGGCCTTGGCCTTGAGCATTCXJAAAGCCCACGGTCrC 
CTTGGCATACATACACAGCAGAAGGTTGGCCACrCGGGTGATGGCrACACGGCCCTCCATGCAGT 
CCATGAGGATGAATTTGAGATIOTCTrCATTAAACGCTTGGTTCCCGTTCCGGTCGTAGGCGGrc 
AGATGTTACTGGCTATGGCAGCXKjTGACCCGGGCGTCAGTGTCCCCGTAACCAGAGTAGGCCAGC 
AOTGATCCCTCGTTATTCAGCAGCAGGGTGCTCTGQACGCCTCCAGTGTrGGCTIGGCTTANCACC 

tgggtcaaagccttggggcgcagcatgcctacggttcctaaccctgggcttttgcacccagatctc 
cgaggtgcctcccgccccgcgtacc 

SEQ ID NO: 1 3 1 8 AC'l'l'l'l'l i ri"ri'l"n"rrnTi-|'l-l'i'lTriCGGGAGCANATrGGGTAATAAAATGT 
^^^NTTGANAATAANACGGCCTmGACCTTTTAGGGTCTAGGGCTATAAAGTGT^ 
GCCGAACGANCCATGAACTGGGCTGGOTTTrTATNTTTGATGAAAAANANCCTAAACNCTT^ 
TTTGGGATAAANAAAAAGGAGCArrAACCTTGACTATGTCTTrANCTCCAGCCACCTTITrAA^ 
TAAATTGCTGGGCAGGTGGGGGAGGGCTANTCACGGAACGAAACTGTAAGCCGGACCAGGTGTG 

aggaggggaqotgataaaaagattacagggtggaggagtggancctgaggaaaaattgggacct 
ancttcgckrggaaaggagggganaggtcaaatgggtttgtaaaaaaggaagattanacacact 
cccaacgcctggggttoggactgaggggacaggtoggaggqaaaaaaggaanatttgggacnag 
ttgcactgggcacananactaggaagggacx:ggatgtgtnaaaaaatgccttggacattaagccc 
ctcaaaccattttccccatititrgacaaaaantatttanogtcrttgtagggat 



196 



wo 02/29086 



PCT/US01/307J2 



AAAAGGGCCmTirrrGCCCNTTTAAAANCCAATrGGCAAANTTTTf^^^ 

SEQ ID NO: 1 3 19 GGTACGCGGGGAGGAGTGAGAGAGCTGCTGGATATGCGGAGGOACTGGGCG 
GGTCGGOTCmAATGGAAGAGGTCTGTGAGAAGTTAACCTGGTGATACCGATCCGAAGAGCCTA 
TCAAGTGAAGCCCCCTGAAATACGGAGAATAAGAATCTTAGAGGTTGTTCAGCAGAAGTCrTGGA 
GTGCATTTTCAQTGGTrAAGGTGAAAAAATGACTACTAAAAAmAGAAACCAAAGTCACCGTTA 
CTTCATCCCCAATOCGAGGAGCAGOAGATGGAATGGAAACTGAGGAACCACCTAAATCTGTTGAA 
GTTACCTCCGGAGTCCAATCTAGAAAGCATCATAGTCTTCAGAGTCCATGGAAGAAAGCAGTTCC 
ATCAGAGAGCCCAGGAGTTCTTCAGCTAGGGAAAATGCTCACTGAAAAAGCAATGGAAGTTAAAG 
CTGTAAGAATATTA^^^TCCCAAAGCTGCTATAACTCATGATATCCCCAACAAAAAATACAAAAGG 
TTAAGTTCmrrGGGACATCATAAAGGANAATTTCCTTGGTCCANTrCAGAAGGGAGTTATOGAANC 
CTTATTANGGAACTCTCAGAAGTTAAAGAAATGTNmGGAAAAANChrrCAAANAATTCTNA^ 
AAAGTTACTACCGGAACAAAAAAANGTT 

SEQ ID NO: 1320 GGTACTOACAAAGTCCTGAAACTACAATGAGAGGAAACACATTGCCCTACTT 

cgggataagtcatgactoagactcaatrrcagagacgctctatgaacagaggtgcttgaagccnc 
agtggcagaanggaaaqatggggaagtgtgccnaaaagcctccaggcatgncagacagtccctg 

ACCAANCNCATGTAACANGCCXnTrGGGTCimGCITCrCACrGNAANATGATGAAGC 

gatt 

seq id no: 1 32 1 aattcgcccrragconootcgnggcgqagatacnqcggggctcactctgcgc 
trcaccatggntotcattgccaagtccttctatgaotcagtgccatcancctggatgc^ 
ttaaatttcaatacgnccggggcagggccgngctgattgaoaatotggcttccctctgaggcaca 
accacccgggacttcacccagcmaacnanctgcaatgccggtttnccangcgc^ 
tggcttnccitgcaaccaamtggacatcagganaantctcagaattgaggaaatcctgaacan 

CTCAA 

SEQ ID NO: 1322 ACGCGGGTCCTTGTCCAGTGAAACACCCTCGGCTGGGAAGTCAGTrCGTTCTC 
TOCTCTCCTCTCTTCTTGTTTGAACATGGTGCGGACTAAAGCAGACAGTGTrCCAGGCAOT 
AAAAGTGGTGGCTGCTCGAOCCCCCAGAAAGGTGCTTGGTTCTTCCACCTCTGCCACTAATrCGAC 
ATCAGTTTCATCXjAGGAAAGCTGAAAATAAATATGCAGGAGGGAACCCCG'nTGCGTGCGCCCAA 
CTCCCAAGTGGCAAAAAGGAATTGGAGAATTCTITAGGTTGTCCCCrAAAGATTCTGAAAAAGAG 
AATCAGATTCCTGAAGAGGCAGGAAGCAGTGGCTTANGAAAAGCAAAGAGAAAAGCATGTCCTT 
TGCAACCTGATCACACAAATGATGAAAAAGAATAGAACmCTCATTCATCTTTQAATAACGTCT 
CTTGTTTACCCTGQTATTCTAGAATGTAAATTTACATAAATGTGTTTGTTCCAATTAGCm 
AACANGCATrrAATTAAAAAAATTTANGTTTAAATTTAGATGTTCAAAAGTAGTTGTGA 
GAA TTrTGTA AGAACTAATTAhrrGGTAACCTTANCmAGTATTCAATATAATGCAATO 
GTTTCTTTTTACCAAAA 

SEQ ID NO: 1 323 ACCAAAAGAAAAAGAAAAGGAAAAGGTITCTACTGCTGTATTATCTATAACT 
GCCAAOGCTAAAAAGAAGGAAAAAGAAAAGGAAAAAAAGGAGGAGGAGAAAATGGAAGTGGAT 
GAGGCAGAGAAAAAGGAGGAAAAAGAGAAGAAAAAAGAACCTGAGCCAAACTTCCAGTTATTGG 

ataacccagcccgagttatgcctgcccagcrraaggtcctaaccatgcxxigagacctgtaoatac 

cagcctttcaaaccactctctattggaggcatcatcattctgaaggataccagtgaagacattgag 

gagctggtggaacctgtggcagcacatggcocaaaaatcgaggaggaggaacaagagccagaac 

ccccagaaccatttqaotatattoatgatraagggccaaaggatctccitgcrratctgaagaan 

a rrgt ccagctcatatrgggaatgcttttgagggaaattcatgccgagacctgcttttcaatgcat 

GTTTTCGTrGNCCTCT 

SEQ ID NO: 1 324 GGTACACAGTATTTTTTrATATCTATGTrrTCTGTTCTCTGGa3CA^ 

TGCACCTGGGTGCrn'CAGGCCTTGACCAAATAGCATGAATAGTAGGAAATGAGAACnTCCCrCT 

GTCAGATCTrCACAAAAACTTTTQTTTTCACTACATTCTTTOOAGTGTAGATTAGC^^ 

TAATTTGGAAAAAGAGCCCAAGTGTATTAAGTAGCGGTTITAAATCTTCTTTGTAATCAGAGAACA 

ACTGCATGAGACCTACTGCTAATCCAAACAGTCCACCTGrmXrrOCAGCACCATAGCriTATATT 

CTTCTTCAGTGGGACXAGTGTAATrATarTCXAGTAAATATCTAGGCCTTGTC 

CCAAAAGCTGGCGGGTAAAAAGCTTCACTGCATCTTGGGTGATCAAGGGGTTAAGACTTTCTCCA 

AGCCAAGQAAATACACCOTAATTrGGCAGAAATTGATGGACAGATNGGATTCCATAAGAATGCTG 

GNGGGCCACTGGAAANCCACNTCGGAAGTTnTGAGTTG 

SEQ ID NO: 1325 ACGCGGGGGGCAGCAGTGGACCTATGAGCAGAGGAAAATCGTGGAGTTCAC 
CTOCCACACAGCCTTCrrCGTCAGTATCGTGGTGGTGCAGTGGGCCGACTTGGTCATCTGTAAGAC 
CAGGAGGAATTCGGTCirCCAGCAGGGGATGAAGAACAAGATOTGATATTrGGCCTCnTrGAAG 
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AGACAGCCCTGGCTGCTrrCCTTTCCTACTGCCCrGGAATGGGTarTGCTC^ 

CAAACCTACCTGGTGGTTCTGTGCCirCCCCTACTCTCrrCTCATCTTCOTATAT^ 

AAACTC^TCATCAGGCGACCGCCCTGNCGGCTGGGTGGAGAAGGAAACCTACTATTANCCCCCC^ 

TCCTGCACGCCCGTGGAGCATCANGCCACACACTCTGCATTCGACACCCACXCCCrCTTTNTGTAC 

CTTOGGCCGCGANCACXKrrTNAQGCGAAATTCCAACACACrTGGCNGGNCGl^AC^ 

CCCANCTCGGGA 

SEQ ID NO: 1326 GGTACC ACCGCAAA GOXrrG TGAGCGTCTACAGACAGCTCACCATTnTGTCC 
TGTATCTGTAAACAC IT ITl GTTCTTAQTCM'l'i'l4 Cl'l GTAAAATTGATGTTCTTTAAAATCGTTAAT 
GTATAACAGGGCTTATGmCAGTTrcTTITCXGTTCTGTTrrAAACAGAAAATAAj^ 
AGCTCCTTTTXrTCAmCAAAGTTGCTACCAGTGTATGCAOTAATTAGAACAAA 
AGTAGAACATTTTATrGCCrAGTTGACAACATTGCTTGAATGCTGGTGGTTCCTATCCCrrTrGA^ 
TACACAATTITCTAATATGTGTTAATGCTATGTGACAAAACGCCCTGATTCCTAGTGCCAAAGGTT 
CAACTTAATGTATATACCTGAAAACCCATGCATTTGTGCTClllllHi-ll'riATGGTGCTTC 
AAACAGCCCATCCTCTGCAAGTCCATCTATGTTGTCrrAGGCATTCTATCITITGC^ 
AAQGATGGNOATTTGTITCATOQOTTmGNAriTGAGTCTAAANGCACGrrCrAACAi^ 
GGCAATGCATTTANTTGTGTTGCCC 

SEQ ID NO: 1327 GGTACi'1'1 ri rrii'riTrj rriM'iTiu'ii'rivi'i'i'ri CAGCAAAATAATmATrr 

CC^AACATATGGTAACATATACATCCAATATGTGCTCCCCT^GCACAT^rrATTCACGAGTGACTTC 

CAAATGAOU^CnXjCTTTGATATTTAAGCATGTGCTAAAAGTrATCTTAGrroANATATGAA^ 

CTTTANATGGATAACATrCTGAGTATArrGGATrAOTCACAGCANAArrTACTrTAQTrANATGAG 

TTCTACAAATTTAAAGCTTTGAAAAGCTACTACTmACTTCTAATACATCCAGATG^ 

TAGCAATATCAGCTTGTATTCCANAGAAATCTCATTAGTTTTTCTGGNGATGGAACCACTrATCC^ 

CGTTTGTTGGT 

SEQ ID NO: 1328 AOCAGCACACCGGCGOXjTCXTGGACrcCGCCTTCTACGATOCAACG^ 
CIXKjAGTGGAGGACTAGATCATCAATTGAAAATGCATGATTTGAACACTQATCAAQAAAATCT^ 
TTGGGACCCATGATGCCCCTATCAGATGTGTTGAATACTGTCCAGAAGTGAATGTGATGGTCACTG 

gaagttgggatcagacagttaaactgtgggatcccagaactccttgtaatgctgggaccttctctc 
agcctgaaaaggtatataccctctcagtgtctggagaccggctgattgtgggaacagcaggccgc 
agagtgtrggtgtgggacttacggaacatgggttacgtgcagcagccgcanggagtccagcctga 

AATACCAGACTCGCTGCATACGAGCGTTTCCAAACAAGCAGGGTTATGTATTAAGCTCTATTGAAG 
GCCGAGTGGCAGTTTGAGTATTTTGGACCCAAGCCCrGAGGTACC 



SEQ ID NO: 1329 ACTGGCCCCAAAGGGAGCTTCAGGCACCAGGGAAGACCCTAATTTAGTCCCC 
TCCATCTOCAACAAGAGAATAGTAGGCTGCATCTGTGAAGAOGACAATACCAGCGTCGTCrGGTT 
TTGGCTGCACAAAGGCGAGGCCCAGCXjATGCCCCCGCTGTGGAGCCCATTACAAGCTGGTGCCCC 
AGCAGCTGGCACACTCAGCACCTGCACTAAATTACTCAAAATGTGCTG 

SEQ ID NO; 1330 ggtaccggaagaagcagctggcaaagcagctccctgcacatgaccaggacc 

CITCAAAGTGCCATGAGTTGTCTCCCAGAGAGGTGAAGGAGATGGAGCAGTTTGTGAAGAAATAT 

AAGAGCX>AAGCTCTGGGAGTAGGAGATGTCAAACTTCX:CTOTGAGATOGATGCrCAAGGCX;CC^ 

ACAAATGAACATTCCTGGAGOGGATAGAAGCACCCCAGCAGCAGTGGGGGCCATGGAGGACAAA 

TCTGCTGAGCACAAAAGAACTCAATATTCCTGCTATTGCTGTAAACTGAGTATGAAAGAAGGTGA 

CCCAGCCATCTATGCCQAAAGGGCraGCTATQATAAACTGTGGCACCCAGCTTGrmGTCTGCAG 

CACCrcCCATGAACTCCTGGrrGACATGATrrATTITnjGAAGAATGAGAAGCTATA 

ACATTACTGTGACAGCGAGAAACCCCGATOTGCTOOCTOTGACOAQCTGATATTCAGCAATGAGT 

ATACCCAGCANAAAACCAGAATTTGGCACCTGAAACACTTCTGCTGCCTrcACrGTC 

CTAGCTGGGGAGATATACCTGATGGTCAATGACAAGCCCCNGTGCAAACCCTTGCThrrriTO 

AATCACOCTTQQOOTOTGTCAANGGATGCCCCCAATGCCCCTCGANCCCANAAAhrrGCAGCG^ 

NGACCTATACAA 

SEQ ID NO: 133 1 GGTACCTGGAGGCTCAACGGCAGAAGCTTCACCACAAAAGCGAAATGGGCA 
CACCACAGGGAGAAAACrGGTTGT(XTGGATGTritfAAAAGTTGGTXXi^ 

TCATCCTATCTATCArrAACTCCATGGCACAAAGlTATGCCAAACGAATCCAGCAGCGGTTGAACT 

CAGAGQAGAAAACTAAATAAGTAGAQAAAOTmAAACTGCAGAAATrGGAGTGGATGGGTTCTG 

CCrrAAATTGGGAGGACrCCAAGCCGGGAACGAAAATTCCCrmCCAACCTGTATCA^ 

AACTTTTTOCTGAAAGCAGTTTAOTCCATACTn'GCACTGACATACT^^ 

TAAGGNATCCACCCTCGATGCAATCCACCTTGTGTTTTCrrANGGTGGAATGTGATGrrCAACAAC 

AAACTrGCAACAAACTGGCCTTCTGTTTGTTACmCAAAANGGCCACATGATACAAAT^^ 

TTCCCACCGCCCAAAAAAAAGTTCCTAA^^^ATGTTTAAAATATGTCAAAGCTTTTT^ 
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ACAAAAAGAATGGCTTTOGTmCCTAAGTCATTCAAAAANGTATTNTAAAATTAWANAATTGG 
GATAAAAAGCCTITGCATNGTTTAiVrAATGTTrACNAArrmAATArrTCATTCCTQC 

SEQ ID NO: 1332 acctaqaagagaggcgggtcaaagaagtagtgaagaagcattctcagttcat 
aggctatcccatcaccctitatttggagaaggaacgagagaaggaaattagtgatgatgagtcag 
aggaaoagaaaggtgagaaagaagaggaagataaagatgatgaagaaaaacccaagatcgaao 
atgtgggttcagatgaggaggatgacngcggttngglsrrttagaagaaagaaaacttagaagatc 
anagagaaat^^cnttg^^^cngggaagaacttnaccagacccancctattttngacc^ 
ttgttgacatcacccaagaggg 

SEQ ID NO: 1 333 ggtacgagcaactcaagggcgagtggaaccgtaaaagccccaatcttagca 
agtgcggggaagagctgggtcgactcaagctagttcttctggagctcaacttcttgccaac^ 
gggaccaagctgaccaaacagcagctaattctggcccgtgacatactggagatcggggcccaatg 

GAGCATCCTACGCAAGGACATCCCCTCCTrCGAGCGCTACATGGCCCAGCTCAAATGCTACTACTT 

TGATTACAAGGAGCAGCTCCCCOAGTCAGCCTATATGCACCAGCTCTTGGGCCrCAACCTOCTOT 

CCTGCTGTCCCAGAACCGGGTGGCTGAGTTCCACACGGAGTTGGANCGGCTGCCTGCCAAGGACA 

TACAGACCAATGTCTACATNAAGCAOCCAGTGTCCCTGGAGCAATACXrrTGATGGAGGGCAGCTA 

CAACAAANTGTTCCOTGGa;AAGGGTACATTCCCGCCGAGAAGCTACACCTnm*CATO 

TGGTCGACACTTNNAAGGGATGANGATCNTGGGGTGCATC 

SEQ ID NO: 1 334 ACTTTGATGACTGCATGCAGCrmGGCGCAGACATTCCCGTTrGTAGATGAC 
AATGAGGTTTCrrCGGCTACGmCAGTCACTrGTrCCTGATATTCXXGGTCAC^ 
GTCTTCATTGCrACTAATCAGGCrCAGTCACCnX}AAACTTCTGTTGCTCAGGTAGCCCCTGTO 
TAGACGGTATGCAACAGGACATTGAOCAAGTrrGGGAGGAGCTATTATCCATTCCTGAGTrACAG 
TGTCTTAATATTGAAAATCACAAGCTGG'ITGAGACTACCATGGTTCCAAGTCCAGAAGCCAAACT 
GACV>LGAAGTTGACAArrATCATTmACTCATCTATACCCTCAATOQAAAAAGAAGTAGGTAACrG 
TAGTCCACATmcrTAATGCrmGAGGATTCCTTCAACAGCATCCTC TTCAC AGAAAAAC 
CAGTTGACAGTGAACTCATTAAATCAGATGCCACAAGTCAACACAGAATTITGGGGATGAArnT 
ATTCTGCTTTCATTAGCTTGAGCCCANTATrCAGCAACCAGCATGCCCCTACCTGC^ 
CCATTCACTCTTTTGAAACrmAAAANGGGCCCCATGGATGTrcrG 

SEQ ID NO: 1 33 5 ACGCGGGGGCCCACTCTGCGCTTCACCATGGCTTTCATTGCCAAGTCCITCTA 

tgacctcagtgccatcagcctggatggggagaaggtagatttcaatacgttccggggcagggccg 

tgctgattgagaatgtggcttcgctctgaggcacaaccacocgggacttcacccaactcaacgag 

ctgcaatgccgctttcccaggcgcctggcggtccttggcttcccttgcaaccaatttggacat^ 

gagaactgtcagaatgaggagatcctgaacagtctcaagtatgtccgtcctgggggtggatacca 

gcccaottcacccttgtccaaaaatgtgaggtgaatgggcagaacgagcatcctgtcttcgot 

CCTGAAGGACAAGCTCCCCTACCCrrATGATGACCCArrTTCCCTCATGACCGATCCCAAGC^ 

CATTTNGAGCCCTGTGCGCCGCTCAAATGTGGCCTGGAACTITGAAAAAGTTCCT^ 

GGANGGAGAAGCCCTITCCOACOCTACAGNCCOCACCTrCCAACCATNAAACATTrGAOCCTGAC 

ArrCAAGCGCCCTCCTrAAAAGTrGCCATATTAGATGTGAAACTG>m'AAACAAACAGAATCT^ 

TACTCCTTCCCAGTCCTGAANGAAGCCmAGGATGCANCAATGCCCTTCAAGGAAANACTTOGT^ 

G 

SEQ ID NO: 1 336 GGTACTGCrGCTGCTGCTGCTGCrGCTGCTGCTGCTGCTAAAGTrCCAGCAAA 
AAAGATCACCGCCGCOAOTAAAAAGGCTCCAGCCCAOAAOOTTCCTGCCCAGAAAGCCACAGGC 
CAGAAAGCAGCGCCTGCTCCAAAAGCTCAGAAGGGTCAAAAAGCTCCAGCCCAGAA AGCACCT G 
CTCCAAAGGCATCTGGCAAGAAAGCATAAGTGGCAATCATAAAAAGTAATAAAGGTTCTTTTTGA 
CCTGTTGACAAATGTATTTAAGCCTTTGGATTTAAAGCCTGTTGAGGCTGGAGrrA 
TGATAGTAGGATTATAATAAACATTAAATAAGCAAAAAAAAAAAAA 

SEQ ID NO: 1 337 ggtacagagagatccccgaggggaatganaaagccctgaagagggcagtgg 

CrCGAGTGGGACCTGTCTCTGTGGCCATTGATGCAAGCCTGACCTCCTTCCAGTnTACAGCAAAG 

GTGTGTATTATGATGAAAGCrGCAATAGCGATAATCTGAACCATGCGGTTTTGGCAGTGGGATATG 

GAATCCAGAAGGGAAACAAGCACTGGATAATTAAAAACAGCTGGGQAGAAAACTGGGGAAACAA 

AGGATATATCCTCATGGCTCGAAATAAGAACAACGCCTGTGGCAlTGCCAACCTGGCCAGCTrCC 

CCAAGATGTGACTCCAGCCANCCAAATCCATCCTGCTCrrCCArnCrTCCACGATGGTGCAGTGT 

AACGATCACrrrGGAAGGGAGTTGGTCTGCTATITriTGAAGCAAATGT GGNGATACTGAA AATrG 

TCTGTTCAGTTTOCCCATTTGTTTrGNGCTTCAAAAGGATC 

SEQ ID NO: 1338 GGOA CriUTl l - l I ' ri4 n - 14 1 11^ - 1^11111 CATGTATACTTCAmATnTATTA 
ATAAGTAAANCCCTCTAAGGGGAGCCrmGCCTAATCCTCCNACTCTGATTC^^ 
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TAATCTGGAAGTAACNAAGTTCGTAGGTCTCCTTGTCAAATGCAACCNCTCGAAGCCAATCACNA 
AAATTCrrCTrCTrAAGGThriTrcrrGGTAAGGTArrrCAAATACCTTITA 

AAACAACTGNGArmATTNTTGAAGCOm:AATGNGAACAACATrCCCNAAATTrCCAOTmQ 
C>riTGACTTTAACXTIWCCCGTAAAAAlTGCTCCTATTITAA^ 

GOAANANAACCCTAAATTTNCCTAGGGAAGGTNCCTTGCCCGGGCGGGCrOTO^rAAAG 

TTCCANCCCACTGGCXJGGCCGTTACTANTGGGATCCCAANCrCNGTACCAAAGCCTTGGGCGG 

TTCATGGGNCATAANCNTGnrCXrrGGG 

SEQ ID NO: 1 339 A ClTi - n M i l - l ' lM ' l - rri - ri - ll -lM-ill M nnr i XjGGGCTCCAAAGATCCTTTACTG 
AGATCCACTrGAAACACTrCGGTCCTrAACTTGTTAACTGAGTrGACAGGCTGATGGCrGATCTAG 
GTAAAGGrrrCACGGTAGCAAAAACAOTCCTGCCTATAATACATCAAAACAGGTTCCTGAATGTG 
TGTGAATATCATCAAGGNCTCTGTCCCATCTnmCAGGGTTCGATTAAGCTCCTCAAATTTGTGQ 
ATCTGACGGATGAANAAATCCCCATAACGCCTGGCGACCTTGGTGTCCGCGATGTGGATCATGTCC 
TTATCAATGTANAAGTCAACCTCTGAaOCTOACTGAAATTCACCATCCAGOGCTGAOATACCGCTG 
GTCCACACGAGGTrCTTCATCTrGTGTTTGAAGACAAGAAGCrGGATCCGGAACTCCTGCTC^^ 
AOGTCCANGAAGCCAKCCAGCTTGGCCCCANGCATGGTGGTGTAAANCTTCANGAAACTGCCGGA 
TGOTTGAAAGCTNGGCCnTGCrGCTTNACCCTCGGGCCGGGANCACCCTITAGGGCGAAr^ 
CNCTGGCGGCCCGTTCTTA 

SEQ ID NO: 1 340 ACGCGGGGCTCrrcCTAAGCCGGCGCTCGGCAAGTTCTCCCAGGAGAAAGCC 
ATGTTCAGTTCGAGCGCCAAGATCGTGAAGCCCAATGGCGAGAAGCXGGACGAGTTCGAGTOCGG 
CATCTCCCAGGCTCrrCTGGAGCTGGAGATGAACTCGGACCTCAAGGCTCAGCrCAGGGAGCTGA 
ATATTACGGCAGCTAAGGAAATTGAAGTTGGTGGTGGTCGGAAAGCTATCATAATCTTTOTTCCCO 
TTCCrCAACTGAAATCrTTCCAGAAAATCCAAGTCCGGCTAGTACC 

SEQ ID NO: 1 34 1 GGTACGAGGACTGGATGGAAAGGTGATTTGTGGCTCCCGAGT GAGG GTTGAA 
CTATCGACAGGCATGCCTCGGAGATCACGTTTTGATAGACCACCTGCCrGACGTCCCnTGATCCA 
AATGATAGATGCTATGAGTGTGGCGAAAAGGGACATTATGCTTATGATTGTCATCGTTACAGCCXj 
GCGAAGAAGAAGCAGGTCACGGTCTAGA.TCACATTCTCGATCCAGAGOAAGGCGATACTCTCGCr 
CACGCAGCAGGAGCAGGGGACGAAGGTCAAGGTCAGCATCTCCrCXjACGATCAAGATCTATCTCr 
CTTCGTAGATCAAGATCAGCTTCACTCAGAAGATCTAGGTCTGGTTCTATAAAAGGATCGAGGTAT 
TTCXIAATCCCCGTCGAGGTCAAGATCAAGATCCAGGTCTATTTCACGACCAAGAAGCAGCCGATC 
AAAGTCCAGATCTCCATCTOCAAAAAGAAGTCGTrCXCCATCANGAAGTCCTC GCAG AAGTGCCA 
AOTCCraAAAOAATGGACTGAAGCTCTCAAGTTCACCCTITANaGAAAAGTTATm 
TATTTATAAGGGGATTTGNGGATGTCTGTNAAAGTGTACCTAAGGAAAGATAATTCAACC CTTC TA 
ATTCAAAATGGGATCTGGATTACTATGNTAAATTCACANCAGGTAAAKAATAATATTAAATTrTGT 

SEQ ID NO: 1 342 GOTA ClUTri - i4 - I4 - lU - l ' r i' rilU 1 ll^ CTGGACAGNOGTr TTATr GGTAAAOATA 
TAAOACATAlTGGCTCTATrAAAAACTCAGGTAATAAAGCACTAAGCrTGATTTTTGTATTGC^^ 
AGTCTCTTTCTTCTAAGGGGAAGAAAATCTCCCCAAGAATAGGATGCTACCTGAGGAATTATGCCG 
AATAAAGAAAAGGAATGGATGGTCGGCAGTGAAA I ITIX; 11 CGGGCATCAACATGCANAAAGTTG 
CGATGCCTGCTGTGGCAGCTGCCGCCTCTGTTCCCTCTTCATrCACTTCCACAy^ 
AATTTTTGATATAAAAATATCTCTGGCTCCTOACATGCCAGACAOATCAGCCTTGCTACT 
GAGATCCTGCACACCTAGGCGGGCGAGGTCGGAGTI'GANAGTGTAACTXn'CTrCrAGTTT GAACCT 
OGGCAAGCTGACATTAACrrCAATGAAATCGAGATTCTCAGGTrrAAGTCCCTCATGCAACTTTTC 
CAAAGTCAACTGTTCCrCAATCTITCITCANGCCCGTGGACTCGTCCTCAA 
GGATGACCTrCTGAGCTCCTCGCCTTGGGTAAAGGCAAGTTNCCAGCACACCGGCCCTTAA 

SEQ ID NO: 1343 GGTACAGTTGAAATAACTGGAACAAATTATTrn'GGTGTGTGTGACAATTCTG 
ITmAATGCTATTTGAACAAGTGGGCCATTAGCCAGATTT GTCnT ITTC 
ACTAATTirACATGTTTATAAATCTTATGCTCTCACTGTTTGrrTTTAm 
GTrTCCIX}ACATrGTCTCCTATATAmCTATTATTAATTGC>AAAACATAGAAATC 
TATCAACAATAAAATTirmAAAGTAGTGAGTGCTATTTTGGAGTrCCAAATTTrCAGT^ 
TATCTAAAA Cl - 1 - 1 - 1 i 1 1 AATACOTGCCATrATCTATAGAAAACATTACrTCAGGTTGTOAOATTGAO 
TTGCATTTCTGGATGGACTGATGAATITATCCCGACATGAAGAAGATTGGCATATTAGCTrTAAAA 
ATTTTTAAAGATTGGATTTITmAGTATAAGCCACTTTCTAANGATTATAGAGAAATO 
CX:AATGCATAGCAAAAATAGTGGTGTTAGAAAAGAAAATAGGGTTA CATTTAA NGGAAGNGCTn' 
TAAAAAACCAGAAACCAGACITITAAAATTTAAAmGTNGGACACCCTITrTAAAAAm 
CAAAGATrmATTTTANATTTTNCAATTACCCCCCTTTTT^ 

SEQ ID NO: 1344 acgcgggggacgccgagacaaaccggacccgcaaccaccatgaacagcaaa 

GGTCAATATCCAACACAGCCAACCTACCCTGTGCAGCCTCCTGGGAATCCAGTATACCCTCAGACC 
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TTGCATCTTCCTCAGGCrCCACCCTATACCGATGCTCXACCrGCCTACTCAGAGCT^ 

GCnrraTGCACCrAGGOGCTGCCACAOTCCCCACCATGTCAGCCGCATTTCCTGGAGCCTCTCTGT 

ATXrrrcCCATCGCCCAGTCTGTOGCTGTTGGGCCTITAGGTTCCAC^ 

AGTCGGTCCCATCTATCCACCTGGCTCCACAGTGCTGGTGGAAGGAGOOTATOATGCAGGTGCCA 

GATTTGGAGCTGGGGCTACTGCTGGCAACATTCCTCCTCCACCTCCTGGATGCCCTCCCA ATG 

CTCAGCTTGCAAGTCATGCAGGGAGCCAACGTCXrrCGTAACTCAACGGAAGGGGAACTTCn^ 

GGGTGGTTCAGATGGTGGCTACACCATCTTGOTGANGAACAANGCCACCrCTGTGCCGGGAAAGA 

CATCACATACXrrTTAGCGCTTOTACAATGTAACTGCrrrAATCA^ 

TAAACACATGTTGGrrcGGGGGOTCmCTGGNGCCQ^AACITITANGCACTrm 

SEQIDNO: 1345 ACCACATATCCCArrTCACTTCTATrrGTGTGAAATGGCCTTTCXrCCGGGT^^ 
AGCCAGCACCTGATGAAACTTCCTrCAGTOAGGCCTTGCTGAAGAGGAATCAGGACCrGGCTCCX; 
AATTCTGCTGAACAGGCATCTATCCTrTCTCTGGTQACAAAAATAAACAATGTGATTOATAATCT^ 
ATTGTGGCTCCAGGGACATTTGAAGTGCAAATTGAAGAAGTTCGACAGGTGGGATCCTATAAAAA 
GGGGACAATGACTACAGGACACAATGTGGCrGACCTGGTGGTGATACTCAAGATrCTGCCAACGT 
TGGAAGCTGTTGCTGCCCTGGGGAACAAAGTCGTGGAAAGCCTAAGAGCACAGGATCCTrCTGAA 
GTmAACCATGCTGACCAACGAAACTGGCTITGAAATCAGTTCTrCrGATGCTACAGTGAAGATT 
CTCATTACAACAGTGCCACXXAATCTrCQAAAACTGGATCX:AGAACrCCATTTGGATAT CAAAA GT 
ATTTGCAGAGTGCCTTANCAAGCCATTCCGACATGCCCGCTTGGTTCGAAGGAAAAATGC^^ 
AATCCCACAGTrAAAGTTCTCATCANACTACTGAAAGGACTTGAGGATTCGTTITCCT^ 
ANCCCCTNACACCXrrTGGATCCTTTGACCTACrAGGCCATTTTGCrGNGATGAACAACCCCNCC^ 
A 

SEQ ID NO: 1 346 ACTCAAAGGTGATATTTGCITITrrCAATGCTTCAOGGGAAAAATCCT^^ 
TTACAAACTTCCATCAGTTTAGGAGTCAGTCTGTATGCCTTTAGTGAGAGAGATCCrrcGGC^ 
mATCGGATCATAAATGAGAACGACAGATTOTCAATGGCATGCTGGTAACTAAACTGAGAGTC 
CAGGAGTGCCCGGGTAACGAATGAGCCATAGTATGTGGACTGATACCAGCCCACGTGAAGATGAT 
CAATGTTTACATGGCGAAGGCTCCGCATCATTTCCATCTGATATTGGACITCATCAAAGTC 
CATCCTCTQT0TGCTGAO0GAAAGGAAAGCAGTTGOTAATTTCAAGCCX}ATCTTCTACAA CCAGA 
CCCAAAAGCACTCXTITGAACAACTTCAGTTCCTTGTCCTrCTTCTTGATAATGm 
ATACCACAAGGCCATCTATCTGCACITGCTTCACGGCTGAATCTCCCGAGCCCOCCTTTO^ 
CCTTTTCCTGCrrGCGCCCGCCGGTGGAGCTGGAAAAANGTGGGCAGTANAGCCCGOT 

SEQ ID NO: 1347 GGTACGTTTTGCCAACCTATGAAATGGCCGTGAAAATGCCTGAAAAAGAACC 
ACCACCTCCTTACTTACCTGCCTGAAGAAATrCTGCCTTTGACAATA^ 
TTTGTTTATGTTACAGAATGCTGCAATTNANGGCTCTTCAACTTGimTGA^ 
^mTrcTTAANCAATTTATmCNAACACTAAAGAGC^^^^m^ACAT^r^ 
TTTTTGTTAAAGNCKmACANTrTTAATAGrrTTTTG 

SEQ ID NO: 1348 GGTA Ci ITi i 1 1 1 1 1 1 i 1 1 1 ' l 1 1 i 1 1 1 1 t GGANACGGAGCTTTGCTNTTGTrACC 
CANGCTGGAGTGCAATGGTGTGATTrrGGCTCANCGCAATCTCCTCCrrCCAGGTTTAAG CAATO 
TCCTGACTCANCCTCCnBAGTAGCTGGGATTAAAAGCATGNGCCATTACNCCTGGCTAANTTTm 
1 - 1 - 14 ' rilJ - l ' i ' iri GNATITrrAANAAAAACAGGGmCTCCATGrrGGTCACGCTGGTCTCAAA^ 
CCGACCTCAOQNGATCTGCCraCCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACT^ 
NGCCCGCCCACCAGCCCGGT 

SEQ ID NO: 1349 GGTACAAATAGTTACTGAAATGACNAGACCCTTG'nTGCACAGCATrAATAA 
GAACCrrGATAAGAACCATATrCTGTTGACAGCCAGCrCACAGTTTCTrGCCTGAAGCTTG 
CCCItX:AGTGAGACACAAGATCTCTCTmACCAAAGnGAGAACAGAGCTGGTGGArrAATTAAT 
AOTCrrCGATATCTGGCCATGGGTAACCTCATrGTAACTATCATCAGAATGGGCAGANATGATCTr 
GAAGTGCCACATACACTAAAGTCCAAACCTATGTCAAGATGGGGGGTAAAATCCNTTAAAGAACN 
GGAAAAANTATTrm-AAGATGATAAGCAAATGTTTCANCCCAATGTCAACCCAGTTAAAAAAAAA 
ATTAATGCTGNGTAAAATGGnrNGAATTAGTTTGNAAACTATATAAANACATATGCAGTAAAAAAG 
TCTGTTAATGCACATCCrGTGGGAATGGAGGTGTTCTAACCAAATTG CCTTT r CTTGTTA TCTGAGC 
TCTCCTATATTNTCATACTCAGATAACCAAATTAAAAAOAATTAGAATTTTGATTTT^ 

TANA^T^AAAACTX^T^^AAC^mc^T^mT^^'GGNGA^^ 

TCAATGGCCTChrrGNGTCATTGTTTTAAAAAAATCAhrriTrNACTTTTACC^ 

SEQ ED NO: 1350 OGTA CriTr f rn -t Tl - riTn - H ri 1 ll U i IN GCNGATAAACAGACATGnTAA 
TGATAGCTTGCTCTTCACAGAGATGTCTACAGAGACTTTrAATCTATAAT(XAGGAGTAT^^ 
ATOCAGCACANACCAATTAGCCAAATGCAAAATAAACTAOATTCTTACCACAACrATCCTATAAA 
CACTGCAAC^ATT^^rTrcCAAAAGGACa^*AA^C^rATGTGAAAACACCT^ 
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NAAACCTAGCTAT^^X}GCCAAAAGGAAAAAATACK;AAT^CAGTGGTTAAT^GGTGAAAGCAGGA 

AATQTATGCCCnTrACTGTGCTGCTGTTGCTGCTGTTGTTGCrGCrG TO 

GCGGCAGGTGCTGCTCCTGCTGCTGCTGCntSCTGCTTTGGTCTTCTTAGTm 

CTTCTTCAGCCCATCCATrGTTACTrCTAAGGGAGGmTGAATAAACCANCTGAAACrrATrC^ 

TTCCTTGGAGCTCAACC^CANrGCTGATmTCrrn^ 

TGTCTrGCGGGCTCAAAACCCCTNCCAAATACCTGGCCCGGGa:GGGCCGGrrCTAAAAGGGCGG 
AATTCCCACCANAOTGGNGGQCCGGTrACTrATTrGGGNTCCCANNCrCGGGNACCAAACC^ 
GG 

SEQ ID NO: 1351 GATACACAGAGTAAAATGlTl'ri CI' I 'I'l 1 1 CAGGACCTTGAACTGAATCTTGC 
ACTGCrTTGGTTTCTATCTAGGAAGCrCAGCGACAOTAGAGTCTGTAGAGGCGGCCACTGAriT^ 
CACACCCCGGAGAGGGACTCACGGGTAGCACAACGGCCAGTTCGGCAATAGCAGGTGGCrCTTGC 
CTOAGAACCTCAOGTTCTAAGAGCAGAGAGTCCATrrCCTGCAAAGGAGATAGCAAGGTCCTGGT 
TGTCTTCCCCANACTGC^^^CTGGGr^GNANCCTCATCAGCr^CTITCCTGGAGT^^ 
GGCTGCATGGNCACCAC 

SEQ ID NO: 1 352 ggtacaggagatctcatttgggacaactaaggataaaatgctggtcatcgag 

CAGTGTAAGAACTCCAGAGCTGTAACCATTmATTAGAGGAGGAAATAAGATGATCATTGAGGA 

GGCX}AAACGATCCCTTCACGATGCTrrGTGTGTCATCCGGAACCTCATCCGCGATAATCGTGTGGT 

GTATGGAGGAGGGGCTGCTGAGATATCCTGTGCCCTGGCAGITAQCCAAGAGGCGGATAAGTGCC 

a;ACOTAGAACAGTATGCCATGAGAGCGTTTGCCGACX3CACIXjGAGGTCATCCCCATGGCCCrCT 

CrGAAAACAGTGGCATOAATCCCATCCAGACTATaACCGAAGTCCGAGCCAGACAGGTGAAGGA 

GATGAACCCTGCTCnrrGGCATCGACTGrrrTGCCAAGGGGACAAAATGATATTGAAGCA^ 

ATGTCATAOAAAACCTrGATTGG(^\AAAAGCAACAAGAWCTCTO 

TGATirTTAAAGATTGATGAC^rrCGTAAGCCCrGGAGAATCTNNAAGAATOAANAA^^ 

AAAAANCTTTGTTOCAAGATCCNCnTnTGTGANTrAAANTAAAATGGGATNGCCTrc^ 

TCT 

SEQ ID NO: 1 353 ACTTCTCATTTTCATTCCrrCAAAGCrr rCi' ri CAGCATAGGCTCCmTGTTC 
ATCCTCCTCCTCrrCCTCAGACAATGGTGAAAAGGCAAACCTCTGGTTGTTTGCAGATGGTCGCCA 
GAGAACCATGATGACAAAGAGGATCATGGAGAACAGCAAGCGCCAOATGGCATCGTCTACCCAC 
AGCTCCCGCCAGTCCGACTCACATG1X:ACTATTCTGAACITCATXiGTrGTCCAGATG ATAA ACACA 
ATOGATGCTOCCACTGCCAAAATAAGCGTGTrGGTGAAATGCCGATACAAAGAGAGrnTACAAT 
GTTCXrrCCGAAGTTTrAATAGCrrCATTGTrrGAGTCAGGCTAATAAATATCCATAAAATAACACA 
GGCGTCAACTGCTGAGAGGGCCAGGTTTACrATCAGAGTCAAGGGGATAAGAAAAATACCCAGTA 
ACnXTTGAGGACCCCrrCCATGCCAGAGAACAAAAGATANAGGGCTCCTGCTACrrACAACCTTATG 
AAGAGTGCTCCCANGCGTGGGCnTGACGATGCCCATATCCCAGACCCCGCGTACCrrCNGCCCGG 
GACCACG 

SEQ ID NO: 1354 ACTGTGTGCAGCAGCTCAAGGAATTTGATQaGAAGAOCCTQGTCTCAGTTAC 
CAAGGAGGGTCTGGAGCTGCCTGAGGATGAGGAGGAGAAGAAGAAGATGGAAGAGAGCAAGGC 
AAAGTITGAGAACCTCTGCAAGCTCATGAAAGAAATCTTAGATAAGAAGGTrGAG 
TCTCCAATAGACTTGTGTCTTCACCTrGCTGCATrGTGACCAGCACCTACGGCTGGACAGCCAATA 
TGOAGCGGATCATGAAAOCCCAGGCACTrCGGGACAACTCCACCATGGGCTATATGATGGOCAAA 
AAGCACCTGGAGATCAACCCTGACCACCCCArrGTGGAGACGCrGCGGCAGAAGGCTGAGGCCGC 
AAGAATGATAAGGCAGTTAAGGACCTGGTGGTGCTGCTGTnGAAACaSCCCTGCTATCrrCTGGC 
TTTmCXnTGAGGATCCCCAGACCCCrCCAACCGCATCTATCGCATOATCAAGCTAGGTCTAGGTA 
TTGATGAAAATGAAGTGGCAGCANAAGAACCCAATGCrOCNGTrTCrGATTGA 

SEQ ID NO: 1355 acacgtcaaataagaaactactcttagcacagaaataacagaaaatatgctc 

ACATCCTATGGTGTGAGGCATGGTGGGCCACTCTCCACATAGAGCGCATTCnTGCCACTGGTGGC 
TAATGTArrGTCACTATTAGGTGCACCAGTAAGAGGAATACACCATGAAGACAGCTrGGCTTTCAA 

cttctggacattgataagtgqtaagagaaaaatcagaaattcagcaaaaccatgccagagaagtt 

CCCTATrCATOTATl'CAAAGCCAACnTCACGThn*OTmGAGGCTTGCAAAATACAGAATGA^^ 

CTAGQAQACGTTCTGTCAAAGTmGCNAACmCCCCTCTGAAGGAAAATCAAAAAATrAATCAG 

CCCACCTAATTTCAAAAGTCCAATCACAAAATTCACACACnXjCrrGAC^ 

ATGATGGTTTTCGAAACAAATCATTAGCATCGTTCTTCTAACCACCTGCCNCAATTTGTACC^ 

GCCGGGAACCCCCNTAAGGGNGAATTTCCCTC 

SEQ ID NO: 1356 ACGCGGGGGGGCCTCCATCAGCAAGCTCCAGTGCTACGTGTCCCTGGCATIT 
TAGGTGTCGGTrGGGTAGGCAGTCATGGATCAGGTAATGCAGTTTGTrGAGCCAAGTCGGCAGTrT 
GTAAAGGACTCCATrCGGCTGGTTAAAAGATGCACTAAACCTGATAGAAAAGAATTCCAGAAGAT 
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TGCCATCKKTAACACTCAATAGGATITGCTATAATCKjGATrCApGGCrTCT^ 
TATrCCTATTAATAACATCATTGTrcGTGGaXjAATACATTTTGGAAOAQAGTTm 
ATTGGTGAACAAGTGTGAGGGTGTGAGAAACTCACAGAATACAAATTTQCCTGTATGnTTGTGG 
GTTTTTITITrCCTTTCAAGATGTTTTCTArrTCT 

SEQ ID NO: 1357 ACCITITGGTGCCAOCGTTTCAAGOAOCCCTCACCATGAAACAAGTCAACCC 
CAGCAAGCGTCTAGATCATTTGCAGCGGGCTCGAGAACACTTTATAAACTACrTAACTCAGT^ 
TTGCTATCATGTGGCAGAGTTrGAGCTGCCCAAAACCATGAACAACTCTGCTGAAAATCACACTGC 
CAArrCCTCCATGGCTTATCCTAGTCTCGTrGCTATGGCATCTCAAAGACAGGCTAAAATACAGAG 
ATACAAGCAGAAGAAGGAGTTGGAGCATAGGTTGTCTGCAATGAAATCTGCTGTGGAAAGTGGTC 
AAGCAGATGATGAGCXJTGTrCGTGAATATTATOTCrrCACCTrCAGAGOTGGATTOATATCAGCT 
TAGAAGAGATTGAGAGCATTGACCAGGAAATAAAGATCXrrGAGAGAAAGAGACTCTrCAAGAGA 
GGCATCAACTTCTAACTCATCTCGCCAGGAGAGGCCTCCAGTGAAACCCTTCATrCTCAC^ 
CATGGCrCAAGCCAAAGTATTTGGAGCTGGTTATCCAAGTCTGCCAACCTTrGACGGGTGAGTGAC 
TGGTATGAGCAACATCGGAAAATATGGAGCATTACCGGATCAGGGAATTACCCAAGGCAGCCCCC 
AGAAGGAATTCAOAAAAACNGCTCA 

SEQ ID NO: 1 35 8 ACGCGGGGGCAGTTTTCAACrGACCTCTGGACX3CAGAACrTCAGCCATGAAG 
GTAACAGGCATCni fl' t CTCAGTGCCTKlGCCCTGTrGAGTCTATCTGGTAACACTGGAGCTGACT 
C(XTGGOAAGAGAGGCCAAATGTrACAATGAACTTAATGGATOCACCAAGATATATOACCCTGTC 
TGTGGGACTGATGGAAATACTTATCCCAATGAATGCGTGTTATGTTTTGAAAATCGGAAACGCCAG 
ACTTGTATCCTCATTCAAAAATCIXjGGCCTTGCTGAGAACCAAGGTTTTGAAA 
CCGCGAGGCCTGACnXjGCCTTATTGTTGAATAAATGTATCTGAATATCCCAAAAAAAAAAAAA 

SEQ ID NO; J 3 59 ACGCGGGCAATGATGGGCrTTATGATCCTGACTGCGATGAGAGCGGGCTClT 
TAAGGCCAAGCAGTGCAACGGCACCTCCACGTGCTGGTGTGTGAACACTOCTGGGGTCAGAAGAA 
CAGACAAGGACACTGAAATAACCTGCTCTGAGCGAGTGAGAACCTACTGGATCATCATTGAACTA 
AAACACAAAGCAAGAGAAAAACCTTATGATAGTAAAAGTITGCGGACTGCACTTCAGAAGGAGA 
TCACAACGCGTTATCAACTGGATCCAAAATTrATCACGAOTATnTGTATGAGAATAATGTTATCA 
CTATTGATCTGGTTCAAAATTCTTCTCAAAAAACTCAGAATGATGTGGACATAGCTGATGTGGOT 
ATTATTTTGAAAAAGATGTTAAAGGTGAATCCTTGTITCATTCTAAGAAAAAAAAA 

SEQ ID NO: 1360 GGTACACCAGACTTTCTTGTAAACTTCAGCCACAGGAAGGTCCAAACTAATG 
ATTTTATTGTTCACTAGAAGCTCCATGCCACTGTCATCTTCCAGGAGGGCCACTAAGTCACAGTCC 
TGGCAAATCTTGTTCrmATATCCCTCATCAOCGGCCCGATGCCTOGCTCATI^ 
TCCCAGGCATCCTGC<XTGTAAGAAGTCTTCITGTrGGGGATOCTTCrCXAGGGTC^ 
CAGTGACTTCATTCTCCTCAGGATAAATGATGCTGCAGAGCCrCTCGAAGATGAACACCGGGGTCC 
GGTAGTCATCCAGATTOTAGCGCTTGGCTGTCTCAATGCACACA GCCA TGAAGGCCTTGGTTTCTG 
ATTCTGT 

SEQ ID NO: 1 3 6 1 CGAGGTACTrN i'iU"rrri"i'rri'rri'ri"i"riGGGGTcrcACTcrGTCACcx:AGGCT 

GGAGGGCAhrrGGCGGGATCTNATCTCACTGCAACCnjrcCCTCTCAGATTCAAGCXJAT^ 

CTCAGCCTOCTGAGTAGCTGAANACTACAGGCATGTGCCACCNCACCCAGCTAATrmTO 

TTAOTAAAANACGGGOTTTCACATTGTTAGCCAAOATGG 

SEQ ID NO: 1362 . GGTACTTGAAGCraATOCAGAAAAATAAAAATTTAATATAATTAAAAGCAAT 
TTATATATCATCATGAAATAAAGAGAGAnTGAAACTATGTAGTCAACAGAGGACAAAAAAAAAT 
CTCTCAAGCAAGAGAATTrCTGATCAAATCTGTCATATCACrrGCTG<XGAAAA0OACCATTA^ 
GCAATACACATCAGTGCCTTIXrrrAAATGTATAGACTCAAAATAGGTGTTTTTTrGAGGCACTA^ 
AAAAAAGATTCCTCTAAGCAATCAAAAATCAGCATITGAGATrTAATCACTAAATCCAAAATGAG 
ATCAGCAAAACCAGATNCTAGTAAGGCACTAGGCTCACAAAATTGACCrrTGGAGAAANAGAGAT 
GCCCAAGAGTAATArmCTrcANAATGNGNGAAANCTGAAAACTTTATCCCCTTTT 

SEQ ID NO: 1363 GGTACATAGACAAGnrCrrGTAAGACAGAAAACAGAGAAATCCACAGTAAC 
TCrAACACATCCCTTAAGGAATAAGCATGTATITGTAGGAAGCAAACAAAGCrTTCCATAOAGAA 
ACCACTTTCACAGGATGArrAGGTGGACCTGCAATGAAGAAAATACATTTCAAAAGATGGGTrCA 
GACTTACACCAAGTmCACTGAAATACTrAAAAAAAAAAGACCCrrrCTCTGTTCAAAAGAATTA^ 
AAAAAATACATCCCGGATTAAATAAATTTCAGGAAAAGGTCAAGTCCTACTCAGAGACATACATr 
TGCAAATTNATATAAATTTrTAAGOTTGGCCACCAAAATNCCTATTrGGGACCTCCTTA 

SEQ ID NO: 1364 GGTACT^GCCACACCAATCTCAGGACCAGCATTAATATGAACTCCACAATCT 
GTCTCX;CGTGATATGGAACTGCCAACTGTGmGTGATCCCCACAGTTAAAGCTCCTCrCTCCTTAC 
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AGTAACGAAGACCCATCAAACn'ATCrGCIXjTCTCACCTrGATTGACTAAGGAAAAAGCAAACATCA 

TCTCGAAAGACTGGTGTGTITCTGTCCAGGAAGTCAOTGCTAGTTCCACCATCACAGGCAACTCA 

GTCAGCTCCTCAAGAACTTOACGTOTTGCTACACCAGCATGGTAACTrGTTCCACAAGCAATAAGA 

ATCAAACGCCGGCATCTCTGGATCTCCTTTATGTGATCCrrCAAACCACCCAAATTCACAGTATAG 

CATCAAAGTTGACTCTTCCnKrcATTGTrm^ACGACAGACTCTGGCTGCTCAAATATTTCC^ 

ATAAAATGAACTGAAGTTGCCCNTCATGATCTGCTGGGAGTn'CCATCNGGAAGTGGTTrGNCAG 

NCTCGTCCNGGGOGTOATCTCCTTCAGTrCX)rrTAAATTCOAhrrGGATAGAAAAGAACC 



SEQ ID NO: 1365 GGTACTGGATOAAGCTGACGAAATGT TAAGC CGTGGATTCAAGGACCAGATC 
TATGACATATTCCAAAAGCrCAACAGCAACACCCAGGTAGTTTIXjCriXjTCAGCCACAATGCOT 
GATGTGCTTGAGOTGACCAAGAAGTTCV^TGAOGGACCCCATTCXjOATTCTTGTCAAGAAGGAAGA 
GTTGACCCTGGAGGGTATCCGCCAGTTCTACATCAACGCGGAACGAGAGGAGTGGAAGCTGGACA 

cactatgtgacttgtatgaaaccctgaccatcacccaggcagccatcttcatcaacacccggagg 

aaggtggactggctcaccgagaagatgcatgctcgagatttcaccgtatccgccatgc atgga ga 

tatggaccaaaaggaacgagacgtgattatgagggagtttcgttctggctctagcagagttttga 

ttaccactoacctoctgoccagaggcattgatgtgcaocaggtttctttagtcatcaacratoacc 

ttnccaccaacaggggaaaactatatccacagaaatcggtcgaggtggacggittggccgtaaag 

gtgtggctattaacatggngacagaaagaaagacaagaaggactcttcgagacattgagaccntt 

tacaacanccitccatttgaggaaatgcccxn'caatggttgcrgacctcatctgangg 

ctggccaccc 



SEQ ID NO: 1366 A C ' IM ' ! l ^ l ^ l ^ iH ^ lUHU ^ ll ^ I ^ iM ^ ^l ^ l ^ t ^ l ^ l ^ ^ ^ l IN GCCATGCAACAATGTCrnTATTATGT 
ATGCGG^^mAAAATTATT^C^TGAATCTCTCCATACACAGGCAAAAATAAGTGNGTTACTT^ 

tactggaaattgcctaacttaatcattgcctaaanaananaaaattatcccx:aaaacgtgcttaac 

caggaqgccaatgcattrgccoacctccaanaacatggagatgaacgtgatanacanactgtcca 

ccatctgaaccircattcaccaccattcgataaccctrattcaggcccanatcagcagcacatrm 

ttgccaacaatcattaagtgtccaanaanactttcatcatcatcttctgccacanaaatct 

atatgttrcttgggtatca(xaaaaaatgtgttggtgctrgaggggaaatgtcatggaaagcaag 

gcaccxkjtcatcctcaaaaatgattttggcrggtatttccrtgcggatgatot 

gtcgcccccaggccgagcgacctgagccn'ggcaatcttatttrgccattttgggct 



SEQ ID NO: 1367 ggtacgcgggggagttcgacacaccatgccgactgtcagcgtgaagcgtgat 

CTGCrCTTCCAAGCCCrGGGCCGCAOTACACTGACGAAGAATTTGATGAACTATGrr^ 

GGNCrGGAGCTTGATGAAATTACATCTGAGAAGGAAATAATAAGTAAAGAACAAGGTAATGTAA 

AGGCAGCAGGAGCCTCTGATGrrGrrCTTTACAAAATTGACGTCCCTGCCAATAGATATGATCTCC 

TGTGTCTGGAAGGATTGGTr<XAGGACTTCAGGTCTrNAAAGAAAGQATAAAOGCrCCAOTGTAT 

AAACGGGTAATGCCrGATOGAAAAATCCAGAAATTGATrATCACAGAAGAGACAGNTAAGATAC 

GTCCTTTTGCGGGTAGCAGCAAGTTCTCCGTTATATAAAGmACTTAAAGATCGATATGACAGCT 

TCATTCAACTTCAGGGANAAATTACATTNNAATAriTGCAGGAAAAGAAGCNCrcGTTm 

GGTCCTGCCCGGCGGGCCGCITTAAAGG 



SEQ ID NO: 1 368 GGTACAGACATmCAAAGTTGCCAGTGTTACTrTAATTGGACTGCCTTCGTA 
ATTCATTGCCTCTGCrTCAACAATGTGCAACTCATCCTTTGCACC^ 

AAAGATAACTGGTGCrCATTTrCATCATTATCX:ACCTTAAAGTGATAATCnTrGTCGGCC^ 

CACAACCQAAAAOATAGTTCTGGGGCCTCAGGGOGCTCATOTCCATGTCCATCGAATCTTa^ 

GGTGGCGGCACGCACTTAGGTAGGAGAGAAGGCCGACGGAGATAAAAGAACGCrGNTCX^AAANA 

ACAACCCGCGCANGAAGGAATCACACCAGGGCCCaK:GTACCTGCCCCGGGCGGCCGNTNAAAA 

GGGNdstAATTCCA 

SEQ ID NO: 1369 ACACACCTTTGTCCACTGGGTAAATTATATTCATTATGCCCACTGCTGCAGCA 
CGCATAJ^CCAACACCCCTGCATGGCTGAACAGOGCCTAATCTAGOACTGATGGGAGAAGGGCTT 
GCAAACTAAGATCAAGGTGTrrCTCCGCTAATACTGTCTACCAAGCnXjATCCCTACAAA^ 
ATAAAXGCAGGCAAGTTTAGCTACTGTGTTGCAAGAGAAACCAGGACCTTGrrAAATANGNn^ 
TCCATTACCATTTATTCTCTCAAGGGAAGCTTAAAAAAAAAACAACAACACAACATC ACAT TGGTC 
TGGCCACCTNATGAATTCCAACAAGCATTAGNGTGGCATTTCAmTGGANAAGGAAACIT^ 
GGAAAAAATAC 

SEQ ID NO: 1 370 AAGTGTAAAAAGGTGAAGCCAACTTTGGCAACGTATCTCAGCAAAAACTACA 
GCTATGTTATTCATGCCAAAATAAAAGCTGTGCAGAGGAGTGGCTGCAATGAGGTCACAACGGTG 
GTGGATOTAAAAGAGATCTTCAAGTCCrCATCACCCATCOCTCGAACrCAAGTCCCGCTCATTACA 
AATTCTrCTTGCCAGTGTCCACACATCCTGCCCCATCAAGATGTTCrcATCATGTGTTAC^^ 
GrrCAAGGATGATGCITCTTGAAAATTGCTTAGTTGAAAAATGGAOAGATCAGCTTAGTAAAAGA 
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TCCATACAGTGGGAAGAGAGGCTGCAGGAACAGCGGAGAACAGTTCAGGACAAGAAGAAAACAG 

CCGGGCGCACCAGTCCGTAGTAATCCCCCCAAACCAAAGGGAAAOCCTCCTGCTCCCAA^ 

CAGTCCCAAGAAAGAACATTAAAACTAGGAGTGCOCAGAAGAGAACAAACXXIGAAAAGAGTGTG 

AGCTAACTAGTTTCCAAAGCGGAGACTTCCGACTTTCCTITACAGGATGAOGCTTGOGCATTGCCT 

TGGGACAGCCCTATGTAAAGOCCATGTOCCCTTGCCCTNAAAAACTCl^GGCAGTGCr 

NACACATTTTTNCAACAl-ri-JUCIUAAGGCTTTGCTTCAN 1 1 NTITGTANTC 

SEQ ID NO; 1371 ACGCGGGAGATGAATGCCAGAGGACTTGGATCTGAGCTAAAGGACAGTATTC 
CAGTTACraAACTTTCAGCAAGTGGAaTrTTTGAAAGTCATGATCnTCTTra 
TGTGAAAAATGAACTTTTGCCTAGTCATCCCCrrc 

AAGATAAAATGAATTTTTCCACACTQAGAAACATTCAGaGTCTATTTGCTCCGCTAAAATTACAGA 

TG GAATI CAAGGCAGTGCAGCAGGTTCAGCGTCTTCCATrrCTTTCAAGCTC^ 

ATGTTTTGAGGGGTAATGATGAGACrATOGATTTGAGGATArrCTTAATGATCCATCACAAAGCO 

AAGTCATGGGAGAGCCACACTIXjATGGTGGAATATAAACTrGGTTTACTGTAATAGTGTGCTGTTC 

ATGGAAACCGAGGGCTGCATCTTGTTTATAGTCATCnTGTACC 

SEQ ID NO: 1372 CGAGGTACAGTTTGCAGAATATArrCAGAAAAACGTGCAAaTTATAAGATG 
CGAAATGGATATGAATTGTCTCCCACGGCAGCAGCTAACTTCACACGCCGAAACCTGGCTGACTG 
TCTTCGGAGTCGGACCCCOTATTCATGTGAACCTCCTCCTGGCTGGCTATGGTGGAGCATGAAOOG 
CCANCGCTGTATTACmGGACTACCTGGCAGCCTKKjCCAAGGCCCNTITIT^ 
TTGGTGGCCTTCCTGACT 

SEQ ID NO: 1373 ACGCGGGAGATGACCnTrGAGTCAGTGGGCTGGGAAAGGCAGACCCACCCT 
TAATCTGGGTGGQCACCATCCCCTCAGCTGCCAGTGCAGCTAGAAAAAGCAGGCAGAAGAAGGTG 
GAATGAGCAGAGGTGCX;AAGTCTrCTGGCCTrCATXnTTCTCCCATGCIX}AATGCrrc 
AATATCAGGCrCCATOrrCTTCGGCTTTTGQACTCrrGOACTrACACCAGTGGTITGCCAGAGGCTC 
TCGGGCCCTCAGCCCGAGACTGAAGGCTGCACrGTCAGaTCTCTACr^^ 
GGACTGATCCACTACTGGOTCCTTGCTCXn'CAACTTOCAGACGGCCTATCX)!^^ 
GTGATCGCGTGTGTCAATT<XCCTTAATAAACTCCCrrrCATATATACATGTATCCTATTA^^ 
TCCCTCTGGAGAACCCTGACTAATACAGCAGGTATGGAGAGAAGrn'ATTGGGGACTGACrTATQ 
GAGAGACAGAAAAAGTCACCTCACCTTTCTGGCTGAACTCACGAATCACAAAATrTTm 
TGATGAAAACnXjGGAATTCTmTrcCTXKK;ATATACACAAAGGATCT^ 
CATAAGTTAGGGGAACCnrrrcAGAACANAAAAACATCANAACnTITCCTTTATTGA^ 

SEQ ID NO: 1374 acgtoaggaaaggattccactcaagtataatcagcaatactctattgaatga 

AACCATAACTCCACT rCACT CCCTATGGTAAATACTAACATTCACCACAGGCTTGAAATTCCTGTA 

GCATCAGTAGAACTArmTGAAAATATTACAAACATGCITATGATGCATAGGAGGATGAAAAAA 

TTTAGACTGTAACCGCACACCATATGTAATCCCAGCTTTTCTAAm 

CAATATCCAAAGAAAGNATirrrAAAATATATTTGCTCACTGGNAAAGGOANTTimAC^ 

AAATAGGAAAAAhrnGCATTTTAATGTCACAArrGAA'nTCAAAACCT 

SEQ ID NO: 1375 GGTAUl U 1 11 1 1 n 1 1 1 1'l'ri-J-l i rn'J AGTANANACATGGTTTCGCCATGTTG 

gctgggctggtcrcgaactcctgacctcaagtgatctotcxn'agccrcccaaagt^^ 

aggcqaaagccaacgctcccggcgagggaacaactttagaatgaaggaaatatgcaaaagaaca 

tcacatcaaggatcaarraarraccatctattaartactatatgngggtaattatoactattrccc 

aaqcattctacgttgactacttgagaanatgtttgtcctgcatggtgganagtggagaagggcx:a 

gg atrct taggtrgatctatcrgtgggttatgactrcccacaatagccacccccggcccccaccag 

tcctmattggctctggatggaaaatccctacccatgtoatgqtcccrggtctctcctatagm 

ACCAACAGTTGACCCCAAAAGGTTATGGNCrrCACCGTmAATTATATCCACGACTAAGATACTG 

GAGNCCTGTA^TCTTNAAAAGTGTGGGGOTGNCTATr^^>^CCAAGAACCAAAATGGNCOT 

CTTAAAAAAAAAGTTTGCITACTTAGGGAAAATraXTGCCCNACCn^ 

TAAKGGAAAAAATNTAGAAAAGCITGANAAAAGCNTGGGTGCCCTrTGNAAAAAAAAAAGGO 

SEQ ID NO: 1376 GGTACTAAACTAGAGCCTGACTQAATAAGAGQAAAGAATQTCCCATAGTGTC 
ATTTOGGGGAATTGAAGTGTCrcATATGATGCrTGAACTTCAGCTACTrAGAATAGG 
GCTTTAGACCAGTGCAmCAAACmAATGCACATGCAAATCACCCAGGGATCTTGTTAAAACAC 
AAGTCTTACACAAAAATTAACrCAAGATGOATTAAAGACTAAGACCTAAAACCATAAAAACCCrA 
GAAGAAAACCTAGGCAATACCATTCAGGACATAGGCATGGGCAAGGACTTCATGTCTAAAACACC 
AAAAGCAATGGCAACAAAAGCCAAAATTGACAAATGGCATCrAATTAAATGAAAGAGCTrCT 
CAGCAAAAGAAACTATCATCAGAATGAGCAGGCAACCTACAGAATGGGAGAAAATnTGCAATCT 
ATCCATCTGACAAAGGGCTAATATCCAGAATCTACAAAGAACTTrAAACAAATTTACAAOAAAAA 
ACAAACCCCATCAAAAAGTGGGCAAAGGGATCCNAACAOACACmCTCAAAAGAAGACATTTAT 
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All^UUCCCAAAAACATTCCGAAQAAAAGCTCATCATCACrCKjOTCATCAGAAGAAGTGCC^ 

AAAACCCCAATrGAGATrACCATClTCATGCCAGGTTAGAAANGGTANTCATTAAAGANOGAAAC 

NANCNNATTCT 

SEQ ID NO: 1 377 ACCTATGTTTCACCTCCTGGAAATGAAGAGGAAGAATCAAAAATCTTCACCA 
CTCTTGACCCTGCTTCTCTGOCrrGGCTGACTGAGGAGGAGCCAGAACCAGCAGAOGTCACAAGC 
ACCTCCCAGAGCCCTCACrCTCCAGATTCCAGTCAGAOCrCCCTGGCTCAGGAGGAAGAGGAGGA 
AGACCAAGGGAGAACCAGGAAACGGAAACAGAGTGGTCATTCCCACCCCGGCTGOAAAGCACCG 
CATGAAGGAGAAAAGAACACGAGAATGAAAGGNAAAGTOQCCCACXrrANCCTGAANAGAATGAA 
CGG>n'CNACC ACGAA ATTNAGCGCCTNACCCCNGAAATATAAGGCGACTCTCNAGCrCTNAmTN 
CCCATTGNGAATTTrCCX:CNAACATTNNAAKmK}GGGCATTA}mCCCCAC^ 
CCCACCTTTTCCANAA 

SEQ ID NO: 1378 GGTACGTGTGCGTAACACCCGAACCAGGAATCTNAGCTATGACCTTTTCACTT 
AGCTACGCTAAATGTCAGTCCAAGATAAAAGAGGAGATTAAAGATAAAACTOAAGAITAAAGAG 
ACTGTGAGTAGT GACAC ATTCAAGTOAGGCTGTTAATCTAGGTAAGTGACACTAAGAACCTGAAG 
AOACCCTATGAACr^ITrcCCTAGGAGGCCGTTTCCAACTGGCCT^^NGACAAACAOCTGATCATC^ 
CCGTTTATAATGATGGGNTCACTCCGGAGCTrCGAAOTATTCATCCTACATTrAATrGATCACCAC 
GAATGG OTNGGCAAGACCGTGTCANACCTTCAAGGGAACTCTCCTrrCCCACAANCCCCCAANCr 
TTirnNCAAAAGTCCCTCTGNTTTAAAGNCAAAAAAG 

SEQ ID NO: 1379 AATTCG CCCTrAGCOTGGTCGCGGCCGAGGTACCAAGAATTTCATAAATTTGT 
TTrcAGTOAACTGCTTrrrGCTATGGTAGGTCATTAAACACAGCACITACTCTTAAAAATGAAAAT 
TTCTGATCATCrAGGATATTGACACAmCAATTTGCAGl^CnTITNOACTGGATATATTAACAGT 
TCCTCTGAATGGCATrGATAGATGGTTCANAAGAGAAACNCAATGAAATAAAGAGAATATmATT 
CATGGCGATTAATTAAATTATTrGCCTAACTTAANAAAACNACTGNOCGTAACTCTCAGTOTG 
CTTAACTCCAITTNACATGAGGT 

SEQ ID NO: 1380 ACTGCAGCTAAACCAGCGGCITCAATAACAAGTAAGCCTGCTACACrTACAA 
CAACTAGTGCAACCAGTAAGTTGATCCATCCAGATGAGGATATATCCCTGGAAGAOAGAAGGGCA 
CAGTTACCrAAGTATCAACGTAATCnTCCTCGQCCAGGACAGGCCCCCATCGGTAATCCACCAGTT 
GQACCAATTGGAGGTATGATGCCACCACAGCCAGGCATCCCACAGCAACAAGGAATGAGACCCC 
CAATGCCACCTCATGGTCAGTATGGTGGTCATCATCAAGGCATGCCAGGATACCITCCTGGTGCTA 
TGCCCCCOTATGGGCAGGGAtXGCCAATGGTGCCCCCTTACCAGGGTGGGCCTCCTCGACCTCCG 
ATGGGAATGAGACCrCCTGTAATGTCGCAAGGTGGCCGTTACTGATCTTACTTCATCCAGTCTAAT 
AGCTmGGA GATTA AACOTriXn'CAACITGTGCTGTTTATATA^ 

TTTCATTGTGACTTTAACAAACATTATCT'fNCCACATACCAAGGAACTATTGGGACATrrATnTAC 

ATTGGGAAAAATTATTTGGAATTAATAAAAGCANGGAACTTTITCCTGGAAGTTTGC^ 

CTGGGATGGGGTrC>nTITTCATGGrrCATCTAGGGriTITAGAAATGAAATITANNAAAAT^ 

SEQ ID NO: 1 381 ACGCGGGCCTCTGGAAGCATGGAGACTGTGGTGATTGTTGCCATAGGTGTGC 
TGGCCACCATCTTTCTGGCTTCGTITGCAGCCTTGGTGCTGGTTrGCAGGCAGCGCTACTGCCGGCC 
GCGAGACCTGCTGCAGCGCTATGATTCTAAGCCCATTGTGGACCTCATTGGTGCCATGGAGACCCA 
GTCTGAGCCCTCTGAGTTAGAACTGGACGATGTCGTTATCACCAACCCCCACATTGAGGCCATTCr 
GGAGAATGAAGACTGGATCGAAGATGCCTCGGGTCTCATGTCCCACrGCATTGCCATCTTGAAGA 
TTTGTCACACTCrGACAGAGAAGCTTGrrGCCATNACAATGGGCTCTGGGGCCAAGATGAAGACTT 
CAGCCAm'GTCANCGACATCATTGTGGTGQCCAAAGCCGGATCAGNCCCCAGGGlXSGATNATGTT 
GTTGAAANTC 

SEQ ID NO: 1382 AATTCGCCCTTAGCGTGGTCGCGOCCGAGGTACGCGGGGTCTACATGAACCA 

gcaaggcagatggaataatgtgaagccaattcgtcttaatggaaccaaagattctatgtttggca 
ttgcagtaaaaaatattgg anata ttaatcaagatggctacccatatattgcatnttggagctccn 
tctgatgacttgngaaaggntrmrrctatcatggatntgcnaatggaatnaaataccatacccaa 

CACAGQTrmrTAiiGGGGrmATCUCCTVATrTmGmTmtiCAATTGKTGG 

SE Q ID NO: 13 83 AATTC GCCC nTCG AGCGGCCGCCCGGGCAGGTAUi 1 i n ITrri 'i'l- l - l 'l' rn - i ' 
TTTTTTTITATAAACTTTTATGCATTITATTCAAACCAATTn^^ 

CATCAAAACCACACTGATACAAANATTCCAllTrCCTCTTGCTTGGGGGAAAAAGGGCTTGGTTGA 
NT^ ^^CAC CAGGTTTGCCAGGGACAACATOATATNCTGCT^GGCGATATTAACATACAAATCTANCT 
GCCCTITGTCCTGNGGGGANATGTCCANTCGNCCACTGAAANGGGGANATGGNTACTGTCCGGCC 
NGACNGGCNNAGATGGNAAATAGCGTTGGTGAGNCCT 
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SEQ ID NO: 1384 CKjTACrCCAOTCCTCCAGGACCTGCCACTATGATCCCGGATGGAACTTTGG 
GATTGATCCCACCAACAACTGAACGCrrTGGTCAGGCTGCTACAATGGAAGGAATnXW 
GGTCGAACTCCrCCTGCATTCAACCGTGCAGCTCCTGGAGCTGAATTTGCCCCAAACAAACGTCGC 
CGATACTAATAAGTTOCAGTGTCTAOTTTCTCAAAACCCTTAAAAQAAOGACCCriTr^ 
(XAGAATTCTACCCTGGAAAAGTGTTAGGGATTCCTTCCAATAGTTAGATCTACCXn'GCCT 
ACTCTAGGGAGTATGCTGGAGGCAGAGGGCAAGGGAGGGGTGGTATTAAACAAGTCAATTCTGTO 
TGGTATATTGTTTAATCAAGTTCTGTGTGGTOCATTCCTGAAGTCnXrrAATGTCACT 
GCCTGGGGAAACCATGGCAAAGTGGATCCAGrrAGAGCCCATTTAATCTTGATCAT TCCNG Trr^ 
TTTTTTTTTGGCCATCTTGNTITCATTTGGCTTGCCCCGCCCCCaAAACGGAGT^ 
OCAGNCTTGGAAGTGTANATGGCNATGATCTC>IGGTTNACT^GCAAT^^Tm 
TA^^ACT^TGTNCAAGT^GN^^KmAAACTTCCrn^ACCCT^G^ 
AAA 

SEQ ID NO: 1 385 GGTAcn-iTn'rri-i'rnTM'n i i 1 1 1 rn iaaaaggncagci'I'i i iattgaac 

ATGTTATAAAANAGGTTTAGTCAAAAAGA(XAAAGCCCATGTCATCATCANACTCCTCGGATTCTT 

CUU - IUirJ GCITCCACTTTCTTCTCCTCAGCTGGAGCAGCAGCAGTGGAGGGGGCAGGACCTCCrG 

CrGGTGCAGCACCAGCTGCTGGAGCAGGTCCACCGGCCCCTACATTGCAAATGAGOCTCCCAATG 

TTGACGTTGGCCAGGGCCTTTGCAAACAAGCrAGGCCAAAAGGCTCAACATTrACACCGGCTGCT 

rrAATGAGGGCATTCATCITATCCTCCGGGACTGTCACCTCATCGTCGTGCAAAATGAGGGCCGA^ 

TAGATOCAGGCGAGCTCNOANACAOAGGCCATGGCNCCGGOCCAhrraTAGGGCT^ 

GGACCCCGGTGCTTATTCCGCCGGATNAAGTGAGGOCCTCACCCCAACCNCAACOT 

GAAGGACCCACCCCTTTGGCNGGAAANCTTGAGGAAAAAGGGCCCCCCGCrTTNCT^ 

NGGCNGTT 

SEQ ID NO: 1386 GGTACTGGATGAAGCTGACGAAATGrTAAGCCGTGGATrCAAGGACCAGATC 
TATGACATATTCCAAAAGCTCAACAGCAACACCCAGGTAOTTITGCTGTCAGCCACAATGCCTTCT 
GATGTGCTTGAGGTGACCAAGAAGTrCATGAGGGACCCCATTCGOATTCTTGTCAAGAAGGAAGA 
GTTGACCCTGGAGGGTATCCGCCAGTTCTACATCAACATGGAACGAGGGOAGTGGAAGCTGGACA 
CACTATOTGACTTOTATGAAACCCTGACCATCACCCAGGCAGTCATCTTNATCAACACCCGGAGG 
AAGGTGGACTGGCTCACCGAGAAGATGCATGCTCGAGATITCACrTGTATCCXiCCNTGCATGGAG 
ATATGGACCAANAAGGAACNAGACGTQATTTATGAAGGGGAGTTTCTmCTNGCTTCTAGCANA 
AGT^TTTGAT^NCCA>m"GAACCTNGCTGGNCANAAKCNA^ITGATGTTGCCAGCAAGGT^^^ 
TAATTCCATCAACCTNTTGACC 

SEQ ID NO: 1387 GGTACAAAGAAAOTTTTAAGTCAAGGCCTCACCAATTCCTACAGTATTAGTA 
nGTOTCTCAATTCTCAAAACTAACTTrrAAAAAGCTTAAACTTAACCTAAN^ 
AATATAAACTAGAATGAACAAACATGAGAAATAmCTTTGAATCAGGGAGCTAGCACCTTTGAG 
TmCCAAAAAGCACGTCTCCCCAGTGTGTTCACTGTGATGTGGTGTAAAAGATCCACA 
TACTTAAACTACTTAAACTTAGATAACATCACrrCTGAAGTATACTACCAAAATGTTAATTGAGA^ 
AGCrGAAAATAGTTTTAGmACTCATTATCACATGCTAGAAGAAAATTrTGCATGAGAAAACACT 
GAAGAGGTAATTTTTTAATCXIAGATTTTTTCCAAACTCAT^ 
TCTATTATAACTGCTCTTAATTGCTTGTTGGCTGCCTGGGAAAAATGATTTGAACT 
GTTNCNTAACAAAACTTNGTCAATAGCCCCCCCCGTACCTGCCCCNGGO^GGGCXrNOT 
GGCGAAmCCAANCACCAOTGGCGGGCCGTTNCCTAGTIX}GAATTCCNNACCrrCGGTACC^ 
NCCTTGGCXiTT^^ATTCATGGGGCAATNANCCTG^TITCCCTGGNG^^VAAAAATTG^r^^ 
CNNCAA 

SEQ ID NO: 1388 ACTACCCAGGCAGGTATGGAGGGGCTCCCGGAGGGTCTGCGTTTCCCGGACA 
AACTCAGGATCCGCTGTATGOTTACTTTGCTGCTOTAGCTGGACAOGATOGGCAGATAGATGCTOA 
TGAATTGCAGAGATGTCTGACACAGTCTGGCArrGCTGGAGGATACAAACCITITAACCTGGAGA 
CTTGCCGGCTTATGGT TTCAA T GCTGGATAG AOATATGTCTGGCACAATGGGTTTCAATGAATTrA 
/U^GAACTCTGGGCTGCCTTTGGri ri J"l 1 1 i NTCCTT 

SEQ ID NO: 1389 ggtacgcgggattcgagtagcggctcttccaagctcaaagaagcagaggccg 
ctottcgtttccttraggtcittccactaaagtcggagtatcritcttccaaaam 
ggccgttccaaggagcgcxjaggtcgggatggatcttgaaggggaccgcaatggaggagcaaaga 
agaagaacrrrmaaactgaacaataaaagtgaaaaagataanaaggaaaagaaaccaactgt 
cagtgtattttcaatgtttcgctarrcaaattggcttgacaagttgtatatggtggtggga^ 
gctgccatcatccatggggctggactrcctctcatgatgcrggtgtttggagaaatgacagatatc 
rn'gcaaatgcaggaaatttagaagatctgattgtcaaacatcnctaatagaagtgatatcaatg 

ATACAGGGTrCTTCATGAATCrGGGAGGAAGACATTGACCAGGTATNCCTATTATTACAGTGGGA 
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ATTGGTGCTGGNGGTGCXrrGGnTGCTTGC 

SEQDDNO: 1390 GGTACATTAACTTCAAAAGTCTTITCAATCTGAGGATCTTGTGTAGCAAACAA 
ATCTOATGTATAGACTACACCAQCATTATTrACTAAAATACrrAACATCTCCAATTTCTGOCTTCACC 
TTCTTTGCAGAGCTGTAAATATCTTCTCGGTTGCTGCAGTCTACCACAAAGGTATGA^ 
CCCAGTCCCTTGCATTTGGCAGCTGTTTCCTCCAGTrcATGCTTATTTATATCCCAGAGAACCAGCT 
TGCTTITAAGTTTAGCAAATTCATAGGCAGTCAGTCTCCCAATTCCATGCCCAGCTCCTGT^ 
GCACCGATTTCGCCGGTGACTGATmCrCCTCTTAGGAATAAAAAGCTTCACGAAG GACTCT ANG 
GAGCAGACGATCAGTAACCGGGAGAAGCAGGAGGATaTCCAGAAGAAATTTCATCCCTTTTGTGG 
CTGCNAGCGTTTTGGGTGTGTTTTITITITrTmAC^ 
GATCTAA 

SEQ ID NO: 1 39 1 GGTACC rAGGGATGATTGGGAAGAAAAAAGGCACTTTAGAAGAGATAGTTTT 
GATGATCGTGGTCCTAGTCTCAACCCAGTGCTTGATTATGACCATGGAAGTCGTTCTCAAGAATCT 
GGTTATTATGACAGAATGGATTATGAAGATGACAGATrAAGAGATOGAGAAAGGTQTAGGGATG 
ATT Cl - lil - ll ' I GGTGAGACCTCGCATAACrATCATAAA'rn'GACAGTGAGTATGANAGAATGGGAC 
GTGGTCCTGGCCOCTTACAAGAGAGATCTCTNTTTGAGAAAAAGAGAGGCGCTCCNCCAAGTAGC 
AATATTOAAGACTrCCATGGACTCrrACCGAAGGGTTATCCCCATCTGTGCTCTATATGTNATTTGC 
CAGTTCATTCTAATAAGGAGTGGAGTCAACATATCAATGNGAGCANGTCACAAGTCGTCNATGCC 
CCTTTCrrCTTGAAATCTACCCANAATQGAArrCTGAC 

SEQ ID NO: 1392 GGTACTTTTCAATCCAAGAAAAAAAAATAAACTGAGACATGGTCATGAGTTC 
AGGATTATATATATTACAATTTGCCrTGTTATAATACATTTGTGGCTITATGATAAAAATAACTCAG 
GGACATATGGAATTCAAGCTGATTTGCGTAACTGTCACAAGAAAAAAAGCATTAAATGCATTTCT 
GAAATAAGTATTTTCATTAATTTCAGAATCTCAAAACAGCATTAGACCTTGCCnTGTT^ 
ATTTAGTTCAACACrAAGCTAGCTAAATAACCTTCTTGAAAGGTTTAAACACAATTGAT^ 
ATGCTTTCCCTCTAATCTCAACTTTACATAAATGGAACAGGTGAGGAAGAAAAGGTAGACAATT^ 
TTTAGCAGCATATATGGCTAAATCCATCAACCACCTTAAACTAGAATGTCCATTATTG ACCCC ATT 
AAAAAOAACTGGGGAAATATCAAAAATTACTOATTTCATA AACCA AAANGACATCAATTTTAAGG 
GTTACAGrrAGTTAAAATATTTTTGAGACACrrrAAAAAACl'i'ri'rAACCCATAAACCI'ATAAAGA 
ATTCATCTTTTTCATTTNGTAAGGAAAGGAAANGCCCTANCCACGAAAAGANNTT^ 
AA(^^SrGATTTTTTAAATTCCTCTT^mGAACNAANCCNAAATGGAAAAAAm 
AA 

SEQ ID NO: 1 393 GGTACGAGGACTGGATGGAAAGGTGATTTGTGGCTCCCGAGTGAGGGTTGAA 
CTATCGACAGGCATGCCTCGGAGATCACGTTTTGATAGACCACCTGCCCGACGTCCCrTTGATCCA 
AATGATAGATGCTATGAGTGTGGCGAAAAGGGACATTATGCTTATGATTGTCATCGTTACAGCCG 
GCGAAGAAGAAGCAGGTCACGGTCTAGATCACATTCTCGATCCAGAGGAAGGCGATACTCTCGCT 
CACGCAGCAGGAGCAGGGGACGAAGGTCAAGGTCAGCATCTCXn'CGACGATCAAGATCrATCTCT 
CTrCOTAGATCAAGATCAGCTTCACTCAGAAOATCTAGOTCTGGTrCTATAAAAGGATCGAGGTAT 
TTCCAATCCCCGTCGAGGTCAAGATCAAGATCCAGGTCTATTTCACGACCAANAAGCAGCCGATC 
AAAGTCCANATCTCCATCTCCAAAAAGAAAGTCGTTCCCCCATCANGAAGTNCTCGCAGAANNTG 
CAAGTCCTGAAAAGAATGGACTGGAAGCTTCTCAAATTTCACCCCTTTATGGGAAAAAGGT^ 
TTGGTITNCCAlTrATTAJn'AAGGGGATTTNTGm'GTCNTGTAAANGTGh^^ 
AATATTTCNACCCNTTTrAAlX;ANAAAATGOGArrC>m^GAATrrACCTOT 
OWGTAAN 

SEQ ID NO: 1 394 GGTACGCGGGGGTCCTTGTCCAGTGAAACACCCTCGGCTGGGAAGTCAGTTC 
GTTCTCTCCTCTCClCrCTTCTTGTITGAACATGGTGCGGACTAAAGCAGACAGTG^^ 
TACAGAAAAGTGGTGGCTGCrCGAGCCCCCAGAAAGGTGCrrGGTTCTTCCACCTCTGCCACTAAT 
TCGACATCAGTTTCATCGAGGAAAGCTGAAAATAAATATGCAGGAGGGAACCCCGTTTGCGTGCG 
CCCAACTCCCAAGTGGCAAAAA.GGAATTGGAGAATTCTTTAGGTTGTCCCCTAAAGATTCTGA^ 
AAGAGAATCAGArrCCTGAAGAGGCAGGAAGCAGTGGCTTANGAAAAGCAAAGA GAAAA GCATG 
TCCTTTGCAACCTGATCACACAAAATGATGAAAAAGAATAGAAaTrCTCATTCATCT^ 
ACGGCTCCTTGTTACCCTGGTATTCTAGAATGTAAATTTACATANATGTNGTTGGTTCCAANTrTAG 
CTITNGTIT^AACAGGCATrTAATOAAAAAATmAGGGTTTAAAATTrANOATGT^ 
TGmTGTGNAANATrrGGNAAATrTGTAGGAACTAAATTGATGGNAAACCTTNACCTTANNTATGT 
CAAATTTAATGNCArrGTrGTGGNNTTCTNNITCCCCAAAATTAAGTGTCT^ 1 IGNTT 

SEQ ID NO: 1395 GOTACl- 1 -lUl'riUiU'l'i'l 11 lU^ ' l - llU 1 - lllN GGCTTTTAGTAANAGGTGCCTAT 
GAAAGAGGNCAATTGGAGATCTAGAAAAATACAAGTGATGGTTGATTCCCCATTGTTCCAGTrAT 
TAATArrAGTTrrACCAACATACAANAAAATCTTTITrcaTCAACACT^ 
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AAGATTCCAAAATTTTCATAACATAAATTAATTACCAAGAATrGTATTACATGAGATTTOT 
GCAACTAAGTTCTAhmTCTATTTTGTCAATGTGATACNTAATAAAATCT 

ATTCTCCrCGCGTACACAAAGGATGCTGGACCAGACrCTGCTTGACCTGAATGAGATGTAGAATG 
a:CCAGTC(X:ACCCTGCTGCrnHn"CCTTCCTTroACCCTGACTrCG NCT^ 
AGCTGACCTrTACCTGAGGGCTTGATCTrTCAATTGGAAAOTWOCTTim 
TCCirCCCTGCCTTmThnT^ANCAAAACrrGTNTCT 

TAGGAANGCTNAAAAACCCTTTGCAAANAACCbnTrTNAAGGGGAATGGGGANNCCC>^ 

^rrAATGGGTNTTTTAAAAA^^NCATNG^r^GGTGGAAAAAAAAAAAfACCA^^ 

A 

SEQ ID NO: 1396 GGACACAAGCnTGAGGAAGTGCAAAGGACTGACCTCTAOOCCAGAACAAG 
ATGGAAAACTACCAGGCCCATCAGGCCTATAACCCAGACACCAGCATGGACAAAACTCAGTTATA 
CTGAATTCAGAGACAAAATTCAGTGACACTmCTACrACTTATTrAGGGTTCTACAGC^m 
TGAGCAGACTTAGTTrTTTGTTTTTGTrrTACAAACCT^ 

AACTAGGACTACGATGTTAAGACAACCACTAGCAGACAGCn'GCGGAC AGTTA CT GGGT CTGAAGG 

TGAOGCTTCCCAACTCAAACAGAGAAGTCATTGGGATAAACTCTGCCTCTTrCAT^^ 

CATATCCTTAAAACAGGACAAGATGGGATGACCACACCAGGTAACATCACCGAGGTCTCAAAGTA 

GCAGCTAAAATGTGCCAGAACTCrrCACCCTGTCTAGGACCTATGTTCCACrCTCTCAGCAACCAA 

AAGGGATCTATAACATGTTTTCTINAATTAAGGCTTTTAG^^ 

CCCTTCACTGCTCANAAACCAATTCTAOCTCITrTGGClU'l'C"^ CVIC I i C NTTCTG GGGGG 

GNATCACCTGNCCa^ATT^mCCAGGC^m:GaNCCCCCN^^ITNT^CANAAA 

TT 

SEQ ID NO: 1397 ACCCTAAAACXrrAGTATCTTmCTCTTCTATGGAAAATCCGAAGGTCTAAAC 
TTQACTTIT^TGAOQTCrITCTCAACTTGACTACAGTTGTGCTCL^TAA^^ 

AAlTATTTTAAGGAACAAATGAAAACrCTGGGCrGGGTGGAGTGGCTCATACCTCTAATCCCAGC 
ACTTTGGGAGGCTAOjGTGGGCAGATCATCTGAGGCCAGNAGTTNGNGACCTGCATGGCCAANAT 
GGAACANCCCNGTTTTTANAANANTTTTAAAAATNNNCCTGOT 

SEQ ID NO: 1 398 acaaagacagcacaagaaaaaaacagatttcataaaaatagtgattctggtt 
cttcaaagacatttccaacaaggaaagttgctaaagaaggtggacctaaagtcacatctaggaac 
tttgagaaaagtatcacaaaacttgggaaaaagggtgtaaagcagttcaagaataagcagcatg 
gggacaaatcaccaaagaaCaaattccagccggcaaataanttcaacaagaagagaaaattcca 
gccagatggtagaagcgatgaatcagcagccaaoaagcccaaatogoatgacttcaaaaaoaag 
aaagaaagaactgaagcaaancagactacitcagtgttaaaaccnacitn'gacatitn'gg'™ 
oggcaanccagatgtgggganattttaanaagaaaanagtgtgnctaanaaaatngagtaahftt 
tnacgagngntrrttncacangrrottcnngggaaatttataacntatagcattg^ 

NTT 

SEQ ID NO: 1399 ACTOOTATTTGAGCTTCAAAGTAAAAATAAAGCAGAGGACAAACGCCrrGTCA 
CAGCGTGGCAGCCCOGTTCATTTGGTAACAGAACAAGTCTCACACGTCGGCTGAGCCCACCGGAG 
AGAAGCACGCAGGCCAGCTTCOTCTGAACACAGCCAGGCACCrGTTCAAAAGCCTCTCTGGTGG 
TGGCGGTGGACCCCAAAACCCACGGGriXK3AAGAGCCTTTGGGAAAGCAAArrGATACGCT0ATT 
GCAGTCCTTGAAGAAATCACTrAGGGTTAGTrTAATAAACGAACATrAAAAAATCT TCCAA AAATr 
GCATTCAAATGTATmACACrn'AGGCAQTTCrrAAGAAAGGGA GAAG TQCAGTITCrTTAAACT 
ITACATTTAAAAGGTrTArrGGCCACATTAATCnTCCTCACAAAAGTnTAAGTrATNTATAAGTTC 
TTAGANTGTCAAAGAATAGAAAAAAAACTGACTAACTTTAANGGGCGGGGGGAAGCAGCAAAAA 
AACTTGGCATTAA 

SEQ ID NO: 1400 ACTGGTAAAAGAAAAAAGAAAAAACCCACAGAATCACAGAATTCATGTTCTC 
ACAGTAGCCAGAAGCTAAAATAATGTGCTTCTITATTGCACGTCAAACCAAGAAAGGTATAAAAG 
CCAATGATrCTCAATCCAAATGTGAAAGTCAGGTrCACCTCAACTGGATrCGGAATGTGCr 
CCATTGGACTCAAGTTGATCTCACTTATATTCTGAGTGCCGGGAGGTGAATGCATCAAGGATAGTG 
ATGAAGGTGTGAGACTTTCGACAATAAACTTGTTTAAAAGTTTCTGTCC 

SEQ ID NO: 1401 ACTTTGCAAATrATCTAGTrAATTTGCATCATGAAT CAAATT CTTAGCTTCTA 
AAGGGCAGACACAGTCTTTAATTGCTTrrAGATAGGCACTTAAGAAATGCTTTTGATGAGGAGAA 
AACCCCAAAGATTGTGAATCACTCTATTAACTCCAGGGAAGAATOAGACTCTCAACAAAGTAGCT 
TTCCAGAACTACCCrrcCAGTGCCAAGAAAGTAAAACTATTTGCAAGClCrTTCTGGTAATAAGAA 
TOAGGGAACTCAAAACAAAAAATGCTCTATTATGCAATTAAAAAGTA AACTT AGGTAAATCAAGT 
TTCrrmATCAATCTCTGTCAATTIXjAATTAACCGGTATTTTTC^TACCTTrTCAAGAATAC^ 
ATCCTNNAACTAGACAATGGTTGCATTT^mTC^^'ATNTTACCT^^TC^m'ATCCACC^ 
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CNGGTCXrrTTrrAAAAAAAGGGGAGCGNTGTGGGGTGAGAAAAATCrCATAAACCCTCAGC^^ 
GTTAATATTNTGGTTAATT 

SEQ ID NO: 1402 ACOCGGGCTAOATACACrGTCAOATCCTTTGGCATCCGGAGAAATGAAAAGA 
TTGCTGTCCACTGCACAGTTCGAGGGGCCAAGGCAGAAGAAATCTTGGAGAAGGGTCTAAAGGTG 
CGOGAGTATGAGrrAAGAAAAAACAACTrCTCAGATACTGGAAACTTTGGTTTTGGGATCCAGGA 
ACACATCGATCTGGGTATCAAATATGACCX:AAGCATrGGTATCTACGGCCTGGACTTCrATGTGGT 
GCTGGGTAGGCCAGGTTTCAGCATCGCAGACAAGAAGCGCAGGACATGCTGCATTGGGGCCAACA 
CAGAATCATCAAA 

SEQ ID NO: 1403 ACTGTCTCrmGGAAAAGTTCTTGATCCCCAATGCTCCACAAGCAGAGAGCA 
AAGTCTrCTATTrGAAAATGAAAGGAGATTACTACCGTTACTTGGC TGAGG TTGCCGCTGGTGATG 
ACAAGAAAGGGATTGTCGATCAGTCACAACAAGCATACCAAGAAQCnTrrOAAATCAGCA^ 
GGAAATGCAACCAACACATCCTATCAGACTGGGTCTGGCOCnTAACTTCTCTGTGrrCTArrATGA 
GATTCTGAACTCCCCAGAGAAAGCCTGCTCTCTTGCAAAGACAGCTmGATGAAGCCATTGCT^ 
ACn-GATACATTAAGTGAAGAGTCANCNAAAGA 

SEQ ID NO: 1404 AACGCGGGCAGCACCTCOTGATTCTCAGTTTTGCTGGAGGCCGCAACCAGG 
CCCGCQCCGCCACCATGTTIXX}AAATCAGTATGACAATGATGTCACTGTTTGOAGCCCCCAGGGCA 
GGATTCATCAAATTGAATATGCAATGGAAGCTGTTAAACAAGGTTCAGCCACAGTTGGTCTGAAA 
TCAAAAACTCATGCAGTrm3GTTGCATTGAAAAGGGCGCAATCAGAGCrTGCAGCrCATCAGAA 
AAAAATTCrCCATGTTGACAArcATATTGGTATNTCAATTGCXjGGGCTTACTTGCT 
TGNTATGTAATTTTATGC^^>iANGANNGmTGGGA^TCCAAA^^r^GGAATa^ATANGACC^^ 
CCTGGGTTTNGTCATGGATTCTTTAAITNGGAA 

SEQ ID NO: 1405 ACATAAAGTAAAGGAGAACAGCTATATTGTCCTTTCTrATAAGCTTGTCACTGC 
AAAAAGTTGCCTTTroCTrGTCAGGTTrGGATAGAATATAATTG ATTGGT T TC 
AAGGCAQTAGATTTOCCTQTOTGGAATTTTTTrcCTAl-l'rriCriCi 1 11 IGcrnm iGCACTTAT 
CAGAAATArrTGATGCGTGCATTGITGAAAAATA(nXK}ATACATGTCTrGT CAGAA TTGTCGTATT 
AAAGTAATGCCTTTCCCrmTGCri"! CTT'l GGCAAATGTIWATOCACAAGNGTnTGGACTTACTTG 
AANAATTTACTGAAGCCCTNTGGGGACITAATTAANACCAAATmCNACTTCCC^ 
GAACCATTTCCATTTANCAAACCAGGAAAANACNATTGC^m'CCCATTrANGT^^ 
TTCCTTAAACTT 

SEQ ID NO: 1406 acaatnaaggaatggggaag gggg aaatgaaagaatagagaaaactatacg 

GTAGTAGTCAGGATGTGGTGGAACCAAATTGCAGrmCTAATTGAGAATGTAATOTGGTCn^ 

AAGAACAGAGTTCTOGAGTAAAGAAGCAGOrrCCCrmCAGTANACACCnX:CCGTCTGCTGm 

GAACACATCAATTGTATCTrCATCCTCCATTTCCXACTGTGCCAGGTGTGTCTGrrrCAT^^ 

TTGCCCGTCAAATCNGAATCTOATCTGCCTCATrGACAATCCCTGTCGGTCACAATAGGCTIT^ 

ANmACTTAAGGGCGAATGCCTTrrAATCTTAAACCGGACCCACATAAACNTOm'GTC^ 

CCTTTNAATATAANTAT 

SEQ ED NO: 1407 A Cl ' i ' n - n - l ' ri ' l ' i 11 i 1 U 1 1 1 1 I GGGAGGG GrTG AATAATCnTAATATTACAC 
ATAAACCACACTAAAATGCCTTTCAATAAGTAAAANAAACCATnTAAATACAGGGAATTATAAT 
TAGATTGGCATANTTAAGGCCAAAACTATAGACATNGCTACCTTATTTATCTTCAACCCTO 
AAGAGGCAAATGAACATGAACNCAAAACACAGGTGAATCTTGCTTGGTTCTAANACAT^ 
AA^T^CCCCAGT^^T^AAATNTATTCGCATNACCTGT^^^mAAAOT 
ANTTAmrnTAAAATGG 

SEQ ID NO: 1408 ACrrrGAArrrGAGAAGTGGGTGATCCCCTCTAGGCTTCCTGGAGGTCACATT 
TAAGCTAaACCrrGACAAATTGGTAGGATTTGGTCAGGCACTAGGAGTGGAGCATGAGCTCTGGG 
GACAGACAGTTATGGQTTCTGGTCCCACTrrrrATCACTrACTANTTGNTrGACCTTGQG 
ATTTCACCrTCTGTGCCTCAGTrrCCTCATCTGTAAAATGGGGCTAACAATATrACCT^ 
GATTTAATGATGTCAAOCTCCTCACTGGNGGCCITATTCCTTCGTGGAGCCCCCrAGGTGCCGACC 
CCT 

SEQ ID NO: 1409 ACrnTAAACTAGTAATAGAATnTCTGAAGAATATCCAAATAAACCACCAA 
CTQTTAQGTTmATCCAAAATGTTrcATCCAAATGTGTATGCTGATGGTAGCATATGrrTAGATAT 
CCTTCAGAATCGATGGAGTCCAACATATGATGTATCrrCTATCrrAAC ATCAAT TCAGTC^ 
GATGAACCGAATCCTAACAGTCCAGCCAATANCCATGGCAGCNCAGmTTTTNANGAAAACAAAC 
CGAGAATATGAGAAAAGAGTITCGGGGC:ATTGTTGAACAAAGCTGGAATO>mX:ATTATTANACC 
ACCGGGa^GTTAATCCTITrTCNATCATTGGCGGGG>rmAAm'ACCCC^^ 
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AANAAATTTNAAGTGGCCCCAGGTTITAAGGNTTTTG^OT 
ATTTANAAACNNAAAAANCrrGGGGTATTATGNTTAATGA 

SEQ ID NO: 1410 ACGCGGGATTGACATGAATGATATCAAAGCATTCTATCAGAAGATGTATGGT 
ATCTCCCTTTGCCAAGCCATCCTGGATGAAACCAAAGGAGATTATGAGAAAATCCTGG^ 
TGTGGAGGAAACTAAACATTCCCTTGATGGTCTCAAGCTNTGATCAGAAGA(^^ 
TTCATCCTATAAGCTTAAATAGGAAAGTTTCTTCAACAGGATTACAAGTGTAG 
GAAAAATATAGCCTTTAAATCATTTTTATATTATAACCTCTGTATAAT^^ 
TTAATAATANGTTTNNCC 

SEQ ID NO: 1 4 11 ACNCGGGGCGCGTCTTGTTCTTGCCTGGTGTCGGTGGTTAGTTTCTGCGACTT 
GTGrrGGGACTGCTGATAGGAAGATGTCTTCAGGAAATGCTAAAATTGGGCACCCTGCCCCCAAC 
TTCNAAGCCACAGCTGTTNTGCCAGATGGTCAGTTTAAAGATATCAGCCTGTCTGACT^ 

aaatatgttgtgttctttttttaccctcttgac^ 

tcagtgatngggcanaagaattttaagaaactcaacctggccaaaggaattggg 
ggattcitcnctttctt 

SEQ ID NO: 1412 actcctcataatcctgataggtatcagaaaaactcagacgtatttccctatga 
cagtcataccaatcatcttcctgggcttcagtctgcaatctgaaggaagaccgcctgggtaggttt 
ctccggctctgcttagctctctggctagccccctctttaagtcttgtcrrc^^ 
gattctcgatatttctctggattgggggtgtcacttccatttggtggttg™ 
catccacaggcgttnattcagagtctttcagctgattatcagaagaccttccaaggatgnctt 
ttccntngctntgggaataaggttcttggnctgnaaaattant gncn^ 

TGGCNTNCANCTTrCCAAACCNGGGGACNTTTTGTTCAATAACl'lir 

SEQ ID NO: 1413 acgcggggggagtcagtcccaaccaggacacagcacggacatgagggtccc 
tgctcagctcctggggctcctgctgctctggctctcaggtgccagatgtgacatccagatgaccca 
gtctccatcctccctgtcttcatctgtgggagacagagtcaccatcacttgccaggcgagtcagga 

CGTCGGGGACGATCTAAATTGGTATCAACAGCGGCCAGGGAAAGCCCCTAAACTCCTGATCTTCG 

ATGCTTCCAATTTGGAAACAGGGGTCCCTTTCGAGGTTCGCTGGAAGAAGGGm 

NANCGTTTCATNATTANCAANGC 

SEQ ID NO: 1414 ACATTGGTGATCGGAGTATAGTTGGAGCGCTTTGTCATGATTrCCAGGTTGGC 
TTTGTCCACAGCTATGTTGGCCAATGCACCTTGAGCCTCAAAGCTGGCAAATCGTCC 
AAGCCGCCAGACCGTCTCC]TCTTTGCCATATCCACATGGAAAATCTCATCACCATC 
CATAAACTCGCCTGATTGGTCAGGATTCAAATAGAACTCGGCCTGGATGATCACATGTTCTT 
GATAGCCCATGATTCCTGAGCGCTCATCAGCACAGCTATGATGAAAAATTCTAGCACAGGGACTC 
CCTTATGGNCATTTTTTTTTGGGGCGCTCT^ 
ACCNGACNTNTTTNNATGANAA 

SEQ ID NO: 141 5 acaacagttgatgatgaagaacacgatgataaggaagaagaggaggaggaa 

GATGATGACTGCTGGCTTGAAGACTTGTTGTTGTrmGCrC^ 

CTGATGAACTGGCATTCTGTGTTAATGTTGTCATAAGTGCCGAAGATGATGAATAGCCAACTGm 

CCCTGGCCATTGAAAATTCrmCCCAACTGAAAGTCATTATTCTTA^ 

TAAACTTGATGTTNTTCGTCCCTCCTTCATCTGNCTNTATCCTGAACTGm 

GTTATTNCCCCCNCCCANTTTCCTOAAG^^ITACCNAACCKATNGT^ 

GNGGAAATNCCTTTNTAGGCNTTTNGTAA 

SEQ ID NO: 1 4 1 6 ACTTGATTTTGAACACAGCACGAAAACATTTTGG AGCT GGTGGAAATCAGCG 
GATTCGCTTCACACTGCCACCTrrGGTATTTGCAGCTTACCAGCTGGCIT^ 
TTCTAAAGTGGATGACAAATGGGAAAAGAAATGCCAGAAGATTTTT^ 
TCAGTGCTTTGATCAAAGCAGAGCTGGCAGAATTGCCCTTAAGACTTm 
CTGCTGGGGAAATTGGTmGAAAATCATGAGACAGTCGCATATGAATTCATGTCC^^ 
CTCTGTNTGAAGATGAAATCAGCGATTCCAAAAGCACAGNTNGCOTGCCATCAACCTGGT^ 

SEQ ID NO: 1417 ACAGCCAACGGTTTCCCTTGGGGGCITrGAAATAACACCACCAGTGGTC^ 
AGGTTGAAGTGTGGTTCANGGCCAGTGCATATTAGTGGACAGCACTTAGTAGCTGTGGAGGAAGA 
TGCANAGTCATAAGATGAAGAGGAGGAGGATGTGAAACTCTTAAGTATATCTGGAAANCGGNCTG 
CCCTGGAGGTGGTATCAANGGTTCCACAGAAAAAANTAAAACTTNNCTGCTGATGAAGATGA 
CGATGATGATNANGANGATGANTATTGAAANATGATGATTGATGA 
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SEQ ID NO: 1418 ACAGCTTAAACCACAATGGTATAAATCTTCATTTTGTAATTAATAATTTOT 
CATAACAATGTTTGATATTTGCAAACAAACAACATTTTTGGAAGCATTAGATT^ 
CTGTGACAAAATTAACTACAGTCAGTCTGTGCAATGAAATTGATGTTGTTGGAGTO 
GGCATTTCATGTTGAAAACAGATGGTAGTGCTCCTAGAAATATTTCTTCT^^ 
GGAACTACACATGTATAACCAATGACTGACTCTGAAATATCAAGCACTGTGGGGTGGCTGGAAGG 
TAAAGGNCTAAGCTTTGTGAAACNCTATCNTATANAATC 

SEQ ID NO: 1419 ACATGTTTCTGCTAAAGATAAAGGCNCAGGACGTGAGCANCAGATTGTAATC 
CANTCTTCTGGTGGATTAAGCAAAGATGATATTGAAAATATGGTTAAAAATGCAGAGA^ 
TGAAGAAGACCGGCGAAAGAAGGAACGAGTTGAAGCAGTTAATATGGCTGAAGGAATCATTCAC 
GACACAGAAACCANGATGGAAGAATTCANGGACCNATTACCTGCTGATGAGTGCANCAAGCTGA 
AAGAAGAGATTTCCAAAATGAGGGAGCTCCTGGCTTGAAAAGACAGCGAANCAGGNGAAAATNT 
TNTCANGCANCTTTCNTTCTTOWCANGGCTTAT^ 
TTGGCTNCTTACCAAGANGGCTCTGGAANTITITGNCCNTGGGGNAC^ 
GGGGAANAACCCGTNTTTATTTGCNGANNTTTTGAANCCCATANAGA^^ 
GGTGAAGNACTNNCTGA 

SEQ ID NO: 1420 ACCGTTTTCAAGATAAAATAACAAAAAAGAAATACATATTm 
TACCTATACAAAATTTTAAAATTTACACCATCTGAGCTGCCAAAAAAGG^ 
TTTCCCCATACATAAAACTATCTCTCATCAGTAGGTCCAAGGGAACm'GGGGGATTCA^^ 
CAGCACTTTACGGAAGGGAGGAGGATAAAAATTCCCCTTCCTGCTTTCCTTTACC^^ 
CAAAAACAAAGGTCGTATGTTTACAGTAGTGTATTCTACTTATTACAAACAACTGANATTAA^ 
AAAAAAAATNCACCCCGNGNNCCCNTTNGNGCCTTNACTNGNT^ 
CTTGGGAAA 

SEQ ID NO: 142 1 ACAGCAGCAGCAGACACGCATCGCAGAGCTGGAGAAGACGTCAGCTGAACA 
CAAACACCAGCTGGCGGAGCAGAAGCGAGACATCCAGCTGCTAAAGGCATACATGCGTGCAATC 
CGCAGTGTCAACCCCAACCrrCAGAACCTGGAGGAGACAATTGAATACAACGAGATCCTAGAGTG 
GGTCAACTCCCTTCAACCAGCAAGAGTGACCCGCTGGGGAGGGATGATCTCGACTCCTGATGCTG 
TGCTCCAGGCTGTAATCAAGCGCTCCCTGGTGGAGAGTGGCTGTCCTGCTTCTATTGTCAACG/^ 
CTGATTGAAAATGCCCACGANNCGTAGCTGGCCCATGGTCTTGGCCCACTATAGAACTAAACANA 
TGACCCGCGCTCTTTTAAGACCTACTTTGGCCAANCGCrrCCT^ 
ATGGCCTGTGATAACC 

SEQ ID NO: 1422 ACrrGCCCCTTCCCCAGAAAAGCGGGACTTGCTGCTAAGGGTGAAGGACCAA 
GGCAGTTGTCCCTGCGTGGTCTGACACCCTTGAAACGTGGGTGTATAATCAGAGAGGCATCCCTGC 
AATGATTAAACACCAAGGGAAGGCrGCCTTCCCAGTCTGTGACCAGCGCCGGAGTTI^ 
CGGATAAAACGTGTCITITITGTCTCTACCAGAAAATGAAAGG 
AGATTGNAGTGTANTGCCAANATTGAANGGANAAAGTGGTTGTGGGAA 

SEQ ID NO: 1423 ACGGCCAACGCCAAGTAGGGGATTGCGTTCCCTCCAGTCGCAGAC CCTATC A 
GATTTGGATATGTCCTTCATATTTGATTGGATTTACAGTGGTTTCAGCAGTGTGCT^ 
GATrATATAAGAAAACTGGTAAACTGGTATTTCTTGGATTGGATAATGCAGGAAAAAC^^ 
CTACACATGCTAAAAGATGACAGACTTGGACAACATGTCCCAACATTACATCCCACTTCCGAAGA 
ACTGACCATTGCTGGCATGACGTTTACAACTTrrGATCTGGGTGGACATGTTCAAG^ 
TGTGGAAAAACTACCTTCCTGCTATTCAANNGCATTTGTTTT^ 
AAAGGCTNTTANAGTCAAAAANANAANTTTGGTTCCNTATGANAGAT^ 
NNGCC^OTACTGATNTTTGGAATAAANC^WACAGACTTGAAA 
AGAGATGTTTTGTTTNTNTNGNTCAGAACAACAGGAAAAGGGGATTT^ 

SEQ ID NO: 1424 TAATTNCGCCCTTACGCGCTNGTCNNNGGCCNCGCACGCGGGGGC^^ 

GANAGACNAGGCTACCATGAAGGAGCCNANCGCANACCCTGANTCCGTCACCCATGGATCGCAG 

CGCGGAGTTCAGGAAATGGAAGGCGCANTGTTTGAGCAAAGCGGNCCTCA ACCG GAAGGGCAGT 

GNTGACAAGGATGTGGTATAGCTTGTGCNGNTTCTGAACATGCGAGATCAKirm^^^ 

TCCTGCGCTGGCCGNATNCTACTCCTTGNCCGGGGTATAAATGGTTTTGAGGNTCAT 

AGTNGCTGGCTACTGGTTACACACAAACmGTGTAANANCATGATGTNGATTGTANCT 

AAACNAANTGGGGGNGCAACTNTNAAATTTGAAACNTm 

CAGGATGCCCATNTCCTCATTCCTNGCAATAGATTNTNGTTTCANGAACTCT^ 

AAGAGAGGAAAAATITT 

SEQ ID NO: 1 425 ACAGAAAAGCCCAGATTTAAATACATTTAATATGTCGTTTTAAAAATGAT^ 
AATAATTCATTTCTTAAAACACTGAATGAATTTTGAAGCTTAATGTT^ 
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TTTGACATCTAATTTACCATCAAGTTGTAAAATTATTTGGAAAAATACAGAACT^ 

TACTTATATGGAATCTGCGTGTGAGGTGTTTGAGGGCATATGTTTGAAAGAGGGAGCATCACCAC 

AGGAATCCTTTCTGTGAGGTGGAAACAGTGGTCCTGAATCATTGTGCTCACACCTAACTNGAAATC 

TGGTCTTACTTTCATNGCTGGTATTGATTCCCTGGTGGNNNCAGNGTTTA^ 

AGTTGGGTAGGGCNAATGGTNTTTAAATTGAAACTATTTGAAAAATGCNCTCnTO 

NTAATNTCNTTTCirCCCTTTNTTGGCrACACCAGNGATCAANNAGT^^ 

GTT 

SEQ ID NO: 1426 ACAAGTAAGAAGCTGGCGAAACATAGAGCTGCAGAGGCTGCCATAAACATTT 
TGAAAGCCAATGCAAGTATTTGCTTTGCAGTTCCTGACCCCTTAATGCCTGACCCTTCCAA^^ 
CAAAGAACCAGCTTAATCCTATTGGTTCATTACAGGAATTGGCTATTCATCATGGCTGGAGACT^ 
CTGAATATACCCTTTCCCAGGAGGGAGGACCTGCTCATAAGAGAGAATATACTACAATTTGCAGG 
CTAGAGTCATTTATGGAAACTGGAAAGGGGGCATNAAAAAAAGCAAGCCAAAAGGAATGCTGCT 
GAGAAATTTCTTGCCAAATTTANTAATATTTCTNCAGAGAACCACATTTCriTACAAATGGA^ 
GGAAATTCTTTAGGATGTACCTCGGNCGCGACCACGCTAANGGCG 

SEQ ID NO: 1427 ACCCCTTAACCCCTTCTCCTTCACCCTTAGCAGCAAGTCCCACTTTTCTAGGG 
GGCAAGAAACCCCAAACCCCnTCCCTCCGTGTCTTTACGCTCTCTTTTCTCTGGGT^ 
ACTATGGGCAACCTTCCATCCTCCATTCCTCCTTCTCCCTTAGCCTGTGTGCTCAAGAAOT 
CTCTTCAACTCACACCTGACCTAAAACCTAAATGCCTCATTTTCTTCTGCAACACCGCT^ 
ATACAAACTTGACAATGGCTCTAAATGGCAGAAAATGGCACTTTCGATTTCTCCATCCTACAAGAA 
CTAAATAATTTTTGTCAAAAAAATTGGGCAAATGGTCTGAGNGC^^ 
ACACATCCGTACCTTCCTAGTCTCTGTGCCCAGNCAACTTCGTCCCAAATCTTCCr^^ 
ACTGGTNCCTTAANTCCNAACCCCAGCGNTGCTGAGTGNNCTAATNTTCC^^ 
TTGAACTNTCO^TTCTTCAACGCCAAGTAGGNCCAATTTTTCTO^ 

SEQ ID NO: 1428 ACTTCTACACATCTGCCTAACTTGGGAATGAATGTGGGAGAAAATCGCTGCT 
GCTGAGATGGACTCCAGAAGAAGAAACTGTTTCTCCAGGCGACTTTGAACCCATTT^ 
TTCATATTATTAAACTAGTCAAAAATGCTAAAATAATTTGGGAGAAAATATTT^ 
ATAGTTTCATGTTTATCTTTTATTATGTTTTGTGAAGTTGTGTCTm 

CCAATATTTCCTTATATCTATCCATAACATTTATACTACATTTGTAAGAGAATATGCATGTC 
TAACACTTTATAAGGTAAAAATGAGGTTTCCAAGATTTAATAATCTGAT^ 

CAAATAGAATGGGACrrGGTCTGNTAAGGGCTNAGGAGAAGAGGAAGATAANGTTAAAAGTTGT 

TAATTGACCAAACATTCTAAAAGAAATGCAAAAAAAAAAGTTTATTTTCA^ 

TAANGGAAAGCAGAATCATTTTTCCTAANTGCCNTATCATTTTGGGAGA^ 

SEQ ID NO: 1429 ATATCCGC>rmAATTCGCCCTTTNGNGCGGCCNNCCGGGCNGGTNCCACCTA 
CACCCAACNAGTCANTGAGGGACTTCTTTTTAATTTGGTAGGATTTTG^^ 

GGTCTATTATTAGAGTCACCTATGACAAAAAAATAGGGGTTACCTAGATAATGCCNAAGTCAGCA 

TTTGTCCTGGGTTCCCTTGTGTGATCTGTTTGGACTATGTTTTCTm 

NCTTGGGCTTCCATTCTAGTTCITTTACCAANATTm 

CCCTCTTTCAATTTCCTTGTGAAAACACCCTTAACTTTCTCm 

AGCTTCTGGTGATATCTTTTCATGATTTTATATCTCTTAAAATGGTGATGGATO^ 

AAGTGAGCTTTGAACTGTAGATAACTCTTAAAAGAAAATGTCATTTTAAACAAT^ 

GCTCAACTGCTTTGGNCNANANTTNAAGGaSfACAATCTCAAATO 

AGGGTT 

SEQ ID NO: 1430 ACACAGAGGGCTATAATCAGAGATCGAGCAGCTTTAGAGAAACAAGAAAAA 
CAGCTGGAATTAGAAATTAAGAAAATGGCCAAGATTGGTAATAAGGAAGCTTGCAAAGTm 
CAAACAACTTGTGCATCTACGGAAACAGAAGACGAGAACTTTTGCTGTAAGTTCAAAAGTTACT^ 
CTATGTCTACACAAACAAAAGTGATGAATTCCCAAATGAAGATGGCTGGAGCAATGTCTACCACA 
GCAAAAACAATGCAGGCAGTTAACAAGAAGATGGATCCACAAAAGACATTACAAACAATGCAGA 
ATTTCCAGAAGGAAAACATGAAAATGGAAATGACTGAAGAAATGATCAATGATACACTTGA 
ATCTTTGACGGTTCTGATGACGAAGAAGAAAGCCAGGATATTGTGAATCAAGTTCTTGATGAAATT 
GGAATTGAAATITCTGGAAAGATGGCCAAAGCTNCATCAGCCGCTCGAAGCTTACCATCTG 
CTTCAAANGCTACAATCTCAGATGAAGAGATTGACGGCACTCAAAGGCTTTAGGAG 

SEQ ID NO: 143 1 ACCAGGTGGGGAGAAGTGTAGCAAATCTCAGTGCCAATTTGAGGGGAAGCC 
AGTCATTCCAGGAGAAGAGCTGAGGGGAAAGAGCTGTTGACTTTCATAATGCAGTCTT^ 
CAGTCACCCTCCTGCCACATGGCAGAAGCCAGGTGGCAGTGATGGTGGTGGGGGAAACAAAACA 
CACAGTCTCTGGCAAGCCCCACCGGGAAAGGAGGGCTCAGAAGGCGTAGCGGGTCCGGATATCCT 
CGAGTTTCTTGGACACTTCGGGTGGGGTTCGGTCCAGTTCTTCAGAGACGTTCTTTTGT^ 
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TrCCATGGAATrGAACTGCTCACAGAGGTCTCCATCAATCACATTCTTCACAGGGAAOTAGTAGGA 
GCGAAAGCTGAGGTGGTCCCCCCACAGAGAGGGGGATGTTCAAACCGCANGTGCATTTTCACATG 
CTGGAAGAAGTCATGGTCCTCATGGGACGTGAATGGCCAAGGATGCCCATTCCTTCCAACAAAGG 
TGGTNTANACAAATGATTCTTGANCCTCCAGGGATCANCGNTOGNCTT 

SEO ED NO- 1432 TTAATTCCNCCTTANCGNGGTCGCGGNCNANGTACAAANATGACTATAAACA 
NGATGCATCCCTCGGNTrCCATGAACAGCACACTATTACAGTAAACCAlWriTATNTTCCACCATC 
AANTGTGGCTCrCCCATGACTTOICTTTGTGATGGATCATrAAGAATATCCTCAAATCCAATAGTC 
TCATCATTACCCCrCAAAACATCCANTGAAAGATTTGAGCTTGAAAGAAATGGAAGACGCTGAAC 
CTGCTGCACrGCCTTGAATTCCATCTGTAATTTTAGCGGAGCAAATAGACCCTGAATGm 
TGTGGAAAAATTCATTTTATm-GGTTGAGCTGGAAATTTmTCTGAT^ 
AGGCAAAAGTTCATTTTTCACACAAGAAAAACCTTrCCGAAGAAGATCATGACTTTC/^^ 
CACTTGCTGAAAGTrCAGTAACTGGAATACTGTCCmAACTCANATCCAAGTCCTCTGGCATrCA 

TCTCCCCGCGT 

SEO ID NO- 1433 ACTATCACTTTGCCITGCCTCAGTCTCTCCTTCCTCCGGGGCCACAGAAGGAG 
AAAGCCCAAAGCATACCCATCCTGCCCTTTAAAAAAAAAAGAAAGAAAGATTCGTCTCTCACrrT 
GGAAAGCTCACCTACTCrCCTAGGTCCTTCTGGAGGGGCATGGAGGACAATATTCCATGTTGCCAT 
TTGATGCCACTAAGGATCCCCAAGGCAAGAAGACACTTGAATTATGCCAGAGTAGGCCAGCAAAG 
GGGAGQGAAGGOACAGGGAAGCAGCCCCATGGCAGGGACTATAGGACCCCTGCCCAAACCTGAA 
TCTAGAGGGAATGCCCCACCCTGGCTCAGAGCAGGAGAGOAAAAGGCACTGTGCTGACCGGGGT 
GGGTCCirCCCCAACCCCTAGGGAGGGCAGAGAACCTGCTGATTCTTTCCCTGGGTTTCCCAAGGT 
GAGCCAAirrcrANGGAAAGGGA-ITTTAaCACTAGGAAAAGGAAAATACTGCTGCTGTTTTCTGA 
AGTTCTCANCTGAGATTTTAAAAACCATATCAGGCCirrTNCAGGAA 

SEO ID NO- 1434 ACNGCGGGATAATAATTATCnTGAAGTANAACANTTCTGTTAACTGGAAAA 
NCACAGGA-TOTATCCATCATATrTTTCAGGACAGATAGTTmACTGTGGGGCAAATAG 
TTACACmTGTTAGTTGC>mTANGTTTTAANGCAAAGAATCTGTNGAGAAATCTATGC^ 
AGTTrGTCCAGATTAGCirrCAmGGGGAATGAAGTTCTGAAATA-IXn'AAAGCAG'ITNCT^ 
ATTGAAAAGTCCTCCAAAAAGAGAACTATTGGGAAACCATQGTGTGGTOGTGGAAAATAAAAGCT 
CCCrCANNNNTTTGGAGGGAATAACTTAAAAAAATACTTAAATGGCTAAGTT^ 
AANAATTAAACITGNCAATITrAACA'mTNCTGTTACATCrrQAAATAAACTrGTN^^^ 

TCNGCNAAAGGA 

SEO ID NO- 1435 ACGCGGGTGAGCAAAACrACTCTGGATAAATCTCTTCTAGATATAATATCAG 
ACCCTGATGCAGGAACrCCAGAAGATAAAATGAGGTTGTrTCTTATCTA-TTATATAAGCACACAGC 
AAGCACCnrCTGAGGCTGATITGGAGCAATATAAAAAAGCTTTAACrrGATGCAGGATGCAACCT^ 
AATCCmACAATATATCAAACAGTGGAAGGCTTTTACCAAGATGGCCTCAGCTCCGGCCAGCTAT 
GGCAGCACTACCACTAAACCAATGGGTCTTTTATCACGAGTCATGAATACAGGATCACAGTTTGTG 
ATGGAAGGAGTGAAGAACCTGGTTTrGAAACAGCAAAATCTACCTGTTACTCGTATrTTGGACAAT 
CTTATGGAGATGAAGTCAAACCCCGAAACTGATGACTATAGATATTTTGATCCCA/^ 
GGGCAATGACAGCTCAGTTCCCAGAAATAAAAATCCATTCCAAGAGGCCATTGTmTGTGGTGG 
GAGGAGGCACTACCTTGAATATCAGAATCTTGTTGACTCATAAANGGG 

SEO ID NO- 1436 acaaactttattgaaacgcacacgcgcacacacacaaacacccctgtggata 

GGGAA^^LGCACCrGGCCACAGGGTCCACTGAAACGGGGAGGGGA'TGGCAGCTrGTAATGTGGCTT 
•ITGCCACAACCCCCrrcrGACAGGGAAQGCCTTATATTGAGGCCCCACCTCCCATGGTGATGGGGA 
GCTCANAATGGGGTCCAGGGAGAA-nTGGTTAGGGAGAGGTGCTA-TNGGAGGCCTGANCAGAGG 
GCACCCTCCGAGTGGGGTCCCGANGGCTGCAGAGTCTTCAAT 

SEO DD NO- 1437 ACTCCAGAGGAGTGTGAGGAGACGAGTGAAAAACCCAAAANGAANAAAAAG 
CNAAAGCCCCAGGAGGTTCCTCAGGAGAATGGAATGGAAGACCCATCTATCTCTTTCTCCAAACC 
CAAGAAAAAGAAATCTTTITCCAAQOAGGAG-ITGATGAGTACCGATCnTGAAGAGACCGCTGGCA 
GCACCAGTATTCCCAAGAGGAAGAAGTCTACACCCAAGGAGGAAACAGTTAATGACCCTGAGGA 
GGCAGGCCACAGAAGTGGCrCCAAGAAAAAGAGGAAATTCTCCAAAGAGGAGCCGGTCANCATT 
GGGCCTGAANAGGCGGCTGGCAAGANCAANCTCCAATAAGAANANAAANTTCCATAAANCATCC 
CAGGGAAGATTAGANTGCGGCTGGACATTCTCTGGGAGGTGGGGCATACCA-TNNCCCAAGGTGAC 
ATTTNCCACCCTGTGCCCGTGTTNCCCATTANAAACAAATTTNCACGATGANAACGTTATGGGGCC 

AATT 

SEO ID NO- 1438 ACTGTGCTATGGACCACGCACATACAGCCATGCTGTTTCAGAAGACTTGAAA 
TGCCA-rrGATAGTrrAAAAACTCTACACTCGATGGAGAATCGAGGAAGACAATTTAATGTTTCATG 
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TGAATCCAGAGGTGCATCAAATTAAATGACAGCTCCACITGGCAAATAATAGCTGTTAOT 

TATCCAAGAAGAAATGGTTGGTGATGGATAAATTCAGAAATG<m'CCCCAAAGGTGGGTGGTT^ 

AAAAAGTTTCAGGTCACAACCCTTGCAGAAAACACTGATGCCCAACACACTGATTC^ 

GAAACACGGGTCTTCCAAGTTCCAAAGGGGCTGGGGTTCCCCAACGATCAAGTTCCTGTGCTGTA^ 

CAAGANGGTCCTTTGGACTGGATAGGGAGCACrrGGGAGCTGTACCTTGCCGGGCCGGCCGATCG 

A 

SEQ ID NO: 1439 ACGGGGGATAATAGATTAAAGAATTTACCCTTTAGCTTTACAAAGCTAC^^ 
AATTGACAGCTATGTGGCTCTCAGATAATCAGTCCAAACCCCTGATACCTCTTCAAAAAGAAACT^ 
ATTCAGAGACCCAGAAuAATGGTGCTTACCAACTACATGTTCCCTCAACAGCCAAGGACTGAGGAT 
GTTATGTTTATATCAGATAATGAAAGTTTTAACCCTTCATTGTGGGAGGAACAGAGGAAACAGOT 
GGCTCAAGTTGCATTTGAATGTGATGAAGACAAAGATGAAAGGGAGGCACCTCCCAGGGAGGGA 
AATTTAAAAAGATATCCAACACCATACCCAGATGAGCTTAAGAATATGGTCAAAAC^^ 
CATTGTACGAGCGAGCCCGCGTGCTGGGCACCCGAGCGCTCCAGATTGCGATGTGTGCCCCTGTG 
ATGGTGGAGCTGGAGGGGGAGACAAGATCCTCTGCTCATTGCCATGAAGGAACTCAAAGGCCCGA 
AAGATCCCCATCATCATTCGCCGNACCTGNCAGATNGTNGCTATGAAGATTGGG 

SEQ ID NO: 1440 TTATCATCCTAATGTAGACAAGTTGGGAAGAATATGTTTAGATATTTTGAA^ 
ATAAGTGGTCCCCAGCACTGCAGATCCGCACAGTTCTGCTATCGATCCAGGCCTTGTTAAGTGCTC 
CCAATCCAGATGATCCATTAGCAAATGATGTAGCGGAGCAGTGGAAGACCGACGAAGCCCAAGC 
CATAGAAACAGCTAGAGCATGGACTAGGCTATATGCCATGAATAATATTTAAATTGATACGATCA 
TCAAGTGTGCATCACTTCTCCTGTTCTGCCAAGACTTCCTCCTCTTTGTTTGCAT^ 
GTCTTAGAAACATTACAGAATAAAAAAGCCCAGACATCTTCAGTCCTTTGGTGATTAAATGCACAT 
TAGCAAATCTATGTCTTGTCCTGATTCACTGTCATAAAGCATGAGCAGAGGCTAGAAGTATCATCT 
GGATTGTTGTGAAACGTTTAAAAGCAGTGGGCCCCTCCCTGCTTT^ 
TTAAGTATAAAAGCACTGTGAATGAAGGTAG 

SEQ ID NO: 1441 ACTAAACGAGCAGGTGAAGGAGGCTGAAGGATCGTCTGCTGAATACAAGAA 
AGAAATTGAGGAACTAAAGGAACTGCTACCCGAAATTAGAGAGAAGATAGAAGATGCAAAGGAG 
TCTCAGCGTANTGGGAATGTAGCTGAACTGGCTCTGAAAGCTACTCTGGTGGAGAGTTCTAOT 
GGTTTCACTCCTGGTGGAGGAGGCTCTTCAGTCTCCATGATTGCCAGTAGAAAGCCAACAGACGGT 
GCTTCCTCATCAAATTGTGTGACTGATATTTCCCACCnTGTCAGAAAGAAGAGGAA^ 
AGAGAGTCCCCGGAAAGATGATGCAAAGAAAGCCAAACAAGAGCCGGAGGTGAACGGAGGCAGT 
GGGGATGCTGTCCCCAGTGGAAATGAAGTTTrCGGAAAACATGGAGGAGGAGGCTTGAGAATCAG 
GCTGAAAGCCCGGGCAACAAGTGGAGGGGGACAAGTGGANGCTGGAGCTACAAGTTTAAAGCAC 
TGCATTGTTAAGANGGGGCACCAGCCCTCCTTCCAAAGGGAAAGT 

SEQ ID NO: 1442 ACTGAATTCAGTGCTTAGAACTGAAGTTATTGAGAGGACAGCTTTAAAAGAT 
GAATGAACTCAAAAGTTCGAGTTGTGCTCTTCACGTTGGTTCGATAATGGCCTTTATTT 
TTTTAATTTTTCTrTACAGTAAATATTCCATTCTGAmCATi^ 

GTGTTGACACTGTAGCTCATACTGGAAAAGTCGATCAATGTTTTGCAGTITATTGAAAGTAGTTCT 
ATATATAACAATGTTATAAGCATTTCTTTAGAAATGGTTGAAAATGCTT 
ACCATGGTATGCATGATCGTTGTAATTGTTGACATTCCTTTTAGAAGTTGTO 
TGCTTATGTAGACACAATCTTCTGTCTCAGTCC 

SEQ ID NO: 1443 ACAAAGCAGCAACTGCAATACTCAAGGNTAAAACATTAGAAAAGCATTTGTG 
TGACAGGTATATTACAGTATTATCAAAATATTACATTrrCAGACTTACTTAGCAGAT^^ 
CCAGAGCTTAAATCTTTAAATTATTTCCATAGNCTTAAAAAATATGTAAT^^ 
AAAGAATGTAAAAGGAAACCTAAAATACAAATGGAATAATGTAACAAATAAATATTTGAm 
TAACTGTTAATAATCAGCTCAACACCACCATTCTCTCTAAACTCAATTTAATTC^ 
GAACTGTCAAATGCCATGGCATAATTATTTATTTCCAAGCTATCATCAATGATTAGAACT 
ATTTTGGCATAAAAAAATCACAATTCAGCATAAATAANGCTATITI^ 
CATCTCTAAGAATTGTTGAAATAAGT 

SEQ ID NO: 1444 ACTTGTGCCTAGTTTTTCAAGGTATTGGCTGTTCTATAGATGCAGTGATrGTC 
CCAGCTAGCTCTGTTACCAGCCTTTTGGTGTGTCTTTATGTTCAT^ 
CAGGTGATGTAGCACTTCTGTTTTTAATAATTATTGCTTAAAATACC^ 

TTAAAGGGACTTGAGGAAGCTACCCAGGATTACAGAAGAGTGTCCACCTAACAAGATGGCCTGGC 

AGTTTCCTAGTTTTGTATCTGGTTCAATAGAAATATGTGAAAGTGGTAATGTCATCAm 

GAGTCCGGGTTTCTCTATAATAAATCCCTTTGCCAAATGCATGAGTTGCAGACTT 

GAGTGAAGCAAGTGGGTGAGTAAAACTAlrmGACGTGGGAGCGTTTTCAGATAGGAGTT^ 

TTGACGAAAGTGTCCGTGCANGAATTGGACTCCGAGGAGGGTTACAGTATCTTNCTGACGGGACC 
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TGCCACTCGCATNTGGGCAATGTTGACATTTGAG 

SEQ ID NO: 1445 AC i " ri lll"i"m T ilTITITITITTTTATCAAAGT^ 

AGCACAACTGACCAAGGTCCAAGATGTGAANATACCATGTTCAANAAACTGGGGGGAAATCACTC 

TACAAACTAACTATNTACTAGGNGATAANAATGCATACTTTATAATATAAAACCTGTCTACACAT^ 

GTAGTTAGCTGCAAAAAGCCATTCGATCTTNTNTTGGCTTGGAAAAATGCCAGATTCC^^ 

TTATTAGCAACCTTCAACIWITTGCGTGGCAGGTATCTTGTTTTC^ 

AGCCTGTGTCATGAGGTCCGCGATATTCITITCCAGATCATCAATGCGACTACTCATAT^ 

CTCCCAATGATCTGGTCANACATGGTCTGAAATTTATCTTGCATCTGCTGCAGGA^^ 

ACCGAGGTGAGGTCCTGCACGGTCTTGGGGTCANTCTCGGCCATCTTCX:CGATGCCCAGNTTGGCG 

GNGATGTCTTANACCGTAACCTACACTTNCGTTT 

SEQ ID NO: 1446 ACAATAGCGTCTTCTTCCAGCAGTCAACTCAAGCACAAAAGCCAGACTGACT 
CACCTGATGGTAGCAGTGGGCTGGGAATTTCATCCCCTAAAGAQTTCAGTGCAGGAGAAAGCTCT 
ACTTCTCTCGATGCTAATCACACAGGGGCAGTCGTTGAGCCTTTGAGAACTTCTGTTCCAAGACT^ 
CCATCAGAGAGTAAGAAGGAAGACTCCTCTGGCGCTACCCAAGTCCCCCAAGCAAGTCTCAAAGC 
CAGTGATCTCTCTGACTTTCAATCAGTTTCCAAGCTAAACCAGGGCAAGCCATGCAC^^ 
CAAGGAATGCCAGTGTAAGAGATGGCATGATATGGAAGTGTATTCCrmCAGGCCTGCAGAGTG 
TCCCTCCCITGGCTCCAGAACGAAGATCCACACTTGAGGACTACTCTCAGTCGCTGCAC 
CTCTGTCTGGCTCTCCCCGATCCTGTTCTGAGCAAGCTCGAGTCTTCGTGGATGATGTGACCA^^ 
GGGACCTGTCAGGCTACATGGAGTATTACTTGTATA 

SEQ ID NO: 1447 AcmrmTTTiTTT^^ 

ATAANACATATTGGCTNTATTAAAAACTCAGGTAATAAAGCNCTAANCTTGATT^ 

CAGTCTNTTTNTTNTTAAGGGGAAAAAAATCTCCCCAANAATAGGATGCTACCTGAG^ 

CGAATAAANAAAAGGAATGGATGGTCGGCAGTGAAATTTTNTTCGGGCATCAACATGCAAA^ 

TGCNATGCCTGCTGTGGCAGCTGCCGCCIOTGTTCCCTOTTCAT^^ 

GACAATTTTTGATATAAAAATATCTCTGGCTCCTGACATGCCAAACANATCA 

AAAAGATCCTGCACACCTAGGCGGGCGAGGTCGGAGTTGAAAGTGTAACTCTCITCCAGm 

CCTGGGCAAGCTGACATTAACnTCAATGAAATCGAAGATTCTCAGGTTTAGTCCACT^ 

TTTCCAAAGTCAACTGTTCCTCAATCrmTTCAGG 

SEQ ID NO: 1448 ACTGATGGGGAAGTGCCGGCGCTTCTTGGATGAACTAGATGTGGTTCAGATG 
GACTGAGCITGGATGCTTCTGAGGCAAGCTGAAGCTTTGGGTTCTGACTGACCCACCX^^ 
TGCTGAACAGAGAGCCCAGTGTGACTAGGGATCCTGAGTTTTCTGGGACAATTCCAGC^ 
ATACATTTTGTTAAATGTGCCATAAAATGAGACTTTTTACGCC^^ 

AAACTCACCCCANCAAAAAAAAAAAAAAAAAAAAAAAAGTACCTNGGCCGNAANCACGCT 
GGCG 

SEQ ID NO: 1449 ACACCTCTTGCATTCGCTTTATGTGCCCCAATGGAAGAGGTGTCCTCTGGAGC 
GTCTGTGCTAAAAACTCATTCAATTGATGAGAAGAGAGATCCTTCAGGTCTGTGGCATTGAATGAA 
TTTAAATCATCTTCTTTGGCAGTAATCCATCTTTGACTTAAGGCAATACA^ 

TATCATAATTGGGCTTTATGGGAGGCAGTCCAGGAGAGTAGAGCCAGGCATTCCAATCAACTTGA 

TTGAGAACATCAACCnTATCTTTAAAATAGGAATACAGGAAATCOT 

CTCTTATAGGAAAACTTCTCAACATAAG(nTITAAGAATCCT 

AGTTGTTCAAGGTAAAAAAGTAAAGCAAAGCCCTTCTCATAGGGAACTGAAGAAT^ 

AGGGTCTATATCTGTCAGATCAACCACAAAGTTTGGGTGAAAGGATGTGTCTTCCCAAATGTCm 

ACCGAATTCTGTAGTTCTCCCCATCCTCCCANANCATTAAAAATGTCTGAAC 

SEQ ID NO: 1450 ACCATTTTACGAATTTCTGTCTTCATAATATAAGTGAAAATACT^ 
ATTTrCTGCTTTAAATTGTrmAATAAGCATTCCAAAGTC^ 

AGTCATTCAGTTGATAGACAAAGTTAGCGATGCTTTATGCTAGGAAACTTGTTGACAGTAAC^^ 

GCGACTTTATGTAGAAGACAAATGCTAGTAATTATTATGCACAGAGGAAAAATCAT^ 

GTGGTAAAGCAGCTTCATCTTTCAAAATTGATTTGCTCTGGTTT^ 

AATGTCCTTTTACTGGGAATTTAGTTATGTATTAAGATAACCTGT^ 

GACATTATTTATATTGAACCACCTTATTTTAAAATTm 

GTGTCATGTCTTGGGTTTGATGTCGTTGGACAGAAAAGTGATCAArTATTTTAAATGA^ 
CCTGTTTGAGGCnTAGTCTGNAAATGNGTTGCTGNAACAGAAAAAT 

SEQ ID NO: 145 1 ACTTTTTTTTTTTTTm^ 

TATTAGGTCCGCTTGGTGCANAGCTGAGTTCATTCCAATCAATAGAAAAAGAGGGAATCCT^ 
ACTCATTTTATGAGGCCAGCATGArrCTGATACCAAAGCTGGGCAGAGACACAACCAAAAA^ 
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AATTTTAGACCAATATCCTTGATGAACATTGATGCAAAAATCCTCAATAAAATACTGGC 

AATCCAGCAGCACATCAAAAAGCTTATTCACCATGATCAAGTTGGATTCATCCCTC 

GCTGGTTCAACATACGCAAATGAATAAACATAATCCAGCATATAAACAGAACCAAAGACAAAAA 

CCACATGATTATCTCAATAGATGCAGAAAAGGCCTTTGACAAAATTCAACAACCTACATGCT 

AACTCTCAATAAATTAAGTATTGATGGGATGTATOSrCAAAATAATAGGAGCTATCT^^ 

CCACAGCCAGTATCATACTGAATGGGGCAAAAACTGGAAGCArrTNCCTTTG 

SEQ ID NO: 1452 ACGCGGGGAGCACCTCCTTGATTCTCAGTTTTGCTGGAGGCCGCAACCAGGC 
CCGCGCCGCCACCATGTTTCGAAATCAGCATGACAATGATGTCACTGTTTGGAGCCCCCAGGGCA 
GGATTCATCAAATTGAATATGCAATGGAAGCTGTTAAACAAGGTTCAGCCACAGTTGGTCTGAAA 
TCAAAAACTCATGCAGTTTTGGTTGCATTGAAAAGGGCGCAATCAGAGCT^ 
AAAAATTCrCCATGTTGACAACCATATTGGTATCTCAATTGCGGGGCITACTG^ 
GTTATGTAATTTTATGCGTCAGGAGTGTTTGGATTCCAGATTTGTATTCGAT^^ 
TCTCGTCTTGTATCTCTAATTGGAAGCAAGACCCAGATACCAACACAACGATATGGCCGGAGACC 
ATATGGTGTTGGTCTCCTTATTGCTGGTTATGATGATATGGGCCCTCACATTTrCCAAAC 
TCTGCTAACTATTTTGACTGNAANAGCCATGTCCATTTGGAACCCCGTT 

SEQ ID NO: 1453 ACAAACAAATTATGACCGTTGGAAACATCCCTTCTTCCTTGATGATCGCAGAA 
CGCCTGCAAAGATGTGTCTGAACCGCACCAGCCAAGAGAATATCTCATTTGAAACCATGTATGAT 
GTCCrGTCAACAAAACCTGTCCTCAACAAGCTGACCGTATACACAACCTTGATAGATG^^ 
GGTCAATTCGAAACirACCTGCGGGACTGCCCTGACCCTTGTATAGGTTGGTGAGCACACGTCTGG 
CCTACAGAATGCGGCCTCTGAGACATGAAGACACCATCTCCATGTGACCGAACACTGCAGCTGTC 
TGACCTTCCAAAGACTAAGACTCGCGGCAGGTTCTCTTTGAGTCAATAGCITGTC 
GTTGACAAATGACAGATCITTTTTTTTCCCCCTATCAGT^^ 
TAGGGGAAGTAAAACAAGTCATCTAGAATTCACTGAGTTTTGTTTCACT^ 
GGTGGGCAGTCNAACCATGGTGAACTCCACCTCC 

SEQ ID NO: 1454 ACTCTAGGAACCCAGGGTCACCCAGATGTCCCTTTGATGGCCGTTGTTGAAG 
GCCATTGGGACCAATAATCTATATTAGATTGAATACTTAAGTTAGATGTGGTTTCCCCCAT^^ 
CAGGGAGCTAGCGTNTTANCCTTGTGGGCAACATGATGCATGGGAAATGAAANATTTTTGTAi^ 
AGTCAGT>mTNTTTCCAGGAAAAGCCNGNCNTlWTTTTT^ 
GGGTTGGGGGTCANNAGGGGTTChrmCAATTTGGGANNGAAGGGGAANTAAAC^ 
GCCTAAAACAANANCTTITCATCATTAAAAATTTTCCCAGNGTTCTGAm 
NTGAGTNNTAAACAAATATAANAAAGCTGTCAATGAGTTIT^ 
NAATCTAAAACCATGGGa>ITANAATrGNAAAACTGGGCTCATCAAAATCGGGACT 
CATAANACTGGGAAAAAAAAATGATGGGGGACCTTTGGTGGAA 

SEQ ID NO: 1455 GGACAAAGATGACTATAAACAAGATGCAGCCCTCGGTTTCCATGAACAGCAC 
ACTATTACAGTAAACCAAGTTTATATTCCACCATCAAGTGTGGCTCTCCCATGACTTCGCm 
TGGATCATTAAGAATATCCTCAAATCCAATAGTCTCATCATTACCCCTCAAAACATCCAGTGAA^ 
ATTTGAGCTTGAAAGAAATGGAAGACGCTGAACCTGCTGCACTGCCTTGAATTCCATCT^ 
TAGCGGAGCAAATAGACCCTGAATGTTTCTCAGTGTGGAAAAATTCATTTrATCl^^ 
GAAATTTTTTTTCTGATAATTCAAGGGGATGACTAGGCAA^ 

CTTTCCGAAGAAGATCATGACTTTCAAAAGGTCCACTTGCTGGAAAGTTCAAGTAACT^ 

GTNCNTTTAGCTCANATCCAAGGTCCTNTGGCTTTCATCTITCCGC^^ 

TGGNCCCGTTACCGNTANTNGACCCCGCCGCTCACTTTTCGTTTTCCCCC 

SEQ ID NO: 1 456 ACACATAAGAAAAGGATTTAGTAACACTTGGGCAAGTAATAAACTGTAGAAC 
TTTAAAAGTAGTAAAGGCATATACCAAGCATACGTGACTCCACACATTGTCAGAAAGGCAGTGGA 
CTGGCTAACGAGTTrCTGCCAAGTTTCAGAAGCAAAGAATGCACTAATGAAAAGGGTAAGGC^ 
CAAGCAGAGTGTCTGAATTGATGGACCACTTTTGTCTAAAACTAATTTCA^ 
AAGCAGGCATTTGTTTTAAGGTGAAACACTTTATAAGAGAAAAGAAAACT^ 
AGATCAGAAAAAATTAATACTGAATTTTTATAATCAATGCATGGAGCAATGATA^ 
TAACAATTTTGTGAGAACAAGTATAAAACTATCCATTACTACCTTAAAGAGTO 
TCTAAAGTATTCAAAGTGGAATAAATAACAAGGTAGGAAATGCAATACCTATATATTTTAAAAA 
CCCACATCTAGTTAATGTTCTCITAGTTTGGTTTAAACCAGTTGTTCATC^ 

SEQ ID NO: 1457 ACATTTGGAGGATCATCTTCCCAGGGTCTTTTAAGACTTTCTGAAAAAGCC^ 
TTCTACOTAAATGTCAAGACATCTCCATGAACAATTCTCAGTTTCCCAGGTGCTGCAT^ 
CATCTGTAATCCAGGAATAAATCGAGTGTCCTTTTCAACCACCAGAAGTTCAGCGACGTC 
AAGAATAGATCnTGTGATTCCCCCTGGCCCAGGGCCCACTTCGTAAACATAAGCATTTGTCAGATT 
GCCAGCnTTCCTTACAATCTTATCTGTCAGCCTCAAGTCCAGGAGGAAATTCTGTGATAGCT^ 
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CGCTGCTTGCAGTCTTAACAACITAATGATrrCrCGAATCGTGGGCAACGGAGGGAGACG^ 
TGCTGAGTTTTCCGGAGGCAGCCATGATACGCGGCAAGGCACCATCCACCCCTACCTCACCCAGG 
CCCCGCGTACAAAAGGAATGTTTCCTTTATAAATCACAGAAGAAAATGACAATATCTGNTGGAT^ 
TTTGATATAATTTAATGGTGTTATAAAACCmAAGANGATTCATGGG 

SEQ ID NO: 1458 ACAAGCCTCACCAAGGGCAACCCCAGAAAAGTGAATGAGTTTGTCTTCTCCA 
ATCATGACTTCCTCGATAAGTTTGCAACTTCCAAGCTTCACCAGTTCTGGGTGATCAAAGGTAGAG 
GCAATTTCACCACCTGTGACAAGAGCTAGGCGTTCCACACCTGCAAAATCTGCATGCTCAATAGCC 
ATGACACCAGCAGCACCAAAGAGCTGTTCAGGATAATTATAAATTAATTGCCTGTTAATAA^ 
ATTTATTCCATGCTTAAGAATACGTTCAACTTTCTCCTTCAm 
CTGCAACCTTTGCTGTAGAGTCAACTCTTACCCGGGAACCAAATATCTTTATTTT^^ 
ACCAGTATTTGCAATAAGAATTTTAGCATTTTCAATTCGTTTTGGTTGAT^ 
CCAACAGGAAGCCITCATCTAAATAGGAATCTGCCAAACTTCCTCCTAGC^ 
TTGCCTCCAGGTTGCCAGAGCCnTCAGTCTGAGAACTG 

SEQ ID NO: 1 459 acaagatgctgtgtaactgttttaatacagcaaatagtaactctccaaatcct 

GTTGCrmATGTTAAATAAGATAACAAGAATTGGAGCATGCAAAGAATGGGACTTGGATAATC^ 

CTTAAGCTTTATATGTAAAGAATTTTAGAAGATCrrGGTGCTGCTATTCCT^ 

AGATGGCTGTTTCAGTTAAGCTATTAGTAATAAAAGTGAACATTGCTACTATCTGAGCCT 

ATAACTTGTGTGATTTCAAATTAAACITGCATTATGTGTTAATT^ 

AATTCCTACrCACACAGCTCAGCAACAACCATTrrGATGGTAACAGTTAATTTC^^ 

TTAAATTCAGGGTTCTGGATATTAAATTAAAATGGCATTCTTAAAGATT^ 

CCTAAATGAAAGTGTGTAAATTATAAGAAGCTGGCGATCTTTTGATATGCTGNTTCACAGGA 

CCACTGGAGGGCAGCTGCTTGTGCATTACTTGGTTTCCNGCA 

SEQ ID NO: 1460 ACACTTGAAACCAAATTTCTAAAACTTGTTTTTCTTAA^ 
ACATTAAACCATAACCTAATCAGTGTGTTCACTATNCTTCCACACTANCCACT 
TNTGGTTTCAAGTCTCAAGGCCTGACAGACANAAGGGCITGGAGATTTT^^ 
TCTTCANCAACTTGAGAG(mTCTTCATGTTGTCAAGCANCANANCNGTNTCTGN^ 
CATANAGACGATTNGAATATCTTCCANAGATATCGGCrCTAACTGTCAGAGATGGGTCAAC>^ 
ATAATCCTGGGGACATANTGGNCNTCNTGAhnSfNAGGNGTCTGCCNNOT 
GAAGAAGGGCANCTGCKmGCAAANTTCTNGATTTGNGGTATTTTCAGNC^ 
AAAGCTTNGNCTGTTGGNGGGCACTTCNTCCACGGGG 

SEQ ID NO: 1461 ggtactttnttttttttttt^^ 

TATGGCTANAAANACACTGTTOTAGCCAAAATCGGCAATGACNCTAAANATATGCA^ 

NTTCCAAAANTTGCCCTGGNGNGACTTCAAAAGTTCATGTTAACTTN^ 

NTTAGTTGTNGNATTCTTGAAAANCCTGGGCCATGAAAAGC^GCCTAAGT^ 

CCTTGATGTTCTGGCAGTAAGTGTTTATNTGGCCTGCAATGAGCGGNGAGTCCATCCTGGCAGG^ 

GCTGTGGNGGTTTGAAAAGTTTGGACAGGTCCTCCTCAGGGAGCGGGGGTTNTCCTCG^ 

NNCTGCATNTTNTCCTGCTGGCNACCCTGCTGAAACTGATGTTTCNGCTGCT 

TTCCTCATGNATGNGTGGGATTTAACTATATCTGGGGCTCATTTCATIWACT 

CrGTANATTCTTCCCCAAATNATTTGCTTGCTGGGCAAGG 

SEQ ID NO; 1462 ACGCGGGGGGGCGCAGAGGCCTGCGGGAAGCCAAGATGGCGCATAGGGGTT 

ctccaggctgcagttggcgccttatcagtatctaagcggagtgttttggaaggagttaaggggc^^ 

tggcaaacgccctctccgccgtcatggcccggcatcggaatgttcgaggctataactacgatgaa 

gattttgaagatgatgatctctacggccagtctgtagaggatgattattgtatttcgc 

GCTGCTCAGTITATTTATTCACGGCGTGACAAACCTTCCGTTGAGCCTGTGGAAGAATATGA^^ 
GAAGATCTGAAAGAATCTTCCAATTCTGTTTCAAACCATCAGCTCAGTGGATT^ 

ctttattcatgccttgatcacatgagagaggtacc 

SEQ ID NO: 1463 actgggatggccctgaggatcacagtaattctttgtagtcatgaaaatcacc 
gccaatccaatttctgcttcaggaataggtctaagaccttgcctttggagcatatccat^^ 
tgaatccatttcttctgacagtcaggaattaaggtctgtgagmgttattcctgtatc/^ 
acgttcccggagccaaaaagaaattatgttcttcitcattctcritctgt^^ 
acctccaaagacaactgcggaactcaatttcttatgctgtttggcattcaai^ 
ccctitgaagatttgttttctrrccrgccgtcctgacagatcaa^^ 
tcatcaagaggtgggtaaaaacmcaatttgtggaggctgcttc^ 
aattcagtaaaatattctggctttacaattggacgtccacaaatgagtgcacatat^^ 
gtaactttcactgataccatgacaagggngagtgcattcttctg 
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SEQ ID NO: 1464 ACAACAATTTCAAAAGGCATATATATGGGTAATTAGTGTTCACATATACCAA 
ACCAGGAAACTAAACAAATCTCAAGGATGAGAAACATGAAGAAAACTTCACTAAAGCGTATC^^ 
AATTAAATTGTCCAAAACCATTGATAAAGAGAAAATCTAAAAGCAGCCAGAGACAAAAAAACAT 
ATTATTTACAGAGAACAAAATAAGAGTGTCTGCAGATTTCTCACCAGAAATAATGCAAGTGA;^ 
GCCAGTGGTGCAACATTmAAAAGTATTAAAATAACACACACACACACACACACACACACACAC 
ACACACACACACACATGAACCTATAATCCTATACCCNAAAAAAAAAAAAAAAAAAAAAA^ 
TNCCTCGGCCGCGACCACGCTAAGGGCG 

SEQ ID NO: 1465 ACTGGCTGGCCACCAAAGCACACGGAGATTCTGTCAGGCGCTGAGACACCAC 
AGCCTrrTCAATCTTGTCCTTAAGGGCnTTATCTTTCAT^ 
TCAACTGCTTCACGACTCTCCTTAGTTTTCTCACTTTCATCGAACTTC^^ 

CTGGAACCTCrrCCCATCAAATTCGGGAAGGGCCTGAATACAGTATTCATCCACAGGTTCTGTGAG 

GTAAATAACTTCATAGCCCTTTITCAGAAGTCGCTCAACAAATGGAGAAGATTC^^ 

GCTGGACCCAGCCATGAAGTAGATTTTGTCTTGTTTTTCCTTCAT^ 

CTAGTAATGTCAGTTGGATGATGAGAAGACTGGAACCTAAGAAGTTTAGCAAGACGTGTTCNATT 
CGAGTGGTCTTCAATCACACCAAGCTTGATGTTGGTACCTTGCCCGGGCGGGCCGCTTCGAA 

SEQ ID NO: 1466 ACGCGGGGTTCCGGGGCAGGGCCGTGCTGATTGAGAATGTGGCTTCGCTCTG 
AGGCACAACCACCCGGGACTTCACCCAGCTCAACGAGCTGCAATGCCGCTTTCCCAGGCGCCTGG 
TGGTCCTTGGCTTCCCTTGCAACCAATTTGGACATCAGGAGAACTGTCAGAATGAGGAGATCCT^ 
ACAGTCTCAAGTATGTCCGTCCTGGGGGTGGATACCAGCCCACCTTCACCCTTGTCCAAAAATGTG 
AGGTGAATGGGCAGAACGAGCATCCTGTCTTCGCCTACCTGAAGGACAAGCTCCCCTACCCTTAT 
GATGACCCATTTTCCCTCATGACCGATCCCAAGCTCATCATTTGGAGCCCTGTGCGCCG^^^ 
GTGGCCTGGAACTTTTGAGAAGTTCTCATAGGGCCGGAGGGAGAGCCCTTTCGACGCTA 
CACCTTTCCAACCATTAACATTGAGCCTGACATTAAAGCGCCCTCTTAAAAGTTGCATATA^ 
GAACTGCTCAACACACAANATCTCTTACTCCATCCAGTCCTGAAGGAG 

SEQ ID NO: 1467 ACAGTAACACAACATCAAAAGCAACACAGCTGTATACAGAAACGTAGGTCAT 
TCTTTTCAGCCCTAATGGAGATGTAATTAACAGTATCGAGCACTCTGGAAAATCACTCTGCAGGTT 
TATATGGACTACATGGAGATCATATCCTGTAGTGTAGTGAAAGCTAAGTCCTCAAGAGCCATATGT 
ATAGATACACAATGTTTTTTAATAATCTTTAAAACAGAGATCAAAGTTCAm 
ATTAACAAAAATAAAAATGAAATAAAAATGGAACCAAATGATCATCTAAAGirrAAA^ 
ATTGTCCAATTTATACAACTGTGGGAGACrrATTCAAGGTTTTTGAA^ 

SEQ ID NO: 1468 acatttatattcactgataatacaagcttctgtggtgtgtggaccagacacag 
gagccaagggtccctctgtgtctgtccaatagtattttacaggagggtagtgacatagcttataat 
taagttacattaccatgggggtctcagtgctactaacattatgtaatcaaggagatacgagcatta 
gcaaaacatgaacaagaagtcatataactctttgcagcacatttctgcccaggcm 
agggatttttatgaaatgatattacacatatcacattgtggcaggttgm 
gcttttaaaattcattagcaaatgacarrctcatgcacacgagtgtttcaaac^^ 
ccatttataggtagtccaagcaatatttgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtg 
tgtgtgtnaatactgcanactgaatgoaacattganacttttgtgcca 

SEQ ID NO: 1469 ACGCGGGGACTTAACGGTGGTGGCTGGTTCTGCGCCGGATCCGGGAGAGGGG 

cgggcgccattgtgcttcgctgccgactgcamcctcagtcacgggcctagaactccaaggagaa 

aggcggcggaattcatgatttgttcctgtagctgaaacatacattgaatgctttctgccct 

gaacttacatggtcagagctggtitttcctgccctcaaatatatttntct^ 

TAGAAAAATCTTTAAGAATGGAGTCTAAACCTTCAAGGATTCCAAGAAGAAT^^ 

CCAGCTCCrrAAGTGCTAGGATGATGTCTGGAAGCACAGGAAGTAGTTTAAATGATNCCT^^ 

CAAGAGACTCTTCATTTATATTGGGATTCTGAATATCAGTCTACATCAGCATCAGCATCTGCGT^^ 

CCATTTCAATCTGCATTGGTATAAGTGAATCTGAGATAACTCAAGGGAGNCACNCT 

ANAAACCAGCATCTGGNNTCNTGATTNAAAAAAGACCCTAAACT 

SEQ ID NO: 1470 ACTTCTGTTTCTCAGTTTATCTGGATGTTATCAGATCACAGACCATGGTCTCA 
GGGTTTTGACTCTGGGAGGAGGGCTGCCTTATTTGGAGCACCTTAATCTCTCT^ 
AACTGGTGCAGGCCTGCAGGATTTGGTTTCAGCATGTCCTTCTCTGAATGATGAAT^ 
CTGTGACAACATTAACGGTCCTCATGCTGATACCGCCAGTGGATGCCAGAATTTGCAGTGTGGTTT 
TCGAGCCTGCTGCCGCTCTGGCGAATGACCCTTGAOTCTGATCTTTGTCTAOT 
CAGGCTTTCTTTCATGCACTTTACTCATAGCACATTTCTTGTGTTAACC 
CrrGTTTTGGCCCCATTTCnTACAACTTCAGAAATCTTAATTTA 
TCTTGCAAATTATACTTTTGGTTTAGAAAGGGATTANGTCTTTTCA^ 
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CATTTTTCTTTTAAATGAAATGCTTTAAAGAAA 

SEQ ID NO: 1471 acgcgggggttaccgcgattctgagaggtgggcttttagtccctccagacctc 

GGCTITAGTGCTGTCTCCGCTTTTCTTTCACCTTCACAGAGGTTCGTG^ 

TATTGGGAGGTAAAGGTCAATGCGTAGGGGTAGAGTAAGATGTCTTATGGTGAAATTGAAGGTAA 

ATTCTTGGGACCTAGAGAAGAAGTAACGAGTGAGCCACGCTGTAAAAAATTGAAGTCAACCACAG 

AGTCGTATGTTTTTCACAATCATAGTAATGCTGATTTTCACAGAATCCAAGAG 

ATTGGGTCCCTGTGACCATCATTGATGTCAGAGGACATAGTTATITGCAGGAGAACAAAATCAAA 

ACTACAGATTTGCATAGACCTTTGCATGATGAGATGCCTGGTAATAAACCAGATGTTATTGAATC^ 

ATTGATTCACAGGTTTTACAGGAAGCACGTCCTCATTAGTATTCGCAGACGATGAGATAT^^ 

CAAGTNAAGCATTTATAGGACCCATTTACAAACCC 

SEO ID NO: 1472 ACGCGGGGGAGAAGAAACGGCGGAGACCTGAGACCCGGAGGCTGAGGCTGT 
AGGTGGGCTCCGCTGGGTAAAAGrrGCCGCAGCAGCTGTCCCTTGGCGCCATCGCGAm 
CCCCCTTGCTTTCCGGGTCCCGGGATCCCAAGTTTTTTAACTAACGGGAGCGAATCC^^ 
CAAAATGTTTGCGAGTTTCAGGCGCCCTTAGTTGAAAGGCTGTAATTAACAAGTCCGCTGm 
AGCCAGGCGCCGTTGCAGGCGCTTTCTGTGGATTGTCATTrATn'CTCACAGCAACCCTAGGA 
TGTTATCCTTGACATCTGCAGCAGCCCTTCCAAGCTGTGGAGACCAGGTCATCTGGAATGCCCATT 
TATGTCAATGGAAGAAAGAAAAAGGGGTCTCCTCCATCCTCACTACTGCATTCTO 
GCTCCTGTCCCACTTCCACAACriTANTTTGTTGACAATTTAACAGTGGGm 
GCCTCCITGATGGTCTAACTAACTACCAGAGTTATTCTTAACANCAG 

SEQ ID NO: 1473 acgaaaagcggcagaactagctctgaaaactctgagcaaggtctgtgtgaaa 

ATGTGTGACCCTGCCAAAGGAGCAGCTGGCCAGAGAACCATCGCTGCCCTTCTGCCTTGCCTTCTG 

gacaaaggaatgatgagcaccgtgacggaagttcgagccctcagcattaacacccttgtgaagat 

CAGNAAAAGTGCANGAGCCATGTTGAAACCGCATGCACCAAAACTCATTCCAGCTCTGCTANAGT 
NCTTAANTGTTTGGAGCCCCAAGTTNTCAATTATTTNANCCNTTCGGGCNACAN^ 
GCTGCNATGGATANGTGCTCGGCTTATTGNANCCAATATTTTCNNTGATGGNAAC™ 
CCTGTANNCCTTATNTTTC 

SEQ ID NO: 1474 ACACTGTTGGTGTTATATGGGGATGGGGTTCTCGGTAATTTTGTTTATTAm 
TGTTTATTATTATGTTTTATCATTAATTATTCAATAAATT^ 

AATCTTCTGTGGGGGTGGGAGGGACAAAAGATTACAAACCAAAACTCAGGAGATGGTAACACTG 

GAATTGATAAAATCACCTGGGATTAGTTGTATAACTCTGAACCACCAAACCTCTGCTATCAANCCT 

TGCTACAGTCATGGCTGTCCAGAAANATTTACAGTTATTTTTCTGAGAAAG 

AANAACTTCACNAACTTTAAGAACTTNAGAAGTTCTTAANTT^^ 

AATTGNArrAAAAAAAAGAATCCCTGNANTCAANGCTTTAANAGCCAA™ 

TGNTCATACCTGNNGAGANCANNTAAATTCTGTATATNCNATANAACAAATTGCAGATTA 

SEQ ID NO: 1475 ACNGATACCGGAAAGGCTGGATACCTNGGTTATTAGAGGATTTTGGAGATG^ 
AGGTGCTTITNCANAGATCCATGTGGCCCANTATCCACTGGATATGGGACNAAAGAAAAA^ 
NGAATGCNCTNGCCATTCAAGTGGATTCTGAAGGAAAAATTAAATATGATGCAATTGCT^ 
GGACAGCCAAAAGACAAGGTCATTTATAGCAAATACACTGACCTGGTTCCAAAGGAGGTTATGAA 

tgcagatgatccatacctgcaaaggcccgatgaanaagctntitnaganataacatgaanatac 

AGAGTAGCCTTATAAAAATCTGTATNACANAANGTCGNCACANCATGCCAGNTCGAGCAGGTGAC 

AAATTGCTNCrrGCTCAATATATTCGANNCACACCATCTCTGCCAAGGANTGGGAm 

GAGCTAAACANAGGGTTmrCGATTGGTANANATGCACAAAGATCCAATGGNCCNTO 

GA^^AATAANAAAAATTCCCGNGGACN^OTCTNCTTCTGNCCTGNTT 

NACTGTAAAGGACAACAAGAGTGGG 

SEQ ID NO: 1476 ACGCGGGGGCTAAAGTGTTGTCAATTITGlTrAACTm 
TTTGTTTCATTGATCTTrrGTATTGTTTTCmAT^ 
ATTTCTTTTCTTCTACTAATTTTGCATTTGGT^ 
GTGTTTTTTTTTTTTm^ 

GTATCTTGTGTTTCCATTATCATTTGTTrCAAGGAAATTT^ 

CCTACTGGCACATTCAAGAGAATGNNGGTTCi"l"l"n"iTl"l 1 Ml 1 1 1 1 1 1 CAGGGCCCAAGACATCT 
GAAACATTG^^ITAATTTCTGGGGATTTGGATANNGCCAAAAC^ATTC TO^ 

TATCCATTGGGGGCAGAAAATNTGCTTGNATTATTCAATTTTT^^ 1 1 NGAACAAAAT 

CTTGCTTTNCACCCANGCTAAAGNGCAGGGGCGCGAATTTGGTTAN 

SEQ ID NO: 1477 ACATGTTTGAAGAAGTGCCGATTGTAATTAAAAATTCACATCTGATCAATGTC 
CTAATGTGGGAACTTGAAAAGAAGTCAGCTGTTGCAGATAAACATGAATTGCTCAGCCTTGCCAG 
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CAGCAATCATTTGGGGAAGAATCTACAGTTGCTGATGGACAGAGTGGATGAAATGAGCCAAGATA 

TAGTTAAATACAACACATACATGAGGAATACTAGTAAACAACAGCAGCNNAAACATCAGTATCAT 

CAGCGTCGCCAGCAGGAGAATATGCTGCGCCAGAGCCCGAGAAGAACCCCCGCTCCCTGAGGAG 

GACCTGTNCAAACTCTTTAACCACCACAGCCXGCCTGCCAGGATGGACTCGCTGNT^^ 

CATATAAACCACTTACTGNCAGAACATCANGGAGTTTACTGCCCAAAACTTAGGCAAGCTOT 

GCCAGGCTCnTCATGAATACAACCACTAAGAAhn^GGAAGTTCCANAAAAGANGTAAC^^ 

TTGAAGTCCACCAGGGNAACrrnGGAGAAATATTITmATNT^^ 

TAGTGNCTTGCCGATTTTGGCG 

SEQ ID NO: 1478 ACATTGTITGTTTTCAATAGGAGTTCATGGAATTTGAAATCAGAAGAA;^ 
CTCATTACAAAACAACAGCATTTTAATTGAATTTTCTAAGTC^ 

ctgttcaaatcaaagttcatcattttgtttggaacaacttttgnt^^ 
aatgacaaaacaaataancatataatgacagaaaagtaagtggcaaaataactacaaact^ 
ccactggctactgcagtagaataaacaaaatgacagtranccananacacctttatm 
aaagaagaagctcttttcttctttttccaggataattatgat^ 

GGNCGTGACCACNCTA 

SEQ ID NO: 1479 ACTTTITTTTTCTTTTTT^^ 

ATNTGGGACTTrATrAAATAAGGNGAATTTGGGACAAATGAAAGGNGANATGAAGGCAA^ 

GTCAAGGGATGATCTGAGCCTGAACAACTCAGTGAATGNGAANAGAAAACAANATTACATGNGA 

ATATANATGTTAACTGGAAAAGCNAGGANAAAAAAAGGGAGCNCAAGGAAAAAAAAAAA^^ 

AATTTGTGAGCCATNTCAAGCCATCAAAAAAACTTCATTNTATTGTAGGAGGG 

ATGGCANAGTAATTTTGTGTTAANAATTAAANTACCTCGGNCGCGACCAC 

SEQ ID NO: 1480 AACTCATTAATAATATTAATAGGCGCTTGACCCCACAGGCTGTCAAAATTCG 
AGCAGATATTGAAGTGGCTTGTTATGGTTATGAAGGCATTGATGCTGTAAAAGAAGCCCTAATAG 
CAGGTTTGAATTGTTCTACAGAAAACATGCCCATTAAGATTAATCTAATAGCTCCTCCTCG 
TAATGACTACGACAACCCTGGAGAGAACAGAAGGCCTTTCTGTCCTCAGCAAGCTATGGCT 
CAAAGAGAAGATTGAGGAATAGAGGGGTGTCGTTCAATGTNCAAATGGAGCCCAAAGTGGTCNC 
AGATAOvfAGATTAAACTTGTNCTTGCNAGGCM^ATTGGATAGGCTTTGAA^ 
AGAGGATTGCGGANThmTCANnWTAATANAANGGATTCAAANCTGAAGAT^ 

SEQ ID NO: 148 1 ACGTGCCGCGGAAATGCTCCGCTAGCAATCGCATCATCGGTGCCAAGGACCA 
CGCATCCATCCAGATGAACGTGGCCGAGGTTGACAAGGTCACAGGCAGGTTTAATGGCCAGTTTA 
AAACTTATGCTATCTGCGGGGCCATTCNTAGGATGGGTGAGTCAGATGATTCCATTCTCCGATTGG 
CCAAANGCCGATGGCATCGTACTCAAAGAACTTTTGACTGGAGAGAATt^CA^ 
TTGTCATAAATGAANAATGAAAAACCTT 

SEQ ID NO: 1482 ACATGTTrGAAGAAGTGCCGATTGTAATTAAAAATTCACATCTGATCAATGTC 
CTAATGTGGGAACTTGAAAAGAAGTCAGCTGTTGCAGATAAACATGAATTGCrCAGCOT 
CAGCAATCATTTGGGGAAGAATCTACAGTTGCTGATGGACAGAGTGGATGAAATGAGCCAAGATA 
TAGTTAAATACAACACATACATGAGGAATACTAGTAAACAACAGCAGCAGAAACATCAGTATCAG 
CAGCGTCGCCAGCAGGAGAATATGCAGCGCCAGAGCCGAGGAGAACCCCCGCTCCCTGAGGAGG 
ACCTGTCCAAACTCTTAAACCACCACAGCCGCTGCCAGGATGGCTCGCTNCTCATTGCAGGCCAG 
ATAAACCTTACTGCCNAACATCAGGGAGTTNCrrGCCAAAACTTAGGCAAC^ 
GCTNTTCAAGAATACANANNCTANAAAAAGGAAGTTNCCAAAAANAATGTT^ 
AAGTCACACCAGGNNAACirrGGAAAAANTTTATTTGCNTAT^ 
GT. 

SEQ ID NO: 1483 ACAGCCAACGGTTTCCCrTGGGGGCTTTGAAATAACACCACCAGTGGTCTTA 
AGGTTGAAGTGTGGTTCAGGGCCAGTGCATATTAGTGGACAGCACTTAGTAGCTGTGGAGGAAGA 
TGCAGAGTCAGAAGATGAAGAGGAGGAGGATGTGAAACTCTTAAGTATATCCGGAAAGCGGTCT 
GCCCCTGGAGGTGGTAGCAAGGTTCCACAGAAAAAAGTAAAACTTGCTGCTGATGAAGATGATGA 
CGATGATGATGAAGAGGATGATGATGAAGATGATGATGATGATGATTTTGATGATGAGGAAACTG 
AAATANAAAGCGCCTTGAANAAATCTATACGAGNTCTCCANCCAAATTT 
AANATTGGAAAAGACTCATAAeCATOSriTCNNCACCAGGATa^ATGGA 
NACATGAAAAAACTCCTAANNCACCAANTGGACCTATTCTGTNGANGACANAAATTAT^^ 
TTAACTCANTTTTTTAGTAAAGC 

SEQ ID NO: 1484 ACTITCTTATTCAACTGTGAATCTGCTCGAGAAAGACAGAAGGTTACAGAAA 
GGACTGTTTCTTTATGGTCACTGATAAACAGTAATAAAGAAAAATTCAAAAAC^ 
AAGAAATCAATCGAGTTTTATATCCAGTTGCCAGTATGCGTCACTTGGAACTCTGGGTGAA^^ 
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ACATTAGATGGAACCCCAGGATCAAGCAACAACAGCCQAATCCAGTGGAGCAGCGTTACATGGA 

GCTCTTAGCCnTACGCGACOAATACATAAAGCGGCTTGAGGAACTGCAGCTCGCAACTCTGCCAA 

GCmCTGATCCCCCAACTTCACCTTCCAGTCCITCGCAAATGATGCCCCATGTGCAAACTCACTTC 

TGAGGGGGGACCCTGCACCGCATTAGAGCTCGAAATAAAGGCGATAGCTGACTTCATTTGGGGCA 

Tn-GTAAAAAGTAGATTAAAATATTTGCCTCCATGTAGAACTTGACTAACATAATCTT/^ 

GAATATCTGCCTTCTAGAATAATATTACAAGAAAACTCAGGGCCCACCGGCAATCAAAAAAAANG 

AGCTGAGATOAGGnTGG 

SEO ID NO- 1485 ACTACACGCGCCTQGGCNACGACTTCCACACGAACAAGCGCGTGTGCNAGGA 
GATCGCCATTATCCCCAGCAAAAAGCTCCGCAACAAGATAGCAGGTTATGTCACGCATCTGAGGA 
ANCCOAATTCACAGAGGCCCAGGAAGAGGTATCTCCATCAAGCTGCAGGAGGAGGAGAGAGAAT 
GGTGANACAATrTTGTTCCCTGAGGNCTTAGNCTTGGATCAAGAGATTATTGNNNGTAGANTOT 
ACCCTAANGAAAATGCT 

SEO ID NO- 1486 ACT ll - iTl ' l - l 11 11 1 1 1 1 1 1 1 1 1 1 1 1 i I CCACANAAATCTAANCNCACTTGTCTT 
ATATTCnrrcrGCAACCCGNATTTTTCCTCrACTGGTTCCAGCATAAAACT 
GCAATThrrGAACCCCTTAAGTCCAGNGGGTITCCACANANAAACATmTSrm 
TATTGTTTACTCAAACGAAGTCTCATGAACTGATCCCAAAGGCAACATGGGAAAGCACGNGATAA 
AAACTTTACATAAACCCATCGCTCTTTAAAAANAAACATTCACITrGGGAGGCTG 
ATCITGAGGTCAAGAGATGGAAACCATCCTGGCCAAANTGGTGAAACCCCGNOTCTACTAAAAAT 
ACAAAAATTAACTGGGCATGGGGGTGCATGCCTGTATTCCCAGCTCTraGQGAGGCTNAGGCAGG 
ANAATCAOTGAACCTGNGAGGCGGAGGTTGCAGTGAGCCCAAATTGCCCATTGCCTCCAGCCTG 
NGTAACAAACCAGACTNCTTTAAAAAAATAAAAATTNANTAATAANCTTCAAGGCATm 

TTTTGTATT 

SEO ID NO: 1487 acatccaaaaccataaggaaatattctgatgcccagatgatgaagactgggg 

TGAATTAAGTCCACACATTTATrrCAAGTTGTTAAAGAGTTTGTGGGCCACGCAATGGTCCCTCGC 

atgcaagaagtcaaagagctcctccgtgcaatcctcttctgtatgtgatcgagaggatacacgctc 

ATCACAGAGCTCTAGCCGCTCCCGGGCCTTTACACATTTCrCCAACTGCTCGCATTGCTC^ 
GTTGTTAGGGGATCCACTAATTCCTCCTCTTCCTCITCCTCCTCCTCAGGATCTC^^ 
AAGCATCTTTTGCTCGGCCTCCAAGTCCCATGTCTGGCTACGGGTCTAGATTNAACACGAGCAGCA 
ACAGNGGCCXTAACCCATTCAAGGATCAANANAGACnTGTAAGCCCCGCNTACCTC 

SEO ID NO: 1488 acagttactcccggaccggcggcgtgaaagtcgtgatatcatcgttgaacta 

TTAGCTTTGAAGmAAATCCAATGGAGAAGACTCAAAAAACAGTCCAAAGAATTClTCTAGAAC 

CCTATAAATACTTACnTCAGTTACCAGGTAAACAAGTGAGAACCAAACTrrCACAOGCATTTAATC 

ATTGGCTGAAAGTfCCAGAGGACAAGCTACAGATTATTATTGAAGTGACAGAAATGTTGCATAAT 

GCCAGTTTACrCATCGATGATATTGAAGACAACTCAAAACTCCGACGTGGCTTTCCAGTGGCCCAC 

AGCATCTATGGAATCCCATCTGTCATCAATTCTGCCAATTACCGTGTATTTCCTTGGCTTGGAGAA 

AGTCITACCCTrQATCACCCAGATGCAGTGAAGCTTmACCCGCCAGCTTTTGGACTCCAT^^ 

GACAAGGCCTAGATATITACTCGAGGGATAArrACACrTGTCCCTGAAGAAGATATAAAGCTATG 

GTGCTGAGAAAACAGGTGGACTOTTGGATAGCAGTANGGCINTGCAGTTGTTCTCTGATTACAAA 

AANATTTAAACCGTACT 

SEO ED NO- 1489 ACCTAGAAGCCGTGACCCAGGGCCATGGCGCTTACCTGATGAGTCAGGATGC 
TCCGGACGTTirrACTGTAAGTGTOGAAACTTACCCCCTAAGGCTAAGGTTCITATAA;^ 

ctacatcacagaactcagcatcctgggcactgttggtgtctttttcatgcccgccaccgtagcacc 

CTGGCAACAGGACAAGGCTTrGAATGAAAACCTTCAGGATACAGTAGAGAAGATTTGTATAAAAG 

AAATAGGAACAAAGCAAAGCITCTCnTGACTATGTCTATTGAGATGCCGTATGTGATTGAATTCA 

rrrrCAGTGATACACATGAACTGAAACAAAAGCGCACAGACTGCAAAGCTGTCATTAGCACCATG 

GAAGGTAGCTCCTTAGACAGCAOTGGGATmCTCTCCACATCGGTTTGTCrGCTGQCTATCTNCC 

AAGAATGTGGGTTGAAAAACATCCAGAAAAAGAAAGCGAGGCTTGCATGCTTGTCTTTCAACCCG 

ATOTCGATGTNNATCmCCTGACCTAGCCAGTGAGAGTAANTGTT 

SEO ID NO- 1490 ACTCTXjGTTCNATANACTITITITATTTTANGGTTGANGCNGACC^^ 

AATAIWGAATAAAAGGAATGCTTATAGGAAACAATTNTGTATGGAATGCTATATGGCCAAGCCTN 
AGCCTITCGTCCAGTGCAACCCTTGCCTCGCTTGTCAACNGNGAAAAATTAGTTTGGTrAGANG^ 

ccatctggaaacncaccagcttctgctaccttcatgctcattgttaaaaaaagattaaccagtgtg 

AACATTCTGATCTGNTAATTCCAGGGNCTGANTTCTTATCAATGNACrGTrTGTTGGTAAAAATAA 
CCCCCAAAATGCTCAAAGCTTAAATGCWTA^rrcATTCC^AGT^WGCGAGTTCCTTANAAATGGAT^ 

GGCGGCNTGGNTAA 
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seq id no: 1491 accagagactctcctatctcacggttgaggcagacccaggatagaatagaga 
ataaaaggaatgcttataggaaacaatmgtatggaatgctagatggccaagcctcagcc^ 
gtccagtgcaacccttgccrcgcttgtcaacagtgaaaaattagtttggttag^ 
aaacacaccagcttctgctaccttcatgctcattgttaaaaaaagattaaccagtgtg^ 
gatctgttaattccagggactgttttctttccaatggactgtttgttggtagaat;^ 
gctcaaagctaaaatgcatcatcagtcctagtcggcagttccttaanaatggactggcggcgtgg 
ttgagctgatatggaaaagctgcccttcctgcagangatcaactgacctgctatcccaccccaaat 
ttcagcctgaggtatatttnagtnaggcaggtnctgtgcttntnanagcatagaacca^^ 
agcaaaaaggnngaggaatctagnaagaccgtttgattanatttatccatggggngaagggagg 
gcaangaccatgg 

seq id no: 1492 acctgcatcagcattagtgatcaacctgttaatccaaggnctttagaaaaac 
tnganatnattcctgcnagccaattttgtccacctgtttgaganc^ 

SEQ ID NO: 1493 acaagaatcgacctcactgttcacatggaaactctgaccaggncttctctgc 

ACCATGATTGGCTGGGAAACTGAATGATCrGATGTTCCCCANGCACTGTCrCGCCATT^^^ 
AGGAATATTCCTTTTGGTCCATCTTTGCTGGATTCATNTGT^ 

GCTGCTCTCCCATCAGCTCTTNCCTCCAGAGCAAGTGCTTGAAAATCATNATTCAGT^^ 
GAACGCACCTGATGTTCATGCANGAATGTTATCCCCAGTTCGGCCATCCGATGTTTGCm 
AGGACCGNNTNCTGCACTTCCTCCTNCCCCAAGCNGNTCACNANNCTGCCNCTGAA^ 
G 

SEQ ID NO: 1494 ACTTTTTTTTTTTT^^ 

TTTCATTGTATACTTTCCAGGCATTACATCCATGCTGNGACATTAGTTCCANATTCT^ 

TGCTTGATGCTNTAACTGGGCCATANAATTGNTTACACATTCTTGCCATGCAGNAATGTCAT^^ 

TTGACCANAGGAGGGGGCTGGAAGCTCATNTCGTTNCArACTGAGCAAATCAATTGGTTGTCW 

CACCAGTCCTTTNAAATTNATTCTCATTATNGTCAGNTT^ 

GTNGCNCANGNAGTTCNTAATTAGGTCGGAAATT 

SEQ ID NO: 1495 ATGCTGATTCTGAAGAGATCAACAGACAAGTTACATATTTCATAACAGGAGG 
GGATCCTTTAGGACAGTTTGCTGTTGAAACTATACAGAATGAATGGAAGGTATA^^^ 
CTCTAGACAGGGAAAAAAGGGACAATTACCTTOTACTATCACGGCNACTGATGGCACC^ 
CAAAAGCGATAGTTGAAGTGAAAGTTCTGGATGCANATGACAACAGTCCAGTTTGNGAAAAAGAC 
TTTATATTCAGACNCTATTCCTGAAACGTCCnTCTGGAAAAATTANCATGCAGATCT 
ACGCAGACATNCNGTCTAACGCCTGAAATTACTTNCACGTTATTGGGrrCAGGCGCTNAAA^ 
AAACTAGATCCACA(>JCACGNGAACTGAANNCGTCAA(rCCCTTGATCGTGAGGAGCAAGCT^ 
TATCATCTTCTCGTCAGGCCACAGATGGANGANGAANATCTTGCCAANCAGNTTGTGCTCACNCTN 
GAANCGTGACNATAATGCCCC>n^ATTTTTNGCGACCTTATNCC^ 

SEQ ID NO: 1496 ACGAAGAAAGCATTTCCCAAGCAATGAGTCTCTTAATGGAAAAAATAAAAGA 
GCAATGTAATTAAACTTTCCTTCAAAGAAAGGAAAGAAACCTAAATCTGTTATGGCA^ 
AAGATAAATATTTCCTAGTCCCATAATATGTGTTTGTTTTACAAGATATGC^ 
CTATGAAGCAGGGATCTGTGGCAATCCAAATTCATCCACCAGAACTCCATCCrrGTTTm 
AGTGGGAACACCTTCTGGAATTGCAGGTGCAGATGCTGCCTCATCCAAATAAGAACTGTOT 
AGCCAGAAGCTCATCACCTAGTGCATCCAACTCTGCTTCTAAATCATCTTTATO^ 
GCCATAAACTGCGNACTNAANGGCrmTTNGAATTTCATTGCATCT^^ 
CTNGTAAATCCTCAATCTTGGNCGGATCTTNAOTGCTTGNATGCCT^ 
TCNTACATAACCGGGNCTTNGNGTCCTCAAAGACTGGANGGNTAATTGGCT 

SEQ ED NO: 1497 ACAACCAGGGCGAGAAGAAGAACGCCCTGGCCCAATATCAGGAGATGGAGA 
AGAAAGTCAGCCTACTCAAGGACAATAGCTCTCTGGAATTTGACTCTGAGATGGTGGAGA 
CAGAAGTTGOGAGCTGCTCTCCAGGrrGGGGAGGCACTGGTCTGGACCAAACCAGTTAAAGATCC 
CAAATCAAAGCACCAGACCACTTCAACCAGCAAACCTGCCAGmCCAGCAGCCTCTGGGCTCTA 
ATCAAGCTCTAGGACAGGCAATGTCTTCAGCAGCTGCATACAGGACGCTCC(XTCAGGTOCTGGA 
GGAACATCCCAGTTCACAAAGCCCCCATCTCTTCCTCTGGAGCCAGAGCCTGCGGTGGAATCAAG 
TCCAACTGAACATCAGAACAAATAAGAGAGAAATAAGAATAGAATGAATGACCCCAAAATAGGG 
TTTTCTTGGGCGAGGATGTGCTGGATTANGAAAGGTGACATGACACAGGCAGAGCAGAGTGGCAC 
CCACCCAGAATACAGTGTGTGTTATTACGAGGAGCCAGCANTTGACCTAAGGCCITCTNCTACCT^ 
GTATNOCTTTTGAGNCGGAACCCTCTCTGCCCANAA 

SEQ ID NO: 1498 ACTTTTTITnTITTTTT^^ 

TGGCTTGGCTCACAATTTAAGTGGTTATAAATATOTATACATNTGATATAG 
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AATAAATATACATCATTTTTTGCTACACmGAACACCAACATGT^ 
AAAACGATATGAATNACNNAAANTCTATmrOCNGGANCACATT^ 

SEQ ID NO: 1499 ACTGGAGATGTAmGATAACCAAGGTTTTAGGTAAATTTTCACCAGTATT^^ 
TTCTATTTGCAAACTGAAAAATGTTGTAGGCTTAATGTAAAATAACCACATTAGT^ 
CTCTTAGAAGAAAGGCCATATTTTGCTCCTGCTTCTGTAA^ 

AATGGTAGTGTGACCTTTCACTTAATTCCTACTCCCTTAATGTGAGAGAGACAAAATGAGCTGA^ 

AAGGAAAATTCTGGAGTTCACTCCCAACCTTGAACATACTGACCGGACATCTm 

GATTTCTTCATGCCCCCATNGCTAATTGCCTTGTGGATNACGGGACAACCCTNm 

CAGCATCATTCGATGGTATCTTTGCANCAAACACCTGCAlSfNGATAANGACANN^^ 

CTGGGG>nTNGCCTTCATTACACCAATAGNGhrrATTGACTTGAATATCNNTAh^^ 

CCAlSnSITGGTATTTNNACTrGTGCIOTGTGACCC^ 

SEQ ID NO: 1 500 ACAAATTCCAGTGTGCAGACCACAACCTCAAAACAAAAAAGATCTATTTCTA 
CTGGATCTGCAGGGAGACAGGTGCCTTrrCCTGGTTCAACAACCTGTTGAOT 
GATGGAGGAATTAGGCAAAGTGGGTTrrCTAAACTACCGTCTCTTCCTCACCGGATGGGACAGCA 
ATATTGTTGGTCATGCAGCATTAAACTTTGACAAGGCCACTGACATCGTGACAGGTCTGAA^ 
AAAACCTCCTTTGGGAGACCAATGTGGGACAATGAGTTTTCTACAATAGCTACCTCCCACCCCA^ 
TCTGTAGTGGGAGTTTTCTTATGTGGCCTCGGANTITGGCAAAGANCCrGCGC/^ 
GATATTCCANCTGGANCCTACAAAAGTTNAmTmACTTCAACAA/^^ 
ATAAGGACGGTAATCCGCATTTGGTCTTNGATATTCAGNAATTTACITG^ 
GTTCTTAGNNTAAGAATGNGCT^^^AAGCCTTGAAANCTGG^^^^ 
T 

SEQ ID NO: 1 50 1 ACCTCAAAGTGACGGGGAGAATGGAACCAAGTTGGAAAACACTGCAGGATA 
TTATCCAGGAGAACTTCCCCAACCTAGCAAGGCAGGCCAACATTCAAATATAGGAAACACACAGA 
TGCCACAAAGATACTCCTCGAGAAGAGCAACCCTGAGACACATAATTCTCACATTGACCAAGGTT 
GAAATGAAGGAAAAAAAAAT(m'AAAGAGCAGCCAGAGAGAAAGGTCAGGTTACCATAAAGGGA 
AGCACATCAGACTAACAGCAGATCTOTCAGCAGAAACTNTACATGCCANAAGAGAGTGGGTGCCA 
ATATTCAACATTCTTAAAGAAAAAGAATTTTCANCCCAGTATTT 

SEQ ID NO: 1 502 ACAGGAAAACGCTGCCGCGGTCCACAGTGTCGATTCTGGATGACCACATTAG 
CCACTATCCACAGCTAGAGGATCCCATGGGCTTCTTGAATGCATATATGGGCTTCATCAACTCCTT 
CTGAGCTGGAAAGAGTAGCrrCCCTGTATTACCTCCCCTACTCCCTTATCTGTTGTGTATTCC^ 
AGGAAGAAATGCCCAAAAGAGGTCCTGGCCATCAAACATAATTCTCTCACAAAGTCCACTT^ 
CAAATTGGTGAACAGTGTATAGGAAGAAGCCAGCAGGAGCTCTGACTAA GGTT GACATAATAGTC 
CACCTCCCATTACTTTGATNTCTGATCAAATGTATAGACTTGGGTTTGGT^^ 
AAATNTTGNNTNAGCANTrACTmraNlCrrGTTGC;^ 
ACANTTTGGGACTTCmGAATAATTANAAGNGCTAATTTCTGGNCCA 
TTAAGGAGGAGGAGAANGGNGGNTCCTTCCTTTCraGATGACmTNTG 

SEQ ID NO: 1503 ACGCGGGTGATCCATGGGAmCATTCTAATGACCATGTGAAGATGTTTGAGT 
CCTCCTTTGCCTTTCCTCAGAAAGAATCCTTCTAAGGCACAAATCCCTTAGATG 
TATrGTTAACrCACTCATATTGAGATCATTTTTAGAGATACCAGGTTT^ 
GGTTCCACCCTCATGGGATAAAACTGCTTACAAGTATTTTGAAAGAAAAACTGACCAAAA 
AATTGTTACTAAGGCAATCACGCACAGGTGACCGTATGTCTTATCTGATTTGTTTTAACT^ 
GCCCAAACTCAGAATGGGAATITCACTGNCANNAATGANCATNCCCTGNNAAAGAAAAGNTA 

SEQ ID NO: 1504 ACTCCCCAATGGTGGATTTATTACTATTAAAGAAACCAGGGAAAATATTAAT 
riTAATATTATAACAACCTGAAAATAATGGAAAAGAGGTTTTTGAATT^^ 
CCTTCTTAAGTGCATGAGATGGTTTGATGGTTTGCTGCATTAAAGGTATT^ 
GAGGGCAAGTGACTGCAGTTTTGAGAATCAGTTITGACCTTGATGATTTm 
AATAAATGTTTGTAAANNAGTGTAATAAAAATCCCTTTGCATTC^ 
AGGAAAAGGCTCGTGACCATTTGTTTTTTTGNGGGTATAGTTGCTATT^ 
G>nTTACTNAGGAGGNTNATNTACATATANCCATAATTTCTCTNGGACC^ 

SEQ ID NO: 1505 acagtgggcatgcagcgcctcgggacgacacccagcgtttatgggggtgctg 
gaggccggggcatccgcatctccaactccagacacacggtgaactatgggagcgatctcacaggc 
ggcggggacctgtttgttggcaatgagaaaatggccatgcanaacctaaatgaccgnctagcgag 
ctacctanaaaaggtgcggaccctggagcagtccaactncaaacttgaagtgcaaato 

GGGACTTTNATTTTTTrirn^^ 
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SEQ ID NO: 1506 ACCCAACTAACCACAAAGCCAGGCCTGCTGGGATGAGGATAAAAATCAAAT 
AAACAATCCTCTCAACGTCCAGGAAATGGCTGCCAGGCCTTCTGACTTCATGAAACGCCGGGG 
NNCTGTGCTCITCrATGCTGGGTCCTCTTCGTCCATTGCNACATATGNTGN^ 
CTCNGNCNATTGGCTCTGCATGGCATAATTGCCTGGNGACCG 

SEQ ID NO: 1 507 acccatgatttggacactttgtcgggcccatttacttatactgcaaagcgtcc 

TTCANATATCAAATTCAAGCCTCTAAATAAGACCAAGGAGTATACAGCCTGTGAACTGATGA^ 

TATACAAAGACTGACAATCACCTGAAACATTATTTACATATCATTGAAAACAAACCCCTGTATC^ 

GTTATCTATGATAGCAATGGTGTCGTCCTTTCAATGCCTCCCATCATCAATGGGGATCATTCCAGA 

ATAACAGTNNATACTAGAAATATTTTTATTTGAATGCACGGGAACT^ 

GT^WTTGAGATTATTGTCACCATGTTCAGGGATATTGNGGAGATT 

SEQ ID NO: 1508 ACAGTTTTrATACTGAAGCTAGTATTGAGCTGCACTTGAATTCA 

CAAAATAATTGCCTGAGCACACACACATTCCACACGCATCATTAAAGGATAGCCATTTATTCTO 

TCirCATCCTCTTCCTCTTCATCTTCATCTTCrrCTTCCTCCTC^ 

TTCTTCTTTGAGCCTGTTGGCCTGCCAGGGCCCTTCTTTCCTGCTTCACT^ 

TGCAGCAATATCCTTTTCATATlTCTCCTTTAGCTrANCTGCTTT 

TGGCTGACTGCTCAAACCACATTTACCCAATrTTTTGCAGATCCCC/^^ 

TNNCCTTTTGNCTTNTGGGCGATGTCAAAGCAAAACNNGAAGAAGGCAG>^ 

NCATTGGGGNCCTTTTTTTCCCCTTCTTK^ 

SEQ ID NO: 1509 ACTATTCCTTACAACTGGAGTGGGTAGAAGCCTTATGAAAATTATACTGACA 
ACCTGATCTCGTTTACTCCATGTTAATCACATTCCTACCAACCTAAATTTCT^ 
TTTGGAGGAAGCCCATTACTCCATAACTGACTGGATGGTCCAGTGTCATTTTGATC^ 
AATOGAAATTTATAATATAAATATATGTTTTAATACCACAAACACTGTC^ 
TANTGGTGTATATGGTCCACTGGTTTCAGGCCAAATCTAGAATTTAGTGATACTGGCTCAG^ 
TTTAAGTTCTATTCANn^TTTCCTGGGTGNGGAGAACCTANAm 
TTTGGTTCACNATNTCIWirmCTOTGN^ 
TCTCCAGGAAAACATGGTGT(>ITTTTCTNCGNTCGAATA 

SEQ ID NO: 1 5 10 ACTTGGTGAATATGAGGAGTATATTACTAAACTTTTCAACTACCACAAAGTTC 
TTCCTATGAATACAGGAGTGGAGGCTGGAGAGACTGCCTGTAAACTAGCTCGTAAGTGGGGCTAT 
ACCGTGAAGGGCATTCAGAAATACAAAGCAAAGATTGTTTTTGCAGCTGGGAACTTCTG^ 

gacgttgtctgctatctccagttccacagacccaaccagttacgatggttttggaccatttatg 
gggattcgacatcattccctataatgatctgcccgcactggancgtgctcttnaggatccaaatgt 

GGGCTGCGTTCATGGTNGAACCAATNCANGGTGAAG CANG CNGTTGTTGNTCCCGATCCAGGrm 
OsITATNGGAira^CNANANCTCTGTCCAGGCACCANNTriTCT^ 

SEQ ID NO: 1511 acagacaaaactacagacttagtctggtggactggactaattacttgaagga 
tttagatagagtatttgcactgctgaagagtcactatgagcaaaataaaacaaataagactc^ 
ctgctcaaagtgacgggttcttggttgtctctgctgagcacgctgtgtcaatggagatggcct 
ctgactcagatgaagacccanggcntaaggttgggaaaacacctcatttgaccttgncagctgac 
cttctaaccctgcattttgaaccgaccaa(>ittaagtccagagagtaaacrrga^ 
gacnttccanaagttaatcttttgaamrctho^iacactggngaana^ 
gcntggaananncttanttttttggaaaccggntttct^ 
gggcccagtcccttgntg 

SEQ ID NO: 1512 ACCTAGAATTCITCAGCATAAATGCCCAAAGAAATGCATTAGCAATTGCAGC 
TAArrGCTGCCAGAGTATCACGCCAGATGAATTTCATTTTGTGGCAGATTCACTCCCAT^^ 
CCAAAGACTAACACATCAGGATAAAAAGTCAGTAGAAAGCACTTGCCTTTGTTTTGCACGC^ 
TGGACAACTTCCAGCATGAGGAGAATTTACTCCAGCAGGTTGCTTCCAAAGATCTGCTTA 
TTCAACAGCTGTTGGTAGTGACTCCACCCATTTTAAGTTCTGGGATGm 
GTTTTCTCTGATGTGTTCCAACTGTCAACTTTNACrGGTCAAOT 
ACNCTTNACTTTCTCCTGTNTGGGGCCTCCAATNGA^ 
CGAANCCrrAANAGTTGTTNAACTGACATCTOTGATrrGNGA^ 
AGGChrrTTTTNAATTTATNCCTGGTNGAAGAAGGTAATG 

SEQ ID NO: 1513 ACAGTTTTTATACTGAAGCTAGTATTGAGCTGCACTTGAATTCACATTOT 
CAAAATAATTGCCTGAGCACACACACACATTCCACACGCATCATTAAAGGATAGCCATTTATTOT 
CATCTTCATCCTCTTCCTCCTCATCTTCATCnTCTNCrTCCTCCTCCTC 
TCTTCTTTGAGCCTGTTGGCCTGCCAGGGCCCTTCTTTCCTGCTTC^ 
GCAGCAATATCCTTTTCATATTATCTCCTTTANCTTANCTGCTTTCTGT^^ 
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TGGCTGACTGCraANACCACAATITCACCCAATNTNTTTGCWlAATCNCCAAAGGANAGG 
GNNGTNNAACTTTTGTraTTGGGCQATGTNCANANa^AANNAGGAATAAGNCAGAATGGNG^^^ 

TTTTAGGACATNGGGG 

SEO ID NO- 1 5 14 ACAATGCCTGCATCCTGGAGTATAAACAGAGTGGCGAGAGCATTGACATCAT 
TACGCGAGCCCATGGCAATGTCCAGGACCGCATTGGCCGCCCCTCATAGACCGGCATTATTGGCA 
TCmTGACCCTGAGTGCCGGATGATTCGCCTGCGTCTCrATGACGGCCrTTTCAAGGN^^ 
CTAGATCNCGATAATAAANAACTCNAOGCCTTCANCATCCGNCTGGAGGAGCTGCATGTCATTAG 

ATGTCAAAGGNCTATTTTGGGT 

SEO ID NO: 1 5 1 5 ACAGACAAAGTGGGAGGTTTTATTTCrTGGtCTCITCCTCCTrGGATAAAGAC 
TTGATGATCTCCTCClTCTrGGCCTGGAGGCGCTCTTCACGGCGCITGCGTGCTTCCTrGGTC^ 
ACCTGCGGGCCTCANCCTGGTCAGCCAGGANCTTCTTGCQGGCCTTGTCTGCCTTCAGCTTGTGGA 
TGTGTTCCATGANAATCCGCTTGTTTTTGAACACATTCCCCTTCACCTACAGGNACTGTAT 

SEO ID NO- 1516 ACGCGGGGGTTCCTTCAGTCCGCTGGTCCCGAGCACGAGCTGTGAGGGGATT 
CACTTGTGTGCGGAACrCCTCGGAACCATGGCGTCCCmCCCTTGCACCTGTTAACATCm 
CAGGAOCTGATGAAGAOAGAGCAGAGACAGCTCGTCTGACrrCTTTTATTGGTGCCATCGCCATTG 
GAGACTTGGTAAAGAGCACCTTGGGACCCAAAGGCATGGACAAAATTCTTCTAAGCAGTGGACGA 
OATGCCTCTCITATGGTAACCAATGATGGTGCCACTATTNTAAAAAACATTGGTGTTGACAATCC^ 
GCAGCTAAAGTTTTAGTTGATATTGTCAAGGGTTCAAAGATGATGAAATTGGTGATNGCACCTACC 
mGTAAlWGTTTNNANGCATAATmTAANGG>mAGCANAATNTTrNh^ 
CCAGACCAT^m'ANCGGNGTGGAGNGAAGCCCNAANGTTGNAAGAGAGGCmGTTGATTNTNAG 
TTG^rrCATGGTT^mGATNAGGTOAATTCGTCNNGATTTAATGATATTGGTGGNCA 

SEO ID NO: 1 5 17 acttcaagttaaagtgaataaccacttaaaaaatgtccatgatggaatattc 

CrCTATCTCTAGAATTTTAAGTGCTTTGTAATGGGAACTGCCTCTTTCCTGTT^ 

tgtcagaaaccagttatgtgaatgatctctctgaatcctaaggqctggtctctgctgaaggttgta 

AGTOGTCGCITACTTTCAGTGATCCTCCAACTTCAriTGATGCTAAATAGGAGATACCAGGTTG^ 

AGACCITCTCCAAATGAGATCTAAGC(mTCCATAAGGAATGTAGCTGGTTrCCTCATTCCTGAAA 

GAAACAGTTAACTTTCAGAANAGATGGGCTTGTTITCTTGCCAATGAGGTCTGAAATGGAGGGTCC 

TCTGhmKjANTAAAAGGAGGGTTNAACTGTTGNTTGQ^GGAATAAGGCmTANTATGTrACCT^ 

GTGGCATTTATGAAAAGAGGGGACCAGAAGCCAAAGACITAGTATAl-lllll"ll'CCnTGTCCCrTC 

q«icx:atanccn<xnttnagtctttgtttttttct^^ 

SEO ID NO- 1 5 1 8 acgcgggggctgtqgtctgagctagagggtgaagctggcggagcaggagga 
tcggcgagcagtctgaatgccagaatggataaccgttttgctacagcatttgtaattgcttgtgtg 
citancctcatttccaccatctacatggcancctccattggcacagacttctggtatgaatatcga 
agtccagitcaagaaaattccagtgattt gaata anagcatctgggatgaattcattagtgatga 
ggcatatgaaaagacttataatgattcactttttcnataca 

SEO ID NO- 1 5 19 acatgctccatcttccaggaggaccactctctgtggcaccctggactacctgc 

CCCCTGAAATGATTGAAGGTCGGATGCATGATGAGAAGGTGGATCTCTGGAGCCITGGAGTrCTTT 
GCTATGAATTTTTAGTTGGGAAGCCTCCrmGAGGCAAACACATACCAAGAGACCTACAAAAGA 
ATATCACGGGTTGAATTCACATTCCCTGACTTTGTAACAGAGGGAGCCAGGGACCTCATTTCAAGA 
CTGTTGAANCATAATCCCAGCCAGAGGCCAATGCTCAGAGAAOTACCTNGGCCGCGACCACOCTA 

AAGGCO 

SEQ ID NO: 1 520 ACGCGGGGATTATTGTAAATATTTGATCTTGAATCACTTGACAGTGTTTGTTT 
GAATTGTGTTTGTTTmCCmGATGGGCTTAAAAGAAATTATCCAAAGGGAGAAAGAGCAGTAT 
GCCACTTCITAAAACAQAACAAAACAAAAAAAGAAAATTGTGCTCTTTTCTAATCCAAAGGGTAT 
AmGCAGCATGCTTGACTITACCAATTCTGATGACATCTTrACGGACACTATTATCACTAAGACCT 
TGmTGGTGAAGTCmAGTCTITITCATGTATmCCTCATGATTTmcr(^ 
CTATGCCTTACCTTTGTAAATAirmGCTrGTGTTGCCGCAAANGGGATAATOT'GGGAAA^ 
CCAAATNATTNGGCTCACTTTATAAAANGAAAGNATTTAAAAACCTTCAGCTOGGCTANACAGTA 
TATTACCTTTGGNATNAAATTCTTCATGGAGTGTCACCrCAAATGNATACTTTGGGTNGGGTACTT 

TTCTTATNAAAATTTNNATAAACTGAATT 

SEO ID NO- 1521 ACACAGGACCAATGCTGCCCATCCACATGGAAnTACAAACATTCTACAGCG 
CAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTGG 
AGACGGGGGAGCTGGAGAATACTTACATCATTTACACCGCCGACCATGGTTACCATATTGGGCAG 
TTTGGACTGGTCAAGGGGAAATCCATGCCATATGACTTTGATATTCGTGTGCCri 1 1 1 1 lATTCGTG 
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GTCCAAGTGTAGAACCAGGATCAATAGTCCCACAGATCGrrCTCAACATTGACTTGGCCCCCACG 

ATCCrGGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCCTCAAACT^ 

GACCCAGAAAAGCCAGGTTACAAGGTTTCGAACAAACAAGAANGCCAAAATTTGNCGTGATACAT 

TCCTAGTNGNAAGAGACAAATTTTTACNTAAGANGGAAGAATCCAGCANG^^ 

AATCACTTGCCCAAATNTGAACGGGTCAAAGAC 

SEQ ID NO: 1522 ACTTTTTCTTTCCITAGACTTTGGCTTACTGGA^ 

GGAGAAGTAAATTTGCTGTAATAATTTTGCTGTAAATAAAACAAAGAGm 

GAATGTGAAGTAAGCATGAAGAGACAGGCTTTGGGAGAAATACCAGAAAGGGAriri'i'CAAAGA 

TGGCATTGTTTAATCTCCGTGTGGCCCTCGGTTGTGCAATCA CAGA TGAGCCAGAAGAGGGCCAGC 

CCCCTACrrGTTTGGGCTCCGAAACTCTTACCAAACATCAATTTTTA 

GTATGTGCTATCTCTAATACGCTACTTCGATATTTATTAAAGAAGTATTm 

GGCTCATTTCATTGAAAACAACTGACTATGATGATAGACAGCTCCTGATTGGCAAAAGTO 

TATATTCAGAATTAAATTTTGCCTGCNCCCTAAACACTGACACATTTAAC^ 

GAGAANAGTGGTAAAAGCTGTATTAGCCAAAATTGGCATCC 

SEQ ID NO: 1523 GCCCTTNNCNGNGGCCGNNCNGNCGGGNCGCGGGGAGGGGCANGTGTNGCC 
TCTGTGCCTCGTTGTCCCCTGGCGCTACCCGGACATN'nTCAGGGTGCCGGCACCATGAAGATCTG 
GACTTCGGAGCACGTCTTTGATTCACATCGGCAGCCACCCGTGGGAAACTGTTACAACAGCTGCA 
ATGCAGAAATACCCAAACCCTATGAACCCAAGTGTGGTTGGAGTTGATGTGTTGGACAGACATAT 
AGATCCCTCTGGAAAGTTGCACAGCCACAGACrrCTCAGCACAGAGTGGGGACTGCCrrCCATTGT 
GAAGTCTCTTATTGGTGCAGCAAGAACGAAAACATATGTGCAAGAACATTCTGTANTTGATCCT^ 
AGAGAAAACAATGGAACTTAAATCTACTAATATTTCATTTACAAAC^^ 
AGACTTATATACAAACCACATCCTCAGGATCCCAGAAAAAACTTGTTTTTG 
AATTACCGTGAAAGGAGTTACCTCAACANGTANCTITGAANGACTGATGGGNi^ 
NCGACCA 

SEQ ID NO: 1524 ACTTTTTTTTTTTTTT^ 

TTTAANATCTGAAATACAATTCCTAAAATATCAACTTTTCCANA^ 

CATTGCCTNTATCATGTTANAACGTGCATTANACTCAAATACAAAAACCNTGAAACAAA 

TCCTTCAACAATITGAGCAAAGATAGAATGCCTAANAACAACATAGATGGACTTGCANAGGATC 

GCrGTTTTACTTCAAGCCCCATAAAAAAAAAAAAAGAGCACAAATGC^^ 

ACATTAAAGTTGAACCTTTGGCACTAGGAATCAGGGCGTTTTGTCNCATAG 

AAAAATTGTGTATGTGTCAAAGGGATNGGAACCACCACCATTCAAGCAATGTTGTCAAC^ 

AANAAAATGTTCTACTGGNATGGTTCTmTTTGGG CTAAT TACCTGNA 

TTGAAAATNAANAAAAGGAGCCTACNCTTCTTTTATT^^ 

SEQ ID NO: 1 525 ACCTACATCAGATCTAACCTTGATCCCAGCAATGTGGATTCCCTCTTCTACGC 
TGCCCAGGCCAGCCAGGCCCTCTCAGGATGTGAGATCTCTATTTCAAATGAGACCAAAGATCTGCT 
TCTGGCAGCTGTCAGTGAGGACTCATCTGTTACCCAGATCTACCATGCAGTTGCAGCTCTAAGTGG 
CTTTGGCCTTCCCTTGGCATCCCAAGAAGCACTCAGTGCCCTTACTGCTCGTCTCAGCA^ 
GACTGTGCTGGCAACAGTCCAGGCTCTGCAGACAGCATCCCACCTGTCCCAGCAGGCTGACCrGA 
GGAGCATCGTGGAGGAGATTGAGGACCTTGTTGCTCGCCTGGATGAACTCGGGGGCGTGTATCTC 
CAGmGAAGAAGGACTGGAAACAACAGCGTTArrTGTGGCTGCCACCTACAAGCTCATGGAT^^ 
TGTGGGGACTGAGCCATCCATTAAGGAGGATCAGGTCATCCANCTGATGAACGCGATCTTCAGCA 
AGAAGAACTTTGAGTCCTNTTCCGAACCTTTANCGTGGCCTCTGCAACTG 

SEQ ID NO: 1 526 ACGAAAAACCTGAAATCACATGCCTATGTAAGGAAAGTGCTATTCACCCAGT 
AAACCCAAAAAAAGCAAATGGATAATGCTGGCCATTTTGCCTTTCTGACAm 
CAAGAACCTCCCCTTTCCCTTCCCCCAATAAGACCATITAAGTGTGTGTTAAACAACTAC^ 
CTAAATAAAAAGTTTGGCCAAAACCAACCATGAAGCTGCAAAGGTGCTTGCrCTTACTG 
TTTTTGCAACTCTGTAGTGTCTCACTTTTAAAGGAACAGC^ 
GCAATGAAGTTATCTCCAACTTCCTAAAGGCTTATGACTTCTAAAAAGTGAATCT^ 
ACATCAGATTTAAAGCATCAAATGCCTGTGAAACAGCAAAGATGGTAAGCAAAGCAAACTAGm 
TTCCGTCTAAAGTCGAAATTGAACACTTACCTTCCTCATAGTACC 

SEQ ID NO: 1 527 ACAGACTTGTTriTGAGTGTTGAGTGGCAGGGACAAAATAAGGGAATGTTAT 
TITITAAGAAAATTCATTTTCATTGTTGTCTCCTTCCT^ 
TGTATATTTTATATTAAATCACTTACTATTGATTTTTGTTGTGA 
ATAAAATCTTGGCTATTGCCCAAAACATAGTAAAGGGTCACGTGTGACTTTTTA 
AAATTCTGCCTTTGTGAGTGCACATGTCCACATTTCATCCCTCCrrCCCTCA^ 
GGCATTAAAGAATTGTTGATGTATATGCAATGTCTGTTAAGCATGCACTATGTATTTCATCC^^ 
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TATTGGGTCTGGGACTGAAGTTTTTANCCAGCATGGACCTAACCTACTT^ 

TGTTTTGTTACAGGCAAAATCTGGTATGGCGTGAATGCCATGGGTCATTCTGAATAT^ 

GGAATTTTATCATTACCCCGANGGTTGGAATACCGTGCCTTT 

SEQ ID NO: 1 528 ACACTTTGAGACGCAGGAAGCAGCTGAAAGAGCTATTGAAAAAATGAATGG 
AATGCTCCTAAATGATCGCAAAGTAmGTTGGACGATTTAAGTCTCGTAAAGAACGAGAAGCT^ 
AACTTGGAGCTAGGGCAAAAGAATTCACCAATGTTTACATCAAGAATTTTGGAGA^ 
GATGAGCGCCTTAAGGATCTCTTTGGCAAGTTTGGGCCTGCCTTAAGT^^ 
GAAAGTGGAAAATCCAAAGGATTTGGATTTGTAAGCTTTGAAAGGCATGAAGATGCACAGAAA 
TGTGGATGAGATGAACGGAAAGGAGCTCAATGGAAAACAAATTTATGTTGGTCGAGCrCANAAAA 
AGGTGGAACGGCAGACGGAACTTAAGCGCAAATTTGAACAGATGAAACAAGATAGGATCACCAG 
ATACCAGGGTGTTAATCTTTATGTGAAAAATCTTGATGATGGTATTGATGATGAACGTC™ 
AAGAGTTTTNTCCNTTGGTACC 

SEQ ID NO: 1529 ACGCGGGGAGTCAGCCTGGCTCCTGTTGAATAGTCTACCCCCCTTGCACTCTA 
CCTGACACAGCTGCAGCCTGCAATTCACTCCCACTGCCTGGGATTGCACTGGATCCGTGTGCTCAG 
AACAAGGTGAACGCCCAGCTGCAGCCATGAAGATCTGTAGCCTCACCCTGCTCrCOT 
TGGCTGCTCAGGTGCTCCTGGTGGAGGGGAAAAAAAAAGTGAAGAATGGACTTCACAGCAAAGT 
GGTCTCAGAACAAAAGGACACTCTGGGCAACACCCAGATTAAGCAGAAAAGCAGGCCCGGGAAC 
AAAGGCAAGTTTGTCACCAAAGACCAAGCCAACTGCAGATGGGCTGCTACTGAGCAGGAGGAGG 
GCATCTCTCTCAAGGTTGAGTGCACTCAATTGGACCATGAATTTTCCTGTGTCT^ 
AACCTCATGCCTAAAGCTCAAGGATGAAAAGAGTCTATTGGAAACAAGTTGCCCGGAATCTO 
TCACAGAAAGACATTTTGTAGATTTTCCAAGACAGOTGTGAAAACCA 

SEQ ID NO: 1530 ACGCGGGACAGAGAGCATCTCCGTGGCCAAAATCAGTGTCTATGGGACACTC 
TGAGCTGTGCCACTGCCACAGGGGTATTCTGCCTTCAGGACTCTGCCTTCAGGAACACGGGTCTGT 
AGAGGGTCTGCTGGAGACGCCTGAAAGACAGTTCCATCTTCCmAGACTCCAGCCTT^ 
CACCTTCCCmACCAGGGAAATCAOTCCmAGGACTGA/^ 

AAGTCCTGCCCTCATCTGAGAATACTGTCTTTCCATATGGCTAAGTGTGGCCCCACCACCCT^ 

GCCCTCCCGGGACATTGATTGGTCCTGTCTTGGGCAGGTCTAGTGAGCTGTANAATTGAATCAAT^ 

TGAACTCAGGGAACTGGGGAAGGCTGACCTCCTCTTTGGTGTTGCGGTAAGATAACCGACAGGGC 

TGGTGAAAAGTCCCAGATGGCAGGATATTTGGTTTCAGAGTAAGGACTAGGTGCACCACCATGAT 

TGACTATCGATCAAAATGGTTGGAACTTAAAAATTTTTA^^ 

SEQ ID NO: 1 53 1 ACAGAGTATGTAGTGGGCATCTGTTGAATGAATGCTTTTCCCAGTAGCAGTGT 
ATTCATACAATATTAATATAATTGTCCCCTGGCrTACAGATAAAAATGAAAGC^^^ 
TGAGTGAGACCCAGGTGTTCTTCCTCCACCCCTAGTGGTCCCCTGGGCAGGTCTTTT^^ 
ACACTCACCATTCTGTTCTGTAGTCAATCATTGATTGACTTGTCTGTGAACrrGCAGGA^ 
ATAGTTTCATTAGCACAGAGTAAACATGmGCCATGCAAGGTTATTTTGCATCTGCAm 
ATAATGTTGAATCAATGAAAAGTTTTGATTAAGCAGTAGTTGTAGATATGCT^ 
ACTAATATCAAGTGGAGATGTTTTACTTITAAGGGTATTGCOTTT 
TTCCTTmTGTNATGTAAATTAATTTGCTNGCAACI^^ 
ATTGCCTTTACAAGTNCCTTCCCGGGCGGGCNGTTCAAAAGGGC 

SEQ ID NO: 1532 ACTTGTATTTTCACAGATGGATTATCTGGGGTAATTTTCTTCA^ 

GTTATACACAGTGAAAATGTATTATAGAGTAGAATAGTAAAGCrCTAGGGGTTTCAGAAAGC^ 

GATGAACAGATGACAAACATCTGAAACCCCCTCCACACTGTTACCCAGTGTGTATATAATGAOT 

TTATAGCTCAGTGTGCCCTTGAATCCATACAGTTTCITAAAAGACAATAAAATOT 

TTAATGTAACTTCTAAGTTCTAGAAAATGCTGATTCTGTC^^^ 

TGArrTGTTGCTTGGATTTCCTGAGAATTTCrCTATTTGTAGGAGGGGTT^^ 

TTGATGACAATTACTTTATGGGTGTGATGCACCGATGGTAGCCAAGGAATOTGTTGGGGAAGN^ 

GGAAAGAAACCTTTCTTTTCTTTTATTTCAGTTAAAGTAAACCTTTA 

CATTAACAAGTTATTTTATNGGNGTTCAGAGAATTAAACNTGACT 

SEQ ID NO: 1533 ACATrTCTTTCGTGTTCAAACCACGGAGTTCACAACACAGCAGCACACACAG 
CTGGGCGCTITGTGGTCTCGGCACCCTCGGCTTCCCCTTCACGGGGCCGCTTTCGACTAGTAGA^ 
GCTGGGCTAGAACCTGGGGTAGAGCTGTGGATTCATTTTCCTCAGA.CAGAAGATCTTGAAAC^ 
TCITCATGTCTTCATCCTGAAGCCACGGGTGTCTTAAGGCTTCTTCTGTCGTAAT^ 
ATCCACTACCAACAACTTCTTGACAAGGTCCAGAGCmCTCTGAGACTTCTGCCCAGAOT 
AATGAAGTTGTATTTTCCACTGGTGATCTGATCCTTCAGTGACACTTGAGTCCTATGCT^ 
AGGTGGATACCCACTAAGGCAGATAAAAAGAATAACTCCTAAACTCCAGCAGTCCACAGCACGG 
NTATACCCAACAGTCCCAACAGAAACAAGAACTTCAGGCGCCAAGTAGGGGGGGGTCCACATAA 
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GGGTCTCATGAGAGAAGTCTCTCCAAAAATCTTGGAGTGNCCCAAAATC 

SEQ E) NO: 1534 ACTTCTATTTATCTTTGATTTCAGTCTTGGCAATTGTTTAAAAAA^ 
GATTTGTTTTATTAGGrrCAGAGTATGTGGGGAATTATAGAATCCCTCTTTCATC^ 
TCTTTTGTTAACATATTTGTTATGCCTTATTCTAAAATTGAGCCTCAAA 
CAGATGCTTCTATAGAGGTTCTTTGACCTAAATAGTTCAGCATTTGTAT^^ 
TCAGATTCCTAATCATAGCCCGTAAG^VGGAATGTTACmAAT^ 
GTGTCCGCATTTTTTTTTTrcnTAAAATCATAGCC^^ 

TTTTATTGATGGGCATGCAGGTGGGTGTTACrTGGAAATGGCCAATTTTTAT^ 
AAGAAAAAAAN 

SEQ E) NO: 1 535 ACGCGGGGCCTTTCTAACTCCGCTGCCGCCATGGCTCCTGTGAAAAAGOT 
GGTGAAGGGGGGCAAAAAAAAGAAGCAAGTTCTGAAGTTCACTCTTGATTGCACCCACCCTGT^ 
AAGATGGAATCATGGATGCTGCCAATTTTGAGCAGTTTTTGCAAGAAAGGATCA/^ 
AAAGCTGGGAACCTTGGTGGAGGGGTGGTGACCATCGAAAGGAGCAAGAGCAAGATCACCGTGA 
CATCCGAGGTGCCTTTCTCCAAAAGGTATTTGAAATATCTCACCAAAAAATAm 
ATCTACGTGACTGGTTGCGCGTAGTTGCTAACAGCAAAGAGAGTTACGAATTACCGTTNCT^^ 
TTAACCAGGACGAAGAANAGGAGGAAGACGAGGATTAAATTTCATTTATCTGGA^^ 
TGAGTTCTTGAATNAAACITGGGAACCAAAAATGGGGGGTTTATCCTTCGT^^ 
TTGAANCANAAAATTGGATATNNTTATNCAAAAGGCTTCCCTTNGGTOCCC 

SEQ ID NO: 1 536 ACAGATGTCTGCGTGTTTGCAGCCCAAGAAGATCTAGAGACCATGCAAGCAT 
TTGCTCAGGTTTTTAACAAGTTAATCAGGCGCTACAAATACCTGGAGAAAGGTm 
GTAAAAAAGCTGCTGCTGTTCTTGAAGGGTTTTTCAGAGTCGGAGAGGAACAAGCT^ 
GACTGGTGTTCTTCTGGCTAATGGAACACTTAATGCATCCATTCrrAATAGCC^ 
TTTGGTTAAAGAAGGAGmCAGCAGCTTTTGCrGTGAAGCrC^ 

AGGTATCAATGCAGTAGCTGCAAGTCnTCGGAAAGTCAGCATGGATAACAGACTGATGGAACTCT 
TTCCTGCCAATAAGCAAAGTGTT(JAACACTTCACAAAATATTTTACT^ 

CTTTCAGAATATGTTCGGAATCAGCAACCATCGGAGCTCGTAAGGAGCTCCAGAAAGACTTCAAG 
AACX^ATGTCCCGNGGGNGATCCATTTANGGTATTAATTTTATATGTCAGG 

SEQ JD NO: 1537 ACGGGGGCTTTTTCCTCTCTTCAGCGTGGGGCGCCCACAATTTGCGCGCTCT^ 

trrctgctgctccccagctctcggatacagccgacaccatgggtttcggagacctgaaaag 

ccggcctccaggtgctcaacgattacctggcggacaagagctacatcgaggggtatgtgccatca 

caagcagatgtggcagtatttgaagccgtgtccagcccaccgcctgccgacttgtgtcatgcccta 

cgttggtataatcacatcaagtcttacgaaaaggaaaaggccagcctgccaggagtgaagaaagc 

tttgggcaaatatggtcctgccgatgtggaagacactacaggaagtggagctacagatagtaaag 

atgatgatgacattgaa:tctrrggatctgatgatgaggaggaaagtgaagaagcaaagagctaa 

gggaagaacgtctttgcacaatatgaatcaaagaaagccaaaaaaacctgcot 

tttcatcitactagatgtgaaaccttgggatgatgagacagatttggcga 

seq id no: 1538 acgcgggtgaggggattgatttgacgcacaatcctgagttcaccacctgtga 
gttctacatggcctatgcagactatcacgatctcatggaaatcacggagaagatggtttcaggga 
tggtgaagcatattacaggcagttacaaggtcaccraccacccagatggcccagagggccaagcc 
tacgatgttgacttcaccccacccttccggcgaatcaacatggtagaagagcitgagaaagcot 
ggggatgaagctgccagaaacgaacctctttgaaacrgaagaaactcgcaaaat^ 
tctgtgtggcaaaaagctgttgaatgccctccacctcggaccacagccaggctccttgacaagc^ 
gtrggggaagttcctggaagtnactngcattaatcctacattcntctgtgatcccccagat^ 
ncccctttggotaatggncccgctctaaanaggggtctgacttggncgctt^ 
tgaaaaaaagaagatatgcaatgcgntattnctgancttaatggatcccntng 

SEQ ID NO: 1 539 ACAAACACTGAACGCCCTGATACACCTACAAACACGCCAAATGCACCTGGAA 
GGAAAAGTTGGGGAAAGGGAAAATGGAAGTCAAAGAAATGCAAATATTCTTTCAAATGTGTAAA 
TAGTCTCAAGGAAGATCATAACCAACCATTGTTTGGAGTTCAGTTTAACTGGCACAGTAAAG^ 
GAGATCCATTAGTGrrrGCAACTGTAGGAAGCAACAGAGTTACCTTGTATGAATGTCATTCACy^ 
GAGAAATCCGGITGTTGCAATCTTACGTGGATGCTGATGCTGATGAAAACTTTTA 
GGACCTATGATAGCAATACGAGCCATCCTCTGCTGGCTGTAGCTGGATCTAGAGGCATAATTAGG 
ATAATAAATCCTATAACAATGCAGTGTATAAAGCACTATGTTGGCCATGGAAATGCTATCAATGA 
GCTGAAATTCCATCCAAGAGATCCAAATCTTCTCTTGTCAGTAAGTAAAGATCATGCm 
ATGGAATATCCAGACNGACACTCTGGTGGCAATATTTGGAGGCGTANAAAG 
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SEQ ID NO: 1540 ACATATTTTAAAATAGAATAAATTGTTAAACATAAAATTTTAAA^ 
ATGTGCGTAAGAAAACmGTAAAATAGTTATGAGTCCTACCCAGTAGCAACTTCTGG 
CAGGATTCCACTATGTAAATATCTGTAATGCATTTATAATAAGTTGTGTAGTTTGTCCTGC^^ 
ACTACACTATTTGCTAAAGTCTCAGTGCCATCTCCTAATGAGACTGACATm 
GAATATCCTTGATAATTCAAGGAAATATCCCTCCTGCCTAAGTTCCAAACTGGGAAACAT^^ 
TATATAAATGACATTTCAGGACTTTAAGTATGAAGATAATGGGAATTrrAT^^ 
AATGAGAGCATTTTTATrrGATAATTTTTTTT/^ 
TAATCAGTITITCAAATCITGATTTTTGATATCATTATO 
AATITGAATTCTGCAAGAAGGCTATTTTTTTATTGGGAGTTT^^ 

SEQ ID NO: 1 54 1 ACTTTNTTTTTTTTTTTT^^ 

CAGGCrGGAGTGCAGTTATTCACAGGNGCAACCACAGGGCACTGCAGCTTTAAACTCCT^ 
AAGCGATCCTCCTGCCTCAGCCTCCCAAATAGTTGGGACTAGATGCACGCACCACCACGCCTGACT 
CAGGACATTATT>nTAAAGGTATTATCCAGGAAACAGATAAGGTCArrCATAAAACACACGGOT 
TTTITCTTTAGCTCAGTGTTAACAATGAAAGTAGATTCCACT^ 

TAACATAGTGAACATATTGCTGTAGGAAAGGGGGTTCAGTGTGGTGTGTTATATGAGCACTTGAA 
CITITCAAGGNGTCATAAGCCAGTTATCTGCCCAAAGAAm 

AAAGAAAACACTGNGGACATTTGGGGNCATGAACTTTTAAGTGGCAACAGCCCAAACAGGGCAA 
GNTGTCTTTAAGTCCCCAACAACGGNCAGCATTACTGNGAATTACCATT 

SEQ ID NO: 1 542 ACTTTTTTTITTTT^^ 

AAGGGATACTTTTATNAATATTCTTGCTGTAAAACAATGTAAAAATAAC^ 

TAAAAATAAAAATATTCTACCTGAGNGNGTTAAATCAAGTGATTTGTAAAACAAAAC^ 

GNGNGGGCTTTNTACATGTAACTTGCCAGGCTGAAGGCTTACACCCTCmGTTCT^ 

CACTAATGATGATAACATGAGTTAAATTGGGATTCTTGCCCITCTGTGTGGCTm 

TCTATGACCAAACTATTGACATATTTGAAGTCTGTATGCAGNCATTGTGTGATAAATCTACm 

AGCnTTGCTTCTACCTGCAGOTACATGATAACCATGCTGGGAAGTGCTACATATGCTTCATA 

TNTGTTGCCCCATCTGATAATAAAAATACAAAGGNGCTCTTTAANCT/^ 

AAAATTCTAGTTAAGTGGNTTGTTTTTCCNCATCNNTGTAAATCCCT^ 

SEQ ID NO: 1 543 ACTATTTTTGCAGGGTTTGCACAGGCCCGTAACTGTCTACTACm 
TCTGTATACATCCTGTATGCTGAGCTGGTAAAATACATTGTAAATTACATATTAAATATm 
CITTTACAAGCGCAAGGTGCAAAAATATATACAATAGTCTCATTGATC^ 
ATTTGGTGATTATGCCTAGCTTTTGACTAATATAAAGATCATAGCTCCCCTTCAOT 
TTGAACATGGCGTGTTTAAATTTTCATACATACTTTAOT 

GCCTCTGTGAGTTCATCTGATGATTGAGCAGTAGCATTTGCCriTTGGGT^^ 
AGAAGAGATGACTTCTGCTGATTTTGOTrANAATGGTTACCTTAG/^ 
TCGAATTTCACTTNTGCAATAACTTTCATTTTCTCATAGC^^ 
TGAGCNGAAGAAAGANATCCCCA 

SEQ ID NO: 1 544 ACACACCTAGTTCATAATCCTCATAATTTATCAACAAACACAAAAAAGTGTCT 
TAOTGAGAGTGAGTGTGTGTGTGTGCGTGTGCACGTGCACACATGTGCACGTTTGTATGTATGGA 
AATAAACTTATAAATGGGGACGTATTGGAGAAGGAAATACATAGACCTACAACTTTGAGCAAATA 
GCAGTGATGTT:TrAGGAACTGAAATGTCACACTTAAAGTCTTCAGCCCAGCT^ 
TGGGGAGAAGAGGGCCTGATTAGAACTGTTCTGGTTGTGTTTGGCGGGAGGGGAATAATT^ 
CAGTCCTTCTTAGTGACCAAACTTTAATTTTTAAGAATAAm^ 

CTGAGTTGAAAGGAGCTCCAGAGGAGTGGAGTTCTGTGTTGCTCACATGTTAAAATCT^ 

TTCAGAGCAGAGGGAATCCTATCTTCANATATCCGNCCATTTTCATCTCTTAA^^ 

GTATTGACTTGAGAAGTGTGCTCTGGTATTCTGGGGTCTGAAGCT 

SEQ ID NO: 1 545 ACATCACTGTGTCCCTAAAATAACCCACCCCAGTGAAAAACTCCAGCCCAGT 
GAACTGGAGGGTGTTCTTITrGGCCACGGTAAACTGAGGGATTATGATGAAGCTGA^ 
TGAATGAGAAAATGTTGAACTTCAAAAGCCATCTCAGAAAGTTGAAATAGGAGAGGACGCTGGTT 
CCAAACTrrGCCTCCAATGATCTTCAGCCGTCTTCTGCCACAGGCTGATGGAATT^ 
CAGGCTGTTCTTGGATCTCCGATAAGCCCGGGAAATGGAGTTCAGACACTGAATACA GCAAT TGA 
GCTGAAGGATACGATGGGTCTGTTTGCTTTTTTCTTTGTCAACTATT^ 
TCCATGGTCCTTGGCTGGTTCCTGATGGCTTTAATTCTGTCTCTGGATGTCATC^ 
CGATTAACTTCTGTCCTTGGGGTGACTTGTGCCAATCACATTCAGACAATGGTC 
CATGGACAGGGACCACTTTCACCTACATAAGCCAGGGTTCACATA 

SEQ ID NO: 1 546 ACACTGAGCCCAAAAAGGCCCGTAGCCAACACrATGACTTGGTTTTAAATGG 
CAATGAAATAGGAGGTGGTTCAATTCGAATTCACAATGCAGAGCTGCAGCGTTATATCCrGGCAA 
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COTACTAAAGGAGGATGTGAAAATGCTCTCCCATCTGCTCCAGGCTTTAGATTATGGGGCAC 

CTCATGGAGGAATTGCCTTAGGGTTAGACAGACTGATATGCCTTGTCACTGGATCTCCAAG 

GAGATGTCATAGCCTTCCCAAAGTCCTTCCGGGGACATGACCTCATGAGCAATACCCCAGATTCTG 

TCCCTCCTGAGGAACTGAAGCCCTATCATATCCAAGTCTCCAAGCCAACAGACTCCAAAGCAGAN 

AGAGCTCATTGAATCATGCATACCATGCAGAAAGTTGAGCTTTTAGGTTTTGCCT 

AAGGCTAAAGTCAGATCTAGAGrrCTGCCNCANGTCTAACAATCAAGTCTTTTO 

TCCAGGCAACATTTTTTNCCCCACGAAANAAAC^^GATT^ 

SEQ ID NO: 1547 ACAGmGCTGCAAATGATAATTTAATTTGGATAATGCTrrAAATCATTGAT^ 
TCTCATGAAATTACTTTTGAGTGAAATTTGACAAGAATTTTCm 

GTATTAGAAGTCCCCATAGTCCAAAATAAAATGTAACCATAGAATGGGGAGATAACmTAGTCAA 

AGACCAAGTAAATATTACtm-CTTTGCAGAACTCCAGGTTGTAGTCACGTAAAAGTGTGGCCAC^ 

GTTGCTTAACTTTTTGGACAATGGGAAAAAAAAAACAATAAAAGCT^^^ 

GAAACTCrrCTCAAAAACAGAAGAAAGTCATGACCATTGTTGGGAAAGTGTTTT^^ 

AAATTCTATTCCATTGATCTGTTTGGATGATTGCITCAACCTATTTGGC^ 

ATTGATTCAGTCATGTAAAGAACCCTTTTATCTTAAGCTTAGAATTTATA^ 

GAAAGGTGGTTTTTCATTCACTCTAATAATTGNGCAGGGATAAGC 

SEQ ID NO: 1 548 ACAAAATACAACTATAAAAGCAAGGAACAATTAATGCCATGCAAGATTTTAA 
ACTAAAAATGGCATAGAAATGCAATTrAAAACAGCAAACTCAAATTCACATAACACTCAA^^^ 
AA^GGTCAGCAGTAAGTTCTACTAACGTTGCTTAACAAGGATTTAGAAAAGGAAATAC^ 
TGCTGGTATAATGGTATAAATCAGATCTTCAAATCTATGGGAACGAGTCATCTTCTCT^ 
CTCAAGAAATTAGCTCTTTTACAGTTGTTTTTATGAAATCTT^ 
ATAATTTTCATAGACCTTTTTGCAAATCTGTTTCATTG^^^ 
AATTTCTTTATTGTTTCCTTTAACTCTTCATCTGTAGGGG 
ATCATCTGAACTATCCTCAGACTCACmCTTTTTTG 
CTGCTATCTGCITITCTTAACATTTGGCACTm 

SEQ ID NO: 1549 GOACAAAGTTGTATGACAGGGCATATTCTTTGCTTCCAAGATTTGGGTTGGGG 
GCACTAGGGGTTCAGAGCCTGGCAGAATTGTCAGCTTTAGTCTGACATAATCTAAGGGTATGGGG 
CAAGGATCACATCTAATGCTTGTGTTCCTTATACTCTATTATATAGTGTTATTC^^ 
TCTTAACAAAATTCGTAGCAGTGGAACCTTGAAATGCATGTGGCTAGATTTATGCTAA^ 
TCAAGTTAGCATmAGTAACACTTCAAAGGTTTITmGm 
AGGATTAATTAGAAGAAGCAATCTAGTTAAATTTCCCATTTGTATTTT^ 
TCATANGTTATTTGTTTAAAAAGATTTAAAAATCATTGCACTTTGGTCAGAAA^ 
TCTTATAAATGNTTGATTCCCTTCCTTGCTATTmATTCAGTAATm 
CACCCGAAAAAATAAANGATTTTTTAAAAGGCNTATANAGNCCA 

SEQ ID NO: 1 550 ACCAATAAATGATGTGTTAGCTGAAGATAAGATTTTGGAAGGAACAGAAACA 
ACCAAATATGTGTTTACTGATATATCATATAGCATACCACACCGGGAGCGTTTTATTGTCGTCAGA 
GAACCAAGTGGCACACTACGCAAAGCCTCTTGGGAAGAACGGGACCGAATGATACAAGTTTATrr 
CCCAAAAGAAGGTCGTAAAATTTTGACACCAATAATTTTCAAGGAAGAAAATOT 
ATAGCCAGGACAGGCATGITGATGTCCTCAATCTCTGCTTTGCCCAGTTTGAGCCAGATTCCACAG 
AGTATATCAAGGTTCATCACAAGACCTATGAAGATATAGATAAACGTGGAAAATATGACCTTTTA 
CGTTCAACAAGATACTTTGGTGGAATGGTGTGGTATTTTGGAAATAAT>^^ 
TNCTGATTTGCCAGATTTCAGAGANATTTAATCGOTGATGCCACCAACTTTG^ 
ACGTGCTTCCATCCANNATGGCCANTCCGGCTCAAGGGGCCAAGGA 

SEQ ID NO: 1 55 1 accttcgcagtgtaggagatggagagactgtggagtttgatgttgttgaagg 
agaaaagggtgcggaggcagcaaatgttacaggtcctggtggtgttccagttcaaggcagtaaat 
atgcagcagaccgtaaccattatagacgctatccacgtcgtaggggtcctccacgcaattaccag 
caaaattaccagaatagtgagagtggggaaaagaacgagggatcggagagtgctcccgaaggcc 
aggcccaacaacgccggccctaccgcaggcgaaggttcccaccttactacatgcggagaccctat 
gggccgtcgaccacagtattccaaccctcctgtgcagggagaagtgatggagggtgttgacaacc 
agggtgcangagaacaangtagaccagtgaggcagaatatgtatcggggatataaaccacgatt 
ccgcaggggccctcctcgccaaagacagcctanagaaggacggcaattaagaaagataaagaaa 
attcaagggagatgaaaccccaanggtcgggcagccaccttaaacgtcgggtcctn 

SEQ ID NO: 1552 actcaaagacgaatcatgaaaaagaaaaaaactttatttcaaacaggttcag 
tgatatatgtgtgtgctacagcaaaggctggttgtggcaaagtttcatttcaaactgtatc^ 
ggctgggcaaggtggctcacgcctgtaatcccagcactttgtgaggccgaggtgggctgatcacc 
ctgaggtcaggagagaccggcctagccaacatgctgaaaccccgtctctactaataatacaaaaa 
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TTAGCCAAGTGTGGTGGCGCGCACCTGTAATCTCAGCTACTCGGGAGGCTGAGGCAGGAGAATCG 

CTTGAACCCGGGGGGCAGAGGTTGCAGATGACGCCACTGCACTCCAGCCTGGGTGACAGAGCCAG 

ACTCAAAAACAAAACAAAATAAAACAAACAAAAAAACAGAACTGCATGATGTATAATTTTGAC^ 

TTATGTGGGAATGTTTAACrrCTGCCAAAATGTAGATTCAATCCCACATTATGCCCAT^^ 

TAATTTTAGTCCCTAAGTTTTCATTACCCAAAAAAAAAAAA^^ 

SEQ ED NO: 1 553 ACTTTTTTTTTTTTT^^ 

CAAAATCAGTGTCAGACACGTTATATITGATTGGGTTCAAATTTGGCTGATGTCCAAA 

AAAANAACGGCGGGGCGAAGGGACACAGTGTTGCTGACAAGGTGACACTGAACACAACAGTTTT 

CCnTTAATTGTAAAAGCGGGCATCGCACAGCTGGTGTGAGTCAATTAACCAAGGCAGGGAGGGGA 

CTCAATGTTTTACAAGCAGAGGGAAAACCAAAGTAGGCAGAGAACTTTCAAGAGAGGAGAGGG 

CAGAACACTGAGAGAAAAAGCTGCAGCANAGGCCTGCTGGANAACAGTGGTCGGATCTGCTGGC 

TCACTTGGGGAACACAGTGACCACTTCATAACCCTCANGTGGTGTGACTCGCTCCCGGGCATGCT^ 

NTACAATAGATTTGATCCTNCACAAANAAATGGCCCTTCTGTTTCANGTTGGNGCCCC^ 

ACACCTAACACTCAGGGTGGNGGGGGAACGGCCCCGNAGNTTACAAACA 

SEQ ID NO: 1 554 ACACCTTGAAGGCGAGGTTAATTAAATCCTGTTGTGGAGTTTGAGGGCCGGA 
ATTTAATTTTTGGAGTTTTATTTAATATCGGGAGCAGATTGGGTAATA 
AGACGGCCTTTTGACCTTTTAGGGTCTAGGGCTGTAAAGCGTCTCAGGGTTGC^ 
ATGAACTGGGCTGGGTTTTTATATTTGATGAAAAAGAGCCTAAACGOT 
AAAAGGAGCATTAACCTTGACTATGTCCTTAGCTCCAGCCACCnTmAAGAGTAAA^ 
AGGTGGGGGAGGGCTAGTCACGGAACGAAACTGTAAGCCGGACCAGGTGTGAGGAGGGGAGGTG 
ATAAAAAGATTACAGGGTGGAGGAGTGGAGCCTGAGGAAGAATTGGGACCTAACTTGGCNTGGA 
AAGGAAGGGGAAANGGCANATGGGTTTGTANAAAANGAAGATTANACACACTCANCAACCCCT^ 
GGGTTTNGGACTGAAGGGACAGGTGGGGAAGGGAAAAAAAGGAAAATTTGG 

SEQ ID NO: 1555 ACGCGGGGGCGTCTTGTTCrrGCCTGGTGTCGGTGGTTAGTTTCTGCGACTTG 
TGTTGGGACTGCTGATAGGAAGATGTCTTCAGGAAATGCTAAAATTGGGCACCCTGCCCCCAACT^ 
CAAAGCCACAGCTGTTATGCCAGATGGTCAGTTTAAAGATATCAGCCTGTCTGACTACAAAGGAA 
AATATGTTGTGTTCTTCITrrACCCTCITGACTrCACCm 

AGTGATAGGGCAGAAGAATTTAAGAAACTCAACTGCCAAGTGATTGGTGCTTNTGTGGATTCT^ 

CTTCTGTCATCTGGCATGGGTCAATACACCTAANNAAACAAGGAGGACTGNGACCCATGAACAT^ 

CCTTTGGTATCANACCCNAAGCCGCACCATTGCTCAAGATTATGGGGGTCTTNAAAGC^ 

NGCATTTCGTTCAGGTGCCTTTTTATCATTGATNATAAGGGNATTCm 

GAACTNCCTNTTGGCCNGTTCTNTNGGATGATACT 

SEQ ID NO: 1556 ACTTTTTAAAAGTAAAAAATCAGATGATTCmTT^ 
AGTGGCTAATTTGAAACCAAGAATCTCCTTTAATTTCm 

ATCATTCCTAGAACTGAAGTTGAAAAGGCCATCAGGATGTCCCGGAGCCGTATCAATGATGC^ 

CGTCTGAATGACAACAGCCTAGAGTTTCTGGGGATACAGCCAACACITGGACCTCCT/^ 

CCCTGTTTCCATATGGCTGATTGTITrTGGAGTTGTGATGGGAGTGATAGTGGTTGGCAT^^ 

CTGATCTTCACTGGGATCAGAGATCGGAAGAAGAAAAATAAAGCAAGAAGTGGAGAAAA 

ATGCCTCCATCGATATTAGCAAAGGAGAAAATAATCCAGGATTCCAAAACACTGATGATGTTCAA 

ACCTCCTTTTAGAAAAATCTATGTTTTCCTCTTGAGGNGAAm 

TGGGTTAGAAAATTTAGATNGATAAAGATTCATTAAATGTCCAAACTA 

SEQ ID NO: 1557 ACTTTGGCCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAG 
TTCCAGATTTCAACTGCTCATCAGATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGT 
TTGATCTCCACCTTGGTCCCTCCGCCGAAAOTGCCTTTCTGGTGACAATAATACACTGCAAA^ 
TCAGGCTCCAGTCTGCTGATGGTGAGAGTGAAGTCTGTCCCAGACCCACTGCCACTGAACCTGTCT 
GGGACGCCAGTGGCCCTGCTGGATGCACCATAGATGAGGAGCCTGGGAGACTGGCCAGGTTTCTG 
CTGATACCAGGCTAAGTAGCTGCTGGCAACACTCTGACTGGCCCTGCAGGAGAGGGTGGCTCTTTC 
CCCTGGAGACAAAGACAGGGTGCCTGGAGACTGCGTCAACACAATCTCTCCGGTGGTATCTGGGA 
GCCAGAGTACAGGANGAAGAAAAACTGCGCTGGGGmCCATGGTTCCCTCTGGGTCCTAACTGA 
GCANCTCTTTCCCGCGTACCT 

SEQ ID NO: 1 558 ACGCGGGAGTTGATGATTTTCITITAAAGAAAT^^ 

TAGCCTGGGTTGGGAGTTAGTGCATATCATCAGCCCTCCATCTAATGTGATTGCTAAGATGTGCAG 
GATAGGACAGCTTAGAATGAGTGCCTTGACACCTACATATGGCTGCATTCCATACCTCAGTGAAG 
AAGCCGAAGGATGCTGACACACAGTGATGTGGAGGGCATTCCNGAGAAACTTTTGCAACAAGTGT 
ATTAATGNGATTGGTTTTCTTTTrGTTTTAAGCTAAACA 

ATACAACTCAGATTTAGTATTATTGNTTATCATAAAACATATTATGTTCCTAGTT^ 
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CCNAGTTGATTTAATATAATITATTGGCATTTmGCNGAANAGGN^ 
TTTAAAAAAAGCCAANGAAGGOSfCTOGNGTCNAAATTTAAA 

SEQ ID NO: 1 559 ACGCGGGATCACCTACCACTGCAAGAACAGCATTGCATACATGGATGAGGAG 
ACTGGCAACCTGAAAAAGGCTGTCATTCTACAGGGCTCTAATGATGTTGAACnTGTT^ 
AACAGCAGGTTCACTTACACTGTTCITGTAGATGGCTGCTCTAAAAAGACA^ 
GACAATCATTGAATACAAAACAAATAAGCCATCACGCCTGCCCTTCCTTGATATTGCACC^ 
CATCGGTGGTGCTGACCAGGAATTCTrrGTGGACATTGGCCCAGTCTGTTTCAAATAAATGAACT^ 
AATCTAAATTAAAAAAGAAAGAAATTTOAAAAAACTTTCTCm 
AACTGAAAGCTGAATCCTTCCATTTCTTCTGCACATCTACTTGOT 
AAAGAAGGATTGATCANAACATTGNGCAATACAAGTTTCATTAACTTCTTCCCCGCT^ 
TrrGAA'l"ill"ll" r ilCAACACTCTTACCCTGGTATTGGAAAATGTCAACC 

SEQ ID NO: 1 560 ACCCACCACCTTCTTCTCCTACATATCCCTTCCAGATGGNCATCCAGACTCAG 
AGCTCTCTCTACAGAGAGGAAATTCTCCACTGTGCACACCCACCTTTGGAAAGCTCTGACCACTTG 
AGGCCTGATCrGCCCATCGTGAAGAAGCCTGTAACACTCCTCTGCGTCTATCCrGTGTAGCATACT 
GGCTTCACCATCAATCCTGATTCCTCTCTAAGTGGGCATTGCCATGTGGAAGGCANGCCAGGCTCA 
CrCACAGAOTCAAGGCCTGCTCCCTGTANGGTCCAACCAGACCTGNAANAACAGGCCNCTCCATT 
NGCTCTTNANATGCCACTTCTAANAAAAGCCTAATCACAGTTTTTC^^^ 
NTTGAATNCTTCCATTTCACACAGAATGCAACCAAGTCACACGCTTTTGA^ 
GTGTTGCATTTCNAAAGTGCAhnSrCAGGGACCATACCGGGTCTTGATNCAGTAANAATGGChT^ 
TTNGTGCCATCCTGNANCTNATAATGNGGANTGTTTGCCT 

SEQ ID NO: 1 56 1 ACTl"ini"rril'l"i"rnil"iUU"lUUlTGGGGTCCAAAAATGGNGATTATTTATGGC 
TTTGGAGTTGGANATTTTTGGTCCAAATACTTCATGGAATCCTGCACCCACTTNT^ 
GCACAAATATCCTTGGCCAGTTTGGTCTTGAANATCACAGCTTTCTGGGGACAm 
ATTCTCCTGTAGCTCTCTANTCGCTGAAGGGGTATCTTCCTATTGGCCAGGTTAAAGCAGCAGGTG 
GTTGGGACANAAGCTGGCCCAGCGAGCCCCTGGGGGCTGAAGGCAGCTGCTATNAGCAGCAGCC 
ACAGAAGTGCTGCGGAGACCTTCATGTTGGAGGCTGAAGGTGTGAGCTTTGGCGTGAGANGTGGT 
GGTITCTGGGTTGGTCTCAACCTNTCTTCCCGNGTACCTNGGNCGCGACCACGC 

SEQ ID NO: 1 562 ACACAGATrGGCTTCATATCAATTAAAGAATGAGATACTTGACTGGATTm 
TATAACTGCrrCCTGCCTCCTTCCAAACTGACTGCAAGAGAGAAATTTAGCTGm 
AACCAAATGGATTACAATGGATAATTCATCTTTTGGGTATATTTTTACTATTATTC 
GATTTTCATTTAATTGTAATAATAACTGACAAAAATCAGTATGTTGT^^ 
GAGAATTATTCTTAAAGTTTGTTCTCCCTGTTTATTACACAGATCAGGAATAGAm 
GTATTTATTGGATACCCTCTATTGGTCAGGCATTGTGTTAAGCATATGTGAATCAAAATG ^^ 
ACTTTITCCTTTGAGTCTGATACAGTGAAGGAGATAAACACTTCTACAACTTAAATTT^ 
AGCAGTAGAAAGAGAACATAAGGGAATAGAGGTTAATTTTACCCANAAGCAGGGATAGAGAAAA 
TATTTACGGAGAAAATCACATATCCATGGGGCTCGA 

SEQ ID NO: 1563 ACAAGCCTCACCAAGGGCAACCCCAGAAAAGTGAATGAGTTTGTCTTCTCCA 
ATCATGACITCCTCGATAAGTTTGCAACTTCCAAGCTTCACCAGTTCTGGGTC 
GCAATTTCACCACCTGTGACAAGAGCTAGGCGTTCCACACCTGCAAAATCTGCATGCTCAATAGCC 
ATGACACCAGCAGCACCAAAGAGCTGTTCAGGATAATTATAAATTAATTGCCTGTTAATAAAGCA 
ATTTATTCCATGCTTAAGAATACGTTCAACTTTCTCCTTCATT^^ 
CTGCAACCTTTGCTGTAGAGTCAACTCTTACCCGGGAACCAAATATCTTTATT^ 
ACCAGTATTTGCAATAAGAATTTTAGCATTTTCAATTCGTm 

CCAACAGGAAGCCTTATCTAAATAGGAATCTGCCAAACTTCCTCCTANCTTCTTGATAATATGA^^ 
TGNCTNCAGGGTTGCCAGAACCTTTCAGTCTGAGAACTGCT 

SEQ ID NO: 1 564 ACCTATTATATAGAGGGATAGCTGAATAAAGTCTGTCTCAAAACCAGTGTTA 
AATCACTCTCAGGGTTGAGAAGAAAAAAGGGGAGTCTAAAATCACAACAAGTAAAGACATATCT 
AGGACCCTTGTCCTTCTGGATCCACGCTTCCITCAGGGTCTTCATCATTATAAATGTTCTCTGCCAT 
TTGCCACACTTGCATGATATTGTCTTCTGATACANGAACAAATCACCCAANGTTCATTGGGGTTCC 
AGGAGAAATCAGATATCTTGGCAGTATGACCACCATGAATAAACAACAACTCTGGTGGCCCGTCT 
TNTGCATCTTCTGGGGATrGTTCCTCTCCAATTTTACTTAAATCCCA^ 

TACCTTCGNCCGACKITCGCCGGGGACCACGCTANGNGCGAATmTNANTCNACACTTGGGCN^ 
CCGTTNCTTGTTGGATACCGA 

SEQ ID NO: 1565 ACTGCAGAGGTATGTGCAGAAACACCCACCCATTTAAGCTTTCAATAAATAC 
AAGGCATCATTTTAAACCATTTCAGTAGAATAAATTAAAAATATTGCATTTCTA^^ 



233 



wo 02/29086 



PCT/USOl/30732 



GCITCTTGAGCTGGCCTGGAGTGCTGTTTGTGACACAGAAGACACA GTAGTTA ATT 
CCAAGCTCmAGACTTCAGCACTTCTCGCTCTTCAAGCCTCAGCACC^^ 

TATTCTTCATTAGCATTGCCACTGTCATGGGGCCAACACCTCCAGGAACTGGAGTGATATACCCAG 
CTTTTTGTCTGACTCCTTCAAAATCCACATCTCC^ 

AACTCTATTTATTCCCACATCAATGACTGCTGCTCCTTCCTTGATCATATCTGGCTG^ 

TTGGGAATACCTGCAGCAGATATTTACAATATCTGCAAGAATTGTATGTTTCTTCCAACT^ 

GGGAGATATCGATGAGATATTGTAACAAGTGGGATTACCNTC 

SEQ E) NO: 1 566 ACTCGTCAATGGGCTCGGTCATATATACCACCTCGAAGCCCCGTTTCCGCACT 
CGCTCCACAAAAGCTGAGTTGGCCACCTGCTCTTTGCTCTCACCAGTGATGTAAT^ 
TGTGTCTCCTTCATGCGAGAAACATACTCTGACAGAGATGTCATCTCATCTCCAGACTGGG 
TGATAGCGCAGCAGCTCAGACAGGCGGCGGCGGTTAGTGGAGTCTTCGTGGATTCCAAGCTTGAG 
ATTTTTAGAGAATGCCTCATAGAATTTCTTGTAATTCT 
TCAAGGCACTTCTTAACAATGTTTTTGCGAATGACmCAAGAT^ 

GGGAGATGTTCAGGGGCAGATCCTCAGAGTCAACCACACCACGGATAAAATTGAGATAC TCTG GT 

ATCAACTCATCACAGCTGTCCATGATGAACACACGGNGGACATANAGTTTGATGTTGGTOT^ 

TTCTTGGTCTCAAAAAGGGCAAAGGGNANCCGACCAAGGAATAAA 

SEQ ID NO: 1 567 ACAATGACCAAGATGCGACCAGGATCAGAGGTTCCTTGGGGAAGACCCACCC 
TACGAAGTTGGAATGAGACCATCAGATGTGATAAGAAACTCTTCTAGATGTCAACATAACCAACC 
rrATAAAGACTAAAATTCATGAGTAGAACAGGAAAATCATCCTGACTCATGTGTrGTGTTC^ 
TTTTAATTTTCAAAGAGGCTCTTGTATAGCAGTT^ 

SEQ ID NO: 1568 ACCGGATTCTCTCTITAACCCrCCCCTTCGTGTTTCCCCCAATGT^ 
TTGGATGGTTTGTTGTTCTGCCTGGAGACAAGGTGCTAACATAGATTTAAGTGAATACA^ 
TGCTAAAAATGAAAATTCTAACCCAAGACATGACATTCTTAGCTGTAAOTi^ 
TTCCACACGCATTAATAGTCCCATTTTTCTCTTGCCATTTGTAGCTTTGCCCA 
ATGGGTGGACACGGATCTGCTGGGCTCTGCCITAAACACACATTGCAGCITC^^ 
GTGTrCTGTTTGAAACTAATAOTACCGAGTCAGACTTTGTGT^^ 

GCCTGTGGGCTTCX:CCAAGTGGCCTGNAGGTGGGCAAAAGGGAAATNACAGACACCACNATGTTG 

TCAAGGATGGTTTTTGGGGACTANGANGGCTCANm'GGGTGGGAGAANATCCCT^ 

ACCAACCNAGAACCGTNGGTTTNCCT 

SEQ ID NO: 1569 ACAATCATATTCTCAGGTTTCATGACAGAACTAAAAACTCTAGGAAACTCAT 
ATAGTATCTCCAAAACCTTGGTATCAGTTTCAATTCTGCCTTCCAAATC^ 
CCCATCAGGTTCATGCCTACTGAAGTCACAGAATTCCACATTTTGGAATGGm 
AGCCTCGCTGATTTCTAAACAGGCCTGTGAAAGAAGTATCCTCTCATGTCCTGTC^ 
TTCAACTTCCAAACACTCCACCTTCTTTGGTrTAAAATGTCTCTGC 
CCAGCATrTGGGGCTCCCTGCTGTTGTCCTTGGGATAATACTTCTTT GGAT TCAG 
TAGAACAAACTCTATTTGTTTTTTGAAGGCATGTGTCCATTCT^ 
TTCTCrACAATTATGCTCCTTTCTATCANGGATTGATATTTAGTCTTCG 
TAAAAGACTrrCTTTTTTGGGANAAGA 

SEQ ID NO: 1 570 ACCGTTGTGTGTTTTCTGCAATGTGGAGTTGACTTAGCTTGGCATm 
GTTAAAACTATTTTITCCATAAATACTTTGAAACATATTTA^^^ 
GCCATGTGTATGCAAATATTATTTrCTTCATACATTCATTTTCTT^ 

GGGGACTCAGGAGGACCTGTGAAGCATGTAGTTATCTANATCTGGGTA ATTTCA TGTTTATTAAA 

TCGAACTTTGGCTAGTTAAACTCATATTGAAACnTCATCTAGTCT 

CAAGTCATTTGTTTTAAGTCTCTAAAAAAGAAGATTGCAGTCATCCATTC^^ 

ATCGCAAATNCACTAAATGTGGAGTGTATGAACCAAAATGGAAACCTGCTGTATGGAAACTACC^ 

NTCACTTATGGOTCATTGGGNTTTTGTACCTTGCCCGGGCGG>n>JCGCTCTG;^ 

SEQ ID NO: 1 57 1 ACCTACATCAGATCTAACCTTGATCCCAGCAATGTGGATTCCCTCTTCTACGC 
TGCCCAGGCCAGCCAGGCCCTCTCAGGATGTGAGATCTCTATTTCAAATGAGACCAAAGATCTGCT 
TCTGGCAGCTGTCAGTGAGGACTCATCTGTTACCCANATCTACCATGCAGTTGCAGCTCTAAGTGG 
CTTTGGCCTTCCCTTGGCATCCCAAGAAGCACTCAGTGCCCTTACTGCTCGTCTCAACAAGGAGGA 
GACTGTGCTGGCAACAGTCCAGGCTCTTCAGACAGCATCCCACCTGTCCCANCAGGCTGACCTGA 
GGAGCATCGNGGAGGAGAATGAGGACCTTGTTGCTCGCCTTGGATGAACTCCGGGGCGTGTATCT 
TCCAGITTGAAGAAAGACTGGAAACAACAGCGTTTATTTGTGGGGTGCCACCTACAAGCT^ 
ATCATGTGGGGGACTGAGCCATTTCATTAAGGAGGNANCANGNCATCCCAGCTGATGAACCGCG 
NmTNANNCAATAAAAAACCTTGANTTCCTTTC 
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SEO ID NO' 1 572 ACACmTGTrACAGTTACATATATGAATAGTTAGCAGAGGAGAAACTOT 
GTrGTTCCTACACCAAGGGAAAAAATGTAAGGACTGTATAGCTCATGAATAATATTAACATT^ 
TATCACAAAATATTGATAAAATTGGAATTGTATCAATTCATGTTATGGAGCTTAACAT^ 
AAATATTAAGATAATATAAATAAGGCTTTTAGCTTCATATCAAGACCTATITGAACAOT 
AAGAACATTAAAAGTTTITrcCTCATTrATGGm 

GTGTCTTAAGTATAATGCATAATTATTAGAATATAArrATAAGGCArrACAGAATTTT 
ACTTGCAAAGATATTTCAAGTCCAAAACTTCAAATrACACACCCAAATATGTTTG^ 
TATACTTGGACAAAGGCAACATmAATTriTGCTAAGTCTrAA(^ 
ATGGTAAAATTTTCATCCCTCAT(mTGCATTGGAAAAAAATACCT^ 

SEO ID NO- 1 573 CGCCCTTAGCGTGGTCGGGCCGAGGTACTGGCAACAGCTGATGCGTATAAAA 
CCCTTAAAGTCTGGATCAGTTATGTATATCTGGATCTTAGAATATCCATGGCACTT 
CAAATGGCATCTGGCACTGGAGGGAACrrGCATrCTTCAACAAATTTCTCTAACAGCATCT^ 
TTTTGTGGCTGAGACTCTTCAATAATCACATTACTCGTGGGCCAAGTTAAC ACTCCAGG A^^ 
TAAATCAAGGTTCTTGCTTTCTCAAAGTGATTGAGAGCTTCTAGAAATCTGll"lU'l"i 
CTTTTCCAATTCCACAGTAGGCCAAGCAATCAAGGCCCTCACTGGGGTAGTGTTC^ 
TAAACTGGTTTTCGGCTTCAGATAATTCCTCANGCTGTCCTATTCCAAGGAGAGAAAT^^ 
CCATANACGACCAAAACATAAGTTNATCATTGGCCAGGTTNAATTGCrrAT^^ 
AACCGNTCANNACCTCTNTANAGGCCTNNTGCAANAGCTGNNGNAAACGCTGNTNCAATAN 

SEO ID NO' 1574 acccagtaaaaaccagaatgacccattgccaggacgcatcaaagttgacttt 

GTGATCCCTAAAGAACTTCCCmGGAGACAAAGATACGAAATCCAAGGTGACCCTGCT^ 
TGACCATGTTAGGTTTAATATTTCAACAGACCGACGTGACAAATTAGAGCGAGCAACCAATATA^ 

aagttctgtcaaatacatttcagttcactaatgaagcccgagaaatgggtgtgattgctgccat^^ 
gagatggttttggtttcatcaaagtgtgtggatcgtgatgttcgtatgttcttccac^ 

TTCTGGATGGGAACCAGCTCCATATTGCAGATGAAGTAGAGTTTACTGNGGTTCCTGATATGCTCT 
CTNCTCAAAGAAAATCATGCTTTTAGGATTAAAAAACT^ 

ccttcagatnancgttrrotgggcncggttnaaaaaaaaaagncac^^ 
cactagccccaaatnaagggccaaanagaangggaggctgggggatnggc 

SEO ID NO- 1575 ACTTTATTrn"ll"iri-i'i 1 1 1 1 1 1 1 1 iggcagtttttacatttatttaaacagaa 
/uVCGTGCACATGANCTGCCTACTCATTTTCTTCACTGCGCAGCCTGGCATO 
GATGGCCAGTTGGGCANCTNTTTCCACGATGGCTTTGCGGTrCTTGGAGGAAACATTGTG^ 
CTCGGCACANTAANATITGTTGCACATCAGCANCACTTNCANCTCCTTGACGTTGTGGACCAGG^ 

CTTC<XjGAAGCCNCTGGGCANCATGTGCTTTGTTTTTT^ 

AAAATCTGGCCCTTGAAT^mNTACNAACCCTGTTGTCAATGCCTCTGGG TT^ 

TTAATTTTGACATATCGGTCTGACTGGTGCCGGATGAACTTCTTGGTTCTCiiiriiGACNATCTTG 

GGCrrCACAAGGGGTCTGAGGGGCGGCCATNATNCCNAGAAANGAAATNGNTTGCCCACCTCCCG 

TAAGGCANCGCCNAAGAAAAAAACCCCCNGGTCTCNGGCGG 

SEO ID NO: 1 576 ACATrCATAGGGTITmCCTAGAGTGGGTCCTTTCATGTATACGAACATACT 
GGGGACACGTGAATGCITTTCCACATrCCTTACATTGATAAGCOT 

ATGTCTACGATAGCATCTGGGACTATGAAATGCmCCCACATTGCTGACATrCATAAGGCrm 

CCAGTGTGAGTTCTTTCGTGTATTCGAAGGGTAGCAGAATAACTAAAGGATTTACCACATTGm 

CATTCATATGGTTTCTCTCCAGTGTGAATTCTTTCATGGATAAGATATAAACTGAGACAATGGAA^ 

GCmCCCACAAAATTrACATrrATAAGGTCCATCCCCACTGTGCATTACCACGTGTC^ 

TTGAATGGGAAATAAAAGTITTrCCACATrCTTTACAAGCATAGGGm 

rrGTGGNGTTCTAAANGAGGGGTGATATCTGAAGGCTTTTTTAAG™ 

TCGGTCCATATTCCTGATACTCATAANGCCTTGNGTCCAAT 

SEO ID NO: 1 577 ACATAAAGTAACTGGTATATGTGCACAAGCATATTGCATITrm 

CTAAACAGCCAATGGTATGTmGATTGACATCAAGTGGAGACGGGATGGGGAAAAATACTGAT^ 
CTGTGAAAATACCCCCTTTCTCCATTAGTGGCATGCTCATTCAGCTCTTATCTrrATATO 
GTTATTrrGCTCTCACTGTTTTAACAAAAAAAAAACAACAACATA^ 
CAATTGGAGAArmAATGTTmCATTTATCATTGTAAAACCAAGGACAAT^ 

GACTTTCTTTTTTTITTTTT^^ 

NATCTANGCNCATTGCAACCACCNCCTNCCANGTTCAAGNNATTTT^ 
TAGCTGGGTATTANAAAGCATGCACCCCCCATGTCCAGNTATATTTGTATTmAGNAy^^ 

NCNTTTrNCCCATNTNGGGCCAN 

SEO ID NO- 1 578 ACTTGATTGGTCATTTGAAAACACTGCAACAGTGAACTmGCATCTCAAGAA 
AACArrGAAAAATTCTATGAATTGTTGTAGCCGGTGAArrGAGTCGTATTCT^ 

235 



wo 02/29086 



PCT/USOl/30732 



TGAAGAAAACTTGGCTGTCGAAACATTTTrCTCTCTGACTGCTGOT 
TCTTATGTATGGGTTTTTTTTTAATGTGATCCC^ 

ATAAAATATTTTGGACAATGCCGATAAATGTATGAAGTTAGTATCCACATCATAAATO 

TGTTTAGCAGTAAATCAATATTTrGAAGTGATACACAGATGTCTTTCCTCCCCACA^^ 

ACAAAAAACAAGACCTCTmCTTTAGATGGTGCCCCTATGCCCACCACAACAGAGAT^ 

GAAACCGGGCTCAGTGAGAACTGATTTCCTGCCCAATATTTGTCnTrGGGCTGTCTCT^ 

ATTATTAAGGAATCTAACTGGTTATACAGTTNAAGGCTTCT^^^ 

SEO ID NO- 1579 ACATCTTGTCTGTTACTTTrrGATAATTCTCAAAACATTrCAA G^^ 
ATTATATCTGTTATGGTAATCTATGATCAGAGATCTTTGATGTTACTATTGTAATTGT^ 
CATGAACTACACCTATATAAGATGGTAAACTTGATCAATAAATGTTGTATATGTm 
ACTGCTGACCACTCCCAGTCTCTCTGCCTOTCTCAGGCCTTCTTATTTCCTGAGACAG^ 
TGAAAGTAGGCCAATTAATAACTCTACAATAACCTCTAANTGTTCAAGTGAAAGGAAGA 
' CATCTCTCTTTTAAATCCAAAGCTAGGAATGATTAAGCTTAGTGAGGAAGGCATGTTG 
AGATAGGCTCAAAGCTTTGGCCTCTTGTGCCACATGGTTAGCTAAGTCATGAATGCAAAGG/^^ 
AGTCTTGAAGGAAATTTAAAAAGTGATACCTCCAAGCCAAGGCGTNGNGGCrC^^ 
CCCTACTNTTTNGGAAGTNAAGGCGGGCANA 

SEO ID NO- 1 580 TTTCAGGTTCrrCCTAGCTCGGGGCTTTTAAATTTTGAi^ 
CCCACCATCCTITrTGACTGTTGACCrTGGT^ 
TTTTTTCCTTTTTGAATTCTATCmATCT^ 

GTGAGAGACATTACTGAGCACOTGGTGAGCAAGCCTGGCTTTAAAGATTGGAGAAGAGCTTCT^ 

GCACCAGAACCCTGTCTTCCTCCAGTTCTCAACATGGTGTTGCTCTTCAGTCATACCGGAATCTGA 

ATCAAAAAAGTATTmAAATATCCATGATTrCTCCCTGTArrGAGGCTAGCCCTGATC^^ 

TGTGCCTGTCACCAGGTCTCCCAAGTGCACTCATCCAGGTCAAGTGCTCAGATGTGTTTAANGAGA 

CCCTATArrCAGGGAAGTTGCGTGAACACrGCAGTGGGGGAGAATTGAGAATAGTCAGGCCTATC 

AATCTCACAGAATCACCCCTCTAACCTTTGAT 

SEO ID NO* 1581 ACGCGGGGAGTGGCTTGAGGTATCCGCAGGAGCGGCCGGGTGGCGGGAGGA 
ACCGTTACGGGAACrGAAGTTTCGGATTAAGCCTGATCAAGATGACAACCTCCCA^ 
GACrrCGTGGCAGAGCCCATGGGGGAGAAGCCANTGGGGAGCCTGGCTGGGATTGGTGAANTCCT 
GGGCAANAAGCTGGAGGAAAGGGGTTTTGACAAGGCCTATNTTGTNCTTGGCCAGm 
TAAAGAAAGATGAAGACCTCTTCCGGGAATGGCTGAAAGACAOTGTGGCNGCCTACGCCAAGC^ 
NTCCCGGGACTG>mTrGATGCCTTCCANAGTGGTGCNACGCCTTCTTOTGATO 
CTCTNAATTCCCAANCCTCNTACAAAAGTTT^n^ACCGAATAAGGACTC 
CAAAANGAAAAGAATTGT^^S^TTGTCCTCCCT^n^CCCNGGNCG^^^ 
TACATCNTCAACTGGG 

SEO ID NO: 1 582 ACTTTTTTITmT^^ 

GTAGCrrArrAAACATAACATGCAAATAATCAAAGAGAAACATACATGACTTANAGTGAA/^ 

ArrCTANAAAAGTTTCACTAGGTAAGTATGCAAATTCTTATTCTAAAAATAC™ 

GAANCnTCATGTATTrrGCAATATTCTTGGCCTCAATATNTACCACCTAT^^ 

TTGAATGTOTCAAGATAATTNGGNGCAAGAGAGTAACATCCATATGTATTTA^^ 

GAACATTAAGATTTAAGGATTATAAAACTTGGCTGATTTCCATGCANCCACGTAAAAGG 

CATCNm-GACAGTNGAAATANAAAAACACTNAATTTNCAAATAAAGCATTGAG 

TATrrCGGGTATATGNTGNGNGTTCTTGNGANGGAAANAAGGCCTGCCCTT^ 

AAAAAAAAAAAAATGTTTACAAAAACAT 

SEO ID NO- 1 583 acagccaacggtitcccitgggggctttgaaataacaccaccagtggtctta 

AGGTTGAAGTGTGGTTCAGGGCCAGTGCATATTAGTGGACAGCACTTAGTAGCTGTGGAGGAAGA 

tgcagagtcagaagatgaagaggaggaggatgtgaaactcttaagtatatctggaaagcggtctg 

CCCCTGGAGGTGGTAGCAAGGTTCCACAGAAAAAAGTAAAA CTTG CTGCTGATGAAGATGATGA 

gatgatgatgaagaggatgatgatgaagatgatgatgatgattttgatgatgaggaagctgaana 

AAAAGCGCCAGTGAAGAAATCTATNO^AGATACTCCAGCCAANAATGCACAAAAGTC^^^ 

GAATTGGAAAAGACTCAAAACCATCATCAACACCAIWATCNATAGGACAATAATCC^^ 

NACATGANAAAACTTCTAAAACACCAAAANGACCTAATTTTNTGT 

SEO ID NO- 1584 acgcggggggggtggtgccagtttggagctcctggaaggtaaagtccttcct 

GGGGTGGATGCTCTCAGGCAAATGAGAArrAATCATAGGGGAAACTACCGACAACTTCT^ 

gatgctgactagttacaggctagctaaagtagagggagaagaaagccctgctgaaccanctgcca 

CAGCTACTTmrCGAACAGTGATGCTGGAAACCCAGTGACAATGCAGGAAAGCCAT^^ 
GAAAGTGGTCTTGCTGAATTAAACAGCTCTAATTGAANATGCANGGACAAAGATGAGTGGTGA^ 
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AAATATGACTTTCCTTTTTGGTAATATTTTTGTGAT^ 
GTATGCATTTATATAAOTGTTTTGTTATTTTGAATCTT GGANA ACTA^ 
AGCCTTTGTTTTTTAAAAAAGGC(>nTrGCCATACAC(^^ 
TTNNATTGTTACCITCCCNNGGCGGCCCGTTOTAA^ 

SEQ ID NO: 1585 ACAAGACCAGCAAAGCCCAGCITCrCGGCTGTGAGCTCTGAGAACCTGG^ 
CGATCCGCCCTCCTGTTGCGATGGCAATCAGCTCAATTTCAGGTCCTCCTACCCAGCGA^ 
GCAAGTTGTTCTGAAGAAGTAAGTGATrrGCTTCATCATCAAAGCCCCACTGACAAATTGCT^ 
TAGCACCAGTCTCTTTAATTTGTTGAATCATCTCTTCAAAm 
TTATAATCTTCGACAGAGGTCACATCCAGCTTATGCTTTGTTm 
ATGTGAGAATTGCAATCTTCGCATCTTCCACTTTTm 
CACAATCACCGCCCTTAATCAGTTTAGTGTNCTCCAGCCTGCCGCCCACT^ 
TAAAGCTCAAAGTCAACGTCThrrCCGCTNCATATCTGCTACAGTNAGGACGGGNA™ 
TTCTCAACCATCTTGTCGGTGNCAACTGTTGACCAACTTTGGAA 

SEQ ID NO: 1586 ACCACCAAGAATTCTTCAAATAAGCCAGCTGTCACCACCAAGTCACCTGCAG 
TGAAGCCAGCTGCAGCCCCCAAGCAACCTGTGGGCGGTGGCCAGAAGCITCTGACGAGAi^ 
GACAGCAGCTCCAGTGAGGAAGAGAGCAGCTCCAGTGAGGAGGAGAAGACAAAGAAGATGGTGG 
CCACCACTAAGCCCAAGGCGACTGCCAAAGCAGCTCTATCTCTGCCTGCCAAGCAGGCTCCT^ 
GGTAGTAGGGACAGCAGCTCTGATTCAGACAGCTCCAGCAGTGAGGAGGAGGAAGAGAAGACAT 
CTAAGTCTGCAGTTAAGAAGAAGCCACAGAAGGTAGCAGGAGGTGCAGCCCCrrCAAGCCAGCCT 
CTGCAAAGAAAGGGAAGGCTGAGAGCAGCAACAGTTCTTCTTCTGATGACTCCAGTGAGGAAG^ 
GAAGAAAAACTCAAGGGCAAGGGCTCTTCAAGACCACAAGCCCCCAANGCCAAT^ 
CACTGACTGCCCAGAATGGAAAAGCACTTANAACAGTGAGGGGGAGGAAGAAAAAA 

SEQ ID NO: 1587 ACTlUl'"rilTl Ul 'r ri4 ^14 I TnTrri'l'lGGAANAAGGTCCAAATCAATAGGTCT 
TTTATTGCATCATTTAAATATCACAAGTAGGTCITAAGTGTCATCTGG 
GGTAACTCTTAAATCTTATTCATCAGCCTGCTGAACAGTTCCTTm 
AAAAATTTCCTGATATCCTTGTTITTAACTGTTGNGGCTTGCT^ 
ACAAGCTCAATGTCATITCCTTCAAGGATTAATTCATCTTTCTGGGCT^ 
ACACCTGGTCTCATCCGAACCCTGCGGATGTATTTTTCACCCAAGAAATTTCGGAm 
GACCCATTCTCCTGGATAACAACGTTGATGGGGAAGTGAGCATNCACAGACCTCATCTTGTNACG 
GAAGCCCAGTGTAACACCCTTNATCATGTTCTGTCCTTGGCCGNGACCACGC 

SEQ ID NO: 1 588 ACTTTT]" ll Ulll " rTT r i" ll 'ri- ll TAAAATCTGAGGAATAAATGCi^^ 
AAGTGGCAAGGACATGTAAACAGAAATTGCTTCTATATATTTTAATACAAAATCC^^ 
TAGCACCAATGATTTAACTTACAAATATATTCTGACTCCCTGGTm 

AAATACACAGTTCATCT>m'CCAAAGTCAAAAACAATAAAATAAAATAACACCGACAAGCAAT^^ 

TAGCAATGGCAATTTGAAAACATATACTTTAAGTTAATGGGACTTCAATAGCTC^^ 

GCATNTCATGGTGACGTGACTTAAAAGGTGGTGCAAAGGTAGGTGCACCCCTGGGGGTCATGAAA 

AGTTTCCTGAAGTCTTCATTGGTGAGTTTTGATTGGTGGAAGGAGTGAGGATC^^ 

CCATCGGGGGGCCAAAGGGTTGGGAGAACGGGOTACTATTCTCGCTCCGGCATTTTG^^ 

TTTTCNCCTGNCACCAACCAA 

SEQ ID NO: 1589 acagtttcccagcccacagtcattgcttcattccttgtctgatcagatggtag 
ttagaaaagaagctctcctacatccatcttctataccaggaaagaggaagagtggcaaaagcaga 
gtcttgtgccttcaaagattcattctggttctgttgtgggctaaggcatcctgto 

ATTTTGAGATCAGATAGAGTTGCTCTTCCAGGAGTCAGCACCCACGCTCATGTCCGTCrn 
AAATCTCCTTGGTTTAGTAACTCTTGTGGATTTACTGCAGTTAAGAC^^ 

CrrCCCTGACTGCATGGTCACCAGGTGATGTGTCACATCAGGCACTGAAGACAGGGAACGTGGGGA 
TCCTTCAGCAAGCACCTCGTCATCAACTCCCAGGATAGAATTGGTATTTGGTAGGTTCAAGGAC^^ 
TCCACCAAGGCTGGGCTTGGGAATTCATAGACACCAGGTTTTTACTGGGTCCNCGGGAT^ 

agaagtaagagtaatgtagtattttatacaaatatttataaaaaatat 

SEQ ID NO: 1590 ACTACGACAmCTGCCAAAAGTAACTACAACTTTGAAAAGCCCTTCCTCT^ 
CrrGCTAGGAAGCTCATTGGAGACCCTAACTTGGAATTTGTTGCCATGCCTGCT 
GAAGTTGTCATGGACCCAGCTTTGGCAGCACAGTATGAGCACGACITAGAGGTTGCTCAGACAA^ 
TGCTCTCCCGGATGAGGATGATGACCTGTGAGAATGAAGCTGGAGCCCAGCGTCAGAAGTCTAGT 
TTTATAGGCAGCTGTCCTGTGATGTCAGCGGTGCAGCNGTGTGTGCCACCTCATTATTATCTAGCT 
AAGCGGAACATGTGCTTCATCTGTGGGATGCTGAAGGAGATGAGTGGGCTTCGGAGTGAATGTGG 
CAGTTTAAAAAATAACTTCATTGTTTGGACCTGCATATTTAGCTGTm 
TGAGmCATATATAANACTGGGTGCNGTCACATNACAAATATTTCAAGTGGTGAAAATOT 
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GTTACTGGNCATTTCCCATTTCCrrnTCGTTTAGA 

SEQ ID NO: 1 59 1 ACAGTTCACTCTGCAAAAAATACTCCTTCTCAGCATTCACATTCCATTCAGCA 
TAGTCCTGAAAGGTCTGGGTCTGGTTCTGTTGGAAATGGATCTAGTCGATACAGTCCTTCTCAG/^ 
TAGTCCAATTCATCACATCCCTTCACGAAGAAGTCCTGCAAAGACAATCGCACCACAGAATGCTC 
CAAGAGATGAGTCTAGGGGCCGTTCCTCGTTITATCCTGATGGTGGAGATCAGGAAACTGCAAAG 
ACntjGGAAGTTCTTAAAAAGGTTCACAGATGAAGAGTCTAGAGTATTCCTGOT 
TACCAGGGATAAAGAGGCTTCAAAAGAGAAAGGATCAGAGAAAGGGAGGGCAGAGGGAGAATG 
GGAAGATCAGGAAGCTTTAGATTACITCAGTGATAAAGAGTCTGGAAAAACAAAAAGm 
TTCANAAGGGGATGACACCAGAGGAGACAGANGArrATAGACAGTTCAGGAAGTCAGTCCTCGC 
AGATCAGGGTAAAAAGTTTTTGCTCCTGNTCTCACCGGAAT 

SEQ ID NO: 1592 ACGCTTGATCAGATTTCTGATGCAGATAATATCCCAGGACTTTTGGTTCTCAA 
AAGCTTGGCCTATCGGAACAAAGGTTCATTTGATGAAGCTGCAAAGATTATGGAAGACCTTCTCT^ 
rrCTTACCCTGACCTAGCTGAAGTTCATGCCCTTGAGGCTrrGATTCAm 

CTACAAGCAGAAAAATGTTTTCAGAGAGCTCTTGAGAAAGATACCGAAGTTGCAGAATATCATTA 

CCAACTTGGATTAACATACTGGTTCATGGGTGAAGAGACAAGAAAAGATNAAACAAAGG 

CCCACnTTCTGAAGGCTGCAAGACTGGATACATATATGGGCAAAGTTTT^^ 

ATTATAGAGACGTAGTGGGAGATAAAAACAGAGCTCGNGGATGTTATAGGAAAGCCCTTTAAT^^ 

NATGACACTGATGCTGAATCTGGAGCTGNAGCCNNTTGANCCCTAGTGNGGGAGCTTGGA^^^ 

TGGGAAATGGmTTNAGCTITCCTTAACAACANTTACTCNAAAAAGG 

SEQ ID NO: 1 593 ACGCGGGGTCCTGCTTTTGGTTCTTACAGTAGTCGGCGTAGGCCTTAGGTGGG 
rrCGTGCGCCTTCTACCTCGCTGTTTCGGTTTTCCTGGCTCCTCGGCCCT^^ 
TGGGAGCGGACGAGGCGCGAAGCTGGGATTTTTTACTGTCTCCTGAAGAATTTAAC^ 
GATATCAGACCAAATCATACAATTTATATCAACAATATGAATGACAAAATTAAAAAGGAAG^ 
GAAGAGATCCCTATATGCCCTGTTTTCTCAGTTTGGTCATGTGGTGGACATTGTGGC^ 
CATGAANATGAGGGGGCAGGCCTTTGTCATATITAAGGAACTGGGCTCATCCACAAATGCC^ 
AGACAGCTACAAGGATTTTCCATTTTATGGTAAAACCAATGCGAATACN^^ 
TCNGGATATANATATTCAAAAAOTGCGGTGGAACTTTTGCTTGACNAAAAGAA^ 
AA>n^GAAAAATAGGCCAAAACrrm'GGAAAANACTGNAACAACCNAC^^ 

SEQ ID NO: 1 594 accatctgtggtggctctctgcaagttitaaaactgcctctgctgagctct^ 

TCATTTTGGTGGTTTCTGTGTTAGATCTCGTTAGTCTGCATTCCACAGCITCTCAGT^ 
TTCCCAACTTGTCCGGAAGTGTrrCCAGAATACTGATCACTTTTITr^ 

CACAAAGTCTCAGACTAGAAATAATTACCCAGTATGATCATGGCATCCAAGACCAGAGTCTCAGA 

ACTCATTAAGAAACAGTTTACTTGGAATGGAGAATACCCATCTGTAATACAGGTCCTGTCATTT 

TTCATCTCAAATTATTTTTGAATrCTTCCCAAATGGCTGCTGGAm 

GGCCATAAATCTGAAGCCrrGANAACCTTGGGTCTGGAGACCATGAAGAGGGAAGGAAAAANAG 
GGGCAAGTTCCTGAACCTAACCAATGACCTGATGGGATTGCmGACCAAGACACAAGAAGNGAA 
NGCTGGTGTCTGTTGCCCTTCCCCANNAGACTGGAGTNTTTTGGGG 

SEQ ID NO: 1 595 ACTTGCCCCTTCCCCAGAAAAGCGGGACTTGCTGCTAAGGGTGAAGGACCAA 
GGCAGTTGTCCCTGCGTGGTCTGACACCCrrGAAACGTGGGTGTATAATCAGAGAGGCATCCCTGC 
AATGATTAAACACCAAGGGAAGGCTGCCTTCCCAGTCTGTGACCAGCGCCGGAGTTTTGGGTCCA 
CGGATAAAACGTGTCTCTTTTGTCTCTACCAGAAAATGAAAGGAATTGAAATT 
AGATTGAAGTGTAGTGCCAAGATTGAAAGGAGAAAGTGGTTGAGGGATAGTGAGGGAAGTTGGA 
GAAGAGAGTAAAAAGAGGCTGCTTACCAGATITGAAATTGGTGAGATGTTTCTTGGGCT^ 
TCTGAGGACCTGAGGTCCGTAAGTGGATCTTTCTCAGGGAGCAAAGAGCANGGAGGACGGAGGAT 
rrGATCTCCCAAGGGGAGGTCCCCCGATCCGAGTCATGGCACCAAATTTATTGTGCCGTCCATGTG 
AAAANAACCACAAACAGGCTTTTGTGTGAGCAANATGGCTGTTTATT^ 

SEQ ID NO: 1 596 ACGCGGGGAAACCGGACCCGCAACCACCATGAACAGCAAAGGTCAATATCC 
AACACAGCCAACCTACCCTGTGCAGCCTCCTGGGAATCCAGTATACCCTCAGACCTTGCATCTTCC 
TCAGGCTCCACCCTATACCGATGCTCCACCTGCCTACTCAGAGCTCTATCGTCCGAGCTTTGTGCA 
CCCAGGGGCTGCCACAGTCCCCACCATGTCAGCCGCATTTCCTGGAGCCTCTCTGTATCTTCCCAT 
GGCCCAGTCTGTGGCTGTTGGGCCTTTAGGTTCCACAATCCCCATGGCTTATTATCCAGTCGGTCCC 
ATCTATCCACCTGGCTCCACAGTGCTGGTGGAAGGAGGGTATGATGCAGGTGCCAGATTTGGAGC 
TGGGGCTACTGCTGGCAACATTCCTCCTCCACCTCCTGGATGCCCTCCCAATGCTGCTCAGCTTGC 
AGTCATGCAGGGAGCCAACGTCCTCGTAACTCANCGGAAGGGGAACTTCTTCATGGGTGGTrCA^ 
ATGGNGGCTACACCATCTGGTGAGGAACCAAGGCC 



wo 02/29086 



PCT/USOl/30732 



SEQ ID NO: 1 597 TGTACATGTTGTGGGTGCCGCTCCGGGAGTCATAGCGCAGCCAGATCCCGAA . 
GTTCTTCACCCGCAGGGGGGACTTCTCAAACACCTGCCCACAGTAGACAATCTCCCCTGAAGAOT 
CTTCATCTTCTTTAACTGAGATACAAAGTACCTTGTTCACTGTGGCATAATAGAACCGT^^ 
CCTCTTAACTGCAAAAGATACCAAGATTATTCTGGTTATCCTGGATGCCATTTCAAATATC^ 
GCTGCTGAGAAACTAGGTGAAACTGAGAAACTTAGTATAATGATTGAAGAATGTGGAGGOT 
CAAAATTGAAGCTCTACAAAACCATGAAAATGAGTCTTGTGTATAANGCTTCGT^ 
AGAAGTATTTCTCTGTAGAGGAAGAGGAAGATCAAAACCGTTGT 

SEQ ID NO: 1 598 ACACCTGGCTTGAGGCTGTCATCTTCCTCATCGGTATCATCGTAGCCAATGTG 

ccggaaggrrtgctggccactgtcacggtctgtctgacacttactgccaaacgcatggcaaggaa 

aaactgcttagtgaagaacttagaagctgtggagaccttggggtccacgtccaccatctgctctga 

taaaactggaactctgactcagaaccggatgacagtggcccacatgtggtttgacaatcaaatcc 

ATGAAGCTGATACGACAGAGAATCAGAGTGGTGTCTCTTTTGACAAGACTTCAGCTACCT^ 

CTCTGTCCAGAATTGCAGGTCTTTGTAACAGGGCAGTGTTTCAGGCTAACCAGG/^^ 

TTCTTAAGCGGGCAGTTGCAGGAGATGCCTCTGAGTCAGCACTCTTAAAGTGCATAGAGCTGTG^ 

gtggttccgtgaaggagatganagaaagataccccaaaatcgtcgagatacccttcaactccacc 
aacaaagtacc 

SEQ ID NO: 1599 acaagcctttaaagaagcaagacaaaatgttgctgaagttgagtcatcaaag 

AATGCTTCAGAGGACAATCATTCTGAGAATACTTTGTATTCAAATGATAATGGAAGTAAm 

CGTGAAGCAACTGTCATCAGTGAGCAAAAAGTCAAAGAAACCAAAATATTGGCGAAGAAACCA^ 

TACATAATTCAAAGGAAAAAATAGCAAAGATGGAACATGGACCTAAAGCAGTGACTATTGCAAAT 

TCTCCATCAAAGCCTTCAGAAAAGGATTCTGTAGTTTCCCTTGAGTCCCAGAAGACACCTGCT^ 

CCAAAACTGAAAACTCTAAGTCAAACCAAAAAAAACAAAGGATCTGATAGCTCACTCTCT^ 

CAGTGATGGCGGANAAGAATTTTGTGAAGAGGAGAAGGAATATTTTGATGATAGCACAGAA^ 

aggttttacnagcagtctttcatgtotgaagatagtgatagcggtgacnacttot 
agtcagacggacacnaaagaaagaaagtaggtgtcattctt 

SEQ E) NO: 1600 ACTITITITTlU l " lT nTrri"i'll"ri"lGGGCAGAGGC^^ 

aaaaaattgaacaaaganaccctnttgcganaggtganatgaggccctgccatgcaaaggagtc 
ccagcagaggaggaanaattccatcctggagttcaagtttctgtgcananacaggacctggggac 

ANANAACGGTCCTCCACCCAATITCAGCTGGTCTGTCCTCATCANCTTTGGGCTG/^^ 

atganaggnggagtctcccctggatggngaccggacitotggccaaacitgc 
ctcatgacnatgatgatgcccatggngcacanaaccccagcgcaaatgagcccgccaacctggag 
gctgtgccagtcatagtagaaaggactgtttttatcttotaggncattc^ 
agcctgccaanaacacaagcaggcccagggtcaccttntgcatgtcaaaaccnctggccttgtgt 

GGTTTNCCAAGasrrGAGCAAAAGTTCCNGGAGAAAATCC 

SEQ ID NO: 1 601 ACAATTTAAAAATAAGTCTATGTTTTCACATTGATrrTAA.^^ 
TTTGAATTACAAATGATTAAGCAAACTCTATTACTTCATAGCTGACCATCTTCCAGAAA^ 
CTTAATTGAATACTTAGAAAAAAATGGCCAGTGGCCGATrGAAAGGTATATTAAAAT^ 
GTTTTAATTCTGAAGACAAATATCTTCATGGAAATCTATTTGTAAGCTTCTGAGAT^^ 
AGTCTACAGTCTGTGAATATACCAATTCCCCTTTACAACTGATGCAGATCATTATGAAATACTG^^ 
AGGCATACCCTACAATTTAGGAATTGGTGTGGCTGCCACTGCTATGCTCTCAATTGCACACTCATC 
AGrrCCTCTGCGGATCCNGGAAGTNGCCNTTCTCACCCCATCCGGNGCCCCANCTGllll"l"rOT 
ATCCNAGTANTNCATCCCNTAGGGCTGANTCAGTGCCATNGCCCACATGCNGAACAGCATTN^ 
TTOTNANCCTNAAAGGGGNTGAAAGGGTNNCTTNANACO^ 

SEQ ID NO: 1 602 ACAGATGGGGTCTTGCTATGTTGCCCAAGCTGGTCTTAAACTCCTGGCCTCAA 
GCAATCCTrCTGCCTTGGCCCCCCAAAGTGCTGGGATTGTGGGCATGAGCTGCTGTGCCCAGCCTC 
CATGTTTTAATATCAACTCTCACTCCTGAATTCAGTTGCTTTGCCCAAGATAGG^ 
AGAAATTATTGGGCTCTTTTAGGGTAAGAAGTTTGTGTCTTTGT^ 
TGTCTACTCTGAAGACCTTTAATGGCTTCCCTCTTTCATCTCCTGAGTATGTAAC™ 
GCTAT<XAGTGACTTGTTCTGAGTAAGTGTGTTCATTAATGTTTAm 

ATATACTCCAGGACTTAAAATAGTGCCTmAGTGCTGCAGCCAAAGACAGANCGGAACTATG 

AGTGGGCTTGGAGATGGCAGGAAAAGCITGTCATTGACCCTGGCAAATTTAACAAACT^ 

GAGGATGATTGAGGGTGGGGTCCTACCC 

SEQ ID NO: 1 603 ACAAAACCCTTGCTCAGAAGCTGTATCAGCATGAAATCAACTTATTCAAAAG 
TAAGACGAATAGTCAAAAGGGAGCCTCTTCTACCTGGATGAAGGCAATTGTGTCATCGGGGACAC 
TAGGTGACAGGATGGCAGCCATGATTCTTCTTArrCAGGATGATGCCGTTCACACACTTCAGm 
TAGAAACTCnTGTGAACCTTGTTAAAAAGAAGGGCAGCAAACAGCAGTGCCTTATGGCCTTGGAT 
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ACmCAAAGAGTTGCTrATCACAGACCTTTTGCCAGACAATCGGAAGCTGAGG^ 

CGTCCTTTTGACAAACTGGAACAGTTGTCCAGTGGCAACAAGGACTCAAGAGATAGAAGACT^ 

ATTATGGTATTITGAACACCAGCTGAAACACITAGTGGCTGAATTTGTGCA 

AAGTCATGATACATTAGTAACCACTAAAACTCGAGCCCTTACCGGTGGCTCATGAGCTGCm 

CAAAGCCTGANGGAAGAAAAGCTCTTCITGGTGCAANTGGTATATAAACTGG 

SEQ ID NO: 1604 ACTTTN'ri-rriiTrri I'riiTi 1 1 1 1 1 1 1 1 1 1 aaaatatttatttttcttaacgac 

CTATTACGAATATTATTTACTTAATATAATCACAGTCTAATTTCTCAGTTAm 

AACTTCCCAAGCTGATGAAAAAGTTTNTANAATAAGCCCCATTrCACANA/^ 

TAGGGAAGTTAAAAAACTTGCTCAAAGTCACACAGNGAATGGTGCAGCAAAGATTCAAACTAGAC 

ACGACTCACTCCGAGTGATTAGTTITATCTGGATGAAGGAATTGNGCCCAGTCAAACCAACTA^^ 

GGTAAGTAAACAGCCTGCGTGCTGGCAGGTGCCATCCCGCGTACCTGCCCGGGCGGCCGCTCNAA 

AGGGCG 

SEQ ID NO: 1 605 ACTGAGCAGGATTACCATGGCAACAACACATCATCAGTAGGGTAAAACTAAC 

ctgtctcacgacggtctaaacccagctcacgttccctattagtgggtgaacaatccaacgcttggt 
gaattctgcttcacaatgataggaagagccgacatcgaaggatcaaaaagcgacgtcgctatgaa 
cgcitggccgccacaagccagttatccctgtggtaacttttctgacacctcctgi^ 
aaggtcangaaggatcgtgaggccccgctttcacggtctgtattcgt 

SEQ ID NO: 1606 acgcggggcttttttcgaggtaggagtcgactcctgtgaggtatggtgctgg 
gtgcaaatgcagtgtggctctggatagcaccttatggacagttgtgtccccaaggaaggatgaga 
atagctactgaagtcctaaagagcaagcctaactcaagccattggcacacaggcattagacagaa 

AGCTGGAAGTTGAAATGGTGGAGTCCAACrrGCCTGGACCAGCTTAATGGTTCTGCTCCTGGT^ 

gtttttatccatggatgacttgcttgggtatggagagtcggcttgactacactgtgtggagcaagt 
tttaaagaagcaaaggactcagaattcatgattgaagaaatgcaggcagacctgttatccta^ 

TAGGGTTTITAATGACCACAACAAGCAAGCATGCAGCTTACTGCTTGAAAGGGTCTTGCCT^^ 

AAGCTANAGTGCAGTGGCCCTTTNAAGCTTACTACACCTCAAACTTOT 

CACCTCCCANTGGGTCTTTTGTAAACTGG 

SEQ ID NO: 1607 ACGCGGGGACGGTTCGTTTTTCCTTTAGTCAGGAAGGACGTTGGTGTTGAGGT 
TGGCATACGTATCAAGGACAGTAACTACCATGGCTCCCGAAGTTTTGCCAAAACCTCGGATGCGT 
GGCCTTCTGGCCAGGCGTCTGCGAAATCATATGGCTGTAGCATTCGTGCTATCCCTGGGGGTTGCA 
GCTTTGTATAAGTTTCGTGTGGCTGATCAAAGAAAGAAGGCATACGCAGATTTCTACAGAAACT 
GATGTCATGAAAGATTTTGAGGAGATGAGGAAGGCTGGTATCTTTCAGAGTGTAAAGTAATOT 
GAATATAAAGAATTTCTTCAGGTTGAATTACCTANAAGTTTGTCACTGAC^ . 

tgacacatgaatatgtgggctaagaaatagttcctcttgataaataaacaattaacaaatacm 
gacagtnaaaaaa 

SEQ ID NO: 1608 acgcggggctcttcctgctctccatcatggcgcaggatcaaggtgaaaagga 
gaaccccatgcgggaacttcgcatccgcaaactctgtctcaacatctgtgttgggga gagt ggag 
acagactgacgcgagcagccaaggtgttggagcagctcacagggcagacccctgtgttttccaaa 
gctagatacactgtcagatccrrtggcatccggagaaatgaaaagattgctgtccactgcacagtt 
cgaggggccaaggcagaagaaatcttggagaagggtctaaaggtgcgggagtatgagttaagaa 
aaaacaacttctcagatactggaaactttggttttgggatccaggaacaca 
aatatgacccaagcatttggtatctacggcctggacttctatgtggtgctgggtaggccacgtm 
caacatcgcagacaaaaagcccaggacaggccggattggngccnaaccccanaatnatcaaaan 
aaggagg(xcttgccgcttggtttccagcaagaaagtttgatngggg 

SEQ ID NO: 1609 ACTi l llllU'll"14'i''l l T TTTTCCACACCTGCCCTITATTGGTCTCT^ 

AGTGGCTCCAGGCCCTTCACGCCIOTCANACACCACCCATGAGGGTTTAGGAAGGTGCCATCA^^ 

TGTGAAGGCCCAAAGCTTACCCAAGTCTTGGAGCCCAAGTTGAATCACCAACCANAGGGTTGGGA 

GAGGAAAAGGAAACAGGCAGAGGGGAAAGGCAAGGCTCTGCAGTGAAGGGGACTGATATCAAG 

GGAATGCTGAGGTCCAGCAGTGTCTCCTGAAGGCATGCTGCATCCTAAGGCTCCTCAGGACTGGA 

TGGAGTAGGAGATCTGTGTGTTGAGCAGTTCACATNTATATGGCAACTITAAGGAGGCCCTTGATG 

TCAGGCTCAATGTTGATGGTTGGGAAGGTGCGGCTGTAGCGTCNGAAGGGCTCTCCTCCGGCCCTA 

TGAAGAACTTTCAAAAGTTCCAGCCACATCTGAGTGGCNCACANGGCTCCAAAOTGATGAGC^ 

GGATCGGCATGANGGGAAAATGGGTC 

SEQ ID NO: 16 1 0 ACAACAGTGAACTAGAACAAAAGGTAAATGAATTAACAGGAGGACTAGAGG 
AGA<mTAAAAGAAAAGGATCAAAATGACCAAAAACTAGAAAAACTTATGGTTC^ 
CTCTCTGAAGACAAAGAAGTArrGTCAGCTGAAGTGAAGTCTCTTTATGAGGAAAACAATAAACT 



240 



wo 02/29086 



PCT/USOl/30732 



CAGTTCAGAAAAAAAACAGTTGAGTAGGGATTTGGAGGTTTTT^ 

TCCTTAAAGAACATATTACTCAATTAGAAAAGAAACTTCAGTTAATGGTTG 

TTAAATAAACTGCTTGAAAATGAGCAAGTTCAGAAGTTATITGTTAAAACT^ 

CTTAAAGAAATGGGATCAGAAGTITCAGAAGACAGTGAAGAAGAAAGATGTTGTTAATGTCCTAC 

AGGCAGTCGGGGGAATCCITGGCAAAAATAAATGAGGAAAAAATGCACCTGGCT^ 

ATOAAAAAAGTATTAGAGTTAGGAAAAAAGAGATTAAGTGCCTTCNANA 

SEQ ID NO: 1 6 1 1 ACCTGGTAGAAATTGTGTCTTGGAATGACCCTTTCGAGTTATTGACATGGCTC 
TGATGAATAGAACATGAGCCCCAAAACTAAATCCAAAAGGAATTTTCTATCm 
GTGGCAAGACAAGTTGGCCCTTTCTTACCCAGAGGTCriTTTGTGTGACTGCATC^ 
CTCCATTGNGTGCTTTCCATTTTGTCTTTAGTGCCTATACTGT^^ 
CAAATTTAAGCCATTGCTGCTCATTAGCCTTGTATTTTGTGTGCATATCAT^^ 
GTTCGCTTTAAGCATTCITATATCACACTGCTCCTCATCTACCATATG 
TTTGTCTGATCAGGGAAAAGCATGGGCACACATCTTCCTCCTC 

SEQ ID NO: 1612 ACAGAGGGGTCTGTTTCTAAGTCTGGAACCTCAACACGAGGCTGGGATGCTT 
CACAACGTGCTCTCTCCCACTGTCCAGCCATCTTTGGTGGTCTTCCCACAGTGOT^ 
TCACACTCCTGTGGAGGGACCATGGGACGGGCCAGGAGGAAACCTGGGATCACTCTGACAGGAA 
ACGGACAAGCACAGCTGCCAGGAGCCAGTTGCTCTGGCCTCAGTCrCCAATTGTTAGTCTCAGGAT 
CAACAGANAACTGGAAAGCAGCAGAATCTTGGAGGAGCAGGAAGAGAGCCCAGAGGGAGTGITG 
TGCTGAGTGACGGTTAACATATGAAACAGAATTTCATGGAGATTCTTTGrrACAAGGA 
TCAGTCTCAACTCCCAAAACGTGAAAGTTGCTGTTTAAGAGACTATGTTTTC 

SEQ ID NO: 1 6 1 3 ACTTTTTTTr r rnTiii4'iiiuni'iiiU4' N ^ 

AGTTACTACAAGTCAATAAATATTGATCCCCAAANAANAGCTCGGTTATTTATCAAATTACT 

CATAAAGAGCTGAATGAAACTGAACCAAGCGATTGCTGGGATGACTGAAGNCACTCCTGC™ 

GGGAAATGAAGCCACAGCCNGCTCATNTOTGGCATATGCTGNTCCACCAACTGNAANAATCAT^ 

NGGCniSITATCAGGAGGACATCAGCTACCCCACCCTTTAATACAGGGGAATTTC^^^ 

AACAGTTTTTGCITCTCCGGAACmATTm 

CCAATOTGACGAAAACCANCAAATTCC 

SEQ ID NO: 16 14 ACATAGGTAACCAAAGTATATAGCnTATTTGGTGAATCTTCATCCTCATTACT 
GTTTTCTGGACAGCCGCACACGGATTCGGTATGGCACATTCCTTATTCCT^ 
NTTTGTTGAGCCTGGTGTCAATGCGCACATCTGGGANTTCCCANTCC^^TCANGGGCAAAT^ 
ATTCTCTraGAGGGGCCCGAGGTGCNCGCTTOTTNAAAGCCANNTCCATGGG 
ATGTTGATGGGGTNTNTTCGGGTTACCACTmGNTGATGGCANAACGGGNCCTTT^^ 
CTTTNTTTGCGGGACCNTTCTGNANNGCCCAAGTGAAAAGGACCCNGGT 

SEQ ID NO: 1615 acaggataatatactcagatatttttaaaataaactacttaataata^ 

TTAGCCATACCACATTGTTCCATTTGCTACAAGAACAAATTGGCAATGAAGACTATTTAA/^ 

TGCTCAGCTCTACAGAGGGTGGTGGCAGGCAACACTTITCCATTACAGAATAACCTCTAT^ 

ATGATACATATTCCTGTGGAAAAACTTTGCAGGGCCCAGGGATGAAAAATAGAGCTTTG 

TTAGCTAACTGTAGGTTCACTTAACATCTTTGGGAAGGACCCAAAAAATCTGG 

AAACATCTGCAAGCTGCAGAATTCCTTANCCTCAGCTATAGTTTCTGCTAGAT^^ 

ACAGTTCCCTGTGACTCCTCCTCAGNTATGGGGGTGGG 

SEQ ID NO: 1616 ACTTTTITr i 'ini" 1 4nT r i"l'l U - il U-riTrilTi'GNANATT^ 

TAATGGCTGATCTATGTAATCACAGAGGCCAGTATGTACANACAAAGGGGGAGGTTTTATTTOT 

GTCNCTTCCTCCTTGGATAAAGTCTTGATGANCTCCTCCTTNTTGGCCT 

NGCTTGCGTGCTrCCTTGGTCTTAAACCTGCGGGCCTCAGCCTGGTC ANCCAG NAGCrri^ 

GCCTTGTCTGCCITCAGCTTGNGGATGTGTTCCATGAAAATCCGCTTGT^^ 

NACCTTCAGGGACCNTGCCCGGGCGGCCGNTCNA 

SEQ ID NO: 1617 ACCTCGGGGACCTGCTGGAGACCTGCCATTTCCAGGCCTTCTGGCAAGCCCT 
GGATGAAAACATGGACCTCTTGGAAGGTATAACTGGCTTTGAAGACTCTGTCCGAAAGTTTAT^ 
CCATGTTGTGGGTATCACnTACCAGCACATTGACCGCTGGCTGCTGGCCGAGATACTCGGGGATCT 
GTCGGACAGCCAGCTAAAGGTGTGGATGAGCAAATACGGCTGGAGTGCCGACGAGTCGGGGCAG 
ATCTTCATCTGTAGCCAAGAAGAGAGCATTAAACCCAAGAACATTGTGGAGAAGATTGACm 
CAGTGTGTCCAGCATCATGGCCTCCTCCCAGTAACTTCAGGTGTTTAATAAAGATGTGTTGACCCA 
AAAAAAAAAAAAAAAAAAAAAAAAAAANTCCTCGCCGCGACCCCCTAAGGCG 
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SEQ ID NO: 1 6 1 8 ACCAGCTGGCACAGGAGCAGGGGGCATGGCACCTCTGTTGTTTATGCCCATA 
GCACCTCCCATAGCCATCTGACCCATCCGAATCTCCTGCTCTCTCGCATCAGGGAAGGTTCCCTTG 
AATCCTTCCTGCTGTCGCCGCATCATTTOTCTTGCTGCCGCCGCATCT^ 
GCTCTTCCTCCTGCCTGAGCTCCAGTTGCTTTCGTTTT^ 

CTCCGAAGTTCTTCTTGGCGCCTCATCAAATCCTGTCTCATTAGCATGACCTGGTGCTCATGGCGTG 

CAGCTTCATCTCCATCTCCAGCTTCTCACGAGCCTCITGATGTO 

TCTCCATCTCAATGAGTGCCTTCAGCGCATGGCATATTATACTCAAGGAG 

SEQ ID NO: 1 6 1 9 ACmTTATTCr ri - rr il'r'i i ' r i l ^ T TTTTTGAGACAGAGTCTGTCTCT^ 

GGCTGGAGTGCAGTGGCCCAAGCTACGCTCACTGCAAGCTCCACCTCCTGGGTTCACACCATTCT^ 

CTG(nTCAGTCTCCCGAGTAGCTGGAATTACAAGCACCCGCCACCACGCCCAACTAATATTTTGT^ 

TTTTTAGTANAGACGGGGGTTTCACCGTGTTAGCCAGNATGGTCTCGATCTCCTGAOT 

CTCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGNGAGCCACTGCACCCAGCCTTT^^ 

TTTTTTGAGANNNAGTTTCNTTCTTGTTGCCCAGGCTAGAGTGC^ 

ACCTNCGNNTCTAGGTTCANGCGATTCTCCTGCCAGCCTTTGAGTAGNTGG 

SEQ ID NO: 1620 ACTTTTiiu"r iu u"i4"rriU" n 'iiiTi Tri ini4TCTNGi"rrri'i'^ 

TNATTCAACTTTATTAAAAATTAAAACTACANAAACCAAACCGA^ 

GAAACAAACAGGGANCAANGGCAGCTGATAACTGNGGCNAGNGGATAATCCGNATGACTCANAT 

CCCACCCCTGCNACTGAGGAGCCCAAAACNCTGCGCTGGCACTGTCGNGCCCTNrrrCCACC 

GGGCNCAAAAACCTGCNGGANACTGAAAACGATTACAAACNTNTTCAACCAGCT^^ 

GGCAGNAAAAGCAGCCTGAATACNCAACTCACNCCAANAGGGCAGCAGCTCTCCTGACATCCATG 

TAAGANGGNTAACACCTAAACCACACGCAGGCNTCTGAACTCA 

SEQ ID NO: 1 62 1 ACACACTTTATTTACTTCGTmGGTTAAGTTGGCTTCT^ 

GTTTCCTAAAAGTTCATAACAGTGCCATTGTCrTTATATGAACATAGACTAGAGA^ 

TTTTCCATCATAATTCTAATCTAACAATGGAAGATTTGCCCATTTACACr^ 

GATGIAAATAACCCCATTCmGCTTGAACACAGTATTTTCCCAATAGCAC^^ 

TTTCTTTGGTGCCTTTCCTGTTCAGCATTCTTAGCCTGTGGCAATAAAGAGAAAC^ 

GACGACAAAGCTGCTAAATCTCCTATTTTTTTAAAATCACTAAC^ 

AAAAAAGTCTCTATTTAAATTCTITTTAATTTTOT 

SEQ ID NO: 1622 ACTTCTCCGTTGACAAAGAAATTCTAGGTGAAATTAAGAGTCATGATCTGAA 
ACCTAATGGTGGCAATATTCTTGTAACAGAAGAAAATAAAGAGGAATACATCAGAATGGTAGCTG 
AGTGGAGGTTGTCrCGAGGTGTTGAAGAACAGACACAAGCirrCTTTGAAGGCT 
TTCCCCAGCAATATTTGCAATACTTTGATGCAAAGGAATTAGAGGTCCTT^ 
AGATTGATTTGAATGACTGGCAAAGACATGCCATCTACCGTCATTATGCAAGGACCAGCAAACAA 
ATCATGTGGTTTTGGCAGTTTGTTAAAGAAATTGATAATGAGAAGAGAATGAGACnT^^ 
GTACnXjGACCTGCCGATTGCCAGTANGAGGATTTGCTGATCTCATGGGGAGCAATGGCCCANAT 

SEQ ID NO: 1623 ACTTAAATATATATTTATTCATTTCTACATATATAGAACTTGTAGGTAAAGTA 
GAAAAAGTTCCCACTAGGAAGGTAATTAAAGGTTGTTAATGTTCnTm 
AGACATGCrrArrTCTGCTCTCCAGAAGCAATGTTAGCTACTAGTTTCTGA^ 
TATCTTTCAGGAAGGTTTACGTGAGATATTTATTCAGTCTTm 

ATAGGCATTATACTGAGAGTCTGTTCCAAAAATTCGATCCCACCATGTAAATGTTGAAGCATA^ 

TCCAATGAAGTTCATGTGGTGGAAATCATrGATGCCGANAACCAGCATAGANAGGGATCAGArrr 

AGGGGTGAGAGGAATATCTAACCCTTTGGACATCAATAGTTCTAATAAACGANTG 

SEQ ID NO: 1624 ACTGTTAAAATGTGGATGGCACCCTCCCAAGGATTGAAAACACACAGCTGGA 
tcatgcatctggaatattttcttttttatccaggaaagtgttttct 
atcaccttacacgtggtgcccgagagccccgccatcgccatccccgccct gcatct gcatctgcat 
ctgcgcctgcatccttgcaatcatctcrrgcatgcggcggagctcancttcm^ 
tggtcmattcatgtcctcatrctccactttcctgccgcctctcttgagt^ 
tttcataatgaaggtcctgggtcacctcctggagatcctgcatgtgggtgatgagcatggttct^ 

CTTCAGAAAGTCATTGTCTCTGGGTTCTCCCTTCACAACACCCCANGGGTA 

SEQ ID NO: 1625 ACCATTTATTTGTCTGCCGCTTTTAAAAAATACCCATTGGCTATGCCAOT^ 
AAACAATTTGAGAAGTTTTTTTGAAGTTm 

GTTGTGTAGACTTACTTTAAGTTTGCACCCTTGAAATGTGTCATATCAAT^ 
CAAGATTAGCAAAGGATAAATGCCGAAGGTCACTTCATTCTGGACACAGTTGGATCAATACTGAT 
TAAGTAGAAAATCCAAGCTTTGCTTGAGAACTTTTGTAACGTGGAGAGTAAAAAGT^^ 
TCmGCTGATGCCTTCTGCTTGAAATAACAGTCACCATACAGCTAAAGGAGAGGACr™ 
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TTCTAAGTAGGCANAAATGGATCATTATGTTGCCGTCTCAATCTCCCAGAGNTCGTCT 

SEQ ID NO: 1626 ACCTACAGACACTTTTACAGAGTTAATACTAAAATTACAAATTGATG^ 
TCGAGGCAAAGCAGGGTAACTGTTCCTTCAGTGTTGCCATCACTGAATTGACm 
CATACTGTTTGGCTGAACACTGACATTCCATAGTCTACAACTATAAAGATACCTACCTATGTGAGT 
TTAAAAAGTGATCTGCACAAAAGAAATAAAAAAACTTTATATACAACCAATTG 
ACAAAAATAACATCTGTCACrmGTCATGCTGACAATTTGACATATATACACATGCAGGAGAAAA 
GTTCCAATTCAGTGATCTGGACAACTGTAGCrrGTATATTTTTAATATGATGGACACAC^^ 
AATTTTCACACTGACAGAGATGTGTTTCAAAATATTCATATACTAAATCTA 

SEQ ID NO: 1627 ACGCGGGGGCGTCTTGTTCTTGCCTGGTGTCGGTGGTTAGTTTCTGCGACTTG 
TGTTGGGACTGCTGATAGGAAGATGTCTTCAGGAAATGCrAAAATTGGGCACCCTGCCCCCA^ 
CAAAGCCACAGCTGTTATGCCANATGGTCAGTTTAAAGATATCAGCCTGTCTGACTACAAAGG^ 
AATATGTTGTGTTCTTCTTTNACCCTCTTGACTTCACCm 
CAGTTGATAGGGCANANGAATTTAAAAAACTCAACTGNCAAGTGATTGGNGOT 
ACTTCTAANATCNAGCATGGGTCAATACACCTAAAAAAC>ITNGAGGACTGGGACCCATGAACAT^ 
CNTTTGNTATCATACCNAANCGCACCATNCTCATGATTATNGGGGCTTAAAGCTAGATAAN^ 

SEQ ID NO: 1628 ACGCGGGGAGTGACGGTGGCGTTTCCTTGAGGAAGAGTGAGGGTTCCAACTT 
TTCTGCTTATCTGGGAGGTGTTGGGCGCGGACAGTCGAGATGTCAGAGAAAAAGCAGCCGGTAGA 
CTTAGGTCTGTTAGAGGAAGACGACGAGTTTGAAGAGTTCCCTGCCGAAGACTGGGCTGGCTTA 
ATGAAGATGAAGATGCACATGTCTGGGAGGATAATTGGGATGATGACAATGTAGAGGATGACTTC 
TOTAATCAGTTCGAGCTGAACTAGAGAAACATGGTTATAAGATGGAGACTTCATANCATC^^ 
GAAGTGTTGAAGTAACCTAAACrrGACCTGCTTAATACATTCTAGGGCAGAGAACCCANGAT^^ 
ACACTAAAAAAATGTGTTTATTCATTATCTGCTNGGATrrAm 

SEQ ID NO: 1 629 ACGCGGGTGGACTGTTGCAGCACCCTCCCTTGGTCTCCCAGTCTGAAGTCTCT 
CCTCTTGCCAATCCACCCTTCATCTTCCTGCCAGGTTAATTTCOT 
TGTGCCAAAGACTTTCATTGGCTTCCGTTTGCTTAAATTATCAAGAACAGAl^^ 
TCAAGGTCTCCATCCCGTCTTTTCAGGTTTATTTCTCACAGCTGCCCTTCA^^^ 
GCTGTGCACCTCGCCACCCGAAGGAAGTGGCACTAGGAGCGTTTACATGCTATTTCCTGTCTCCAC 
TTTAAAAAGCTCAGAATAATOTCTCACCGGGCGCAGTGGCTCACGCCTGTAATCCCAGCAOT 
GGAAGCCAAACAGGAGAATAGCTTAGCCTGGAGTTCGAGACAGCTGGACAACACACAAG 

SEQ ID NO: 1630 ACTCCATCCCACAAAAGAATTCTTGTTCTGGTCTTTCAGCCAGGTAAATAAGG 
GCTCAAAGTAGTTGAGCAGTGGCCITACATTCATGTTCTTTGCTCCTACAACAT^ 
GGTCCAGGGTTCTGATTTTCCAAGCCTCAGCATATTGAACAGTTTCTGTCCAGCTTCT^ 

gagatgtcacatttgtgcagagggccttcatgtttagctgcttgacaaagtgcttot 

aattggtaaagggccttgtgtaatatcgaatgaatgagtaatcattagaaacatggaacagagat 

gcggggtcacagtatgtttcatcatggggcacaggttccccaccccaactatctctcgcttcat^ 

CCACCACTTTTCATCCACTGGTCTTTGGGAATTCCCCTTA^ 

SEQ ID NO: 1 63 1 ACGTGGGCTATCAGTAGATCTACCATTCTGGGGTCTGGAGGATGGTTGCCCTC 
TTCTCACAGCTCCAGCGGGGATTCTGTGCGGGGCCTCCTTCCCCACATTTTCCTTCCTC^ 
AGCAGAAGTTCTCTATAAGTGCCTTGCCTCTACAGCAAACnTCTGCCTGGGCATCC^ 
ATACATTTTCTGAAATTTAGGCAAAGGTTTCCAAACCCCAATTCTTGACT^ 
CTCAATGCCATGTGGAAGCrGCAAGGCTTGTGGCTTGCACCCTCTCAAGCCATGGCCrGA^ 
CATTGNTCCTTTAAGCCACAGCCAGAGCANCTGGGATACANGGCACCANGTCCCTAGCTGCA^ 
CACGGGGACCCNGGGCCTGNTCCACAGAGCCATTTTTTCTAGCTTCTGGGCCTGTGATG^ 
GG 

SEQ ID NO: 1 632 TACGCGGGGACGTTGCCAGCCGAGGTTTGGACATACCTCATGTAGATGTGGT 
TGTCAACTTTGACATTCCTACCCATTCCAAGGATTACATCCATCGAGTAGGTCGAACAGCTAGAGC 
TGGGCGCTCCGGAAAGGCTATTACTTTTGTCACACAGTATGATGTGGAACTCTTC 
ACACTTAATTGGGAAGAAACTACCAGGTTTTCCAACACAGGATGATGAGGTT^^^ 
AACGCGTCGCTGAACCCAAAGGTTTGCCCGAATGGAGTTAAGGGAGCATGGAGAAAAGAAGAAA 
CGCTCGCGAGAGGATGCTGGAGATAATGATGACACAGAGGGTGCTATTGGTNCAGGAACAAGGT 
GGCTGGAGGAAAAATGAAGAAGCGGAAAGGCCGTAATCACTTTTATGAAGCTCGATTCTGCT 
TGTAAAAA 

SEQ ID NO: 1 633 ACGCGGGGGGTTCCGGCGTGGCCATTTTCGTTGGTGGTGTTCAGTTGTGGCGG 
TTGCTGGTCAGTAACAGCCAANATGCTGCGGAATCTGCTGGCTC^CGTCANATTGGGCAGAGGA 
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CGATANGCACTGNTTCCCGCAGGCATTTTAAAAATAAAGTTCCGGANAAGCAAANACT^ 

AGGATGATGAAATTCCCTGTATCTAANAGGGTGGGGTAGCTGOTGCCCnNCTGANANAT^^ 

TGATTCTTACAGTTGGNGGACAGTNTATGCCATmTGANCTGGCTNCNGGNT^^ 

A 

SEQ ID NO: 1634 ACATGGAGAGGAGTATGGTGAGCTATTTCCTTTTTAAAGGATGAGACCTTCAT 
AAATTGGCCCCTCGGATTCTGGTGATTCCCGCCGCAAGCGCAAATGCTCCAGTGTGTTATGAAAAT 
GTTTGTTAATCTGCTCTGATTCTTCACTGGATTCAAGATTCGGGAGGTCTTCTCGAATC^^ 
AAGCTGGTTTAAAACCTGAATTGTTACCGCATCATTTTCCIT^ 

AGAATTTCTATAAAAAGCTGCACTTGTAGAGAGGGGTCCATGCACTGATTTGCTATT^ 
TTTTTTTAGGCACTCCATTACCCTCrTGCTTCGTGAAGCT^ 

AATAAAATrATTCACTTGGTAAGTGTTCAAGCTITCTGATCACCCCAAGTAGCATGACT^ 

SEQ ID NO: 1 635 ACAGAAGTnTCATCTATGAACATGGCCTCATCATCACCTGCAGCCTTGGCCT 
TGGCCTGTTCITCAAAAAGCTGCCGCTGCCGCATGGGATCATrCAGCTCAGTATACGCATTGC^^ 
TCTCTTTCTTCATGACAAACAGCTCAAAGCGCTCAGTCAGACCCTCTTT^^ 
CCAAAGGGCTCATTATCTGTGGGTGATCACAGATGAATGTAGGATTGATGCAAGTCACrrCCAGG 
AACTCCCCAACAAGCrrGTCAAGGAGCCTGGCTGTGGTCCGAGGTGGAGGGCATTCAACAGCr^ 
TGCCCACANATATCATCAAGAATTTTGCGAGTNTCTTCAGTTTCAAAGAGGNCGNm 
CATCCCCAGGGCTTTCTCAAGCTOTCTACCATGTTGATTCCCNGAAGGGTGGGGAGA^^ 
TNGA 

SEQ ID NO: 1 636 ACmATTAGTTTAAAAAGAAATTGAGGTTGTTCAAAGTTTAA(^ 

CTCTCTGAACACACATTGCTATTCCCATCCCACCCCCAATGCACAGGGCTGCAACACCACGACTO 

TGCCCATTCTCTCCAGTGTGTGTAACAGGGTCACAAGAATTCGACAGCCGGATGCTCCAAGAGGG 

TGGCCAAGGCTATAGCCCCTCCTTCAATATTGACCTTCTCTGGGTTTAATCCAAGTTCm 

TGCAGCAGAGACAGCTGCAAAGGCTTCATTGATTTCAAATATGTCAACATCTTCCAGTGACCAACC 

TGCTTITGAACAGCriTGCTTTATGGCTGGAATTGGTCCTATTCCCATAATGG;^ 

ACTTGGGACCAGGAAACTATCCGTGCTAAAGGTGTAAGCCCACGTTATCACrrCTGCTTOT 

A 

SEQ ID NO: 1637 ACCCACCAAATCCATGGAGAGAAATTTCTGGTGAAGCAATTGATCTGATAAA 
CAATCTGCTTCAAGTGAAGATGAGAAAACGTTACAGTGTTGACAAATCTCTTAGTCATCCCTGGCT 
ACAGGACTATCAGACTAGGCTTGACCTTAGAGAATTTGAAACTCGCATTGGAGAACGTTACAT^^ 
CACATGAAAGTGATGATGCTCGCTGGGAAATACATGCATACACACATAACCITGTATACCCAA^ 
CACTTCATTATGGCTCCTAATCCAGATGATATGGAAGAAGATCCTTAATCACTGAGCTAACCTA^ 
TAAGGAAGGATTTCATTTTATGGACTGATATTTTGCTGGTAACTTG 
AGTGCTGCAAAGATATGAAGAAATATGATACGAATAAGTGACACCAGTACC 

SEQ ID NO: 1 638 ACATGACCTAATTTTTACATCATAGTAAAACAGGCCCTATGGAGAGAGGACA 
TGGGTTTCTCTGCTNGAACAGCCATTATTTATACTCGTTCCAAGGCTTNTA^ 
NCCTCGTATTACCACCATTCCAATATrGATCTGNTGTaWCTAGACGCCATCTNC^^ 
ATCACANGGTTCATAAAGGGATCANATTCCNGCA^^^ATT^mTTGGACATGTCTGCC^ 
TTCAATGATNACTTCTTGTCCATNAATTTTGTCAACNTTGGGAGGGTGAGCTTO 
ACTNCTGGGGCT 

SEQ ID NO: 1 639 ACGCGGGAATGAAGGACTTGGCAGATGAACTTGCTCTTGTTGATGTCATCGA 
AGACAAATTGAAGGGAGAGATGATGGATCTCCAACATGGCAGCCTTTTCCTTAGAACACCAAAGA 
TTGTCTCTGGCAAAGACTATAATGTAACTGCAAACTCCAAGCTGGTCATTATCACGGCTGGGGCAC 
GTCAGCAAGAGGGAGAAAGCCGTCTTAATTTGGCCAGCGTAACGTGAACATCTTTAAATTCATCA 
TTCCTAATGTTGTAAAATACAGCCCGAACTGCAAGTTGCrrArrGTTTCAAATC^ 
GACCTACGTGGCTTGGAAGATAAGTGGTTTTCCCAAAAACCGTGTTATTGGAAGTGGTTGCTATCT 
GGATTCAGCCCGATTCCGTACCTGATGGGGGAAAGGCTGGGAGTCACCCATTAAGCTGCATGGGT 
GGGCCG 

SEQ ID NO: 1640 CGCGGGGACCGAGGCCCATGCGAAGCTTTCCACTATGGCTTCCAGCACTGTC 
CCGGTGAGCGCTGCTGGCTCGGCTAATGAAACTCCCGAAATACCGGACAACGTGGGAGATTGGCT 
TCGGGGCGTCTACCGCTTTGCCACTGATAGGAATGACTTCCGGAGGAACTTGATACTAAATT^ 
ACTCTTTGCTGCGGGAGTrrGGCTGGCCAGGAACTTGAGTGACATTGACCTCATGGNACCTCAGCC 
AGGGGTGTAGCCAAATA>rrTCTAATGCCACCTGTCCNCTTATCATCTGATTGCAGACAN 
NCTGTGCTGAACCCGATCTTNTCAANAACANCTACATCTGTGACCANCACANGATGTNCCCTGTGG 
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cnnctgaatttgtntatntggcacttttctcccttccagntagta 

seq id no: 1 641 acaacttatagaaaaggtaaaggaaaccccaacatgcatgcactgccttggt 
gaccagggaagtcaccccaccgctatggggaaattagcccgaggcttagcmcattatc^ 
tcccagggtgtgcttgtcaaagagatattccgccnagccanattcgggcgctccx;atcttgcgc^ 
gttggtcacgtggtcncccaarrctttgatggctttcacctgctcattca 
a agtcaca caaatgggggtcatttttgtcagtgnccantttgtgcagtccagtag 
catttttitncaaatgtaatgcncactcnattgcattcagcccgctc^ 
gt>™tngatntcctgaangaagntnggccactngtgtttgan(^ 

seq id no; 1 642 acttttttttttttttt^^ 

gaaggatttgngaaactotcacatcatggngagagtttgtatgattaataanaagc^^ 
atgaaatgcttggaggtgaacganttctcagcctgtganatccgaccatcccattaacm 

TTCTCTTGATTAATANAAGAAAAAAGGGGAGGGTGAAAAAAAGGAGGAACATGCTAAA^ 
ATGACAATCATCCAAATGTGAGGAAAGAACAACCGATTCACCAACTCCACTTTTTC^^ 
CTTTCTACATCTCACTCTTGATTTTGGCITCCTGGCTGA^ 
AAAGAGCCCTGGTTCTCAAAAGACAGAGGAGGAGAAGCCCTGCANGATGC 

SEQ ID NO: 1 643 ACTGTAGGAGAGAATTAAATAAAATAAAATAGCTGTAGATAATTAAAAGCTA 
ATTAGATAAATCAAGTTACAGTATCATCCTTCAGATTAAAGTGCrCTGATATAACCAATGCCACGG 
CAAACGAAATCCTGGAAAGAGATTGCACTGCCAATGATTTTGTCTOT 
ACACACATAAAAGAGATTCCTGATTTAATAACTGCTATCATCTTCACCCCCA^ 
CTGTGATATTCTTCCCCTAGGAGAGATGTCAAGATTAGATGAGATTCTATCATAGCCAGGGAAAA 
AAATGAGAAATACCAGAAACATTCTAATTCCAGTCrrAAGTGAGTAACATCAAAATTCCTGAA^ 
ATCTCAGCTGGTGCTTTAGCTAAGAAGGAAGCAAGTGCCCTAGTAAAAAAAAAAAAAGCT^ 
TCTAGTTTAA 

SEQ ID NO: 1 644 ACTGGGATTACAGGCGTGAACCACCGTGCCCGGCCTTCCCCAGATATCTTCA 
AAGCAACTGC TAGTC CTGCTTTTGCACATTACACTCTACATTCTCArrGCCCCACTTAACT^ 
CCCTCATGTCCTTTTCTAAAGATGTAGAAGACmGAAATAGCATAACAm 
CACCTGGCTTGAAAGTGACTAATAATAGGAAAATCAGACCTCAGGACTGAGTTCAATTTCAAGAT 
TCTCCCACTCCACirrGCTAAGAGGGAAACAGAAAAACATTACTCCAGCCACACA^ 
TGTGCAAGTATCCCACTCTTACTCATTCAAATCCTACCCATTTTTTCCCATCGTGGAAC^^ 
GCAAOTATACCTCTCTCCAATAACCCATAAAGATGGTGATTTGAATTTGGCCAAGA 
TTATGAAA 

SEQ ID NO: 1645 ACTTTTTGCTrATGAAAAGAAACCAGTGAGTCCTCTAAGGATAG 

AGTATTTTACAGTTAAGCACATGATTTGGAGTGAAGAATTCCGATGTTGCTCTGCTGGTCCAGACT 

GTTAATACTCTTCITGCTCCTCCTGTGGGCCCCCTTCATCAGGTATCACAAAGCOT 

ATACAGAATGTCTACAATCCTCTGCAATACAGGGTCNGTTTTCCCTCTCGTTCTCCTGGCAAArc^ 

ATTCAATGTTCCGTNCTTTCCNAAGTAGAAATCCCTCTCTTTCTCCAAGTCTT 

AATACGTNGACCTGCTTGCATCACTCNGCTGCTCGTNACTCCNATGCCAACCAAGGTAC^^ 

CACCCATGGCCAGCCTATNAGCCG 

SEQ ID NO: 1646 ACACCTTGACACCACGTCGTCATCAAATTTACACCATTTGCCATCCCCTTTGG 
GGTTTAGATAAACCACATAATGTCCACCATGATTATCTCCACTATGAACCAGGACTGCATGAAGA 
ATATAATTTGCAGGGTCCTTAGGATCTGTTTTTTGCAAAAATTCATCAAGTGGTAA 
AATTCAAACCTATCATTGATCTTGATATTITGGTCCGTCTGAGGGTCATACATAAATC^ 
GTAGATGTAACACTGGTGGCAATGTTAGGAATTTCACACCTTTCTCTGCTTCCTGT^ 
CCCANCGTCGTATTTATTGCCCCATCGAGCTGTTCTACTGCACATAATCCACAAATGAT^ 
TATTTTTCTTTCCTTTGATACITAGCTGGATATCATAATAATC^ 
AN 

SEQ ID NO: 1647 ACCTGAATGTAGAAACAATGAGGATGGACCTGGTTTAATAATGGAAGAACAG 
CACAAGTGTTCTTCGAAGAGCCTTGAACATAAAACACAGACACCTCCTGTGGAGGAGAATGTAAC 
TCAGAAAATTAGTGACCTGGAAATTTGTGCTGATGAGTTTCCTGGATCCTCAGCCACCTACCGA^ 
ACTGGAGGTTGGCTGTGGTGTGGGAAACACAGTCTTTCCAATTTTACAAACGAACAATG 
GACTCTTTGTTTATTGCTGTGATTTTTCTTCCACAGCTATAGAACTGGTC 
TGATCCTTCTCGGTGTTTTGCCmGTTCACGACCTGTGTGATGAAGAGAAGAGT^^ 
CAAGGGCAGTCTTGATATTATCATTCTCATATTNGTCTTCAGCAATTGTCCAGACAAGATGCAGAN 
GGTT 
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SEQ ID NO: 1 648 ACTGCATTTGTTTTTATATCTTTAAmCTGGCAAGGGTAATGGG^^ 
TGAGCCCTTCTAAAAAGGTGTCCATGTCCTGAATCTGGTCCCTGTGAATGTGACGTTAm 
TAGGGTTmGCAGAGGAAATGAAGGATCTTGAGTTAAGACCACCGCGATCAGGGCGGGCCCTAA 
ATCCAATGACTOCTGTCCTTTCAAGAGAAAGGAGAGGGGGATTTGACACAGACACGGTGAANAAG 
GTGCTGTGGGGAGAGGCANGGACTGGAGTGATTCACTCATGGGGCGAGGAACGCCAAGTGCCGG 
AGCCACCGGAGCTGGGAGAGAGGCATGAGGCAGCTTCTCCCTCTAAGCCTCCAGGAGGAGCAGN 
CCTGCTGGCACCTTGATITANACTGCCTGCAGAACTGTGGGAGAGTAAATATCTCT^ 
CTG 

SEQ ID NO: 1649 ACTTGAACTGAGTTACTGTGAAGAAAGCATGTCATCCTTGACAATGGATTAA 
ATGAAAGTGAGCTAATGCATGCCATTCTCAATCCTTGCAGCTACACCAGTTCCCAGGCA CTGTCCT 
TACAAGTGATCCATCCATATTCCATACAANATCCAACACTTGCrCCAATGATATATCCTNGTT^ 
AACTTCAGAAATTCAACATACATTTACATTTNGArrATCCCTGAACAGCCTGTO^ 
CACTTTCCCTAATATACTCTAATACAAAATCCrrATTAAAGAAACAGANCAGAAGTCAATO^ 
AGACTGAAATANATCTGAAGCAATGTCAATGAAGATGCTCTCTCCTTGGCTGTTGTOT 
TTTTTCTGACA 

SEQ ID NO: 1 650 ACACTTGACTGTTTCCAAAGGGAGAGAGGTGATGTAGTCTTCATTTCAGGGG 
AGAAGAATGGACTGAATTACTGTCTGCTTmAATGTTTCAACTAACC^^ 
GGCCGGTGCCTTTGGTTGCTGACGTTTTGAATATCTGCCATTTTCGGTCCTTCAAG^ 
AAGTGAATTTGCCATCTCTGAGGAAGTCATGGCCTGTTCCATGTCCTGTTTATTTGCA^ 
TAAAATGGCTTTTCTCAGCTCTTCTTCCTCCAACATGGCAACTAACTCTGAT^ 
CGGTCTCGGTCACAACTGTCTACTACATAAATGACTGCATCTGTGTTTGAATAGTAACATCT 
ATGGCCTGATACTTGTCTGCCTCCTAAATCCCAGACTTGGAATTTAAG 

SEQ ID NO: 1 65 1 ACAAGGAGACCAGCCTACACAGTCCATCAAATACATCTGCCCCTCATAGCCA 
AGGAGGTATTCCACCTCCTACCGGAATATAATTAAAGGGAGAAATACACTGTATGAAGTATATGT 
TGATACTATGACATGTNGCCAACACCrrGAGAAGCATTATTTGTTTTNTAATA^ 
NTGTTAATATATTGGTGGNTATAA 

SEQ ID NO: 1 652 ACGCGGGGGCNGTCTTGTTCTTGCCTGGTGTCGGTGGTTAGTTTCTGCGACTT 
GTGTTGGGACTGCTGATAGGAAGATGTCTTCAGGAAATGCTAAAATTGGGCACCCTGCCCCCAAC 
TTCAAAGCCACAGCTGTTATGCCAGATGGTCAGTTTAAAGATATCAGCCTGTCTGACTACAA AGGA 
AAATATGTTGTGTTCTTCTTTrACCCTCTTGAOTCACCTTTGT^^ 

CAGTGATAGGGCAAAAGAATTTAAGAAACTCAACTGCCAAGTGATTGNAGCTTCTGTGGATTCT^ 
AOTCTGTCATCTAGCATGGGTCAATACACCTAAGAAACAAGGAGGACTGGGACCCATGAACAT^ 
CCTTTGGATCAGACCCAAANCGCACCATTGNTCANGNTAATGGGGCTAAANGCTGAT 

SEQ ID NO: 1 653 acatttttgitacagacagaaggctgattttggaaagaaagaaac^ 

TGTATAGGGGCTTTCTATCAGCAGACTAGTATGTTTAAAAATAGTCTCATCAAGGGTTCTGAA^ 

GAAATATAAATGTTGCCAGGCAGTCCCAAACTCACATTTGATATTAACTGCAGACTCATTTAA^ 

TGAAAACTGCTCCAGCCTCTCTCAATCmACTAAGGACTGGGAGATTATCAAACCCTT^ 

TAATAATGCTTTTGAATTAAGATTArrGAAAAAGGAATCTCTGTTACAGTGCAAAGi^ 

AGAAATTCATGCTACAAGGAGTTAATACAGATGTCCTTCATTAGCAGATCTATCTTGCTCTAA^ 

NANAATATCAAACNTTGAAATAATNn^GGTCAAAAATATGGNAACANATTCTACGGGCCCT^^ 

AAGCAATCTCAAGTTGThTITGANCrCTCCAGNGAGGANTTNTTTTGAAGAOT 

TTNTATCCTTACTTTGGGGNGAGOTCmGGATANTCCT^^ 

GCGC 

SEQ ID NO: 1654 ACAGCAGTTGCTCAAGAACAAAGTTATTCTCCTTTTCATCTCGAATACTCAGA 
ATATGTGATmGGATTCAGmCTGGCATTTAGTATGAACTTCATCCTGTG^^ 
TATTCTTTGTTATTATGAAATTGTAGCAACAGTTCTGAAATGGTATCCACGGAGTAm 
GAGATGGACATTTAACACTGTCAACTGGTTTGACCTCTITrTCAGTCTCAm 
AATAGCACCTGTTGATTGNCATTGCAATCAACTGTTTTCCAGAATCATCAGNGNCTA^ 
ANNTTCGAGTTTGCCATTAGTTCAGCCCAGCGACTAAAArrGAAGACGTTCCATNTGACAA 
AGTTGANTCATATTCrrGCTGNAGAGTCCGACCTAAGGAAAGTTGNNAGNGCCCTGCCACTGAGA 
ATGCTGNNGGAAGGGC(>JGGNGC>ICACAACTGCATNTCTTTANANCTCr^ 

SEQ ID NO: 1 655 ACCAGTGTGGAGAAATGGGCTGGTTAACTGTGTGGGCCCAGACAGTCATTTG 
TCTATATTCCTAGTGATGAGAAAtTATTCCCCCTAACCnTCCAATGAAACAAGTGTCCACA 
CTCATTTTrCCCTATTGrrGAAACATTAGTGAGAGGGACCCATGCATGTTATTAAAAAT^ 
AAAACACTTAAAATAATTCTGTATTAGAAACTrGACTCTATCATAAGTCTCTT 
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ACTGGGCTGGTTAAATGCTCCAATTCATTGTrATTAACATTTAGGAAGCAGAAACT^ 

GTTCAAACTAGAGGTTANITACTACAACACTGATTCCTGGTGGTTGTGTGGNCT^ 

TCCTOTTNCTCTGCAGATTTGCTCTGGATTTGTGGTCATTCm 

TTTCATTACTTCATTGATNTTTAGGNATGCNCATGATAANAACTATTGGNT^^ 

CCTGAAl^ACGNTNTTNTTTCNNGNCGCCTOh^ 

TTTACCAATTTGCNTTGCAAGACCTTATTGNATA 

SEQ IDNO: 1656 acatagacaagtttcttgtaagacagaaaacagagaaatccacagtaactct 
aacacatcccttaaggaataagcatgtatttgtaggaagcaaacaaagctttcc^^ 
actttcacaggatgattaggtggacctgcaatgaagaaaatacatttcaaaagatgggttcagac 
ttacaccaagttttcactgaaatacttaaaaaaaaaaaagaccc^ 
aaaaaatacatcacggataaaataaatctcaggaaaggtccaagtcctactcagagacatacatt 
tgcaaattaatataaattttaaagtttgacacaaaaatactam 
ctaagttcgggtctgctttagtgcattcattgaaataatctcattct^^ 
cttatcccacggntggtcactgattggcatagacttotaatgtcaagaat^^ 
ccaaaatcttaaantaaacaggccaa 

SEQ ID NO: 1657 accagtagtttttatcggtaataagaaaaaggatggcttaatagtgccaata 
acacaactgtctgcccattcattaggtaactgaatgtaggctctctgcccacatatccagtatagt 
ccagcgagagccatccagtcctgatgagatcctggacaagcccccattatatcaggagctggatc 
agtgggcaccaactctcaggcttcccaaggccatcggtctctgatagtggttccccc gcata tata 
acaagaagtaacattaagtaagaaaattacattttctgctaatttggan^^ 
ttcagaagttctgctgctggcagattcagctccmataaaaggtttgj^ 
cacttgtggacctccctctaataaaatggcacitgagggttaaccctgcctato 
cacgtttnctttttccacggtgacctaggggatggtaatattantctagtot 
acagangggtgcrrcccctntnananaaacgggcttttgtnttt^ 
ncattttacaacctgctngaaaatttttt 

seq id no: 1658 acaaaaaaaatcatttcaaataactcaggaggatgataatggctggacrm 
gtaattcacctcaaagactgtgggagagccaactcaactcactgtatagtctgtgcatatggt^ 
ttgtagcatgtaggttttttccaaaagaaggaaatataaaatgm 
acagggtgcctataaaaggtggcttacrccitattgttattatactatccaa^^ 
ttaaaaaataacactgagtcttgttattacaaggcagcaaatgtttctcctcat^ 
gactgcaatgctttcctgaacatttagaaaagaggcagtaagagtcctcggccgcgacccgct^ 

seq id no: 1659 actgggataaatgaagaagaaggcataaggacaataaacatggaactccac 
tgcaaatggattttatgcagctgaggaaagtttgggcttattagtatrrc^ 
agttttctccattgcggacaacgtaactaccagctccttggctcagtggttcgcctccact^ 
gttcccagtaggttctgtcattattgttggcacataggccctgaatacaggtgatatagggccccc 

ATGAGCGCTCCTCCATTGTGAAACCAAATATAGTATCATTCATTTTCTGGCm 

aggaagacagaaccattancacagtgacattggtgaaatatgtttcattgattctcacagagtaa 

ttgacggagatatatgattgtgagtcaggaggtgtcacagtatagctcatcagcggaatgttgag 

ttcctgaacagaacgcaagaagagcntgtaatatcaanagnctttccatcaggcagtaaacctg^ 

ctgcacgttggattntgatgctnctgaaaattncggggcctgttnta;^ 

rmatam^gnnanaaaaggctgnggtttctngtaangtgtncaagaaccm^ 

seq id no: 1660 acaacttatagaaaaggtaaaggaaaccccaacatgcatgcactgccttggt 

GACCAGGGAAGTCACCCCACGGCTATGGGGAAATTAGCCCGAGGCTTAGCrn'CATTATCACT^ 

TCCCAGGGTGTGCTTGTCAAANAGATATTCCGCCAAGCCAGATTCGGGCGCTCCCATCTTGCGCAA 

GTTGGTCACGTGGTCACCCAATTCTTTGATGGCTTTCACCTGCTCATTCAGGTAA^ 

AAAGTCACCCAAATGGGGGNCATTmGGCAANTGGCCAGTTTGTGCAGTCCCANTAGTACT^ 

CANATTTTTTTTTCAAATNGAATGCCACTCCAm 

NTTCTTGATNTNCTGAAGAAAATTCNGCCCCCITGGTGGT^ 

SEQ ID NO: 1661 A CTr iTr rnTi Tr i " i i" ] 'i 1 1 1 1 1 1 1 1 1 1 i i i 1 1 attnggaaaanactgctttatt 

GGNGGCAGTTACTACAAGTCAATAAATATTGATCCCCAAANAANAGCTCGGTTATTTATC 
AO^GGCCCATANAGAGCTGAATGAAACTGMCCAAGCGATTGCTGGGATGACTGAAGTCACTCCT 

gcttcttgggaaatgaanccacagccagctcatatntggcatatnctggtccnccaac^ 
aacnatggnggctctatacaggngggcttaac otcccccn ccot 

NAATCCTCaSfGGAACANTTrrGNTTrrCCGGANCTTT^^ 
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SEQ ID NO: 1 662 AACTCTCCATCTTAAATGAGCAGCGCATrCGGGGCATTTTATGCGATGTCACT 
ATCATTGTGGAAGATACCAAATTTAAAGCCCATAGCAATGTTCTGGCAGCITCAAGC^ 
AAAAATATCTTTTGGAGCCATACAATCTGTATTTCCAGCCACGTCCTGGAGCTGGACGATCT^ 
GCTGAAGTGTTTACTGAAATACrrAATTATATCTACAGTTCCACAGTCGTTGT^ 
IWNGTbrrCTGACTCNNAGCTGCAGNAAAAAGCTGGAATATCGTTC^^ 
GCTACTTNTCATATTTCCCCGNTCCCTNTGTOTCrGGNTTACTGAAAAG 
ANA>rrGAAATATGCATGNATAACC>mCCATACNNANTGGCCGGNGATCAC>m 
TCGTAACAGAA 

SEQ ID NO: 1 663 acatotgcctagatgtcgatgactgcaagtaataatacagtttataatgaa 

ACTATCTACAATTCTTGTTTTAGCACATCTGTTATCCGTAAAACACCTGTAACTAG 

TATTATTrGAATTTTAGGATAGCGAATCACTAATTTTTAGTTGCTGAG^^ 

TTAAGCACTTCTGTCAGTCTTTGAAAAAAGAACGTATTTTTTGTC 

TCTTTTATAATAGAATGGGCATGTATTGTAACAGTTTTATGTCAAATGATCT^ 

CATTAACCCTTGTCAAAAAAGAAATGGATAAACTTGCCrrrCTAAGTGGT^ 

ATAATATACTGATGTTTACATTTATTTAAATTAATCTCTTATGTTAGGGNGA 

CAGTGATGCGATGTTCTAGAACTCTTAAANGCCCATTGGCAGACCTTGNCGGACCCCTAGGCAATT 

CNCCCTGCGGCGTNTANGGTCCACCGTNCANTGNGTATANGGCAACTGTTCTGGGAATGNTCCTO 

ATTCCNNANNCAGNGACTTANTGAAGCTGGGGGCATAGGG 

SEQ ID NO: 1 664 ACACGGCCCCTTTCACITCCITITCAGCCACAGTCTGT^^ 

ACTGAAAGACCACAATGCGACCCTGCTTGGGCTCTGCCTCrrCAGGATACACCATTGCTGTGCCCA 

CAATGAAGTAAGTGTTGGGGTCmGCCCAGCTTGCAGGAAACCAGACTGAGGGCATATTCATTC^ 

GCAGAAACTGGTGGGCATGAAGCACITCAAAGGTGTGTTGGTCAATGATAAAGTAAGTTGTGCAC 

CTCCACCmTTCTTCAAAGGAGGGCITATGAAGAACAGTGCTGGTGGANAACAGCTTGCT^ 

TTACACTGNTGGACAAAANCTGGGTGCTANCCTTGGCCTTAAAGGTTGTCGGCCCCACTCG 

GGACnrCAATGCGGTGGAGAGGACCCGAACATNGGACCTTCTGGTACANATCm'CTTGGAACTA^ 

ANAGGGACTGNGCAATGNCACTTTGGATCTATNATGGGCCANNGNGAGGNNTNT^ 

GCTGCNGNACCTTTATT 

SEQ ID NO: 1665 ACTACAGCAGTCAAAGAGATCTCCACTAGAGATCAGAAAGAAGCACCACTAT 
TTTCTTCTATGACGTGTATGTGTTGGTCATGAGCATGCTAGTATGAATAAGGCAATGTGTT 
CTGGCATACAAATGCAGCTAAAGGTGCTGAAGGAAGGCAGTGGGGTGGTGCAGGCACACAGCAG 
GGAGCTCTTCCCCGTGACACAGTTAGTCATCrrCTCCACAGAGCANCAGAAGAGCmC^ 
CCCCAGATGTATTCTCCCTTATCATGGAATAAAAGAGAGGNGCAAAATTCTTCCTAACTCC^ 
GATGTTAAACAGATAAATCTCACTCCTTGGAACCATGACTCTGANAGGGTTGATCATN^ 
GGCGGCCGTCNAAGGCT 

SEQ ID NO: 1 666 ACAACTTATAGAAAAGGTAAAGGAAACCCCAACATGCATGCACTGCCTTGGT 

gaccagggaagtcaccccacggctatggggaaattagcccgaggcttagctttcattatcactgtc 

tcccagggtgtgcttgtcaaagagatattccgccaagccagattcggacgctcccatcttgcgcaa 

gttggtcacgtggtcacccaattcrrrgatggctttcacctgctcattcaggtaatgtgtct^ 

aaagtcacacaaatgggggccatrmgtcagtggccagtttgtgcagttccagtagtgac^^ 

cacattttttncaaatgtaatgcacactccattgcattcagcccgt^ 

cttgatatctgaaggaagattcggcactcgttggtttgcactncatcagttctnaaccccg 

tgccgggcggccnttcnaaggcn 

SEQ ID NO: 1667 Acrr n Trrri' i 'i' r n'i'i'i"! i i i i i n i i i t i cagcnaagtttcatttatttgngc 

AAATACAGGCATGANCAAAAATGTTCTAAACAANGTAACGATTTCCAGCATTGA TTAC ANAAm 

CCTCTGATCATTNGATTNGGTTATANATGAATTNAAACTTCAATTTi^ 

CCCCTNTGNTTCCTGATGAACCACCATAATrCCTAAAATTACACCTAANCAAGTC^ 

TTGGGGTTGCCTTAAAAACAmAAAATCTAmGGGCAAGGCGGTGGAACGAGGTTGGGATG 

ACATGATTTATGCTAATTCTGTTNGACCCTGAACNAAACATTGCATTTCAT^ 

TGGGANNATNTOTCTGAACATITrGGCCNCCTTATTGGAAAAGTTTC^^ 

GNTAAACTGGGACTNAAAATrAGGGGANAANNGNCCTTNTTTGNTO 

CCCNCNTAGAA>WCNNANAGGTTrANCCGGTTGTTTTTCCAC^^ 

TAAATTTT 

SEQ ID NO: 1 668 ACAGAGCACATAGACCAAGGATGGCCCAGTAGTAAGGCATGCTGTGCTCTCC 
AGTGGGGTCCTGAGGCAGTTCAAAATAAGGCCTCCTGGAAGAACGGCCCCTTCITCAAGGAGCTC 
GCCTCTCAGCACGCATGGGGTGTTCTGCGGAGGGAACTGCCCTTGGCTTTCTCCTGCAGGCT^ 
GGCTGTTATGGACCCGCATGGAGTTTAATTATGCTTAGCATATATTTTTGGCATACT^ 
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CCCCTAGGATTTGCCAAAAGAGGGGANGGATCAAANTTTGGCATGCTGGGGGAACAACTCNCCCC 

ACTGATTNNAGAAAAAAACAAACANATNGAAGTAAANACNANATAAT^^ 

ANACCAGATAATTTGNAACCCTTTNTTGGTTGTTAGATAAA 

SEQ ID NO: 1 669 ACGCGGGGGTCTCTGGTTTCTGGCCCCTTGTCTGCAGAGATGGCTCCCAATGC 
TTCCTGCCrCTGTGTGCATGTCCGTTCCGAGGAATGGGATTTAATGACCTTTGA 
GACAGCGTGAAAAAAATCAAAGAACATGTCCGGTCTAAGACCAAGGTTCCTGTGCAGGACCAGGT 
TCTTTTGCTGGGCTCCAAGATCTTAAAGCCACGGAGAAGCCTCTCATCT^ 
GAAGACCATCCACCTTACCCTGAAAGTGGTGAAGCCCAGTGATGAGGAGCTGCCTTGTTTOT 
GAGTCAGGTGATGANGCAAAGAAGCCCTCCTTCAGGTGCGAAGNCCAGCTNATGGCACAAGTGA 
AAGCAATGATCGAGACTAAACGGTATAATCCTGAGACCAAATTGTGACTTGCAATGGAAGANCTG 
GAAAATGGGAAATATGCAAATACGNTTAAAAGGCNATTACTTTCTGATTTNTGATGG^ 
ACCTGGGATGGNGTGGANGGTNAAACTTTTNTTAATTm^ 

SEQ ID NO: 1670 ACCCTTGCCAAGTTGTCTACAAATGCTTTGTCGATTTTTCT^^ 
AAACTTGTGATTTTTITCCCTTTCTTAACACAAATTTAAA 

AAATGACATATTTTCAATATGAGTTCTGTGGCAGTCTCACATGGAAGTCAGGAGTAACAGCCT^ 

TTCCTAAATAGCTGTTTACCACTGCITITCTGAGCATATTrAAGT^ 

AACAAAATATGAAGTTTATAAAAGATTTCAAACACITITCT 

GGTATTTCATTCCrmCTCAAACAATATGGAGATTTATAAATATGAGAAT^^ 

CCTTAATAAATGAAGNGTTCTTAAAAATmATATGGAAACCAGTAGCT^ 

GGATTATCCAGATCAGGACACTCCTGNGCANTTCCAAAATOAGGCTACAGGACCCAAATTGGAAA 

TAGGATATITAGCAGAGAAACTCATCNTGTGNACATAATNCAGTGATTCTCAATTCTAAAAACAA 

GAGACTITGACTTGCACCAGGNTTGCAAGCATCAACTTANGTG 

SEQ ID NO: 1 67 1 ACCAAGAACCGCmATCCAGATTAATATAAGTGAAAGCCTTTAAATGCAGG 
GACGAAAGGTATCCCTCTAGCCATTCAGTGGCACCAACCGATCCAAAGTCTCCAGCACTCCAACT 
GGCAAAGATAATGCTTCTGCTGGGCTGAAACCCATCTTTTAAGACCATATCTGAGA^ 
AAGTTTCAATAGGAGAGCTGTGCCTACACCGGATTTTGCAGCTCCAGGGCCCCATGCATCTCT 
GGCCCCAACTACAACATAGTGATCTGGTCTACAAAGCCTTTAATAACTCCAAAGATGTT 
TTTATCTCTTTCAGCACATTGTCACAGNGAGCTTCACATTCTTGm 
GTAGAGCTGTTTNCAGCAGAGGACAGCrCCrTCATATrCCCAACAGNTTTCTGCAGC^ 
ATGNCTGACAGTATATTAGCATCTGATGACGANAGGTGGAACTGATGTGATGAAGAAGGATCAGG 
TNTTANGGCACTGNCCAATAGCTGTCAANAAGAAGTTTGGNTACATGGGAATTANNTGNCCNGTT 
NTACCCCATGTTTAACTTNACTTTGACTTTraGAAG^ 

SEQ ID NO: 1 672 ACCCCCAAATAGAAAGAAGTGACTGGATGTTGGAAAACTTAAAAAATGATAC 
ATTTCCATTACATTGGTATTCTATTGTATAAACATGCCATAATTTATGTATTAATAACTAGCCA^ 
CTTGGTGAATATTCAGATTGTCATATTCAATITmACTAGCAAAAACCACT^ 
TATTTATGTCTTTGTGCATrmGTAAGTATATCTGTGAGATTAATTT 

CAGAGAGTATGTATATTTAACATTTTGATGTAGCGCCGGGCATGGTGGCTCATGCCTGTAAT TCA^ 

AACTGAGAGGCTGAGGTGGGCGGATCACTTGACTCATGATTCCAGACCAGCCTGGGCAACATTTT 

TTCATGGGCAAAACCCGGGGAAGAAAATTTGAAAAATTCCAAAAAAAAAAAAAAA^ 

TGGGCGAACCACCTAAGGCGATTCAC^mNCTGGCGCGT^^TOGNGNTCGGCTGGACCA^ 

ATATGGCTGTGTTCTGGGGAATNTTTOvfGTCNATTCCACAATTCGCCGGAA 

SEQ ID NO: 1 673 CCAGTTCTTTTACAATACAATATTTTGGTATTTAAATGACCTCTA^ 
ATGTGCAATGAATATCrmGTTCTGAAACTCAACCNTGAATTACGATOCCATC 
CTAAANTCATAAAGATGTTGGGAAGGTTTGAGTCTGCATTGNGAANNGGNTAGTACCACANACTC 
TTTNCTTTANAACCCAGGATGACCAATCCACTGAAAGTGCrmGAAGATTGCT^ 
CTGATCTTAAAATCNGThnSIATCANAOTCAGTONCTTT^^ 
AAAATCTNAAACNrraGCTTTANTATCTACTGAGGCANACTCAAATTGAT^^ 
ACmCTGTTATTTTAAATAGTGNGACAACTTATCCATGNATAAANACGGGTO 
CTGATCCTGArrCACAAANGAC>mGCTGTGATAKrACATTAACCOTTANGC^ 
ANAGCANTTATTTGTTATTNTAGCAANNGCNCCGTTTNCC™ 
TmTGTNGTTNAAAAANCCCAAAAAAACCri^^ 

SEQ ID NO: 1674 acgcggggcaactgatatatctttagggtgagttactgaggctgtttcagctg 

AGGCAGCATCACCCACAGAACACGATGTTGAAGATACTGGTTTCTCATTACTG AGAGC GTGTTGGT 
CATTAGTCTTTTTAAGTGGAAGGCAGGCAGCTTCTTTATCTGTCAOT 
AATAACAATATGTATCTCTGAAGCAGGAACTTCAAATTTCTTGTGCCm 
TAAGTCATCACATTCCAGCCAACTTCCATCAACATCTAAAATCCATGGTATAAAATGATTAm 
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TCGATACTGAATTACAAAAGTATCTTNATAAAGACAGCCITCAAAATGAA^ 

GCATCThfNGGGNAGCCTCTACAANNGCACATGAATATGGGAGATCTTTTC^^ 

TGGGATACTGGTGNAATNGTAATGACCAAATGGCACCATAAGTGGG 

SEQ ID NO: 1 675 ACAAATACGAAACAGGATTATCTGATAGTAGGCCTCTGTGGATGGCATCAAT 
TATTCCACCAGATATGATTCTTACTCTmGGAAGGGATTACAGCCATTATCCATTACTGm 

gatccaactacacagtatcaccaactmggtcagtgtagaccagaaacacttgttrgaagcacgc 

agtggaatcctctcaatccntcatatgatcatgtcctctgtgacactgctttggagcatact^ 

aagctgattcttcagaaaagatgactattgccgcatccgcatctcttaccactattaatct^ 

ctacaaagaacttgagacaacagattcn'gaattgtggccccatitcaatgaa^ 

ttatggctgcattgatrrgtgtggatgaaagaagacagaatnaaaccaccccnggncaagncatt 

cctgcagcngtgaaaacagttttattagggaatggtcgtcatcagggctgaaacanaacnm 

nactntaaaaagtttaagcgcncancntgccaggcaaaaaat™ 

ttctnttcaanatcaggccattatgg 

SEQ ID NO: 1 676 acacaattatgaaactatgccagtagcgacctgcatgtttgtctcatctcttt 

AGCCACTTAAGTTCTCAAAACACTGCCAAAGCTTTCCTrCTCTATTAAACT^ 
ATACTATTATTCAAGTTTAAGATGTGGCAAATCTCCCCAAGTGTATATGGCTGGTTCTCTTCT^^ 

aatgaatccataagacagtotcccctaatrcatctctattttctctaacct^ 

ttggaaaaaagtaattccggtatactaaaaccagagcccaggtttctcaacatot 

acttttaattccccaatcaaggtataacacacaaatgtanctgcgcttacagtatag^ 

tggtggtgctttggttagggaccattaccagcagtatctgnttgcaccccggtcttctttagc^ 

aaaa^fttcanaattgtatgccatggagatttttacanagcncctcnttc 

TGGTT 

SEQ ID NO: 1677 acacaaggaaagagcccagaaccaagctcaagatacctattcaaagatccct 

GCCAGCCAGGTGGGCAGTTGAGTGCAGATTCCTGCGACCCCCGCCATTTrrGCTGTCTACTTAGAA 

gggaagtcaatggcatacccaggattccttcctgatccctcaaagaaatggggaataaacacttt 

aaacaagttccaggccactcagcagtggtaggaaagggagggagggcatggaggaggatggggc 

acatagctgacattattaaaaacacatccgaagaagtagggggccctgtggggtgtagaanacaa 

ATAAGGCATCCTGTGAAACTACAGACATCAGGGGCCAGAGTCCGGTAGCACCTGATTAGGCAAAT 

gcagatgataggcggtgagccgccaggctccaccaggaatgacgaagtctggttgattctcttcc 

TTCGGCATAGCTGNTGTAATCACNTCTACTCNAGTCCTTCTTGTCTGCAAAGNGCCTGGm 

AANCAATGCAAAACCANATCCAGANATCGGTACAAGGNCGTTGCCNATNACCCCAOTCGTCAAA^ 

TCTGCCCAAATTGANCTGGGGCCTCCCTTCGCCTCNACT 

SEQ ID NO: 1678 acgcggggggatttgctgaattaatgactattgaatttaaaactaatta 

GTTGACAAATAAATAAAAGGTAGTGTTTATGTCTGAGCTTATTGTGTITGAGCTAACACCAGGW^ 

CTCAGTAACCATGACCTGCTCCTCCATTTCCATTTATTCTCAACATTAAATAGTm 

GCCAGAAATGCACTTGTGCCAGGTATTGTCCCTGCTGTATGAAAAGCTTC^ 

TAATAGTGCCCTACATTKTGGTTTITCTGGGNGGAATTGTTT^ 

CAATATANTTTTGITITATTTGTlSriTCCAA^ 

SEQ ID NO: 1 679 ACTAAACCCAGTAAAAATTGTTGAAAATGTTAAAGGTCAGCATGTTCTAAT^ 
GGGAATCTAGATATAGCTTAGATTrCCTATTGGCTTAGAGTATTTGCTATAACAAATGAAGTC 
TGACAATTATATATTCCTACTCGGTCATACTGGACTGGCTTCGTTCTCTTAATATACTCAGT^^ 
CTCAAGCCTCTGGCTATTAACATACCCTAGTTGCCGTTTTTTAATTGCCATGAGCCA^ 
GGTATACAATTGATCCATTTATTTTAATGGCTGCCnTIT 

CTATGTATGTAGCATTGGGGGGAAAATGTACCACATTTTTTATGGGAAGACm 

CATTTGAAGGTTTTACTGGGGAACTACCTGGATATGCCCCAAGACTGAGTGGAATCGCCC^ 

GGNGCCTTCTTATGACCCAAAGTTGCTTTAAAAGTCATTTGAGGGATAAATGTATT/^ 

AGGNGGTACCCNNAANANATNTACTGTTGTGGGANGGGGGAATTGTAAAAAGACT^ 

TGCCAGCAAAGNTCCTAATT 

SEQ ID NO: 1 680 ACGGTATATATATTTTAAATATTCTCACACACACATGCAATCCATAAAGCAAT 
TATTTTAATGTTAAGAGTGAAAGTAAAATGTAAGCTCCATGAGTTTGOTCAATGT^^ 
CAACTACAACAGGGCCTGATTCCTAGCAGAGACTCCAGGAATATTTGTGGAATGGTTCCTCATGOT 
AGNCTCAAGGCTATGAGGGTCTTATATGTGGAGGATGGGTNACGAACACTANANATGTGGNTGTG 
CTTTArmANCCCGAGAACAAATAANTTTA 

SEQ ID NO • 1 68 1 ACTAAAAAAGATTTTGAGGATTTATACACTCCTGTGAATGGATCTATAGTGAT 
TGTCAGAGCAGGGAAAATCACCTTTGCAGAAAAGGTTGCAAATGCTGAAAGCTTA/^ 
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GTGTGTTGATATACATGGACCAGGCTAAATTTCCCATTGTTAACGCAGAACmCAT^ 

ATGCTCATCTGGGGACAGGTGACCCITCACACCTGGATTCCCTTCCTTCAATCACACTCAGm 

CATTTTCGGCATNAGGATGGCTAATATCCCTGTCCAGACAATCTCCANNCTGNTGAAAA^ 

TGGGAATATGGANGAGACTGTCCTTGCTGGAAACANACTCTCTTGTAGGATGGNACCCTANAAAG 

CAAGAATGTGAANTNCTGTGANCATNGGCTGAAGAGTAAAATCTTACCTCrrGNGTTTAA^ 

GNNACCAATCCTTTTTNNANTNGGCCAAANAATCATGG 

SEQ ID NO: 1 682 ACGCGGGGATTAAATGTCCCAAGCAAGGATAGGGAAGGGGAATGGTTGAGT 
CTCTGGAGATCATTGTAACCAATCCTGCCAGACCTGTTTGGGGCAGTGGGGAGCAAACCTAGATA 
AGGACCCGTTTGGGGCAGCAGGGAGCAAAATCTCCTITAACAACCAAGCAGTTCCTCATO 
CAACAGAGCGAGGCTGTGATAACTTAGGAGGCAGCAATCCTAATAGTCCITCAGTTGCAT^ 
CTGTCTCCAACTGGACACCATTAGGCAGTGTCAGCCNGATATTCGGGGCAGTAGATAAATGTTCAT 
TTACTGATGCACTTTAGTTTTTGGCTGNTACCTGTTTCCAGAAT^^ 

SEQ ID NO: 1 683 ACnrnri 1 1 1 1 1 i 1 i 1 U 1 1 1 1 1 [TCGCANATTTGAGTAACTTTATTTGCATTT 
TATAGNGATTTCTTAAGGCCTATATCCAATGAAACCATTTTAAAAGCTCTATGAGG^^ 
TANATGTCTATTACACTNGTCrnTAAAANAAAAATGCTTAAATrr 

rrTGTAGGTANAGGCCCTGCTTCTTCATGATCTTCAGTTTTAGATCTAACAACCACAANANAA^ 

AGCTGGATATCTNNTNAGCTTTNGNTCATTCATAAACTCCAAGTCACCATAG^^^ 

AAAGGGAACATACrGNCCACAGNTGTANCAACAGTTGCNCACAAATCCAAAGCATNCCTCCCGAG 

AGGACAAGAGTTGAGTTAAAATCCCCCAGAGTAAGAGAGAGGCTTCTTAAAAGCTCCm 

CTNCAAAAAATCCTGNGCGCTTGNAATTCTCATTGCAGTAGGCATTTGCTANAGAGA^ 

GAGCTCTGTN 

SEQ ID NO: 1 684 accgccagctctctgctctccacagggctccccgccccacccggcctgataaa 

GCGCGCCGACTGGGCTACAAGGCCAAGCAAGGTTACGTTATATATAGGATTCGTGTTCGCCGTGG 

tggccgaaaacgcccagttcctaagggtgcaacttacggcaagcctgtccatcatggtgttaacc 

AGNTAAAGTTTGCTCGAAGCOTCANGTCCGTTGCAAAGGAGCAAGCTGN ACGCCAC TGTGGGGC 
TCTGAGAGTCCTGAANTTTTANTTGGGTTGGTGAAGATTTCCACATACAAAT^^ 

tatttgatccattcattaatgtctttagaaanaaatccttgtcacccagtgannacc 

acaagctaaaggganatnnnggntocatttgcaggctaaanatot 

ntnanacctottggngntntagngggacntagtaaatggnatactttccaa 

seq id no: 1 685 gcggtcgcggccgaggtacttgcagccctcggccaaacggccagacgccgac 

GTCGACCAGCAGGGACTGGTAAGAAGTTTGATAGCTGTAGGACTGGGTGTTGCAGCTCTTGCAm 

GCAGGTCGCTACGCATrrCGGATCTGGAAACCTCTACAACAAGTTATCACAGAA^ 

GAmCAACTCCTAGCrmCATCCTACTATAAAGGAGGATTTGAACAGA^ 

AAGCTGGTCTTATTTTAGGTGTAAGCCCATCTGCTGGCAAGGCTAAGATTAGA^ 

AGAGTCATGATTTTGATCACCCNATAAAGGTGGATTCTCTTACNGTACACCAAA^ 

ATAAGACTTGCTANAACAACCACCAAACATTGNTGTTAAGACCCACCAAAAAAAAAAAAAA/^ 

AAAAAAAANNTCOTGCCGGGCGGCCNTTCNAAGGCTAATTCACTCCTGCGGCCGT 

SEQ ID NO: 1686 actgtcggtttcagaaatgccitgcagtggggatgtctcataatgccatcagg 

TTTGGGCGGATGCCACAGGCCGAGAAGGAGAAGCTGTTGGCGGAGATCTCCAGTGATATCGACCA 

GCTGAATCCAGAGTCCGCTGACCTCCGGGCCCTGCAAAACATTTGTATGACTCATACATAAAGT^^ 

TTCCCGCTGACCAAAGCAAAGGCGAGGGCAATCTTGACAGGAAAAACAACAGACAAAT^^ 

TCGTTATCThrrGACATGANTTCCTTATGATGGGAGAAAATTAAATCANGTTT^^ 

rmAGNAACANACNAAAAGGGGGCCNTCNANTTTTAAGGNTTGCCAm 

GNAGGANACNCAGATTTNCNAANCTTCGGGTTNGAAATTTGGACTGAACN 

NAAATTTNGGGTCCACAAAATTTTACCANGTGCTNTTGANAAAAANAG 

NTTTTANNGGGNGTTTTANAACCTCNAAACNTT^ 

SEQ ID NO: 1 687 accgcgggtccgtcaagctggtgtcattgactaaccacatccacaaagcaca 

CCATTAATCCACTATGATCAAGTTGGGGGGAATCTGGTGAAGGGTTCTGAATATCTCCC^^ 
CCCTCCCGAAATCTGGAATACTTATTCTATTGAGCTATTACACCAGrmAACACCl^ 
ATGTTTAAAAAAATAAATTAATTTrAANAAAACCATTTTAAATAATC 
AACTTNANGGGGCNCTTATNTNTCAATTTAGGAACTITATTT 

AATCCCTrrGCCTGTGTAAGTGAAAATATAGACTGTATCTTGTGGCCCTATGAAATTCTGCCT 
TTATATACTCTACCTTCATTAATACTTCTGGCAAGATGTCTGCTTAGCACTCAGTTGCATTCm 
TmCTNCIXjTCATTAGCTTAATTCTGAGACATOTGAGGAAA^ 
TGTTTNGNAANCirrTAAAGGTGGGGCAATTTAAAGGT^^ 
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AAAAAAGTGGTTTTNAAGGNAATTTITCCAAGNAGGTm 

SEO ID NO 1 688 ACTACCAGATAGAAATTCTGAAATTGGAAATTGGAGGCCAAAGCCTTAAra 
GGACTGCAGAGAGTATAACGCAGACAAGGCCATCGTGGACAGTGGCACCACGCTGCTGCGCCTGC 
CCCAGAAGGTGmGATGCGGTGGTGGAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCT^ 
ATGGTrrCTGGACTGGGTCCCAGCTGGCGTGCTGGACGAATTCGGAAACACCrrGGTOT 
CTAAAATCTCCATCTACCTGAGAGACGAGAACTCCACAGGTCATTCCGTATCACAATCCTGCCrCA 
GAAATTGCAGGTGCTGCAGTGCTGAAATrrCCGGGCCrrTCTCAACAGAGGATGTAGCC^^ 
TGTGTCCCCGCTCAGTCTTGAGCGAGCCCATTTGNGGATGTGTCTATGCGCTCATGAGCGTCTGTG 
GACCATCTCITGTCTTATCGTCTGTGCTGTGCGTCCGGTGTCANCGTCGCCCGTGACCTGAGN 
AATGATGANNCTTCTGTCAANTCGTGGAATGANTNCCNGCCTACTNACANCOT 
GAAATCCTTTCNGGCA 

SEO ID NO- 1 689 ACTTT rnTn 'l 1 1 11 1 1 1 i n 1 1 i 1 INGCCAATCCGTTTTTTAATTCTTAAATTC 
TOACArrACAGAACTAAACTGAAAmATTAACATTCCACTCTTACAm 
AAATAAAATTCATGAGCCAAAAAACCCAAAACAAAACTAAAAACAGGGAAAAGCTTATAAAA 
AAATATGGATCCCAGCATTAACAGCTGAACAGANAATGNGATTTrrAAAT^ 
AAGTTGTGTAGAAACTGAACACTNACAAATTATTTAAAACCTGGAATCACTGCCANAAAT^^ 
ATITGGANCTTGGGANAACAGCANAANGGGGTTATTGAGGGACCTACACTOTCTACT^ 
CCCriTrAAAAGCCNGNGTACCrGCCGGGCGGCCGrrAAAAGGGCGAATNCANCACACTGNGCCG 
TACTANNGGATTCNACTCGNACCAACTrGGNGAANTTTGG>OTANCTGGTTNCTG 
CCGTCACATTCCNNACANACGACCNGAACANAAATGAAACCTGGGGCCNAAGG 

SEO ID NO- 1690 ACGCGGAAGATGAATGCCAGAGGACTTGGATCTGAGCTAAAGGACAGTATTC 
CAGTTACTGAACTTTCAGCAAGTGGACCTmGAAAGTCATGATCTTCTT^^^ 
TGTGAAAAATGAACTTTTGCCTAGTCATCCCCTTGAATTATCAGAAAAAi^ 
AAGATAAAATGAATITITCNACACTTGAGAAAACATTCANGGGCTAm 
GATGGAATTCAAGGCAGTGCAGNAOGTCACCGCTTCCTTTCTTTCAAGCTCAAATCm 
GTmGAGGGTAATGATGAACTATTGGAATTTAGGATTTTOTATTO 

ATGGGAGANGNCGCTTGNTGGGGAATATAACTTGGTTACTGTATAGTGGCTGTCATGGAACCGAG 
GCTGATCTGTTATAGCATCTTGTNCTC 

SEO ID NO* 1691 ACTGGGATTACAGGCATGAGTCAATATGCCCGGCCCGCAGTCTATCTTCTAA 
AGCTGCATTAGGTGAGGTCrGACTTGCTGATGAGTTCCTCCCCAGCCTACCCTGAGCTGGCCTCTC 
TCTGCTCAGTTGTGCCATCTCGCCTCTGGCCTGAGGCCAACCAACATGTGGAGCTTCTTACCCAG 
GCCATTrTAAAATGACACTTCATAACTAATATTAATAATAGCTAAAC^^ 
TATAAGGCACTATTCTAAGTGrrGTATATATATTAACTGATTTATTTACTGCAACAC^ 
GTCTTCTGCATTTTACAAATGAGGAAACTGACCACAGGAGATTAACTGACTTGCCAAAG 
TCTGATGATGGACCCANGTCTATGTCACCAGAGTCTGTGCCTTAACACTACAGATAGCTGCTGAAG 

TAGTTGTGCCTGAGGACCAGTTANGAAAATTAT 

SEO ID NO- 1692 ACAAAGAAAGTTTTAAGTCAAGGCCTCACCAArrCCTACAGTATTAGTATTGT 
GTCTCAATTCTCAAAACTAACTTTTAAAAAGCTTAAACrr^ 

TAAACTAGAATGAACAAACATGAGAAATATTTCTTTGAATCAGGGAGCTAGCACCT^ 

CAAAAAAGCACGTCTCCCCAGTGTGTTCACTGTGATGTGGTGTAAAAGATCCACATTTAACATACT 

TAAACTACTTAAACTTAGATAACATCTICTCTGAAGTATACTACCAAAATGTTAATTGAGAA^^ 

GAAAATAGTTTTAGTrTACTCATrATCACATGCTAGAAGAAAATm 

AGGTAATTrmAATCCAGATTTTTCACAACTCATGGTGCAAA^ 

AACTGCTCTTAATTGCTTGTGCTGCTGTGAAATGTTGAAGTACTCAN 

SEO ID NO- 1693 ACATTACACTAAATTATTAGCATTTGTTTTAGCATTACCTAATm 
TCCATGCAGACTGTTAGCrmACCn'AAATGCTTATTTT/^^ 
CGAAGTGCCAGTATTCCCAGAGrmGGTTmGAACTAGCAATGCCTGTGA^^ 
ACCTAAGATrrCTGTCTTGGGGTTTTrGGTGCATGCAGTTGATTACITCTTA 
TGAATGTTGGTGTGAAACAAATTAATGAAGCrTTTGAATCATCC CTA^^ 
CATAAATGGATTAATTACTAATTTCAGTTGANACCTTCTAATTGGriT^ 
'AACACAAATTTATGGGCTTCCTGATGATGATrCTrrAGGCATCATGCCTATAGTrGCATCC^ 
AATGTAAGTACACTGTCCAAAGGTTTGCTTCrmCACTGG 

SEO ID NO- 1694 acagtggcatgatctcggctcactacaacctctgcctcccgggttcgaggga 

TTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTATAGGCTCCTGCCACCATGCCCAGCTAATGm 

tttgtatttttaatagagacagggtttcgccatgttggccaggctggtct^ 
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GTGATCCACCCACCTCTGTCTCCCAAAGTGCTGGGATTACAGGTGTGGGCCACTGCGCCT^ 

TCTTGCATTCTTCAGAGAATTCACATACTAAGGGGCAATTCGTGATCCTCAAATAAGA^ 

GTAAAACATCnTAGAGTrirrTATrrCCAGACTGCATTm 

CATAGCATGTTCTTTGTCTCTAAGAATTAGGGAAGAATmGAGGCTATTAGGACATO 
CTITATAATCACTCTATGTTTCCCACGTACTGCTrCATCTCTGA^^ 

SEO ID NO- 1695 ACTCCAACTCAAGTTTACAAGTTACACCTTTGCCACAGCCTTGGCTAAATOT 
GAACTAGTGCAGAATTCAGCTGTGGTAGAGTGCTGATCTTAGCATGCTTC^^ 
TCTTGACAGTCATGTGCTTTGTAAGTCCTTGATTTACCATGACTACATTCTTAGC^^^ 
AACTGGAAGAAGAGATTCTTCAGTATATGACAGGTAATGTTGTAGAGTTGGTGTCCATTCACCATT 
ATCCAGAATTTTCAGTGCTAAGCAAAAAGCTCCTGCTGCAATTTGAGAAGGAGGA^ 
TGTCATAGTCCAACATAGTTAGTTCCATCAGGTAriTGGCCAAAGTATGTTGCTCGACATCAAC^ 

ctccantcttagatgctctccgaaggaagtgcanaggtaqaggcgcccagaccaaagttaaagct 

CTTATAATCTCATTTCATCTGCTGATTGGTGCTTAGTATAAGTGTTGCAGT 

SEO ED NO- 1696 accotctaaaggggcacaaggatgccatcacacaagcattgtttctacgaga 

AAAGAACCTGCTAGTTACTAGTGGGAAAGATACCATGGTGAAATGGTGGGACCTTGATACT^ 

ACTGCTTTAAAACAATGGTTGGCCACCGGACTGAGGTATGGGGGTrGGTTCTGTO 

AAGCGACTCATCANTGGGGCCTCANACAGTGAACTGANGGTATGGGACATAGOTATCTGCANGA 

GATTGAANACCCGNANNAACCANACCCCAATNAAAATCATNGGATCrrCTCCTGNAATACAA^ 

ACTCTTGAGGCANAGGATGGTGCCTTTGAGACGGATGAACCCCTGANGATCGAATCCm 

AAAAGCrGGTCCTTATTGCGGGANGGAA 

SEO ID NO- 1697 acaaaggacggagcaccatcaacccgtccaaggccagcacaaacccagatc 

GAGTGCAGGGAGCAGGAGGCCAAAACATGAGGGACCGGGCCACCATCCGGCGCCTGAATATGTA 

TAGGCAAAAGGAGCGCAGGAACAGTCGTGGTAAAATAArrAAACCCCTGCAATATCAATCAACGG 

TGGCTTCTGGCACAGTGGCAAGAGTAGAGCCAAATATTAAATGGTrrGGAAACACACGTGTGAT^ 

AAGCAGTCATCATTACAAAAATTTCAAGAGGAAATGGATACAGTTATGAAGGATCCATACAAAGT 

TGTCATGAAGCAAAGCAAGTTACCAATGTCTCTTCTCCATGATCGAATCCGGCCTCATA^^ 

GGTGCACATTCTTGATACTGAAAGTTTTGAAACTACATTTGGCCCTAAGTCACAG^^^ 

AACTTATITGCAAGTGATATGCAGTCTCTTATCGAAAATGCTGAATGTCACTGAGACTAT^ 

SEO ID NO: 1 698 ACAAAGAAGCAGAAGTGTAATTITCCITrTCCCAGTATGACG 
GTTCTGCCATITGAGCAGCTTACTGGAAAGATCCAGCCTTACTTGTCTTA^ 
GACTCATTGCCCGGCAAACACTTTTACCCTCAGATGTTACTCATGATATTATAAAAT^^ 
GTGCTCAGGTTTGCATCATAAGTGAGCTATCCCTGAAGGGTTTTAATTACTTAm 
TATATrrGCAAACTTCTTTATAAAAGGTGAAAAAAGCACACAAAAGAGAGGGTGTOT 
AACOTCACAACCTTCATGATITCATAGGArrATTTTGGAAATATAGCAC^ 
ATCTGGCTAGGTATATTAGGGGTAGTGCAATAACCTGAAGAAGCTGGCATTGTTACAGAAACAGA 
TCAAGGGCTATAATTTATGCATTTITAGCAGCAGTATCTATTAATCATGCC^^ 

SEO ID NO- 1 699 acgcgggatccacgggctgcaccggcaccctggtggcagagaagcatgtcct 

CACAGCTGCCCACTGCATACACGATGGAAAAACCTATQTGAAAGGAACCCAGAAGCTO 

GCTTCCTAAAGCCCAAGTTTAAAGATGGTGGTCGAGGGGCCAACGACTCCACnTCA^ 

GAGCAGATGAAATrrCAGTGGATCCGGGTGAAACGCACCCATGTGCCCAAGGGTTGGATCAAGGG 

CAATGCCAATGACATCGGCATGGATTATGATTATGCCCTCCTGGAACTCAAAAAGCCCCACAAGA 

GAAAATTTATGAAGATTGGGGTGAGCCCTCCTGCTAAGCAGCTGCCAGGGGGCAGAATTCACTO 

TCTGNTATGACAATGACCGACCAGGCAATTTGGTGTATCGTTCTGTGACGTCAAAGACGAGAC^ 

TGACTTGCTCTACCAGCAATGCGATGCCACCAGGGGCCAGCGGNCTGGGTCTATGTGAG 

SEO ID NO- 1700 ACATGAAGTATATGCTGTGGATGTTCTCGTCAGCTCAGGAGAGGGCAAGGCC 
AAGGATGCAGGACAGAGAACCACTATTTACAAACGAGACCCCTCTAAACAGTATGGACTGAAAAT 
GAAAACTTCACGTGCCrrcrrCAGTGAGGTGGAAAGGCGTTTTGATGCCATGCCGm 
AGCArrTGAAGATGAGAAGAAGGCTCGGATGGGTGTGGTGGAGTGCGCCAAACATGAACTGCTGC 
AACCATTTAATGrrCTCTATGAGAAGGAGGGTGAATTTGTTGCCCAGTTTAAAm 
TCATGCCCAATGGCCCCATGCGGATAACCAGTGGTCCCTTCGAGCCTGACCTCTACAAGTCTGAGA 
TGGAGGTCCAGGATGCAGAGCTAAAGGCCCTCCTCCAGAGTCTGCANGCGAAAAACCCAGAAAA 
AGAATAAAAAGANTGCTCANANTGAKAGAATGCCACAGTGGGGANCATTA 

SEO ID NO- 1701 AcrrrmTrTTTTm^^ 

CATGGCAGTTTTATNGGGGGTNCTCCCTTCCCATATTTTACCTGGAGT^ 

AAAAGGGNGGGACAANAAAGNGAAACTANAATATAAATATCCGTAAACAGCATNTGACCAn^ 
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TNTTTAACAGGGAGGAGGAACCACTGGNGTNTNTTANTTTAGGAa^^CAGGANCATTATAGGG 

GGGCCAGCTGGNGTTAATGGCATGCAGGCATNTGTNATGCCAACCATGAGGGCTGGAANAACCAA 

AACCAAAGAAAAATATGACCGCATNTATTCTAAAGCTACTGGGGNGGTTTCTCATGC^^ 

CTCGNAAGTCTCCNGGTCCNGGGGCmAGGCCCGNGAAACACCATCTNATNTOT 

AGCTCCTTCNCCTTGGAT 

SEQ ID NO: 1 702 AC1" 1 r r t 'i'i il i ' l TTTril'l'iTr mT ITGGTTTANANT^ 

mATAANAACAAATATTTAAAATCGAAGGCCAATTATTAGGNCTCAirrAGTTGOT 

ACTTGTATTTACCnTrCCCTAGNGTCTGAGTAACmTCAANAAACAA^ 

AACATTCAACANATATTNTTATATATTTCTGTCTATGATGCAAANATATTT^ 

TGCAACAAATGTGTCATTGNGTCATAAACAGO^TGTmAAAATTCANATTO 

ANACAGAAKTTTGAANGACAAANTTAAGTCTCATTCAGGAGNGGCCATTATGTO 

ATNACACTGATTAACCAAACTCTGAAAGCCANAGCCCCAACTCCAGAGAAACTT 

GAAAAAGTAATTATTITGANACCACTTTGTAGATCATCACNA 

SEQ ID NO: 1703 ACTTTTT rri U l ^ l ' l 1 11 1 1 1 1 i 1 i " 1 1 n GAATCTGAAGTCTTGTGnTTACTAATG 
GAAAAAAAAATACAGAANAGGTTITGTTCTCATGGCTGCCCACCGCAGCCTGGCA 
CCAGCGCTCACTTCTGCTTGGAAAAATATTCTTTGCTCTTTTGGAC^^ 

GCCAGGmCCAGCCAGNTGGGCACACTTCCCCATGTTTGTCAGTGAACTGGAAGGCCTGAACT^ 

TCTCAAAGTCTNATCCACAGAGCGGCCAACAGGGAGGTCATTTACAGTGATCTGCCGAAGAATAC 

CCTTATCATCAATGATAAAAAGGCCCCTGAACGAGATGCCITCATCAGCCTTTAAGACCCCAT^ 

CCTGACAATGGTGCGCTTCGGGTCTGATCCAAAGGAATGTTCATGGGTCCCAGCCTCCTTGTTOT 

AGGTGTATTGCCCATGCTAGATGACANAAGTGAGATCCCAGAAGCCCA 

SEQ ID NO: 1704 ACTGGGAGATACAGCCATCCACCITCAGATGTGTCTACGTGCGCTCTGCCATT 
CAACTCGGAAACTATAAGTAATTCTCAAGAAAGCCCTCATim 

ATGTCATTGCTAAAAAATAAATAAAAGCTAGATACTGGAAACCTAACTGCAATGTGGATC 

CCCACATGACTTATTATGCATAAAGCCAAATTTCCAGmAAGTAATTGCCTACA^^ 

TTITGCCTGCATTTTCAGAATCATCTTTTGAAGCmCT^ 

TTCTTATTTCACTAAATGTAAAATTTGGAGTAAATATATATGTCAATATTTAGAAAGCT^ 
TTAATTTCCAGGAAAAAATAAAAAGAGTATGAGTCTTCTGTAATTCATTGAG^^ 

gagataaagcanatgccaacactactctgatatccccatcatactggtaaagcg 

SEQ ID NO: 1705 acttttatcaaaatccatcataaaagggaaagaagactacaaagttttgcct 
aaatataacaactagagctagatttgttgtgggggaaggggctctcagagcttatctgccgct 
tcctcccaccccccaccgacactgaccactggaatcttcaggttcttgaggagttccaaggct^ 
gtcagaacgggcatgggaggcagcagacacagcactgcctgctccccctccaatgttgctgot 
ctcaatgagcctccaacccaactgctctagaggagagtggatgggctgaaggagaagccaggga 
agttagagcaaacacattatcatagaaactgtacctcggcgcgaccacgctaagg 

SEQ ID NO: 1706 acgcgggccagttgcccagagtggtaactcgcttgataatgcaccttttattg 

GCTGCCrrCCTCTCTATTGTCCCCCTTCTCCATTCCTCTATCAGTC 

accatctgcactcaaattctgagttggcttctggaaaactggaaactaagaacagaaatccagtg 

CATAGACCAGAAGCGGTGGCTCACACCTATAATCCCAGCACTTTGGGAGGCTGAGGTGGGCAGAT 

cacctgaggtcaggagttacagaccagcctgaccaacatggtgaaaccccgtctctactagaaat 
acaaaaattagccggcatggcggcagtgcctgtaatcccacacntcgggaggctgangcgggcag 

ATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGACCAACATGGTGAAACCCCTCTCTACT^ 
TACAAAAATTACCGGTGTGGTGGCAGGTGCCTGTATCCCACTACTCANGAGGTGAGGAA 

SEQ ID NO: 1707 ACAAATTTCGATTGTTAGGAAACCAAATGTTCTGAACATTATmCATTAGi^ 
AAGGGAAGGTAATTTTATAGTTGTTTGAGGTGTTGTAATACATCTACTGATAGGCCCAGCT^ 
ACTAAACTGGCTGTCTGTTCCAAAATCCATGTGGGGCTTGGGTTTTCTGAATG^ 
GTTCTACAACATTCTACATATGAACAATCTATCCCACTATGGCTCTTGCTAC^ 
TCAGCATTATTTTCACAGAGTCTTTCATCnTTAGCTTCATC^ 

GACAAAGGTCCATATCCTCAATGGAGTCCAATAAATCTACAACATGGGACGGTGGAGTGATTGAT 

TTCAGATGTTTGAGCACATACTCTTCACACTGCCAACTGCTCCAGAAGATTGTTATm 

AAGTTGAATTATTATTATGAANGTGCCCAGAAGCAAGCCTCATGTTCTCGGA 

SEQ ID NO: 1708 ACAAATATTTAAGAGTGTTGATTGGGAGTAAGGGAATGTCAACTGCCAATAA 
AGTGGAAGATGAAAGAATAGGACTTTACACAGAGCATATTTAGTTATGGGTCTCTGTCTCCT^ 
ACACAGAAAAACTCCCAGACATTCATGACTTCATCCACCCTGCCTGGCAGATAGGCCATAm 
GACCCCCTGGCTCTTCACCTCAAAAGGTTTATCTrrCCACCACACTAGCAAAGACCCT^ 
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ACAACTCATACTGCAGCTAAGCATCTACCCCGAGGGACAAGGCAAGCACACACTAGGGCAGCTGG 
GCCATCCTGCCCTAATCCCTCCAGCGGTGTCCACACTGAGCATTGCAGCACTTGTAGAAGGTGGTC 
ATCGGCTCATCTGCAGAGCGGGTCTGAAGCTGCATGAAGTAAGCACGAGGATGTCGCATTTGGGA 
CACGACTCTGCATAGAGCAACATTCTCCAGGCAGTGTCACAAGCACATATCCATTCTT 

SEQ ID NO: 1 709 ACAAATGTTTmATTCAAAAATACAAAATAAATTATCTGTAGGC 
ATGACAGCAGTAAACCATTATATATTTTGTCAACTGAAACCAGTAACTGATGGTTAT^^ 
CAGCCAGCCTTTTTCTTCATTTTCCCCAACTGACTTCTCTGAA 

GGGCTTCCTGTCACAGTTCATTAATAAAGGTAAAGCACTAGTCTAGGAGTTAGAACATGCCACCTC 

CCATACCACCTCCCATTCCACCCATTGCACCCATTCCAGGGTCCTTCTCTTCmAGGA^ 

GACTACAACTTCTGCTGTANTTAACAGAGAGGCACACCAGCAGCATCCAATAAAGCAGTTCTCAC 

AACCTTTGTGGGTCAATGATTCCTTTrrCCCCATATTACAAAATCTC^ 

CTTCTGAGGAACTTTGCTAATTTCTCACTTCAAAGATCTTNACACCTGCT^ 

SEQ ID NO: 1710 ACACGGGGGGAAAGACAATCATTGAATACAAAACAAATAAGCCATCACGCC 
TGCCCTTCCTTGATATTGCACCmGGACATCGGTGGTGCTGACCAGGAATTCTn'GTGGACATTGG 
CCCAGTCrGTTTCAAATAAATGAACTCAATCTAAATTAAAAAAGAAAGAA^ 
CTCTTTGCCATTTCTTCTTOTCTTTTTTAACT^ 

TTGCTTAAATTGTGGGCAAAAGAGAAAAAGAAGGATTGATCAGAGCATTGTGCAATACAGm 
TTAACTCOTCCCCCGCTCCCCCAAAAATTTGAATTrrr^^ 
AAATGTCAACCTTTGTAAGAAAACCAAAATAAAAATTGAAAAATAAAAACCTA^ 
AAAAATACANTTTATGATNAGAT^^TmATAAGTNTNANGCCTCANAAAT^^ 

SEQ ID NO: 17 1 1 AC i 'rinTriTrriTn"i- i - n TTTTTTGCTiui"iuii"i-i"i 1 1 1 1 1 n m 

AATCTTTNAATCTTTTATTTAAANGCCATGANCCANGANGGAT^ 

NCATCCATGGACNGCACNTAGNCCTNAAAAGCAGNGATCTGCTCCTCCAGCATATCTGTTCCAAC 

TTTATNATNrrTCAACNAO^CACTGTNTTTGAAGTTT^ 

AGAAACCCCCAACTAACCGCTGTITGAATGNTTNTGACCCACTCCTOTAAm 

CATCATCCCANGGNTTCACATrrAOTAANATGGAAGACTTGGAACAAGGGCAGGTT^ 

TTTGATTCATATrGNGCAAAAAGTTTTCCCTTACCT^^ 

AAGAGGCAANGCA 

SEQ ID NO: 1712 ACCAGCGAAGCACCTCAGCCCCCTCGGAAGAAAAGGGCCCGGGCAGACCCC 
ACTGTTGAAAGTGAGGAGGCGTTTAAGAATAGAATGGAGGTTAAAGTGAAGATTCCTGA^ 
AAAACCATGGCTTGTTGAGGACTGGGACTTAGTTACCAGGCAGAAGCAGCTGTTTCAACT^ 
CAAGAAAAATGTAGATGCAATTCTGGAGGAGTATGCAAATTGCANGAAATCNCAGGGAAATGTTG 
ATAATAAGGAATATGCGGTTAATGAAGTTGTGGCAGGAATAAAAGAATATTTCAATGTGATGTTG 
GCACTCAGCTGCTCTACNOTTTGAGAGGCCCCAGTATGCTNAATCCTCTTGCTCACC^ 
CANTGCCCATGTTTATGGAGCACCACACCTACTGAATOTTTGTAAGAATTGGACATT^ 
CGCCCTTTGATGAGAAAGCTTGATTATTGTGGCTTTTCGKrGAm 

SEQ ID NO: 1713 ACGCGGGGGCTCACTCTGCGCTTCACCATGGCTTTCATTGCCAAGTCCTTCTA 
TGACCTCAGTGCCATCAGCCTGGATGGGGAGAAGGTAGATTTCAATACGTTCCGGGGCAGGGCCG 
TGCTGATTGAGAATGTGGCTTCGCTCTGAGGCACAACCACCCGGGACTTCACCCAGCTCAACGAG 
CTGCAATGCCGCTTTCCCANGCGCCTGGTGGCCrrGGCTTCCCTTGCAACCAATTTGGACATCAGG 
GAGAACTGTCAGAATGAGGAGATCCTGAACAGTCTCAAGTATGTCCGTCCTGGGGGTGGATCCAG 
CCCACCrrCACCCTTGTCAAAAATGTGAGGNGAATGGGCAGAACGAGCATCCTGCTTCGCTACCT 
GAATGACAAGCTCCCCTACCCTTATGATGACCCATTTCCCTCATGACCGATCCCAACTCATNAm 
GGACCCTGGCCCGTCANATGTGGCCTGAACTTTGAGAAGTCCTATAGGGCCGAGGGAA 

SEQ ID NO: 1714 ACTTTrriT n "lUl"i-iU'ri-lU- i n i U-lUl'r i TATGGAATGA^ 
AAAGTTTTTTTCAAGACTTCATTCTAAATACACAGAATAA^ 
CACCAACCANATTTTCCTTATACTGTCTCAAAATTTAAAGATCAAm 

GCATCATAAAATGGCCCrrTTTTGAGGATGGGANAGGAAGGGTTGGGCAGGA TGGA ATATTA^ 
TGTAACATGATAAACATGCAAGACTGrrATCCAATCTAGATAATTTATATACATTTTGATGAOT 
GGAAAACAAAGCAATCATITGTGACAAGCCTAAAAAGCTTGACATATTTAACATACT^ 
TTTTTGTGCGGTGGGAATTCTCTAATTGTATCATGTGGCCTTTTC 

CTGTTGAAGTTTGCTGTGAACATCACATTCCCCCTAANAAACCAAGGNGGATTGCTCGAG 

SEQ JD NO: 1715 AC l lU'rin"i'iTl'l U "iTri n ' i ' l ' lU Tr n U'il lTl - r CGGGANCCAAGGAAGm 
ACTCTACTGGGNGACAGGAGGGCANAGNGCTCCAAAGGANACCCANATNCNTCAACCAAGGACT 
TCCCTGAAATTTGGCTTTGCTCITCCAGGCCTGCACATGCTGCGTGATNAAATC 
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ACATCTGTGAGGGCCTCNAGGGCTGOTGCCTCAACTrrCTCCCTACTAAGTCCACCCGC^ 

ACAGCCAGGGCACTGCTCTGTGCTGAOTCCANTGCAGCCAAGGGTCAAAATGAAC^^ 

GGCCAGGACTCCTTGGCNTCGGACACAGTCAGGGGAAAAGCCCCCTGACTCTGCAGGACAGAGG 

GTCTAGGGCNTTNGCAGGANAACACTGGTGTCCAAGGGAANCACCATGATTCTGGAGTGGCT^ 

GCATGGCTGGAGTTAhfNAAACTGGAAGTTCCCCCAGTCTTATTANTCA 

SEQ ID NO: 1716 actctggtaagcttgttgttgtccaagtgaagctccctcagatgaggcgtgtt 

GGCCAGAGAGCCATTGTCAACAGCAGAGATGCTGTTGAAACTCAATCCCAACTTAGCCAAATTAT 
TCAGTCCTTTCAGGCTAGCTGCATCAACTCTGCTGATTTTGTTGCC^^ 

GGAAGGAGGAAGACCTTGAGGAATGCTGGTGATATTGGTATCAGCAATGCGGATGTAGGAGAGCT 

TOTCATTCCCrGGAAAGCCCCATTTTCAATTCCTGAGCTCTTCAGOT 

GACAATCATCrGGTrCAGCCATTGAAAGTAACITITCGCACm 

GCAGCTCCTGAAGAGTTTTGGCATTmCTGCAATTCCTTCANTGA^ 

TCCAACTTCCCAAAGGTGTAATGCTCCAGGACTAACTTTGTATTTATTGTGACA^ 

SEQ ID NO: 1717 ACAGCCAACGGTTTCCCTTGGGGGCTTTGAAATAACACCACCAGTGGTCTTA 
AGGTTGAAGTGTGGTTCAGGGCCAGTGCATATTAGTGGACAGCACTTAGTAGCTGTGGAGGAAGA 
TGCAGAGTCAGAAGATGAAGAGGAGGAGGATGTGAAACTCTTAAGTATATCTGGAAAGCGGTCTG 
CCCCTGGAGGTGGTAGCAAGGTTCCACAGAAAAAAGTAAAACTT GCTG CTGATGAAGATGATGAC 
GATGATGATGAAGAGGATGATGATGAAGATGATGATGATGATGATTTTGATGATGAGGAAGCTGA 
AGAAAAAGCGCCAGTGAAGAAATCTATACGAGATACTCCAGCCAAAAATGCACAAAAGTCAAAT 
CAGAATGGAAAAGACTCAAAA(XATCATCAACACCAAGATCAAAAGGACAAGAATCCTTC^^ 
AACAGGAAAAACTCCTAAAACCCAAAAGGNCTATTCTGANAAGACTTAAAGCAA^ 
GTTAGAAAA 

SEQ ID NO: 1718 ACrn''i''rrn' l T m 'r rn -i'i n n r TTTNGATGCrnTCATTATCACANAACAC 
ACCACCTGTAATGNGTGCAAAAAGGAAATGAGGGGTGGAAGGANAGGAAGTCTAATTGGGAAAG 
GCTGGATGGTCACn'CATTTOTCTCTITCTTCTTATC^^ 

TTTTCTGAANACAGTAATGAAATCTAANAANAGATCAATGCAGNGCCAGATATAATCTTGATCTC 

CATGTTCGGCCTTTTCAATAATGAGTTGAGTATCAAAAAGGACGAAGCCCACATGACCACCAGTC 

CCACATACAGGTITGCCrGGAAAACCAAATGGATCCAAAGAAAACATTCCCCANGGAAGACAA^ 

AGCACAAGCTCAGGGCTGCATCAAGATCCTCCCAAAAGAGGTAGCTCGGCCCTGCATANAGTGrc 

TGAGGGTGAAGCANGTAAAGATCATTGCGGCCCTGAAACAGTGGAANGATGCTGGGGTTG 

SEQ ID NO- 1719 ACGTGAACCACAAAGTGTATACCAGGCATATGACATGGATCCCCCTGGGGAA 
CCAGGCTGATCTCTTTCCAGAGGGCACTATCCGACCAGTGCATGATGATATCCTCATCGCTCAGCT 
GCGGCCTGNCCAAGAAATTGACCTGCTCATGCACTGTGTCAAGGGCATTGGCAAAGATCATGCCA 
AGTTTTCACCAGTGGCAACAGCCAGTTCAGGCTCCTGCCANACACCACCCTGCTTGAGCCCGTGGA 
AGGGGAGGCAGCTGAGGAGTTGAGCAGGGTGCTTCTCACCTGGTGTTATTGAGGTGCAGGAAGTC 
CAAGGTAAAAAGGTGCCAGAGrrGCCAACCCX;CGGCTGGATACCTTCAGCAGAGAAATCrrTCCGG 
AATGAGAACTAAAGAAGGTTGTGAGGCTTGCCGGNTCGAGATCATTATATCTTCTCT^ 
GGGGTGTGCNCCNNATGTCTGGTAGTGATCCATCAAACNCCT 

SEQ ID NO: 1720 ACGCGGGGGGAGGGGGAAAAACGAAAATAAACGAAGCTTGCAGCACACTCT 
GCGTTCATCACTAGGTCACCTTGCTCTCCGACCTGCTTGCTCATAGCTCTGTGTATCAG 
GGTGCCCATCGTCTGTCCTAGAATCACGTAGACCCTGTCACAAGATTACAATTCCCCTTAACTCCA 
TAGATAACAACTTGAACATTATGAAACGTTTTCCCTTGGAGATATTCTTTC 

GCATAATTACTGACACCAGTTGTTCTGAAGGAGCCCACAGAAGCAAGACTCAATGGGGACTGCAG 
TTrCCATATCCTGAGGATTTCAACCCCCnTACCCTGCCAGTCGATGACCCCAAT^ 
ACCCITCACAATGCCCTTTAAAATCCCACCCGGACTCCTTGGGGAAGATGGATTTGTTGGCT^ 
CGTTCTCTAGCACCCTGCTCTTAACCTTTTCTGTGAAACCTGTTCCTC^ 

SEQ ID NO- 172 1 ACCTTTGTCACAATCCTAACACATTATCGGGAGCAGTGTCTTCCATAATGTAT 
AAAGAACAAGGTAGTTTTTACCTACCACAGTGTCTGTATCGGAGACAGTGATCTCCATATGT^^ 
CTAAGGGTGTAAGTAATTATCGGGAACAGTGTTTCCCATAATTTTCTTCAT^ 
AAGCTTGAAGATCGTTAGTATCTAACATGTATCCCAACTCCTATAATTCCCTATCrm 
TTGCAGAAACATTTTGTGGCATTAAGCATTGGGTGGGTAAATTCAACCACTGTAAAATGA^ 
TACAAAATTTGAAATTTAGCTTGGGTTTTTGTACCm 

GATAGTAGCATACATTTATAATGITGCTATTGCAAGTCATTTACTTATCACATTAm 
CTATAAACTTAGTGCGGCAAGTTTAATCCAGATTGCCTTTGCTTAAAGCAG 
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SEQ ED NO: 1722 ACGCGGGGAGGTTCTGGGAAGATGGCGAAGGTCTCAGAGCTTTACGATGTCA 
CrrGGGAAGAAATGAGAGATAAAATGAGAAAATGGAGAGAAGAAAACTCAAGAAATAGTGAGCA 
AATTGTGGAAGTTGGAGAAGAATTAATTAATGAATATGCTTCTA^GCTGGGAGATGATAmGGA 
TCATATATGAACAGGTGATGArrGCAGCACTAGACTATGGTCGGGATGACTTGGCATTGTTTGTCT 
TCAAGAGCTGAGAANACAGTTCCCTGGCAOTCACAGAGTCANGCNATTAACAGGCATGANATTTG 
AAGCCATGNAAAGATATGATGATGCTATACAGCTATATGATANGATTTTACAAGAAGATCCACTA 
ACACTGGTGAACCCCATCTCrCTAAAAAGCAATAATTACTGGGCGTGGTGCATGTCCTGAATCCCC 

CTNTCA 

SEQ ID NO: 1723 AC'i"rrn"l" l" i'i'rni'il'n 1 TT i T TTTTTTTNGGNTCAAGTTTAATACAAACTAC 
AAAAGATTAATGGGTTGCTCTACTAATACATNATACAAACCAGTAGCCTGCCCACAACGCCAACT 
CAGGCCATTCCTACCAAAGGAANAAAGGCTGGTCTCTCCACCCCCTGTAGGAAAGGCCTGCCITG 

taagacaccacaattcggctgaatctgaagtcttgggtttactaatggaaaaaaaaa atacn 
naggttttgttctcanggctgcccaccgcancctgcactaaaacagcccagcgctcact^ 
ggagaaanattctttgctctitnggacatnaggcttgtggttcactgcaggt™ 
gcacacttcccatgttggnagtgactgaancctgactantctcaaagctatccacagacggcaac 

angnaggcattanagtgattngcg 

seq id no: 1724 acaatacrrggccgaaatctgtcaggtcagcccaactttccttgtcgtgtcaa 
tgctgtgcctcgtcctataccggagaaaaaatggttcatggagcctgcggttattgtttgcctggg 

TGGAAITITACCTTTTGGTrCAATtnTTATTGAAATGTAm 

AGATCTATTATGTCTATGGCTTCATGATGCnGGTGCTGGTTATCCTGTGCATTGNGACTGTCTGTO^ 

GACTATTGTGTGCACATATTTTCTACTAAATGCAGAAGATTACCGGTGGCAATGGACAAGTm 

CTCTGCTGCATCAACTGCAATCTATGTTTACATGTATTCCTTITACTACTAT^^ 

ATGTATGGTTATTCAACATCATTTTACTTTGGATATATGGCGGATTACACACCTTGGGATATGGTG 

GAGCGATTGTTNATGGGACAAGTCCTTTGCCGAAATCTATACNG 

SEQ ID NO: 1 725 ACACATGATGAAATGAAGCAGAAGCTGGGAGTCGGCCTTTCCTCTAGTAACC 
ACCACATGGCTCAGCATCTGTGCCAAACATAGGCGCTCCTAGTCTGGTCAGTGCCAAGAGGCTAC 
CAGAACATGGGGCAGGTGGCTGGTGTTGGTGTCCCAGCCTAAGAGCCACCTGCTGCAGTTACCAT 
GGCATGCTGAGTTGATGCACCAGGTGGCAGCAGCCATCCGTTATTATTTCCAATGGAGACCTAGCC 
CAGGCCAAGGTAAAGTTAGTTAATAGCATTGGGATATAGTCACTGTAATGGTGCTATTAACA AAC 
AGTCAACACCATTGTATTTTTTAACTTCGTGTTCTGTATCTCCTCACC^^ 
GCATCATAATCTTTCTTTATGGTGGGGGCAGACTTTGACTACTCC^^ 
TCCTCCACTGCTAAATTAGAGCAAATCATTGGAATACNGTGTTTTGTCTGAG 

SEQ ID NO: 1726 ACTTTITAAATCATGTTCCCCCTAAACATGGCTGTTAACCCACTGCATGCAGA 
AAOTGGATGTCACTGCCTGACATTCACTTCCAGAGAGGACCTATCCCAAATGTGGAATTGACTGC 
CTATGCCAAGTCCCTGGAAAAGGAGCTTCAGTATTGTGGGGCrCATAAAACATGAATCAAGCAAT 
CCAGCCrCATGGGAAGTCCTGGCACANGTTTITGTAAAGCCCTTGCACAGCTGGAAGAAATC^ 
TCATTATAAGCTATGAGTTGAAATGTTCTGTCAAATGTGTCTCACATCTACACGTGGNTTGG 
TTTTATGGGGCCCTGTCCAGGTAGAAAANAAATGGTATGTAGAGCTTAGATGTCCCTATTGTGACA 
GACCTGGTGTGTNGTATAAT 

SEQ ID NO: 1 727 ACACTTGAAACCAAATrrCTAAAACTTGTTTTTCTTAAA^^ 

ACATTAAACCATAACCTAATCAGTGTGTTCACTATGCTrCCACACTAGCCAGTCTTCTCACACrrCT 

TCTGGTTTCAAGTCTCAAGGCCTGACAGACAGAAGGGCTTGGAGATTTTT^ 

TCTTCAGCAACTTGAGAGCTITCTTCATGTTGTCAAGCACAGAGCTGTATC^ 

CATAGAGACGGTITGAATATCTTCCAGTGATATCGGCTCTAACTGTCAGAGATGGTCAACAACATA 

ATCCTGGGGACATCTGCCCATCAGGAGAAGGNGTTNNCAGTTGTTCATAACCAGATTGAGGAGGA 

CAACTGCTCTGCCATTTCTGGATTTCTTATTTCACAAACACTTTCm 

TCTCCAANGATGAAAATCATCAAGGGTTGTTGTTGCTTGG 

SEQ E) NO: 1 728 ACCTATirmAATTGAGACAGGCCACTTTATTAAAATAGGTCCA^ 
ACACGATAACATTCATTTrAAATAAAAACTACAAACATGACrrGACTTCTCCA^ 
TCAGGTAGTCCATCCGAGCATCCTTTGTTGACAGTACCATAGAGTTCCATTCAAACTGGGGAAATC 
TGATTACACGATACCCCAGATrTCCAAATGTCGTTmCATAGCAGAATTTTCCm 
ANTATTNNTANAAAGGGCTTTTGAATCCAAAATTCCCAAANAATCOT 
KrGNTTCAACTNNTTCGATATTNNTTCCAAGGNAATTCT^ 
TCCTCCGGAAAAGNTTTTTNTTTTTTCCAl^ANCC^ 
NNGNTTTNCNATTGNTTCTCCTATCCTCGGC 
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SEQ ID NO: 1 729 ACTGGATGGCCCCACAAGATGCTGCC ACTTTAATAAGGCTGCAATACACTGT 
GTATCTTACAGGAGTTTTCTAATACATCCNG 

SEQ ED NO: 1730 ACTGGATGGCCCCACAAGATGCTGCCACTTTAATAAGGCTGCAATACACTGT 
GTATCTTACAGGAGTATTCTTATCCATCCCGTGGAAAAGGTTGCTTAACAACTGC^ 
CGGGCGTTCACCTTCGCNAAArrrGACCAGCTTTTCACATAGGCTTTCAATCAA^ 
TCTGGTTCCAGGATCAAGAGTAGGGGATACCACACTGTTCATCACACTTTCAACATCm 
CTCCTTCAAACACACATAACAGGCTTTAATAATTNGAGCTAAATCAACATGi^ 
TCTTTTCTGAAATCTCAGTTCCGTTAGAmCAAAJWANNACGAACT 
GATGCCANNANGCCGGACCOTATGGTGGNAGCCGGCCCNNGGTAN^fNTCATTGTAGrc^ 
CTTGGCCGGA 

SEQ ID NO: 1 73 1 ACATCATAATCGGCGACACAGGTGTTGGTAAATCATGCTTATTGCTACAGTTT 
ACAGACAAGAGGTTTCAGCCAGTGCATGACCTTACTATTGGTGTAGAGTTCGGTGCTCGAATGATA 
ACTATTGATGGGAAACAGATAAAACTTCAGATATGGGATACGGCAGGGCAAGAATCCTTTCGTTC 
CATCACAAGGTCGCArrACAGAGGTGCAGCAGGAGCTTTACTAGTTTACGATATTACACGGAGAG 
ATACATTCAACCACTTGACAACCTGTTAGAAAATGCCCGCAGCATTCCAATTCCAACATGGTCAT^ 
ATGCITATTGNAATAAAAGTGAmAGAATCTAGAAGAGAAGTAAAAAAAGAAGAAG^^ 
TTTTGCACGAAACATGGCTCITrTCATGGAACCNTTNGTT/^ 
TTTATC>mCAAAAGAATTTTTGAAAAATTCANAAGNGTm 
GCCTANATGTGTTCCA 

SEQ ID NO: 1 732 ACTATAATGCCAACAGGGCATTTCAGAAGATGGACACAAAGAAGAAGGAGG 
AACAGTTGAAGCTTCTTAAGGAGAAATATGGCATCAACACAGAT CCACC AAAATAAATGT^ 
ACATmCATTTGGACTAAATCCCACGAATGACAACTACCACCTTTT^ 
AAATATTGTGATrrCTTATTTGAGGTTCAAAATGACCTGCTTGAAACTTTG 
CATTATGTTAATAAACTTGTAGCTTTTTGTAAAAAAAAAAA^ 
CCTATTNCCAAACAGGCAGAAATCTTTCTATATTAAATTGCACNTACTAAAAy^ 
CTNGAAATTGATATAAGATCATTGCTCATCATTAGGATAAAATACTGAATTTT 
ThrrTCAGAGAAAAATTTCTTTACAANTCTAGTTOT 
TG 

SEQ ID NO: 1733 acatcccaagagatgtagatgaaacaggtattactgtagccagtcttgaaag 
attcagcacatatacttcagataaagatgaaaacaaattaagtgaagcttctggaggtagggct^ 
aaaatggtgaaagaagtgacttggaagaggacaacgagagggagggaacggaaaatggagcca 
ttgatgctgttcctgttgatgaaaatcttttcactggagaggatrrggatga^ 

TAAATACACTTGATTTAGAAGAATGACACCAAACACATCGCTGAAAAAATTAAGTCAGCTCAGC^ 

CGAGTTGAAATTGCTACATrAATTTCTTTCCCCTAGAATCAACAGGAT^ 

CTGGAGGAGTACCTCCTGCAAAAAAGGCATCTTGCCCTCATeTTTOTCTGCm 

GTAAGTCAGAGTAGTTCATGATAATTGAAATTATGGCATGCANAAATGATGATGTT GACTGC CCCC 

AAGAAAAGTGATCTGCOTCATCTTTGGmATTGGGGCTGGCTTANCAAACAC^^ 

NCCTCTTTGCAAGTTTA 

SEQ ID NO: 1734 ACGCGGGGAGTTTTCAACTGACCTCTGGACGCAGAACTTCAGCCATGAAGGT 
AACAGGCATCTTrCTTCTCAGNGCCTTGGCCCTGTTGAGTCTATCTGGTA^ 

CCTGGGAAGAGAGGCCAAATGTTACAATGAACTTAATGGATGCACCAAGATATATGACCCTGTCT 
GTGGGACTGATGGAAATACTTATCCCAATGAATGCGTGTTATGTTTTGANAATCNGAi^ 
ACTTTTTCCTCATTNAAAAArrGGGCCTNGCTGNGAACCAAAGm 
GNNAGGCANAACTGNCTTATTGTGAATAAATAG 

SEQ ID NO: 1735 ACCTCAAATCTGCTCTGGAGTCGATTATGCCACCTGTGTGTCAGGATGCACCT 

gaaagccctcggctcggnccttagaccatcrrcctacattacctggaagggagctgc catct^ 

ctctgcanagggataccttccaatagtaaattatctggttcctcactgaaacaagttat^ 

catatagtcaagagtcagactgacatgataaaatatcatgttcctaatctgttgtctcagataagt 

gaccaagacgggacnttcacatmaagtctacatnctaatcttaaangaataaagcact^ 

gggactaacattctgatangttgccccttaagagtattcanagncatcaaaaggagcccacaot 

cancagtgaangattctacacagggaatctgcagtttgtgcagaatgtmtnnct^ 

tanactgccaaattcaaanttaagctgattgacacagacctncctttnctgaccaatc 

TNNTNANATGCAGGGTTCCCACATATGGAATGCAGACAACGNTTTGT 

SEQ ID NO: 1736 ACGCGGGAACCCAAAACCTATAAGAACTAATAATAATCCACC^CTCm 
GACTCTCTITrTGGACTCAGCCCACCTGCACCCAGGTGAAATAAACAGCCATGTO 
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GCCTGTTTGGTGGTCTCTTCACACGGACGCGCATGAAATTTGGAGACATGACTCGGATCGGGGGA 

CCTCCCnTGGGAGATCAATACCCTGTCCTCTTGTTCTTTGCTCCATGAGAAAG 

CTCAGGTCCTCANAACAACCAGCCTGAGAAACATCTCACCAATTTCAAATCCGGGAAGCAGNC^^ 

TTTTTACTTTATTTTCAACCTKCCTCACTATCCCTC^ 

TCTATCTCTCCCn'CTTTAArrCAATCCrrTCAATTCT^ 

CGTGGCCCAAAACTCTTNCCCGGTCACGGATGGGAAGCACCTTCCTTGGTIOT 

CCTTCTGATATCCa^CGTTNAAGNGTAGACNTGCNGGNACCTGCTTGNCTTT^ 

TTTTTGGGAGGCANACTGCCGGCGGC 

SEQIDN0:1737 ■ ACAGTTGATCCCACTTTGGAATAAATGCCCAGAAGGTAATAAGCATTATCAG 
TGAGTGAGAGCTCTAGGCACAAAATAAGTTCTCATTCAGAAAGTGGACAGAGATATGAAGCAGTG 
AAACATATAGCTTTAAAAACTGGAAATCATTCATGACATTTGTTTTCA^ 
TTTCAAGAACTGTAATTTTCAAAAGTAGAATCAGGCCTGATTAAGTAATAm 
AAAATTACAAAAATAAAAATGAAAACTCTTCTGCCTTGAAAGANATAGAAAACTATAT^^ 
CTGNAATGNCACAGAGATTCATCAGTATTTCATCCTTATGTCGCnTAAAAATGTTAT^^ 
CTTTCCTGTCTGCACTTITAACATAGCAAACCGTGTTAATATGGCATCTCTGTGGTGAAAG 
TTATTTACTGATGATGANAGAACTCAATTACCTCCATGGACTATCTCCCGTGGGAAAA>^ 
CCNGATTTCCAAAACAAAGAACAACAACAAAAACCCCAACTCATGTTAAANTOT 

SEQ ID NO: 1738 ACGCGGGGCTCTCGAGTCACTCCGGCGCAGNGTTGGGACTGTCTGGGTATCG 
GAAAGCAANCCTACGTTGCTCACTATTACGTATAATCCTTTTCTTTTC^ 

CACCATGGAGAGGAGGAGGTGGAGACTTTTGCCTTTCAGGCAGAAATTGCCCAACTCATGTCCCT 
CATCATCNATACCTTCTATTCCAACAAGGAGATTTTCCTTNAGGAGTT^^ 
GCCTTGACAAGATNCGCNTTGACAGCCTGACAACCCTTCGAAAGTTGACAGTGGGOT 
AAANTTACATATCCCCAACCNTTAGGACGNACCTGCCNGGCGGACGTCGAAAGGC 

SEQ ID NO: 1739 ACCTACAGACACTTTTACAGAGTTAATACTAAAATTACAAATTGATGA^^ 
TCGAGGCAAAGCAGGGTAACTGTTCCTTCAGTGTTGCCATCACTGAArrOACClTCACTGNTCC^^ 
CATACTGTTNGGCTGAACACTGACATTCCATAT 

SEQ ID NO: 1740 TCCCCCACCACTGTGATCTATGAGGATAACCAAAGACCCTCTGGGGTCCCTG 
ATCGGTTCTCTGGCTCCATCGACACCTCCTCCAACTCTGCCTCCCTCATCATCTCTGGACTGAAGAC 
TGAGGACGAGGCTGACTACTACTGTCACTCTTATGATTCCACCTATCATGTTCGGGTGTTCGGCGG 
AGGGACCACTCTGACCGTCCTAGGTCAGCCCAAGGCTGCCCCCTCGGTCACTCTGTTCCCACCCT^ 
CTCTGAGGAGCTTCAAAGCCAACANGGCCCACTGGTGTGGCTCATAAGTGACTTTTC 
CGNGACAGNGGCCTGGAAGGCAGATAGNANCCCCGTCAANGCGGGAGTGGAGACCACCACACNC 
TCCAAACNANGCNACAACATGTNCCTCGGCGNGACACGCTAAGGGCG 

SEQ ID NO: 1741 ACTTGTCATCAAAGACCCAGGCAGTOTCTGGAATAGGCirrCCAGCTGCT^ 
TCCTTGGTGTATTCTAACACCTCAGCAACATGACGAAGAATGCTATAAACAGTTTTGGAT^ 
AATrrGTCTTCACATITGATTGCrrTCCTCTGGAGAAACTC^ 

TTCTTTGTCCACCCTAATGACAACCACACACTCATTCCTGCCAATTCGGATGAGm 

ACGGATCGCCrrCTGGGATAATTCACTAAGAAAAATCATGCCTTCAATGTTGTTGTATTC^ 

GCTGACATAAGCCCCATTTCAGCAATGGATCTGCATTCACCATCACTACATCTTCCACCTCAGAA^ 

TTTGTGTGATAAAATCTACACITAACCCGGCATTCTGAGGATGTGTGTGATTCCGCACTG/^ 

GACCGCTTNTCAATACAGAGACCAGNTTGTTCCCCTCACGGCCGCCGTCCTCTAGCGTGCG 

TCCCCGCGTCCTGGCCGNGACCCTANGCGAATCACNCTGGCGNCGTNTAGTGGTCNAGTNGTCCA 

ACTNGGGATATGGATAATTNTCTGGGAATGT 

SEQ ID NO: 1742 ACGCGGGGGCCATATTATCAGCGGTTATTCGGTGAGCGGTGGTGGTTTATTCT 
TCCGTGGAGTrAAGGGCTCCGTGGACATCTCAGGTCTTCAGGGTCTTCCATCTGGAACrATA^ 
GTTCAGAAAACATGTCTCGAAGATATGACTCCAGGACCACTATATTTTCTCCAGAAGGTCGOT 
ACCAAGTTGAATATGCCATGGAAGCTATTGGACATGCAGGCACCTGTTTGGGAATTTT AGC^ 
GATGGTGTTITGCTTGCAGCAGANAAACGCAACATCCACAAGCTTCriTGATC 
GAAAAAATTTATAAACTCAATGAGGACATGGCTTGCAGTGTGCAGCATAAOT 
TCTGACTAATGACTAAGCTCATTGCTCAAAGGTTTTATTACAGTATCAGGACC^^ 
CAGTTGGTACAGCGCTGGTGATATCAACAACTTATANCAATTGGAGGAAACGTCCTTGGGTTTA 
CTGACTCGCCGGACACCTAGGCAATTCACAACTGGCGGCCNTCTATGGACCGAhn^G^^^ 
GNGACATGGCTACNGTTCCTGGGAAATGTTCCTCAATCCACANT 

SEQ ID NO: 1743 ACGTAACTGGAAGCAAGGACGGCTGCATCAAATTATGGGATG GTGTTT CAAA 
TCGATGCATCACAACrmGAGAAAGCACATGACGGTGCTGAAGTTTGTTCTGCCATT^^ 
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AAATTCTAAATACATTCTCTCAAGTGGAAAAGACTCTGTAGCTAAACnTrGGGAAAT^^ 
GACGAACACTGGTCAGATACACGGGCGCXjGGTTTAAGTGGACGCCAGGTGCACCGGACACAGGC 

tgtgtttaaccacaccgaggactatgtgttgctgccgacgagaggacgatcagtctttgctgctgg 
gactcgaggacagccgagcggagaacctgctgcgttggggcacaacaatattgtacctcggccgc 
gaccacgctaaggcg 

seq id no: 1744 actgtatccagcaccacagaaacctcagtgtttttcctctgctggtt tggg gc 
acaaggaagccttagggtatgggnaaaggm'gttattanctagaggttactccatnaggtm 

GNTTG 

SEQ ID NO: 1745 ACTGAACTTTGTGTTGACCATAGCCTTCTTGTCTTTCATCACTTTATCTCC^^^ 
TATGTATCCTTAAAGAATAATAAATGGATrrTAACTGAANAAAAA 

SEQ ID NO: 1 746 ACAAAACTAGGAACACAGAAAAGGACCAGAGAGGATGTTACACTGTAAAGT 
CTTAGGACCTACTCTAAACTTCTGTCCTCATCAAGACCCTCACCTAGAGACCTGAGGGTTC 
TCCACTGGAGAAAGTAAGTAGTCTCCGATCATCACCCn'GCTTCACCGAAGATGGCCTCAGGTC^^ 
TCGCTGTCAAACITAATAGAATTGACCTGGACAGAGATTGAGCCTCCCCGGTAGCTGCCCGCT^ 
TCTTGGTTrCTCATGCCGAAAAGGACTTGCCTTTGGTGAACTrCAAAAC^^ 
CCAGCCTCCGGCTGCACCTCGOTGNCATCAAAGGAGTTGCCGCAACTCGTGAATCC^^ 
TCCTNCTCCTGACCCTTCGGAATGGGGATGATGCCCTTTTTCTCCTTTCTTCT^ 
NGGGCTGAAGCTTATCTTCTTGGCCGANGAGCTTGCTCTGGAGCCTNTTTGrrrc^ 
TNTTTGGAACTACCATGCGCnTITCTTTNTCTCTCTCTATGT^ 

SEQ ID NO: 1747 ACTTTTTCCAGACACirrTTTGAGTGGATGATGm 
TTTACCTTTTTCCTTCCTTATCACTGACACAAAAAGTAGATTAAGAGATG^ 
CCCTTTTACATACTGCTGTCTATGTGGCTGTATCTTGTTTTTCCACTACT^ 
TCATGCAAATGCTGTATTCTTCTTTGGTGGAGATAAAGATTTOTGAGT^ 
CTAAAGTATCTGNATTGCATTAAATATAATATGCACACAGTGCTTTCCGTGGCACTGCATACAATC 
TGAGGCCTCCTCTCTCAGTTTTATATAGATGGCGAGAACCTAAGrrCAGTTGATTTACAAT^^ 
TGACTAAAAAACAAGAAGACAACATTAAACAATATTGTTCAAAAAAAAAAAAAAAAA^ 
CTCGGCCGNACCCGCTAGGCAATCCNCACACTGCGGCCGTCTAGGGTCCGACTCGNCC 

SEQ ID NO: 1 748 acagtgatttggctatagactctcgccccttcagggcagactgtcctcagttc 

ATCCTTATTGAGAGAGAAAAGTTGTGCACCATTTAATACTCCAAGACTATTGACAGTCAC^^ 
GAATCCCTTTGACTGTAACCACGTCTTCACATCCTCTGGTGTGGAGTCGTAAGTGATATTGATi^ 
TGGCACGTTCTGCCGTGGCACATGGAATTTCTTCTGAGCGGCACTCCGACCAATGGTCAGTCTGNG 
GATGAATTCAThrmGCACTTCCTTCATTCTGANAArrCCm^ 

TCTGGCTGCTCGCAGATCTGCCCCACTGCACTGAGCTGCTGTTTTGACGTGTTATAm 
CTTTGACACAGGAACAGGTGCrrGGAGTGGAAGGGGGAAGGGGAACAAGACANGAGCTGTGTTG 
AANGAGGTATTGGACANGGGGATATNAGNTGTCTTGGCCATACTCATCCTTTGTTO 
ANTTAAGGGGAT 

SEQ ID NO: 1749 A Cr il'll'ill"ilin"i lU l" r il"i U Tl l - IT GAAAGCAAATTTCTTTTA^^ 

CANAATTAAACTTCANAGGGACCCAACGTCATACTTCCATTCAGGGACTTGATACAAAAAAm 

GTTTGAACTGCTArrAGCAGGNGGCAGGAGCCACCrrCAAATGAATCTrCAAATTGGAA^ 

GCTTCACCACCTGTNGGGGATAAGTTGCAAATGGAATAATTTAGTATGGrrGTAGCTATT^ 

GACCACCTCNCTGGANACCrrCCCATAACCCCTCTGNTGGNTGCTTTAATCCCATATCCATO 

NTGGTGNNCCCAGNGTANCATATCCTCCATACCCCGAACCTTGGTGTATATCCATAGITGCATAAC 

CTTGATCCAATAGTTCTATATCCTGGTCAGNTTTGATGGGGCCCCACCTTTNCAANACT^ 

ANCCTCTTAGANCCCATTGNTGTGGOCTGTITGTCCTCGNATGGCNaT^ 

ACTTGGGATTCTTTCATNCTTTrr 

SEQ ID NO: 1750 ACGCGGGGGGAATCAAGGATCTAATTCAGCAGCATGTAAAGTTACAAAAAC 
AAATAGAAGACCTACAAGGTCGAACAGCAAATAAGGATCCAATTAAAGCCTTTTATG 
GCCTACACACCTAGGAGTTTATCAGCACCTATArrTACTACTTCACITAACT^ 
AGAGATTGCATAAAAATTNrrrCACGAAATGNCCNACGGGTGGNAATCNAN^^ 
NGTNNNAGCGTTGNACGTTGATAAAT 

SEQ ID NO: 1 75 1 ACAACATGACTTAAAAC lU - lll UU r i 'CTArrAAAACTTAAAGGGGAACAAAAC 
TTGAAAAAGCCCTGTTCTTCAGAAGGTGAGTGGGTTGAGGGAGGCAGTAATATGAAGTGACTGCT 
GTGTATTTTATCTACCAGATTTTTNATATTNGCCACTGNTAAN^^ 
TTANGa^GATANGNGGTTCATNCTATGANGCTTNTTTNAAAT^^ 
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CATTTCGACTTANTCTNAGNGTTTCG 

SEQ ID NO: 1 752 ACAAAGAATCCGTGAAGAGACTATTAAATGTGACATGAATTATGGAGTAGAA 
AAATCTGCTTGATTTATTAATTITATGATATATGTGATCAGTATCTAATTTG^^^ 
TGTTAAAAAATCATTTTTTTTCCTCAGAGTTAAAATTAm 
GATTACTCCAGAGTTTTTTAATITITACACTGGGGCAATAGACT^ 
GTCTAAAGGTGTTTGmrrTATTTCTACCTAATAAGTTTGGGAT^^ 
TGGGANTTATCACCTTTCCCATTCTTTCCCCCAAGCTGGCATTTCAGTAGTTGT^ 
TTTGGTCCTGGGAGGCAGGCCCAGACCCATTTCTTGCTTGCANTCCGGCTGNGTC C^^ 
TACTGTGCATTTCNATTTGGNACTCTGGTAGACAGGTCTCGGATTTCTCTACCTGATT^ 
AAGA 

SEQ ID NO: 1753 ACTACACGCACCTGGGCAACGACTTCCACACGAACAAGCGCGTGTGCGAGGA 
GATCGCCATTATCCCCAGCAAAAAGCTCCGCAACAAGATAGCAGGTTATGTCACGCATCTGATGA 
AGCGAATTNNGAGAGGCCCNGTAAGAGGTATCTTTATCAAGCTGCATGAGGANGATAGAGAATG 
GTGAGACAATTA 

SEQ ID NO: 1 754 acaaagatgactataaacaagatgcagccctcggtttccatgaacagcacac 
tattacagtaaaccaagtttatattccaccatcaagtgtggctctcccatgacttcgctttgt^^ 
gatcattaagaatatcctcaaatccaatagtctcatcattacccctcanaacatccagtgaaagat 
tttgagcttgaaagaaatggaagacgctgaacctgctgcactgccttgaattccatot 
tagcggagcaatagaccctgaatgmrctcagngtggnaaaattnatttaatnto 
aaatititit(>rgannattcaaggggatgactagacaaatgttca^^ 
tncanananatcatgactttc>^aaaggccacttgntgaangtanagnactgnatc^ 
anatcnanttctcrggnttcatm-ccgnnctntgckacagnnct^ 
cccccgtgtcntgccggcggcggtc 

SEQ ID NO: 1 755 ACTTITTTTTTTrTTTn^^ 

CATTAAAAAACAAGGAGATTGGCAAACATATNTTCCAAAGTTGAAGCAGCTCAACNCAGTTCAGT 

TAGGCTAATTTAANAAAAAGCCTTGCATTTTAAANAGCGNGTAGGCATT^^ 

GGAGGTTCGCTATTGTTGCCCAAGCTGGAGTGCAATGGTGTGATTTCGGCTCACTACAAC 

CTNTGGGTTCANGCGATTCTCTGCCTCANCCCCrrGNNTAACTGGGATACAGG^ 

CCCGGGCTTATriTrTATTTTTAGTANANAGGGGGTTCTCCA^^ 

CTACTNANGTTATCNCCCNCTTGGGNTCCGAAAGGNNGGGATTACANGCGTGACCCAGNCCCGGC 
TTnTnTTTTAAAAAGANACAGACAATTTTTC^^ 

CCCTGCCAGGOGGAAGNACANACTTNAGGTTTGGCTGGANCCATCNTTNAAAGGAGTGT^ 
GGGCGCTTTGGATTNGGGA 

SEQ ID NO: 1756 ACAAAATATCCCCACITCCCTTGAGAAAGAGTATATCTAAAATACACTTT 
GAACACAGAATATTAAAACATTATATGCTATAGAAGTGAACACAAATACATTTT CTCC^ 
AATAGTCTAAGGAATATATAAGCATTGATAATATGAAGGAAAATGTTTGAATTTAT^^ 
AAAATTCTCCTGGTTGTAGAGGTGAAAAAAGTAATCAGGTTACCCCAAAAGAAATGTGCTGGCAT 
GNTCTTGATGTCTCCTCTCCAGCTGAGTAAATGGATTACTGCAGTTANACCTGGTGAATCAAGAAG 
GTGGCATTCAGTAGTGAGACTTTTATTATCAAATAGTTCTGACTGAGAAATm 
CTAAGATGCTATTTTGTGAAGCTGTATTTTmCAGGGCTCTCATCi^^ 
AAGAATCAGTTCCITCCTTCCCAACATCTGAAATCCTTTT^ 

NCNAGAGAGGCAACTCATTAhrrCCTGATAANATCCCCTNAGGTGNTNTTNCACATAT^^ 
NTGGGATCCGGNCTGANATTTTTn^TCTCTGTrTAGAAACT 

SEQ ID NO: 1 757 ACGCGGGTTCTAOTCAACAAAGAAAATTTTTGAGTTATAGGAATAAGGACG 
GTAATCTGCATTITGTCTCTTTGTATCTTCAGTAATTTACTTGGTCTCG 
TTTAGGATAAGAATGTGCCTCTCAAGCCTTGACTCCCTGGTATTCrriTr^ 
GIWTTACTTGAGCTTCAGCAACTTAAGAACTTCTGAAGTTOT 

CANGGATCCTTTCTNAAAAAAANAACTGTAAATCTTTCTGGACAGCCNTGACTGTAGCAA 

GATAGCAGAGGNTTGGTGGNTCAGAGTTATACAACTATCCCNGGTGNTGNNTCATTCCAGGGTTA 

CC^^^CTTCTGAGTTTGGTNGTATT^mTGCCCTCNCCCCCNGANTA^^ 

TAAANTGNTGGATAATAOTGNNANCTNCTGNCCCTTTT^^ 

ATCACTGGNACCTTGCGNTACNNCTAGQNTTTmTCKm 

TTNCrATAATGTATATTrrmGTGTAAT 

SEQ ID NO: 1758 ACTTTTTTTTriTrTTT^^ 

TrTa:AGTAAAATATTCACATAATGTCAAAANAATGAAATGATAGCGATATAGCCAACTACC^ 
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ATTAATTCCACATAAATATTTAAAATCTAAAAACCTCANATCAGCANACCGAGTCGAAATGTGA^ 

CTTCAAAGCAAGTATTGCTTTACCCTTGCCTGAATGCAGTCCGTCAT^^ 

TGACCAAATOTTTGCANAGGGrnriTANATATGCTCTOGGGGAGCCGCATCCGCAATCCA^ 

NAANATGTTGTTGAACTGCGTTGCACAAAGTTTCTTGACCTOT 

TTTmGATTCAAAATAACATAGAGCTCCGCGATCAGACriTITCACACCC^ 

CCrrCACAATGACTCCrATCTCTCACTCTGGAAAGCCTGTGATGCCCGAGATNT^^ 

GCCGGAAGTGGCGAACCTGGNGTTCTATNGGACTGGCnTITTCGGGAGATATC 

SEQ ID NO: 1759 ACTAGGCCATGAACTGGGGCACTGGAAGTTGGGACATNCAGNCAAAAATATC 
ATTATTANCCAAATNAATNCTTNCCNGGGTTTTTT^^ 
CTTTTNGCTGCATTNGGTTTmATNATANCCANCCCACTOT 
TNATTTTTTCACCTTACAANGAGGTTCTTTCTT^ 

TOAAGCTGATGCNTTTGCCAAGAAACTNGGGAAGGCTAAAGACTTATATTCTGC^ 
TAACAAAGATAACTTGGGATCCCTGTTCTGCTGGrrGTCTCATGNGCATTATTCTATCCT^ 
GAANACTCAGCCnTGAAACTATGNAGCAANCTGNATNCCAGATCTGGACTGAANAAm 
TTTCTGCCTGNNATGTCCACTTTGATTTTTAACT^^ 

SEQ ID NO: 1760 ACCTGTGTTCATATCCTGAACTGCACCAAGGAATAGCTCCATGCTAAACTGTC 
TTGGCTAAAATACAACCAGTGCCATCATTCCTAATCAGACTAATAAACAGAATGTCTTTT^ 
ATGAGTTAGATGGCTGNGTrrAGGTTTTGTTCAGGATTTACmGGAT^ 
AAGAAAGTTATATNGAGTTTTTATGGATGATTCTATGTGTTTTTA^^ 

AAACATATTAAAAGAGTCTATAGCATCTGTCCAAAAAGATTATATNCTATACAACTATCAANT^ 
ACAAAACCATTTTCATTGCTTCAACTGAGAANATCTATCACACCTTCTCCT 
ACAAGOTGTTTCAGTTTTArrNTCTAAAACTNANATTC^ 
TTTGGTTTNTGAGNCACTCTAAGGAGNTTAAAATCCAACC 

SEQ ID NO: 176 1 ACATAGTGTCGCGANCTCAAATCGGCATTTAGATAGATCCAGTGGTTTAAAC 
GACACGTTTTTGCTTATAAAAAAAGTGCAAAAAAOATGTGGTTTACAAGT^ 
CCITTTTGCTGTAATTGCACCAGTTITAAAGCCTCTGGACAGAGCAGTAT^ 
TTTTCTTAAAAGCITACAGNGTTTGGCTAATTCTCCTCCCC^^ 
TGGCCACTGGTGGCAGGTTAANGGGATCTGCNCTTTAAGAAGCCCNNAAA^^ 
NAGAAATTNGGGGCCGNTTTTTAACCTGGNNGAGANNTAACCANC^^ 
CAACCGGGTTTCNCGTTACCCCClllTrTTTTTTACAANAACCT^ 
TTGGNTTTTTGTTTTAAAGGGCAAAANCTCrrAAm 

SEQ ID NO: 1 762 ACTAAGTAGGTGAGAANCTGAAGTCCTCAAGTGTTCATCTrCCAACTm 
AGTCTGTGGTCTGTCTTTGGATCAGCAATAATTGCCTGAACAGCTACTATGGCTTCGTO 
TCTGTAGCrCTCTGAGCTCCTCTATGTGCAGCAATCGCAGAATTTGAGCAGCTTCATTA^ 
CATCT<XTGNGTCAAACCCAANAATATGTTNGTCTAAANCACAGGAAAGCCCTOT 
TGCCTOANCAACTGNATCCTGTGTCAGGCCCTCNTGAACCAAAATCCNAATNGCm^ 
NANGTAATNANCNTNACCCTGAATNTGAANCTGGTTACCAAANCNTCCNCCAGCCTAA^ 

SEQ ID NO: 1763 ACGCGGGCTAGGTGGCTTTGACCCCCTGGGGGATTAATGGCAGTGTCACAAG 
TCTGGTGGCGGTAGAAGGAGGATGTTATTGGCATCTAGTGAAGAAGAGGCCAGGGGCGCTGCTGA 
ACGTCCTACGATGTGCAGGACATGTCCCCACAGCACAGAACTATCTGGCCCCACGTGTCAATAAG 
GTTCAAAAAGCCTGGGGGATTGCCTTCrGTGCTTCCACGAACACATATCCATGTATTATO^ 
TTGCGGCAATGCCACGAGGTCAArrGAGACTCCCTGACTAGCATACATAATGTTAGGATCTAGGG 
AGrrGCTAATGTCTACCTTGCCTTCCCAGCNTCTATGTGTGGCCTTAGCTTGGCT^ 
CCTNTTTTCCGGGCCTGCACTGGTTTGTGATCCATNTTAOT^ 
AGGCTGGCNTGATGGTTACCCTGAATCCACACTTTNGAGCC 

SEQ ID NO: 1 764 ACTCTTTGGAAGCATTTGGACTTGGCTTGATCGTAGGAGAATCCGGTGTCCAG 
TTCGCTGGGCAGACTTCrcCATGTGrrrCTACATACTGGAACGCCTTCACCAAGC^ 
TCCACGCTTCGGCCCACTGGGAGATTGTTGACGCTCAAATGCITGATGACTCCATO 
ATGAAGAGACCTCTTAGTGCAAGACCAGAACCTTCTAACAGCACACCGTAGTCTCGGGAAATCTG 
CTTAGTTAAGTCTGACAAGAGTGCGATGTTCATGTGGCCCAAACCACCATTCTTTCnTGGTG^ 
ATCCAGGCAAGATGGCTAAAAGTGGGAATCCACTGANCTGCACAACTTACAGTTCACATCGNGAA 
TTCNTAGCTTTNCACTAAACAACAATTTCTGANGACAACAAGGGGAATCAANGATAA^ 
CAANTTTTCCCITAAAGCATCAAGhrrTAGTCTTGACCTTC^ 
AGGCAATCCAC 
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SEQ ID NO: 1 765 acgcgggtgtggagaaaagaaagacctctataaagaaatgaagaaccttgc 

TAATCAGTGGTGAGACTGTGACTAAGACCAmCmAAGAGTTTGGGTTTACAGAT 

AGCATGGTCGATGCGGGACAGAGATAGGACATGGGATTTGAAACTGAAAGACATCATrrTGGCCA 

GGTGGCirACACCTGTAATCCTAGCACTTTGGGAGGCTGAGGTGGGAGTATCGCTAC^ 

AGTTCAAGACCAGCCTGGGCAACATAGTAAGACCCTGTCTCTACAl"lU"l"rrrriTAATTAGCTC 

ATAGTAGCATATGCCTGTAGACCCAGCAACTTAGGAGGCTGAACTGGGAGGATTGCTTGACCTGG 

GAGTTCAAGGAAGCAGTAAGCTGAATCATGCTACTGCCTCCACCTGGCACACAGTGANCTO 

AAAAAAAAAAAAAAAAAAAAAAAGTCCTCGCCGGACCCCTANGGCANTCCAC^ 

CTThWGNTCCACTCGNNCAACTGGCNATATGGCTAGTGTTCCGGGGAATTGTNTCG 

SEQ ID NO: 1766 ACCAAGAACCGCmATCCAGATTAATATAAGTGAAAGCCmAAAT^ 

GACGAAAGGTATCCCTCTAGCCATTCAGTGGCACCAACCGATCCAAAGTCTCCAGCACTCCAACT 
GGCAAAGATAATGCITCTGCTGGGCTGAAACCCATCTTTTAAGACCATA^ 

AAGTTTCAATAGGAGAGCTGTGCCTACACCGGATTTTGCAGCTCCAGGGCCCCATGCATCTCTCTG 

GGCCCCAACTACAACATAGTGATCTGGTTCTACAAAGCCTTTAATAACTCCAAAGATGTTAAGA^ 

rmATCTCTTTCANCACATTGCTCACAGTGAGCrrCACATTCTTGCm 

ATGTAGAGCTGTTTCANCAGAGGANAGCTTCTTCATATTCCCAACAGTTTTO 

TGCNGGACGGTITITAGNATCCTGTGACCAAATGNNGAACTGANTGNATGANGAAGGGANCA^ 

NTNTAGGTNCTGCCCAANACTGCAAAANGAAAGNTGGTTACATGGA 

SEQ ID NO: 1 767 ACGCGGGGCTCTAATCTTCCATTTTCTGTCCCTGAGTGAGTCrCTGGCGTC^ 
AAATTGCCTGTTTTTCTCGCAGGCTCTATTCCGTTCGCTGGTTC^ 

CATGGAGTCCACAGCCACTGCCGCCGTCGCCGCGGAGCTGGTTTCTGCCGACAAAATTGAAGATG 

TTCCTGCTCCTTCTACATCTGCANATAAAGTGGAGAGTCTGGATGTGGATAGTGAAGCTAANAAAC 

TATTGGGTTTAGGACANAAACATCTGGTGATGGGGGATATTCCAGCANTGCAATGCATTNCAGNA 

ACAGNTrATCCTTTTAATTAAAAAGATGGAAGACAGCTAAANGAGNGTGGAGAACCT^ 

NATGGAAANCACTTTGNAGTG 

SEQ ID NO: 1768 ACiUl" in - l ' l UlUUUT rill ' r J-ril- rN GGCCATTTGCTATGTriTAT^ 

GCTTATTAAACATAACATGCAANTAATCAAAGAGAANCNTACNTGACTTAKAGTGAAAAATA>^ 

CTAGAAAAGTTrCACTAGGTAAGTATGCAAATTNTTATTCTAAAAANAC 

GCITCATGTmTTTGCAATATTCTTGGCCTCAATATCTACCCCTA 

TGTATCAAGATAATTTGGGTGCAAGAGAGTAACATCCATATGTTTTAATCCAGCTTNGAGGGACAT 

TAAGATTAAGGATTATAAAACTTGGCTGATTCCTGCAACCAGNAAAAGGTTTGNACATOT 

GTAAAATAAAAACNCTAAriTACAATAANCCTGNNTr>mAGGCANTCGGGATTG^ 

ANAAATAGCCTCCTT(>rTTTTNTTAAAAAANANrrGTTCAAAOT 

AACGTOTAAATrGGTCTTGAACTCCCCT 

SEQ ID NO: 1769 ACTGCAAGACCCATTTTCCCTCCAGTTAATACACrNCCAGGATGGNCNGCAG 
AGGGGGAGACTCTGAGAGAAGCTGGAGGCCCACAAAAGTCCACTGACCCTCTTTCTGTCCCAGAA 
ATGAATAAAGGACCCAGTTGTGCTTTCCTTCCAAAATCCTCAACAAAGTGGTTNGTGCT^ 
AAATGTGGGAATAAAAAAAAATCATGTCCCAAGGCATCTTTGTGTGTGTNC 
CCGAACTCCTAGCGNACACCTCGNGGGANCCGGCCGGAAAACCAACCGAAATGAAGGNGAANAT 
GCTGAGCCGGAATCCGGACAATTATTGNCCGGAAACCAAGTTGGACTACANAGAGTCCAAGAAAC 
TATGATCCNCmACTCNmTGAGGCCACNNAAATANGGAAA^ 

GAGNTTTGCAAAACATNCCTGCTCNNTGATGGNCNGGNAGGAGNAATGGTGGAAANNTCCANNA 
ACNGNTCTGCCTTTGGGCGGGNAGGAAGGAAAATTGGATCACNACGGGATG 

SEQ ID NO: 1 770 ACATAGAACAGCC ACAGCTGATGACAAAAAGCTTCAGAGTTCTCTAAAAAAA 
ACTGGCTGTGAATAATATAGCTGGTATTGAAGAGGTGAACATGATTAAAGATGATGGGACAGTTA 
TTCATTTCAACAATCCCAAAGTCCAAGCTTCCCTTTCTGCTAATACCm 

ANAANCCAANCCANTCACAGAAATGCTTCCTGGAATNTTAAGTCAGCTTGGTGCTGCAGTTAACA 

AGCCTTAGGAAGTTACTGAACAGTCCCCGGCAAGTCTTGGACAGTAAACACCAAACCAGAAGACA 

TTGTTGAGGGAAGATGATGATGTCCAAATCTTGTANAAAATTTTTGNTGAG 

NCTACTAAAAGTTGGTTTTGGANCTGCATGGCTANTTAACAATCANTITGGGTO 

NTGANACrcCCTGTCTATCAGGATNAAAAmriTGGTm 

SEQ ID NO: 1771 ACGCGGGGCCTTTTTCCTCTCTTCAGCGTGGGGCGCCCACAATrTGCGCGCTC 
TCTTTCTGCTGCTCCCCAGCTCTCGGATACAGCCGACACCATGGGTIT^^ 

TGCCGGCCTCCAGGTGCTCAACGATTACCTGGCGGACAAGAGCTACATCGAGGGGTATGTGCCAT 
CACAAGCAGATGTGGCAGTNTTTGAAGCCGTGTCCAGCCCACCGCCTGCCGACTTGTGTCATGCCT 
TACGTTGGTATAATCACOTCAAOTCTTACNAAAAGGAAAAGCCACCTGCCAGGAGTGAATA>^ 
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TTTGGCAAATATGGTCCTGCCNANGTGGAAAACACTACAGGAAGNGAANCTCAGA^^^ 
ATGATACTT 

SEQ ID NO: 1772 GGCACTTACGATGGTCACATTCATATTGGGCATTGGATCTAmAAGCCTAGA 
AGTATAACAGATAATCAGrrCCTTAAGTGTAAATATGTTCTTAAATGTTTGGCCT^^ 
TTCAAATGTATTAGTAATTCTCATATTTTGCAAACAGATCCrmGTGTATAT^^ 
CAAATCTGTTTTTCCTTTTGCTGNGGAAGATGTCTrrATATAA^^ 

GAGACGGGACTCTGGTCAGGTATAGTGTGTAGGCCTAGCCCCATATATTAGGAACTCATAAACTC 

NGAAATCCAAGGATTAAACATTTTATGTCTCCTTCATTCATGCAAGTCATGC^ 

AGGTAAAACTACAAAGTATGGAGGAAAAAAATAGAACTTTCNCATTGGAAA'rrm 

AGGNAAATGTCirrAGCAACTTTGGAATCAGCACCTTTAAls^^ 

AACCAATNAACCNTTTCTTTTTTCOTGCANGGAAAAGAAAA^ 

AAA 

SEQ ID NO: 1 773 ACTACCAGCTTTCACATCAAATTTOGAACGTGGAGGTGTGGAAAAGCTATO 
GATITAAGTTGGACAGAGTCATGTAAACCAACAGCAACTGAACCACTATTTAAGAAAGTC^ 
ATGGGAAACATCTACTTCTAGCTTTTTTCCTATTTTGGCTCCGGCCGTTGG^^ 
ACTGCCCGCGCTCACAGTCCTGCTTCCTTGTCTTTTGCCTCATATCGTCAGGTA^ 
CAGCTGCTCCTCCCAGACAGTTTGATGCATCTCAATTCAGCCAAGGCCCTGTGCCTGGCAC^ 
CTGACTGGATCCCACAGTCGGCGTCTTGTCCCACAGGACCTCCCCAGAACCCACCTTCTGCACCCT 
ATTGTGGCATTGTTTTTTCAGGGAGCTCATTAAGCTCTGCACAGGCTGNTO^ 
GAGGCTTTACTACCAGCCCTT^rIX}GTGGCACCTTTCCCTGAGCTTGG 
TTmTCTNNTTCCTACANACCCCTGTTCCC^ 

SEQ ID NO: 1 774 ACCCCTTTATTCACTGTTGGAACAAACTCATCTCAAAGTAGAAGATGTGAGTG 
CAGTTGAGATTGTTGGAGGCGCTACACGAATTCCAGCTGTGAAGGAAAGAATTGCCAAATTCm 
GGAAAAGATATTAGCACAACACTCAATGCAGATGAAGCAGTAGCCAGAGGATGTGCATTACAGTG 
TGCAATACTTTCCCCGGCATTTAAAGTTAGAGAATTTTCCGTCACAGATGCAGNTCCT^ 
ATCTCTGATCTGGAACCATGATTCAGAAGATACTGAAGGTGTTCATGAAGTCTTTAGTCGAAACC^ 
TGCTGCTCCTTTCTCCAAAGTTCTCACCTTTCTGAGAAGGGGGCCTTT^ 
TCTGATCCCCAAGGAGTTCCATATCCAGAAGCAAAAAATTAGCCGCTTTGTAGTTrc 
TCTGCACAGNAAAGATGGAGAAAAAATTTTANAGTNAAAGTCAAAGTGCGGAAGTCAACNCC^^ 
TNGCArmCACCNATCTCTNCGGCATCTATNGGTGGGANGAAAGTCCCAACTGAAGGANA 

SEQ ID NO: 1775 ACCCATCCAATGAGTCCCCNGAGCCTCCANAAGCTGTTGTCTCCTCTCTGGGG 
ACAGCAGCTCCTGCCmGGAGGCCAAAGCCCCAGATCTCTCCAGCCCCAGAGCTGAAAACACCA 
AGTGCCTATTTGAGGGTGTCTGTCTGGAGACTTAGAGTTTGTCATGTGTGTGTGTGTGTT^ 
TGTGGGTTTATGGGNTTTCTTTCTTTTTTTTCT^^ 

TCCCATGTGCANACAGNGTGTCmATAGATTTTTCTAAGGCTTTCCCCAATG 

CTGATGTTTCTGAAAGTTCCAGGAANTNACACACCCGTTCCCCATTCTNACTTGCCC^^ 

TGACAACCCTCCGGNGTGGATATACCCCCNGGGGGACTCATOGGhmNTTCCCNNACCCCNANTT 

TTNrrATAAANTGNANGGCCTAANAATACCNCTTTTCTGGTTGNAAAAA 

CmCCNCCNTTGA>fTGNTTGAAANNTOT 

SEQ ID NO: 1 776 CANGTTTCCCAGCCCACAGTCATTGCTTCATTCCITGTCTGATCAGATGGTAG 
TTAGAAAAGAAGCTCTCCTACATCCATCTTCTATACCAGGAAAGAGGAAGAGTGGCAAAAGCAGA 
GTCTTGTGCCTTCAAAGATTCATTCTGGTTCTGTTGTGGGCTAAGGCATC 

ATTTTGAGATCAGATAGAGTTGCTCTrCCAGGAGTCAGCACCCACGCTCATGTCCGTCTTTCAGTT 

AAATCTCCTTGGTTTAGTAACTCnGTGGATTTACTGCAGTTAANACAGAAACTTCA 

CTCCCTGACTGCATGGNCACCAGGTGATGTGTCACATCAGGCACTGAAGACAGGGAACGTGGGGG 

ACCCTTTAGCAANCACCTCGTCATCAACTCCCAAGATAGAAATNGGTATNTOGGTAGG 

GGCTNTCCACCAANGGNTNGGCTGGGAATNNATANNACCCANGTTTTTACT 

CNAACCNCCCCTTAANGGGCGAATTTTCAACNACACTGGCGGGCCGTTAOT 

SEQ ID NO: 1777 ACGCXJGGGACGGCAGGCGTCCGCGTCGCTAGCTAGTCGTTCTGAAGCGGCGG 
CCAGAGAAGAGTCAAGGGCACGAGCATCGGGTAGCCATGCCTTTCTTGGACATCCAGAAAAGGTT 
CGGCCTTAACATAGATCGATGGTTGACAATCCAGAGTGGTGAACAGCCCTACAAGATGGCTGGTC 
GATGCCATGCTTTTGAAAAAGAATGGATAGAATGTGCACATGGAATCGGTTATACTCGGGCAGAG 
AAAGAGTGCAAGATAGAATATGATGATTTCGTAGAGTGTTTGCTTCGGCAGAAAACGATGAGACG 
TGCAGGTACCTGCCCGGGCGGCCGCTCGAAAGGGCG 



264 



wo 02/29086 



PCT/USOl/30732 



SEQ ID NO: 1 778 ACCACTGAAACCCTGACCCAGAAAAGTGGCTTGCTTGGACACCCAGCTGCCT 
TTGTTTCTGCATTAAACCAATATrGATCACACATATGACACAGGCTAGTCCTATAAAAGTAA TGAC 
TrCATAGAAATGGCATTATAATTTTTAAOTTGATACTCTACAGGTAGCTATTGAT^^ 
AATAAAACATGCTGCAACCATGGTOTACAACAAAAATACATTTNTTTGGTGA^^ 
CGTATTrACAATGACrrAATATAAGACTGACTTTTATCCTGCTTCATA^ 

CAAGAAAGAATTCAATACTGTGAAATATGCAGCAAGAAGATTGGTCnTTACCTAGGCTGTGm 
TAAGCTCTTGAGTTTTAAACACCAGTNNATTTGTATTAAAAGAAAAAAA/^ 
CTGGCTTTTAATTTTTGCCANCCTAAGGGACATTAAANACNTAAAT^ 
ATAGGCCCCTCTGNCTTTCAAGCAATCATTTNTTGTAAAAAGAAAAA^ 

SEQ ID NO: 1 779 ACTATTTCATGGTCCAAACCTGTTGCCATAGTTGGTAAGGCm 
GTGAAATATTrAGATGAAATTTTCTCTTrrAAAGTTCTT^ 

ATATTAATAAATCTGTAGTGTTTTGTGTTTATATGTTCAGAACCAGAGTAGACTGGATTGA^ 

QGACTGGGTCTAATTTATCATGACTGATAGATCTGGTTAAGTTGTGTAGTAAAG CATTA GG 

CATTCTTGTCACAAAAGTGCCACTAAAACAGCCTCAGGAGAATAAATGACTTGCrm 

CAGGrTTATCTGGGCTCTATCATATAAGACAGGCTTCTGATAGTrrGCAACTGTAAGCAGAA^ 

ACATATAGTTAAAATCCTGGTCTTTCTTGGTAAACAGATTTTAAATGTCT 

CAGGAGAATTCGGGGATTTGGGTTTCNCTGAATAGCATATATATTGATGCATCGGATAGGTCATTA 
TGATTrmACCATTTCGACTTACATAATGAAAACCAANTTCATTm 

SEQ ID NO: 1780 ACGCGGGGGAGGCGGCACTGGTCTCGACGTGGGGCGGCCAGCGATGAAGCC 
GCCCAGTTCAATACAAACAAGTGAGTTTGACTCATCAGATGAAGAGCCTATTGAAGATGAAC^^ 
CTCCAATTCATATATCATGGCTATCTTTGTCACGAGTGAATrGTTCTCAGTTTCT^ 
TCTTCCAGGTTGTAAATTTAAAGATGTTAGAAGAAATGTCCAAAAAGATACAGAAGAACTAA^ 
GCTGTGGCATACAAGACATAmGTmCTGCACCAGAGGGGAACTGTCAAAATATAGAGTCCCA 
AACCTTCTGGATCTCTACCAGCAATGTGGAArrATNACCCATCATCATCCAATCGCAGATGGAGG^ 
ACTCCTGACATAGCCAGCTGCTGTGAAATAATGGAAGAGCTTACAACCTGCCTTAAAA^ 
GAAAAACCTTAATACACTGCTATGGAGGACTTGGGGAGATCTTGTCTTGNAGCTGOT 
CTATACCTGTCTGACACAATATCACCAGANCAAGCCATTANACAGCCTGCG 

SEQ ID NO: 1 78 1 ACTTTTTTTCTTAATTTCACTGACITCAGAGAC^^ 

GTATTGGAATTTCACAAAAGACATAGGACTTAACTGGAAAATGAAAAAAAA AAAG A^ 

AAACTAAACAAAAAATCCCTCTAGGTAGTTTAGGTGAAAAATGTCCCrrrrA TT^ 

GTGArrrCAGAGCATAATGCTATGTTTTTTTGTCTTm 

AGTGCATACAGriTrCTCTAATTrrrAAACCCTTTCCTCCT^ 

ACACrrGAGrrGTGAAGGTTTTGGGCATCCACCCCAGAAAGTGGGAATTTGATT^ 

ACTGGAAGAACATTTTTATGAAGAATTmGTCTANGAGAATATAACAGTGTTACCCA^ 

TCTTTAANGGTGGNTCATTITCTCTGACCTTTTGITACT^ 

GGCCGCTNTAAAANGGGCGAAATTTCCACCACACTGGGCGGGGCCG 

SEQ ID NO: 1782 ACGTCrGCATCGATTATCTTACGTGGGGCAAATGATTTCATGTGTGATGAGAT 
GGAGCGCTCTTTACATGATGCACrmGTGTAGTGAAGAGAGTTTrGGAGTC^^ 
CGGTGGGGGTGCTGTAGAAGCAGCCCTTTCCATATACCrrGAAAACTATGCAACCAGCATGGGGT 
CTCGGGAACAGCTTGCGATTGCAGAGTITGCAAGATCACITCrTGTTA^^ 
TTAATGCTGCCCAGGACTCCACAGATCTGGTTGCAAAATTAAGAGCTTTTCAT^ 
TTAACCCAGAACGTAAAAATCTAAAATGGATTGGTCTTGATTTGAGCAATGGTAAAC^^^ 
AACAAACAAGCAGGGGTGTTTGAACCAACCATAGTTAAAGTTAAGAGTTTGAAAm 
AGCTGCAATCACCATTCTTCGAATTGATGATCTTATTAAATTACATCCNANAAAGTAA^ 
AACATTGGAAAGTTATGAAGAATGCTGrrCACTCTNGAGCCCTTTAATGA 

SEQ ID NO: 1783 ACTCTGGATCCCAAGGTGACTGGTTGTTTAATCGTGTGCATAGAACGAGCCA 
CTCGOTGGTGAAGTCACAACANAGTGCAGGCAAAGAGTATGTGGGGATTGTCCGGCTCCAC^^ 
GCTATTGAAGGGGGGACCCAGCTTTCTAGGGCCCTAGAAACTCTGACAGGTTGCCTTATTCCAGC^ 
ACCCCCACTTATTGCTGCAGTAAAGAGGCAGCTCCGAGTGAGGACCATCTACGAGAGCAAAATGA 
TTGAATACNATCCTGAAAGAANATTANGAATCTTTTGGGTGAGTTGTGAGGCn^ 
GGACATTATGNGTGCACCNTTGGTTITGNTArrGGGAGTNGGNGGTCANATGCATGGANOT 
NTGGrTTTTTCTTNGAANCATTAGTGAAAANGGGCCCACAATGGT 

SEQ ED NO: 1784 ACCTCATAGCCCTGTGTCATTTAGTGTTCAGCACrrTTGGGAACATCAGTO 
TGAACTTTAAATTTrGCTGTCTACTCACTGGGCACGGTGGCTCACACCTGT^^ 
GGAGGCCAAGGCAGGTGGGTCACTTGAGGCCAGGAGTTTGAGACCAGCCrGACCAACATGGCAA 
AACCCCATCTCTACTAAAATACAAAAATTAGCTGGGCATGATGGTGCACTCCTGTAATCCCAG^^ 
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CTTGGTAGGCTGAAGCATQAGAATTGCTTAAACCTGGGAGGCAGAGGTTGCAGTAAGCC^^ 

ATGCCACTGCACTCCAGCCTGGGCAACAGAGTAAGACTCTGTCTTAATAAATAAATAAGAAi^ 

AAACGGAACTGCAGTGCTAACAGTAAmATACATTTTTAAATGTTCTGAGTATGTTI^ 

CTAGTGTAACAATATACTACCCTGAAAGTGCAGTTTTGATTG^^^GGTTGGTGTCm 

AAGTGAACTGTGCCAAGAAGTATTTTTTCAGTGACATGAATGGArrCCTGTT 

SEQ ID NO: 1785 ACl- l - n -i l -i'I rni l n l i 1 1 1 1 1 1 IGAGTCAACAAACTTCTATrnTATTGACA 
GGAGTCCAAAATGAATACAAAAAAAGCrrrCCTTTGCATGTTATACTTAC^ 
TGATAGAGTCAAGTAGACTGAAAACTAAGGATGTAGTTCANCrnTANAGTGA ACCx X 1 1 i l AAAA 
AAGGTTTAATANCATAAGCCACCAGTOTCANAAAACATACAGTCCCACTATCAC^^ 
GAACTTATCAAAANAATTAGATCCTGCATTGATCTGCTAANATAGAGGTrTAAAAATGGTTTv^^ 
CATTAAAAGCCATTTTTATAATCTGTCTGATAACTCTGGGCAATTAANA/^ 
CGTAGCCCTCTATATCATGTGGAATGTGAGAGAATGCCAACTTATGACCAAGATANGTAAAAATA 
AAAGmAAATCTATGAGNTGGAGCTGNTGNATCACTGCNAAATGTCAACCAACAGNGCTTATC^ 
AAGAGATTNTCANCTTGAGCGATGGTTACTTAAACTTGNNGCGATGT 

SEO ID NO: 1786 ACTATGAACACCAGAACAGAAGAGATTTTTTACTATTATGACACA>^ 

GGAAAGAGGGCAAACTAGACATTGTAATGCATAAGATGCAGGAAAAAGTGCAGAGCATTAACTA 
TAACCXrrTTTGACCAGAAACTTTATGTCTATAACGATGGTTACOT 

TTGCAGAAGCCCCAGTAAGCTGTTTAGGAGrTAGGGTGAAAGAGAAAATGTTTGTTGAAAAAATA 

GTCTTCTCCACTTACTTAGATATCTGCAGGGGTGTCTAAAAGTGTGTTCATm 

GGTGCATAGTTCTACCACACTAGAGATCTAGGACATTTGTCTTGATTTGGTGAGTTCTCTTGGG^ 

TCATCTGCCTCTTCAGGCGCATTTTGCAATAAAGTCTGTCTAGGGTGGGATTG^^^ 

GGCACTGTGGGCCTAGTGAAGCCTACTGTGAGGAGGCTTCACTAGAAGCCTTAAATTAG^^ 

AGGAACTTAAAACTCAGTATGGCGTCTAAGGATTCTTTGGACCT 

SEO ID NO- 1787 ACGCGGGATAACCATGCACACTACTATAACCACCCTAACCCTAACTTCCCTA 
ATTCCCCCCATCCTTACCACCCTCGTTAACCCTAACAAAAAAAACTC^^ 
TCCATAGTCGCATCCACCTTTATTATCAGTCTCTTCCCCACAACAATATTCA^^ 
AAGrrATTATCTCGAACTGACACTGAGCCACAACCCAAACAACCCAGCTCTCCCTAAGOT 
TAGACTACTTCTCCATAATATTCATCCCTGTAGCATTGTTCGTTACATGGTCCATCATAGAAT^^ 
ACTGTGATATATAAACTCAGACCCAAACATTAATCAGTTCTTCAAATATCTACTCATOT 
ACCATACTAAT(mTAGTTACCGCTAACAACCTATTCCAACTGTTCANTCGGCTG^^ 
NGGAATrATATCCTTCTTGCTCATCAGTTGATGATACCGCCCNGAGCAGATGCCAANACAAGC^ 
GCCNNTCAAGGCCAATCCTAATACCAACCCGNGATCG 

SEO ID NO: 1788 acaaaagaagcagctcaggaggctgttaaactgtataataatcatgaaattc 

GrrCTGGAAAACATATTGGTGTCTGCATCTCAGTTGCCAACAATAGGCTTTITGTGG^ 
TAAGAGTAAAACCAAGGAACAGATTCTTGAAGAATTTAGCAAAGTAACAGAGGGTCTTAC^^ 

tcattttataccaccaaccggatgacaagaaaaaaaacagaggctitrgcm 

atcacaaaacagctgcccaggcaaggcgtaggttaatgagtggtaaagtcaaggtctgggggaat 

gttggaactgttgaatgggctgatcctatagaagatcctgatcc tgag gttatggcaaaggtaaa 

AGTGCTGTTTGCACGCAACCITGCCAATACTGTAACAGAAGAGATTTTANA A/^^ 

gtttgggaactggaacgagtgaagaaagttnaaagattatgccgtcattcatt^ 
anatggtgctgtcaangctattggaagaaatgaattgggcaaaagactt 

SEQ ID NO- 1789 ACCAAAACAGACATATAGACCAATGGAACAGAACAGAGCCCTCAGAAATAA 
CACCGCACATCTACAACCATCTGGTCTTCGACAAACCTGACAAAAACAAGCAATGGAGA^ 
TCCCrATTTAATAAATGGTGCTGGGAAAACTGGCTAGCCATACGCAGAAAACTGAAACTGGATCC 
TTTCCTTACACCTTAAACAAAATACCATTTGACCTAGCAATCCCATTACTGGGTA^ 

gattataaatcattctactataaagacacacacacgtatgtttattgcggcactgttaac^ 
aaagacitggaaccaacccaaatgcccatcaatgataggctggataaagaaaatgtggcacatat 

ACACCATGGAATACTAAGCAGCCATAAAAAAGGATGAGrrCATGTCATTTGCAGGGACATGGATG 

AAGCTGGAAACCATCATTCTCAGCAAACTAACACAAGAACAGAAAACCAAACACCT^ 

ACTCATAAGTGGGAGTTTAACAATGAGAACACATGGACACAGGGAGGGGAACA 

SEO ID NO: 1 790 acttgcctcatagctggtgaaggattcttctgaacccccaccct^ 

AGGTATCTGTGGTATTGGCAGGATAGGGAATATGCATTACAGAAATGCAGGATTTGACTCTGGGC 

ATGAAAGATGGCAGCAGCXCTAGGGTGACCGTGAACTATAGACCTCGCAGTCTTrrCGOTGAA^ 

AAGAGACAAGTTGACCCTCTGCCCATrrCCTTATGGACCTCACCCATCATGCCAGCAGGGTC^^ 

gaccctggccitgttccaaatcatctgggacatgacccactccccactgtcactgtg™ 
gagaatgtttgtgtggccccaacacccataaggaaaccaggctttaggcccaggggagc^^ 
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aggtaagggctccaccccatcttaagctctctcttccgngck:acaaattcc^^ 

agtaattgttaaaggaatggcaaactgttttgtttttgi^ 

catgtttccttagcaaccctgagatgatttrctttccatm 

seq id no: 1 79 1 actattaagaaaaagaagattgattcttaacctactgaattgtgcagataca 
aaagtgcttagcatgacaaaattaccaaaataaaaacatmagaggttatttggtt ct^ 
rraactattctgtatttctggataatttaacatttgtatttt^ 

AAGTTTAAAAAAGCACACGAAAAATAGTTCAAACTATATATAATCTGTTAIT^ . 

GCTAATCACAAATAACTCAGCANAACAATGCTTGAACATTCAGTTCTACTAAAAAACA 

AGTAGATCCCATCACCTTACCCATTGTTrrGCTATGTTAGACCCTAAAACAGGCTGCCGAC^^ 

GGACAAGTGCAATTCTCTGAGAGCCATCCGGTCAATACAArrGAATGTGAAATTCATGCATGCAA 

GGGTAATTGCCTGAGCTTGTTTCCAAGTTACCATAGTCACTAAATACCAAACACCTA^^^ 

CCTAAGTTCACTTATCAATTACTGGNATGCTCATANTGGCTNGGNGGAAAA 

SEQ ID NO: 1792 ACAAAATTGCCACCAAAAACGATGTTGATGTCCAAATTGACCAGGAGTCCTA 
CCTGCCTGAAGACATAGCTGGTGGAGTTGAGATCTATAATGGAGATCGTAAAATAAAGGTTTCCA 
ACACCCrGGAAAGCCGGCTGGATCTCATAGCCCAGCAGATGATGCCAGAAGTCCGGGGAGCCTTG 
TTTGGTGCAAATGCCAACAGGAAGrrmGGACTAAGCCTTCAGGAGG 
CTCCTGCTGTGATGTGGAAGCTTCTGATATTTGAAGAAACACGAATGTCTCTGTAGCTO 
CTGCCCCAGTATTGCTCTGTATTTATCAGCGATGCCCCTCTGTCACTCATGCCTTGCCT^ 
ACAATGGTGGAAAGCTTCATGTAATATGATCAGGACCCACCTCCAGTTCTTCTGAAAGTGTGACAG 
TGTCCAACCGGTTCTGCAGCACTAGGGGAGGGGGCAGATGGTGGGTTGCATGGGCTTNCTTGGGT 
CTCCACTCTTCGTCTGGCCTAAAAGGTGATGTATTTGGTGTTTGGCCC 

SEQ ID NO: 1793 ACCCACAGAGACTGAGAGTTGGTGCTGGTGGTTGTGGTGGCAGATGATATTA 
CCTGAAGAAGGGACGAATGGGTOCTGGGCAGGACAAAGCATCAGCTGTCCAGTTCAGGCCTCTCC 
TCTTTCCCTGGTGTCTTCATTTTCCTCCGTCTCCCTGCTGTCCCTTACCCT 
CTCCTGGTCTTGGGAGTTGCCTTCTGAGGATACTCCACTGGGGGTACC 

SEQ ID NO: 1794 ACITGAGTCAAAGACGACATTTAGATTCTTCAGCTTTGAAGCAm 
TCArrGTCTAGTCAGGCAGAGGAAAGAAArrCAAAAGCACTTTCr^^ 

CAGTAAGATCCATCTTCTCACGTGAGAAATCATTAATAGGGAGACCTTCAACAAACirCCAGGOT 

TATTCTTGATTACAACAGGGAATGAGTAGAGCAGATCATCAGGAACACCATAGGAGTTGCCATCA 

GAGATAACACCCATGGACACAAACTCTCCCTCTGGGGTTCCAAACCAGATGTCCCTGACGTGGTC 

ACAGATGGCrrrrGCAGCAGACATGGCACTGGATAGTTITCGAGCCTTGATGACAGCAGCGCCAC 

GCTGCTGCACAGTCGTGACAAATTCTCCCTTGAGCCAGCTGTCATCTTTCAGAGCTTCATAAACAC 

CAACTTCCTTTCCTTGCAATTTCACCITGGCATGGTTGACATCT^ 

TCCCCAGATAATGACATTCTTTACATCATTAGCAGTCACACCAAGTT 

SEQ ID NO: 1795 ACATTTCAATAGCACGTTCATCCTCCTCATTAAACTCGTCTTCATGATCCTCCA 
GCrCTTCCAAAGTCATATCTTCATATGTTTTCACCACTGACTGCTGG^ 
TGCCTCCTCTTCCAATTCTTTCAGACTTTCCTTGGGGGGTAAGATACCCT^^ 
TTCCATTCAGTGTCTGCGTTGGGGTCCTGCATCTTGTITCCAGrrCAGrrGCTCAAACC^^ 
CGNCCCCGACCCTCANCTTrCTnTGTCCCCGNGTT 

SEQ ID NO: 1796 AC rr ri' rnT r'i' i 'l H l i 1 1 1 ll 1 1 ITGCTGAAAACTTITrATTGCrrCTTTTGGAT 
ANAAACGGGAATTTATTTGCCAGGAAGGATGATCCCATCATACrrNTGCT^ 
CCTCCTCTTTGCTGATTCTGNGmGGCCCCAATGCAGCCTGTCCTGCGCT^ 
GAAACCTGGCCTACCCAGCACCACATANAAGTCCAGGCCGTANATACCAATGCTTGGGTCATATT 
TGATACCCANATCGATGTGTTCCTGGATCCCAAAACCAAAGTTTCCAGTATCTGAGAAGTO 
TTCTTAACTCATACTCCCGCACCTTTANACCCTTCTCCAANATTTOT 
TGTGCAGNGGACAAGCAATCTTTTCATTTCTCCGGATGCCAAAGGATCTGACAGGTC^ 
TTGGAAAACACAGGGGTCTGCCCTGTGAGCTGCTCCAACACCTTGGCTGCTCXjNGTCAGTCTGT^ 
NCACTCrcCCCAACACAGATNTTGANACANAAGTTTGGCGGA 

SEQ ID NO: 1797 ACGCGGGAGAGCAGGACAGCAAGGACAGCACCTACAGCCTCAGCAGCACCC 
TGACGCTGAGCAAAGCAGACTACGAGAAACACAAACTCTACGCCTGCGAAGTCACCCATCAGGGC 
CTGAGCrCGCCCGTCACAAAGAGCrrCAACAGGGGAGAGTGTTAGA GGGAG AAGTGCCCCCACCT 
GCTCCTCAGTTCCAGCCTGACCCCCTCCCATCCTTTGGCCTCTGACCCTTm 
CCCCTATTGCGGTCCTCCAGCTCATCTTTCACCTCACCCCCCTCCTCCT^^ 
AATGTTGGAGGAGAATGAATAAATAAAGTGAATCTTTGCAAAAAAAAAA 
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SEO ID NO: 1798 acaaaaggaatgtttcctttataaatcacagaagaaaatgacaatatctgtt 

GGATATrrGATATAArrrAATGGTGTTATAAAACCTTTAAGAGGATTC 

AACATCTITATACTTTGAAAAATGTTCCACrrACCOTCAGATATTTGTO 

TTAATACTTTAATTTTGCTCCAACAAGGGCITTATGTrGCTGGTAAG 

ACTATGTATAAAGTGAAAGATAGTITACTTATCTGACTTTGATATTAGATGGCT^ 

CATAATGCAGAGTTTAACCTTGATTCTTCAACAGAGTCCAGATTTAAATGTCTAOT 

GTTAGCTGATATTCTTCCACAATTAATATATTCAATTTCCCATCAGTATATCACT^ 

rrTTTCTAANGAAACTTTCCACAGAATTTTAAACAACTGATGCAT^^ 

AATACTTTGCATTTAAAAACCCCTGTCCACCTGTCACCCAGCACAA 

SEO ID NO- 1799 ACGGATCrrACirCCTGGCTCCCACGCGTCCTGCTGGTGATCCTGCCGGGGGA 
GTGGGGTGTTGCTGCACAGGCmGGAGTCCTGTGGGCGGAATTCCACT 

TGGCTITGGAACCTAGGGAATGATCACCAAGACACACAAAGTAGACCITGGGCTCCCAGAGA^^ 

AAAAGAAGAANAAAGTGGTCAAAGAACCAGAGACTCGATACTCAGTTTTAAACAATGATC^ 

TTTGCTGATGrrmTCCTTTAAGAGCTCATCCCCCrCTAAGAGTGTGGCCCATGGGC^ 

ANATGC(nOTAGTGAANAAAAAGANGAAGAAANAGAAGGGTGTCANCACCCTr^ 

TGTAGANCCTGAGACCACGCTGNCCTGCTATACGGACAGAAAAANTrCACCCAGCCCTm 

ACCACGGTGTITNGGCCNCITGGAGTTTCCTTCANTGGGGGAAAANNAA^^^ 

NTANCCCATTGGCCCATGCCTNTTNGGNGGGAAAAACCTCCCNCAGANCCCm/^^ 

GGAGGA 

SEO ED NO: 1 800 ACTTCACCCAGGCACTCCAGGCCGGGACCGTGTGGGTAAACACCTACAACAT 
CGTCACCTGCCACACGCCATTTGGAGGGTTTAAGGAATCTGGAAACGGGAGGGAGCTGGGTGAGG 
ATGGGCTTAAGGCCTACACAGAGGTAAAGACGGTCACCATCAAGGTTCCTCAGAAGAACTCGT^ 
GAGCAGCTGICAGGGAGGCCCAGTCACAGTCCAGCAATTCCACAACCACCTTGACGAATGCTTGC 
CAAGCTGTTTTAAAGCCAAGAACACCCTTTCTITGTTCCAA^ 

AATAAAGCAATTCAATCAAGGCTGTTCTATTTAAATCAGAGATGGGGACCAGGCTCAGAGTTCT^ 
CCTATCTAACCCCCAACCACAGCCCCCTTGGTGGCCCATGAGrrGCTTCATGAAATOT 
TCTGGAGGACAGATTAAAAACCAGTGATCTGTAATTTGTAGCTCTTCCTGCT^ 
CCCATGGGTGCGCTTGGTGGTTAAGTGGATCGACTCAACTTAAAACACAAG 

SEO ID NO- 1801 ACTTTTATATATACTATCTATGAAGAATTCACTAAAGCATGAATCACCTTAT^ 
ATGAGAAGCTAAAAATGTATCAAAACGAACATAAGTATAGGTAATCCACATCAAACATA^^^ 
CTTCCAAGTCTAGAACATACACTGGTATAAACTGTATTACAACCCAGATTAGTTTGAAATOT 
TCAAAACATTGCTCAGTATTAAGTCTCAGTAGACAAATAATAGGACCACATGAGAAACTGrrCGG 
CAGGTGGCTGAGGAAACCTTAACTTCCAAAGGCTCAAAGTGGTCCTCCAGAGACTGTTACACTCC 
CITAGGTArrrATTTCAGGGAAGGACACTATTAAGGGACACTTTTGAGTATAAAGACAGGTG^ 
CACAAAGTATAGGCAGATACATGCTTGATTTTATCriTCTAATCTACAGAT^^ 
AATGTAATGAATTCTACACCTTTCAAAAGGOAAAAACTGATGAAGTAACAATAAAGGTA 
GATAATGGATCAGATGAAATAATTTAAATGAAGCTTGGCTGTGTCTGAAAKA 

SEO ID NO: 1 802 ACCCATTGGATCTCTAGCATCAAGAACTrGAACTACAACATCTGATGAATCTA 
TCACCTTGTAGAGCTCACCCCATATTCTTTTGGACTGTCCC^^ 

TTTCTCACACCAGTGTCTTCAGTrACCAAATCACGATCCTTGCCCTGGTCATAGCTCTCAGTGGA 

TTTCAGCATTrrCGATAAGAGACTGCATATCACITGCAAATAAGTTTGGTCGm 

AGGGCCAAATGTAGTTTCAAAACTTTCAGTATCAAGAATGTGCACCITCAAGT^^ 

TCGATCATGGAGAGGAGACATTGGTAACTTGCTTTGCTTCATGACAACrrTGTATGG 

AACTGTATCCATTTCCTCTTGAAATITrTGTAATGATGACTGCT^ 

CATTTAATATTTGGCTCTACrmGCCCCTGTGCCAGAAGCCACCGGTTGATTGATATO 
GTTTAATTATTTTACCCACGACTGGTCCTGCCTCCTTTTGCC 

SEO ID NO: 1 803 ACGCGGGATTGCAGCArrATTTCAGTTCAAAATGAACTATATGCCTGGCACC 
GCCAGCCTCATCGAGGACATTGACAAAAAGCACTTGGTTCTGCTTCGAGATGGAAGGACACTTAT 
AGGCrrTTrAAGAAGCATTGATCAATTTGCAAACTTAGTGCTACATCAGACT^ 
TGTGGGCAAAAAATACGGTGATATTCCTCGAGGGATTTTTGTGGTCAGAGGAGAAAATG^ 
TACTAGGAGAAATAGACTTGGAAAAGGAGAGTGACACACCCCTCCAGCAAGTATCCATTGAAGAA 
ATTCTAGAAGAACAAAGGGTGGAACAGCAGACCAAGCTGGAAGCAGAGAAGTTGAAAGTGCAGG 
CCCTGAAGGACCGAGGTCTTTCCATTCCTCGAGCAGATACTCTTGATGAGTACC 

SEO ID NO- 1 804 ACTCAGTCTGAAAAGCTAACAAATACTGCATCTAACCACTCAATGGACCTTA 
CAAAAAGCAAAGACCCACCAGGAGAGAAACCAGCCCAAAATGAAGGTGCACAGAACTCTGCAAC 
GTITAGTGCCAGTAAGCTGTTACAAAATTTAGCACAATGTGGAATGCAGTCATCCATGTCAGTGGA 
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AGAGCAGAGACCCAGCAAACAGCTGTTAACTGGAAACACAGATAAACCGATAGGTATGATTGAT 
AGATTAAATAGCCCTTTGCTCTCAAATAAAACAAATGCAGTTGAAGAAAATAAAGC^ 
TCAACCAACAGGTCCTGAACCAGGGCTTTCTGGTTCTGAAATAGAAAATCTGCTTGA^ 
CCTCGGCCGCGACCACGCTAAGGGCG 

SEQ ID NO: 1 805 ACAGAGTCGCATCCATTCTTTTTGAACAACATGTAGCATGTCTGCAACAGTGG 
ATGATTCACAAACCATATACAGTCCCCATAATCCTGTATCTGTGTAGGAAGTGITGAAAGACTGAA 
AGCTATGGCAAAGATrGCCATGACAAGTGAGCTGGGCCAGCTTGCTAGATAAATTCATTCCTCCCC 
CAAAAGAGCGATCCCAGTTGCCAATCAGCGTGTTTGCAACCATGAGACAGATTGTATCTGGATGT 
GCCCAACCAACAGCTTCAACAGCTATTGCAAGGTGCGCCAAAGGCATCTTGTCATCCCTCACACG 
AATCTCACTTCCTGTGAATTTGCAGGGAGGCAGAGCTGGTATrrCTCCm 
GTCACCGAAATGAAACTTTGCTAAGTCAAGCAATTCATCATGGGAAACACCTCCAGCAGCAGCAA 
GCACTATTCTTGGCCCCTTATAATTGTGGTGGTTATATNATNCACTAAGTCCTTACGAOT 
TTGATATTrrCAGTTGGNNCCCAAAATTGGTCCGTCCAAGTGCAGTATTTTGA 

SEQ ID NO- 1806 acctgttgtcacgcgtcctgtttgccctgagccgcctggctgtagagaagggc 

TACATCCCTGAACCCAGGTGGGACCCGTTCCCGCTACTCACTGCGGTGGTGTGGGGGCTGGTGCTG 

tggctctttgagtatcaccgatccaccctgcagccctcgctgcagtcctccatgacctacctctatg 

AGGACAGCAATGTATGGCACGACATCTCAGACTTCCTCATCTATAACAAGAGCCGTCCCTCCAATT 
AATGCAGCCCTGAGGTGTCTGGCTGTGGCTCAAGATrrGGCCCCATGCAGACCCTCCCAAAGGAT 
ACTGCCTTCTCAAGATCATAGGCCTCANACTCCAACTGGTGTTATCCCAGGGTTCCGTTTGCT^ 
GTAAAAACACTGATTTTAAAATCCCAGTGGGT 

SEQ ID NO: 1 807 acgcggggcttcaagcaacagcgacgcaagatggcagccaccacgggctcg 

GGAGTAAAAGTCCCTCGCAATTTCCGACTGTTGGAAGAACTCGAAGAAGGCCAGAAAGGAGTAG 

gagatggcacagttagctggggtctagaagatgacgaagacatgacgcttacaagatggacagg 

GATGATAATTGGGCCTCCAAGAACAAmATGAAAACCGAATATACAGCCTTAAAATAGAATGTG 
GACCTAAATACCCAGAAGCACCCCCCTTTGTAAGATTTGTAACAAAAArrAATATGAATGGAGm 
AATAGTTCTAATGGAGTGGTGGACCCAAGAGCCATATCAGTGCTAGCAAAATGGCAGAATTCATA 
TAGCATCAAAGTTGTCCTGCAAGAGCTTCGGCGCCTAATGATGTCTAAAGAAAATATGAAACTCC 
CTCAGCCCGCCCGAAGGACAGTGTTACAGCAATTAATCAAAAAGAAAAACCACAGGCCCTTTCCC 
TTCCCCCCAATTCGATTTAATCAGTCTTCATTTTTCCACAGTAGTAAATT^ 

SEQ ID NO: 1808 AC rn ' rrr i'i' rm ' n ' l ' lTl 1 1 1 T r NGGGNGGCCACCACATCTTTATTGCATACr 

caggngaataacttattatacaatgaacactcctccattaggagaccatgcccacttacagaatg 

CAGCCGTAAATGCGGTAAATCTATTTACAGAGGTTGGGGTGCAAGATGANANAAGTATCAGCCCC 

aggaatttgaagtgagaatgatctacaaattctcctgacaaggagcaaccgggcitgtgcta^ 

AGGTCTGAAAGAATTCCTGGCANAGCGTAGGGGGAGATTANATCTCGGAATTGACAGCAAGTTTG 

gggacagtgcaagaagagaggggtgacctgtgaattggtgctggggagctgctgaggcccaatg 
tgaggcaacactagagagatgagtaaatttagggngatctttancctctcctacccaggcaaaaa 
gggttggggagcgggggtgtcaacaagtttggcttccagtgtagattcaaaccantgggctgaga 
naggcmtgcttttttrgctcaaatatgcaggtctgct^ 

SEQ ID NO: 1809 accaagtccaggtataacattcctattggaagccatacttatattttcttgta 

AAGTCCTTTTCAATTAATAAAATATTAGCATAATTGTGTATAGTCAG 

tgttcttatcccatgggaagcagttggttacacgattcttattttataagaaacagct^ 

CTATGGATTAAGTCrTCTGAAGTGAAGGAAATATAGATGTCACCTAAATGATAGTTAA^^ 
•i n ^ rrrr iTrT A GGCATAGAAGCCAGTTCAGGGTCCATAATATTTAGTGACC y^^ 
GCAGCAACCTGGTTCTTAAACACAAAGTAAGTTGCCCATTAACAAATGGCTm 
AAAACTTTCCACAGGTCTAAAAATTGOTCCATTTTATAAm 

GCTGATCCATCATGATGTAAAAGTTCACAATATGGGTCAAATGTAACAGTGCAGAATTGAATATG 
GAGGCATGCATAACCCTTCCTCTTAAAAAATGGCAGGGGNTGNA 

SEQ ED NO: 1 8 10 AAACATGATTTTGCCCTAAAGAACAGCTGAACTGTTGAGAGAAGCAAGGGCT 
TCCTAGCGGCCTTCCAGTGTAGCAGATAATATTACCCTGTGTAACAGAGTATTACAGAGTTTTGCA 
TTTTCCAAGTCTGTAAGTCTAACCATAAACAAAACAAAGACAAACCCAAACATTi^ 
ACAATCTTGTTCCCATATATGATAGAATAGTTCTCATACACTGTTAAGATTGATTTACrilUl 
AGGAGCACTrCrCAGGTTTrTGGCATTTCACATGTCTACACTCAGTGTCCATrGGCTCOT 
GTATCCTTGAAGGTTCGGTAAATTTTGCAGTGAACAAGTAAAAAAGGGCTGGTGTAGGT^^ 

SEQ ID NO: 1811 ACmCCrrCATCGAATGATTATTTGCCTGGAGGAAGAAGTTCTTCCGTO 
TCCATCTGCTTCAGAACATATGCTCAAAGATTGTGAAGCAAAAGATCTCCAGGAGTTCATTCCTCT 
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TATCAACCAGATTACGGCCAAATTCAAGATACAGGTATCCCCGTTTTTACAACAGATGTTCAT^ 

CCTGCTTCATGCAATTTTTGAAGTGCTGCTCCGGCCAGCAGAANAAAATGACC^^ 

AGAGAAGCAGATGTTGCGGAGGAGTTACTTTGCTTTCCTGCAAACAGTCNCAGGCAGTGGGATGA 

GCGAAGTTATANCAAATCAAGGTGCANAGAATGTAGAAAGAGTGTTGGTTACTGTOTCCA^ 

CTWTGAATATTCATATCCAATTTGTCAGAAAACm'GTTTATC^^ 

NTGGGGAGGTAANATGGACCANTGGGATTTGCTGATTTTGTTATAMCACATTGCCCC 

CTACACCrrrAAACAAACCmGCCTGGCNGATCACAAACAGtlTGGC^^ 

SEQ ID NO: 1 8 12 ACACAGTTTCAGAAAAAACNAGGAATGAATACTATTGATGNCTGGTGGCm 
TNGATGATGGAGGN^INGACCTT^m'GATACC^^^ACCTTCTGACGACC/^ 

TGTAAGATCAGAGNATTCATTGGTGGAAAGATAAACTGAATAGACCATGACCGGAGAGCGATGGC 
TNCTTTGCTTANAAGATCCGGATANACTTTTTTGATTCATGGTO 
- CCAAAGAAAGANAATTmTATAGGCTTITGNGNGAAATTATTGAGCCA^ 

ATGATAAAGAGCATOGATATOTGNAGTTTNAAATGAAANGAAGATGAACCA}^ 

AANATAAATGNGCrrTGAACTTTTTTTAAGACCAANGAC^ 

NGTTNATTTNAAGGG 

SEQ E) NO: 1 8 13 ACCAACAGAAAGCAGGCCAGGCTCCCACTCTCATCATCTATGGTGCGTCCAC 
CAGGGCCACTGATGTCCCAGCCAGGrrCACTGCCAGTGGGTCTGCGACAGAGTTCACTCTTATCAT 
CAGCAGCCTGCAGCCTGACGATTCTGCAGTTTATTATTGTCAGCAGTATAAAGACTGGCCTCT^ 
rrTCGGCCCCGGGACCAAGGTGGAAATCCGGCGAACTGTGGCTGCACCATCTGTCTTCATCTO 
GCCATCTGATGAGCAGOTGAAATCTGGACTGCCTCTGTTGTGTGCCrrcCTGAATAACT^ 
GGGAGGCCAAAGTNCCTTGCCGGGCGGNCGNTTNAAOGGTG 

SEQ ID NO: 1814 ACGCGGGGGTTTATCGTGTGAGCACACCATATAnTACAGTAGGAATAGACG 
TAGACACACGAGCATATTTCACCTCCGCTACCATAATCATCGCTATCCCCACCGGCGTCAAAGTAT 
TTAGCTGACTCGCCACACTCCACGGAAGCAATATGAAATGATCTGCTGCAGTGCTCTGAGCCCTAG 
GATTCATCTTTCTTTTCACCGTAGGTGGCCTGACTGGCATTGTATTAGCAAACTCATC^^ 
CGTACCTNGGCCGCGACCACGCTAAGGGCG 

SEQ ID NO: 1 8 1 5 ACAGGCCCTTTGATGGCTTGGGTTACAGACAACCTCATAGCTGGTGCACCAC 
ACACACGAGATAAAACAGGAAGCCTAAAAACCCCAAGCCACACCAAGAAAAATGAGAGAGGGG 
AGGGCGGGGTAACAATGCAGCATCCCGCGGAGGGAACTTAATGCACAAGGAGGGAGAACAGAGG 
GTGGAAGGCAAGCCAGCTTCGTCrrCGCCGCXjCAGCTGCTGTGTGGTGGTCAGGGGACTGAGTTC 
AACAGGTCCTTCAGGAAGCTCTCTGGATCGGTGATTTCTGATAAAAGACCGGCCACATCGAGGAA 
CTCTGAGAAGGTCTTCACCGGCATGAACTCCTCTGGAAGGrmCAGGGCTTTGTCGGCAGC^ 
TAGATGGGCATGTCGTTGTGGCGGATGTACC 

SEQ ID NO: 1816 ACTTCAAGTTAAAGTGAATAACCACTTAAAAAATGTCCATGATGGAATATTC 
CCCTATCTCTAGAATTTTAAGTGCriTGTAATGGGAACTGCCT(^^ 

TGTCAGAAACCAGTTATGTGAATGATCTCTCTGAATCCrAAGGGCTGGTCTCTGCTGAAGGTTG^^ 

AGTGGTCGCTTACTTTGAGTGATCCTCCAACTTCAriTGATGCTAAATAGGAGATACCAGG 

AGACCTTCTCCAAATGAGATCrAAGCCTTTCCATAAGGAATGTAGCTGGTlTCCT^ 

GAAACAGTTAACTTTCAGAAGAGATGGGOTGTTTCTTGCCAATGAGGTCTGAAATGGAGGCOT 

TGCTGGATAAAATGAGGTTCAACTGTTGATTGCAGGAATAAGGCCTTAATATGTTAACCTCAGTO^ 

CATTTATGAAAAGAGGGGACCAGAAGCCAAAGACTTAGTATATTTTCTT^ 

ATAACCTCCATTTAGTCTTTGTTATTTTGTTCTTCCAAGCACAT^ 

SEQ ID NO: 1817 ACTT^^IT l ■ r rrll l " lU" il' l ^lll ^ ' ^T TTCTGGACAGGAAGTANAA^^ 

GTATTAANAGGGGGGCAGCACATTGGAAGCCCTCATGAGTGCAGGGCCCGCCACTTGTCCANAGG 

GCCACGACTGGGGATGTACTCACATTCATITGTCACATATTTCAGGCCCTCATACACCCCT^ 

ATTGNCTAACTCCTATCCCAGTTTNTTTTTATAGTCTAAAAACAAGGAATC^^ 

TCOTCANAGCACTGCTGAAAATGGATCAAACNTGGAGATCCCCCAGATCCCTGT^ 

AAAAAAATTTTATATTAGCACATAGAATACCCTTAGATATATTNTGNTOT 

GTirCCCCCTTTTTGATGATGTCTTCAATTITITCTGAGAC^^ 

GCTTTTAACTTCTTTTGATACTCCAGTGGCAAACCATTTTO 

CTGGGGGGATGGGGGAGCCTTCTTATTGNCATAAATACTTCNACAGTTAT 

SEQ ID NO: 1 8 1 8 ACiTi Ti Tri' l 'I'1 1 i 1 i l li riU"lTlU" iU TCCGGAATnTCrrTAl-ri iri ACAAA 
TTAANACTATGCANATTTCATNTATTTCTGAATCAAAAACACCm 

AATGCAGCCTGANCTGAAAATCAAGAAACTAGAAAAGAAAGNGGTAGANATAACTNT^ 
ATCTGTTAGGTATTTTNTTTAAAAGTAGGGGG'11"111"11'1U1TC1T1U1H"1U1 111 ITTNAANATCTT 
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GGCATTCmGGCACTGmCTAAAGAAAAACTCCKITITCCCAGC^^ 

GTTTGGNTrCAAAANATGTTAACTCAAAATmAGGCCTAGCAGAAAATCACCAA^ 

AGTTAACAGGGGTTAACAGGAAGGAAGTGCCTTTATTAANTTCTCAAGCCAGAGGCTGGAGGC^^ 

CAGCnslATTNAGAGGACAGCATCCTCAGGTGAAAGGNGCCATTCGGGGTGGCATGCACT^ 

ATAAAACACTTNAAAACAAATGATTCGT 

SEO ID NO- 1 8 19 ACGCGGGGACTCTGCGCTTCACCATGGCTrrCATTGCCAAGTCCTTCTATGAC 
CTCANTGCCATCAGCCTGGATGGGGAGAAGGTANATTTCAATACGTTCCGGGGCAGGGC 
GArrGAGAATGTGGCTTCNCTCTGAGGCACAACCACCCGGGACTTCACCCAGCTCAACGAGCTGC 
AATGCCGCTTTCCCANGCGCCTGGCGGNCCNTGGCTTCCCTTGCAACCAAm 
ACTGTCAGAATGAGNGAGATCCTGAACAGCTCAAGCATGTCCTCCTGNGGGTGGATCCANCCCAC 
CTTCACCCTTGTCCAAAAATGTGNGGTGAATGGNAGAACGAGCATCCTGCTTCCCTA^ 
ACAANCTCCCNTACCCTTATGATGACCCATTTTCCCTCATGACCGATCCCAAGCTCATCAm 
CCCTGTGCCCGTCANATOTGGCTGGACTTNAGAANTCTATAGGGCCGAGGGAGAGOT 
ANNCGAChriTCTACATTAATmGCTGGCTAACCCTCTANAGTG 

SEO ID NO- 1 820 ACTTTTTTTTTTTT^^ 

TCCCACCATANAAAGTTAACTrrCTCCCAAGTAGCCAATTTT^^ 

rrTGAAAGATGGNGATTAAAAAAAAAAAATCAAACTGTTAAATAGGAATTTGGANANAC^^ 

TTGTCATTTTCCAATGCACTTGGGAGGCNCAANATTTCNATTGTAAAAGGGAAAACC^^^ 

GCATANTGATAAAGTTNTATTTTCAGAACAAANAGGNGAGTTTATCCTCATGC^ 

CTGAAATAATTTCTATGCCTCAGTGAACATTGAGAGGATATT>rmAG^^^ 

riTGGrrGTGAACACACACTTTTGCTTAATTNGTCTATTTAAAW^ 

CTAAAAAATCCCAATAAGTCTGGAATCAATATTTGAACTCCCCCANATCAAAAGAm 

TGCTCATAGGNGAATCACTACAGCTTTGCTITONCTT^ 

SEO ID NO* 1 82 1 CGATCGAAGGGACTATGTCTTCATTGAArTTTGTGTTGAAGACAGTAAGGAT 
GrrAATGTAAATTTTGAAAAATCCAAACTTACATTCAGTO 
CATITAAATGAAATTGATaTrTTCACTGTATTGATCCAAATG^ 

AGATCAATTTTATGITGTTTACGAAAAGGAGAATCTGGCCAGTCATGGCCAAGGTTAACAAAAGA 

AAGGGCAAAGCTTAATTGGCTTAGTGTCGCTTCAATAATTGGAAAGACTGGGA^^ 

TGAAGACATGTCTAATTrrGATCGTITCrCTGAGATGATGAACAACATGGGTGGTGATGA^ 

AGATTTACCAGAAGTAGATGGAGCAGATGATGATTCACAAGACAGTGATGATGAAAAAATGCCA 

GATCTGGAGTAAGGAATATTGTCATCCCTGGATTTTGAGAAAGAAAATAAOT 

CATAATTGAGAGAATCCTGAGTTGATAGCTCTAANGCAGATTCTGATTTGC 

SEO ID NO: 1 822 ACCTTTGAATCTCTGrTACCTGAGGAAACAGCATTCTCAGCTTOT 
GTCTGGTGGAGAAGGAACTTITGCAGTAGCTCTGAGATCCTGGCTTTCCTO 
AATTCAGTGGGATTAOTATTTTTATCTCTGAATGATTCTGCTGTGAGTCAGG 
GTCTTGCACCTTCTCCrrGTTrrGGTGACATGGCCAAGTGGCTGTCTCTCAGCT^ 
TGATCTGAATCTCAGTCTCATCACAAGAGGATGCAGAAGTTTGACTTTCATCCTGA 
TTrrcrrGCCTCATGTTTAATGTAGCCTOTCAAGGCim 

GAGACCCAGACTCTTGGCTAAGTTCTGCANGTCACTGTACCTGCCGGGCGGNCCGTCGAAAGGGC 
G 

SEO ID NO' 1823 ACAGTTATGCTCAGATGAACACTGGACCCATGTGACAGGGTCAAGCAACTAG 
/^CATGATTCAGAAATCAGTGAAAGATACACTrGGACAGGACCAAGAGGCATTTCACTGCCATC^ 
AACAAGGCAGGAAGGGAITCTAATACACACACCAGGAAGCACTCCTGCCCCTCAGAGGTCAAGG 
AGCTGATCCTATArrGGTATGAGGAATGGCTTATTTrCTGATQACCACATGTGGGACTAr^ 
CCGNCACAAGAAACCCCANAAGGGTTArrGTTn'GCATTATATATCTATACTT^ 
AAATTNACACTTAACGAAATTCAGGATTGATCCCAACCTAGAGCCAGATCCTCTGGGGTCAGGG 
NGAAACAGTTGTCTCATCACCNCGCANGTTNCATTTGThnrr^^ 

ATCCAGAAGGAGT^fNCCCTGTCCTGGTCCNTGCGTGTGAGAGGNGGANTGCAGATCr^T^ 
CCCTNCCGGNAAAACATCACCCTGNAAGTGGANCCCNTTGNACCATCNAAAATG 

SEO ID NO: 1824 ACmATGTTCTTTGCAACTGTTTCCATTATGAGAACGCTGTGCTAm 
GTTACATTTTTCTTGGCCAGGCGAGGTGGTCATGCCTGTAATCCCAN^ 

GGGCGGATCACTTGAGGTAAAGAGTTGAGACCAGCCTGGCTAGCATGGCNGAAACCCAGTCTCTA 
CTAAAAATACAAAAATTAGCCGGGTGAAATTAGCCGGGCGTGGTGGTGTGTGCTTGTAATCCC^^ 
CTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAATCCCGGAGGCAGANGTTGCAGTGAGCCAA 
GATCANGCCAOTGCACTCCAACCTNGGGGTAANAGCGAAACTCTGCTCAAACANAAA 
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SEQ ID NO: 1 825 ACTGTTTACCAAACTGAAGGTGCAGTGCTTGCAAT ATTTT 
GCATCCCGGACATGAAGAACATTCTCCCCTGmCTOTCCAATCTGACGTGTT^ 
CAGGTGTTCCAAAAGCTCCAACTTCTCGAACCCCACAGCTGCTGTTCCCAATCATAC^^ 
GGGCAACCAACroTATAAACTGGTCAAATGGGACGTGTrrAACTGCACGAAAGTTGGGA 
TCAATGCCCTTCTTCCGCATCACTCGAACCATCTCTTTGCTCCCTGCGTCAATA 
CTAGGGTCCGCTTGTTAAATGAGATAAGTGCATCCAATGTTAATTCAAACATm 
AATGTCAGTGGCACAGGGTGCTGTAGTGCAACAATGTAATCTTTAGATTTTACATCATCACCTAGC 
CCATGCGAATGATGCTCATGTAGTCmGTCTTGACTGAGAGAAGTTTGCATANGAAGGCACCT^ 
CAAAGGATGCGATCATGGTCCTCACACATGGATATCAAGTGCTGTCTGA 

SEQ ID NO: 1 826 ACACAAAGGATGTATGAAATGTGGGTTTTGTrGCTGAGGATAACAGGGTATT 
GCAATGCAGTAGTGATCCTACACATCCTTCOTGTCTGATTCACTGAAATCACTCCAGTGGG^ 
GGATGCAGGACATGTGGGAAAACGAGGAGCATGCAGGGCAGGACTGTCAGCAAATCAGATTACT 
AATGTGATTTGACTGCAGGGGTTGGCGTGTTrTAGGAGGGACAGGAGGTTACAAAAGACAGm 
TGACCTGAAGGCTTCAACAATATAATTCTATTCAAGCTrCCAGGACTGACAGAGGAA^ 
TTCAGACAGAGAAAGCTGATNAAAATCrrACCTAGCTATTGTTCCTTCC^ 
ATTAAATTAAAAAAAAGTGGraGCCGTTTCrGACCAGCACTCTCATGATGCCTATTC^^ 
CTGATGACGCCTCTTGAGAGCACATCCCAGNCAGATGTCCCCAGTCThrrTGCGGGGCCTCCTO 
AACGCAGCCAGATNACACTGNATGTTAAACTGTTCTATCTCAAAGCACTATCTCCTC 

SEQ ID NO: 1827 ACCTGTCTTTTCTTTTITCTTTTm 

AGAAAGAATGCAGTATAAATATAGCTTTrCTCTACACGGGAGCAGGGGGAACAGAACCAATCCCC 

AGCTTAGCCACACCCAACATCATGGAAATTACTGTGAACCTGTTGTCTCTTGAGGACAACT 

CAAAACGAAATCCCTAACATTATTAAAATGTTAGGAAACTTTTCAGGTAATTGCT^ 

AAAATACAGAAAGATTACATTCACTCATAATAAAAATCAAGTGTGCCCACGCCATC TGCA AAGGG 

AACITGCACCATCTTGGTTTCACTCGCATTGTTAACAGTGCCCTAAAAGTATACACCT^^ 

TAAAACCATTAGGNAATATCTTGATCATATCCTCTGCATGATGAACTATCACTC 

ATGGGTTACCTTACTAACACTGACAAAGAGATAATAAATGTAATT TTACA GGGATAGGAGA^ 

rrGCAAACCTTTCAGTTTCTTTCATAACTATTGTTTAT^^ 

SEQ ID NO: 1828 ACCCGGCTCTGCATCGCGTCGCCATGATGGGCCATCGTCCAGTGCTCGTGCTC 
AGCCAGAACACAAAGCGTGAATCCGGAAGAAAAGTTCAATCTGGAAACATCAATGCTGCCAAGA 
CTATTGCAGATATCATCCGAACATGTTTGGGACCCAAGTCCATGATGAAGATGCTTTTGGACCCAA 
TGGGAGGCATTGTGATGACCAATGATGGCAATGCCATTCTTCGAGAGATTCAAGTCCAGCATCCA 
GCGGCCAAGTCCATGATCGAAATTAGCCGGACCCAGGATGAAGAGGTTGGAGATGGGACCACATC 
AGTAATTATTCTTGCAGGGGAAATGCTGTCTGTAGCTGAGCACTTCTGGAGCAGCAGATGCACC^ 
ACAGTGGTGATCAGTGCTTACCGCAAGGCATTGGATGATATGATCAGCACCCT AAAG AAAATAAG 
TATCCCAGTCGACATCAGTGACAGTGATATGATGCTGAACATCATCAACAGCTCTTTACTACCAAA 
GCCTCATCGGNGGNCATCrrTGGCTTGCAACATTGCCTGGATGCTGCAAATGG 

SEQ ID NO: 1829 ACACrmGrTACAGTTACATATATGAATAGTTAGCAGAGGAGAAACTCCTCC 
GTTGTTCCTACACCAAGGGAAAAAATGTAAGGACTGTATAGCTCATGAATAATATT AACA TT^ 
TATCACAAAATATTGATAAAATTGGAArrGTATCAATTCATGTTATGGAGCTTAACAT^ 
AAATATTAAGATAATATAAATAAGGCTTTTAGCTTCATATCAAGACCTATTTGAACAOT 
AAGAACATTAAAAGrnriTCCTCATTTATGGTTTGAm 

GNGNCTTAGTATAATGCATAATTATTAGAATATAATTTAAGGCTrrCAGAATTT^ 
TGGCNAAGATATTTNAGTCCAAAACTTCAAATTNCACACCCAAATATGm 
ACTTGGACAAGGCAACATTTTAATrmGCrAAGCTTAACT^ 
TCnrCTCTCTTGChrrGGAAAATNCTGGGTCAGTATGTTACTTGTAAGNC 

SEQ ID NO: 1 830 ACTTTATITrTTTGTTTGTTTT^ 

GATATTTTGCTTTGATGTGTTTCTACATGTAGTTGCACACGGTTCAGTAAAAA 

AGTATGCAAATATTGAAGTATGATGGTTTGACTGTATGGCAGTGTTGTAGCAGCCTCTTGrillllT 

CCCCATTGCCTCTTTrrTTAAAAAACTTATAAAGTCACT^^ 

GAGCAATATTAAGAAGACATTGCTATCTAATTmAATCI^^ 

AGTAGCGTGGTTGATGCTATTGTTTAGCCTTCCCCTCCAAATTGTATACATTGGCT^ 

CAACTTGCGTGCGTGGCAGCGGAAGACGATTCCCATATTCTACATTGCTACTGTTTTGTAT/^^ 

AAATTGGTAAAGATTGCGGCTAAAAAAAAAAAAAAAAAAAAAAAAAAGTCCTCGGCCGC^ 

NCTANGGCG 

SEQ ID NO: 1 83 1 ACGTGGGTGAGGGGATGGAGGAAGGCGAGTTTTCAGAGGCCCGTGAGGACA 
TGGCTGCCCTTGAGAAGGATTATGAGGAGGTTGGAGCAGATAGTGCTGACGGAGAGGATGAGGGT 
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GAAGAGTATTAACCTGTGTGCTGTACATGTGATACATACATTANAAGAAATAAGGAAAAGA^^ 
AAATTTATNTrGTAAATAAATGAAAATAATACnTCTCCCTGATTATAAAGA^ 
GTACTCNTTTGGANNACCTNCTATGANNGAAAAGTCirrAAT^ 
TGNTGCrmTATATTTTTTTATGCATATGAACATm 

TTGTCACTATGTTTTGGGCATATrrATATGGCAGNAAATATATCCTTGGCATNATN™ 

CTCGATGTATCCCGNGTACCTNC>n^GCGGCCGCTNTAAAGNCGNATTCCTNACACTGGC^ 

ACTANTGNATTCGAGCTNGGGCCAACTTNCGAATNATGGNCGANCTGTTCCT 

SEO ID NO: 1 832 ACAAGCTTTTGTCCAAAATGGCACAGCGAGCACAAATGAGTTCCTGTGTGAT 
AAAGACAAAACTTCAACAGTGGCACCCACCATACACACCACTGTGCCATCTCCTACTACA^^ 

tactccaaaggaaaaaccanaagctggaacctattcagttaataatggcaatgataot 

TGGCTACCATGGGGCTGCAGCTGAACATCACTCAGGATAAGGTTGCrrCAGrrATTAACATC^^ 

CCAATACAACTCACTCCACAGGCAGCTGCCGTTCTCACACTGCTCTACITAGACTC^^ 

CCATrAAGTATCTANACTTTGTCTTTGCTGTGAAAAATGAAAACCCGTTTTAT^ 

ACATCAGCATGTATTTTGGrrAATGGCTCCGTTTTCACATTGCNAATAACAATO^ 

GCCCCCTGGGAAGTTCTTATATCTGCAACAAAGAGCAGCTGTTTCAGTGTCTGGAGCATTT^^ 

NATCCTTTGTTCTAANGGCTCACCTTTNATGTGACCAANGAAAGmCTA 

SEO ID NO- 1833 ACTGCCCGACTTCCTCATCTTACTGGGTCCAGCATAAAGCAGATGTCCACTGT 
CTTCCTCACATGCTGTGATCTTGGCTTANAGGTAGGCACAGTGCCGCTCCAGCAGCGA 
CGTTACTTAGGAACAGCAGCTCTTTmCCCTTCTTCTGTAGGAGCTTTCT^ 
GTTrrCCAAATAATACTGAATACAACTTTCAAATTCCm 

AATAAAGGGATCCAGGGCATCAAATCCTTCCmCCCAGCAACTCCTGGGGCAGA^^^ 

GGGCTTAAAGAGAGACCCAGTCTGGCTCAAAGCCGACACAATGGCGCCTCCATGCCAATCATTTT 

TCATCATTTTCCTCAAGrrGTGAACAAGTGCTAATTCCTCGGGGGCAATCGGGCTTTT 

TTTCAGAGTGGTTCTTCCCCAAAGAGCATTGATTCCATCCACGGCCACTAGGAGGTGAAACATACC 

CAAAGAACTTTGCTCITACrCTTTCAGCACAATTCCAACTGATCT 

SEO ID NO: 1 834 actatgctatgttggctaaaactggtgtccatcactacagtggcaataatatt 

GAACTGGGCACAGCATGCGGAAAATACTACAGAGTGTGCACACTGGCT ATCAT TGATCCAGGTGA 

CTCTGACATCATTAGAAGCATGCCANAACAGACTGGTGAAAAGTAAACCITr^ 

TTCACCTGCAAACCrrAAACCTGCAAAATTrrCCTrTAA^ 

AAAC 

SEO ID NO- 1 835 actcaatctgaaagatgtagaagaaggagatgagaaatttgaatgacaccc 

ATCAATCTCTTCACCTCTAAAACACTAAAGTGTTTCCGTTTCCGACGGCACTGm 

TCTGCCAAATACTTGCTrAAACTATTTGACATTTTCTATCm 

CTTTCCTACATAAGTATAATAATGTGGGAATGATTTGGTITrAATTAT;^ 

TAAAGCAAAATTGAAACTCCAAGATGCAAAGTCCAAAGTGGCATTTTGCTACTCTGTCT^ 

TGATAGCTITCCAAAATGAAAGTTACTTGAGGCAGCTCTTGTGGGTGAAAAGTATTrGTACCT^ 

CXjGGCXjGGCGCTCGAAAGGGCG 
SEO ID NO: 1 836 ACTACGACATrrCTGCCAAAAGTAACTACAACTTTGAAAAGCCCTTCCTCT^ 

otgctaggaagctcattggagaccctaacttggaatttgttgccatgcctot 

GAAGrrGTCATGGACCCAGCTITGGCAGCACAGTATGAGCACGACTTAGAGGTTGCTCAGACAAC 

tgctctcccggatgaggatgatgacctgtgagaatgaagctggagcccagcgtcagaagtctagt 

riTATAGGCAGCTGTCCTGTGATGTCAGCGGTGCAGCCGTGTGTGCCACCTCATTATTATCTAGCT 

aagcggaacatgtgcttcatctgtgggatgctgaangagatgagtgggcttcggagtgaatgtgg 

CAGTTTAAAAAATAACTTCATTGTTTGGACCTGCATATTTAGCTGTm 

TGAGTITCATATATAAGACTGCTGCAGTCACATCACAATATTCAGTGGTGAAATCriTGTTGTACT^ 
CATTCCCATTCCTTITCGTTAGAATAAAATAAAGTGNATTTCAAATA 

SEO ID NO- 1 837 ACTGTGTGGCGCCTTATTCTAGGCACTTGTTGGGCAGAATGTCACACCTGCCG 
ATGAAACTCCTGCGTAAGAAGATCGAGAAGCGGAACCTCAAArrGCGGCAGCGGAACCTA^ 

tcagggggcctcaaatctgaccctatcggaaactcaaaatggagatgtatctgaagaaacaatgg 

GAAGTAGAAAGGTTAAAAAATCAAAACAAAAGCCCATGAATGTGGGCTTATCAGAAACTCAAAA 

tggaggcatgtctcaagaagcagtgggaaatataaaagttacaaagtctccccagaaatccactg 

TArmACCAATGGAGAAAGCANCNATGCAGTCTTCCAATTCTAATCAAAAAAGAANANGANGAA 

naagagaaaaxtogtgaatgatgctgagcctgatacgaaaaaagcaaaaactgataac^ 

aaatctgaaaaagaaagtgccgagactncanaaaaaaanaaaaanaataaaaaaagtacgt^ 

gtacaggcataaaattttaagaattcttaatctagggacttgctcctgt^ 
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SEO ID NO* 1838 ACr i -i'i- n - l 'l 11 U 1 1 1 1 n 1 1 1 i l i 1 1 ANAACCTNTGCAATTATTAGTTTATTA 
GTATCATCCAGGACTCANATGTTCAGTATTCCTCCTGAAATTACATAAACAAATGCAi^ 
GAATCCAAGTCTAAATTATATAACAAAACAGCACTCCATCACAAAAGCGTGTAAAATTACAAGAA 
CGCTATTrrAAAATACTGGCACTTTAANAAAACGATAATCTCGAAAACCAC 
GTTCCCTAAACTCTTAAGCAGATAAACATGACTAATGAATGAGTrrGTTITGTA^ 
TCAAATAAATTGAATAATTCATACTGAGATGCAAAGGrrGNCTTCmCTTCCT^ 
TATAAAGGGGCAAACCGTAATATANGAAAATATACCCTATmGAATGTGGCATCTTTGm 
AGCTGGCCArrAATTAAAAATACTTGTNATGGAAAGCTCGTTTCCTGAGCAGCCATAm 
ATGGGAAAAGACATCTATACCAGCATTTTCTAGTCTTCCTGAGACTA 

SEO ID NO- 1839 ACAGAAGTTTTCATCTATGAACATGGCCTCATCATCACCTGCAGCCrrGGCCT 
TOGCCTGTrCTrCAAAAAGCTGCCGCTGCCGCATGGGATCATTCAGCTCAGTA^ 
TCTCTTTCTTCATGACAAACAGCTCAAAGCGCTTAGTCAGACCCTCm 

CCAAAGGGCTCATTATCTGTGGGTGATCACAGATGAATGTAGGATTGATGCAAGTCACTTCCAGG 

AACTCCCCAACAAGCTTGTCAAGGAGCCTGGCTGTGGTCCGAGGTGGAGGGCATTCAACAGCTTT 

TGCCACACAGATATCATCAAGAATmGCGAGTTTCTTCAGTTTCAAAGAGGTTCGm 

CTTCATCCCCAGGGCTITCTCAAGCTCTTCTACCATGTTGATTCCCGGAAGGGTGGGGTG^^ 

ACATCGTAGCTTGGCCCTCTGGGCCATCTGGGTGGAGGTGACCTTGTAACTGCCTGT^ 

CACCATCCCTGAAACCATCTTCTCCGTGAnTCATGAGTCGTGATAGTC 

SEO ID NO- 1840 GCCCTTTACATCGCTGAGAGCCGCCTGTCTTGGTTAGATGGCTCTGGATTAGG 
ATTCTCACTGGAATACCCCACCATTAGTITACATGCATTATCCAGGGACCGAAGTGACTGTCT^ 
AGAGCATTTGTATGTTATGGTGAATGCCAAATTTGAAGAAGAATCAAAAGAACCTGTTG^^ 
AAGAAGAGGAAGACAGTGATGATGATGTTGAACCTATTACTGAAmAGATTTGTGCCTAGTGAT 

aaatcanccgttggaggcaatgttcactgcaatgtgcgaatgccaggccttgcatccagatcctg 

GNGGATGAGGATTCAGATGACTACCATGGAGAAGAATATGATGTGGAAGCACATGAACAAGGAC 

agggggacatccctacattitacacctatgaagaaggattatccctctaacagcagaaggccaag 

CCACACTGGAGAGATTAGAAGGAATGCTTTCTCAGTCrGTGAGCACCAGTATATATGGCTGGGGT 

caggacagaagattcataagagattatgaagatgggatggaggtggatccaccca 

SEO ID NO- 1 841 acgtgcacctcctgtttcaaaaacgcaagcagaaaataggtctggacaaggg 
caagctgcaactcaatttctgctgcaaccctagctcaggtaagttatgt^ 
gatatagtcagtgagtctgttcatgccttcttcctgctgttctagggcatccaggacaatc^^ 

CGGTGTCCTCTTCAAACCTATTCCTTCCATTrrATTCTCCAGTrTGrrC^ 

acgatgcatggataaacatcaacagcaaaggaaaagtaatgccaaacacaaagaccatgactcc 
tccaaacatggagataaggaaatagctcgccaacatgaccaccataacgaacgtngtggggtagc 

GCTTCTTCATCCGGCGAAGGACGTCmATTGTGGGCTGCCACACAAACCCTGTGA^^ 

CACCACGATTarrCCCAGGATCATGTTGAAGGGACTCAGAAACCCCCAATGGAAATCATNATGGC 

AGCCCCACCAGGTAGTTGGTCTGGAATAGACANGTGCTCACTACCGGTG 

SEO ID NO- 1 842 ACGCGGGGCTTmCCTCTCnTCAGCGTGGGGCGCCCACAATTTGCGCGCT^ 
CrrTCTGCTGCTTCCCAGCTCTCGGATACAGCCGACACCATGGGTTTCGGAGACCTGAi^ 

gccggcctccaggtgctcaacgattacctggcggacaagagctacatcgaggggtatgtgccatc 

ACAAGCAGATGTGGCAGTATTTGAAGCCGTGTCCAGCCCACCGCCTGCCGACTTGTGTCATGCCCT 

ACGrrGGTATAATCACATCAAGTCTTACGAAAAGGAAAAGGCCAGCCTGCCAGGAGTGAAGAAA 

GCnTGGGCAAATATGGTCCTGCCGATGTGGAAGACACTACANGGAAGTGGAGCTACAGATAGTA 

AAAGATGATGATTACATTGACCTCmGGATCTGATGATGAGGAGGAAAGTGAANAANCAAAGAG 

CTAAGGGAAGAACGTCTTGCACAATATGAATCAAAGAAAGCCAAAAAACCTGCACTO 

GTCTTCNTCTTACTAGATGTGAACCTTGGATGATGAACAA 

SEO ID NO- 1843 ACTACCAGGTATTGGTTCGTrrACAATTATTGATGGAAATCAGGTCAGCGGA 
GAAGATGCTGGAAACAAmCrTCCTTCAAAGAAGCAGTATCGGCAAGAACCGAGCTGAA^ 
CATGGAATrCTTACAAGAATT.\AATAGCGATGTCTCTGGAAGTTTTGTGGAAGAGAGTC^ 
ACCTTCTAGACAATGATCCCTCATTTTTCTGTAGGTTTACTGTTGTAGTO 
AAGCACrrCACTACGCTTAGCAGATGTCCrCTGGAATTCCCAGATTCCTCTTTTGAT^ 
ATATGGACTAGTTGGTTATATGAGGATCATTTATAAAAAGAACATNCAGTAATAGAATCTCATC^ 
GATAATGCArrAGAGGATCTACGACTAGATAAGCCArrrCCTGAACTGAGAGAACATTTTC^^ 
TATGATTTGGATCATATGGAAAAAAAGGCCACAGTCATACTCCATGGATTGNGATCATAGCTAAA 

TATTTAGCACAGTGGTTAGTGGAACAAATGGACGAAT 

SEO ID NO- 1844 acatcctacccctctcccattcccagagccacctaagagaagtaaaaaacta 

TTGCGATGrrGTCACTGGGAGATTTTTGCTGAATCAAACAACAAAACAG^ 
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GATGTCTTTGGTGCAAAGAAAGAAATCACAGAGGAGGTGAGGCCCATGCTGTTGCCGTTGGCrc^ 
AGGGATCCATGGTCAGCTITGACTCTCACCAAATGCCCAGGTGTGGACCGATGGTTGGGCTGGTO 
TATTGATGCTrTGGTAAAATCCrmCCTCGCTCTGGGCCTTArrCCTCTCT 
GAATTTATGCCCGTTTGCGCCTGCCTTCAAGATTTCTGAAGTTGCTTGC^^ 

AACTGGATTGAGTGrrCGTGGCCCTTCCAGTGGAACCAGTTAACGCCCTGACTGTGGNTATTGTCC 
CCATATCTCCCCATCAAGGTGACACGGTGACAGTTCCTGT 

SEQ ID NO: 1845 ACTTTTmrmrrm^^ 

CCAGGCTGTAGTGCAGGGGCGCGATCTCGGCTCACTGCAACCTCCATCTCATGGTCCAA GTGAT TC 

TCCTGCCTCAACCTCCTGAGTAGCTGGGATCACAAGCACACGCCACCACGCCTGGCTAATTm 

ATTITTAGTTGANACGGGGTTTCACrATGTTTGCCAGGTTGATCTTGAACT^ 

CCACCCACCTTGATNTCCCAAAGTGCTGGTATimAGGCGNGAGTNACTGTGCCCCGGCCAA^ 

AAACrmTTTAATGCTGAAGGCCACACTTmCCCATGC^^^ 

TTCACAACTGATTTGGTAGCCTGACCCTTGGTGTCAAGCTGTTCGNCTTCCACTGC^^ 

GAGAAGTGAAAGCTCATTTGCTTTGTTCTTTTACAAGTAAAGCT 

TGACATTTAATCTGNGCTAAANACACTNTG 

SEQ ID NO: 1 846 GCGTGNmGTTGCCGAGGTACAGGTAGAAGCTTGATTGCTAGGCCCAGGCC 
CACCCAGACCCTCCAATCCTAACAGGTATTTAGGCTTGAGGTTCACTCCCTCCTCAGCTGCACACG 
CAGCCAGGTATAACACTCGCCCTCAGTCACAACGGGGAGGGGGCACCGGTTACATCTACATCACA 
TTATTTATAAAATAAGAATTACATTTCATATAACATGGCCAGAAGGAGCTCTAGTCCCCC^ 
GCTGCCGGGGACAGCATITGAGCCTCTTCTTTGCACAGGCATAACrrA^ 
AGTTAATAGCATTTATACTTAACCACCTCAATGAACCAAGCTTGAAGGAA^ 
GCTTAAATACAAAAATAAATTTITGTTAAAAAACGTTTAAAT^ 

CGCATTCATACTTCTCCCAAAGAGGCTGGGCGTGACAGCAAGGCGCTTGGGCCTGGGGTATGTGG 
TTTCAGAAGGACGGAAGGAAAGGATGGGCTGCAGAGGGCCTGTTTGGGAAAAATAGGA 

SEQ ID NO: 1 847 TTCGCCCTTrmCTTTCGCCCGGG(>GGTACTGNAATT^ 

TGGAGAAGTAATTCAGCTACAGGGTGACCAACACAAGAACATATGCCAGTTCCTCGNAGAGATTG 

GACTGGCTAAGGACGANCAGCTGAAGGTTCATGGGTTITAAGTGCTTGTGGCTCACTGA^ 

ANNGAGGATTTCCmGCAATGAAGTAGAATNACCCTTCNCTCCOT 

NCACAGOTGTATAATGTAACCATNTTGGGGTCCCGCATTNTAACTTGNGACAANTGNAACT^ 
CATGCAATAAACTGAAAAGAAGCCCGTAAGN 

SEQ ID NO: 1 848 GCGNGNTTTGTTTTCGAGGNACCTCGTTTTCAGGTTCATCCATCTCCAGTGGA 
ANGTTTNCAATAAAAGATGAAGAAAATGNGTGTGANCrrrAATAACACATCCCTATAGAAAGTG^ 
ATAAAAGANATACCAAAACTGNAATACAGATATATACAAATATAGGGGCCmm 
GTOTGTCTAGTATGGCCTNGGAAAGAAAACCAAGCAAGCAAGNGTGCTGCXrrANTCT 
TATTTTATTACACATGACNGATATTTTTGTGGNAGGGAAGTGGGATNCTCCTCAG 
GATACNGATNGNATTTATCTCTAAAGAATTAACAACCTTAGAAAATGCCCG>m 
NCTNGAAAGGNCirCGNGGANTTATATAGAGGGGAGCTATAATAAACAKTAACTCT^ 
ATTTAAANTGC(>rANNGNAAGANAAANAATNGNGAGGCTGGANTCACTACCAAGANGAAOT 

C 

SEQ ID NO: 1849 ACCATTITATCACTGTNCAAAGCACTGTCAAATTCCTTTCATCCm 
AAGATCTTTGAATCTTCAGTCTGATTTITAATGTAAGCAAAAACAGAACCATN^^ 
rrGAGAACCTCAGGTGTrCTATAAACAGTCCTTTCCTGNATGTCTTCTATT^ 
TTATTTTGGNGTGGGNGGTATGGNNNATTTTTTGTNTTO 

GCCTNGAGN^r^ATTGCCCCATAACTAAAGGATCAGGATGATGGNAGAACGGAGATCTGGGNNTA 

AANAGCTTTCCCCATTTAANAAAAAATAGATCTTGAGAATCTGATTCNm 

TNCATGNACCTGCCCGGCGGCCCGTCGAAAGGGCG 

SEQ ID NO: 1850 accagcagtgtgtcaggngctgcagagcgttcttggagaaggcccactgagg 
caggttcgtgccctgctgcggccagcctgactagacx:ccaccctgaggtcctgcatttct^ 
gtgtgtaatcacgttccagggcccaaagcccagctctitgttcagttgaot 

AAAAAGTAATTGTAGATGGAAATCAGTTGTGTTTGGCAGGAGAATCAATAAAAATCTTT^^ 
GACAGCrrATGGGGTATTTTAAGCATTCTTAGACTAGTTGAACA^ 

AAATAGTAGAACAAGCAACATAAAACAATGAAGGAAAACCTCACTTGAAGGCCCAGGTCAACAT 
CTAANCCTGTOAGACTTAAANAATCCGAGTCTACCTCrrCANTAGGTTTGTGTGG 
GGGCAAGTGCCCTCTGCTCCCCAGTGCTCCTCTCrcrrCCTANGGCCTTlT^^^ 
CCCTCCGTAGGACTCACAGCCTAATTAGAAGGGNTTAAA 
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SEQ E) NO: 1851 ACCGCGGGGGCGTGGCGGCGGTGGCGGCTGCGGCAACAGCGGGGCCGATGT 
GTAGTTGGTGACTGCCTCTCCANATGCTGAGGNGCCTGTATCATTGGNACAGGCCAGTGCTGAACC 
GTANGTGGAGTANGCTGTGCCTTCTGAAGCAGTATCTATTCACAATGAAGTTGCAGTCTCCCGAAT 
TCCAGTCACirrTCACAGAAGQACTTGAAGAGTCTGACAGANTTATTTGTC 

AATTATGAATAGNAGGGAGGAGCANGANGGATITATTANATGGAGTAAAGCCTCANGATTNTAT^ 
TTCCCCCNTTGGGTGCCTCTT 

SEQ ID NO: 1852 ACCTTTTGGTGCCAGCGTTTCAAGGAGCCCTCACCATGAAACAAGTCAACCC 
CAGCAAGCGTCTAGATCATTTGCAGCGGGCTCGAGAACACTTTATAAACTACTTAACTC^ 
TTGCTATCATGTGGCAGAGTTTGAGCTGCCCAAAACCATGAACAACTCTGCTGAAAATCACACTC 
CAATTCCTCCATGGCTTATCCTAGTCTCGTTGCTATGGCATCTCAAAGACAGGCTAAAATACAGA 
ATCAAGCAGAAGAAAGGAGTTGGGAGCATAGGTTGTCTGCAATGAAATCTGCTGTGGGAAAGTGG 
TCAAGCAGATGATGAGCCGTGTTCGTGAATATTATCTTCTTCACCTTCAGAGGTGGATTGATATCA 
GCTTAAAAGAGATTGAGAGCATTGACCANGAAATAAAGATX:CTGAGGGAAAGAGACTCTTCAAG 
AGAGGCATCAACTTCTAACTCATCTCGCCAGGAGAGGCCmCAGTGAAACCCTTCm 
GAACATGGTCAAGCCAAAGTTITGGAGCTGGTATTCAAGTCTGCCACTATG 

SEQ ID NO: 1853 ACCTGAGGAAGCGGTTTGGAGGCCAGCGGATCCAGGTCTACCTTTCCCTTCTG 
TCCCTGCTGCTCTACATTTTCACCAAGATCTCGGCAGACATCTTCTCGGGGGCCATATTCATCAATC 
TGGCCTTAGGCCTGAATCTGTATTTAGCCATCTITCTCTTATTGGCAATCACT 
TACAGGGGGCCTGGCGGCGGTGATTTACACCGGACACCTTGCAGACGGTGATCATGCTGGTGGGG 
TCTTTAATCCTGACTGGGTTTGCTTTTCACGAAGTGGGAG^ 

SEQ ED NO: 1854 ACTCCAGCAAGAGGAAATATGACACTCCCAAAACGAAGAAGAACTGATTGG 
GGCTTCCACAGCCCTCCTCTCCCAAGAAATCCGGGCTCCTCTCCCAAGAAATCCAGGTGCm 
GACTCCAAAGGGTATCITAAATGCAATCTCTTCTCTCTTAGCCCTTGGCC^ 

GCCCTGCTCTCAGCCATAGTGAAGGACCNCCCTAGGAGTCTGCGAGAGCCTCCTTGGTTCCATCGT 

GAAGCCATAAACAGGAATGCCTTTGGNNATAACCTTONANCCTAGAGGGGCCT 

TGANGTGCTGTGGGTTTATTGCTGGCAACNTGAATTCTCTCAGGGGTCTATGAGGGGCATT^ 

GACTGNCTGACACCATCCCTATCCCTGCTCCCCCTCTCAGAAGAGGGTGGAAGATGAAATGAAAG 

CTATGGGACTCrrGAAGATACCCAGTGTCTATTCTGGGTTAGAGAAGTGCTTACTAAGGGGT^ 

AATAAAACAAATGCCCAAAAAAAAAAAAAAAAANAAAAA 

SEQ ID NO: 1855 ACAAAAATTTGAAAGTGTGACAATGACAATTATGAAATCCTGTGACTGAAAG 
TC(XCTCGAGTGCACTCTGTGGTGCACATGCGCCCGCCCACACAAACTCTGGCATGGAAACATAA 
ACTAATGCAAACCAGTGCTACCCAGAAGCACCAACACGTGTGTTCTCCATTCCACCAATCACAGA 
CCAGTATCTACTCCAAACATCCAGTAACGAAAACTATGGCATCTTCCCAGGAACAGCAAGGCAGG 
CTTCTTACTCACGATGAACCAGCACGAATAAACCCAGCAAAAAGAGAACTGCATACTTAAATTTA 
GGATAGTCATTCATGAGGATCGTCACAATTCCAATATAAGGAACAAATCCCCTGGCTCTCCCCACA 
ACATCTITmCTCTAGCCAATGTTGTCCITGTTTATAGAGGCCTC 
CTCXTTTGGGTCAAAAACTTGATATGCCCATmGCrmCATGAA 
TTAGGAATCTCrCnTCCTTCTATCCmAAACAACAATT^^ 

SEQ ID NO: 1856 ACAGrTGGAGTCTGTGTGriTrCTTGAATGTTTGAGACAGCTTCACCTTG^^ 
TTTGAATTTCTCAGCAGCTGCTAGTTGTGCTTGCTGGGATAAATCTTCGATOT 
ACTATGTAAGTATCTGAAGCAGGGCTCITGTAGACATCTGGTTTTGTGATGACAAAGAGGATATTC 
TTAGATTTCCGGATAGTGACTCTAGTAACTCCTGTAGCCTGCCGAAGACCCAGTTTGGACATAGCC 
TTCCGTGCCTTCrmCACTCCGACTCTGTTTTGCTTTACTGACT^ 

TGCCGCCAGCTGGGCTTGTTGTGTGGrrGCCTGGGTGGAATCCTGTTCTTCAAGCTCTGGT 

SEQ ID NO: 1857 ACAAGTTCGGCTTTGAGCrrCCTCAGGGGCCTCTGGGAACATCCTrCA/^ 
AAAATATGGGTGTGTAGACTACTGGGTGAAGGCTrrrCTTGACCGCCCGAGCCAGCCAACTCA^ 
AGACAAAGAAAAACmGAAGTAGTGGATCTGGTGGATGTCAATACCCCTGATTTAATGGCACCT 
GTGTCTGCTAAAAAAGAAAAGAAAGTTTCCTGCATGTTCATTCCTGATGGGCGGGTGTCT^ 
GCTCGAATTGACAGAAAAGGATTCTGTGAAGGTGATGAGATTTCCATCCATGCTGACTTTGAGAAT 
ACATGTTCCCGAATTGTGGTCCCCAAAGCTGCCATTGTGGCCCGCCACACTTACOTGCCAATGGC 
CAGACCAAGGTGCTGACTCAGAAGTTGTCATCAAGTCAGAGGCAATCATATTATNTCANGGACAT 
GCCCATCATNGCGTGGCAAGAGCCTTCNGGTTCAAAAAGATCAGGCCrmATCCTGGGCTGC^ 
ATNCTTCGAhnTGAATATTCCTTACTGATCCTATGTTAGCGGTCCC 

SEQ ID NO: 1858 ACGCGGGACAOTGGCCAACCATATTTTATTTTITATATTm 
TCCTrGTTCTGAAACCATAAAGTGAGmAACATTTCTGGCTGGGCACAGTGGCTC^^ 
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TTCTAACACTTTAGGAGACCAAGGCAGGAGGATGCCTTGAGCCCAGGAGTTTGAGACCATC 

ACAACATAATGAGAAAAGAAAATACAAAAATTAAGGGGTGTGGTGACACATGCCTGTAGTCTCAA 

CTATTTGGGAGGCTGAGGCAGGAGGATTGOTGAGCCCGAGAGGTTGAGGCTACAGTGAGCCAGG 

ATCGTGCCATTACACTCCAGCCTCGGCGACAAGAGCCAAACTCAAAAAAAAAAAAAACAAAA^ 

AAAAAGTACCTCGGCCGCGACCAAGGGCG 

SEQ ID NO: 1 859 actggatggccccacaagatgctgccactttaataaggctgcaatacactgt 

GTATCTTACAGGAGTATTCTTATCCATCCCGTGGAAAAGGTTGCTTAACAACTGCAGTCT^ 

CGGGCGTTCACCrrCGCGAAATITOACCAGCITrTCACATAGGCTTTQ 

TCTGGTTCCAGGATCAAGAGTAGGGATACCACACTGrrCATCACACTTTCAACATCTT^ 

TCCTTCAGACACACATCACAGGCTTCAATAATTTGAGCTAAATCAACATGAAGTCCACOT^ 

TTCTCnTCTGAAATCTCAGCrCCTTTAGATTTCAGATAAGCACGAAGCTCAG^^ 

CACTGATGTCGATGAAGGCCGGGACGCTCATGGTGCANGCCCGGGCACAGNGGACACTCCACTCG 

CCNCCGCGT 

SEQ ID NO: 1860 ACTGTTTTTCAGTArrTGGGGAGGGTGGTTTGAGCAGCATTrATrGACAAm 
CATTAGTGGGGATGTrrCTATTGAAAACAGGGTTAGGAAGTCATAAAATGTTCCTGCAA 
GTAATAATACCACCAGCGTTTATCTTACTGTTTTCATGTTCTAAGTGCATGCATCTGAGTAAAA 
ATCTGGGCTGCAGTCCAGTCTGAGAGATGCCAGCAAAGGCTTCCTAGGCCANTTCANTCCAGTAA 
ATCCCTCTTCGATCTTCTKTTCCACACAGACAGCAOTGATGAGCATGCCCATG/^ 
ATTTTGGGGAAAATGAAANAGTTGTATTCTTNNTTGAGGTAGNAATTCCACm 
ACATTTrrGATrATTNTTATCACCCTTCANTGANGTTGTNm 
N 

SEQ ID NO: 1861 ACGCGGGGAGGAAAGCCGTGCGTTGCGTTCCAAGGCATCTGTGAGCCCGCGG 
AGTATACACCATGAGCAAAGCTCACCCTCCCGAGTTGAAAAAATTTATGGACAAGAAGTTATCAT 
TGAAATTAAATGGTGGCAGACATGTCCAAGGAATATTGCGGGGATTTGATCCCm 
TGATAGATGAATGTGTGGAGATGGCGACTAGTGGACAACAGAACAATATTGGAATGGTGGTAATA 
CGAGGAAATAGTATCATCATGTTAGAAGCCTTGGAACGAGTATAAATAATGGCTGTTCAGCAGAG 
AAACCCATGTCCTCTCTCCATAGGGCCTGTTTTACTATGATGTAAAAATTAGGTCATO^ 
CCGGCGGGCGNTCGAAAGGGCG 

SEQ ID NO: 1 862 ACATATTrAATTATGTAAAATATTAAAACACCACCATrACAAAACTTCTAAAA 
TATTTTTAAATTCAGTAATAAArrrrrAAAATArrTm 

GTCTTAATAAGAGAGCATTCAATTTGCTGCTGGCATACAAAGGAAGCTACTTTGAAGT^^ 
CTATACACAAGAAATGTTCTTATCTGGTTTCAGGTTTTTTAGCTTTCT^ 

AACAGAAGGGAGAGATGAGCAAAACAGAGGTGTATTACTGAAAATTCATACTAATCTTTAGGC^ 

AGCTCATCAATGCCAGTATCTCATAATAGACAGAGATTTTGCATATAATAAAACCCCACCACCATG 

TAATAATAACCACCTTAATGGCTTTTTCGGTAAAAAAATTTAAG 

CTTTTAAAAAATAATTTAAAAAAGAATTCATGGTGCTTCATTTA^ 

CXjCAGATATGGATTCTCTCACATTAAGGTACCAGCAAATGGNGAGCCTG 

SEQ ID NO: 1 863 ACGCGGGAGAGGTGGTGGGGACCAGGGCTATGGGAGTGGCAGGTATTATGA 
CAGTCGACCTGGAGGGTATGGATATGGATATGGACGTTCCANANACTATAATGGCANAAACCAGG 
GTGGTTATGACCGCTACTCAGGAGGAAATTACAGAGACAATTATGACAACTGAAATGAGACATGC 
ACATAATATAGATACACAAGGAATAATTTCTGATCCAGGATCGTCCTCCA AATGGCrrG TAm 
TAAAGGTTITTGGAGCTGCACTGAANCATCTTATTTTATAGTATATCAACCTT^ 
AGACCTGCCAAGGNAGCTGAANACCTTTTANACAGTrCCCATCTn^^ 
ATTTAAAGACAAAATTTGGGACCGTTTTGGATNAACCTGAKTANTT^ 

gat^^g^wccncttnatgttaotggnacctagcatnanr^gnttcc^ggna^ 
gacttaaaaanaattaatgtngnatncnggggncatntttaaaaaaccc 

SEQ ID NO: 1 864 acaagttcacactcgccccacgggcaagtatatcatagagggaggaagccac 
tgcttcggtcttgattgtggagtgaggttggccccagtgatctagccagccagtatagaattcaga 
attgatcaagggtccntgggctcacacttcctctggcrraggaaagcatctgtc^ 
tgttccaaagtccaccgtggtgtagaggccctgcngggccccacatttcaggaatgttttatgtg 
tccatcagtggtaaacanaaccacatcatcccccanatngtggcgaaagcgcttctgcaggaagc 
gcatgtagtcaaaatcgcaggcaaagtatctgccatattcattttcaacctgcactgtgtot 
ggcccttcattcctgatagnggagagccttatnnttgggcnagaangactnccaa^ 
cacancmgccangtaaactgggtctgagtgaccngg 
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SEQ ID NO: 1 865 ACCCAAATATAAAATAOTGTTTAAATGTAGAGTrrCATATCCTTTAAACCTC 
ACCATAAACATCTCTGTGGAATCmCATAGGGAACAATTATAGTGTTGATAAGTC^ 
ATCAGCACATACCCAGAAAAGTAATTTTGATCAGAGAAGTTGCTCCTACAGAAATAACAATGTGT 
TAATGACAAGAAAAAGAACCCACATCTTTGAGCTGCAGANATGCCAGCATCTAACAACCTGATGC 
CTTATGCTGGGACACATGACITATTTTTGAGTCGArrATTTAT^ 
ATGCCACCATGTCTGGCTAATTTGGTTmATTTTATTrrAm^ 
TGTTGC(XAGGCTGGTGTCAAACTCCTGGCCTCAGGCAATCCTCCTG 
TGGGATTCAAGTGTGAGTCACCACACCTGGCCTTATTTTCGAGTTTTAAAGGCAAT^ 
AGTGAGACTCCATGTTGAAGTCAGTATGGTCTGGGGGGTNGAAAAA 

SEQ ID NO: 1 866 ACACCTTGAAGGCGAGGTTAATTAAATCCTGTTGTGGAGTTTGAGGGCCGGA 
ATTTAATTmGGAGTTTTATTTAATATCGGGAGCAGATTGGGTAATAA^ 
AGACGGCCTTTTGACCTTTTAGGGTCTAGGGCTGTAAAGTGTCTCAGGGTT^ 
ATGAACTGGGCTGGGTTTTTATATTTGATGAAAAAGAGCCTAAACGCnTCTGAm 
AAAAGGAGCATTAACmGACTATGTCmAGCTCCAGCCACCrrm 

ANGTGGGGGAGGGCTAGTCACGGAACGAAACTGTAAGCCGGACCAGGTGTGAGGAGGGGAGGTG 
ATAAAAAGATrACAGGOTGGAGGAAGTGGAGCCTGAGGAAGAArrGGGGACCTAACTTGGCGTG 
GAGAAGGANGGGGAGAGGTCATATGGGTTTGTAGAAAAAGGAAGATTANACACNCTCTGCAACG 
CCCTGGGGrrGGGACTTGANGGGGACAAGGTGGGGAAGGNAAAAGAAANANAA 

SEQ ID NO: 1867 ACAGCCAACGGTTrCCCTTGGGGGCTTTGAAATAACACCACCAGTGGTCTTA 
AGGrrGAAGTGTGGTTCAGGGCCAGTGCATATTAGTGGACAGCACTTAGTAGCTGTGGGGGAAGA 
TGCAGAGTCAGAAGATGAAGAGGAGGAGGATGTGAAACTCTTAAGTATATCTGGAAAGCGGTCTG 
CCCCTGGAGGTGGTAGCAAGGTTCCACAGAAAAAAGTAAAACITGCTGCTGATGAAGATGATGAC 
GATGATGATGAAGAGGATGATGATGAAGATGATGATGATGATGATTTTGATGATGAGGAAGCTGA 
AGAAAAAAGCGCCAGTGAAGAAATCTATACGAGATACTCCAGCCAAAAATGCACAAAAAGTCAA 
ATCANAATGGAAAAGACTCANAACCATTATCAACACCAAGATCAAANGGACAAGAATCCTTCAAT 
GAAACAANGAAAAAACTCCTNAAACACCNAAANGANCTATTTCT^^™ 
AANTGNNAGCTANTTTNTAAAAAGCGCATTTGACAGTCCTGGGC 

SEQ ID NO: 1 868 acgcgggtattgaaggtggagtagcaaccgggcattatattatctcttggaa 
aaggacctcagcaatggagaatatccccatcatcacaactgtcatcactctgccgcacgtgattgt 
ggagaatatccctctccatgtgaatgcagatctgccgtcatttgggcgtgtcagagagtcgttacc 

TGTCAAGTATCACCTACAGAATAAGACCGACITAGTTCAAGATGTAGAAATTTCTGTGGAGCCCA 

gtgatgccttcatgttctcaggtctcaaacagattcgattacgtatcctccctggcacggagcagg 

aaatgctatataatttctatcctctgatggctggataccagcagctgccatctctcaacatca^ 

tgcttagatttcctaacitcacaaatcagctgctcaggcgttitataccta 

gccacanggtcggactcatggatgatacctctattgctgctgcatgatgttcaaaaccggcccttg 

gctgttgttacaaaaatgtttgggcanagctatgcanggngtttcant 

SEQ ID NO: 1 869 accgggggaggcaagatggcggcaaccaagaggaaacggcgtggaggcttt 
gcagttcaggcgaagaagccaaaaagaaacgaaatagatgcggagccgccagctaagcggcacg 
ccacagcagaggaggtggaggaggaagagagggaccggatcccaggccccgtttgcaagggaaa 
gtggaaaaataaggaacggattctcatcrmcttccagaggaataaatm 
aatgcaggacntgagaatgttgatgcctcattctaaagcagatactaaaatggatcgtaaggata 
agctatltgtgattaacgaggtttgtgaaatgaagaactgtaataaatgcatctatm 
agaaaaaacaggatctctatatgtggcmcaaattcacctcacggaccatctgcta^ 
ttcaaaatattcataccctcgctgaactgaagatgactggaaactgtttgaaaggttct^ 
ttttgtctttttgacccctgctttttgatgaattaccacatta^^ 

seq id no: 1 870 . acgcggggctaagtgtttccggtggattcccagggactgtcggaggtgtgga 
ctctgcctgcctacctggtctgggaagatattctaccatatctccctagagcacgaaatot^ 
cacccgcactacttcggccccaacttgctcaacacggtgaagcagaagcrcttcacc 
ggggacctgcacagggaagtatggcmgtaattgctgtcaccaccattgacaatattggtgct^ 
tgtgatccanccaggccgaggcirrgtcctttatccagttaagtaccn^ 

GGGCG 

SEQ ID NO: 1871 aacagttggagtctgtgtgttttcttgaatgtttgagacagcttcacot 

CTTTGAATTTCTCAGCAGCTGCTAGTTGTGCTTGCTGGGATAAATCTTCGATCT^ 
AACTATGTAAGTACTGGCTTTrCAAAAAAACAGAACAAAAAAAACCCAAAAAAAG^^ 
ATAAGCCAGGTGGACCTGAAAGAGCCAAGGGGCAGGAACAATGGCTGTTTCAGCAGCTACTCAAT 
AGCGAAGGGTCAGCGTCCCACTCCCATGCCTTGGCACCGCATCAGATGTCCTGGAGCCCTTGATAC 
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CATCCTAGAAGGCTrrGTTTGAGGCTCCTTAATAAAAGAGGGCAAGGCGAAGAGAAAGAAGACCA 
GGACAGCCAGAAGCAAGGACCACTGGACGCCTTGGCCAGGTGCCTGCATCTCCACAAGCAGGACC 
TCATGGTGGTGGGTCGGGTTCTAGANGGAANGNTCTGCATGTCCTCCCGCGTACCTCGGGCGGGA 
CCACGCTNAAGGGCG 

SEQ ID NO: 1 872 ACTriM"l"rri"lUUTl'l"lUll"lTGGTAGTTCTCACAGTTTAATAGAAAGAATGCA 
TTTTTTTGGTATGCAGTTCCTTTTAGAAATAACAAAATAGv^^ 
GAAAACAGGGAAATGTCCTTCAArrGAGCAmCTmCCTCACATTAAT/^ 
TCCTATGTTAACAGGAAATAGGAAmCAGATAACAAATGTGAAAAGTTOAAATAATCCTGACA^ 
ATTTTTCAAATCAATTGTTGAAAATATTTATTTAGAAAAAAAT^ 
CCNNAGAAAAACACNCITCNTGTNAACANCNGAACACTATGCTAAGm 

GCCACACAAATCTGTGGGNGGCTAAGGATGANGATTCCNAGNAAAANGANAAAAGAANATTTGA 

NAGAGAGGNNGAATOTAACCCTTTGCTTACTGCAGGNCTTGN^^ 

AAATNATTAAGTTTA 

SEQ ID NO: 1 873 ACGCGGGATCCAACCCTGAAGATATTTAGAGATGGTGAAGAAGCAGGTGCTT 
ATGATGGACCTAGGACTGCTGATGGAATTGTCGGCCACTTGAAGAAGCAGGCAGGACCAGCTTCA 
GTGCCTCTCAGGACTGAGGAAGAATTTAAGAAATTCATTAGTGATAAAGATGCCTCTATAGTAGG 
rrTTTTCGATGATTCATTCAGTGAGGCTCACTCCGAGTTCCTAAAAGCAGC^^ 
TAACTACCGATTTGCACATACNGAATGTTGAGTCTCTGGTGAACGAGTATGATGATAATGGAGAG 
GGTATCATCnTATTTCGTCCITCACATCTCACTAACAAGTTTGAGGACAAGA 
GAGCANAAAATGACCAGTGGCAAAATTAAAAAGTTTATNCAGGAAAACATTTTTGGT^ 
CCATTGACAGAAGACAATAAAGATTTGATACAGGGCAAGGACCTACTTTTGCTTACTATG^ 
GGGCITATGAAAAGAACGCTTNAANGGTTCCAACCTACTGGANAAACAGGGGT 

SEQ ID NO: 1 874 ACGCGGGGACCGCGGGGCGGACGGGAGCGAGTATGTCCGCTCTGACTCX5GCT 
GGCGTCTITCGCTCGCGTTGGAGGCCGCCTTTTCAGAAGCGGCTGCGCACGGACTGCTGGAGATGG 
TGGAGTCCGTCATGCCGGTGGTGGTGTACTTn^lU"l"n"lTlUl"rrrrril'rri"rTlT 

SEQ ID NO: 1 875 ACGCGGGAGGAACTGCTCAGTTAGGACCCAGACGGAACCATGGAAGCCCCA 
GCGCAGCTTCTCTTCCTCCTGCTACTCTGGCTCCCAGGTTCCACTGGAGAAGTAGTGATGACGCAG 
TCGCCAGCCACCCTGTCTGTGTCTCCAGGGGAAAGGGCCACCCTCTCCTGCAGGGCCAGTCAGACT 
ATTAGCACCAACTTAGCCTGGTATCAGCACAAACCTGGCCAGGCTCCCANGCTCCTCTTGTTr^ 
ATAGACACTAGGGCCACCGGCATCCCAGCCAGGTTCAGTGGCAGTGGGTCTGGGACAGAGTTCAC 
TCTCACCATCAGCAGTGTNCANTCTGAAGACTTTGCAGTTTATTATTGTCAACAAGTA^ 
GGCCTNGGGTCACTTTCAGCCCTGGGACCAAAGTGGAGATCAGACGAACTGTGGCTGCACCATCT 
GXmTATCITCCGCCATCTGATGAGCCAATTGAAAATOTGGAA 
TGAANAACTTCTATCCCANAGAAGGGC 

SEQ ID NO: 1 876 ACGGATGTGGCAGCGAGAGGACTAGACATTCCTGAAGTCGACTGGATTGTTC 
AGTATGACCCTCCGGATGACCCTAAGGAATATArrCATCGTGTGGGTAGAACAGCCAGAGGCCTA 
AATGGGAGAGGGCATGCCTTGCTCATTrrGCGCCCAGAAGAATTGGGTTTTCTO 
CAATCCAAGGTTCCATTAAGTGAATTrGACTTTTCCTGGTCTAAAAm 

TTGAGAAATTGATTGAAAAGAATTACTTTCTTCATAAOTCAGCCCAGGAAGCATATAAGTCATA 

TACGAGCCTATGATTCCCATTCTCTGAAACAGATCTTTAATGTTAATAACCTAAATTTG^^ 

TGCTCTGTCATrTGGTTTCAAGGTGCCTCCCTTCGTTGATCTGAACCGTCAACAGTA^^ 

AGCAGAAAAAGCGAGGAGGTGGTGGTGGATTTGGCTACCCAGAAAACCAANAAAGTTGAGAAAT 

CCAAAATCTTTAAACACATTTAGCAAGAAATTATTCTGACAGCNGGC 

SEQ ID NO: 1 877 ACGTTGAAGGACrn'GCTGGGTTCTGAGTGTTTGTCCCTCACATAGGATTCCA 
. GAACAGTGCTGCTGGGTTATGAGCGTTTGTCCCTCACATAGGATTCCAGAACACTGATACTAAGGT 
CTGAATGTTTGTCCCTCAGATAAGATTACAGAACACATCTACGAGGGTCTGTATGATTGTCCC^^ 
CATAGGATTCCAGAACACAGTGGCTGGGTTCTGAGTGTTTGTCCCTCATATAGAAmCAGAACAC 
TGCTACAAATTTCTGAACGTTTGTCGCTCACAGAGGATTCCAAGAACACTGTGGCTGGGT^^ 
TGCCCCCCACATAGGATrCCANAATACTGCTGCTGGGTTCTGANTGTTTGNCCTCACGTANGAT^^ 
CAAAACACTGCTATGANGGTCTGAATATTTTTGCCTCTTAAAAGGATTCCANAACA 
GTTGTGTrGNTTGTCCCTCACAAGGGACTCCAANGNACTGNTGCAGG>nTITG^^ 
TCACATAGGAATTCCAGAACACTITTTACNANGGTCTGNAATGT 

SEQ ID NO: 1 878 ACAGATGGTGATTACAGAAGCCCAGAAGGTTGATACCAGAGCCAAGAACGC 
TGGGGTTACAATCCAAGACACACTCAACACATTAGACGGCCTCCTGCATCTGATGGACCAGCCTCT 
CAGTGTAGATGAAGAGGGGCTGGTCTTACTGGAGCAGAAGCTTTCCCGAGCCAAGACCCAGATCA 



wo 02/29086 



PCTAJSOl/30732 



ACAGCCAACTGCGGCCCATGATGTCAGAGCTGGAAGAGAGGGCACGTCAGCAGAGGGGCCACCT 

CCATTTGCTGGAGACAAGCATAGATGGGATTCTGGCTGATGTGAAGAACTTOGAGAACATTAGGG 

ACAACCTGCCCCCAGGCTGCTACAATACCCAGGCTCTTGAGCAACAGTGAAGCTGCCATAAATAT 

TTCTCAACTGAGGTTCTTGGGATACAGATCTCANGGCTCGGGAGCCATGTCATGTAAAGTGGGTGG 

GATGGGGACAirTCAACATGTTTAATXjGGTATGCTCANGTCAACTTGACCTGACCCCA^ 

TCCCATGGCCAGGTTGGTTGTCITATTGNACCATACrCCTTGCTTTCm 

SEQ ID NO: 1879 ACGCGGGGTTGCAGTGAGCCGAGATCATGCCACTGCACTCCAGCCTGGCGAG 
AGAGCGAGAGTCCATCTCAAGAAAAATAACAAAAAAAGAAAAAAAAGAAAAAAGAAAAGCTCT 
CTGAACTGGGCTCCCTTCTGAGAGTGAGGAGGAGAGCCGGGCACAGTGGCTCACGCCTGTCATCC 
CAGCACGTTGGGAGGCTGAAGCGGGAGGATCGCTTGAGGCTATGAArrCAAGACCAGCTTGGGCA 
ACATAGTGAGACCCCATCTCTACAAAAAATATAAAAGTTAGCCAGGCATGGTGGCGTGTTCCTGT 
AGTCCTAGCCACACTCAGGAGGCTGAGGTGGGAGGATTGCTTGAACCCGGGAGGTGGAGGCTGCA 
GTTGAGCCGTGATTGCACCCCTTGCATTCAAGCCTGGGTGACAGAGCAAGACCCTTGCTAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAGTACCCCGGGGGATTTTTGTGAAAAAN^ 
GTTGTGGCCGCGTTTCCNACCTCCANCAGAAATTGGGTTTITTTNCGC^ 

SEQ ID NO: 1880 ACATCCATAAGCCAATTCTTCACTAATTAAACAAGCATCAAAAGTTCCAAGT 
CCAAGACCTCCACAGTTCTCTGGAATGTGTGTGTTCATTAAACCAAGTTCCCAGGCTCTTCT^ 
AGGGGGACTGGATATTCACCAGTTTTATCATATTCTGCAGCCACTGGGATGATTTCCTCT 
AATTTACGAGCAGTAGCTTGAAATTCTTTCTGCTGTTCGGNGAACTCAAAACT 
GGTTCACGTTGTCGATTGGCTTTTGTATGCTGTGATCTCCAATGAAi^ 

ACCCTGCAGCATCGCCCGAACCCCGCTGCCATGTTGGCTCCCGTTCCGGCCACTCGGACAATAATA 
CACGGGTCACGGCCTTGACATACTCCCCGAACGCGGGACTCCTCTGGTCCCCCCCGNGTACCT^ 

SEQ ID NO: 1881 ACmGTAGAGATTGAOTCCTAAGCTACTTAAGACAAC^ 
AAAAAATGTAGAACCATTTGGAAAAATGAAATTTAGTAGTTCCAAGTTTCAAAG 
TTTTATTCCATTCAATAAAGAACAAAACCAATAGTGTTrrrATTACTTTC^ 
TTTTAATCTGAGCCTTGCAGACTTTCATTTGGAGTTTGAACCCGTri^ 
AGAACrrAATTAACGTGAGATTGGCAATrGAAATGCAGGTGCAGTTTTCTGTTAATGTCA^ 
TGTTTAGGTAATAAGAAATATTAAGTAATTGGCmAGATTTTGTAATTT^^ 
CTAGATTTCGTATTCTAGTAGTCAATGTATTTTCAGTGAAATGCAAAAATATTCCCGTTATC^ 
CCAGTATTAATTTTTGAGATCTTACCCGCTTGTCACTTGAATCCCGTGA^^ 
ATAAGCAACATTTTGATTTTTGAAGTGTGTAAGACCATCTCT 

SEQ ID NO: 1 882 AClU" iUl U"i UUU l'll"l'14'ri" lU ' i - ri ' lU ' l 'GGAAAAAGTAGTTAGCATrrAATGAAA 
CTCX:CTCCATGTGGCTTCAAGCCACCAGGACACAGGCCCCCCCAACACTNTTAATCTTCT^ 
CTCTTCTGCTGAANAATTTGGCCTTCACGATGACAGGCTGCTITGGGAGCTT^ 
TTTGTAGTAGCCCGATCGCNCCACATCAATGATGGGAGCAGCCCCAGTCTTGTTTTTAGCAGCA^ 
CACCCGTGTm'GTTCACTGACCAAAGTCCACAATTTGTCAAGGTTGACAGTTGGGCAAAAGCT 
GTTCCTCnTAAGTGGGGGN 

SEQ ID NO: 1883 ACTTrTTTTTTTTTTTT^^ 

AACTTTTAACCTTATCTTCCTCnTCTCCnTAG(XCrr/^^ 
ACAAGAACrrGATCAGATTATTAAATCTTGGAAACCTCATTTTTACC^ 

ACGTGCATATTCTCTTACAAATGTAGTATAAATGTTATGGATAGATATAAGGAAATATTGGC ATAG 
TATAGGNAATTAGTGAAAAAGACACAACITCCCAAACCATAATNAAAGATTAACNTGAAAC^^ 
ACNCTACTTAAAAAAT 

SEQ ID NO: 1 884 ACACAAAGAGGGGGTGGGTGTCGGATGCAGAGTGTGTGGCCTGATGCTCCAC 
GGCGTGCAGGACGGGGGGCTAATAGTAGGTTTCCITCTCCACCCAGCCGCCAGGGCGTCGCCTGA 
TGATGAOTTTTCTOACTTCGTCATATACGAAGATGAGAAGAGAGTAGGGGAAGGCACAGAACCAC 
CAGGTAGGTTTGAGGGGATACATCCTAAGAGCAACCGCCCATTCCAGGGCAGTAGGAAAGGAAA 
GCAGCCAGGGCTGTCTCTTCAAAGAGGCCAAATATCAAGATCTrGTTCTTCATCCCCTGCTG 
ACCGAATTCCTCCTGGCrrACAGATGACCAANTCGGCCCACTGCACCACCACGATCTGACNAAGA 
AGGCTGTGTGGCAGGTGAACTCCACCGATTmCTCTGNTCATAGGGTCCACTGGCTGCCCGTAGC 
TNTCTTTCCANATCGTTGATCCAACGGTCATCCCAm'CCANTCNGATGGCCCAAACANGTO 
TGGGAGGGAAACCCGTTTTATCCNNAATTNACTAAAGTTAANTTATAGAA 

SEQ ID NO: 1885 ACACTGTTGGTGTTATATGGGGATGGGGTTCTCGGTAATTTTGTTTATTAm 
TGTTTATTATTATGTTTTATCATTAATTATTCAATAAAriTTTAl^ 

AATCTTCTGTGGGGGTGGGAGGGACAAAAGATTACAAACCAAAACTCAGGAGATGGTAACACTG 
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CCONCCirATTTCTATAACTCAAAAMTTTTIWITrNTTro^^ 

SEO ID NO- 1886 ACGCGGGGGAACACCTGGCGAGTCCTCGGTGTCGGTC^CC^^^ 
^GSoaiGTTCANAArTATAAGGCTGTCTGCAGAGATITG^^ 

nrATrTTTAGTTCAGCATCCTTGGCTGTGGAATATGTAGATTCACTTITACCTGAGAATCCT 
^frm-SrCAAAAAA^ 

SSS^tS^SSX^gJ™^^ 

AGATGCTTTGGTrGNTGCCAGTCATTGAAAAATACTrrGGCACT 

otJrt m Kin. 1 RR7 ACAGrrGAAGGTTGTGACCGGACATTTGTATGOCCAGCTCACTTTAAATACCA 
?:Sc^biCTCATC^i2SoAC^^ 

?SSctgJSaggctgaagct^^ 

IrT^SfiCTGTGGTAA^^^ 
rPAAAA^^^SGG^^ACTJAGGS^ 

Sgagc^g^gc^ct^^^ 

CrGATGTOA^AO^CANCCTGGTGACC^^^ 



^Vn m NO- 1 888 ACTCGCCACGATGAGCAGCACCTTAGCTAAGATCGCGGAGATAGAAGCAGA 

?I^?^SSaca^tcato^^ 

^r^^CTcScAGATAS^GOTGCCAAGATCCAGCTCCT^^ 

SS?§SSLS&^^GGrA^A^^^ 
S?^^CTG^^GTCCTOAAACCmGG 

GGCmGGCATTCGCTTGAACAACAAACCCCCCAACATTGGCnTAAAAAA 

SEO ID NO- 1889 ACGCGGGGAGCAAGCTCCAGTGCTACOTGTCCCTGGCATTTTAGG^^ 
rf«--fAnG<^GTCATGGATCAQGTAAT^^ 

rtJT-AAT^GGATn^ 
TA^^CA??GTTCGTCO^ 

GGCCGGCCGTTACTrAOTGNNTCCNANCTTGGGNCCNACCrroGNCGAA 

AACAACTGTTCTTNTGGCTGACATTAAA 
QTjr^ m \rn. 1 RQ 1 ArTrri" l - i ' n " i Ti i 1 1 i TTTTTTGATGCATTCAAATArrTATTGAGCAGCTAAGG 

'^^aSISaSggcg^SSS^^ 

r,r AAGGTrCACTGAATCACAGCAGTCANAANAAAOTGCTTTAGGGAACCAANACM 

gSS^oto^t^^^ 
taatttacatgtagtgccccagganaa 
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SEQ ID NO: 1 892 ACITAGAAGTAGAAGAACATAAQATGTATrrCTGACTAAAACAA ATXKS 
TTCACATGTGCTTTATTAGACTCTGGGAGAGAAAATTAACCAAGTGCTTCAGAACAGGl 
ATTTAATTCTTCACGGTAAGAAAATGAAGTTCTAATGAACTGTTTCT^ 
CAAGAGTTATTCTGTTTGTTTAAAAAATAAGAAACCTCTTTAAGC^ 
TTTTTTAAAAACATAATACTGTCCAGGCAAGGCACTGTAAAA 

AGTGGAAGAATTTAAATTTGGCGCTACGATCAAAACTACTGAATTAGTAGAAATAATG^ 

AAGCTTACCAACAAAAGAACCCTCAGCAGAATAACAAAAACTTTGCTCAGGACATTO 

ATTGAAGACGGAAACCGGAAACCGTTTTCTTGTANGCCCCTAOAGGCAGATCAGGT/^ 

ATAGTNGAGGGAAAGGGAGAGAATGGAAATAAACTCAATATTATGCAGATTTATGCCTTATT^ 

TAGCCTTITITAANGGTTGGGTCTTTCANGCTGGTTm 

NAATTrAACCTGGGGGGATTANGTTTTTATTTTTTA^ 

G 

. SEQ ID NO: 1 893 ACTGGAAAATAGCAAGGATGCTCATGTTGGGAAAGAAAAGGTGAAAAAAA^ 
AAAAAACTTGATGAATAAAAGGCCAGCCAAAGAAGACAGGAAGTAGGCTCATCAACCGGGAACT 
TGGACAGCAATAAAATCAGAAAAGCACTTACCTTTTCCAGGGAATCACTCCGATAA^^^ 
CAATCCTTTATTTGGTTTATTAATAGCATCAGTCTCTGAATATTTTAT^ 
TTCAAGACAAATTAGGTAAATAAAGCTTTCAAACACTTTATTTAAATGTGA/^ 
AAAAAAAGTTTTTTTTCAAAAAAAOGCTCTTGGAAGTGAAC^^ 

GATTATTCCAGAATGAACACATTATAGAGrrAAAATTAGTTACGCCTATT ACAAT TTAAA 
CCATATAATTAAAAATTATTANACCTATGCAACTrATTATGAAAGTAGGAT^^ 
AAGTTCATGTGATGCTTTTTACCTCAANAAATGGNAAACCACCCATAATTGCATTACT^ 
ACITCCCCrnTGTNCCCTGCCCCGGGCGGC 

SEQ ID NO: 1 894 acgacgtctcatctgggaaaagaatctaaagtttgtgatgcttcacaacctg 

GAGCATTCAATGGGAATGCACTCATACGATCTGGGCATGAACCACCTGGGAGACATGACCAGTGA 
AGAAGTGATGTCrrTGATGAGTTCCCTGAGAGTTCCCAGCCAGTGGCAGAGAAATATCACATATA 
AGTCAAACCCTAATTGGATATTGCCTGATTCTGTGGACTGGAGAGAGAAAGGGTGTGTTACTGAA 
GTGAAATATCAAGGTTCrrGTGGTGCTTGCTGGGCTTTCAGTGCTGTGGGG 

CTGAAGCTGAAAACAGGAAAGCTGGTGTCTCTCAGTGCCCAGAACCTGGTGGATTGCrCAACTG^ 
AAAATATGGAAACAAAGGCTGCAATGGTGGCTTCATGACAACGGCTTTCCAG 

SEQ ID NO: 1 895 ACGCGGGGGCCCGAGCAGCGGTGACAGGAACCTGGAGCGAAGATGTAGCCC 
CAACCTCTCCCGAGAGGTGCTCTACGAAATCTTTCGCTCCCTACACACCCTGGTTGGACAGOT 
CCTCAGAGATGATGTGGTGAAAATTACAATCGArTGGAACAAGCTCCAGA GCCTCTC GGCATTCC 
AGCCTGCATTGCTCTTTAGTGCACTTGAACAACACATTTTATATTTACAGCCT^^ 
TCAGTCTCCGATTAAAGAGGAGAATACAACTGCTGTTGAAGAGATAGGAAGAACAGAAATGGGG 
AACAAAAATGAAGTAAATGACAAArmCCATTGGCGACCTACAAGAGGAAGAAAAGCACAAAG 
AAAGTGATTTAAGAGATOTGAAAAAGACACAGATCCATTTTGATCCAGAAGTAGNTCAGATAA^ 
GCTGGAAAAGCAGAAATTGACAGACGAATATCTGCAriTATTGAAAGAAAGCAAGCTGAAATCAA 
TGAAAACAACCGTCAGGGAATTITGCATGmTTGATTGNAATCAAGAAAATA GGTG GTGC^ 
ACTGATGCGATTTTTTACCCCTTACCCCGGGATTTAAAAAGTCACCGTAAAAAG 
TGGTGAATACCATACCGGNC>JCAGACITNNAACCITGAAGGGAAm 
AAACCTAANAGC 

SEQ ID NO : 1 896 ACTCCAGTTGCTGGATATCCAAGCCTCTCAGCTTCGGAG AATGGAGTCTTCC^ 
TCAATCATATCTCACAGACTGTGGATA1TCATAAGGAGAAAGTGGCACGAAGAGAGATTGGTATT 
TTGACAACAAATAAGAATACATCAAGAACTCACAAAATAATAGCACCTGCGAATATGGAGCGCCC 
TGTAAGGTATATTCGGAAACCTATCGATTACACAGTTCTGGATGATGTGGGCCATGGTGTCAAGCA 
TGGAAATAACCAGCCTGCAAGAACTGGCACACTGTCGAGAACAAATCCTCCTACTCAGAAACCGC 
CAAGTCCTCCCATGTCAGG(XGGGGAACACTGGGACGGAATACTCCrTATAAAACCCTGG^ 
GTTAAACCCCCAACAGTTCCTAATGACTATATGACCAGTCCTGCTAGGCTTGGAAGTC^^ 
CCAGGCAGGACAGCATCTTTAAATCAGAGACCAAGGACACACAGTGGAAGTAGTGGAGGAAGTG 
GAAGTCGAAGAAAACAAGTGGTAGCAGTAGTATTGGCATTCCCATTGGTGNGCCTACACCrrCGC 
CACCCACTATTGGACCAGCAAGCCCCGGGCTCAANCTTCTGGGT CCCAG TATGGCACAATGACCA 
GGCCAGATNT>rrTGNACAACAACTTNTACTACTTTITr 
A 

SEQ ID NO: 1 897 ACGCGGGACTACACTCCAGCTTAGGAGATAGAATGAGGCCTCATCTCTAAAA 
AAATAAATAAATAAAAAATGAAAGAAACAACATATTTTGCAGGTTGCCACCCTGGGGTGACrc 
TTTCACTAArrAGGTGCCATGATGTGCCATCCCAAACrrACCCCTCTCTCTCCATAGAOT 
TCTCTACTTlTCTGAGACTCAGAACTTCCCTGCCAAAGTCACATTCnTAGCAACA^ 
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GCTATITGAAAAAGAATAGAATGCAGCCCAAAGCAGATGGAGGGAAATGACAGTCCGCAGTCAG 

CCAGGCCTCACCAAAGAATACAGGCCCTCTGCCAGGTCCCTCCTACCCTGGAGAAATCTGAGTCTT 

CAAATTGCCAACTCCCTGGACTGGTCTAAGTATTTTTCTGCCAGCGTTGGA^^ 

AATTCCCCTAAGCAATTTCTCAGTTCCTCTCACTTCCrCTGGTTCOT 

TTATAGAAATAAGTTTTTGCAAAGCCTTCATGTTCTGACATGACCTAGAGCCTCA 

ACCCCATTGCTGACCTCAAACAGGCCGGGGGAATCAGAAAAAAAAAAAAAAAAAAAAAAAAGTA 

CCTTNGGNCGNGACCCCCCCTTAAGGGCGAAATTCCCAACACACTGGGNGCCGGTTNCTAGNGGG 

AT 

SEQ ID NO: 1 898 ACTCTGGATCCCAAGGTGACTGGTTGTTTAATCGTGTGCATAGAACGAGCCA 
CTCGCTTGGTGAAGTCACAACAGAGTGCAGGCAAAGAGTATGTGGGGATTGT(XGGCTGCACAAT 
GCTATTGAAGGGGGGACCCAGCTTrCTAGGGCCCTAGAAACTCTGACAGGTGCCTTATTCCAGCG 
ACCCCCACTTATTGCTGCAGTAAAGAGGCAGCTCCGAGTGAGGACCATCTACGAGAGCAAAATGA 
TTGAATACXJATCCTGAAAGAAGATTAGGAATCTTITGGGTGAGTTGTGAGGCTGGCACCT^ 
GGACATTATGTGTGCACCTTGGTTTGTTATTGGGAGTTGGTGGTCAGATGCAGGAGCTTCGGAGGG 
TTCGTTCTGGAGTCATGAGTGAAAAGGACCACATGGTGACAATGCATGATGTGCTTGATGCTCANT 
GGCTGTATTGATAACCACAAGGATGAGAGTTACCTGCCGGCGAGTTGTTTACCCm 
TGrrGACATCTCATAAACNGCTTGGTTATGAAAGACAGTGCAGTAAATGCCATCTGCTATGGGGGC 
CCAAAATTATX^CTTTCCAGGTGTTTCTTCGATATTGAGGACCGGCATTGGA GGGCA AATCAAGGA^ 
AAATTGTGGGTmTCACCCCCCAAAGGGAGAAAGCCAATCNTNTATGGNCrTTITG^^ 
A 

SEQ ID NO: 1899 ACAGAAGAAACAGAATTGGCAAATGTTCAGTTGACAATTATGAAAGCTGGAT 
AAACAAATACCTGCAGGGTITATTTTTACTACTTTTCT^^ 
TAAAAAGTTTAAGTTACCAAAATCTCTTCACTGTCnTrTrr^^ 

CTCAAACCCTCTAAGACTGCCAAAAGCTACGCCACAATCAAATTCTGCAGGTTCTTTCAAAT^^ 

TTTCACTTAGCTCTTCTACAATAAAGCCTCACTACTCAAAGTATGGTCOT 

AATTGTTTGGAATGTTATTAGAAAAATAAAATGTTTTTCTTC^^ 

AAGCATmACCAAAGAAGACAGTCAAATAACCAATAAACATATAGAAAAGTGTTCAAGTTAATT 

ACTTGTCAGGGAAATGCAAATTATAACCACAAAGAGTGTCTCTGCCATCCACCANAATGGC^ 

ATGAAAACACAAAACAGACATGTCAAGCATCGATAATGATATGCAGCAATGGGAGTCATGCNCTA 

NGAATGGGCAAACTGGTAAAACTAGAAAACTAATTGGCAATAGCTACTAANGGTTGGACAA^ 

GTNTNTNTACCANCCCANCAArrCCTTTCCTANAAANTTCCCCCCCAGAAAGGGCATTT 

SEQ ID NO: 1900 ACATAGGTAACCAAAGTATATAGCTTATTTGGTGAATCTTCATCCTCATTACG 
rmCTGGACAGCCGCACACGGATTCGGTATGGCACATTCCnTATrCCTTTGGCCCA 
TTGAGCCTGGTGTCAATGCGCACATCTGGAGTTCCCATCTCCTTCATGGCAAATTTCCGAATCTC 
TGAGTGCCCGAGGTGCACGCTTCTTGAAGCCCACTCCATGGATGCGCTTGTGAATGTTGATGGTO^ 
ATTCTCGGGTTACCACTTCGTTGATGGCAGAACGGCCCTTTTTCTTCTCGCC^^ 
AGCCATTCTGCAGCGTCCAAGTTGGAAAGGAAGCCCCGCGT 

SEQ ID NO: 1901 ACCGCGGGGGGGCGACTGAGCGGNCAAACGGAAGTGTNGGTTNCGGTCTGA 
GACATNACCGCCAAGCTGGGCATCGGGGAGATGGCCGAGACTGANCCCAAGACCGTGCAGGACC 

tcacctnggtggtgcagacactcctgcagcagatgcaagataaatttcaagaccatgtctgacca 

GATCATTGGGAGAATTGATGATATGAGTAGCGCATTGATGATCTGGAAAANAATATCGCGGACCr 

catgacacaggctggggtggaagaactggaaagtgaaaacaagatncctgccacgcaaaagagt 

tgaangttgctaataatrtatactggaatcrggcatttttcca^ 

ttmtgcagctaactactatgtgtanacaaggttntatattataangntatc 

antttattaattaagtktgtanagngnatttccccccagtttctttga^ 

tctctgaatng 

seq id no: 1902 acctgg1tggcctggtatcagcagcgaccagggaaagcccctaacctcctga 
tctataaggcgtctagtttagaaagtggggtcccatcaaggttcagcggcagtggaactgggaca 
gaattcactctcaccatcagcagcctgcagcctgatgattttgcaacttattactgcc/^^ 
aatacctacccctacatttttggccaggggaccaagctggagatcaaacgaactgtggctgcacc 
atctgtcttcatcttcccgccatctgatgagcagttgaaatctggaactgcctctgttgtgtgcct^ 
ctgaataacitctatcccagagaggccaaagtactccacaagaaacagaaggaacttccag tggt 
agcaaaagcaatgtgaggagtgggaagagagttccaagtggcaggatggtcaacattcgcatttt 

CCAGCAAGAAGTCACGTAAGAATAGACAAGAGTCCATTGGATrTGGCAATCCTGATGTCAGGGTA 
AAGCTAATGGGTGGGAGATTAATTCCCATTAAGTTCTTTCTTGACCTTAGTATGTGGGGGTT^ 
TTTAAAAGATTTCTGCATTCAAAAGTTCAGATGANTNNGAAGTNm 
GGAGCCAACTTGCAAATGTTCTTTTTNCATTTAANATGGNAAAATO 
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AGGC 

SEQ ID NO: 1903 ACTTTATAGAATTCACATrCACCTGAAGGTTCACAAAATCTGCTGGTATGTCA 
ACATAGCAAGCACCTGGACGACCATAGATACTGCTTCTCACTGCCTTTTCAATAACAAAAGG^^ 
AGCTrCTATGCTGCTTGGGCGGGCAGAGAACTTGGTATATAATCTACAAGOT 
ACTCCTGGAAAGCTCCCATTGTTTCITGGTTTCmCAG 

AGCAGTTCATGTTTGCAmGCCATACCGCCCAAGGCATGGATGAGACCTGGGCCAGAAACAACA 
AGGCAGACTCCTGGCCTGCTTGTCAGATATCCAATCGCGGAGGCAGCATAACAAGCCGCTTGCT^ 
ATTCCTCATCCCGATGTACC 

SEQ ID NO: 1904 ACACCGGGTGGCATTAAGGGGTGAAGATGTCCCCCTTACGGAGCAGACCGTG 
TCTCAGGTGCTGCAGTCAGCCAAAGAACAGATCAAGTGGTCACTCCTTCGGTGAAGACCTCACTGT 
TCCTGGCTCTTCATCCTCTTCAAAAAATTTGCATGTCTGCTGTGAATTT^ 
GATGCTCTCAGGGTCATCTCOGGGATCACAGGGATCCITAAATCTCCATTCTGTrrGTGG^^ 
CCTCAACCrrCCCCTACACX:CTTCCTATT(nTTTTCAT^ 

ANCATATTTANATNATAGGGCAGGGGAAGCACCCTCmCTTTCTAGACTGGATTA 

CTCCCTTGCCCTGACATTTTTGTAAATTNNTGTGCCCC™ 

AGTANGAGAAAGGAATGTGNATTANTAT 

SEQ ID NO: 1905 ACCATCTTGACAGCATCCAGGGCAATGTTGCAAGCCAAAGATGACCACCGAC 
TGATGGCTTTGGTAGTAATAGAGCTGTTGATGATGTTCAGCATCATATCACTGTCACTGATGTCGA 
CTGGGATACTTATTTTCTTTAGGGTGCTGATCATATCATCCAATGCCITGC^^ 
CACTGTTGGGTGCATCTGCTGCTCCAGGAAGTGCTCAGCTACAGACAGCATTTCCCCTGCAAGAAT 
AATTACTGATGTGGTCCCATCTCCAACCTCTTCATCCTGGGTCCGGCTAATTTCGATCATGGA 
GCCGCTGGATGCTGGACTTGAATCTCTCGAAGAATGGCATTGCCATCATTGGTCATCACA^ 
CCCATTGGGTCCAAAAGCATCTTCATCATGGACTTGGGTCCCAAACATGTTC GGAT GATATCTGCA 
ATAGTCTTGGCAGCATTGATGTTTCCAGArrGAACTTTTCTTCCGGATTCACGC^ 
TGAGCACGANCACTGGACGATGGCCCATCATGGCGAACGCGATGCANAACCCGGGTACCTCTGGC 
CGNGACCACCGCTAAGGGGC 

SEQ ID NO: 1 906 ACCCGGGAGAGGCATCCCGGGTATCGGTCGCCGACCACTCCCTGCACCTAAG 
CAAAGCGAAGATTTCCAAOCCAGCGCCCTACTGGGAAGGAACAGCTGTGATCGATGGAGAATTTA 
AGGAGCTGAAGTTAACTGATTATCGTGGGAAATACTTGGTTTTCTTOT 
ATTTGTGTGTCCAACTGAAATTATCGCrmGGCGACAGACTTGAAGAAT^^ 
TGAAGTGGTAGCATGCTCTGTTGATTCACAGTITACCCATTTGGCCTGGATTAATACCCCT^ 
ACAAGGAGGACTTGGGCCAATAAGGATTCCACTTCTTTCAGATTTGACCCATCAGA 
ACTATGGTGTATACCTAGAGGACTCAGGCCACACTCrrAGAGGTCTCTTCATTATTGATGACAAAG 
GAATCCTAAGACAAATTACTCTGAATGATCrrCCCTGTGGGTAGATCAGTGGATGAGACACTACGT 
TTTGGTTCAAGCTTTCCAGTACC 

SEQ ID NO : 1 907 ACATAGGTAACCX^AGTATATAGCrrATTTGGTGAATCTTCATCCTCATTACG 
TTTTCTGGACAGCCGCACACGGArrCGGTATGGCACATTCCTTArrCCTTTGGCCCAGA 
TTGAGCCTGGTGTCAATGCGCACATCTGGAGTTCCCATCTCCTTCATGGCAAATTTCCG^ 
TGAGTGCCCGAGGTGCACGCTTCTTGAAGCCCACTCCATGGATGCGCTTGTGAATGTTGATGGTGT 
ATTCTCGGGTTACCACTTCGTTGATGGCAGAACGGCCCTTTTTCT^ 

AGCCATTCTGCAGCGTCCAAGTTGGAAAGGAAAGCCCCGCGTACCTCGGCCNGCGACCAC 

SEQ ID NO : 1 908 ACAAGGTGTTTTCCAGCGTGCCTCAGCAAAATGGAAAGACGATGTTCAACTT 
TGGCTCTCCTATGTGGCTTTTTGTAAGAAGTGGGCTACTAAAACTCGACTTAGC^^ 
GCCATGTTCGCGATTCATTCCAACAAACCAGCTITGTGGATTATGGCAGCCAAATGGGAAATGG^ 
AGATCGATTGTCTTCAGAAAGCGCAAGGCAAa'AmCTTCGCGCACTGCGCm 
CCCAAAACTTTATAAAGAATACTTTAGGATGGAGCTGATGCATGCTGAAAAACTGAGGAAGGAGA 
AGGAAGAATTTGAAAAAGCCAGTOTGGATGTGGAGAATCCTGATrATTCTGAAGAAATCC^ 
GGCGAGTTGGCATGGATCATCTACAAAAATTCTGTAAGCATAATTAAAGGTGCAGAATTTCACGT 
GTCACTGCITrCGATTGCACAGCTATTXGACTTTGCCAAAGATCT^ 

CCTTCAGGCTCTACACACAGATGATCCTCTCACTTGGGGATTATGTGGCAAGGGCGAGAATTANA 
AGATTNAGTCANAGACCGGAAAGAACANCCTACAACCGAAAACAAGCCCAAACCATTGGANGGT 
CGGCCCCGGAAAGGAGGAAAAGGTGCTTGTGCTTGTGTATTAAANAAGCAAmGAAA^ 
CACCAGAAAG 

SEQ ID NO: 1909 AC r iT iiu - i - ii - riiN ' ii - iu - i ' i i"iu - i4U i GGATCNCATTTAACrr^^ 
CTCAAAATTCrGNGACAAATTTTTGGTCAAGTTGTTTCCATTAAAAAGTO^ 
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GGTGTCANACACCATGGNGTGCCAGTCTTCCTCAACTGTAGGTCANTCATTNAGTGAACAACACAT 

TANCCAACTTCAAAAAAGCATTCNGGCANAGATTCCCTGTGAAGATGAACAAGACCAAG^ 

NCGGACCACTGGCAACAAAGGAAAANCCACGGCTGGAAA'llUUViU^GTTAATCCCCTTGAGCCAA 

AAAATGCANAAAAGCTCAAGGTGAANATTGCTGACCTTGGAAATGCTTGTTGGGNGCACA^ 

TTCACTTGAAAATNTrCAAACAANGCAATATNGGTTCCTTGNAAAGTTCT 

TAATCCCCTGCTGACATITGGAGCACGGCOTTGCATGANCAAAAArrNTGGGAAm 

GCCNGAAANAGATGGATCATCCCTGCATCCCATGGAArrAATTGTGGGAAAANATTANGACCCCT 

TTTTTTANAGGCrCITTGAANCTGNCCANANGGTANNr^^ 

AANAAANACbTITNGGNCNCGGACCCACCCTTAAGGGNNGAATTT 

SEQ ID NO: 1910 A Cn ' iTlTrrnTrn ' n ' l llTiU Tlll- lll 'CNAGGCAAAGTCTTGCTCTGTCCCC 
CAGGCTGGAGTGCAGCGGCCCNATOTGGCTGGCTGCATGCTCCACCTCCCAGGTTCAT GCCAT^ 
TCCTGCTTCAGCCTNACAANTAGCTGGGACNACACACGCCCGCCACCACACCCAGNTAAririir 
CATTTTTAGTAAAAAATGGTCTGNGTTANCCAGGATGGTCrCGATCTCCTGACOT 
TGCCTTGGCCTCCCAAAGNGCTGGGATTACAGGCGTGAGCCACCGCACCTGGCCTATTCAGGGAA 
GGTTTCAAGCANAAANATGAACTGAGTTGGCTTTGGANANATTCACAGGACTCCCACAGGCAG 
TGAAATAAGGG(>JTTGTANATGGACAAAGGCACATATNACCACCAATGGGGACAGTOTGACTGG 
GGGACNCTNAGGGGCTGCTGTGGCnTNlGAAATGGGAGGGCTTGCCACCATTNATGGA ANAT^ 
AATGAGGCAAAATAANGTTTGGTT^NNGGAGCA^r^TACCNTTO 
AAAACCCTTCTTTN 

SEQ ID NO: 1911 ACAGTGAGCAGCCTTCTTCCTGGAAAGACATACAGCTTCAGACTACGTGCAG 
CTAACAAAATGGGGTTTGGACCArmCAGAAAAATGTGATATTACTACAGCCCCTGGGCCAC^^ 
GATCAGTGCAAGCCCCCTCAAGTGACATGTAGATCTGCAACTTGTGCACAAGTGAATTGGGAGGT 
TCCTTTGAGTAATGGAACAGATGTCACTGAATATCGACTGGAGTGGGGAGGAGTTGAAGGAAGTA 
TGCAGATATGTTACTGTGGGCCTGGTCTCAGTTAGGAAATAAAAGGACTTTCACCAGCAACT^ 
ATTATTGTAGGGTCCAGGCTCTGAGTGTTGTGGGTGCAGGCCCTTTCAGTGAAGTAGTAGCCTGTG 
TGACTCCACCATCAGTTCCTGGCATTGTGACCTGTCTTCAAGAAATAAGCGATGATGAGATAGAAA 
ATCCCCATTATTCACCTTCTACATGCCTTGCAATAAAGCTGGGAAAAGCCTTTG 
GGAAATCCTTGCCTACAGCATAAACTTTGGGAGATAAACAATCCCTACAGTGGGAAAGGTTACAA 
GCTATATTATTCAACAATTTGCAACCCAGATTCCACATTCCAGAAATTCCNAATTTC^ 
GAATANCCmTGGAGCTTGGTCCCTTTNAAGCCATTTGATTAAAATTAAAA 
TNCC 

SEQ ID NO: 1912 ACGCGGGGACTTCCTGCGGGTGCACAGGCTGTGGTCGTCTATCTCCCTGTTGT 
TCTTCCCATCGGCGAAGATGGCCCIXjGAGACGGTGCCGAAGGACCTGCGGCATCTGCGGGCCTGT 
TTGCTGTGTTCGCTGGTCAAGACTATAGACCAGTTTGAATATGATGGTTGTGACAATTGTGATGCA 
TATCTACAAATGAAGGGTAACXGAGAGATGGTATATGACIXJCACTAGCTCTTCCTTTGA 
ATTGCGATGATGAGTCCAGAGGACAGCTGGGTCTCCAAGTGGCAGCGAGTCAGTAACTTTAAGCC 
AGGTGTATATGCGGTGTCAGTCACTGGTCGCCTGCCCCAAGGAATCGTGCCGGGACCTGAAAAGT 
CTAGGAGTGGCCTACAAATCCAGAGACACAGCTATAAAGACCTATCAAGATGCAAGGCTTGCAGC 

atcnttgcttctccacctcctgcctctgcttamcttgttctggaact 
aaatactttcttaccctcccaattcaanactcaancttgactggttgaaaagag^ 

TTTTTAATCATTITrAACOTTTCrrrGGGACTACCAAG^ 
TGGNTTNGATTT 

SEQ ID NO: 1913 ACrAGTGTCCATGGCTTGAGTGAGTTGCACAGCTTTTCGCAGGATGACAAOT 
TCTCAATCAGATCCTGAAGTGACAAAGGGTGGCTTCCATCITGAGCTTTAGTCC^ 
ATTTCTCTACATTCCCTGCACAAATATAGCAGAGACATGCITGAGTCTGCAGGAGGCTAT^ 
CATTTTCAAGCCTGGTTCCCAAAAGATCACAAAGGGCTGAAAATrCATCCGGCTTTG^^ 
ATACTGCAGCTAAAGCCTCTCTCCAATlTITAAGATCACAAGACTCAACAATCTCm 
CATCACCACTGCAGTGATGAGCCTGGTAATTTTGCTTTGGGATTTTGCGAAGTAT^^ 
CGAGCCAAGAGTTCTTGTCCACCTGCTATGGCCAATATAATGGGCATCGGCCATGCGGNTATCATG 
TAAAACAANGGGCAACAGCCTCTCAAATTGCCGTNANCAAAAGCCTGAGTAATTAAACCAT^ 
TGGTCCCCCNCTGACAGAAAATAITAAANGTTCCCTNCNANAOTGAGGGGTTGGAA>^ 
TCTTNCTTTTTTOGCTCTTTAAAGGNGCCnTCNm 

AAAATTTCNAAGGGTCGTTTTCCCNCCNAANATTNATCCCTCAANAT^ 

SEQ ID NO: 1914 ACTCCAACTCAAGTTTACAAGTTACACCTTTGCCACAGCCTTGGCTAAATCTT 
GAACTAGTGCAGAATTCAGCTGTGGTAGAGTGCTGATCTTAGCATGCTTCGATGTGGCATACTTGT 
TCTTGACAGTCATGTGCTTTGTAAGTCCTTGATTTACCATGACTACATTCITAG 
AACTGGAAGAAGAGATrCTrCAGTATATGACAGGTAATGTTGTAGAGTTGGTGTCCATTCACCATT 
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atccagaattttcagtgctaagcaaaaagctcctgctgcaatttgagaaggaggaaagtgcac^^ 

tgtcatagtccaacatagttagttccatcaggtatttggccaaagtattgttgctcgacatcaacc 

tctctaatcttagatgctctccgaaggaaagtgcaaaggtagaggccgaccc^accaaaagtta^ 

agctcttanaatctttcatttcccccgcgtacctgccccgggcngncgctncna^ 

tcancacactggcngcccggttctanngggattccgnagctcgggaccaaancttnggcgta;^ 

CNA 

SEQ ID NO: 1 9 15 CNGTGAGACACAAGATCTCTCTTTTACeAAAGTTGAGAACAGAGCTGGTGGA 
TTAATTAATAGTCTTCGATATCTGGCCATGGGTAACCTCATTGTAACTATCATCAG^ 
GATGATCITGAAGTGTCACATACACTAAAGTCCAAACACTATGTCAGATGGGGGTAAAATCCATT 
AAAGAACAGGAAAAAATAATTATAAGATGATAAGCAAATGTTTCAGCCCAATGTCAACCCAG^^ 
AAAAAAAATTAATGCTGTGTAAAATGGTTGAATTAGTTTGCAAACTATATAAAGACATATGCAGT 
AAAAAGTCTGTTAATGCACATCCrGTGGGAATGGAGTGTTCTAACCAATTGCCTm 
GAGCTCTCCTATATTATCATACTCAGATAACCAAATTAAAAGAATTAGAATATGATTTTTAAT^ 
CTTAACATTAAACrCTTCTAACTTTCITCm 

TGCCTCTGAGTCATTGTTATAAAAAATCAGNTATCACTATACCATGCTNTAGGAGACTGGGCAAAA 
CCCTNGT 

SEQ ID NO: 1 9 1 6 ACCrrCTTrCTGTAAATCTGAAGAACCTGATTCTATTACCAAATCCATTAGT^ 
CACCATCTGTTTCCTCTGAAACTATGGACAAACCTGTAGATTTGTCAACTAGAAAGGAAATT^ 
CAGATTCTACAAGCCAAGGGGAAAGCAAGATAGTTTCATTTGGATTTGGAAGTAGCACAGGGCTC 
TCATITGCAGACITGGCTTCCAGTAATTCTGGAGATTTTGCTm 
. AATGGGCAAATACTGGAGCAGCTGTGTTTGGAACACAGTCAGTCGGAACCCAGTCAGCCGGTAAA 
GTTGGTGAAGATGAAGATGGTAGTGATGAAGAAGTAGTTCATAATGAAGATATCCATTTTGAACC 
AATAGTGTCACTACCAGAGGTAGAAGTAAAATCTGGAGAAGAAGATGAAGAAATTTTGTrrAAAG 
AGAGAGCCAAACTTTATAGATGGGATCGGGATGTCAGTCANTGGAAGGAGCGCCGGTGTTGGAGA 
TATAAAGATTCTrTGGCATACAATGAAOAATTATTTCCGGATCCTAATGAGAAGAAGACC^^ 
TTTTAAAGGNNNGTGCAAACCCCGTITrTACTAAAAACAATGGGAATTAAA^ 
CNAATAATGCTTTATTTrGGGCIT^CCCCAAATTTTCCT^ 
CCTT 

SEQ ID NO: 191 7 GGGACi'i'm'nTrm-rmririrnrirGGC^ 

GCAGTATTTTGAGATGGACATTGCCTCTTCATTGTATTTCTCATCAATTCArrATT^^ 

AGCTTGACAAGCAATrAACTTTAAAATGGTAGATTCCGTAACTTTAAATTGGTAGCTT^ 

TTAAAATnTTTTGGCATATGCANATAATOTTCTCATCAGTAGTAANAATCT^ 

TCCCCAATGGAGGTATGGCATATAATCTTTTCTGCCmACTTATCAATTCACCAAGGAGCTG 

CTCTGCATCTAGGCCATCATACTGCCAGGCTGGTTATGACTCAAAANATGTTATCTGAAAAAAGTC 

TATANAAAAAAAAAGTrrCCCCrCCCTNATCAACAAAAGCCCACCCTCTAAGAGACATTCy^ 

GAACTATCACAATTCrrAATCAGTTACAATrrACAAACAGATAAGTTTAAAATAAA^ 

AATTTTTGAAGCATACCTTAACATCITGGTmGas^^ 

CCCrAAAAAAAAAACTTGNTTTACNCCCAACrrGGAAAATTCCCCCGCGT 

SEQ ID NO: 1918 CCTGCTGGCATAGTrCTTTGACCCGTTCATATrrGGGCAAGTGATTTGACTGT 
TGGATATTCTrGCTGGATTCTTCCTTCTTACGTAGAAATTTGCCTCm 
GCCAAATTTTGGCCTrCTTGTTTGTTCGAAACCTGTTACCTGGCTTTTCTGGGTCCA 
GACAGACTTGCCGTCCACATCAGGAGGTGTGTCGAGCCCAGCAATATCCAGGATCGTGGGGGCCA 
AGTCAATGTTGAGAACGATCTGTGGGACTATTGATCCTGGTTCTACACTTGGACCACGAATAAAAA 
AAGGCACACGAATATCAAAGTCATATGGCATGGATITCCCCrTGACXAGTCCAAACTGCCCAATAT 
GGTAACCATGGTCGGCGGTGTAAATGATGTAAGTATTCTCCAGCTCCCCCGTCTCCACGAGCATGT 
TATACAGCCTCTCCACAGAATCATCCACTGACATCAAAGTCTGGAGCCTTTTGCGCTGTAN^^ 
TTGTAAATTCCATGTGGATGGGCAGCATTGGGTCCTGTGTACCTCGGCCGCGATCACGC 

SEQ ID NO: 1919 ACGCGGGGAGCTGGCACCTTGGCGCTGTTGGTGGCGGCGGAGACAGCTGTGA 
AGTGTGAGGTTCTTTGTCTGCTGGCAGCTAGGGGCGACGAGGCGGGACGTCATGGAAGTGAAGGA 
TGCCAATrCTGCGCTTCTCAGTAACTACGAGGTATTTCAGTTACTAACTGATCTGAAAGAGCAGCG 
TAAAGAAAGTGGAAAGAATAAACACAGCTCTGGGCAACAGAACTTGAACACTATCACCTATGAA 
ACGTTAAAATACATATCAAAAACACCATGCAGGCACCAGAGTCCTGAAATTGTCAGAGAArrrCT 
CACAGCATTGAAAAGCCACAAGTTGACCAAAGCTGAGAAGCTCCAGCTGCTGAACCACCGGCCTG 
TGACTGCTGTGGAGATCCAGCTGATGGTGGAAGAGAGTGAATAGCGGCTCACGGAGGAGCAGATT 
GAAGCTCTTCTCCACACCGTCACCAGCATTCTGCCTGCAGAGCCAGAGGCTGAGCAGAAGAAGAA 
TACAAACAGCAATGTGGCAATGGACGAAAGAGGACCCANCATANAAAGANCCCAGCTGGCCCCG 
GCGTTTCATGAAATCAAGAAAGGCCTGGCAGCCCATTTTCCTTGGAAhrmGAA^ 
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TTTTGANTTTTATCCTCATCCCAACAAGGCCCTGGCTTTTGTGGNTTAGTTGGGGTACC^ 
GGCGGGCC 

SEQ ID NO: 1920 ACAAGAACAAAGGCAGCTACGTCACCTATGAACCTACAGAAGGTGAGCCCA 
GTGCCATCGTCCAGATGGAGAGTGACTTGGCCAAGGGCAGCGAGAAAGAGGAATATTTCATCTAA 
TGACTCCCAGGCCCCAAGGAGCTTATTCCTGGCTCCATCGCTAACACGTTGACTGCrrATTATGG 
AAAGTTTTCTCTGAAGCCAGGGAGAAGCATTGATTGATGTGGGCAAATCCAAGCTCCAGCCAGGT 
CGCAGTCCCAAATGCCGACATCACTGACTCCAGGGACCAGGGACATGGAGAAAGCTGTTTATGAT 
ATCTTTAACCAGGCCCTCITACTAGAGCTGGTGTTTGTGACTGGCCAACAAGATGTGGCT^^ 
GGGGACATCTGAGTATGTGCCCAGTCATCTTTTTTCACAGGTTGAAGGGAGAGAAAAGATm 
GTTAAGGTCATTGGCTGCTCTACTCTGTCCCCTACCTGGTCACCTAGTGATAGCCCCAGTGGAAAT 
ACTGNCCATACAAAGGTCTTCCCCAGAGGCTGGATACCCACAGTAAAAAGGCCNGGCCCANGGA 
AGGGGGTANGGAGACThrTTGAAGAATCTTACCCTCCNTGAATAAATNNGGCCTCAC C^ 
TITGAGNCCCTrrCCTTTTCCGGGGTTCCCCAAACNAACCNTTAATGGCrrACGGNGA'l 
TCAAATA 

SEQ ID NO: 1921 ACACCAAGCACCTATTTTTATAACTTAGCTTCCCATGGAGAGATAATGGCTTG 
CGTGCATTTTATGTATCCATAACATACATACAAGGCTCGGTCTTTTCAATGGGATAACAG^^ 
ACTCTTCGATTTGAATTGTAATGAATCTGGTGACAAGGATTTTTCTCTA^ 
CCAGAACnrrTAATGTCAAGATGAAAAAGGGTGTAAGGTGTTATATTTrCTTC/^ 
CAGGAGGCTAACTCCACAATTTCCCTCATGTTTCTCATTCAGAAAAAAAAATATT/^ 
CAGAATrATTTGATGATTGCTTCTTTGTGCTGATGTTTCAGTTCCTGAAG 
ATTTTCTAAGGTCAGGTTATTGACTTAGGGTTGTATAAACATTTT^ 
GAGAACGACCTTCCCTTAATCGTCTTCn'AGATCCTTGAACTTTm 
GGGTGTGTCTTTCTTGTAACAATTTCTTCAANAGTTTCTrCTGGTTTCT^ 
CTAGTGTATTTTATTTCANGACCTGTAATGATTCGTNCCTTCCTCGNGGCI^^ 
NCAGGCACTCCATCAATGAATTTTGGGGGATTTTTTAATAAATTCGGCT^ 

SEQ ID NO: 1922 ACTAAATATTGCTGAGAGCATCCACCCCAGGAAGGACITTACCTTCCAGGAG 
CTCCAAACTGGCACCACCCCCAGTGCTCACATGGCTGACTTTATCCTCCGTGTTCCATTTGGCACA 
GCAAGTGGCAGTGTCTCCACCACCTATGATGGTGATGCAGCCCCTANAAGTGGCTTTCACCACCTC 
ATCCATGAGAGCTTTGGTTCCCCGGGCAAAAGCTTCCCATTCAAATACCCCCACAGGACCATO 
CACAATCTGCTTAGCCCGAGTGACAGCCTCAGCATACTTCTTGCTGCTTTCAGGACCACAGTCCAA 
GCCCATCCAGCCAGCAGGTATGCCANAAGCCACAGTGGCTTGGCCAGTCTTGGCATTCTCATCAA 
ACTTGTCAGCAGTGACAAAGTCAACAGGCAAGGNAATCTTAACACCATTCTTNTNACN 
ATTAGGTCTTTGANAATCTTGGGNTCCCTmTTATCAAACAAGAGAAAGGTGNCANTO^ 
GTTGAAGCNCCITTAAGGAA 

SEQ ID NO: 1923 ACGCGGGGGGCACCTGTGGCCACACACGGGCCCAGCTTCTCCAGGGCCTGGG 
TTTCAACCTCACTGAGAGGTCTGAGACTGAGATCCACCAGGGTTTCCAGCACCTGCACCAACTCTT 
TGCAAAGTCAGACACCAGCTrAGAAATGACCATGGGCAATGCCTTGTTTCTTGATGGCAGCCTGG 
AGTTGCTGGAGrcATTCTCAGCAGACATCAAGCACTACTATGAGTCAGAGGTCTTGGCTATGAATT 
TCCAGGACTGGGCAACAGCCAGCAGACAGATCAACAGCTATGTCAAGAATAAGACACAGGGGAA 
AATTGTCOACTTGTmCAGGGCTGGATAGCCCAGCCATCCTCGTCCTGGTCAACTATATCTTCT^ 
AAAGGCACATGGACACAGCCCTTTGACCTGGCAAGCACCAGGGAGGAGAACTTCTATGTGGACGA 
AACAACTGTGGTGAAGGTGCCCATGATGTTTNCAGTCGAACACCATTAAGTTCCTTNATG^ 
AACCmCCCTGCCAGCTGGGTGCAGATAACTACGTGGGCAATGGGACTGGTTTTCTT^ 
CGGA 

SEQ ID NO: 1924 ACACAGTAlll'lTTTATATCTATGirrrCTGTTCTCTGGCGCAAGATATTCTGC 
ACCTGGGTGCTTTCAGGCCTTGACCAAATAGCATGAATAGTAGGAAATGAGAACTTTCCCTCT^ 
AGATCTTCACAAAAACTTTTGTTTTCACTATATTCTTTGGAGTGTAGA 
mGGAAAAAGAGCCCAAGTGTATTAAGTAGCGGTTTTAAATCTTCTTTGTAATCAGAG 
GCATGAGACCTACTGCTAATCCAAACAGTCCACCTGTTrrCTGCAGCACCATAGCTTTATATTCTTC 
TTCAGTGGGACAAGTGTAATTATCCCTCCAGTAAATATCTAGGCCTTGTCCCTGATGGAGTTCCAA 
AAGCTGGCGGGTAAAAAGCTTCACTGCATCTGGGTGATCAAGGGTTAAGACTTTCTCC 
GGAAATACACGTAATTGGCAGAATTGATGACAGATGGGATTCCATAGATGCTGTGGGCCACTGGA 
AANCACGCGGAGTTTTTGAGTTGTCTTCAATATCATCGATGAGNAACTGGCATTTTGCACATTC^ 
GCNCTTCNAATAATAATNTNTAGCTTGGCCCTTCTGGAACmCANCCCAATO^ 
CGAAAGTITGGTTTITACTTNNTTNACCCTNGGTA/^ 
AA 
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SEQ ID NO: 1925 ACGCGGGGTTCCCCAGACCTGCTCGCAGCACCCTGCTGTCTTCCCGQTCCGGC 
CCGCTGCCCGCGGCGCCAGCACCATGCTCITCTATTCTITmCAAGTCCC^ 
GGTCGTGGAACTAAAGAATGACCTGAGCATCTGTGGAACCCTCCATTCTGTGGATCAGTATCTCAA 
CATCAAACTAACTGACATCAGTGTCACAGACCCTGAGAAATACCCTCACATGTTATCAGTGAAGA 
ACTGCrrCATTCGGGGCTCAGTGGTCCGATACGTGCAGCTGCCAGCAGATGAGGTCGACACACAG 
TTGCTACAGGATGCGGCAAGGAAGGAAGCCCTGCAGCAGAAACAGTGATGGCTCCTCCTCCTCTT 
CCCCTCCCrCTTTCATTGGTGACCCATAACCCCAAGTCCCAGCCCAGAACCCCTAACCCCC^ 
TTGAAGGGGTTTTGTTTTTTTACTAATGATGGTTm 

GAGGAGTNATAGGGAACAGCTATCCTCTTTTTGAGAAGGGGAGGATAAGTAGGCTGGGGAAACTT 

CAAAGCCmCCCAGTCCCCANCACCTGCCTTTCTNACTTACTTCTCT^ 

TTTCCTAGGTCNTTTTCCAGGGGCANNAGGTGAmCATTTTGGGGGATGGG^^ 

SEQ ID NO: 1926 ACITACTCTATCTTTTCTTTAATTAACATTC^ 

AAAAATAAGATTTTTCTTTAGAGAGCAGAAGCAGAAGAGTAAAATTTAT^^ 

frrrrTTATGCCATCTGTCTCAAATCAAAGAGTCATCATAGTAGGAAATAACATGTTAGrrG 

TGGCATGAGTGTGCATTCCAGTAATTCTTAATTGATATTTGATTAATTCCATACCTTTGA^ 

ATGCTAGTTCAAAATAAGACTGCTCAGmCCAAGGGTTTTCAAGCCTACT^ 

TCTCTAGTCTCTGATTAGCCATGACTGTATTGGACTTTGAACATTTTCT^ 

CTAAACTAATCTCAmGGATGTGTAAGTCrmGTAAAGGCAAGAATAAATAATA^^ 

TTTATTAGTTITCTCAGTATTTTCCCAAATATTAGAATATTTAOT 

ACCCCATATGTTCTGTGGAGAATAGTAGCTTTATCnTrGATATAATACATAGGTCT^ 

TAATACTTCGCCAATTGGArrAGATTTTCAAAGTAAAATTTAANAGGTTATCTC^ 

ANGGGTCAAATmrnrrGGrrAATTTAAAGCTCCCAAAT^ 

SEQ ID NO: 1927 ACTACirCAGGTAATCATTGTTTTACTTAAAGTTCAGATTCC^ 

AGATGAATATTCCCTGGTTATACTTTGTCAATAGTTITCTCATTGCTACAGTGTATTGGT^ 

TCACAAGCTTAATTTAAAAGACArrGGATTACCTTTGGATCCATTTGTCAACTGGAAG 

CArrCCACTTACAATTCCTAATTrrGAGCAAATTGAAAAGCCTATATCAATAATGAm 

TATTAATTAAAAGTTACAGCTGTCATAAGATCATAATTTTATGAACAGAAAGAACTCAGGACA^^^ 

TAAAAAATAAACTGAACTAAAACAACrmGCCCCCTGACTGATAGCATrrCAG/^^ 

GAAGGGCTATGATACCAGTTATTAAATAGTGTTTTATTTTAAAAACAAAATAATTC^^ 

TTATAGTTATTCAGGGACACTATATTACAAATATTACmGTTATTAACACAAAAAAGT^^ 

GTTAACATTTGGCTATCTGATGTTTGNGrrACCTCAAAAAAAAACTACCTGGATGCA^ 

GTAAATCTGANATTTCACCTGACACTTTrAAGAAATCAACCCAAACATT^^ 

AAATTGGAANCCC 

SEQ ID NO: 1 928 ACCTCTACTAACCATAATTGCATCACTAAGCCTGTTAGTTTOAGAGGGTCTTA 
AATTTGTTAAAACTGGAAAATCTITGTATAGGGACTCCATTCATTT^^ 

CACCGTATTCAGAAATTCATTGCCATCGAATCCTCCCACTGCATAAATGGTGTTCCCTACAGTTGC 

AATCCCAGCATTGCTCCITGGTGAAGTCATATTTCCCATCATCTTCCArrCATTTCTAGTO 

TACATTTCCACACAACTGATGGCATGAGAACCATCAAAGCCACCACATACAAACAGTTTTC^^ 

AGAACAGCCACTCCAGCTCCTCGCCTAGCCACATTCATGGGTGCAATTAAAGTCCAGGTATTAm 

TCAGGATTGTATCGTTCTACTGTGTTCAGACAATTCCAAGATTCTGCACCTCCGATTATGTACGCG 

GGGCCTTTTTTCTTTTTTCCGGCGTTCAAGATGTCGAAGCGA^ 

AAATTCCGGATTTCCTTGGGGTCnTCCGGGTAGGAGCTGTAATCAATTGTGCTGACCACACANGGA 
GCCAAAAACCCTGGTTATCATCTTCCCGTGAAANGGGGATCAAAGGGGACNGGNTGAACCAAACT 
TCCCGCTTGCTTGGTGGNGGGTGACATTGGTGGATGGCCNCNAGNCAANAAAANGGCNAAACC 

SEQ ID NO: 1929 ACGCGGGGACCTCCGGCCTAGCCATGTGATTTCACTTCCACTCCATAACGCTC 
CTCATACTAGGCCTACTAACCAACACACTAACCATATACCAATGATGGCGCGATGTAACACGAGA 
AAGCACATACCAAGGCCACCACACACCACCTGTCCAAAAAGGCCTTCGATACGGGATAATCCTAT 
TTATTACCTCAGAAGTTITmOTCGCAGGATTm 

ACCCCCCAATTAGGAGGGCACTGGCCCCCAACAGGCATCACCCCGCTAAATCCCCTAGAAGTCCC 
ACTCCTAAACACATCCGTATTACTCGCATCAGGAGTATCAATCACCTGAGCTCACCATAGTCTAAT 
AGAAAACAACCX3AAACCAAATAArrCAAGCACTGCrrATTACAATTTTACT^ 
CCTCCTACAAGCCTCAGAGTACC 

SEQ ID NO: 1930 ACAAAAGGTGAGGAGTGAGGAGATAGGGTAGTTCTTCCTTGGCTGGCTGGCT 
TCATAATCCCTGGGCCCCGCAGATAATTAAATCGACrrmCTGTCTCAGGCATT^ 
TTGGAGGTTCCCTGCTGGGTAGTTATCTTTGTATCTGATGGACCCATCTCAATTTAAA^ 
CAGGTTCGGAGGTTCATGCITGTCATCCCAGCACTTTGGGAGGCTCAGAGGTGCCATTGGOT 
CCCAAGAGTTTGAGACCAGCCTGGGCAACCTGGTGAAACCTCTTCTCCATTAAAAATACAAAAi^ 
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TTAGCCAGGCATGGTGGTATGTGCCTGTAGTCCCAGCTACTCAGGGAGCTGAGGTGGAAGGATTG 
OTGAGCCTGGGAGGTCAAGGCTGCAGTGAACCrrGATTGTGCCACTGCACrC^^ 
AAAGACCCTGTCTCAAGAAAAGTCTGTAATTCTTTCCTAACCCTTAATATCCA^ 
GGTTTTCTrTACCTCTGGGGOCTATTTTATGCCGGCTCTCTCC^ 

AAGAAAACATTCTTCACATCCCTGGCCTGAAAAAAACAATTTTCAAGGGAGAATCG^^^ 
CAGCCCCrrmTGGGGCTTCCCCAACTTAAATGGGTTGTACCNCGGG 

SEQ E) NO: 1 93 1 ACCCATAAAAATCCTTATCACTGGGGCCGGGCGCAGTGGCTCTTAACATAAA 
CACTTTAATGTCAATTAAGGCTTCAACTCAATGTTAAGTCAAACTCAAGTCAATGCT^ 
CTAGTGTCAATTAATGCnTGATGGTTTGCTATACACATTGTATTTGGAACAACT^ 
AATTTTCTGATATCCTGCAAGGAGTGAGCTCAGACTGAAGACTTTGGCACACTGATGA CAm 
AGATTTCTGTCCAGTATGCATTCTCTGATGTCTAGTGAGGTGTGAATGTGAAT^ 
ACAATCATCACACTTGTGAGGTTTCTCTCCTGTATGAATTCTCCTATGTTrrGCAT^^ 
TGACTGAGACCTTGTCACAATCATGACATTTGTAAGGrn'CTCTCCAGCGTTGAGTTCACCGA 
ACTGCAAGGTATGAACAATGTCrGAAAAATCTGCCACATTTATTACCCTAGTATGATCTCTOT 
TTATGGATTCTCCAGTGATTC(>ATGGTTGTAGCATTACTGAAGACTTTGTGACAATCATTACC^ 
GTAAAGTTTNCCTACCCATGGATTGCCNTGACGGTNAACAAAGTGTTGACTTGCCT^ 
TTTGCCNCACCCCmANCCTTGGGAAAGGTTTCTCTrCCAA 

SEQ ID NO: 1 932 ACGCGGGGAAGAAAAAGAAGAGACATATGATGATATTGATGGTTTTGACTCC 
CCAAGTTGTGGTTCCCAGTGCAGACCCACTATCTTGCCTGGGAGTGTGGGGATAAAAGAGCCTAC 
AGGAGAAAGAAGAAGAAGATATTTATGAAGTCTTGCCAGATGAAGAGCATGATCTAGAAGAGGA 
TGAGAGTGGCACTCGACGAAAAGGAGTAGACTATGCCAGTTACTACCAGGGCCTATGGGATTGCC 
ATGGTGACCAGCCAGATGAACTGTCCTTCCAACGGGGTGACCTCATCCGTATTCTGAGCAAGGAG 
TATAACATGTATGGCTGGTGGGTGGGAGAACTGAACAGCCTCGTTGGGATTGTTCCAAAGGAGTA 
TCTCACCACTGCCirrGAAGTGGAAGAAAGATGAAACCCANGAAATATATTTm 
GCCTTTATGAGGAAACTGATCATCAAAAGTTCX:CACTCCCTACTTCTGCCACCCCACC^ 
TGGACTTCCTCTCTTTTGCOTGAAAGAAGACCCAAGTCm 
AAGCTACCAAGTAATACAAAGTGGGGAAGAAGGCACGTTTNTITNAAACCCTG 
GCCCTAANTCAOTAGCTTCATCCCCCATTTNrmAAOTGNG^ 
TTT 

SEQ ID NO: 1 933 ACCCCTTAACCCCTTCTCCTTCACCCTTAGCAGCAAGTCCCACTTTTCTAGGG 
GGCAAGAAACCCCAAACCCOTCCCTCCGTGTCTTTACGCTCTCTTTTCTCTGGGm 
ACTATGGCAACCTTCCATCCTCCATTCCTCCTTCTCCCTTAGCCTGTGTGCTCAAGAACT^ 
TCITCAACTCACACCTGACCTAAAACCTAAATGCCTCATrTTCTTCT 
ATACAAACTTGACAATGGCTCTAAATGGCCAGAAAATGGNTACmCGATT^ 
AANNTAAANAAA^T^TT^^mCAATAAAATGGGCAAATGGTCTN^^ 
TrrrrmACACATTCGTCCCTTTNCnTN^^ 

SEQ ID NO: 1934 actaaaagacctaatgcatggcctcatcaccttaatgctggattctcggattg 

AAGATCTTGAGGAAGGACAACAGGTCATCCGCTCTGTGAACCTCTTGGTGGTGAAGGTTCTGGA^ 
AAGTCAACCANACCAACATCCTGAGTGCCCTACTTGTTTTGCTCCAANACAGCCTGCT^ 

ccagttcttccaaattctcggagcttgttatggagtgtctctggagaa tggt tcgactgnt^ 
ataccatcaatagcattaacctaaacagaattcttctggatatccacattttc^^^ 
ccnaagagaaactggagcaatggcaaaaggggattitccataggganccctaaagacccctgct^ 
acaccttt^m'gcaaattaaaaanggccnaagaatccttgac^^ 

CNAAAAACCANGNCTGANCCTGGAANGCCCTTNT 

SEQ ID NO: 1935 ACTTAACTGGATAAAGGACAAAGCCTNGGCCTGGCTGGATCACACCAGCACC 
AATATTGNCAATGGTGGTGACAGCAATTACAAAGCCATACTTCCCTGTGCAGGTCCCCTCCACCTC 
GTGAAAAGCTTCTGCTTCACCGNGTTGAGCAAGTTGGGGCCGAAGTANCGCGGGTGCAACANGAT 
TTCGTGCTCTAGGGAGATATGGTNGAACATCTTCCCANACCAGGTANGCAGGCNAAGTCCANACC 
TCCNACCCCNNCGTACCTNGGGNCGM^AACCACGCTAAGGGCGAATTCCANCCACTGTC^ 
TTACTATAGGATCCTANCTCGGNACCAAGCTTTGGCCGTAAANANTGGANATANCTTGAT^ 
NGGAAANANTGNTAT(XAGTCACAATTmCCCN>fNAACATACCGAGCC^ 
TAANNCCTTGGGGTGCCTANTGA 

SEQ ID NO: 1936 ACGCGGGGGGGGCCACTTAACCATCCGCCTTGCCCTGGGTGGCTGCACCAAT 
CGGCCGTTCTACCGCATTGTGGCTGCTCACAACAAGTGTCCCAGGGATGGCCGTTTCGTATAGCAG 
CTGGGCTCTATGATCCATTGCCCAACAGTCATGGAGAAAAACTCGTTGCCCTCAACCTANACAGG 
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ATCCGTCATTGGATrGGCTGCGGGGCCCACCTCTCTAAGCCTATGGAAAAGCTTCTGGGT 

GGCmCTCCCTCTGCATCCTATGATGATCACAAATGCTGAGAGACTGCGAAGGAAACGGGCA^ 

TGAAGTCCTTGTTACTTTTNAGAAAACANGATACAGANGCTTCAGATACTGAGC^ 

ATAANTGAGCTGACmAGTGANGCATTAGCAAGTGGGAACCANGGTCAAGGNCCTTTTGN^^ 

ACTTGAANCGATCTTAATmGGTAGAATTNGGAGTTCA 

SEQ ID NO: 1937 ACGCGGGAGAATATCCCTTCTGCTCTGGTAGAGGAAAGCTGAGTGTAGGGCT 

gatgttacggaggatgtagtcctcctatgtotcctggatttttctccct^ 
gcattcaacccagcttgctcactcccatttgcatntggatcccagagaagtagcattcaaacccag 

CTTGTTCACTCCCATTTGCAACmTTCITrGCrrGCANGCAGT^^ 

CATNTTCCTCAATCNCnTmGGACCTGACCGATOAGGCTGAAGAAAN 

GTGCTTACANATCTCCATGGACCAACNGGCTGGNATCAGGTCCTNCCTC^^ 

gngcctacnaggccrmcctanatggtaancgaagccncttggtggttctaanccctgc^ 
tattncaagnnaaccccatttcttgaggaacca 

SEQ ID NO: 1 93 8 ACGANAACAGAACCAATCTAAAAATGGCTG ATGTTACnTANGAGCCTGAAA 

aaaacaggagatccttgaagacccagccaccccttctagaatggtcaataatgggcacctttcca 
aagctactaagaggcacttggcatttnaggagtttgncttatggttgcataaaannatccc^ 
cccanacctggcaccctttatggttcaaagttaagaacgggaagaatgggtggcaaggtggctcc 
tggaanagctcacccagcacagctgccctgagctcggggccttggtttctgaccctggggatam 

ANATTTAATAAATOTTATATTAAATTOCACAGAAGAAATAGATAATATAAAATCTGATGGGGhTO 
GGGNAAGAAAANCGNTAOTGGACTCATTCNGGGAATCCCAAAGCTGGGAGAAGNTCCATTNANG 
GCCCAACTTGACCTCCCTCTTTGANCCCATTNG 

SEQ ID NO: 1939 ACGCGGAAACGGACCACAGTTCTAGAGTCTGAGGGGACCCGAGAGTCGGCC 
ATCAATGTGGCAGAAGGGAAGAAACAGGCCCAGATCCTGGCCTCCGAAGCAGAAAAGGCTGAAC 
AGAAAATCAGGCAGCAGGAGAGGCCAGTGCAGTTCTGGCGAAGGCCAAGGCTAAAGCTGAAGCT 
ATTCGAATCCTGGCTGCAGCTCTGACACAACATAATGGAGATGCAGCAGCTTCACTGACTGTGG 
GAGCAGTATGTCAGCGCGTTCTCCAAACTGGCCAAGGACTCCAACACTATCCTACTGCCCTCCAAC 
CCTGGCGATGTCACCAGCATGGTGGCTCAGGCCATGGGTGTATATGGAGCCCTCACCAAAGCCCC 
AGTGCCAGGGGACTCCAACTNNTCTTCAGTGGGGAGCACATAAATGTCCATGGTCCT^ 
ACCACGCTAANGGCG 

SEQ ID NO: 1940 ACAGCTGTCTGCATTGAAAATTCATGCATGGAGAAAGGGAGTAAGCAAGGG 
AGAAACGGTGCGATTCACATATTCCGCGAGATCATCAAGCCAGCAGAGAAATCCCTCCATGAAAA 
GTTAAAACAAGAAAGCCGCTTTAGCACCTTCCTCAGCCTACTTGAAGCTGCAGAC^ 
TCCTGACACAACCrGGAGACTGGACATTATTTGTGCCAACCAATGATGCTITrAAGGGAATGA 
GGTGAAGAAAAAGAAATTCTGTACGGGACAAAAATGCTCTTCAAAACATCATTC^ 
ACACCAGGAGTTTTCATTGNAAAAGGATTTGAACCTGGGGGTTCTAACATTITy^ 

anggnagcaaaatctttctggaanaangtaantgatacccttcotggggga^ 

seq id no: 1941 acatgaattagaagcgtgcatctaggattatggccaaactgttttaaaaatg 
cagaaatgtaaaattacatcttgaaaatatgaagagatggtctacacacttcaaaaatcaaatgt 
tgctataccagaqatgtatgacaatcacgggattcaagtgacaagcagtaagatctcaaaaatta 
atactggtcaaagataatgggaatatttttgcatttcactgaaaatacattgactact^ 
aaatctagcaggaactcanggaaaaaaattacaaaatctatagccaattacttaatam 
acctaaacaacagcatgacattaacnagaaaactgcacctgcanttttattg^^ 
gntaattnagtt(:tocnaaaa>jtgaanatgnaatccaaaact^ 

SEQ ID NO: 1 942 ACGCGGGGCCCTACTGCCGATGCCAAATATTTGAGAGAAGGG AACTTTTGCT 
GAGGTTTTCTCTGAGGTTTTTTTGATGCTTTATAGGAAACTATTT^ 

CAAGGACACAGTGGATGTGTTTTCCCTGACTCCAGCAGGGCAAGGAATGTANCCGAGAGGTTGTG 
TGGGCTGGGCTCTGGTGCCCTCTTCCCTGGCCAGGACACCTCTCCTCCTGATO 
TCTTTCTGTCTGTTTACCTGTCTCCCTGCCTGCCCATCTGCATCr^^ 
TCTGGGGGCTGAGACCACCCTTGCTGCCCCTTCITCTGCTTAAGAATG^ 

TGGCTCACCCCTGTNACCCCACCNCTTTKGGNGGGGGGNGACGGCAGAAACCTGNGGCAGGATTC 

TANAACNACCTGACCTACATGGANAAACTCC^WCTCTGGTAAAAATACAAAATAACCGGGCA^ 

NGGGCACGCCTTAATCCG 

SEQ ID NO: 1943 ACAAAACCAAATGTTNGTTACTATAAOTCTGCATCACAATTAAAATCCAAA 
CAGrrTTTTAAAAACAGTCAACTCAATCAAAACCCACTACTTCAGAATCi^ 
CACANTAACATTAAATATGGTTAAGACTCGGAATGCAGAAATTTGGTTGGNTGGAAAGCTAAT^^ 
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AACTTCCAACTTGNTCAAATAGAAAT^ACAAAT^M.GGC^^NAATTGTGN^ 
AOTCCNOTGNAATCACCAACACNTGGNCAGCTATGTGAGAATTAAAAGTCCTGAGAAT^^ 
AANTTTNAGGNGTTCCTmrATACAANANATATTGATGTNCTGTTCTTCCCA 
NNGCNTNTAGNGGAATNACACGCACAACCCTTNCITT 

SEQ ID NO: 1 944 ACTAAATGCACAATTTAAACATCAGTTATACACTGTCATTAGTTTTCCTCrrA 
AAAAAATCAGTrCCTGGGCrrTTANATGTGTTTTCTTTTTAAAA 

CACAATTGGTCCATACTGACACCAmGGTAACAATCCTATCAAAGATCCACATGTTCTTGCTCT^ 

TTCAGGCCACTCCATGGCC/lGCGGGAGTCTTGAGTCCTCTCTGCGTGTrGTTGTGGCA^ 

TGGTCCCGGGACTCTGCCTAGAGCANGACGGCCTGNGCTANCTOCrrGATGGTGGACGCAGC^ 

CTCCAGQNATTNCTCTGTTNGTNCCACCGNGACCACAACCC GANT GCTGGGKGGAGGGAGACACT 

TNTTTmNTTITCTGANTTOGNCCCTGANGAAAGCNANACW 

TACCTGNACAATNNGGCNANTTGTNGCNANACNAATGGTCnr^^ 

GNNAAAGGACTCCCCACACTTT 

SEQ ID NO: 1945 ACTTGGCAGGGTTClTGCCACAGACACATTTGGCrCCAGGCTGCAGTTCACAG 
AGTGGTTTGAAGGGGATGCAAAGGCTTTTAGCTCCCATGGATGGAGCACCAGGTTCAAGATCrTG 
ATCCCTGGAGTGGTCTTTTTGATCCAGTCCrCACAGTCAATTTCCCCACAGAATGGAATCT^ 
ATCTTTCCAGAATCTAGTATCrrCTGAAAGTCTTCCATrGTATrAGCCACAACCATATGAAGT^ 
AGGTCTTCAAAAGCCCTTGTGAAAAGGGTGACCTGGATGTCTTCCANAATAGCTTGAAGm 
CTNTGNCTCATTTTTNAGCAACTGTCANCTNTTT 

SEQ ID NO: 1 946 GGGGCAAGACTGTGGATTTCACGCAGGATAGTAATTATTTGTTAACCGGGGG 
ACAGGATAAACTGTTACGCATATATGACTTGAACAAACCTGAAGCAGAACCTAAGGAAATTAGTG 
GTCATCTTCTGGTATAAAAAAAGCTCTGTGGTGCAGTGAGGATAAACAGATTCT TTCT GCT^ 
CAAAACTGrrCGACmGGGATCATGCTACTATGACAGAAGTGAAATCTCTA^ 
TGTTAGTAGTATGGAATATATTCCTGAGGGAGAGATTITGGTTATAACTTATGGACGATCTATT^ 
TTTTCATAGTGCAGTAAGTITGGACCCAArrAAATCCmGAAGCTCCT^ 
TCTTTNTTCCTGAGAAAGATTTCTTGTGCAGNCGGNGAANATTTTAACTT^ 
AGTGGANANATTAGAATNCTCCANGGGCACTTGGNCCTATCCTGNGGAATTrAGTCCTGAGGGAA 

CCTNTTGC 

SEQ ID NO: 1 947 ACTTGTTGTCAACCAAACCAGTGATACTTTGCAGAATTGCACATTAGAACTA 
GCTACACTAGGGGATCTGAAACTTGTGGAAAAGCCGNCTCCTTTGACTCTTGCTCCTC^^ 
GCAAATATAAAGCTAACGTCAAAGTAGCATCAACAGAAAATGGAATAATTTTTGGTAATAT^^ 
TATGATGTCTCTGGAGCAACAAGTGACAGAAATTGTGTGGTTCTCAGTGATATTCACATCGACATC 
ATGGACTATATCCAGCCTGCAACTTGCACTGATGCANAATNCCGTCAGATGTGGGCCGAATTTGA 
ATGGNAAAACAAAGTGACAGTTTAACACCACATGGTTGATTTTAAATGACTACTTACAGGCACAT 
ATTAAAGTCAACCAATNTGAAATTGOaTGACTTCCAAAAAAAAGGCCCNT^ 
CTTTATTGGCAGCCNACCCTTTT 

SEQ ID NO: 1 948 actcagtataaatgcagatgcttatgacagcgacatagaaggcccatgcaac 
gaagaagcagctgctcccgaggcaccagaaaatacagtccaaagtgaagctggtcagatagatg 
acctggagaaagcattgagaaaagtgtgaatgagattctaqgactggcagagtctagcccaaacg 
aacccaaagcagccaccctggctcttcctccaccagaagatgttcaaccttctgcacagcaactg 
gagctgctagaacttgagatgagggcaagagcgattaaagccctaatgaaagctggtgatataaa 
aaagccagcctaggtatttaacttgattttgaattitaggtatgtt^ 
taattttggatctaaaattratttgggggncttatatgttattrrcat^ 
cattttagccctaaarracctggtggtgnttctttttatm 
tactggctatgaattctggcatangttggtagcctgg 

seq id no: 1949 acacacctttgtccctgggtaaattatattcattatgcccactgatgcagcac 
gcataaaccaacacccctgcatggctgaacagggcctaatctaggactgatgggagaagggcrrg 
caaccaagatcaaggtgtttcrrccgctaatactgtctaccaagctgatccctacaaaaatgcatat 
aaaagcaggcaagtttagctactgtgttgcaagagaaaccaggaccttgrraaatagttctc^ 
attaccarrrattctctcaagggaagcttaaaaaaaaaancancaacaaaacaaca^ 
ggccaccrcatgaatccaacaagcattantgtggcatttcagtggagaangaaacttggggggaa 
aatccntcaaggttggnagaaaagctcccaattaactgtcctgncntnttttcctcattcaaga^ 
atccntr>mtranacnctctgncaataaatggggtcangcttanggcntgn 
gacatataaccgatitnttga 
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SEO ID NO- 1950 ACnriTITITTriT^^ 

TCNGCGGTTGANTAGTTCCCATTGAACAGACACAAGGNCAAAACGTCCAGGCTGANCTGG^^^ 

GrrArrCNGGGAGTGCCATTGGGNGCrrCCATATTTNCAATTTCTGCTTG 

TGTCNANCAGGGGGTAATCATATATTAAGGTTTNCTTAGCGTTACGACITAC^ 

TAATCAAGGNAAGCTNTCCCGAGCTTNCATCNGACAATTrGC^^ 

TGGATCACTTTTGCTCAGGGTTGGGNCT 

SEO ID NO- 1951 ACTTT NTi - n - i - 1 1 1 1 U 1 1 1 1 i 1 1 INGAGAAATGTAAAACTCAGnTAATTTCA 
AGTrrGTATAAANAATGTATGCCNCANTTTGTATTTTATA^ 

NAAATAAAAAAAGTGATNCGTTAGCTGrrATGAAGGGNGAAAACATTATATAAACTTCA^ 
TGCTITCTGCATCTGCATOTATGTAATTTCATGGNTCTCTACCAAm 
TATAGCCAAATTATTGTTCACTITCATGCCCCAACATGGGAAGGGGTAGGNGAANNTNT^ 
ATACAATTTATTGTCTGCTCCTCCCTCAAAAGAGCCCTGTGTNTGGCCANTCTACACT^ 

SEO ID NO- 1952 ACCATCGCACACACTATTGACGTCATTGGAAAGAAGGAAGACGACmGTCT 
GCTGCCTrCrmGAGTGGCAAGCCACTGCACTGGACCCATCTCTGCTAm 
TTCAAGGATGACCTCAOTCTGCAATGGTTTTGAANAAATTCAGNGAAGTA^^ 
GAAACATATITCATGATGGGTAAACCACAAGAACCrrTAATGGGGGGCAGTAGTGTG^ 
AAGGAAGTCTTCTTGATCCTTTCGTGCCTCCACATTANATAGATCCCTGCCACCAACACCCATGTC 
GCCACCAGCAAAGACAGCAGGAGGAGAGGCAGCCAGCCTCCGGNTTrGNTTTTGTrGTIT 
GGGAAANGGACGCCTTGTTTGTGGGCAAAACCACAAATTGTTCNTTTOT 
GCCNCCAANTANGGAAAANNTGGN 

SEO ID NO- 1953 ACITI i 1 U 1 1 1 1 1 U i ITITITCCACACCTGCCCTrrATrGGTCTCTTO 
NAGNGGCTCXAGGCCrrrCACGCCTOTCAAACACCACCCATGAGGGm 

ctgtgaaggcccaaagcttacccaagtcttggagcccaagttgaatcaccaaccanaggg™ 

AGAGGAAAAGGAAACAGGCAGAGGGGAAAGGCAAGGCTCTGCAGTGAAGGGGACTGATATC^ 

gggaatgctgaggtccagcagtgtctcctgaaggcatgctgcatcctaaggctcctcaggactgg 

ATGAANTAGGAGATCTGTGTGTTGAGCAAGTTCACATOTATATGGCAACTTTAAGGAG^ 

atntcaggctcaatgtnnatgnttgggaaagngccgcttgaancgtcgaaaggctctccttccgg 

CCrITGAAGAACTTTTAAAAGTTCCAGCC^n^TNTGANCW^ 

tgggatcggtcatgaggaaaatgggtcatcataagg 

SEO ID NO- 1954 ACTACnTOVGGTAATCATTGTmAOTAAAGTTCAGATTCCAG^^ 
AGATGAATATrCCCTGGTTATACmGTCAATAGTmCTCATTGCTAC^^^ 
TCACAAGCTAArrrAAAAGACATTGGATTAC^ 

ATTCCACTTACAATTCCTAATCTTGAGCAAATTGAAAAGCCTATATCAAT^^ 

ArrAATTAAAAGTTACAGCTGTCATAAGATCATAATTTTATGAACAGAAAGAACTCACGA^^ 

AAAAAATAAACTGAACTAAAACAACrmGCCCCCTGACTGATAGCATTTCA^ 

AAGGGCOTATGATNCCAAGTTATTAAATAGAGGTGTANTTTTAAAAAC^^ 

GTTTTATTTGTITTTTNAGGGACCCTATT 

SEO ID NO- 1955 CAGGTACTCTrTAAAAAGGGACTGCAGGGCTGGGTGTAGTGGCTCACACCTC 
TAATCCCAGCACTTTGGGAGGCCAAAGCAGGTGGGTCACTTGAGGCCAGGAGm^ 
TGACCACGTGGCAAAAGCCCATCTCTACTAAAATACAAAAArrAGCTGGACATGATC^^ 
CTGTAATCCCAGCTACTTGGTAGGCTGAAGCATGANAATTGCrrAAACCTG^^ 
CAGTAAGCCAAGATCATGCCACTGCACTCCAGCCTGGGCAACANAGTAAGACTCTGTC^^ 
ATAAATANGAAAATAATACGGGACTGCAGTGCTAACAGTAATTTATACATITrrA^^^ 
GTATGTTTAGCTGGGCTANTGNACAATOTACTACCCNGAANGTGCAGTNm 
CmGGGT>mGGAAGNGAACTGTNCAAAANATT™rrGNC^^ 
NGANAAATNTGTTCCTTTCTTACT 

SEQ ID NO: 1956 ACAOTATTCATTrATGCrrGAAArrCCAGTCCTAGACCAAG^ 
AGCATTGACGTTCITGCCATCCANAAGAGCTGACAGTGTCAGmAATACCTGGCTTTAGA 
GTGTATCCTAAACCTATCAGGCTGGAGTTGTTCACTTTAGCCGANAAGCAGGCGTC/^^ 
TGATACTTGGCrGCrATTCCGAAGCGTGTGrrACTGTTTCCTGCTGTCCAGGCAAG£^ 
GTCrCCAACTTCTTGTTCACTTTCTGGTAAATGGAGCCGCAAACTCT^ 
GNGAAGCTGGAATTCATNAGTCTTGNAGCCAACTGCAAAGTTGCTCTGGGTCACT^^ 
AGTCTCAAAAArrATCTGGTAGCCGGGCAANCAGCCTNGTAACCTAGCACCAAAGNACCCCGGAT 
GGGAAGCCCANAATGTNCAAAATCNTTGCGNANCCAGGTANTGTGCTCCCGCTTGNACCTCGGCC 

NGANCCNCTAAGGCG 
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SEQ ID NO: 1957 ACGCGGGGAGGCATTGAGGCAGCCAGCGCAGGGGCITNTGCTGAGGGGGCA 
GGCGGAGCTTGAGGAAACCGCAGATAAGTTTTTTTCTCTTTGAAAGATAG 
TTAAAAAATAAGTCAATAGGTTACTAAGATATTGCTTAGCNGTTAAGTTT^ 
AGCTTAAGAtTTTAAGAGAAAATATGAAGACTTAGAAGAGTAGCATGAGGAAGQAAA AGAT AAA 
AGGTTTCTAAAACATGACGGAGGTTGAGATGAAGCTTCTTCATGGAGTAAAAAATGTATT^ 
GAAAATTGAGAGAAAGGACTCCCNAGCCCCGAATTAATTCCAATAGAAGGGCAATGCT^ 
ITAAAATGGAGGGTGANCTNANCAGCTTTAAAGTTTAGTTTAAAAGGTGGNGGGNGNATTA^^ 
TANrmGAAGGCGAACNTTrrAAAANGAGGATTNNANCCGAA 

SEQ ID NO: 1958 ACACATACACACCTAAAGAGTCATGGCCTTCTTAAACAGCTTTOT 
TCTGGAAATATCCTTTGGTTCATTTTTATTGCCCCrCTCTAGGCAAA^ 
GTATCAGTGATTATTTCCTAGCACTTGTAAGCAAATATCCITACCAAGAGGAACCATTC/^ 
ATAATATCGTAAAGCGTGGAGTTAAGATGTGTTTTTAAAAAATATACAGGCTT^ 
ATCTGAAAAAGGTAACATCCACAATGACATTCTATTACAGAGTTCITACAATCACCCTAGCCT^ 
ACACTCTGGTATAATACTCTTTCTTCAATTCTGTTTAACAGAATAAAAAGTAAC^ 
CAATGCATTTGAACTTAAAATATACCTGCCCACAGGAATTAAGTAGTTTATTCC^^ 
AATTTCTCAATCCCX:TTTACCTNTGNCTrAAGGGGTAAGCACATAGTC^^ 
ACCAAGTACTGTATGGATTATTGNATACA 

SEQ ID NO: 1959 AC IU - iU - ll -r i - ill U UU - iUN GG'rriUU-llU-i-i-iTCTAAGCCATTACTTm 
AAANATTTGNGAAACTCTTCACATCATGGNGANAGTTTGWGATTAATAAAAAN^^^ 
GAAATGOTGGAGGGGAACAAGTTCTCAGCCTGNGANATCCNACCATCCCATTAANTTTGAAGTT 
TCTCTTGNTTAATANAAAAACAAAGGGGNGGGNGAAAAAAAGGAGGAACA^ 
TGACAAT>rrTCCAAATGTGNGGAAANAACAACCNATTCACCAACrCCNTT^^ 
TGTCTACATNTCACTCTTTGNTTTGGGNCTTTCTGGCTGAAAC^^ 
GANAAGACCCCTGGTTTTNNAAAGACCTAGGAGGAAAANGCCTTGAGGGATGCNT^ 
AAAAAACTAGAANGCCTGGGCTCCCAAATTTTNAACC 

SEQ ID NO: I960 ACATAGTGTCGCGAACTCAAATCGGCATTTAGATAGATCCAGTGGTTTAAAC 
GGCACGTTTTTGCTTATAAAAAAAGTGCAAAAAAGATGTGGTTTACAAGTTAAAGCT^ 
CCTTTTTGCTGAATTGCACCAGTTTTAAAGCCTCTGGACAGAGCAGTAT^ 
TTTCTTAAAAGCTTACAGTGmGGCTAATTCrCCTCCCCTrm 

GGACACTGGTGGCAGGTTAAGGGATACTGTCACirrAAGAAGCCTGCAATTGAAGNGTAAAC^^ 

GAGAAATTAGGGGCTGATTTITrAAACTGTGTGAGATATTAACCAGCCGGCCCTGTTATAAAA 

GGAAATNCAAACAGCGATTACACCGATTACACCCCCTTTATATATTTTTTACA^ 

AAATAATCNAACGGTTTCATCTNTTTGGCTTTTTTGT^^ 

ATAAAAATTTAAAGGTTA 

SEQ ID NO: 1961 ACAAGACACTACGGGAACAGTTTGCCTCCCTCCCAGCCTCAA CCACAATTCTT 
(XATGCTGGGGCTGATGTGGGCTAGTAAGACTCCAGrrCTTAGAGGCGCTGTAGTATn^^ 
TTGTCTCATCTTTGGATACTTCTTTTAAGTGGGAGTCTCAGGCAACTCAAGT^ 
TTTTGTTTGTTTTTTGAAACAGGATCTTGCTCTGTC^ 

GCCCAGTGCAGCCTCGACCACCTGGCTCAAGCAAATCCTCCCATCTNCATC TCCA AAG 

TGACAGGCGTGANCCACAAGCrCCCANCCTANGCCCTTAATCTTGCTGGTATTTTCCATG 

AGGNCTGGTCATCTGAGCTCACGCTGGTTACACAGNTCTAGGGGGCTGCTCTCTAACTC NNAG GG 

GGTTITNGTGANGCTCTTGTGGCCNAANCAAACTGNATNTTG^ 

ACCACMSfGCCTGGATCTNCACTGGAAGCCANTTGGTGCCC 

SEQ ID NO: 1 962 ACCCTCAGAATTGATGTGTTGAGATCCTGACCCCTAAGGTGATGGTCCTAGG 
AGGTGGGGTCTTCAGGTGGGCGGAGCCTCATTCATGGCATGAGTGCCCTTGTCCAAGGGGCCCCA 
AAGGCTTCTGGCTGTCTCn'GGCCATGTGAGGACACAGTGGGAAGGCAGCCATCTACAACCCAGG 
AAGCAGCTCTCATTGGAACCCTGACTGTGCTGTCACCGTGATCGCAGACTTCCAGCCTCCAGAACT 
GTGAGGAGTGTCTGTTGTGTATAAGCCACCCAATCTCTGGTGTTCTGGGATAGCACCCATACACAC 
TAANACAGGGGTTNCTGAGTGCCTGGAGAGCCAGCTCCITCCAAGAGGCTGGGTCCCTCCTTOT 

mcTCNTTTmriTCTrTATTnTrr^ 

TGTNGTGATCANAmTAANTGAAGCTCATTNTTGGCTAAAGGANCCTTCCCCT^ 

SEQ ID NO: 1 963 ACTTTCTGCTGCTGAAGTAATGCAATGGTCTCAATCTCTGGAAAAACW 
CCAACCAAACTGGTCAAAATGTCTTTGGAAOTTTCCTAAAGTCTGAATTCAGTGAGGAGAAT^ 
AGTTCTGGCTGNTTGTGAAGACTATAAGAAAACAGAGTCTGATCTTTTGCCCTGTAAAGC^ 
AGATATATAAAGCATTTGTGCATTCAGATGCTGCTAAACAAATCAATATTGACTTCCGCACTCGAG 
AATCTACAGCCAAGAAGATTAAAGCACCACCCCCCGTGTTTTGATGAAGCACAAAAAAGCATATA 

293^^^ 
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TACTCTTATGGAAAAGGACTCTTATCCCAGGTTCCTCAAATCAGATATTTACTTAA^ 
GACCTGCAGGCTAATAGCCTAAGTGACTGGNCCCTGCTGAAGGGAATTAACAGATAGTTTCAANC 
GCANAAGGATGTGCCAGATGGCTCCTGGGGAACAGCTTGGCCnTTrTGGGNGOT 
AAAAACAAATGACTCAAATGGTTAACCNTGAAGT 

SEQ ID NO: 1964 ACCACGCTGGTCTAATGCAAAAATGGAGATTGCTACAAAGGACCCTTTAAAC 
CCTATTAAACAAGATGTGAAAAAAGGAAAACTTCGCTATGTTGCGAATTTGTTCCCGTATAA^ 
ATATANTGGAACTATGGTGCCATCCCTCAGACTTGGGAAGACCCAGGGCACAATGATAAACATAC 
TGGCTGTTGTGGTGACAATGACCCAATTGATGTGTGTOAAATTGGAAGCAAGGTATGTGCAAGAG 
GTGAAATAATTGGCGTGAAAGTTCTAGGCATATTGGCTATGATTGACGAAGGGGAAACCGACTGG 
AAAGTCATTGCCATTAATGTGGTGATCCTGATGCACCAATTATAATGATNTCAATGATGTCAACCG 
GCTGAACCTGCTACTTAGAACCTCTGNGGCTGGTTTAAAAGGATAAGGTTNCT^ 
AATGAGTTTGGGTTTANTGCGAATTANAGATAAGGCTTTCCTTTGTATTATTAA^ 
TTNAAAGCATTANTGCTAAAAAAC 

SEQ ID NO: 1965 ACCTGTAAATGTCCTGAATCCAGCATTGTTGAGCTGTCATCAACATTCTTGTG 
TCTGTTTTACTGTTACAATATTAGGTGAATATGGAAGTAAAGGCATTCCACAGGATCATC^ 
AAAAAGAATTCTGGTCCTGTTTTCTAAAAAAAAAACTGTTGTAGAAAT^ 
TATTAGTCAGAGTTrCAGCrrrCTTCAGCTGCCAGTGTGTTACTCATCm 
ATCAGAGATTTTTGGTTTGTCACATATGATCTCITAACACTm 
CrrTGGGGAAAAATTCTTGGGTATTCTmCCTTACCAGATTATGGTATT^ 
GTTCATACCCGGTTAAGTTNTTGCTAATATTrCCANAAGATTNTTGTNTTGGTGA^^ 
GGATGGGGGGNTTTTTTGGTCTTGTTGTCCTCGGCCCGACCCCCTANGGCG 

SEQ ID NO: 1 966 ACTTCATTCCTITn'ATGGCTGAGTAANGGNAAGGAATATTCCATGGA^ 
TITATTAATCCACTCATCAGTTGATGGACATTTGGGATGTTTGGGCITAC^ 
GTAAGCATATCTTTGCCTACTCCTTCCCACAAATTAAGGAAAATCAGTAGCA AAATG AAGGGCAA 
GCANCTAAAACAGCCAGCAACTGGCACACAGACAAGGAAAAGCACCTCTGGTTTITAG 
AATGAAGAGTGGGGCCGAGATTTAATTTCNANCTAGGTTNAAACCCCTAACACTTTCAAGOT 
GTTNTGAGTTGAGTCNATCCACrrAANCATNAAGCGCCCCATGGGAAAGTCT TGGATC AGCANGA 
NGAGCTCCAAATrACCGATCACTGNNTTTNATCTGTCCTCCNGGACTCCTAGAT 
CTCATNGGCCACCAANGGGGCTGNGGNTGGGGGTTANATAGGNAAAACrCTTAACCTGTCCCATT 
TTTNATTAAAAGAA 

SEQ ID NO: 1967 ACGCGGGGACCTCATTCATTrCTACCGGTCTCTAGTAGTGCAGCTTCGGCTGG 
TGTCATCGGTGTCCTTCCTCCGCTGCCGCCCCCGCAAGGCTTCGCCGTCATCGAGGCCATTTCCAG 
CGACTTGTCGCACGCTmCTATATACTTCGTTCCCCGCCAACCGCAACCATTGACGCCATC^^ 
GTTATTCGAGTGACCGAGACCGCGGCCGGGACCGAGGGTTTGGTGCACCTCGATTTGGAGGAAGT 
AGGGCAGGGCCCirATCTGGAAAGAAGTTTGGAAACCCTGGGGAGAAATTANTTAAAA^ 
GGAATCrrGGATGAGCTGCTAAATTTGAGAANAATTTTTATCAAANAGCAC 

CGCACAGCACAAGAAGGNGGAAACATACAGAAGAAGCNAGGGAATTACAGTTAGAAGNCCAACN 

TGCCCCAAGCCAGTTCTAAATTTTATGAANNCCAAmCCTTGCAAATGCATO 

ANAAAAAAATTNNCTTGACCCNCTrGTTTTT/^ 

SEQ ID NO: 1968 GCGTGGTCGNGGCCGAGGTACGTTTCATGACCAAAATTTATCATCCTAATGTA 
GACAAGTTGGGAAGAATATGTTTAGATATmGAAAGATAAGTGGTCCCCAGCACTGCAGATC^ 
CACAGTTCTGCTATCGATCCAGGCCTTGTTAAGTGCTCCCAATCCAGATGATCCATTANCAAATGA 
TGTAGCGGAGCANGNGGAAGACCGACGAAGCCCAAGCCATAGAAACAGCTAGAGCATGGACTAG 
GCTATATGCCATGAATAATATTTAAATTGATACGATCATCAAGTGTGCATCACTTCTCCTGTO 
CAANACTTCCTCCTCTTTGTTTGCATTTAATGGGACACANGTCTTTAGAAAC^ 
AANGCCCAGNACNTTTTCANGCCTTTGGTGATTAAATGCACATTANCAA^ 
GATTCACTCTCnTAAAGCATGAGCANAGGCTTGAAANTNTCATCTGGGATT 

SEQ ID NO: 1969 ACGCGGGGAGCACAGTTCTGTCCAGAGAAGGAAGGCAGAATAAACTTATrCA 
TTCCCAGGAACTCTTGGGGTAGGTGTGTGTTmCACATOTAAAGGCTCACAGACCCTGCGCTGG 
ACAAATGTTCCATTCCTGAAGGACCTCTCCAGAATCCGGATTGCTGAATCT TCCCT GTTGCCTAGA 
AGGGCTCCAAACCACCCCTTGACAATGGGAAACTGGGTGGTTAACCACTGGTTTTCAGTm 
CTGGTTGTTTGGTTAGGGCTGAATGirrTCCTGTTTGTGGATGCCTTCCTGA^ 
GACAAATACTACTACACAAGAAAAATCCrrGGGTCAACATTGGCCTGTGCCCGAGCGTCTGCTCTC 
TGTTTGAATTTTAACAGCACGCTGATCCTGCriTCCTGTGTGTCGCAATCT^ 
CACCTGCTCATTTTGCACCCCACACTGAGAAAGCAATTGGATCACAACCTTANC^ 
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GTGGCCTATATGATCTTGCTACATACAGCTATTCACATATTTGC 

SEQ ID NO: 1970 ACAAAATGTGGATCACACTAAAGACTCTGAGCCACAGTCTCCTGGCATTTGA 
ATATTGGTCATGTTTCTTTATTAOjGTITCTTrGTTCTTAA 
TAAGTTTCTTTGmGCTTTTTAAAACATATCAAGi^^ 

GTGCCAAACAGAAAGAGAAAGGGGACAGGCCAAGCCTTTATGCCTGAGAAATTGTGGGGGAAAA 

TAGAGGGACAGATGAAGTGAGTCGGTAGGGTGCTGGGGTCTCTCTCTTCACACCCATTCTT^^ 

AAGACAACACTTTrrGCANANGTGGGGCTTGGCCTTTAACAACAACC^ 

CTACTAAAACTGGGCCAGGAACTTGGTTCCCTGACATAATAGGGGAGATTGCAAGCCTCTAGAAA 
AAGCTCCATGTGGCCACTTCTACATTGCACAGGAGCCAAGGTGANGACTGCNAAAGCANACAAGT 
GGATGGGTGANCTGAAACACGGTANTAGTTCCATGGGCCCCAAAATNT 

SEQ ID NO: 197 1 ACTTCTGATAGCTCATCACTTTCTGTGTCAGAACTTTGAAAAGCAA^ 
ACATCGACTTCCirCGAGACTGCCTACTGATTCTTCTAGAAGACAGAGATCT^ 
GGACTGACTCCAAAGAGCCTCAGGTAAAGCAGAGGCCATTGGCAGCCTAAGGTTGGCAGTO^ 
GAGGCTGCGGAAGGTTGAAAATGAAAAGGAGAGTAAGCAGAGGGATGATAAGACCCAAAACCTC 
TGATGGGATCAGGGTCTGTAGGAAGAGANAAATGAAGCTGGGGAGAATCCACTCANGGAAGGGC 
CANCANAANGCCTGTANTAAAGCCTCCAGGATGTTGCAATGGANCAACTGTGCANAGCTTAATGA 
GCTCCCTGAAAAACAATGCCNCAATAGGGTGCAAAAGGNGGGTTCTGGGGAGGCCTGTGGGACA 
AGACGCCGACTGTGGGATCCAGTCNCACAAGTGCCAGCACAGGGCCTTGGCTGAATTGANATGCA 
TCAAACTGTTGGGANGAACACTT 

SEQ ID NO: 1972 ACGAGAAAAGGGTCCGAGCACAAGCCAAGAAGTTTGCGCCCTCATAAGCAG 
CGACCTTGTGGCATCGTCAGAAGGAAGGGATTGGTTTGGCAAGAACTTGTTTACAACATT^ 
AATCTAAAGTTGCTCCATACAATGACTAGTCACCTGGGGGGGTTGGGCGGGCGCCATCTTCCATT^ 
CCGCCGCGGGTGTGCGGTCTCGATTCGCTGAATTGCCCGTTTCCATACAGGGTCTCTT^ 
TTTTGTATTTTTGATTGTTATGTAAAACTCGCrmATm 
GNAAAAATATAAAACTTTTTATACn'GGGGAAGCCCCCANGGGCGAmTO 
AGGCATGNTTTTTACCCGGGNAAAANTGNACITGGCTAGOTGGGTGTAAGNAA^^ 
TCCTGCCtUUUl'I"lU-l"l'AAAACTTTOAAAAACNGGGGTGGGl"lTl"l"ri"lA 

SEQ ID NO: 1973 ACACGAGAAGCTCCGAGGATGGCTGAAGTCCAACGTCTCTGATGCGGTGGCT 
CAGAGCACCCGTATCATTNATGGAGGCTCTGTGACTGGGGCAACCTGCAAGGAGCTGGCCAGCCA 
GCCTGATGTGGATGGCrrCCTTGGGGGTGGTGCTTCCCTCNAGCCCGAATTCNTGGACAT^^ 
TGCCAAACAhTTGANCCCCATCCATCTTCCCTACCCTTCCTGCCAAGCCANGGACTAATCA^ 
AANCCCAGTAACOTGNCCTTTCCCrGCATATGCTTCTTGATGGNGTA^ 
GCCTCAhnsrCAAACTGNTTTCTTCCTTTANNTGTTTATAT^ 
NNCNAAGAGCCAAATCCXnTTGTTCCACmAhrrrATAAATGGm'GGNAAC^ 
CAANGGTNGGCTTTNNCCTTTGGCNTGA 

SEQ ID NO: 1 974 GGGACnTTTTTTTTrrTTTTTTT^^ 

GANATITACTGGCANAATTTTrCACAATANATTTTAAATCTGTATCTG 

AGTTTGNGTCCNCAGGCTCATTCTCANATANATGCTGCTCCTTATTTT^ 

CAGGTTGNGTrrGTTOTAAAAAANAACAANACGTTTCTATTCTTG^^ 

CITGATTrrCTTCATCCCTITCCCTTTT^ 

CTGNTGCTGCOTGAAAACCCANCITNCCTTTATTATAN 

ATAGrn'GGGATCTTNATOITAACQ^AAGT>rmGATACCNTO 

CAGGACTCCTTTACAGNGGGGNGNATCTTG 

SEQ ID NO: 1975 ACAGAACACCCTAGACAATTCAAGGCATCTTAATCTCCATCAAGAACAAAAA 

aaaataatrmgtcatgctgttaaatccatcattatggataaaactcgtgca^ 
gatccaacaaacactgatgtccaagctggcatattggcaactaatacgttaactggtggtcaata 

GAGAGTTrAAAAGATCTTCCCTTTCTTCTTGTTCTTTTCC^ 

aaatacgttgttgagcttcccaotttgctttctcatcctcgaactgacgacgt^^ 

TTrGGCTGNGCTTCCAAATCTTTTTmrrGNTCA 
CTTTCTTCTTCCATTTGTGCCAGAAGGGCTOTANTCAAGCTGCCC^ 
CCATTATAAGTCACAGCTGCAAGGTTTCTGCTTCTGl^GNTCTCATAGTGGACATTA 
TCTTTCAAGTCCTGCATGTGTGTTCnTATCACATATTCTTANG 

SEQ ID NO : 1 976 ACGCGGGATGTGAGCTCCTGAAAAATCTGGCCTTGTCTGGTTTTAGACAGATT 
CATGTTATAGATATGGACACTATAGATGTITCCAATCTAAATAGGCAGTTTTTATTTAGGCCT 
GATATTGGAAGACCTAAGGCTGAAGTTGCTGCAGAATTTCTAAATGACAGATTTCCT 
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GTAGTTCCACAmCAACAAGATrCAAGATmAACOACACTTTCTATCOACAAmCATATTATTG 

TATGTGGACTGGACTCTATCATCGCCAGAAGATGGATAAATGGCATGCTOATATCTCTrCTAAATT 

ATGAAGATOGNGNCTTAAATNCNANCTCAmGGCCCCTTnGATAGAATGGGGGGGACAGAAGG 

rriTAAAGGAAATGCCCCGGGNGATTCTGCCTGGAATGACTGCTTQTATCGAATGCCCGCTGGAA^ 

TrrrcCACCACAGOTTAATTTrCCCATGGCACCATrGCATCTATGCCCAAGCTACCAGAACACTGG 

>m-GAGTTTGAANGATGTTGCAATGGCCTAAAGANCAACCTTTTGG 

SEO ID NO- 1 977 ACAAAAAGCTGCATTGGAGGATAGCAGTGGCAGCTCCGAOTTACAAGAAATT 
ATGAGAAOACGACAGGAAAAAATCAGTGCTGCCGCTAGTGATTCAGGAGTGGAATCTTrTGATG^ 
AGGAAGCAGTCACTAATTTGTTTGTTTGTATTTAAACTCCATrGTrTTTGGCATTATrCCAACATG^ 
TTTGTTTTAAGAAGCCTTGAAGGGAATGTCAGATTCATTTrrCTTGATGTAATTTATCACCATA^ 
AAAAAACCCATGCNAACCINAGTGAGCACAGGAmGCTTCTOGCCCATTATTTTrATTA^ 
AAAAAATmAACTGGATTTTTTGGNCTTGGAAAAAANTTmCTTACTT^ 
TCCnTATrrAGACTAANTATTrrATCCCCATNCCANGGTATAAACAGGAATTGNTTTGATAG^^^ 
GGGAGTTATTCCTCCNACAAAGCAACATGTTGNCCTGATTCAAAATCNAANCAGTTCCAATTTGCC 
TGNGAAATGGNOCTGGATTNAGGCTACTCACTGGAGGCTACCTTG 

SEO ID NO- 1 978 ACTTCATTCCACATrCAATCAAAGCAAGTGCAACCAGCACAGGTG(XCrTCC 
CAATCCTGCAACACAATGCACTGCAACACAGCAACCTGGCTCTrCACGAAATTTGGTTm^^ 
GmAACCAATCATCTACTATCTGATTAGGGGGTGGAGCTCCATCATCAAATGGCCAATCri^^ 

GTGGATTCCrrCTrmCAACTGGAGCTTTATCATATO^^ 

CACTCCATACTTCn-AAGTTCCTCTGTGAACTTGTTGAGAGTANCATTGGTAGGffrrGTOAOTrATC 

AAAAAACGCATGTTCTCATAGOAAATCTCCACAGGGGCTGGGCCGGTCATTATGGCAA^^^ 

TGTCAACGTGCGTGTGAATQTGATNGGGAAAGTGAAAAAAAAAAAATCAATAAATCTTGAM^ 

•ITCACANGCAGAAAACATrAAAAAGACCACTAAAATGCmTTATCAATCAGTGTrTrCTCTATTC 

AACITGTTATTCCTTATGAAGCTrCTCTCTTCAAGATAAGCAAAGTAT 

SEO ID NO- 1979 ACGCGGGATCATCAATGCCNAACAATGAGCCCCATCCATCTrCCCTACCCTTC 
CTGCCAAGCCAGGGACTAAGCAGCCCAGAAGCCCAGTAACTGCCCrrTCCCTGC^^ 
TGGTGTCATCTGCrCCrrCCTGTGGCCTCATCCAAACTGTATCTTCCTTTACTGhnTATATOT 
CTGTAATOGTTGGGACCAAGCCAATCCCTTCmCACrrACTATAATGGTTGGAACTAAACGTNAC^ 
AAGGNGGCTr^^^^ICTTGGCTGAOAAATTGGAAAGGCGTGGTGGGANTTGCT^CrGGGTrCCCTAA 
GCCCmmJAGGGCAGAAAAGAAAACA-rrCTTITCXamTm^MANCaKj 
NTNAAAANGGCNNGAAGNGCTCNC^mTITCCATGGTOa^CCGGG^OTCTGTGCTGTGTATT^ 

ACCACCCATGNGAGGGAATAAACCTGGCCCTTTGGAAN 

SEO ID NO- 1 980 ACAAGGTGAATATCTTAACCAGACTTGCCGCAGAATroAACAAATTTATO^ 
l55AAAX>!GTGACrGAGGACACAAOCAGTGTTCrGCGTTCCCC^^^ 
TCTCTGTCAGGCCTGGAGACGCGGTAGCAGAAGGTCAAGAAATrrGTGTGA'TTaAAGCC^^^ 
ATGCAGAATAOTATGACAGCTGGGAAAACTGGCACGGTGAAATCTGTGCACTGTCAAGCTGGAGA 

ScAOTTGGAGAAGGG^ATCTGCTCGT^^ 

CCCAArrTAATTAACC>mTrGCATGATGCmACACACAATTGGATTCAAGCATrNTACAGGAAC 

ACCCCTGTGCAGCTACGTITACCGTCGTCAmATTCCACAGAGrCAAGACCAATA 

AiC\TCACCAATGGGAAATTTrCA-rTOATATAAATACTrrGTCCTCGCCGCGACCa}^ 

SEO ID NO- 1981 ACCTCGTCTCAAAACCTGGCTGCTATCAGAGAAGAAGGGTGTTTGGGGTGTO 
mACAAAG^^^ 

GCCTTTrACGTCCTGTTTATAAAATGAA-rrCCAAAGCACCCAAGTCATCAACTGCCAACCAAGGGG 

ACGGGGATGAANAACCTGTTGGAGACCTGAACCCAGTGTAGGAGAGTOAGC^^^ 

CCCCAGGATOA(>CCACAGCATCTCCCCCTGCTATATGTGGGQAAAACTCATGGTCAC»AAC^ 

ATITATCCTmANOGGACTACANAAAGGCCNGCTOCTrrGANCTATn^GGAAA>n^^ 

NNAGANAGCATA-rrA-rcT(XGOATTAAATTNCACCan-CGOTGNTAAGATTCATACCTCCTT^ 

NAAACCTGTC 

SEQ ID NO: 1982 ACCTGCATCAGCATTAGTAATCAACCTOTTAATCOUGGT^ 
TGAAATrATTCCrOCAAGCCAATTrTGTCCACGTGlTGAGATCATTGCTACAATO^^ 
TGAQAAQAGATOTCTGAATCCAGAATCGAAGGCCATCAAGAATTTACTGAAAGCAGTTAQCAAGG 

ATOAGVCNGAGGCTGCCTCTCCCATCACTTCCCTACATGGAGTATATGTCAAGCCATAATO 

^SmSG^ACTAAAAGONGACCAATCOTGGGNCACCCAAA™^ 

^mOGGAAAGGGTAATG^^■CATCATCCTAAGCTA^TCAOTAATAACTCTACCCrc 

AAAGCTCTACTGAOOTGCTATGTTOTA^^ 
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TTCCCATCTTCCAAGGGTATAAGGAATTTTCTGCTTTGGGGTTTATCAG 

SEQ ID NO: 1983 ACACTTCTGmATATTTAAACAACAAAGAAAAAAGCATCTACACACTTAAA 
AAATTAATTCAATATTCCTAAATCTATTTTAACTCATTTTAA^ 

GCAGAGTTAAGAATGGAATAAGGTGGGGAGAAGAAGGGGACCACGAAGAAAAACACTTAGACA 

ATTACTTGTCTGTTGTGGGTAAAGCAACAGGAATCCTGGGAGATACAAAGAAATCAGTAACAACT 

TTGCTCATAACTGATATTTTCCCCTCATGTTTGNITITA^^ 

GCTCCCITCCTGGCCTAACANOANGGGNCTTTAACCGA(>IGGCNTGGTCCC^ 

GCCATAAGCTTATAANAATCTTGGACCCTCCCCTGTCCATAGTCATAANTATTTCTGAGTCCCC^ 

ACTCTGGCTGNAATAACCCTTCGTAGCCTTCGNAACTriTGTTCTNGGG 

ATATTCTTCTATGTTTCTGGNGCAAGA 

SEQ ID NO: 1 984 ACGCGGGGGTGGCGCTTTCTGGGTAAAGATGGACGTCCACGATCTCTTTCGC 
CGGCTCGGCGCGGGGGCCAAATTCGACACGAGACGCTTCTCGGCAGACGCA GCTCGAT rCCAGAT 
AGGAAAAAGGAAATATGACTTTGATTCTTCGGAGGTGCTTCAGGGACrGGACTTT^ 
AGAAGTCTGTCCCANGTGTGTGTGGAGCATCACAAACACATCANAAGCCCCAAAATGGNGAGAA 
AAAAGAAGAGAGCCTNACTGAAAGGAAGAGGGAACCAAANCATGAAAAAAAGGGAGACGATGA 
CirCAGAAATTGCTTTCCAAGAAGAAAGTGCTTCrrAT^ 
ATAATTGGATACAAAAAAAGrrCANANNNGAAANNTACNAACTTCCGGAj^ 
AANAAAAGAAAAAATNAACTTCTTGCGGAATNAACCTAANTTCACGTNCAAGGACC^ 
NACGCNATTGTTTCATTTACACTTGNCC 

SEQ ID NO: 1985 ACGGGGATGTGATGGATGTCTrCATCCCCAAGCCATTCAGGGCCTTTGCCTTT 
GTTACATTTGCAGATGATCAGATTGCGCAGTCTCTTTGTGGAGAGGACTTGATCATTAAAGGAATC 
AGCGTTCATATATCCAATGCCGAACCTAAGCACAATAGCAATAGACAGTTAGAAAGAAGTGGAAG 
ATTTGGTGGTAATCCAGGTGGCnTrGGGAATCAGGGTGGATTTGGTAATAGCAAAGGGGGTGGAG 
CTGGTTTGGGAAACAATCAAGGTAGTAATATGGGTGGNGGGATGAACTTTGGNGCGTTCAACATT 
AATCCAGCCCTGATGGNTTGCGGCCAGGCCACACNACAAACCAATTNGGGTONTGATGGGCATOT 
ANCCACCAGCANACNNGTCAGGCCCANTCGGGTTAATAACCAAANCAANGCCACATGCANNGGG 
AGCCAACX;CGGCCrmGGTTCTGGAAATAACTCrrATAGTGCTNTAATTCT 
TTGGGGATCAGCATTCATGCAAGGGCCGGGCANTGGGTTTAATTGGANGCT 

SEQ ID NO: 1 986 ACTTGAACTGGCAGGAAATGCATCAAAAGACTTAAAGGTAAAGCGTATTACC 

cctcgtcacttgcaacttgctattcgtggagatgaagaattggattctcrcatcaaggct^ 
gctggtggtggtgtcattccacacatccacaaatctctgattgggaagaaaggacaacagaagac 

TGTCTAAAGGATGCCTGGATTCCTTGTTATCTCAGGACTCTAAATACTCTAACAGCTGTCCAGTGTT 

GGTGATTCCAGTGGACTGTATCTCTGTGAAAAACACAATTTTGCCTTm 

AGm-GGAAGTTTATTANCTTrTCAAACCAACCAAAATm 

AGTGGANCTTGNGGCTTNAAANAANCTTTTGATTCTGAATTANNAGGTT^ 

TTTAAAAAACTGTTNGGNTTCNATTGGGANGCAANAANTWr^^ 

TGCCCGGGCNGCCT 

SEQ ID NO : 1 987 ACGCGGGACCAGAAAGAGTAAACCCTTTTATGACAGGGGCTGCAG AACAAA 
TCAAGCGCATCCTTGCTAATTTCAAAAACTACCAGTTCTTTATTGGTGA^ ^ 
GCATGGTTQCTCTATTGGACTACCGTGAGGATGGTGTGACCCCATATATGATTTTCTTTAAGGATC 
GTTTANAAATGGAAAAATGTTAACAAATGTGGCAATTATTTTGGATCTATC^ 
TGGCTTCTGCTTGTCATCCACACAACACCANGGACTTAAGACAAATGGGA 
CTCTTATTTATTTTGGNCTGGGAATTATTTWGAGT^ 
AGGGANGGTNNCTAAAAATAAATNGCATTTTAA 

SEQ ID NO: 1988 ACTTTCTCAAATCCCTATAAAGAATTACTTGTTATCTATTAAAAATCTCCm 
CCCAAATCTGTATTATGGTTTCGCCAGACAAATATTATGTGATGTAGCCCAGAAAATCCACACCTT 
TTGTCCrrmATGrrTTAATITGAATAATCATGTGCITAGTCCOT 

AACACGATATCAGCACATTCCCTTGTCATCATCAAGGTGTTGGCTCAGATTTATTCTTAATTGGAT 

GTTTAAAGCATGTACGCGGGGACTGAAAGGATTCTTITCTACATTATACATGT GTGTT GNCATATT 

TGGCTTTTGCTATATCTTTAACrrCATTGGTAAAATTTTNGGATO 

TAAAAACCTATTTTTGAAAAAACAAACTTTGGCTTGATAATCAm 

C 

SEQ ID NO: 1989 ACAAATACTTGTGCCACAATGACTCCTCCCCCTAGCTCCACGGCATGATACAT 
ACAACCAGTrTGTATACACTAGGCCTGCTOAGGCCATmAAACTATGAGGA CTTCT ^ 
AACTAAAGCTGCAGGGTGCCGGGGGAGTGGGGCAGCTTCATTTGCGACTGCTCTTTATTCTCTCCA 
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QjQYYirnTCAAGTCGrCTAlGTTAGCTGrGGAGGAGGAGGAGGJCACAGTTCCTGAAGAGTGAG 

TCTGGTTAGAATCCAGGTCTGAATGCTGGTGCTGCTCCCGTGACTCCGGAGCTGAAANAGTITGCT 

TGTGGAGCATGTCTGNGGAAAAAGCGACAGTAGGAACTGCTGGTTTGGANAGCA AAGA AGGTCA 

AAGGANGTCGGTCATCTTTGCTTTGTGTTGCCANACCACATCGCTGNCGGAGGATC^ 

CAAGTAGACAGATGGCCCCACTTCTTCCCCArrTGTGrrACCTATGGAGGACACTGTGCTTGTC^ 

CGTGGGCACACATGTGACTTCCATCTGANGGGAGATCCTGTTGAAGT 

SEQ ID NO: 1 990 ACGCGGGGGCTCTCCATCATGGCGCAGGATCAAGGTGAAAAGGAOAACCCC 
ATGCGGGAACTTCGCATCCGCAAACTCTGTCTCAACATCTGTGTTGGGGAGAGTGGAGACAGACT 
GACGCGAGCAGCCAAGGTGTTGGAGCAGCTCACAGGGCAGACCCCTGTGTTTTCCAAAGCTAGAT 
ACACTGTCAGATCCTTTGGCATCCGGAGAAATGAAAAGArrGCTGTCCACTGCACAGTTCGAGGG 
GCCAAGGCAGAAGAAATCTTGGAGAANGGTCTAAAGGTGCGGGAGTOTGAGTTAAGAAAA^ 
ACTTCTCANATACTGGAAACTTTGGGTTTTGGATCCAAGGNACNNATCGNATCTGG^^^ 
TTGACCCANGCTTTGGTTTCTACCGGCCTGGACTTTTNTTGTGGNGCT^ 

GCATCGCAACAANAATCGCANGGACAAGCTGTNTTGGGGCCAAACACAGAATCNGCAAAGANGA 

GGNCATGCCTGGTTCCAGCAGAATm-GATGGGATCNTNCriTCTTGCAAATAAA^^ 

CCAG 

SEQ ID NO: 1 99 1 ACTGCACGGCAATTGAAGCATAGCTACTACAGAATAACTCACCTTCCAACAA 
TTCCTGAAATGGTCCCTTAACTGGATTATTACAGCACCAAAAAACTTCTCTGA^^ 
AACCTTGTTCTATGGATTCCATAATGTTACAATGGATT^GCTATGAAGCCTCAAAACATCACGA 
GATAAGCATGATGGTCTCAGACITGGGAAAACTGCCTAATATTATGCrGTAGTGGAATTATGm 
GATTTGAATTCATCTGTGAAGCATTCAAATCAAAAGCTAAAAGCCTAAATG^ 
CAAGNCCTGAGAAAGGNAAACTGGGAATCmWTTTTC^ 

GGATCAAj^TTATTTAAGGGGGmTGAAAATGCTATTGGNAGGNGGNCNACCTA NN 

rrCAGTCTTCCCNCANCTTCAATCCTGGCAATNATTCTAATCTACTCT^ 

TGAGGTTANGTAAAAGCAACATTr 

SEQ ID NO: 1992 ACTGGTCCAGGAGTTATCCAGGATAGATTTrCACCCACCATGGGACGTCATC 
GTTCAAATCAACTCTTCAATGGCCATGGGGGACACATCATGCCTCCCACACAATCGCAGTTTGGAG 
AGATGGGAGGCAAGTTTATGAAAAGCCAGGGGCTAAGCCAGCTCTACCATAACCAGAGTCAGGG 
ACTCTTATCCCAGCrGCAAGGACAGTCGAAGGATATGCCACCTCGGTTTTCTAAG 
TTAATGCAGATGAGATTAGCCTGAGGCCTGCTCAGTCGTTCCTAATGAATAAAAATCAAGTGCCA 
AAGCTTCAGCCCCAGATACTATGATTCCTTCTAGTGCACAACCACCACGCANTTAAACACCCAOT 
TTGGGACAGAACACCrrAAGCTTGGTCTCAAAACTAATCCACCACnTATCCAAGGAAAAGCC^^ 
AAGACCAGCAAAAAGCCCCACCGGTCAAAGGAAGACTCCTTAAACTACTGAAACTGGTGTGACTG 
AATATCTAAATAGTGGAAATGCAATGAGGCTGTCAATGGTGTAAGANAAATGAA 

SEQ ID NO: 1993 ACTTrrTTTTTTTTTT^^ 

TCANAGCAGTAATCTTCCATACATAAGT^mm'CCC^GCCAATAATTCA^ 

ATGArrAGTAAAGAAAAATATNAGANTTAACAGGCCCTrTAAATTTGTT^ 

TTAAAAAGTGTTTAAAGTTTGNAATTCCTAGNAGGAAAACATTNTNTC 

CAAAC<>fCTGGAAATGCTTCAGCTGCATTrGGGGGAGAGGGGTAGGGATTrTCm 

CACTCnrrTGATNAAANGNAAGAAGGCClTNGNCCNCN^ 

GGGGCCGNTTATTANTGGGANCCCNAGCTCGGANCCAAACTTNGGGNAAATCATGGNCATTAGCT 

GTTNCNNGGGGNAAATTGTNATCCNTNACAATTCCNACANNTTACG^^ 

AA 

seq id no: 1994 acattgtgcccttgatnttatctccaagtggcagritttaaaa^^ 

acctggatataaattaattgtgcctgccaccaccatccaacagacctggtgctctaatgccaagtt 

atacacgggacagttgctggcatgtcitcattggctatataaaatgtggccaagaagataggctct 

cagtaagaagtctgatggtgagcantaactgtccxrrgcttrctggtataaagctct^ 

catgtgaatctgggtgggataatggactcagctctgtctgctcaatgccattgngcag agaanc a 

cccttatgcataaanctttttaatgcttgaaaaanatagno^^ 

aanaggngaanttaattggacngtctngggnacm'caaaagcttttng 

aatggaactattccotcaataggcaaaagtgtaacaacctatctanatggatagtot 

gcacangctntgtttataaatacatcactggntacx:gtcca 

SEQ ED NO: 1 995 acactcttccagcggagacatttggaccttgttgggaacagtgctttatccat 
tccttgccatctggagattaattagcaatrtctrgtttagtaatccgcctcccacacagaot 
gagagtaacatcgtcagaacccccaaaccctgcatcatctagcaaatcagaaaaaagggaacca 
gtgagaaaaagagtgctggaaaaacgtggagacgactttaaaaaggaggggaaaatttatagat 
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TAAGGACTCAAGATGATGGTGAAGATGAAAACAACACTTGGAATGGAAATTCCACTCAACAGATG 

TAGTGTGCAAGTATAATATGTGCAATAATCATTGGTTCCTCTTATOAATTNAATTCACCT 

TTCTTGGAGAAGTGGGGACTGCmATATTTCCAACTGGTCTATAAAATGTCT 

AGNGGGGTGTGGGTTGAANGNGGTTTAACTCAGAAAAGTAAANACAGGAAATAACTCTCTC 

NCCTTGCTTATATGGCACCACrGCTAGACCCTAAAAGAAC>[AAAAATCTGCC 

SEQ ID NO: 1996 ACTGCmAAAACACAACTCCAGAGCCCCTCCCCAAGCTCCCCTCCCCAAGCT 
CCTGAAGACCCGGTTTCTGAGGGAGGGAAATTGCTAOTGGATTGAGAGTAGCTGGAATGTAAGT 
GACCCCAGGCTTTGCCTCAGGGCCTTTAGCCTATGTCCCCCCCACATAAAGAGAGCT^ 
CTGACTGAAGAGCTGACGTTTTGCTITITCATATGCCAATTAA^ 
CTCCAGCCATCCAGGAGTGGCTGTCCTTTTCAGTTTTGTCTTTTATAT^ 
ATrTANAANCCTNGCNCTCNCTAATTAGATTAAACAGAACAAGGCTTGNTTTGGT^ 
CAAAAGTCCAACAGACACACACTGAGCAGGNGTTTTACACNCACATTCCCTTTTTGCCCOT 
AGAAANGTGCAGGNAAGGrmTOCCAACAAGAAAGCACATNGAAAATAATTTGTACTCT^ 
CCATTANCATGTGTANGGGTTACGGNGAGGACACTGTGTTGTTTCATAAA 

SEQ ID NO: 1997 ACGCGGGGGCCTGCAGCTCCGCCATGGCTCCTAAAGGCAGCTCCAAACAGCA 
GTCTGAGGAGGACCTGCTCCTGCAGGATTTCAGCCGCAATCTCTCGGCCAAGTCCTCCGCGCTC^ 
CTTCGGAAACGCGTTCATCGTGTCTGCCATCCCCATCTGGTTATACTGGCGAATATGGCATATGGA 
TCTTATTCAGTCTGCTGTTrrGTATAGTGTGATGACCCTAGTAAGCACATATTTGGTAGCC^ 
TACAAGAATGTGAAAmGTTCTCAAGCACAAAGTAGCACAGAAGAGGGAGGATGCTGTTTCCAA 
AGAANGGACTCGAAAAATTTCTGAGCTTGTAATANAAAAGAATGTCTCGGAAAGG^ 
GAAAGAAATCTrGTGGAANAANAATGAAGTTGCTGATrATGAAGCCACAACATTTTCCATCTTTTA 
TAACAACACTCTGTTCCTGGTCGNGGNCATTGTTGCTTCCTTCTTCATATTGAAA^ 
CAGTGAACTACATATTGNCCATAAGTGCTTATCAGGACTCNCGCCTCTG 

SEQ ID NO: 1998 ACTTTrrmTTTTTTT^^ 

GGCTGGANTGCAGTGACNCNACCTTGGCTNACTACAACCTATGCCTCCCCAhnrrCAAGTAAT^ 

Ca^CCTTAACCTCCTGANTAGCTGGGATTACAGGCNCCTGCCNCCACACCCANCTy^ 

TTTTANNAAAAANAGGGTTTCACCNCATTGNCCAGGCTGGTCTTGAACTCCTGG^^^ 

CACCCGCCTTAGGCNCCCANAGGGCTNGGATTACAGGCATGAACCCCCNTATCTGGCCn^ 

TTTAATTTTTCAAATGAGAANGTNGGGTTAGGNAATTTm 

CCACAArrArrTTNCTGANACWn-GCAANCAACTGATCTT^ 

TTGATGTCATCTOGGCAAATCAGTCCAATGACAGCCACAAGTGGTCAAAAAGCTITNTTACAGCCC 
TNTTTGGTCACTAGGCNACTACCTANGAGTCAACTGTGA 

SEQ ID NO: 1999 ACGCGGGATTCGAGTAGCGGCTCTTCCAAGCTCAAAGAAGCAGAGGCCGCTG 
TTCGTTTCCTTTAGGTCnTCCACTAAAGTCGGAGTATOTCTTCCA^^ 

CGTTCCAAGGAGCGCGAGGTCGGGATGGATCTTGAAGGGGACCGCAATGGAGGAGCAAAGAAGA 
AGAACTTTTTTAAACTGAACAATAAAAGTGAAAAAGATAAGAAGGAAAAGAA^ 
TGTATTrTCAATGTTTCGCTArrCAAATTGGCTTTGACAAGTTGTAT^^ 
TTGCbrrTATNCATGGGGCTGGACTTTCCTTTTATGATGCTGGGGGTTT 

TTTGCCAAATGCNGGAAATTTAANAAGATCTGATGTCAAACATCNCTATAGAAGTGNTATCAATG 

ATACAGGGTTCTTCATGAATCTGGNGAAGACATGACCANGTTTGCCTNTTTATTACAGGGG^ 

GGCTGGGGTGCTGNTGCTGTTTNCTTTCAGGTTCATTTGGTGCCCTG 

SEQ ID NO: 2000 ACCAGTTTTGGTGTCAACTAGAAAAGGTCTTATTGAAGTTAAAACAGATGAG 
TTTCCTCGCCATGGGAGCAACATAGAAGCCATGTCCAAGCTAAAGCCTTACTT^ 
ACGGGAACGGTCACCCCAGCCAATGCTTCAGGAATAAATGATGGTGCTGCAGCTGTTGTTCTTATG 
AAGAAGTCAGAAGCTGATAAACGTGGGCTTACACCTTTAGCACGGATAGTTTCCTGGTCX^^ 
GGGTGTGGAGCCnrCCATTATGGGAATAGGACCAATTCCAGCCATAAAGCAAGCTGTTAC^^ 
NCAGGTTGGTCACTGGAAGAAGTTGGCATATTrTGAAATCAATGAANCCCTTTGC/^ 
GTTGCAATTAGTTAAAAGAACTTGGATTAAACCCAAAAAAAGGTCAATATTGAAGGAGGGGCT^ 
AGCCITGGGCCACCCTCTTGGAGCATCTGGCTGTCGAATTCTTGTGACCCTGTTACAC 
GAGAATGGGCAGAAGTCOTGGTGTTGNAGCCCTGTGCATTGGGGGGNGGG 

SEQ ID NO: 2001 ACATTATTTTTACCAAACAACATTGTTTCCCACTACATCCATGGTTATGTATA 
AAAAAACGCCATTGTCAATTAAGATGTTAAAAAATTATTTCCATAAGAGTATTAAATTCTAAT^ 
TGATTCACITAACTTCCTACriTATAAACACTAGGTAAAGTAATCT^ 

ATCCCAAGCAGTAATCACAGACGATGAAGCACCTTACAGGACCCCTCCACCCTCAAACGCGCATG 

TCCAGAGAAGT^m•AATGCCTTAATAGACTACTGAAGGTTAACTATTTACATCATO 

TTTTNAAGCTGNATTACAGTGCTTrrAAATTCTCCTTTTA^ 
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CTTTACCGGNGTrrGAAAGGAAAAAAAAAANAATCCCTTAGTANCrAATGGNCT^ 

CCTGAATTTTTAAACTCCAACAAAGCATTbrrrm 

CTGTmGATCTCmGTAAAANAACTCTCTGTTCATNC^ 

SEQ ID NO: 2002 ACGCGGGCTTTCCAACTTGGACGCTGCAGAATGGCTCCCGCAAAGAAGGGTG 
GCGAGAAGAAAAAGGGCCGTTCTGCCATCAACGAAGTGGTAACCCGAGAATACACCATCAACATT 
CACAAGCGCATCCATGGAGTGGGCTTCAAGAAGCGTGCACCTCGGGCACTCAAAGANATTCGGAA 
AmGCCATGAAGGANATGGGAACTCCAGATGTGCGCATTGACACCAGGCTCAACAAAGCTGTCT 
GGGCCAAAGGAATAAGGAATGTGCCATACCGAATCCCGTGTGCGGCTGTNCAGAAAACGTAATGA 
GGATGAAGATTCACCAAATAAGCTNTATACTTTGGGTTCCTATGTACCTNGGGN^ 
NAGGGCG 

SEQ ID NO: 2003 ACGCGGGGGGGGTCGACTGACGGTAACGGGGCAGAGAGGCTGTTCGCAGAG 
TTGCGGAAGATGAATGCCAGAGGACTTGGATCTGAGCTAAAGGAC AGTA TTCCAGTTACTGAACT 
TTCAGCAAGTGGACCTTTTGAAAGTCATGATCITCTTCGGAAAGGT^^ 
ACTTTTGCCTAGTCATCCCCrrGAATrATCANAAAAAAATTTCCAGCTCAACCi^ 
TTTTTCCACACTGAGAAACATTCAGGGTCTATTTGCTCTGCTAAAATT^ 
AGTGCANCAGGTTCAGCGTCrrCCATTTNTTTCAAGCTCAAATrCTTT^ 
GGGTAATGAATGAGACTATTNGGANTTTGAGGATTTTCmAATGATC 
NTGGGAGAGCCACACTTCGATGGTGGAANTTAAACTTG 

SEQ ID NO: 2004 ACTAAAGATCTGAATTAAGAGTTCTTTTAACAAAGCATAATGTGGTAATrCAT 
CAAAAGCAGGGTCAAAAGACAAAAGGGGCCGAGAACCTTTCAAACAGTrTCCAGTCATC^ 
TCAGCGAGGGTATGAATATTTTGAACAAGGAATTTAGCAGATG GTCC GTGAGGNGAATTTGAAAG 
CCACATATAGAGATCCTGTTTTTTOTANCrrCAAAATAGATGC^^ 
CAAACCTNGTTNATCACAAATAGCTTATCCTTACNGATCCATTTTAOTAT^ 
NATNAACArrCTTNAGTCCCTGNATNAATNGGCTTGGTCnTAAAATTTAT^^ 
NNNGGAAAATNCCNTTCrrATITrTNCNCmCCOT 
TNTTTTTNCTNCACCNTNCTCTGNTGNGGGCGNGCCGCTTTAA 
CTTmGGCTTCTTTCGCCTGAACTNG 

SEQ ID NO: 2005 ACACCTGAAATCCAGGCCAATGAAGTTCGGAAAGTGAAGAAATATGAACAG 
GGATTCATCACAGACCCTGTGGTCCTCAGCCCCAAGGATCGCGTGCGGGATGTTTTTGAGGCCAAG 
GCCCGGCATGGTTTCTGCGGTATCCCAATCACAGACACAGGCCGGATGGGGAGCCGCTTGGTGGG 
CATCATCTCCTCCAGGGACATTGATTTTCTCAAAGAGGAGGAACATGACTGTTTC^ 
AATGACAAAGAGGGAAGACrrGGTGGTAGCCCCTGCANGCATCACACTGAAGGAGGCAAATGAA 
ATTTTGCAGCXSCAGNAAGAANGGAAAAGTTGCCCATTTGTTAATGAAAATGATGAGOT 
ATNATTGCCCGGACAGACTGNAGAAAAATCGGGACTCCCACimfCCTCCAAANArrCCA^ 
AANTGCTNKTNNGGGCAGCCATTGGCCTTOTGAAGAATGACAAGTA 
AAGCTTGTGNGGA>rrGTNhrrGGTTTTGGCCTCriTCCCAGGAAAT^^ 

SEQ ID NO: 2006 ACGCGGGGAGGTAACAGCTCTTGCACCTGTTTCTCTTGCACCTGACGTGCAGC 
TGCTCCTACCCACCTCTCXTGGCTGAGCCTTGCCTGATACAGCAGCCCGGAGGCA^^^ 
CCGAGTCTCACCCrCCCAGGCAGCTCCTACACTCAACTGCTTCTCTAGGAAAGGNCTCAOT 
CCTGGAGCAGTCGGGATTACANAAAGCCCCATCCTTGGOTANGGAGCGCCATGACGACTGAAAT 
TGGNTGGTGGAAGCTGACnrrCCTCCGGAAAAAGAAATNCACNCCCAA^ 
CTTGACACCNTATGCCNAAACANAGGGGAGATGCANAAACCCCCTGAAGGCCTGACCCTTG 
CCCCAACAGNGACTTTAACACCCGCCTGGAGAAGATTGTGGCAAAGAGCACAAAAGGGCANGCA 
CGTCNANGNNTCCAACTCAGGACGCTTCNAGGAGAATAAGAAAGTGAGAGCCCNCTGGCAGANA 
ACCCTAACCTCTTGGTGATANCQAGGAAAGACGGCNTTCAAATGAAAGGCT 

SEQ ID NO: 2007 ACGCGGGGTCTCTTCCTCGGCGCTGCCTACGGAGGTGGCAGCCATCTCCTTCT 
CGGCATCATGGCCGCCCTCAGACCCCTTGTGAAGCCCAAGATCGTCAAAAAGAGAACCAAGAAGT 
TCATCCGGCACCAGTCAGACCGATATGTCAAAATTAAGCGTAACTGGCGGAAACCCANAGGCATT 
GACAACAGGGTTCGTAAGAAGATTCAAGGGCCAGATCITGATGCCCAACATTGGTTATGGAAGCA 
ACAAANAAACAAAGCACATGCTGCCAGTGGCrrCCGGAAGTTCCTGGTCCACAACGTCAAAGGAN 
CTGGGAAGTGCTGCTGATGNTGTAACAAATCITACTGNGCCGAGATCGCTCACAATNm 
AGAACCGCAAAGTCATNGTGGAAAGAGCTGCCCACTGGCATCAGAGTCNCCAACCCCAATGCCAG 
GCTGCGCAGTGAAGAAAATGAGTAAGCAGCTCATGTGCACCGTTTCTGTTTAATAAATGTAAAA^ 
CTGCCAAAAAAAAAAAAAAAANAAAANATATGTCOTGCCXjGGCGGCCGTrC/^ 
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SEQ ID NO: 2008 ACAAAAATTTAAAGCATTCCTrrCTTTAATTTTGTAAT^ 

GCTCAGAATGTCAGTTCTGTmAAGTAACAGAATTGATAACTGAGCAAGGAAACGTAATTTGG 

TATAAAATTCTTGCTTTAATAAAAATTCCTTAAACAGTGAAAAA/^^ 

TCTATTTCAGNGACTCTTANATTTTTGTTTATTTrrAA 

GGATTACTAAAATAAAAGCATTAAGAAACTCCCCAAAACTCCAAATACATCTGGAGTCAT^ 

CTAATAGGGACAATATTTTATTACCAAAAATGGTAGAAAGGGTTAGAAGGGCCTAAGTCTCAGGA 

TAAAGGATAGTTCATTGCATITAACTAATTCACATTCAGAAAAATGTTm 

TTCTTATTTTGCCATGGATTGATAAAACTTTATCACAAATTANAACAGAGAATTAAGGAGGTC^ 

GTTTAAATCCAGACCCTGNTGCTNTTTACATCCCCTA 

SEQ ID NO: 2009 ACGCGGGGTTCCACGGAAACACTGGTGGACAGATTCTAGTGCTGAGAAGAAA 
CACGTTTGGTTTGGAGAGTCCATGGATGGTGGTTTTCAGTTTAGCTACGGCAATCCCGAAOT 
GAAGATGTCCTTGATGTGCAGCTGGCATTCCTTCGACrrCTCTCCAGCCGAGCTTCCCAGAACATC 
ACATATCGCTGCAAAAATAGCATTGCATACATGQATCAGGCCAGTGGA7VATGTAAAQAAGGCCCT 
GAAGCTGATGGGGTCAAATGAAGGTGAATTCAAGGCTGAAGGAAATANCAAATTCCCTCACAGTT 
CrGGAGGATGGTTGCCGAACACACTGGGGAATGNANCAAACAGCTTTGAATATCGAACCACAAGG 
CTGGANACTACTATGGANATITGACCTATACTTGGGGGTCTGATAAAAAATTGGGNGACGTGGCC 
TGTTGTTTTNTAACAANTTTTTTGAATCCACCAAAAATT^ 
AGNAGGGGNGCNAATANTTTTTTTTTTATTNGG/^^ 

SEQ ID NO: 2010 CGCGGGGAGTTCCGTCGCAGCCGGGATTTGGGTCGCGGTTCTTGTTTGTGGAT 
CGCTGTGATCGACACTTGACAATGCAGATCTTCNTGAANACTNTGACTGGTAAGACCATCACCCT^ 
TAGGTTGAGCCCAGTGACACCATCGAAAATGTCAAGGCANAGATTCAAGATNAGGA ANGC ATCCC 
TCTGACCAGCANAGGCTNATCITTGCTGGAAAANANTTGGAAGATGGGCGCACCC^^ 
NAACATCCATGAAATATTCACCTmAChrrGNNCTCCGNTAAAAAGGT/^ 
ANACACTTACTGGCAANACNNTATCCTTNANGGGGACCTATNANATCATT^ 

SEQ ID NO: 20 1 1 ACTACCACTTITATGCTAGrrGGCATCTGCCmCTATACAA^ 

ATCGACATTGACCTATTTCCAGAAATACAATTTTAGATATCATGCAAATrrCATGACCAGTA^ 
CTGCTGCTACAATGTCXrrAACTGAAAGATGATCATTTGTAGTTGCCTTAAAAT^^ 
CCAAAATGGTCTCTAACATTTCCTTACAGAACTACTTCTTACTTCTTTGCCCTGC^ 
AACTACTTCTTTTTTCAAAAGAAAGTCAGCCATATCTCCAT^^ 

'I"i"ri-1U"1U"1U'GAGACGGAGTCTCACTCTGTCACCCAGGCTGGACTGCAATGACGCGATCTTGGTCAC 
TGCAACCTCCGCATCCGGGGTTCAAGCCATTCTNCTGCCCAGCCTNCCAAGTAACTGGGATTACAG 
GCATGTGTCACCATGCCCACTAATTTTTTTGTATTTTTAAGTAN^ 
GCCAGTCTGGGNTTGGAACTCTGACCTTGNGATCCACTCC 

SEQ ID NO: 2012 ACCACTGAAACCCTGACCCAOAAAAGTGGCTTGCTTGGACACCCAGCTGCCT 
TTGTTTCTGCATTAAACCAATATTGATCACACATATGACACAGGCTAGTCCTATAAAAGTAATGAC 
TTCATAGAAATGGCATTATAATTTTrAAGTTGATACTCTACAGGTAGCTATTGATAT^ 
AATAAAACATGCTGCAACCATGGTATACAACAAAAATACATITCTTTGGTGATTGAAATTAAGGC 
CGTATTTACAATGACTTAATATAAGACTGACTTTTATCCTGCrTCATAA 
CAAOAAAGAATTCAATACTGTGAAATATGCAGCAAAGAAGATTGTCITTACTAM 
ACTOTGATTTCAGCCCAGTAGATTGATTAAAGAAAAAAATGGGCCTTACTTTGC^^ 
GCTAGGCTTAACAAATACCACNAAACAATGNCrTGTTTCACATATNTGNA^ 
AATAGAAAAGGACTAAAAAAGGGGNTTTAGNGAAAAATNNCmCAAAAJN^^ 
ATCTGNGCCACAAAGGGAT 

SEQ ID NO: 20 13 ACin'iU-lUlUTrrrri'rin'll'lTrfCQATCTAAAANATTTTATTrr 

ggtaggcagrranaaatttcaaagtctaacaatgacattcntgaagngggcncagcctm 

tcaggctatgtntacagtaaccrtgnggaactggttcagccanatcttcactttcatga^ 

gggtctgtccttttcrrtccaaaggacrccmtcatattccatcg^^ 

atatatatnctacttccaacacccgattcatcctggttcaatcaaagcctgntt™ 

taanct^^caggaaatcgaaggtgtnaatg^^^gccgtggctctcttggaag 

CTTTCNAAAATCCATT 

SEQ ID NO: 20 14 ACTGCAATGTATAAACTTTGGAGGATG AAGGACAGGAGACTTAGAACGTCGA 
CGACGCTGAGTTCTCACTGCTCCTCGTGAAGACTTCCTTGTAGACCCGTGCCAACTGT^^ 
GCACATGTGGTTTTAGGTTTGGTATCATCTGrrGCTGTTCTGACTGGTGCTCTGACAC^^ 
CTCCAACAGACAGAGATGCTACAGACCCTGATGCATCCNCTCTTANTITmGAAAGCAGACT^ 
GACTCTCCTCCTCTCCATCCTTGGCTTCGGATATCATTGTGCANGTTTTO 
TTNNTNANCTCNTNNTNAAAGGNGNCACI^ 
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ATTTCNGAGCCC 

SEQ ID NO: 20 1 5 acititcaatccaagaaaaaaaaaataaaccgagacatggtcatgagttcag 

GATTATATATATTACAATTTGCOTGnATAATACATTTGTGGCTTTATGATAAAAAT^ 

ACATATGGAATTCAAGCTGATTTGCGTAACTGTCACAAGAAAAAAAGCATTAAATGCAT^ 

AATAAGTATTITCATTAAmCAGAATCTCAAAACAGCATTAGACCTTGCOT 

TTAGTTCAACACTAAGCTAGCTAAATAACCTTCTTGAAAGGTTTAAACACAATO 

GCTTTCCCTCTAATCTCAACmACATAAATGGACCAGGTGAGGAAAAAAGGTAAC^ 

CACATATTGGCTAATCATCACCCCTAACTAGAATNCATTATGNCCTTAAAAGAACGGGAAAOT^ 

AAATTACGATTCATAACCAAAGAATCATTTAGGGTCAGTANGAAATOTTTGAACCTTAAAAOT 

NTACTAAAAANAThnTOJTTNGAGAAAAGGNCCCAANAArrTNT^ 

AAAGG 

SEQ ID NO: 2016 ACCAGAAGTATAAGTTTATGGAACTCAACCTTGCTCAAAAGAAAAGAAGGCT 
AAAAGGTCAGATTCCrGAAATTAAACAGACTTTGGAAATTCTAAAATACA^^ 
AGTCCACCAACTCAATGGAGACCAGATTCTTGCTGGCAGATAACCTGTATTGCAAAGCITCAGTTC 
CTCCTACCGATAAAGTGTGTCTGTGGTTGGGGGCTAATGTAATGCTTGAATATGATATTGATG^ 
CTCAGGCATTGTTGGAAAAGAAmATCGCTGCCCAAAGAATCTTGATTCCCCGGAGGAAGACC^ 
GCTTTCITCGAGATCAATTTACTACCCAGAAGTCATATGGCCAGGGTITATAATO 
AGAAGAACAGGATGCTCmCAAGACAAAGCATATGTGGCAATAAAATGNGGTTAATTTCCA^^ 
GTATC TAAATNCCTTATCTTNAGGTGGCTACTTGATGTTTACACAAAATTAA^ 
TTTTTAACAAATGTTNAAATTTG 

SEQ ID NO: 20 1 7 CTCAGAACATCACCTACCACTGCAAGAACAGCATTGCATACATGGATGAGGA 
GACTGGCAACCTGAAAAAGGCTGTCATTCTACAGGGCTCTAATGATGTTGAACTTGTTGC^^ 
CAACAGCAGGTTCACTTACACTGTTCTTGTAGATGGCTGCTCTAAAAAGACAAATGA^^ 
AGACAATCATTGAATACAAAACAAATAAGCCATCACGCCTGCCCnTCCTTGATATT^ 
ACATCGGTGGTGCTGACCAGGAATTCTTTGTGGACATTGCCCAGTCTGTTTCAAATAAATGAACTC 
AATCTAAATTAAAAAAGAAAGAAAmGAAAACTTCTCTTGCCATTCTTCT^ 
CTGATCCTTCATTTTTCTGACATTACTTGCrrAATTGGGGCAAAGAGA^ 
NTTGTGCATNCAGTTCATTACTCCTTCCCCGTCCCAAAATTNAATTT^ 
ATGTAACTTNAANAACCNA^WAATTGTNAAATNACCC^WNTT^^ 
TCCCGCGG 

SEQ ID NO: 2018 ACAAGCCTCACCAAGGGCAACCCCAGAAAAGTGAATGAGTTTGTCTTCTCCA 
ATCATGACTTCCTCGATAAGTTTGCAACnTCCAAGCTTCACCAGTTCTGGGTGATCAAAGGTA^^ 
GCAATITCACCACCTGTGACAAGAGCTAGGCXjTTCCACACCTGCAAAATCTGCATGCTCAATAGCC 
ATGACACCAGCAGCACCAAAOAGCTGTTCAGGATAATTATAAAITAATTGCCTGTTAATAAAGCA 
ATTTATTCCATGCTTAAGAATACGTrCAACTTTCTCCTTCATTTm 

CTGCAACCTTTGCTGTAGAGTCAACTCTTACCCGGGACCAAATATCTTATTTGTCTGTATC^ 

AGTATTrGCAATAAGAATTTAGCATTTTCAATCCNTTrGGTGTTTACTa 

CCTCTCTAATANGGATTGCAACTCCTCNGTTTTm'ATATAAT^ 

TGTTTAANTA 

SEQ ID NO: 20 1 9 ACGCGGGGGCAAACCAGCTCTAGGCGGCTCTGGGTAAGTTGTCGTTCTGTGG 
GCTGCGGAACGCANACrrCGGCTGGAOTGCCTGCGGTGACACCTGCTCCCCTCTGAGAGOT 
GTrCTCCGGCCTGCCTTCACTGGrrrGTGTCCAGAGCCGGACTGATTCTCTCAAm 
GCCTGTTAAACAAGAAAACGAAAAACCCCrrCCAGAAAACATGGATGCATTTGAAAAAGTGAGA 
ACAAAATTAGAAACACAGCCACAAGAAGAATATGAAATCATCAATGTGGAAGTTAAACATGGTG 
GTTTTGTTTATTACCAAGAAGGTTGTTGCTNGGTCGTTCCAAAGATGAAG>^ 
TTATTAAGCTTTArmATTTNGAGGACTTAAGTAACCAGCCCTTAT^ 
AATGAAAATTTTGCTNCCANNAANACTTAAATCTGAACTNTACT 

SEQ ID NO: 2020 ACrilUllUn"ll"lUnU-llU"iU"ll"lUll"l"ll'l'CAGTTCAAGTrrAATACAAACTAC 
AAAANATTAGGGGGTNGCTNTACTAATACATCATACAAACCAGTAGCCTGCCCACAACGCCAACT 
CAGGCCNTTCCTACCAAAGGAANAAAGGNTGGTCTOTCCACCCCCTGTAGGAAAGGCCT 
TAAAACACCACAATTCGGCTGAATCTGAAGTCTNGNGTTTTACTAANGGA/^^ 
NAGGTTTNGTTNTCATGGNTGCCCACCGAAGCCTGCACTAAAACAGCX:CAGCGCTCACT^ 
NAAAAATTTCTTGT^^T^GGGACATCAGGCTTGAGGGTTCACTGCAGGTTCCAGCCAGCTO 
ACrrCCCITrTTGCAATAAACTGAAGGCNTACTAGTrrAAAGT^^ 
Cr^TCAGNGNTTGCCNAAAAACCTTNT^m^AAGAAAAAAGGCCTNAAA 
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SEQ ID NO: 202 1 ACGAGAAAAACGATTTGGAAATTGGCTGAAAGATGCACGTGACTGGACAATT 
TCCAGAAACAGATACTGGGGCACCCCCATCCCACTGTGGGTCAGCGATGACTTTGAGGAGGTGGT 
ATGCATTGGGTCAGTGGCGGAACTTGAAGAACTGTCAGGAGCAAAGATCTCAGATCTCCACAG 
AGAGTGTTGACCACCTGACCArrCCTTCACGCTGTGGGAAGGGATCCrrGCACCGCATCTCTGAAG 
TGTTTGACTGTTGGTTTGAGAGTGGCAGCATGCCCTATGCTCAGGTTCATTACCCGTTTGAAAAC^ 
AGAGGGAGTITGAGGATGCTTTTCCTGCAATTCATTGCCGAGGGCATCACCAAACCAGAG 
TTTrACCTGCTGGTGCTGCNCGGTCKrCTTGNCACCGCTTTAAGAOT 
TGCAANGATGGCAAAANTGNNCAACCGGAAAGArmCNATCATTCCTCTTCAAAm 

SEQ ID NO: 2022 ACTTTGCCTACGGCAGCAACCTGCTGACAGAGAGGATCCACCTCCGAAACCC 
CTCGGCGGCGTTCTTCTGTGTGGCCCGCCTGCAGGATrrrAAGCTTGACTTTGGCAAI^^ 
CAAAACAAGTCAAACTTGGCATGGAGGGATAGCCACCATTTTTCAGAGTCCTGGCGATGAAGTGT 
GGGGAGTAGTATGGAAAATGAACAAAAGCAATTTAAATTCTCTGGATGAGCAAGAAGGG 
AAGTGGAATGTATGTTGTAATAGAAGTTAAAGTTGCAACTCAAGAAGGAAAAGNANTAACCTGTC 
GAANTTATCTGATGACAAATTCNAAAGTGCTCCCCATCCNACAGTATAAAANGATTATT^ 
GTGCAAAAGAAATGTTTTCCGTTGAGrrCAAAATNAGTTNAANCATANACCAATTGAC^ 
NGGTNTANAANAATTNANANTTATNANAGGGGANCAANNTmAAANNNAC^ 
TTTTCTNNATAATTmAAACTGAGAAGNGCGGGGGTTrTA 

SEQ ID NO: 2023 ACGCGGGAGTCAGACCCAGTCAGGACACAGCATGGACATGAGGGTCCCCGC 
TCAGCTCCTGGGGCTCCTGCTGCTCTGGCTCCCAGGTGCCAAATGTGACATCCAGATGACCCAGTC 
TCCTTCCACCCTGTCTGCATCTCTAGGAGACAGAGTCACCATCACTTGCCGGGCCAGTCAGGGTCT 
TAGTAGTTGGATGGCCTGGTATCAACACAAATCGGGGAAAGCCCCTAAACTCCTGATCTATAAGA 
CGTCTACCrrAGAAAGTGGGGTCCCCTCAAGGTTCAGCGGCAGTGOATCTGGGACAGTATTCACTC 
TCACCATCAATAACCTGCAGCCTGATGATTCGAACTATTACTGCAACAATTAATACTTCCCT^ 
CTGArrACTTrCGGCCCTGGGACCAGAGTGGATGTAAGCNACTTGNGCTGACATCm 
CCCCTTTTGTNANCAGTTGAATCTGGAACTGTTOrrGNNGNCTGrrAATAT^^ 
GNCTTGCCGGACNCCTAAGGNA>™AAANmTGGGCNGTATNNTG 
GGANATNGAAAAGNTTT 

SEQ ID NO: 2024 ACATGCAAAATCACTGGCAAAGGCTGTAAAGAGATTGTAGGCTGGGGATTTC 
CACTAGGCACCTGAGTAGTAGCAACTACAGTAGCTGTATTGGGTGCAGGAAGAACTGACGTTGCT 
GTGAGGGAACCCGAAGTCACTGGAGTCTGCAATTTAGGTTGACTACTGACAACTGCTGGAGGAGT 
GACAAGATTGGTTGAAGATACTGTATTCACAGGAGTITGAAAGGATGGTGGTGCACTCTCGCTTAC 
ATTTCTTTTGGACTCCAGCATCTGTCTCACTGTGCCTGCATTTCT^ 
TTGACATCATTTACAGTTTTTCCTGGTGATCTGGTGGGTTGGGTGGATGTCATGCT^ 
TTCTTTGGCTGNTTNAAGCGmGGTAACCTGCTATCTTGCT^ 
Gm^CATNAATTTTTCNCTCGGTTTrATTTTGCAAGNTT 

SEQ ID NO: 2025 ACAAGACATTTCAmCTCTTAATGTTTACAACAAGCTTGTTGCCAGGGCTGA 
TCTTGAACTCCTGGCCTCAAACGATCCTCCCAGCTCAGTCTCACAAAGTGTTGGGATGTC^ 
ACTAATGACTATCTTAACTCTTGTGTTTCAATGTTTATGCCTTCTTTTATCT^ 
TATGTCircrAGAACAATGTTGAACAGAAATGGTGAGAGCAGACATCCTTGCTTT 
ATTATATATGATGTTAGGTATAGATTTTTCTCACAGATGCCrmATCAGATTGAGGAAm 
CCTACmGCCGAAAGNTmOTAGTATGAGGGGGTGCTGNATTTTGTCAA^ 
AATTGANATGATTGGTTCTGAGCATCGAANGGGGAmCTCTTATTCTGTCGANGT 

SEQ ID NO: 2026 ACAGCCAACGGTTTCCCTTGGGGGCTTTGAAATAACACCACCAGTGGTCTTA 
AGGTTGAAGTGTGGTTCAGGGCCAGTGCATATTAGTGGACAGCACTTAGTAGCTGTGGAGGAAGA 
TGCAGAGTCAGAAGATGAAGAGGAGGAGGATGTGAAACTCTTAAGTATATCTGQAAAGCGGTCTG 
CCCCTGGAGGTGGTAGCAAGGTTCCACAGAAAAAAGTAAAACTTGCTGCTGATGAAGATGATGAC 
GATGATGATGAAGAGGATGATCATGAAGATGATGATGATGATGATTTTGATGATGAGGAAGCTGA 
AGAAAAAGCGCCAGTGAAGAAATCTATTCAAATCTCCACCAAAAATTmCAAA>r^ 
GGAAAAGACTCAAACCATATCTCACNAAATCAAAGGACAANATCCTTCA>n^}AACGGANAA^ 
AAAACCAAAGGCNTTTTTGTNAANNATTAAGCAA^^^CCGCNANTTN7^^ 
NANGCAATTTTNATTTTTAAAATTTCCGT 

SEQ ID NO: 2027 ACTCrrGATGAAAGACCGTGAAACCAACAAATCAAGAGGATTTGCTTTT 

ACCTTTGAAAGCCCAGCAGACGCTAAGGATGCAGCCAGAOACATGAATGGAAAGTCATTAGATGG 
AAAAGCCATCAAGGTGGAACAAGCCACCAAACCATCATTTGAAAGTGGTAGACGTGGACCGCCTC 
CACCTCCAAGAAGTAGAGGCCCTCCAAGAGGTCTTAGAGGTGGAAGAGGAGGAAGTGGAGGAAC 
CAGGGGACCTCCCTCACGGGGAGGACACATGGATGACGGTGGATATTCCATGAATTTAACATGAG 
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TTCTTCAGGGGACCCTCC(>GTAAAAAGAGGACCCCA(XAAGAAGCGGGGGGTCCTTCTCTAAGA 

GATCTGCNCTTAGGCCmrCGCAGTACAGTGAATGGGAGGAAACTCTTTTCACG^ 

TNGAGKrCAO^TGAGGGAACCGTGNCNTTGNAAAAThrrTTTNW 

GAAGTnTTCANC 

seq id no: 2028 acacataaatcacctggaaccttgttaaaatgcagatcctgactcaggaggt 
ctgagttagagcccaggatttcatatttctagccagctccatgatgagctgctggtccgcagatca 
cgcttgcaggttttgaccagagtcagtgrrggttagagtaagaggatgaggcagacatctgggaa 
aagtccagctggggcaagcatttgaagtctgccttcctaccaggtcaaaatcaaggcaacgaot 
tccatagataactatcaaagcitgagggggtgccttgaacccaactcctaaatoc^ 
cccacctcttgtgtctcctgctaacaaacattccacactcttgcatattgtnaagtaacot 
ccagcttctggttaataaagatggntagagngactccrmaagcagtactaggcnctcaaaaga 
actcaggttantctggtctnaatngcccanctagntgccccattotattgcg;^ 

AACATrmCNTAGCTAAAANNCAGGGGCTAAAGTCNCCCrr 

SEQ ID NO: 2029 ACGCGGGGACACTGGACTCCCGTGAGCTGGAAGGAACAGATTTAATATCTAG 
GGGCTGGGTATCCCCACATCACTCATTTGGGGGGTCAAGGGACCCGGGCAATATAGTATTCTGCTC 
AGTGTCTGGAGATCATCTACCCAGGCTGGGGCTTCTGGGACAGGCGAGGACCCACGGACCCTGGA 
AGAACTGGTCX:AGGGGACTGAACTCCCGGCATCnTTACAGAGCAGAGCATGATCACArrCCTGCC 
GCTGCTGCTGGGGCTCAGCCTGGGCTGCACAGGACAGGTGGCITCGTGGCCATGTGGAAAGCA^ 
TGCTGTOGGATGATGCTGGGACTCCAAAGGATTTCACATATGCAThrrCrrCA^ 
ACTGCTGGGATCAAAGGAGAATAANATGGCCCTTGCGATTTGGGGAGNTAAT 

SEQ ID NO: 2030 ACTGTGCAGGCAGATTCACAGGGTGGTGGTAAAGCATCCACAATGGCTCTGG 
CAGCATCAGGATCACACTTGAAGGGGCTCTCAGACAAAGTTGTArrCATGCAACTGATTCCTTTTC 
CATTCGTTTTCITAGTCACTAATGCTTTCCAATGGTCATGAGTGC^^ 
AGTCCTTATCTTTAAATTCTGCATTAAACGCAAACTCATTTT^ 
CCTTCTAAACCAGTCCACAGTAGCTTNTAAGTAGCCAGGTTTCAGCCGTTTGAC^^^ 
ATTATAATTGGCTGCATCAGGATCATCCCATTAATGGCAATGACTTTCNAGCGGGrrCC(^^ 
ATCATAGCCAATTGCNTAGAACTTGCNCGCAATTATTTCACTCTTCACAACCT^^ 
ACNATTANTTGGTNTTGTACCNCACAACCT^WTTTATATTNTNCTGG 
GACCNTNTCNAATTATCTTTmGGGACAATTCCAAANACNAANTTC^^ 
AGGNCCTTAACA 

SEQ ID NO: 203 1 GGCCGAGGTACGCGGGAGATGAATGCCAGAGGACTTGGATCTGAGCTAAAG 
GACAGTATTCCAGTTACTGAACTTTCAGCAAGTGGACCTTTrGAAAGTCATGATC^ 
GGTTTTTCTTGTGTGAAAAATGAACTTTTGCCTAGTCATCCCCTTG 
AGCTCAACCAAGATAAAATGAATTTTTCCACACTGAGAAACATTCAGGGTCTA^ 
AArrACANGATGGAATTCAAGGCAGTGCAGCNGGTTCAGCGTCITCCATTTCm 
TTTCACTGGATGTTTrGAGGGGAATGATGAGACTATTGGATTGAGGATTTTCTA^^ 
AGNNAAN>OTGGGANNCCNATTTGTGGTGNAATAAACTGGT^^ 

SEQ ID NO: 2032 ACGGCTCCATGGGATTAAAGGAAGCAATGACATCCrGATCTGrrCCrTGATCT 
TTGGGCATTGGAGTTGGCGAGAGGTGTCAGAACAAAGAGAACATCTTACTGAAAACAAGTTCATA 
AGATGAGAAAAATCTACGAGCTTCTTATTTACAACACTGCTGCCCCCTTTCCT^ 
ATGGATGTTCATGCAACTTAAGTGTGTTGTTCCTGAACnTrCTGTAATGTTTC^ 
CAAACTAANAAGTATAACGTCTITAAAAGATTGTCATCAACKCCATANTOT 
CACTGCCTGAAATTTITANTTNNTTNAGGGAGTACATTGGTGGTO^ 
ATTTTCO^GNNANGCAATGGTTCCTCTTTNANAGNrrGTATTANA^ 
TTNNTTATTANGCCTAAATATATAATGAANAATA 

SEQ ID NO: 2033 GCCGTAGGTCCGGCCGANGTAC;rri"l'rri"rri"l'l"i"ri"ri"ri'l"lGTAAAACAAGC 
AAATTTTATTAAAGGAAAATTTTGCAGGTTTAAGGTTTG 
TNACTmCACCAOTCT>riTCTGGCATGCTTTTAOT 

CAGGGTGCACACTCTGTAGTATATTCCNCATACTGTCCCANTTCAATATNANTGCNACT^^ 
TTGGACACTTGTTTNANCAAACATAGCNTAGGANCCT 

SEQ ID NO: 2034 CCCTTAGCGTGGCGCGGCCGAGGTACAGATCCGAAGTTTCAAGGGCAAACGT 
GTGAGATGTGTCAANACCTGCCTTGGTGTCTGTGCTGAGCATAAAAAATGTGTTCAGNGCNAAGC 
OTCAATTAAGGNAAAAAGAAAGACCCCTGCCCCCNGGAATNGTCCTTATTTTAACA^^ 
AGAAAGTCCGGACAAATTGCCCCANCCGGTCCAACCTGATCCTGTGTCCCATTGTAAGGANAAGG 
ATGTTGNCACTGTTGGTTCTlSrmACrTNTTCAGTGArrGG 



wo 02/29086 



PCT/USO 1/30732 



NGGAAAATCCAAAGGGCCCCTGGCCAAACNTATTCCAATTGNACCTGGTGNGGTNGCTGNAATTG 

TTCTTATGGCCTTGGOTACTGCTGATATGGANNCmAAAT^^ 

GTTAANTTGAAAAAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 2035 ACCTCTCCCAAGAAGTTCATTATATATCAGAAATTAAAATGAAATATAAGCA 
TATATTGCATGCTGCATArrrCTATCGTTCAAACCAGCCAACCAACAAACTAAA^ 
CAAAACACTGAGTATTAAGTAATTTCATCTGCCTnTTTGATCCT^ 
GTGTTmCTTTAAATTATTCATCAAACTACTACATATACA^ 

GACATTTGAGATGGGGTAGTGGGCTGCACATAAGTANTTCCAAGAACCACTCATTCT 

SEQ ID NO: 2036 ACGCGGGGCTCTTCCTGCTCTCCATCATGGCGCAGGATCAAGGTGAAAAGGA 
GAACCCCATGCGGGAACTTCGCATCCGCAAACTCTGTCTCAACATCTGTGTTGGGGAGAGTGGAG 
ACAGACTGACGCGAGCAGCCAAGGTGTTGGAGCAGCTCACAGGGCANACCCCTGTGTTTTCCAAA 
GCTAGATACACTGTCAGATCCTTTGGCATCCGGAGAAATGAAAAGATTGCTGTCCACTGCACAGTT 
CGAGGGGCCAAGGCAGAAGAAATCTTGGAGAAGGGTCTAANGGTGCGGGGAGTATGAATTAAGA 
AAAAACAACTTCTCAGATACTGGAAACTTTGNTTTTGGATCCAGGAACACATGGTCT 
ATTGACCCAAGCNTNGTNTNTNGGCTGGACTTTTTTTGGTGTTGG 

NAAAAACNCAGANAGGTTGTTTGGGGCCAACCAAATANNAANAGAGCATGCCTGTTCCNCAN/^ 
TTTATGGATNTTCTTCTG 

SEQ ID NO: 2037 ACCAAAGAACAGATAAAAGGAGGAACAGGAGACGAAAAGAAAGCGAAAGA 
GAAAATTGAAAAGAAAGGAGAGAAGAAGGAGAAAAAACAGCAATCAATAGCTGGAAGTGCCGA 
CTCTAAGCCAATAGATGTTTCCCGTCTGGATCTTCGAArrGGTTGCATCATAACTGCT^ 
CCCTGATGCAGATTCTTTGTATGTGGAAGAAGTTNATGTCGGAGAAATANCCCCAAGGACA^ 
TCAGTGGCNTGTTGAATNATNTTCCTGTTGAACNGNAC 

SEQ ID NO: 203 8 ACCTGGAGTGATGGATGGCGTTCCCTCGGCTAATAACTATCAGGGTGGATTT 
AGAACAACACTCATGGCTAAGGATCTGGGATTGGCACAAGACTCTGCTACCAGCACAAAGAGCCC 
AATCCTTCTTGGCAGTCTGGCCCATCAGATCTACAGGATGATGTGTGCAAAGGGCTACTCAAAGA 
AAGACriTCTCATCCGTGTTCCAGTTCCTACGAGAGGAGGAGACCTTCTGAGTGTGCCCTTTGGCC^ 
CGGACACTGTTGGGAACCAAACTCTGOTGAGCCTCCTTTTAGCTCACTCCACAAGTA^ 
AAATCAAAGTCACCTATCTGCnTTTGATTGCTAGGNCACAGTAATCCCTAGGATTTN 
TTIT^GNTTTAACAAA^mTNTCCGAATTT^I^^ 

ANCrmAAANGTTGNNCCCAGCTCTCACTAANATGAATATTNAATCNGNTT^ 

ANAAACAATITITGNCACNGGTGAACCCTTTAAAGAAAAAGGATTNTGGGAG 

AACCCGATT 

SEQ ID NO: 2039 ACCAAAGAACAGATAAAAGGAGGAACAGGAGACGAAAAGAAAGCGAAAGA 
GAAAATTGAAAAGAAAGGAGAGAAGAAGGAGAAAAAACAGCAATCAATAGCTGGAAGTGCCGA 
CTCTAAGCCAATAGATGTTTCCCGTCTGGATCTTCGAATTGGTTGCATCATAACTGCTAGAA^ 
CCCTGATGCAGATTCTTTGTATGTGGAAGAAGTNGATGTCGGAGAAATAGCCCCAAGGCAGITGT 
CAGTNGCCTNGTGAATCATGTTCNTNTTGAACANATGCAAAATCGGTTGGGN^^ 
TGAANCTGCAA 

SEQ ID NO: 2040 CCCTTAGCCGTGGCGCGGCCGAGGTGGACCITCAGGGGATCAAAGCAAAGTT 
CCAAGAGAAGTATCAGAAGTCTCTCTCTGACATGGTTCGCTCAGATACCTCCGGGGACTTCCGGAA 
ACTGCTAGTAAGCCCTCTTGCACTGAGCCAAGCCAGGGCAATAGGAACACAGGGTGGAACCACCT 
TTGTCAAGAGCACATTCCAAATCAAACTTGCAAATGAGACTCCCGCACGAAAACCCTTAAGAGTC 
CCGGATTACTTTCTTGGCAGCTTAAGTGGCGCAGCCAGGCCAAGCTGTGTAAGTTAAGGGCAGTA 
ACGTTAAAGATGCGTGGNCAGGGCACCrrGAACTXrrGCTTAGCAAGCATCTAGGCTGOT 
TTrCTTTTAGCATGGTAACTGGATGTTrTCTAACACTAATGAAATC^ 
GCATTTNNATGGGCACAATTTAGAAG 

SEQ ID NO: 204 1 ACGCGGGCGCTGTGGAAATTGGGTCTTGGGCTGGGTGGCATCTGGCAGTCAT 
GGGTAACACTTGCTTTT CCAGTTA ATGTGGCCATGTGATTCCAAGTGTCATGTTGCT^ 
GATTGTTGTGTGACTTGTTTTTTTGTTTTTGTTTTTGT^^ 

GGAAACTTTCTGATGCCTCCGGATTGTGTTAGTAGTAGCCATCAGGAGGGTCTCCAACTAAAACAC 
TTGTTCCTGCTTGCTCCTITCCCCTCTCATTGTTCANCATTTC^ 

GCTGCACGCACATGTGTCCITGTGGTTATAGCTAGAAGGACAGGAGTCTCCTGCTGATGCGTGATA 
. ACTTAAGCrrGGGGGAGAAAGTCTTTTCCACTGCCTAACTAAACA^^ 
. TCATITCTATGTGTGGGGGTAANCTGGCAGTAANArrGAAACrr AATTTAAA^ 

TTCCTTAATGTTATTACCTCTAACNAGNGNTTGGAANTCCCATCCAAAATTTGGA^ 
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TTCTTT 

SEQ ID NO: 2042 ACGCGGGACTACTGGAANTGCACAAACTGGCCACTGACAAAAATGACCCCC 
ATTTGTGTGACTTCATTGAGACACATTACCTGAATGAGCAGGTGAAAGCCATCAAAGAATTGGGT 
GACCACGTGACCAACTTGCGCAAGATGGGAGCGCCCGAATCTGGCTTGGCGGAATATCTC^^ 
CAAGCACACCCTGGGAGACAGTGATAATGAAAGCTAAGCCTCGGGCTAATTTCCCCATAGCCGTG 
GGGTGACTTCCTGGTCACCAAGGCAGTGCATGCATGTTGGGGGTTTCCTTTACCTTTTCT 
GTACC 

SEQ ID NO: 2043 ACGCGGGGAGTCCGCTGGTCCCGAGCACGAGCTGTG AGGGG ATTCACTTGTG 
TGCGGAACTCCTCGGAACCATGGCGTCCCmCCCnTGCACCTGTTAACATCm 
TGATGAAGAGAGAGCAGAGACAGCTCGTCTGACTTCTTTTATTGGTGCCATCGCCATTGGAGACT^ 
GGTAAAGAGCACCTTGGGACCCAAAGGCATGGACAAAATTCTTCTAAGCAGTGGACGAGATGCCT 
CTCTTATGGTAACCAATGATGGTGCCACTATTCTAAAAAACATTGGTGTTGACAATCCAGCAGCTT 
AAAGTTTTAATTGATATGTCAAGGGTTCAAAATGATGAAATTGGTGATGGCACTACC^ 
GTTTAGCNCNANAATTATTAAGGGAAGCCAAAmiTTAATTGC 

SEQ ID NO: 2044 GCGTGGCGCGGCCGAGGTACCCCCTTTCCATAGAAGGGGGAAGCCCTCTTTC 
rrGTCTGCCAACCCGATCACGACCACTTGAGTAGAGATCACTTCGGCTGCTTGAGTAACTGTCTCG 
ACTTCCACCATATCCGTCACGTGAGCTGCTGTAATCATCATAGCGACTGCTTCCACCATAAGATGG 
CGGGGGCCCTCGTGTAGGTGGAGCACTACGTGAGTTACCATAACTCTCATATGAATCTCTGTAGGA 
ACCTCCAOTGGATGATCTGAATAGTCACGATCACGACCATATCCATCTCTATCGCTATATCCTOT 
GATGGATAGGCATCACGTGAACTGGAATGACCATAATCACGGTAAGGATAATCTCGTGGNGGGNG 
GTGCATAATCTCTAGTATCACNAGAACTrGGGTAATCTCTGCTTGAATAGCTGNCrn'^^ 
TACCCATCOTCTNTTTGGGGACAAATAAACTTCTOTACNAAAGGNCAGCGGTTNCNT^ 
GACCTCCATAACNNTCTNTmCACGTGATACAGNGAGCnTmCTCCA^^ 

SEQ ID NO: 2045 ACAGAGTGACATCGGCAGTTGCAGCAGCAGCAGTAGCGGCAGGAGGAGGGC 
TGTATACTGGCGCCTGGCTGGGATATGTGTrTCCTTGAACGGAACTGACCATTGTATTTGGGTAAG 
CGGCTGAGTCTGCTGACTAGGAAGGGCTGGGTTTGCGGATGGTGGGACCACTGCCTGCTGANAGA 
AAACAGCTGCTCCGGGGGAAAACTGTAGCTCTGGAGGTGGCTCATCTGCGCGTTCCCTGCAACCA 
GGTAGGCACCACITGGAGGAGGCCTGCATCACCTGAGAACCAAAAACACCAGATGACTGCATATA 
ATATGGCTGATTCTTGTAACTTTGCATACATGGAATACATCGGATCTTCGTCATTAAOT 
AAAGGAAAGGGCCTCCATCACirrCACATTAAGTrCTGANAGTTCTGAATGTm 
TTCCANCTmCATCAATNANAGGTNCCATmTGGTGACACATTGCTTTA/^^ 
GGTANGTCTGGCTGANNCATACTGGGNCnTSITAGCTCGGNCCGCGACCCCGCTT 

SEQ ID NO: 2046 ACGCGGGGGCAGTGAGTTCGACACACCATGCCGACTGTCAGCGTGAAGCGTG 
ATCTGCTCTTCCAAGCCCTGGGCCGCACCTACACTGACGAAGAATTTGATGAACTATGTTT^ 
TTGGTCIXjGAGCTTGATGAAATTACATCTGAGAAGGAAATAATAAGTAAAGAACAAGGT^ 

aaggcagcaggagcctctgatgttgrrctttacaaaattgacgtccctgccaatagatatgatctc 

ctgtgtctggaangattggttcgaggacttcangtotcaaagaaaggataaaaggcttcagtgt 

ataaaacgggtaatgcctgaiggaaaaatccaaaattgattatcacagaagaaacagctaagatc 

cgtncrmgcggtagcagcanttctccgtaatattaaagtttactaaggatccatttgac^ 

catttgaaccttcnagggagaaarracatotaaaatatttgcanggaaaa^ 

ccatttgnt 

seq id no: 2047 acgcgggggtagccggagccggcgacgtgaggcgggcgttgctcgcgcgac 
aagtagttgctgggacagcgaaatggaggggtgtgtgtctaacctaatggtctgcaacctgccta 
cagcgggaagctggaagagttgaaggaaagtattctggccgataaatccctggctctagaa ctga 
ccaggacagcaoaactgcattgcactgggcatgctcaactggacatacagaaattgttgaattttt 
gttgcaacrrggagtgccantgaatgataaanaccatgcagggtggtctcctcttnatattgcggg 
ttotgctggccgggatganattgtaaaanccttrtgggaaaaagngctcaaaatgaatc 
aatnaaaangggnttgtncctntggccnggacnccncttnn^ 
gncgtthn^tatitggatcccagctctggccaaattgggggaaaaatgggnataan^^ 
tggggaaantttttcccctncnatttccacaacatatangncggnaanataaaatg^ 
nggngcccc>rmaagggggccccctcnctttttgtggngggcccactntoccr^ 
aaaancntgt 

seq id no: 2048 gccgtggcgcggccgaggtactttggcctctctgggatagaagttattcagc 
aggcacacaacagaggcagttccagantcaactgctcatcagatggcgggaagatgaagacag 
atggtgcagccacagttcgtttgatttccaccttggtccccttggccgaacgtccacgtgggganc 
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CACTATAATATTGCTGACAGTAATAAACTGGCCACATCTTCAGGGCTGCAAGGCTTGCCGANGGT 

GANAGTGAAATCTGCCCAGACCCCTGCCACTGAATCNGTCAGGGGACCCCGGAGTNCCCGCTTAA 

AATGCCCANTAAAATGAGCAGTTTAAGGAGGOTGCCTGGCTTTCTGCTGATACCAANCCTAAGTA 

NTmTTATTGGTTGGAAGGNTNTANTAAAANACTTTGGCTGGGACNTGCATTTGATO 

CNCCCAGATANAC 

SEQ ID NO: 2049 ACAAGTTTGAACTGGATACCTCTGAAAOAAAGATTGAATrrGACTCTGCCTCT 
GGCACCTACACTCTCTACnTAATCATTGGAGATGCCACTTTGAAGAACCCAATOT 
GCTGATGTGGTCATCAAGTTCCCTGAGGAAGAAGCTCCCTCGACTGTCTTGTCCCAGAACCrm 
ACTCCAAAACAGGAAATTCAGCACCTGTTCCGCGAGCCTGAGAAGAGGCCCCCCACCGTGGGTGT 
CCAATACATTCACTGCCCTGATCCTCTCGCCGTTGCITCTGCTCTNCGCTCTGNGGATCC^ 
TGCCAATGTCTCCAACTTTAACnTITGCTCCTAAGCACGATTOTAT^ 
TATGCTGGGGACrCATGTATGGTCTACTGGGACTCACTTANCATTGTTCCAAGACCT^ 
TGNCCGGGCGGGCCGCTNAAANGGGCGA 

SEQ E) NO: 2050 ACTGAAAACATGAAAAACAGCAAAATCCAAGGGTGAACTTTGACCTANATTG 

ataaccaaatctagcaaacccacatgtatatataaaatcagaotgaacactagccggggcggtgg 
ctcgcccctgtaatcttagcactttgggaggccaaggcaggtggatcacgaggtcaggagatcaa 
gaccatcctggccaacatggtaaaaccatgtctctactaaaatacaaaaaattagccgggcatgg 

TGGTGCATGCCTGTAGTCCCAGCTACTCAGGCGGCTGATACAAGGGGAATCGCTTGAATCTGCA^ 

ggtggaggttgcagtgagctgagattgtgccactgcactccagcctggcaacagaatgagactct 
atctcannaaaananatatnnaaaaaaaaaaaantcc 

SEQ ID NO: 205 1 ACTGCAAGACCCATCTTCCCTCCAGTTAATACACTCCCAGGATXjGGCTGCAG 
AGGGGGAGACTCTGAGAGAAGCTGGAGGCCCACAAAAGTCCACTGACCTTCITTCTGTCCCA^ 
ATGAATAAAGGACCCAGTTGTGCnTrCCrrCCAAAATCCTCAACAAAGTTGm 

aatgtgggaataaaaaaatcatgtcccaggtcatctttgtgtgtgtgcgggggaggtggatggga 
ggaaaaggcattgtattaatagatactgctgctataaaatgacatnaaattatagccotg 
gttotgtaaacaatgccgnrrttttaggttatttggcact^ 
tttntnaagtcangtgcttnrititccaaaaataatc(^^ 

seq id no: 2052 acotctmcagaagtaaagcctgcaggccctactgttgagcagcagggag 
aaaatgggcgccgatctggaaggaaggatgttgcccacctttgnaccttgao^agaaaccaaaat 
tnttcarratttngcttatctngttgaccggccaacgtgatnanatcctg^^ 
acnrggangaagcaaaaggganacttgcagcttnctotgctgaaa 
tcaaaattgaacagcctgccatcggtcttgcnacagatcgcaancctcctccca3wacaacgtgg 
ggacctgtttttgcgccccaccccaaatcgccaaaaacttggaactggaacaaagtgactgtcca 
attggagttctgctggngatcrrrgaatcntcgncctgatgctaccccangtgggactttggc^ 
cgcaagtggcaatgggxmitnctcaaagagggaangaggntgcnacagnaacccggt^^ 
ctctgacccnanaagntnttttnatccatggggtnaaaggagggccngccact^ 
cnaaaaaaaattttaaatttttttccccctttan^^ 
tttcccann 

seq id no: 2053 acacatraagcatccccagttcccctcgcacaccccttttcccagccactagt 
aaccatccttctactctctatatccatgagttcaattgtmgacttttagatc^ 
gagaacatgcaatggttggctggttctggcttaatgtacrraatatagtgacctctantt^ 
atganttotaactggccctggatttttgacctttattt^ 
cttatttttaaaccct^atatacaatttcctctaaotata^ 

gncataacatannaccttnggccgngaccaccctaangggcaaattncancncactggngggcc 
gttctagtgggatccgagcttnggaccaaancttgggggaaacatgggcaaaactgttm 
gggaaattgttttccnctcaaaattccncaaaaaatacnagccggaancataa^ 
cggggngcctatngngggncccnotcccatttntnngggtgcggccaacnggcccxm 

seq id no: 2054 actccagatggcgccaaagaataaataggcaggtctctatgataaaagaaca 
aagggaaagcagtttagctcgattgttgatcaccttcaccaagcgccgtcttgc^^ 
ctgagcagtggagattttatcaacaccagcaggcatcacagactgacacaactcttttatct^ 
tgggggaagctgctctacarmgtagcttgctgacactctgtcnatgactgcataatagctg 

AAATCATTANCACTCTGTTTCAGCAGGTAGATGATAATGCCTAAACCAGGCAGGCGCCAGTATGG 

AACCACTGGAACCTTGGGTATCTTGCCGNGTCCATrrTATTAACTGGTTTCT 

GAAAAAAACNCCNNANCNTGCTGGAANGGAGGGGAACrCTGTACATGAGTGACTGGCAATATTC 

CATTACATTGGCACAATTGNrmATACCAGNTCAAACnKATNTTTTm 

ATTATCGNCTTGAAATTTAACTGACCNAGCITNTCNAACCNCCCAAACGAC^^ 
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TGGCNCTGGNATCrTCAATTTTTCCCTTGGNCTCCT^ 
NTTA 

SEQ m NO: 2055 ACTAAAGATGATGATTNKITNGTGGNATGCCTGAAGGGGCNGGCNCCAT^^ 
AAAAAGTATTGGAAAACATTAACCAAGAAAAAGGCTCCTTGTCCCTCCTAATCCATATCCANATA 
CCCCCANCCAAANATCTTAAAANGGATAGACCAGCCNTATNCCAACATGCCAGGATTCCAAAAGC 
ATGCCTCANCTAATTAGGGTNTTCCCTACAGCAAACTGGGGTANAGGGGCTCCATCCrGCA^ 
CCANAAAGCCnTACCTCGCCTAGGCCAAGGGTATGGAATCTAAATGAANACTGNCCACAAACAGT 
GGCTNTAAAACCTGGGCTTAAATAAAATAATATTTTNCTTCATTACATGG 
GANATNGAAAAGATTGATTTrACTACCTTTGGGGGCTGAAAAACTTCCCCTTGrrGGTO 
GGTTTAAAGGAAAATAACAGGAAAACCTGTCTNNAAAAAAACATAAATNA^^ 
CNCTTTTT 

SEQ ID NO: 2056 ACTTONTTTTTTTTTTm^ 

ACANAOTGGTGATAGCCCCTAAACATAATGTTAAAATTTGGATTAATGGGCTAT^ 

GGTGGAAACCGATGAATAAAAAAAATCCTGCTACACTATATGCTTGANTGGANTANGGCTGAAAC 

TGGGGTGGGCCrrCTATGGCTGAGGGGAATCANGGGTGGANACCTAATTGGGCTGATTTGCCTGC 

TGCTGCTAAGAAGAAGCCTAATAGTGGGGTGAAGGCTTGGATTAACGTTTAAAANGGCTAm 

TGNGGGTCTCATGAN1TGGAGTGTAGGATAAATCATGCTAAGGGGANGATGAAACCCCATNTCCC 

CGATACCGGTTGTATAAGATTTCCTTGAATGGCTTCTrGTGTTGGCCATT^ 

TAACTGGTTGAGCCAANAAAGGTTirmTTCCTTNCCCCCn^ 

GGTI>nTACCGGGAACTTAANATrrATITGNTATITNGAAAA^ 

NTTTAATGTTTGGGT^mG>^TmATTNCCNNN 

AAN 

SEQ ID NO: 2057 ACTGTAATCCAACACTTCnTrGTTAGCACAGCCACCCAGTTCCCACACCCGGG 
GTGCTTTTITGCCCTTCrCCTTTGGAGCATCTGACTTCGTGGACT^ 
CATGCITCTGAATGAACTCCTCGCGOTCTGCGGATCAACrrCCTT^ 

TCANGACCCACTGGAAGACCTGACTTTCTGCANGGACTGGTrrGCTTGGTACCCAAAGGACCATN 

AAAAACCTTCrIT^^^TTGGCCCCTT^^IT^ 

GNGTCTCAAT^r^mGACCTCANAGGTTTNTTGGGCCTm 

NGNGGGANCCCGGAT^m:ANTGCCCTCNTTTTTrmANAAAGGAGACCCCAAG/^^ 

AAAAAAAAAAAANGCCATTTAATAAAACnTAAANCACTrGGl^GTGGA 

TGCCCOGGNNGGGCNGTTTAAAAGGGNGAATTTCNNNANCAATGNNGGNCCGG 

ATACCAACTCTGGANCCNAACCT^^VNGGAAAAAANGGGNTAAANTGGTT^ 

CC 

SEQ ID NO: 2058 ACAAACATGGGTGAGCAAAGTTCAACnx:AGGTAATAAGTGATTAAAAAACA 
AAACATGTAGTGGGTCACAACCCAGACTGGTTTTCACTGTGCAACTTTCCTCANGGAATAATACC 
ATTCACAGAAAGAAAGGAATNTGCmCCAAGCCTTTGrrCATAAAGAAAATG^ 
TTCATCATNAAGCTTAAAAAAAAAAAAAAGGAANGAAAAAGGGGTGAANCCGTTTGGCCCAAA^ 
AAAAGOTAAATATAATGGGGTATACCTACTTATAGATATGTTGCCATTTAAACCCNTGGATNTAT 
ACATTrrGCAATTAAATGAATTGGCTCAGTAAAGTGAAAAAGTGTTTTCTA 
TGCTGATATGCCNCACATTTTGCTNTGCATGGNGACTGTGTAACrCNAAAT^ 
TGNCNG>m'ANCCCTCAOTANGCATTCTGTGGACAACATGCCTCCAATTCTTGAANAAA^ 
ACNCCCTTCTTTCAAAAAAAAGGGTTTTGCCCNGCCCTACAT^^ 

SEQ ID NO: 2059 ACCATTTTATTTAGTGTTGTAGGAAATGTTGGGTTACTTCTrAAAAACGAAAC 
CAAAGAAATTCAAAAGTCCCAAAGAAAGAAAGCAGGAAATAATAATTCTATAATCCAAAAA 
TGGGCGATCCTTCAGTTGGAGGAAGAGGGCGTCAGTTAAGTAGCTCACACAGTAGATATGGAGAC 
ACCATATGGAGATACGGAGTTAAGTTTGGTGGATACTAGGAATrAAGrrCTCCACCTAAGGCAATT 
AATirrrCAGCCTTGAGAGATAATTAGTAGTTCTAGAAAAAGAAAAAAGTTGACTGGGANAA^ 
TGGGAGGGAGGATGGTGCGTCATTTAACATTAAATTGCCTCTCTCTACAGTATGTGGCTGAGCATC 
ATGGCCTTCCTCCCTCTGCCCCGTTGAACTCCCACTGTTAGCAAACTGAGAGCACATGCCTGTGTC 
rrCTTCATACCGCATGTGCACACACACCTGTATCTCCTTCATCAAGGCATATGCAGACCCCTCCCA 
ATCGGGGGTCCCAGCTTGTTCAATTrTGGCAAAAGGGCANAGTrACTCCTTC^^ 
CCrrGATGAGGTGAACACACTGGAATAAGATGGAGGGCAGGATACCTGCCAAAGCCTGAGGAAT 
GANAT 

SEQ ED NO: 2060 ACCCATCATTACTCCCACTNAGAAAGAAGAAGTAAATGAATGTGGTGAAAGT 
ATrGACAGAAATAATCTGAAACGGTCACAAAGCCATCTTCCTTACTTTACTCCT 
GATAGTGCGGTTATCAAAGCTGGATATTGTGTAAAACAAGGAGCAGTGATGAAAAACTGGAAGA 
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GAAGATATTITCAATTGGATGAAAACACAATAGGGCTACrrTCAAATCTGAACTGG/^^ 

TOTCGCGTAATACCCTTAAAAGAGGTCATAAAGTCCAGGAATGTAAGCAAAGCGACATAATGAT 

GAGGGACAACCTCnTGAAATTGTAACAACGTCTCGAACTTTCTATGTGCAGCTGATACC^ 

AATGCACAGTTGGATTAAAGCAGTCTCTGGCNCATTGTACACAGCGGGGTCCCGGCAAATCT^ 

TCTTnvrTGACATCC<XCCGGTCCTTCANAATCCCAAACACGCTTTCCGTC^ 

GNCACCTTAAATTTCCACAGCCTTmGNAGCAACTTTTTGGTC^ 

CGAGGATTTTACNAATCTTTTGCCCAAGGTCAAGCCAGGGGACTTNAAGGTCCA^^ 

NAAA 

SEQ ID NO: 206 1 ACAAGATCTACCCCGGANTTTGGAGGCGCTACGCCAGGACCGACGGGAAGG 
TmCCAGmCTTAATGCGAAATGCGAGTCNGGCTTTCCTTTCC^ 

AACTGGACroTCCTCTACAGAAGGAAGCACAAAAGGGGACAGTCGGAANAAATTCAAAAN>^^ 

NACCCCGCCGANCAGTCAATTTCCAGAGGGCChnn'ACTGGGCATCTCTTGCTGAT^^ 

GAGGAATCAGAACCCTGAAGTTANAAAGGCTCAAOIAGAACAANCTATCANGGGCTGNTAAGGA 

AGCAAAAAAGGTTAANCAAGCrrCTAAAAAACTGCArrGGTTGTTGTTAAGGa^CT^^ 

GCCCCCTAACCAAAANATTGTGAANCCTGTGAAAAGTTCACrrCCCCGAGTTGGNGGAAACCNCT 

AACTNGCAATTAAATTTTAAATAAGAATGGATTTTACTCC 

SEQ ID NO: 2062 ACCTGTCTTTTCTTTTTTCTTTIT^ 

AGAAAGAATGCAGTATAAATATAGCTTTTCTCTACACGGGAGCAGGGGGAACAGAACCAATCCCC 

AGCTTAGCCACACCCAACATCATGGAAATTACTGTGAACCTGTTGTCTCTTGAGGACAA^ 

CAAAACGAAATCCCTAACArrATTAAAATGTTAGGAACTTTTCAGGTAAT^^ 

AAATACAGAAAGATTACATTCCTCATAATAAAAATCAAAGTGGCCCACGCCATCTGCAAAGGGAA 

CTTGCACCATCnTGGTTTCACTCX}CATTGGTTAACAGTGCCCTAAAAAGTATACAC 

ATAAAACCATTANGGTAATATCrrGGATCATATCCTCTGCATGATGAACTATCACTCAAATTATCT 

GTCATGGGTTACCTTACTAACACTGCCAAAGAAGATAATrAATGGATAATTTTAACAGGGATA 

AGAAACXnTGCAAACCTTTCAAGGTTTCnTTCATAACTATTGGm 

NNTTATCATCAANCTTATNCNTTATAGGCCAATATAGTACCAATGGCTAATTTA^ 

N 

SEQ ID NO: 2063 ACTGGCTCCACCCCITGGTGCTGGCAGTGTTTGGGGACATTATGCTGGAAAG 
AGCTCCTAGCATCAGAGGATTAACACTAGCAGArrCTGTTCCATCTrrGCACTGTTGCTTACCT^ 
GATTTTCTTAACTGTTCTTGTGCAATCGACAATGTGCTAACCTGCT^^ 
TGCATTACAGGCTGCATTCTTGCCrrACTGTATAGAAAAAGAAAAAAAGGCTGGGTTACT^ 
CATTTTTAAGC>rmAATACCTITATCTTCnTGGAAAT^ 
AAACNACAGGTNTTG>rrrGTAAANGGAATTITAAATTOGNCCATTr^ 
ACTTAANAATATNCCNTGTGNTTNACAGNNGTGAGGGGCCTGmATNTNAT^ 
NnTGTNAAANGGGAAAGTGNTTCTTATGGG 

SEQ ID NO: 2064 ACATATAGGTGGAATGAATTCTATCCTTGACATACTGAGGCCAAATTACAGC 
CTmCAATAGCTCCAATCTCArrAGACAGTCCCAAGCCATAGTAGCACAATGCTCCAAGACCA^ 
AGCAGCCCCTTCCAGCAACAAACCATCTTCCATCTGATCAAATTTAAAATATTT^^ 
GTTCCAATGGCrcCCTTTTGAANTC^m'GGCCC^ 

gnggctatttcctggctagggtttaacaagccnnttngatctttng 
agggaggcrttx3tgaaaagcttggtx3gaaaaccctnaaaagna>^ 

CTTGCANCAACATGGT 

seq id no: 2065 actaagtaaatgttcctgctattatttmaattatatt^ 
cctccttgtgcacctcttgtttcctttctgaattatgttgacactgaagaccaatgg 
cgggttaatttaaaaagaaactggggcctcataanggactaaaagaaaattaaactagcct 
gtactaccaatnctttttataaanataatggttngaaaccrggggataaangtt^^ 
cttggttgcnanccaatgggttgtggggaacctgtggtttctgaaggggtgaattaaaaaacc^ 

CTTGThrmAGTTCCCTGGAGCCThrrrTTTNAAAAGTCANAT^ 

GCGGGCCGTNNAAANGGCCAATTCCAACNACTGGCGGNCGTTCTANTGGATCCGANCTCGGACCA 
AACTTGGCGTNATATGGGCANAGCTNNTTCCTTNGGGAAATTGGTATNCC^ 
NATTCCAANCCGGAAGCATTAAAGTGKTAAAGCTGGGGGGGCTTATGAAGGGACCTACNTACAT^ 
NATTGCGTTGCCCCACTGGCCNTTTCAA 

SEQ ID NO: 2066 ACGAAGAAAGCATITCCCAAGCAATGAGTCTCTTAATGGAAAAAATAAAAGA 
GCAATGTAATTAAACTTTCCTTCAAAGAAAGGAAAGAAACCTAAATCTGTTATG 
AAGATAAATATTTCCTAGTCCCATAATATGTGTTTGTTTTACAAAGATATGOT 
TCTATGAAGCAGGGATCTGTGGCAATCCAAATTCATCCACCAGAACrCCATCCTTGTITI^ 
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CAGTGGGGAACACCTTCTGGAATTGCANGTGCAGATGCTGCCTCATCCAAATAAGAACTGCTTC^^ 

CANCCAGAAGCTCATCACCTAGTGCATCCAACTCTGCTrCTAAATCATCTTCATCCAGTTCTGGGG 

TGCCATAACTGCGACTCAGTGCnTCTTGGATTTCATTTGCATCTTCCATCATATCCTCTACTGGTCT 

TGNAAATCCCTAATCTGGTCGATCTTCACmGCTTTGTATGCCTTCTTCATTTCOT 

GTTTCTAGCATCAACCGNGGTCTTTGGTGTNCCTTCAAAAGACTGGATNGTATAATTGGC^ 

CTGTTGAATGAACTGGTGGGGCAAAATTTGCCCGNTGCTGGTCATACOTCCnT^^ 

A 

SEQ ID NO: 2067 accagctactgaatacaccggatctagatatgcccagttctacaaatcagac 
agcagcaatggacactcttaatgmctatgtcagctgccatggcaggccttaacacacacacctc 
toctgttccgcagactgcagtgaaacaattccagggcatgcccccttgcacatacacaatgccaa 
gtcagttrcttccacaacaggccacttactttcccccgtcaccaccaagctcagagcctggaag 
cagatagacaagcagagatgctccagaatttaaccccacctccatcctatgctgctacaattgctt 

CTAAACTGGCAATTCACAATCCAAATTTACCCACCACCCTGCCAGTTAACTCACAAAACATCCAAC 

ctgtcagatacaatagaaggagtaaccccgatttggagaaacgacgcatccactactgcgattac 

CATGGTTGCACAAAAGTITATACCAAGTCTTCTCATTTAAAAGCTCACCTGAGGACTCACACTGGT 

gaaaagccatacaagtgtacctgcgggcggccgtcgaaagggcg 

seq id no: 2068 actgcacggcaattgaagcatagctactacagaataactcaccrrccaacaa 
ttcctgaaatgggtcccttactgggattattacagcaccaaaaaacttctctgaagccm 
caaccrrgttctatgggattccataatggtaccaatgggattaaagctatgaaccctcaaancatc 
acgagaataaccatgatgggtctaagacttgggaaaacrggcctaaattatgntg^ 
atgttaaaattgaattcatctgggaagcattcaaatcaancttaaagnctaatctgaaatgct^ 
acagccrgaaaggnaactgggaatctcatttctatcattgactaacm 
atrtaagnggnattgaaaatgctttggagggagtcacacttatactatcaactattagtc^ 
agcttcaatcactggcattattctaatcctctccrcttaaattttaag 
agcaacatttcgcaaatgtgcctcgoncgcgacccccgttaanggc 

SEQ ID NO: 2069 ACTTTNTTTITriTrrTT^^ 

TTGAACAGGTTCAGCTATTACTGAAACrrGTAATTTCTAAACTTAAGTTGGGGCAAATGGC^^ 

TGCAGAATAATGCCATCACTGGGCACTGCGAATGCCATGACTGAAAAATTAACAGCCACCCNTNA 

GGCGCAGGACCAGGTGCAGGGTCCACTCTTTCTGGATGrTGTATCAAAAAAGAOTGCGGNCATTN 

TTTCAAGC^IT^^^^'GCCTTGAAAAAANAACCT^^^GCTGGTCC^ 

ATCTTGGCCITTACATrrrCGATGGGGTCN(nX}GGCTCCACCTCCAA>^ 

AGGTCTTCACNAAAATCTTNATTCCACCTTTTAAACGCANGACCAGGTGCANGGGTCACTC^^ 
GATNTTGGAATCAAAAAAANTGCGGCCNNTTCACTGmUCCTGCAAANATAACC^ 
GGGAAGGGATGCCrmrm^TCCTGGAhnmT^GCCTT^ 
CNAirmCAAGGNANATTGGTCTTGCCNGTNANGOTTTTTTAAA^ 

SEQ ID NO: 2070 ACAAACTCCCAATTGCACCATTAATTATGGGGTCACATCCAGTTGACAATAA 
ATGGACCCCTTCCrrCTCCTCAATCACCAGTGCCAACAGTGTTGATCTTCCTGCCTGTTTCT^ 
AAATTTCCCCAGCCAATCCCAGTATCTAGAGCATTTGTCAGAAACTGCANAACTGCACAGGAATTC 
CATTGTTGAAACTCAACCAACTTATGCCCCCTGTATGAACTGATCACTCANTTTGAGCTATCAAAG 
GACCCrcACCCCATACCTTTGAATCACAACATGAGATTTTATGCTGNTCTTCCTGGCAAGAACACT 
GCTATTTCCTCAACAAGGATGCTCCTCTrCCAGATGGCCGAAGTCTACAGGGAACCCTrGTTAGCA 
AAATCACCTrrCAGCACCCTGGCCGAGTTCCTCTTATCTAAATCTGATCAj^ 
TAACACCCCTCATTTGGAAGCTGGGTNAAAAAAACTATTCTGAAANAAAATT^ 
CCAAATTTGAANTGNNCCTTTTTCAAAGTCTCGmAACNTATI^^ 
ACTCCC 

SEQ ID NO: 207 1 ACTTGTGGGCCAGCTTAAGCAGCTGAGTAGCTGTTTGGCGGTCCAGGGCCTG 
GGTGAACTGGTTAATCGCAGGAAGCACTTrCACCCGCTTATAGAGGATGGGCTCTC^^ 
AAACCTGATATAGCGGGGCCATTTTCACAAAGCNGGTGAAGGGTCTTTTTTTG G^ 
TTGCCAAATGCCAAATTNrrAAGGCTTTTTITNAAA^ 
CTGNTTTTTTAACNAAAAGCTTGGGCCCGGAACCCACCTTTTm 

GCATTCTTGGGCCGGCGGGAAGAAAAAAACCCCCNfNNNTCCNTNGGCCGNAACAACCl^ 
CNAATTTCAANACACTTGCGGGCCGTACTTANTGGANTCCNAACCTTNGGNANCNA 

SEQ ID NO: 2072 CACTACAGAGCAGTrGGGGTATGATGGGCATGTTAGCCAGCCAGCAGAACCA 
GTCANGCCCATCGGGTAATAACCAAAACCAANGGCAACATGCAGAGGGAGCCAAACCAGGCCTT 
CGGTTCTGGAAATAACTCITATAGTGGCTCTAATTCTGGTGCAGCAATTGGTTGGGGGATC AGCA T 
CCAATGCAGGGTCGGNCAGTNGGTTTTAATGGAGGCTTTGGCTTCAAGCATGGTTm 
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TGCTGGGGAAATGTAACAGGGGGGGTTGTGGNT 

SEQ ID NO: 2073 ACCCTTCCNCTTACCTATGCCCATGTGCCTGCCCTTCCGGCGGGCCAAGGTGT 
TTTTCCGGCNTCGAGCCCGGGGAATGGACCGTCACAGGCnTGCGGATGATCAGCCCATCrrTTGAT 
GANCTTNCGGATCTGCTGACGGTAAGTTGGCArrGGGCGATTTANATTGGTNTCATTGGTNGT^ 
NCCACACCTTTNTmrrGNCNAC 

SEQ ID NO: 2074 ACTTTTTTTTTTTTTrT^^ 

AAAAACTCCATTATCCCAAGCAAAAAGCACAGAAGGTGGAGTTTGGCITCAAGAGATGT^ 

AAAATCTTAGGCCTAGCAGANAATCACCAAATTTATGGAGAAGTTAAAAGGGGTTTAACAGGGAA 

GGAAGTGarTrrATTAAGTTCTAAACNANAGGCTGGGAGNCACANNTTAATC^^ 

CCTCAGNGAAAGGTGAACCCATTCGGGGGTGGCATGTCACXTCAAGAATTAACCCCACTTAAAAA 

CAAATGATTTCNTAGGATANCACNNNGCCTGGTGCCrGTGAACCCTGAGGCNC^ 

GCNCTGGTTOTGAATNNGGANACCCAAAATTmrmCTO 

TmACCTTAAACTTTGTAAANCANa^TAAAACCCNACCCCNTCCAANAT^^ 

ACNTTThriTOTGGCANCCCAAANGAAAGGACCTNNTATT^ 

SEQ ID NO: 2075 ACCCTTCCNCTTACCTATGCCCATGTGCCTGCCCTTCCGGCGGGCCAAGGTGT 
TTTTCCGGCNTCGAGCCCNGGGAATGGACCGTCACAGGCTTGCGGATGATCAGCCCATCrm 
GANCTTNCGGATCTGCTGACGGTAAGTNGGCATTGGGCGATTTmATTGGTOTCArrGW 
NCCAAACCTTTGrnTTTGNCNA 

SEQ ID NO: 2076 ACCCTCTCCTTGTTCTCrTCAGGAATCAAGGGTCCTGATGCGGCAAAGGATQA 
AGTTTTAGGAAGCTCTGGGTCACAACGCTCTGCCCCCTGGCATTTANAGATAm^ 
TTTGCCATACACTGGCACCAGTGTGNTNAAGTCTGANGCCCANCTATNCACANGTCTTO 
ATCCTCCAAATGNACCANGATTAATGGTCAAATNTATGGCTCNTATA 

SEQ ID NO: 2077 ACrTTTTTTTTTTTTTTTT^^ 

CACAGANCAGTHTNATGAAGGNGGTrrTCTCCTGACTCCATGCATCrrTNACACA^ 

TAAATATG<X(>rGTNATCrGCCCCACCTCAGGNCTGGAAANNTGGCACTTAGNAAGGGGGGC>^ 

ATGCn-AAGTCTCANGANGGTrTTTAANGGCATTTTTGCGGA 

SEQ ID NO: 2078 ACACAATGATATAACCAGCTATAAGTTTAAAAGCTTAAAGCACTGTGTGAGT 
GCTGGAGAACCAATTACCCCTGACGTGACTGAAAAATGGAGAAACAAGACGGGCCTGGATATCTA 
CGAAGGATATGGACAGACTGAAACGGTGCTAATCTGTGGAAATTTTAAGGGAATGAAAATTAAAC 
CTGGCrCAATGGGAAAACCTTCTCCTGCTTrCGATGTTAAGATTG 

TCCTCCTGGACAAGAAGGGAGATATTGGCATTCAAGTCTACCCACCGCCATTTTGGCCTT^ 

NTTACGTAGATATCCTTCAAAACAGCTTCCACTCTACGAGGCAATTTCTATATCACT^^ 

GATATATGGATAAAGKNGGGTTTTCTGTTTGTTGCAANANCAGATGATTGCATA^^ 

ATCGANTTGNACCATTNAGGNANAAATGCCCTGATGACTCCTNAGrrGAGAGTCACTGTGTCACN 

NCCNAACCCATNGAGGAGAGGAGAAAGCnTGCGnTAATTCTGNT 

SEQ ID NO: 2079 ACAAACCACGGATCTTGTGTCAGAAACACATGTTGAGACTCCTCCATTCCTTC 
CAGAATTTTCAGAGATGGGGTAGACCCACCTCAATCATCCTCAGCATCAGTTTGCTAA^ 
GCrCAATGACAAGCTCTCCTGCCATCTCCAAGCCCACrrn'CATAGTTCCGCTCTGTCm 
AGCACTTTAGGCACTATrCTAAGTCCTGGAGTATATCACTCTTGOTCAGAGCTAAATA^ 
ATGAACACACrrACTCANACAAGTCCTGGATAGCTGCCATTGCAAGTACATACTCAGGAGATGAA 
NAAGGAAGCCTTAAANGGCTTCAGAATAGACATNCTAATCAAGATGTGGCCAGACAAAGACACA 
AACTirrTACCCTAAAGANCCAATATTTCTGGATTNAANACCTCOT 

CAGGAGGNGAGTGAT^^^^AAAC^^^GAGGTTGGGACAGAGCTATGCCANAATCCACATTTGGGGGC 
AAGCAGAAGATTGTTGAGCAGATTTAAAACCGCTGGNAAC 

SEQ ID NO : 2080 ACAAGCACTTAATTAAAGCAGAAGAGCCCAAGAAGAAGAAGGGAAAAGTGG 
AAGTGAGAGCCATTAATTTGGGGACAGATTATGAATATGGGGTOTTAAATATGCATCN 
TATGATATGACCCTGGCAGANAGTTATGCCCAGT^^'GGTCACANCCTCTG(>^ACTATC 
ANANGNCNAGGAANGGNTTTCNATGNCACCCAGNANCNNTATATGGTGTAGTTNGT 

SEQ ID NO: 208 1 ACGCGGGGGTCTCTGGTTTCTGGCCCCTTGTCTGCAGAGATGGCTCCCAATGC 
TTCCTGCCrCTGTGTGCATGTCCGTrCCGAGGAATGGGATTTAATGACCTTT 
GACAGCGTGAAAAAAATCAAAOAACATGTCCGGTCTAAGACCAANGTTCCTGTGCAGGACCAGGT 
TCrmGCTGGGCTCCAAGATCTTAAAGCCNCGGAAAAGCCTCTCATCTTATGGCATr^ 
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AAAAAGACCATCCAC^^ITACCCTGANAGTG^^^GNAAGCCCAGTGATGAGGAACTGCCT^ 
TGGQNANTNTGGT 

SEQ m NO: 2082 ACGCGGGGAGAGCGCCGAGGAGCCGGTCTAGGACGCAGCAGATTGGTTTATC 
TTGGAAGCTAAAGGGCATTGCTCATCCTGAAGATCAGCTGACCATTGACAATCAGCCATGTCATCC 
AGGCCTCTTGAAAGTCCACCTCCTTACAGGCCTGATGAATTCAAACCGAATCATTATGC ACC/^ 
AATGACATATATGGTGGAGAGATGCATGTTCGACCAATGCTCTCTCAGCCAGCCTACTCl 1 1 1 i AC 
CCAAAAAGATGAAATTCTTCACTTNTACAAATGGACCTNTTCTTCAGGAGTGATO 
ATGCTCATTATTGGGGATGTGCATTGCCT^TCTTTGCTGTGTGGNCTCCACGC^ 
GCTATGGAACTTCCTTTTANGAGGTANGTAGGTACCCTTATGGAGGAAGTGOT 
GTGGTATGOTATGGTOTGGrrOTGNTATGGa>fCGGAGGTATACAACCAAAACACA^ 
rTGCCATGGTGCCTTTGTTCATTGCCCGTGGGACriTGTACCAGGTAAAAAACTAAAG 

SEQ ID NO: 2083 ACGCGGGGAGAGCGCCGAGGAGCCGGTCTAGGACGCAGCAGATTGGTTTATC 
rrGGAAGCTAAAGGGCATTGCrCATCCTGAANATCAGCTGACCATTGACAATCAGCCATGTCATCC 
AGGCCTCTTGAAAGTCCACCrCCTTACAGGCCTGATGAATTCAAACCGAATCATTATG CACCAA GC 
AATGACATATATGGTGGAGAGATGCATGTTCGACCAATGCTCTCTCANCCANCCTACT>ri^^ 
CAGAANATGAAATTNTTCACnrrmCAATGGACCTCTCCTCCA^ 
NCTTAATTATTGNNATGNGCATTGCCATNTTTGCTGGGTGGCCTCNCCCrrG CCT^ 
ATNGNAANTCCCTTTTTNGGAGNAAGNGTAGTTNCCCTTANGNAGAAGAGNT^ 
NAAANGNTTNGGTTNGTOTGGGTATGGTrTGCTACGNGGGCTTACAAACCCAA^ 
TTAANGTCGCrTGCrGCTTrTNTTAATGC 

SEQ ID NO: 2084 ACGCGGGGACGCTCGACCCCAGGATrCCCCCGGCTCGCCTGCCCGCCATGGC 
CGACAAGGAAGCANCCTTCNACGACNCAGTGGAAGAACNAGTGATCAACCAGGAATACAAAATA 
TGGAAAAAGAACACCCCrmCmATGATTCNGTGATGACCCNTGCTOT 
CCTGas^NAAT^TCITCCTGAT^rITCC:mTCC^ 

SEQ ID NO: 2085 ACAAACAATGTTTAmGTTTGTAAAGTGCCAGGTTTATATTTAAGTAAACAT 
TAAAATCTGCGTTGAAGCAGTGAGGCCGCATCriTrAACTGGCTGTGCTGGTTAAAC^ 
ATCCCGCTXjTGCTAAGCTCACTAAAGGGTCACCCCTCACTTTAAAGCCAAGACTGCCATTGTCA^ 

gctatggtaagtcacagccagccaggccrrctggcaaaaggtgatactaccagcactataaacag 

acggactggttgtgaggnanctcacagttttaaaagatgcttgtnangaacat^^ 

ttgngnggctagtttggnaacact 

SEQ ID NO: 2086 accggggacaggtgcagtccctcacctgtgaagtggatgcccttaaaggaac 
caatgagtccctggaacgccagatgcgtgaaatggaagagaactttgccgttgaagctgctaact 
accaagacactattggccgcctgcaggatgagattcagaatatgaaggaggaaatggctcgtcac 

CTTCGTtfAATACCAAGACCTGCTCAATGrrAAGATGGCCCTTGACATTmG 

AAGCTGNTGGAAGGCGAGGAGAGCAGGATrTCmGCTCTTCCAAACrm 

GGGAAACTAATNTTGGATTCACTCCTNTQGTTGATACCAACTCAAAAGGACCn^ 

NTGGAACTOGAAATGGCAGGTTATAACNAAAATTTNTNGTAATANGANNACTTG 

CACCCTCTGTGCTGCATOTATTACCACTAAAAAAAAAAAAAAAAAGTTTCCTCT^ 

SEQ ID NO: 2087 ACTCCACAGAOAGATGCAGACAAAGTAAACAATGAAGGTTGTTTTATAAAGG 
TGATGACCATrATAGAGTCAGAAATGGGAGTCGTTGCAGGAATTTCCTTTGGAGTTGCTTGCT^ 
AACrGATTGGAATCTTTCTCGCCTACTGCGTCTCTCGTGCCATAACAAATAACC^^ 
TGTAACCCAATGTATCTGTGGGCCTATTCCTCTCTACCTTTAAGGACATTTAGGGTCCCCCCT^ 
ATrAAAAAGTTGTTrcGCTGGAAAACTGCAACACTACTTACTGATAGACCAAAAAACT^ 
TAGGGTTGNTTCAATNAAGATGTATGTAGACCTAAAACTCACCAATANGCTTGATrCATCAAAATC 
CGTGCTCCAGNGGGCTGATCAACAAAATTNATTGrrGNTATGTTCrAANCCACCTN(^ 
ATGGTNAAATO^TGAACCCTGTTCCTTTGAACACTGGAAAACTAGT AATTGG AAATAAAGA^ 
NGGTCCTTTGCTGTNTITrTCTAAANGGGGCTTTGAAGGCCT 

SEQ ID NO: 2088 ACCATTCAGGACATAGGCACGGGCAAGGACTTCATGTCTAAAATACCAAAAG 
CAATOGTAACAAAAGCCAAAArrGACAAATGOGATCTAATTAAACTAAA GAGCTT CTGCACAGCA 
AANGAAACTACCATCAGAGTGAACAGACAACCTACAAAATGGGAGAAAATrmGCAATCTACTO 
ATCTGACAAANAGCTAT^ATCCATNAATCTGCANATGAACTC^^'CAT^OmTACA TGA^ 
NGNCCTTTNAACNANTGGGCAATNGGrrrGNACATATTCTNTTGAAANG/^ 
TNNGACCATGATAAANTGCTCTC 



312 



wo 02/29086 



PCT/USO 1/30732 



SEQ ID NO: 2089 ACll U -lUU" l U U - riU " lUl - l l f T l l T T T TATCCTCCAAACAGATTTATTGAATACAG 
CAAAATTCTATATACAAAGTGACCTGGACCTGCTGCTTCAAAACATGATCCTTTOT 
TTGATAGNCGGNCCATANAGCArTATAAAGCAATTGACTCTTAAATAAACAAAAAAGTGCCTAAT 
GCACATTAAATGAATGGCCTAACrACTGGAACTTTANTANTTCTATAAGGT^ 
GATCCAGTTCCTANTGACNGGCTTGCTGAAAAACAAATATGAGCNTCANNNNGNCT^^ 
CTGNCACCGGNAAGCCCTCNTGTTTATGGANCNAATGTCCCATTATCGATTCTANACNACCACN^^ 

AATTCANGGGGGCAAAGGTT 

SEQ ID NO: 2090 ACTGGTCCAGGAGTTATCCAGGATAGATnTCACCCACCATGGOACGTCATC 
GTTCAAATCAACTCTTCAATGGCCATGGGGGACACATCATGCCTCCCACACAATCGCAGTTTGGAG 
AGATGGGAGGCAAGTTTATGAAAAGCCAGGGGCTAANCCAGCTCTA CCATAA CCAGANTCAGGG 
ACTCTTATCCCAGCTGCAAGGACNGGCNNAAGGATATNCCACCTCGGTTITITANGANANGACAG 
ChrrANTGCAGATAGANATTTGCCrGGAGGNCTGCTNAGTTOATNCCGTANGA 

SEQ ID NO: 209 1 ACCCCTTAACCCCCTCTCCTTCAa^CTTAGCAGCAAGTCCCACIT^ 
GGCAAGAAACCCCAAACCC(JITCCCTCCGTGTCTTTACGCTCTCTTTTCT 
ACTATGGGCAACCrrCCATCCTCCATTCCTCCTTCTCCCITAGCCTGTGTGCTCAAGAAC^ 
CTCITCAACTCACACCTGACCTAAAACCTAAATGCCTCATTTTCTTCTGCAACAC^^ 
ATACAAACTTGACAATGGCTCTAAATGGCAGAAAAATGGACTTTCGATTTCrc 
CTAAA>mATTrrTTGTCAAAAAAATGGGCAAAATGGTCTGAGNGCCTGA TGN 
CACATCCGGCCCTrCCTANCTCrGTGCCCAGGCAACTCGTNCAAATCTTCIT^ 
CCTAATCCAACCCAAGCGTGCTGAGTGGTATATTDnTITCNANAACCCArrGA^ 
NGCAACTAGGCCAATTTTCTCAGNTCCTCT 

SEQ ID NO: 2092 AC'in'il-l U ' lTr i'rilUTITTNTTrn^GGCCAGAAAAAATAATCCGT^ 

AAAACCTGGAGGATACTATTCCACTCCCCCAGATGAGGAGGCTGAGGAGACCAGACCCCTACATC 
ACCTCGNATCCACTTOTGATACT^m^<ACGAGGCANNAGGCAAAN 

AAAANCAATrCCAAGGGCTGCTGCAGNTACCACCAGCACATTTrrCCTCAGCCANCCCCCA^ 
NTTCACACAGNCOTCCTTATGGGATTGCCTTCTCGTANAAAT 

SEQ ID NO : 2093 ACGAGGACTGGATGGAAAGGTGATTTGTGGCTCCCGAGTG AGGGTTG AACTA 
TCNACAGGCATGCCTCGGAGATCACGTTTTGATANACCACCTGCCCGACGTNCCTTNGATCCAAA^ 
GATATATGCTATGANTGTGGa^AAAAGGGNCATTATGCTTATGATTGCCNTC 

SEQ ID NO: 2094 ACATTGACAGACirn'CAGTATTGTAAGACCAAGAAGACTrCTCTACATGGCA 
ATAGATGGAGTGGCACCACGTGCTANAATGAACCAGCAGCGTTCAAGGAGGTTCANGGCATCAAA 
AGANGGAATGGAAGCAGCANTCGANAAGCANCAAGTCAGGGAAGAAATATTTGGCAAAAGGTGG 
CTTTCTTTNTCCAGAA 

SEQ ID NO: 2095 ACTGGCCTGCrGCTGGCCCGCAGGCTTCTCAATAGGTTTGGCATGGACAAGA 
TCTATGAAGGCCAAGTGGAGGTGACTGGTGATGAATACAATGTGGAAAGCATTGATG GTCAG CCA 
GGTGCCTTCACCTGCTATTTGGATGCAGGCCTTGCCAGAACTACCACTGGCAATAAAGTT^ 
GCCCTGAAGGGAGCTGTGGATGGAGGCrTGTCTATCCCTCACAGTACTGCGATTAAAAAAAAAGC 
ACTTCTGCAAAGGAACCATGTTCCAACACCGCAAACAAGGTGTTCTGCTTAAACAGAGTAAGATC 
ACCACCCCCATCCATCCCTTNCTTTCCTGTTCCCTCCACTTGATTGTGTCA^^ 
GGTAGGGATGCTCAGCCACCTAAGGCAAGGATNCTTGGGAGGTGGAAGGCTTGCATGNTTNANCA 
CACCAAACTGANCGCAAAAGGTCACTGCTCATCTAAATCrCTGGATGTTCT^ 
NANTCCAATGCAGGGCCTGGTTGTCNTGTCCGGT 

SEQ ED NO: 2096 ACTCCCTACGGCACTAGTCTACAGGGGGAAGGACGCTCTGTGCTGGCAGCGG 
TGGCTCACATGGCCTGTCTGCACTGTAACCACAGGCTGGGATGTAGCCAGGACTTGGTCTCCTTGG 
AAGACAGGTCTGATGTTTGGCCAATCCAGTCCrrCAGACCCTGCCTGAAACrrGTATCTTACGTGA 
ACTTAAAGAATAAAATGCATTTCTACCCCGATCTCGCCCCCAGGA 

SEQ ID NO: 2097 ACAGGGAAGTGTGAGGAGAGCAGCACCCCAGGAACCACCGCAGCCAGCATG 
CCCCCGTCTGACGAATGACAGGAAGATTGTTGAAGGCCATGAGGGAAAAAATAAACCCCAGCTCT 
GAATCACCTACCTTCACCATCTGTATATACAAAGAATrCTTCGGAGCrrGTCTTATTTGCTATAGAA 
AACAATACAGAGCTTTTGGGAATGGACTCACTGATTTTCAGTCTTT^ 

TCTGTGATCTGAGGGTATAAAGACATGTCCACCANGTCTGAGCCCTCAAAATGTCCTGATTACAAT 
GCTGGCTGTCCAACTGCCTGTrCAATAAAAGTAAACTCAGCAGAACACCCATNTTT^^ 
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GNTrmATOTTTTTNTTNAAAANTGrrCNTTGNCCGNACCCCCT 

SEQ ID NO: 2098 ACGCGGGGACGTCGCTnTTGATCCTTCGATGTCGGCTCTrCCTATCATTGTG 
AAGCAGAATTCACCAAGCGTTGGATTGTTCACCCACCACTGCCCTGGGAAAAAAAAAAAAAAAA^ 

SEQ ID NO: 2099 ACGCGGGGAGGCAITGAGGCAGCCAGCGCAGGGGCTTCTGCTGAGGGGGCA 
GGCGGAGCTTGAGGAAACCGCAGATAAGTTITmCTCTTTGAAAGAT AGAGA T^ 
TTAAAAAAATATAGTCAATAGGTTACTAAGATATTGCTTAGCGTTAAGTTm 
TAGCTTAAGATTTTAAGAGAAAOTATGAAGACTTAGAAGAGTANCNTGAGGAAGGAA^ 
AAGGTTTCTAAAACATGACGGAGGTTGANATQAANCTTCTTCATGGANTAAAAAATGT ^^ 
AGAAAATTGANAGAAAGGACTACAGACCCCCGAATTATTCCCAATAGAAGGGCATTGCTmAGA 
TAAAANTGAAGGTGACTTAAACANCTTAAAGTTTA^m>^AAAAGTTGTAGGTGAT^ 
NGAAGGCNNCCrrTNAAAAAANANATTAACCCGAAGGGTG>rrTAA^ 
ACNCAGGGAANAATTGCGTNATrrAAAGCCrANrrACNCCTTTTACTTAACN^ 
GAAANATTANTTGGGAAGTGGTAGGATNAAACAATTTGGAAAAAATTNAAA 

SEQ ED NO: 2 1 00 ACTCrCTGTCCACGATCATGGNAACCATCCAGTCCTTGAAGCGGCCAGTG AA 
GAGCTCCAGGCGTGCGGTCAGGTCCCGGTTACCGTACCCTTAAAAGGGGACAAACGAGTGCCCAT 
TGAACTACACATTACAGCTGCCACAGGGGAATTGGACAGTATGTGTCCANAGACCAGCGGNGTGA 
GAAGAGGAGTCCCNTCCTOTCCAGGCACAGGTAATANATCCTGATCCCCAAGGACGAGGCATGGC 
TTGCTGTCACACAATGAGGGTAGGAGGAGTTCGTGTGGAACCCATGGATCCCCTTGGGGGCCrrTCT 
NTTTACCCrrGTCCCAT 

SEQ ID NO: 2101 ACATrCCACATTTTAATAAArTAACCACAAGAAAATA ATCCCACATATA CAA 
GGTCAGGGGTGGGGAAGAGTATTAATGGTATCTTAATTATACCCAGTCTGGririliiiliri 
GGGGGTAAAAATCAAATGCAACCCCATCTTGTTITANNAATmGAAAACTAA^ 
ATGGNCAGNGTTCCTTTCAAACATGTGAGTTCTTTAACAAAAATGAAATAAACCNG 
ATTTCTAArrAATCACCGOTGGCCATTACACAGGTTTTGTTGTrrGGGGTGGGGAGGGG^ 
TTCCCTTTTGCATAATATAGTCNATGCACrAACAATTATGTATATrCAAACTTGAT^^ 
CNAT 

SEQ ID NO: 2102 ACTTCAAGGAGAATTCCCACCACTGGAGCTGGGCTGTGCAGTGGCTACAGAA 
GAAGATGTCAGAACATTACTGGACACCACAGAGTAATGTCTCTAATGAAACATCAACTGGAAAAA 
CCTTTCAGCGAACCATTTCAGCTCAGGACACGTTAGCGTATGCCACAGCTTTTGTTGAATGA^^^ 
NANCAATCAGGAANCATTATTGGGTCGGGAGAGTATTCCTGCCAATGANACCGGANACAGGCrrC 
TACAGCAGGGTTNAAAATCTCCCNTGATGArrGTTGAGTTGAGAAGTGNCCTTGNTGATGTTGATC 
CTTAGAGGAATATOCCCANCCTGAAAGGAGTAAANACNCATTACTGANTGCTCANCACCTT^ 
GANTCAAAATTTTCGAANCCCTTTGAANACCCTGAAWrrGGAOT 
GCCGGATCGTGNTNTrrCTCAAGGAAACC>nrrTTTAANCCN(rTAA 
AGTGGGGAAAACTCCGCrmGGATCACTTGCCCGGGACCGTNGGCrriTNTO 
TTNGGGGAATrrTCGNAAATAAACCCCCGTGGCAANll"rill"riUCCCCAACTTC^ 

SEQ ID NO: 2103 ACGCGGGGCnTmCCCCGGTTGCTGCrrGCTGTGAGTGTCTCrAGGGTGATA 
CGTGGGTGAGAAAGGTCCTGGTCCGCGCCAGAGCCCAGCGCGTCTCGTCGCCATGCCTCGGAAAA 
TTGAGGAANTCAAAGGACTTCCTTGCTCACAGCCCGACTAAAGGATGCCAAA TCTNG CT 
AAAAAAAATAAGGACANCGTNGAATTTTAAAAGTTCCAATNCANCANAATACCm 
CC^rmACTGNCAAANAGAAGGCANAGAAACTNAACCAGTCC^^^TGCCCCCCG^r^ 
AGGACTTGAATTGAACNAACACNCTGmTGNACCTGrmATTTTAAAATNCT 

SEQ ID NO: 2104 ACTTACATATCCTACATTTGACTACATTATTTCCAAACCAAGTATTCCATCCA 
AAGGAACATACTGCTATAATAGAGACCAAGGAGGGACTGTTTAAGGTTGCCAAGGTGAAGCGAG 
CTGAGAGGCTTTGTCCTCATGCCAGTAACTCTGAAATCTCTCrrAATTCCTGCT^ 
GATTGCCATGGTTTCCCCAAGTAGGTANCTGCnTAANCAmTAAAGCCCAATTGTCTGT^ 
ATNAAAGGTCrIOTGAATTmTGAAG^fNGGTGTTTAATTCa^GGNGGACT 
NTCCnTNTTGGTAAAGNGTGGCTAATAAAAANAATCCNCCTTNAAAAGCTGNAAA^ 
NCCnTAAATTGACCAACTTAACTGAATGGGCNTNAGGAAAGTnriTNG 
AACCC^r^C^^TCCCCCCAG^^^GGNTGCCTTGNANACn^^AANN^W 

NTCnTAAAAACAGGAAATCCAAGGGCTGGNTTAANGGNCTGNGNAANACCTGTTTG^^^ 
TAACTTAAACCCCTTNNATAAATTGGGGGAACCCCCTAAAAAAATAAAAN 

SEQ ID NO: 2105 acaagtccttgtagatctcctgcaggagcgggtgaagactcatgtctgtctcc 

GTCrrcnTGATCACCTGCAGTAGCTOCACGTGTTCCGTGACAATCTGTCTGAGGT^ 
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GGAGCAGCTTGGCAAACAACTCTGAGGACTCAGGGNGGTTAANCTTNAGCTGGANCT^^ 

GGTACNAGGTTGTCTTGAlSnrGTCTTNAATGGGCnTNACNr^ 

AAAATANTGNCNGCATTAAATATTGCCAAGTCGTTGCTT1SFNAATT(>JAGGN^ 

CAAACCAAAACrrGGCTCAATAAANTCNCCAAAAGNCTTThn^AGGC^^ 

TAAANCCTONCCOTajAATTTNAAAACCa^ATmro 

AThn^INGGGACTCAATTTTTTNAGGAAAGTACTTGGTC 

SEQ ID NO: 2106 ACGCGGGTAAAGATGTC14"llTITATrrrACllUU"llU"l'AAGCACCAAATTTTG 
TTG' rr r i ' l lT i TCTCCCCTCCCCACAGATCCCATCTCAAATCATTCTGTTAACCACCATTCCAACAG 
GTCGAGGAGAGCTTAAACACCnTCTTCCTCTGCCTTGTTNCTCTTTN^^ 
GTATTAATGTTTTTGCATACTTTGCATCTTTATTNAAAAGNGTAAACTTTC^ 
CATGCCCATATATGAAGGAGATGGGTGGGTCAAAAAGGGATATCAAATGAANTGATAGGGGTCA 
CAKTGGGGAAATTGAAGTGGTCCATAACATTGCCAAAATAGNGTGCCACTAISIAAATGGNGAAAA 
GGCTGTC l ' i ' il ' iTl"l - l 1 1 ' l 1 r AAAANAAAAGTTNTTACCArrGTmTTGGNNAGGCAGGTTO 
CCTACAAGTCTTGANTTAANAAGGAANGAGGGAAAAAAAGAAAANCCCCNTCCCCATATTA^ 
AAAAAAAACCANNCTTGTTTAGGAGTNCATTNAACCCTNAGGAC^^ 
TGAAC^^'GGGCCGNAACCCNCTAANGG^M^^ATTCNNCCNNANTGGNGGCCGN^ 
CAACNN 

SEQ ID NO: 2 107 ACCATCTCAAAGCTAGATGCrCGAATCCAGCAAAAGAGAGAGGAGCAGCGT 
CGAAGAAGGGCAAGTAGTGTCTTGGCACAGAGAAGAGCCCAGAGTATAGAGCGGAAGCAAGAGA 
GTGAGCCACGTATTGTTAGTAGAATTTTCCAGTGTTTGTGCTTGAAGTGGGTGGAAGTGTTCT 
TTCAGCTCCrrrTGrriTATQvrANTTTTATTTrCTXj 
GGGGACCCTCACTTCNNTGAANAT^mTGGT(:OTGGNTTGAAAT^ 
NGTTTTTTGGGNTNCTCCCTITGNTTNGNGTTTACAAAAA 

SEQ ID NO: 2 1 08 ACGCGGGGAGCTGGAGTGCGTTCTGCCGAAGCTTGTGGTTGCACGCCCATCG 

tcttaggggctaccttccgtgaccatgtccaagtctctgaagaagttggtggaggagagccggga 
gaanaaccagcccgaggtggacatgagtgaccggggcatctccaacatgctggatgtcaacggcc 
tctttaccttatcccatatcacacaantgntcctnanccataacaagctaacaatgg^^^ 
acatngnaaaactgatoaattitgnangtgctaaacttttm 

seq id no: 2109 acctaacccagctagtgtgttrrccccaatttcaaagctactcattactgttc 

AGrrTCCCTTTCCrATAACGTTTCTTCTTTm 

ATATCTAGTAAATAAGAAAAATGAACAGGGGGTCCATATGGGAAAGTATTCTGGCTCTGAGAACT 

TCTAATAATGCACAATCrTTGGATTATGGAGACGGNCCTGGTAACTTNNAGAGTG^^ 

AGAAAGGTTTATTNCTCrrCCCCTTTCATTAAAAGCTCTAAAAAAC^^ 

TACAAAGAGACAACTTCGCCAAATm'GTAANATAAATTCTGCCANAAGTTCCACAAAACA 

ATTGAGTTTTTNNTTGNTACCTTTGAAACCAATCATAAAN^ 

SEQ ID NO: 2110 ACCCAAGGATGTCCTGGAGTATGTTGTATTCGAAAAGCAGTTGACAAACCCC 
TATGGAAGCTGGAGAATGCATACCAAGATCGTTCCCCCATGGGCACCCCCTAAGCAGCCCATCCT 
TAAGACGGTGATGATCCCTGGCCCrCAGCTGAAACCAGAAGAAGAATATGAAGAGGCACAAGGG 
AGAGGCCCAGAAGCCTCAGCTAGCCTGATGACAAAAATGACTTCTAGGGGTGAAGCCTGGGTGAT 
GAGGCTGCTGGAAANCTTGAAATCTCCCriTCCCTTCATGCTATAAAAA^ 
TCCATCTGCTCAGGTCITTTTCACCAGTCTNATATTCAQCA CCATGA CT GGTTG 
CANGGTGGGCAGGTATAACATGGGCATTGGACAATTTTTCTTTmAAATT^ 
GANTCTAAGATGAAAAGACANNTNTGTmAAAAAAACATTGGATTOT 
ANGGGACTGTGAGAAACACCCAGCAGCmTCTNTTTTGGAATCAACAGGGCAGGGGA^ 
TTGNAAArrGAATGTTGNCAGGGGTGTNGGAAAAriTmGNTGAGTTCTNCACAm 
TCANGC 

SEQ ID NO: 2111 CCGGCCGAGGTACAAGCAGCnTCGlTGAAG'nTAGAAGATAAGAAACATGT 
CATCATATTTAAATGTTCCGGTAATGTGATGCCTCAGGTCTGCCriU'innTCTGGAGAATAA A 
GTAATCCTCTCCCAAATAAGCACACACATTTTCAATTCrrCATGTTTGAGTGATTTTAAAA 
GTGAATGTGAAAACTAAAGTTTGTGTCATGAGAATGTAAGTCnTm^ 
QGTTCACTOAGTAACTAAAATITAGCAAACCTGTGTTTGCATATTTTT^ 

AATrAATGTCATAAGTGATTTGGAGCTTrGNTAAAGGGACCAGANAGAAGGAGCNCCTGCAGNCT 
TTrGTTTTTTTAAAACCTOANAACNTTACCAOT 
ACATCTGGG^^S^mGGAAACAAGTGGN^»J^rITTTTA 
NNTCCNTNNAACAGGNACAGGTGGATGCATTC 
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SEQ ID NO: 2112 acgcaggggtatttgaaaactcaaggttatccagatgttccaggtcctctga 

ACAATCCAGACTACCCCGGCACCAGGAGCAATCCATACTCTGTAGCCTCCAGAACACGTCCAGAC 

TATCCTGGGTCTCTGGCAGAACCAAATTATCCTAGATCTCTGAGTAATCCAACTATTCTGG 

GAAGCAATGCATACTCTGCAGCCICTAGAACAAGCCCAGACCTTCCTACCm 

AATTATAGTGAATTCAGAGTCATTCCTTACCACCGAGCATCATCCANACAACCAGACTACCCTGGA 

TCrCAACGAATTCCTGNTTTTGGCAGGCTCCANCANCAAGTGGNAACTATGCAGGCTCC 

ATCCAGATATTTNGGTCNrrAAAACCCGGCTACCCTGGAGCTCAANAC CACr m 

CAAGAANCCATTTGAACCTTCCAGATCCANAAGATCTGGACATNAAGTITrAAAATC^^ 

ANANTCTTTGGAAAGCTGATNnSfTCrAGGCGCTGACTrCAACCTAACTO 

CCAAACWCCAATGCTGAGGNAATCANAATTTGCAAGCCCTTGGANANAACCTTGT^^ 

GGCTGANA 

SEQ ID NO: 2113 ACCGCCAGCTCTCTGCTCTCCACAGGGCrCCCCGCCCCACCCGGCCTGATAAA 
GCGCGCCGACTGGGCTACAAGGCCAAGCAAGGTTACGTTATATATAGGATTCGTGTTCGCCGTGG 
TGGCCGAAAACGCCCAGTTCCTAAGGGTGCAACTTACGGCAAGCCTGTCCATCATGGTGTTAACC 
AGCTAAAGTTTGCTCGAAGCCTTCAGTCCGTTGCAGAGGANCGAGCTGGACGCCACTGTGGGGCT 
CTGAGAGTCCTGAATTCTACTGGGGTTGGTGAAGATTCCACATACAAATTTTTTGAG 
ATTGATCCATTCCATAAAGCTATCAGAANAAATCCrGACACCCAGTGGATCACCAAACCATTCCA 
CAGCACANGGGAGATGCGTGGGCTGACATCTGCAGGCCGAAAGANCCGTGGCCITGAAAAGGGC 
CANAAGTTCCACCACACTTTTGGTGNCTCTCNCCGGGCAANTTGNAAAAGGCGCAATACT^ 
GCTCCACCXJTTNCCGTTATATAANTAAAAGTTNGNAAAATTCATACTmATA/^ 
CA 

SEQ ID NO: 2114 ACAGTTCTTTCCAATCTGTGTCTTGAACTCTGACAGTArrGTCTCGAAAATCA 
CGCAGATATNNAAACCTCCACAGGAGTGGGTCAmGAAGCAGTAAAGAGGTCACGACAAACCGC 
AGACAAAGACAAGACGGAACGAACATCCAGAAGTCGGAAOATCCGTAGTrrCAGTTCCAATGGG 
AGGACGATCAACCCAAATACATCTGGTAGGTTCAGTGCITGTCGGGTAAAAGCCAAAAG AGGA TA 
CACCAGCTGGTCTTTAAAGAGGCGAGAGAGTTCTGAAGATCTTTGTATATGTTGGCTACATT^ 
CCTAGTTTCrCTTTGAAATAAAAAGATTCTGGGTAGCAAGCTGGNAATCTTm 
TTAATTGNTGATrmAAGTGTCCCATTTACAACAANTCAGGTTTCCCAA^ 

ANTANCNGGATCTGCCCTTTCATA/^AGGATGCTTGTACCTTGTCCGGGGCNGCCTNTTNA/^^ 
GCAAANTTCAA 

SEQ ID NO: 2 1 1 5 ACTCAGTAGTGCCCTGCrrCTAGGGCTCTGAATACGGGCTTAAAGTCATCTTG 
TCCTGCTGGAATTTGCTGTGCAGAGCCATAAGCCTCCCATTTTGTTAGCGTCAGCTAGGCCAAT^^ 
GAACAGACCGGGACCTTGTCTCACACTGATGATACCTCACATGTTGACCGGCTATGTGAACTGCCT 
ATITCCrATGCTGGAGTTTTGATTTTTAACTAACOCAAATCTG 
GAAAACAAAACAAAATAATGCrrTTCGAAATTGTTTCTAGGACTTTAA;^ 
AAAATTCmATTTCAGAATGCAACAATAGATTCCATTAATATAGACTCAGATCAAAACA G^^ 
CTGCTAAGCTAANATAGATGGTGGTrGArrCCACTGGGTTTGGATCAATCAATAACAAACCI^ 
CTTTGACATACTCTNAAATTTTGTTGTTTGGGGGGAGGGNGTGTGTGTG 

SEQ ID NO: 21 1 6 ACATTGACAAACACATAACTGAGGCATTAATACCTCTTTATAATAATGCAAG 

ttgaaatgcraacaaagcata^acacttctgcaaaaattccacaaggcacagttgttcattcaac 

agaaaaagtcaaaaccacitggtttttaaatgaaaatccrrcacatcc^^ 

aataataacttatgatattaataacmrctttaaaaagcaattaccacaaaacagc^ 

gcatgaaaagacttattttcctactatgtaccrragaaagaatagat^ 

tagcataaagctaagctaccagaaaaaactttaacaagttacgtgttitccam 

tcagctrgcttagaccagcncccattaactaanacagan>mattgtccttcg 

aggcgattccagccctggcggcgtt 

seq id no: 21 1 7 actagaagtatacaccacccagcccggggtccagrntacacgggcaacttc 
ctggatggcacattaaagggcaagaatggagctgtctatcccaagcactccggtttctgcctgga 
gactcagaactggcctgatgcagtcaatcagccccgcttccctcctgtgctgctgagcctggtgag 
gagtatgactacaccacctggtrcaagttttctgtggcttaaggaagtgtgaagata^ 
ccagggctaggctcaccacctgtctcctgtcagaaaaaaggtgaagattaagaagctttcaaa 
attctatggattaaaatcatacaaatgggggcrcrrctgaaaatcagtctgggoam 
ttncagtgactggctccagccatgtntatgaccagctoaattccctgtgcagrrcaa^^^ 
tgaaccaaccaacatggtcx>tcatctaacccttgaccttaccagggactcx;antgct^ 
TNCAT^r^ITNCAACTGGCTr^T^^mT^N^ 

TG>nTrTATTTCCTCCTCNTTAACNTCAACCATrGTNANCANNACTO 
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TTTACCT 

SEQ ID NO: 2 1 1 8 AC ACCm'fCAGGGTCGTTAAAGACCACCTCAAAAGACTTGATCTTOT 

ATCACCATGATGGAACTGAGTTGGTmAAGAGTTAGAAATGACGGTGGAAGAAAAAAAAAGCTC 

CAAATCGAGGAAACCCCTTTGCAAAAAATTATTTCACTTTAAGGAATTAAGG^^ 

TTGAGCITAAAAATAAAATAAGATTTAAACAAATGAATNTTAACTACTAGGT ^ 

CCTAAAAAATATACGCCGTTGGTTACACTAAGCCNAArrCAGAGAAAAAGCCTnTI^ 

TGCTCGAAAAAAmATCCCGTCTCACAANCACTCCTTTGGAGAAAAAAGAGGGG^ 

NCOTACCTGNCCGGGCTGTCCTTTTAAATGGGCGANTTTCAACANACCGGTGNGNCGGTACTTAG 

NNGOTCCNAACTTrGGTrNCAANCTrGGNN^ 

SEQ ID NO: 2 1 1 9 ACAAATTTACATTCATGAGGAATGrrAAAAAAAArrCAACTAAAAAACCCAC 
rrrrrCCrGTGACCCATAATCCCACATTTTACAGTGCAGGGGAGAAGGGGATTAGGGGi^ 
AACAAGTCTCTNCCAAAAAGAAATGATGTAAATTTCACATTCCCTCTCCCACAGGATCCA^ 
NGAGAGWAATTTACAATTCATCTnTrCAGCTGTANATTCCTTTGCT^ 
NTCCACATCCATTTCTTOATNrmGTCTTTGNNTGGGNCTTCC^ 
AGGTCTTTTTNGGGCNNTTWrTTC^^ 

A^^^C^T^CTATTCTATCTCCATATGC^^^TAGGNCTGGTAAAAAGAATCCC^ 

SEQ ID NO: 2 1 20 ACCTAGAAGAGAGGCGGGTCAAAGAAGTAGTGAAGAAGCATTCTCAGTTCAT 
AQGCTATCCCATCACCCTTTATTTGGAGAAGGAACGAGAGAAGGAAATTAGTGATGATGAGGCAG 
AGGAAGAGAAAGGTGAGAAAGAAGAGGAAGATNAAGATGATGAAGAAAAGCCCAAGATCGAAG 
ATGTGGGTrCANATGAGGAGGATGACAGCGGTAGGATTAGAAGAANAAAACTTAGAAGATCNAA 
GAGAAATTCCITGATCT^TGAAGAACTTTACCATGACCANGNCTT^^ 
GANATTCCCCCAAAAGG^INTTNTNGTGAATTCTAaS[ANGACCTCACT^^^ 
NCTTCGCATTC 

SEQ ID NO: 2 1 2 1 Acn"n"n"rrn"n"rn"i j j i u i iNGGAATGGAGrrrcACTCCTGTTGCCCAGG 

CTGGAGCGCATGGCGCAATCTCGGCTCATCGCAACCTCTGCCTCCTGGGTrrAAGCG^ 

CCTNAACCTCCTGAGTAGCTGGGATTACAGGNGGCTGCCACCACCCCTGNCTAATTTTGTAT^^ 

AANAAAGGGGGGGTTTCTCCATGTTGGNCANGCTGGTCCCTAATrCCCGACCrrANGGGA NC^ 

CAGGAACAArrrCATAAChrmATTAAATTACNGGCATATAATNAACCTh^ 

CCAAATTGNCCCAAAATGCATTTTA 

SEQ ID NO: 2 1 22 ACCAAGGGATGGAAGAAGTAAATATAGCTCAGGTAGCACnTATACTCAGGC 
AGATCTCAGCCCTCTACTGAGTCCCTTANCCAAACANGTTTCTTTCAAAAAAGCC ^^ 
AAGCAGGGACTGNCCCTGCATITCATATTNCACTGOTAAAAGNTGGGTTTTGAAAT^ 
NNTGCACAAATTGGGCCAAAAAAACATTGCCTTGANGAANATTTGi^^ 
ANAAGAATAAATACCG^rmCTGNCCAAAGANATG^^ITATAGNGCCCTGGAAATGNTCC^i 
GNAhn^CTTNGCAGAATGCTTANGGAGANNAAAGTTTGANGNAAACCAACAGGAAm 
ATCACA 

SEQ ID NO: 2123 GTACAATTCATCTAAOTCCGGAAAGCACTTTCAGTCCAAATGCANAAACCG 
TCCCACATGCCCACCAGGAGCAAGCrrCAAAATGTTCAGCTTGCTTACATTAAGCAAGAG^^ 
CAGGGATGTTTCTGAAGGCCCrrGNTGATACCATTATCCTCANTTOT 
TGCGCmGATACCGGNACNGATTrmATTTTGCCTn'GNCAGCOTCT^ 
TANCCrrrrTTGATANCATTCCAAGCITrAAG 
NTTATITNGANCCTmAACnTrATTnT 

SEQ ID NO: 2 1 24 ACGCGGGTGGCTCAGAGCACCCGTATCATTTATGGAGGCTCTGTGACTGGGG 
CAACCTGCAAGGAACTGGCCAGCCAGCCTGATGTGGATGGGTTCCnTGGGGOTGGGGCTTCCCTT 
AAACCCGAATTCGTGGANATTATTAATGCCAAACAATGAACCCCATCATTTTNCC^ 
CAAGCCAOGGACTANCAACCCAAAAACCCAANAACrGCCCTTTCCTTCATATGOT^ 
GCATCTGCTCirCCTGGGGGCCCTATCCAAACTGGATCrrCCCTTACnXjGTTATATC^^ 
ATGGTTGGGACCAAGCCAATCCCrmTTCAmTACTTAATGGGTGGGAACTAAACGTCAC^ 
GNGGCTTTNTChrrGGCrGAAAAAATGGNAAGGCGTGGGGGGGAAm 
GGCCTAATGAAGGGCAAAAAAAAAAAACCATTOrmTCCnTr^ 
ATCCCTTNAAAANANAAGGANTTGCnGCCCTTTTCCANTGGGQCCC^^ 
GTNAACCCCCCATNTTOAGGGAATAAACCCTGGCACTTGGACAA 

SEQ ID NO: 2 1 25 ACAGACAGTCCATCTCTGTTCTGGCCGGGTCCACCGTGGAAGATGTCCTGAA 
GAAGCCCATGAGTTAGGAGGATTCACATATGAAACACAGGCCTCCTTTGTAGGCCCCTACTTAACC 
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TCCGTGATGGGGAAAGCGGCCGGANAAAGGGAGTTCTGGCAACTTCTNCGAAACCCCAACACCCC 

ANTTGTTGCAAGGGATTOCTGACTACCAACCCAAGGATGGAAAAAACATTGANCTN 

AGCTNGGTNNCCCCTGACTTCTTATTCCANAANCTTGAACACT^ 

GANOTCCCTNGANAAGAACTTGNCTGACCCTNNTGNNACTTCTGNGNAC^ 

TGGGANTCACCCCANCCCCAANGCCCTTITGGGGNCCTATTCCATTGGCCCNCCTTTGGANC^^ 

AAACCAAGCNTTTTTCCTNGGAAANTTrmGACCAAATm 

TCCATTANAGCCCCNCCCATTGGGCTNANNGGNATNAAACANTTTAAAATCCTTGGGAAAAAA 

GGAATCCCAAGGNCCAAGTTTOn'GNAAAANCCTrC>rrTNTC^ 

AGCXriTCTN 

SEQ ID NO: 2126 acgcggggagcgacaatattgactcacgcctactccagagtggtcctgagag 

TCCTGGAAGCAGCCGTGGCGGCCAAGAAGCGATTTAGTGTATACGTCACANNAGTCACAGCCTGA 

TTTGTCAGGTAAGAAAATGGCCAAAGCCCTTNTGCCACCTTAAACGTNCCTGTCACTG^ 

GATNCTGCCTNNNNGCTACATTCATGGAAAAAGCAAATN^ 

SEQ ID NO: 2 127 ACCGCAAGGGAAAGATGAAAAATTATAACCAAGCATAATATAGCAAGGGAC 
TAACCCCTATACCTTCTGCATAATGAATTAACTAGAAAATAACmGCAANGGAGAG 
rTAAGACCCCCGAAACCAGACGAGCTACCTAAGAACAGCTAAAAGAGCNCCCCCGTCTNTTGTAG 
CCAAAATAGTGGGAAGATITATAGGTAAAGGCGACNAACCTTNCCGACCCTNGTGATAGCCTGGT 
TG^^^CCAANATANAAT^W^ANTTNAACTTTTAAATTTGCCNCANA>^ 
AATTTAACTGGTANNCCCAAANNAGGGACNACNTrrmGGACCT^ 
GG 

SEQ ID NO: 2 1 28 ACTTGCCCCTTCCCCAGAAAAGCGGGACTTGCTGCTAAGGGTGAAGGACCAA 
GGCAGTTGTCCCTGCGTGGTCTGACACCCTTGAAACGTGGGTGTATAATCAGAGAGGCATCCCTGC 
AATGATTAAACACCAAGGGAAGGCTGCCriTCCCAGTCTGTGACCAGCGCCGGAGTTITGGGTCCA 
CGGATAAAACGTGTCTCTTTTGTCTCTACCAGAAAATGAAAGGAATTGAAAT^ 
AGATTGAAGTGTAATGGCCAAGATTGAAAGGAGAAAGTGGTTGAGGGATAGTGAAGGGAAGTTG 
GAGAAGAGAGTAAAAAGAGCTGCTTACCAAATTGAAATn'GGTGAGATGTTTOT 
GTCTNAGGACCTGAGGTC 

SEQ ID NO: 2 1 29 Acirr iu^ - ri - i ' rri ' i 1 1 1 rrrrn'i'riT r NCCGGGAGGCAANAGGACCAACCC 

TCCAAGTCCCGGGGCCCNTGTCCACCCACCATATCCTAAACCCAATCTTrrCTACC™ 

ANGGTTACAAACAGAGAGGO^AGCAAANAAGGNTGGGGCCCAACGGAGGGGANAAAATTTATAT 

CCCGGGAAACGTGGGGCAACAGCATCATANACTTGAATGAACCCAAGGGCCAAGCAGCCAAGCA 

AGGACTAATTCANAGCACTNAAAAACOCTTANTTAAACCGGGGGGCCCCl^GTGCTAAG 

GCCGGGCCCAAACCCNCnsriTATAGGGTTCACANANAGTCTGGAGTCCACGTCACTAAAGTNTAA 

AATTCTACrCGAAAATGAACCCCANCCTCCCITITGANANATGAATCGGT^^ 

GACAGATCATTTACATAAACCCCCACACTCTCT^^mCT^r^TCT^^ 

SEQ ID NO: 2130 acgcgggaagaagtgttcggagagatggaagaccatgcctgcaaaggagaa 
gtcgaagtttgaagatatggcaaaaagtgacaaagcrcgctatgacagggagatgaaaaattac 
ngttcctcccaaaggtgataagaaagggqgaagaaaanaggcccccaatgctccttaanaggcc 
accatctgncttcttcctgtttggctctgaacatcgcccaaagatcnaaaggttgaacn^ 
ctatcccntnggggattcttgcaaaagaaactgggotgnaatgt 

SEQ ID NO: 213 1 acctggatgaagcatacccagggaagaagctgttgccggatgaccc ctat ga 

GAAAGCTTGCCAGAAGATGATCTTAGAGTTGTTTTCTAAGGTGCCATCCTTGGTAGGAAGCm 

TAGAAGCCAAAATAAAGAAGACTATGATGGCCrAAAAGAAGAATTTCCGTAAAGAATTTACCi^ 

CTAGAGGAGGTTCTGACTAATAAGAAGACACCTTCnTrGGTGGCAATTCTATCTCTATGArrGA^ 

ACCTCATCTGGCCCTGGTTTTGAACGGCTGGAAGCAATGAAGrrAAATGAGTGGTGTAGACC^^ 

CTCCAAAACTGANACTGTGGATGGCAGCPCATGAAAGGAAGATCCCCACATGCTCAGCCCTGOT 

ACTAGTGAGAAAGACTGGCAAGGGTTTCCTAGAGCTCTNCTTACAGAACAGCC CCTGA GGCCTGT 

GACTATGGGGCTNTGAAGGGGGGCANGAOTCAACATTANAACrTNNGCTOATA 

NCAAA 

SEQ ID NO: 2 1 32 ACGCGGGTATCTCTGTTGCTAAAATAGATCCTTTAGCACCTTTGGATAAAGTC 

tgccrrcraggttgtggcatttcaaccggttatggtgctgctgtgaacactgccaagttggagcct 
ggctctgtttgtgccgtctitggtctgggaggagtcggattggcagttatcatgggctgtaaagtg 

GCTGGTGCTTCCCGGATCATTGGTOTGGACATCAATAAAGATAAATTTGCAAGGGCCAAAGAGTT 

tggagccactgaatgtattaaccctcaggattttaotaaacccatccaggaagtgctcattgagat 
gaccgatggaggagtggactarrccttlgaatgtattggtaatgtgaaggtcatgaaancagcac 
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TTGAGGCATGTCACAAGGGCTGGGGGCGTCAACCGTCNNGGTTGGAGTANCTGCTTCAGGTGAAN 
AAATTNGCCACTCGTCCATTCCAGNTGGTACAGGTCCACATGGAAAAGGCACTGCCTTTGGAGGA 
TGGAAAAGTGTANAAAGTGNCCCAAAACWGGNGTCTGAATATOTGCCAAAAAGNAAAAAAl^ 
GAGAAKTNGGGACTCACAATCTGGCTTTTGAAGAAATNAA 

SEQ ID NO: 2 1 33 AGCCGOCCGCCGGGCAGGACATATTAGGCCrnTATGAACACTAAAACAATG 
AGGAAATGTTGGTCATGGGGCAAAGTATCACTrAAAATTGAATTCATCC Ariii iAAAAAACAOT 
CATGAAAGCArrCTGGTGTGAATTGCCATTTTTTTCTTACTGGOT 

CCCTACCTAAAACATrCrCCrCGGAAATTACATGGGTGCTGACCACAAAAGTTTCn'GGATG^^ 
rrAAATATTGNACATTTTGGCTAGTTmCCCGAATTTCTGTOT 

TGAAGGACATCAATGACCATTTGTTCCTCTGAAGTAGTCGOTTGGTCATGAACTTTCTAGTC 
TAGTTACGGNGTCGTTCATGATGGCGATCTATCTTCAGACAGCCAAGGAGGAAAAGAGCCCCCGC 

GT 

SEQ ID NO: 2 1 34 ACAAATAAAATCAAAAAGAGCAGTGTTCTGTTGTATTCA TTTCT GCATGTATA 
GCnrrArrAATrGCTAATGAAAATTAGAACnTITCTGGGATOT 
TAAAATGCCTmCTTCAGTGAAGCCATCmGGAGTTAGTCATTACT^ 
GACTTCAACCTGATATTCCTCtTCrmGGTCCAGACCCTCAAATTrrAA^ 
GGAAAGGCATTTTTCCACAGTTCAGTTCCCTGAAAAACTTCCATCTCCCACTG 
AGGAGTGAAGTAATCACATGCTAGAACATCAGGGCCAATTGGAAAGTCATTATGAACACTTGCAT 
TGGTCGATCTTATTTATCACCACAAGCCTGAAAATGCAATGTCCTGAAAAAGGTGACCT 
CACACGTAArnTTAAAAAGGAGAGGGTAATATGAAGGGGACTGAGGCITGATCACCAAAAATCA 
GCACAATGAAAACAAACCATAATGAATAATGACACTAGAATTCAAATTACCAGATGTTTCAAAGA 
GAAGGGGGTGCCAGTTTTTArrCCGrrTGAACACCACTTTCAAj^ 
GGTTGAGGG 

SEQ ID NO: 2 1 35 ACTTAATrCTTCTGCTAGGATATTCTTrTTCACTTGTAGTTCACAGA 

TCTTAATGGATTCACTAGCTCATAGAAGCGAACAGATACATATTCAGGAATAGAAAGCTGATOT 

AGGAAAAGCTTTTAATTTAATATATTGCTCTTCTGTCAACTCAAAGTCACGCAGGTT^ 

ATCTCCAGCTnrrCTCTTAGCTGAAGATITGTCTCTTCTAGTTGm 

CCArrrCTTGTTTCATTAATTCrrGATArn'GCrGGCATCTrrCTG™ 

ATCTCAATGTTAGTAGCrGCTTCTGGTGAAGTGCATCATTAAGTTrCTCCTCCAAT T(^ 

TGTAAGATAATCCACriTCAAArrGTCGATCATCATAGtriTCTGGGATAGCTC/^ 

TGAATATTATGAAGTAGTrarn-CGTCAATTANCTGCCrGGNGATTCTGACn^ 

TTCTGATGAGGAAATATCATCCGTAGGAACTGTGNTTCTAAACTAATATCTTCAGATTCX^ 

CTAGAGATGGTCACrTTTTTrGGACTCCTTTGAAATm 

SEQ ID NO- 2 1 36 gtgggtcgcggcgagotacgcggggagcacttccttctgagtgggcttctct 

GGGAGCTCTCCAGTGGCACTGCTGGACCTGCCCACGTTTCTGTAAAATCAGGATACGTGGC^ 

taagtagaccaagcgcttcotggcagggaaagcagcgtgcggggaagtcactgaaaagtgctgc 

CTAAGGAAGrnGGAAATAGTCCCCGTrCCAGATTGCCrmGAATTTrAA^ 

AAGTANGTCAGCANCACCTAAGATCAAGGATNGC^r^TCCATTTTTACAC^ 

CTGAAAANACTGTCrmCAGCGtGAACTAAAGTCACAGGCAAATCACrG>mCCANA^ 

GANCTTT^^^GAAACANGTCWATAAGTCTTmGAATGNGTACCNTGCCCNGGC^ 

ANAGNTNAATTCCTANTCACNGNCGGGCNGTANTATTGGATTANGACTTGGTAC 

SEQ ID NO: 2137 ACGCGGGGGGCAGAAGAGGAAGATTTCTGAAGAGTGCAGCTGCCTGAACCG 
AGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGTGAGAACAATAAGAATTCCTTGGAG 
AGCAGCCTACGGCAACTAAAATGCCATrrCACCTGGAACTTGATGGAGGGAGAAAACTCOT 
TGATTTTGAAGACAAAGTATTTTACCGGACTGAGTITCAGAATCGTGAAT^^ 
CAACCTACTGCCTATCrAAAGCACCTCAAAGGGCAAAACGAGGCAGCCCTGGAATGCTrAC^^ 
AGCTGAAOAGTTAATCCAGCAAGAGCATGCTGACCAGGCAGAAATCAGAAGTCTGGTCACCTGGG 
GAAACTATGCCTGGGTCTACTATCACATGGGCCCGCrCTCAGACGTTCAGATTTATGTAGACAAGO 
TGAAACATGTCTGTGAGAAGTTTCCAGTCCCTATAGAATrGAAAGTCCAGAACTTGCTG^^ 
AAGGGTGGNCACGGTTAAAGTGTGGGGAACCAAAATGAAAGAACGAAGGTGTGCTTTGANAAGG 
CTmTGNAAANAANCCAAAAACCCAAAArrCNCCWGGACTGNCATANC^^ 
NAANTGGCCACCATTT 

SEQ ID NO: 2138 ACTCTAGTTCTATGAGGTTCCTCATAATTGTAAGGCACATGGAAAAGCAGAT 
GAGGTGCTGACTCAAGAAGTGAATACCTTGGGGCCrCTGAACCCCAG GCCTGCCTGA CTACCTC^ 
GGGGCTGAAGGATCAATACCTTCAGTGTTGATGAGTGAAACTGGGATCl-ii ri 1 1 1 1 1 u 1 1 iTAGCT 
TTAGAAAAAGGCCTTTGACGGGCGGATCACGAGGTCAAGAGATTGGGACCATCCrAGCCAACATG 
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GTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGATGGCACACGCCTTAGTC^ 

AGCTACTCAGGTGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCTG 

AmTGGCCCACrGCACTGCAC(nX}GGCAACAGANACTCX:ATCTCAAAAATAAA^ 

ACTCrrCCrCAAGArmAAGAGCATTCTAAAATTATGATCANAAAAAGCAATCCTACAACTACT^ 

GGGCCAAAACGTGGArrCCAATGACCTGCCTTGACCCGNGGTTGCCAGAATTTGGCCT^ 

TNGGAAGCTCACGGNCTAATCC 

SEQ ID NO: 2 1 39 ACAGGTGGAACAATCCAAAGTTTTAATCAAAGAAGGTGGTGTTCAGTTGCTG 
CTCACAATAGTTGATACCCCAQGATTTGGAGATGCAGTGGATAATAGTAATTGCTGGCAGCCTGTT 
ATCGACTACATTGATAGTAAATTTGAGGACTACCTAAATGCAGAATCACGAGTGAACAGACGTCA 
GATGCCrGATAACAGGGTGC^GTGTrGTTTATACTTCATTGCrCCTTCAGGACATGGAC^ 
ATTGGATATTGAGTTTATGAAGCGTTTGCATGAAAAAGTGAATATCATCCCACTTAT^^ 
AGACACACTCACACCAGAGGAATGCCAACAGTTTAAAAAACAGATAATGAAAGAAATCCAAGAA 
CATAAAATTAAAATATACGAATTTCCAGAAACAGATOATGAAGAAGAAAATAAACTTGTTAA^^ 
GATAAAGGACCGTTTCCT^^ITGC^GNGGTAGGTAGTAATACTATCATTGAAGTTAATGGCAAAAG 
GGTCAGAGGAAGGCAGim'CrrGGGGNGTTGCTrGAANTTGAAATGGTNAACAmGNGAT^^ 
NAATCCTAAGAAAATATGTTGGNTNAGAACANACATGCAGGACITGGAAGATGTTCT^^ 
CCCCNTNTNAAACTC 

SEQ ID NO: 2140 ACACTGAAACATAAATCCGCAAGTCACCACACATACAACACCCGGCAGGAA 
AAAACAAAAACAGCAAGrTTACATGATCCCTGTAACAGCCATGGTCTCAAACTCAGATGCT^ 
CATCTGCCAAGTGTGTTCTGGATACAGAGCACATCGTGGCITCrGGGGTCACACTCAGCT^ 
GTGGGTCCACAGAGCACrCATCTGGCTGGGTTATQGTGGTGGTOGCTCTACTCAAGAAGCAAAGC 
AGTTACCAGCACATTCAAACAGTGTATTGAACATCTTTTAAATATCAAAGTGAGAAACAAGAAGG 
CAACATAATAATGTTATCAGAAAGATGTTAGGAAGTAAGGACAGCTGTGTAAAGCTTGAGGCTGA 
AAAGTAGCTTGCCAGCTrCATTTCrTTGGTTTCTTGGGTAm 

AGGTTCTGGTTCATGGATCATATAATGGACCCATCCCTGACTTTGCTGACGCCAGATTCCT^ 

AGATTCAGACATCANATGGGTTTTAGGGCCAGCTTGG NTNTG TCCmGGGCA^ 

ATCTCAAACTCCTCGNCGTCTATTTGTCCGAATAGTAATTTTGTTGTGCNAC 

SEQ ID NO: 2141 GTCGCGGCGAGGTACn'GCCCCrrCCCCAGAAAAGCGGGACTTGCTGCrAAG 
GGTGAAGGACCAAGGCAGTTGTCCCTGCGTGGTCTGACACCCTTGAAACGTGGGTGTATAATCAG 
AGAGGCATCCCTGCAATGATTAAACACCAAGGGAAGGCTGCCTTCCCAGTCTGTGACCAGCGCCG 
GAGTTTTGGGTCCACGGATAAAACGTGTCTCrmGTCTCTACCAGAAAATGA^ 
TAAGAGAAGGGAGAGATTGAAGTGTAGTGCCAAGATTGAAAGGAGAAAGTGGTTAAGGGATAGT 
GAGGGAAGTTGGAGAAGAGAGTAAAAAGAGGCTGCITACCAGArrTGAAATTGGTGAGATGTTTC 
TTGGGCTCGTCGGTCTGAGGACCTGAGGTCGTANGTGGATCTTTCTCAGGGAGCAAAAGAGCAGG 
AGGACGGATGATTGATCrCCCAAGGGAGGTCCCCCGATCCNAGTCATGGNACCAAATTTCATGTG 
CGTNCNATGTGAAGAGACCACCAAACAGG^OmTGTGTGAGCAACATGGTT^^T^ATTTANCT 
GTGCANGOjGNCTGANTCCAAAAAAGAGTTANTCCCCTNGTTCrrGCCC 

SEQ ID NO: 2142 ACAAGACTCTTGACAGTTGTGCTTCTCTAGGAGGTTGGGTT^^ 
GAATTATCrGTGAACCATACGTGArrAATAAAGATTTCCTTTAAGGCAGAGGCTGGTC 
GCTGTTATCrTCTGCCTCAGACAGACAGTATAAGTGGTCTTGTTrCTAAGATTCCrACCACC^ 
CTTTGGGCCAAGTATCCACATCCCCTTGCGTATGGGAGGTGGGTGAAGAGTGTTGGATGCAAAGT 
GGTTATTATGGGAAGTAGCTCGATGGTAAAAGGACAAACACCn'ATCTATCTTAGAGCTTAAGCCT 
GTATGTGCrrATTCCCAAGGGAGATAGAGGTGTTTAATCACAAGGACAGCATGAGTTAGAGGACA 
CTGGCATCAACAGCTGCCACAGCCXJTGCnVCACCAGGGCCAGAGCAGCCCACrGACATCTGC^ 
GTCTTGAGATCAAATGCATCCCATTCTTCATACATTAGAAGGTCGACCTCCTTGAAGCANACCAAG 
TATAGCAAGCCTTTAAAAGGACTACTGAGAAACAGAATCAGAAACTCTTAGAACTCT^ 
CCCTTNAAGNAGGGCTGCAGANNCrCCCTTGGATACCCAGCCCTGGGGAAAGCCTGTOT 
GTCACCCCAG 

SEQ ID NO: 2 143 CGGCCGCCGGGCAGGACGCGGGGGGAGATGATCAGAAATCCTTTCACTGTCA 
AATTCAGAGAAGAAACATGCTGAGAAGAGGCACrrCCTATGCCAGGCCTACCTGTGCCTTGAGGA 
ATACATGTAGGAGACACTATTTGAAAAGGTCGCCACCTTGCTGGAACAACCAATGCAGAGATAAT 
GCTGGGCCCTCTAATCAGTTGAACAGTGGCTTCAAAAAGATAGGTCAAAGTTATAATCTCCTGCAG 
CrGTGCATGTGA(nTrCTrrGGAAATAGGATCTTGCAGGTATGATGAAGGCCTATOT 
CTGCCAAAGGGATCCTGCAAAAAAGCAAATCTGATCATATCCCAGCTCTCAGACCTCrCCACTCG 
ATGTCCATCACTCCAGGGAGGTGCCTTCCAAGGCCTGAAGGTTGACAGATCCCCCAGGATCCCCA 
GGGCCAGCCTACTCrrGNAGCCTNACCTNCNNTGCTCCCTTATGCCAATTCT^ 
CCTGCTTCATTCCTAAATGTTOCCCTCCAGTGATNTTATTTGATCACCT^ 
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CTTCCTGTGACnrmCCACCANTTTCCCTGAGTCAGGAAATGTTNTT^ 
>OTCTGGTTAGNTTNNNATr(>INNTA 

SEQ ID NO: 2144 ACCTAGAAGAGAGGCGGGTCAAAGAAGTAGTGAAGAAGCATTCTCAGTTCAT 
AGGCTATCCCATCACCCTTTATITGGAGAAGGAACGAGAGAAGGAAATTAGTGATGATGAGGCAG 
AGGAAGAGAAAGGTGAGAAAGAAGAGGAAGATAAAGATGATGAAGAAAAACCCAAGATCGAAG 
ATGTGGGTTCAGATGAGGAGGATGACAGCGGTAAGGATAAGAAGAAGAAAACTAANAAGATCAA 
AGAGAAATACATTGATCAGGAAGAACTAAACAAGACCAAGCCTATTTGGACCAGAAACCCTGATG 
ACATCACCCAAOAGGAGTATQGAGAATTCTACAAGAGCCTCACTAATGAC TGGGAA GACCACTTG 
GCAGTCAAGCACTTTTCTGTAGAAGGTCAGTTGGAATTCAGGGCATTGCTNTTT^ 
TCCCTTTGCCTTTTTGNGANCATGAAGAAAANNGAA(>rCTTNTTC^^ 
TCTTTGGCCAGCTGTGNTTGAGTTG 

SEQ ID NO; 2145 GTCGCGGCGAGGTACCGCTTCCTTAGAACTTCTACAGAAGCCAAGCTCCCTG 
GAGCCCTGTTGGCAGCTCTAGCTTTGCAGTCGTGTAATTGGCCCAAGTCATTGTITT^ 
CTTTCCACCAAGTGTCTAGAGTCATGTGAGCCTCGTGTCATCTCCGGGGTGGCCACAGGCTAGATC- 
CCCGGTGGTTTTGTGCTCAAAATAAAAAGCCTCAGTGACCCATGACAAAAAAAAAAAAA 

SEQ ED NO: 2146 ACTACGACATTrcrGCCAAAAGTAACTACAACITrGAAAAGCCCT^ 
CTTGCTAGGAAGCTCATTGGAGACCCTAACITGGAATTTGTTGCCATGCCTGCTCTCC^ 
GAAGCTGTCATGGACCCAGCmGGCAGCACAGTATGAGCACGACTTAGAGGTTGCTCAGACAAC 
TGCTCTCCCGGATGAGGATGATGACCTGTGAGAATGAAGCTGGAGCCCACGTCAGAAAGTCTAGT 
mTATAGGCAGCTGTCCTGTGATGTCAGCGGGTGCAGCAGTGTGTGCCANCTCATTATTATOTAN^ 
TAAGCNGGAACOTGTGCTTCATCTGTGGGATGCTNGAANGAAATGANTGGGC^ 
TGGTGGCAGGTOTAAAAAATTNCrTACTOGTOCNGAACATGNAT^^ 
NNACGATGGNTrGAGTTTATATTT 

SEQ ID NO: 2 147 ACATTTTCATAAAATATGAAGGGATAACTACAAACTGGAGTAAAAATGATGG 
TAATTAAAAAAAATCCTCAGTATCCCTAGCrTGTCTATTAACTGTGATAATCTGACTTGA GTC 
TTGAATATTTGGAGTGCTTCCCCAGAATAACCACTTATTmGAAGCTATCATGTGAAG 
TAAAACAAAACAAAAATTATGGTCATTAAAAAACTAGAGAATTAGCCATATTAAGGATTT T^ 
GACTGCAAATTACrrCTAAAGAATCATCAGTGTATAGATrAGAAGTGCTCATTACCTGCAAC'l'l"!"!' 
AAAAAAAATTCAGTTATAGCTGCTTTTGAAGAGGTTTCCATTm 
AAGAACAATrGTTTATTTTTTCTCTTTGGTTTTAGATATTAATG 
CAAAGAAAATATTTTTATAATTAAATAATTTAATGTTTTTCTTCCT^^ 

GTGTTAGGGTATCTGTTACCTTTAAAATGATAAGCTCCTCAAGATTTTTATGTATGTATAAAT^ 

GGGNGTGCTACAAAAGCCTTGNAAATTATCAGNAGTAGTTITm^^ 

NCCG 

SEQ ID NO: 2148 gtcgcggcgaggtactggcititcaaaaaaacagaacaaaaaaaacccaaa 
aaaaagtctctcacacataagccaggtggacctgaaagagccaaggggcaggaacaatggctgt 
ttcagcagctactcaatagcgaagggtcagcgtcccactcccatgccttggcaccgcatcagatgt 
cctggagcccttgataccatcctggaaggctttgtttgaggctccttaataaaagagggcaaggc 
gaagagaaagaagaccaggacagccagaagcaaggaccactggacgccttggctcaggtgcctg 

CATCTCCACAGGCANGACCTNATGGTGGTGGGTCGGGTTCTANAGGAAGGTTCTGCNTGTCCTGG 

GGCCTTTGATGTAGGCAAGCTGGGAGTCTCACCGCTCAGGTTTCCTGGCCAGG^ 

TTTGTCTTAACAATCCCNCGC^W^CCTNCCCGGGCNGGCCGC'ITCGAA 

SEQ ID NO: 2149 A CTl - llTrni - I - ri - lU iU ' l il ' l ' ll ' i ril^ - riN CCNCAGGGGAAAATAACTm 
TGAGACCCCACCAACTGCAAAATCTGTTCCTGGCATTAAGCTCCTTCTTCCTTT 
TCrrCAGNGGNCCCATGAATGCTTTCTTCTCCTCCATGGNCTGGAAGCGGCCATGGCCAAACTTGG 
AGGNGGGGTCAATGAACTTAAGGT^^AAT^^^mCCAAAGCCC3^CCGNTTCTTO 
GACTTGCGGNGGGTGAGC>^CCCGNTTTTTGGTTTCCCACCACACANCCmC/^ 
TTTGGTCCCTTCACCATNGGGGACAAAGCCNCCCAGAGGG 

SEQ ID NO: 21 50 ACCAAAACATTTATGACCTTATAATTTTATAGTGCAAGAAAAAGGACAAAGA 
CAGGAATACAAATAAATTATAATCTAAAGAGTTACATATAAAATGTCCrrGArrAm 
CTGCTAGAAAAGTAACAGGAATGTTATCAAGCCTTrcrAGACATTTTGAACA^ 
TGCATTrCCTQGTTGTATCCTCAOCTTAACAGGCACrcGAGACAGCCAGCTTCTrCAAATC^ 
TGAACACCTGCAGGCTAACATCATCAGTTAGGATGGGTGCTCCAGTTTCCTGTCCCCAAGCATACA 
GGTTATTGTGTGTCTGAGATGGGrrCACTTTGGACAAAAGGAATCGAGCCTGACTGCCTCCATG^ 
CCGTGTTGATGTAACGTGGCATCGGGAAGCGTGCrrTGCAGAATTTCTTGAGCATCATCCAGTGGTG 
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CCTGCAGAAGGTGCTTGAAGTTTTCATACTCGGGCATTGTCCTGGTAGCCAGCm 

GCTATGGNCTCACCAAGATAAATGACAATTTGAAAGAAAGTNTCCTCANCAAAATTCTGCAGCT^ 

GAATGCTGNTGCTATCCAANATCCTNGGCCGNGACCACGCTNAGGGCGAATT 

SEO ro NO: 21 5 1 ACAATCAATAAGTCTTAAATCTCTCrrCCATGGATTTCCCCCATCTCCCCACTT 
AGCAGTAAAAGGCATTTTTAGGTTrATATAAACACATTCTTTACAACTGCOT 
TACTGTAGTATATAAATAGCACTGAGCATTTAGTAAATAAGrrACAGGCCAATAAGTTCAATTGTG 
CAGTTGTGAAATCTAATGTCAAAGAGTTTTTGAAACCITAAGACTGGGTGATCT^ 
ACATGTCrrCGACAGCCTTAAGAGAATAlTrGGrrrCCTTCTTACTrCGACCTGC/^ 
CTCTGTrTATTTGACTAGCTGCCCAACTAGCATATTTCATGTTGNCCTTTT^ 
GrrGGATTANNAAGCAATATGACTTGGCGGANNTATAANAGTCCANACAANGCTCTCATATTTC 

GGNGTTCAGTTTC 

SEQ ID NO- 2 1 52 ACGCGGGGACTGTCAGGTGACGCTTCCGGCGCAGAAAAATGGCAGCCGCCG 
CTCCGGACTCACGTGTGAGTGAGGAAGAAAACCTGAAAAAGACCCCAAAGAAGAAGATGAAAAT 
GGTAACTGGAGCCGTAGCGTCGGTGCrrGGAAGACGAGGCCACAGACACTTCTGATAGTGAAGGAA 
GCTGTGGATCGGAAAAGGACCACTTTTATTCTGATGATGACGCAATAGAAGCTGACAGTGAGGGT 
GATGCTGAGCCCTCTGACAAAGAAAATGAAAATGATGGAGAATCAAGTGTTGGGACTAATATGGG 
CTGGGCAGATGCTATGGCTAAAGTCCTCAACAAGAAAACTCCTAAAAGTAAACCTACTATTCTGG 
TCAAAAATAAGAAGCTGGAAAAGGAAAAAGAAAAGTTAAAGCAAGAAAGACTAGAGAAAATAA 
AACAGCGTGATAAGAGGCTGGAGTGGGAAATGATGTGCAGAGTAAAGCCAGATG1TGTCCAAGA 
CAAAGAGACAGAGAGAAATCnrCAGAGAArrGCACAAGGGGTGTGGNGCAArrATTTAATGCTGG 
TCAGAAACATCAAAAGAATGTTTGATGAAAAGGTTAAGGAAGCTGGAAGTTTATGANAAANC^^ 
CTAAGTTGATATCAACTGTTTCC 

SEO ID NO: 2 1 53 ACGCGGGGGGGGCTCTGCGTTCTGTAGTGGCGCTGCT TGGGC CCGTGGCGGA 
TTGTAAGCTGCTGGTTTTGCGGCTGGGAAGAGCGGCGAGAGGGTTCGGCATTm 
CCGCAAGGATGAGTCCTGCCAGAGAGTCTCACCCGCATGGGGTGAAGCGTTCAGCCTCCCCAGAC 
GACGATCTGGGATCTAGCAATTGGGAGGCAGCAGACTTGGGTAATGAAGAGAGAAAACAAAAGT 
TCrrcAGACTTATGGGTGCAOGAAAGAAAGAACATACTGGTCGTCrTGTTATAGGAGATCACA^ 
TCAACATCTCACTTCCGAACCGGGGAAGAAGACAAGAAAATTAATGAAGAACTGOAGTCT^ 
TCAGCAAAGTATGGACAGTAAATTATCAGGAAGATATCGGCGACATTGTGGACTTGCTTCAGTGA 
GGTAGAAGACCATAATGGANAAGGTGATGTGGCTGGANATGATGATGATGACNATGATGATTCAC 
CTGATCCTGAAAGTCCANATGATTCTGAAAGCNATTCAAAGTCANAAGAAAGAAGAATCTGCTTG 
AANAACTCCAACCnTGCTTAACACCCTGATGAAGTGGAGGATCCCAAAAACAAAAAAANATGCAA 
AAGCAATTA1TAAAATG 

SEQ ID NO: 2154 ACAAAGGAGGTTATAATGGACITAACCAGTGTTTGACAACTACTGACAGCAA 
GATArrTCAGTGTGATAAATGTGTGAAAGTOTrCATAAATTTCCAAATGTAAATAGA^ 
AAGACATACrGGAAAGAAACCmCAAATGTAAAAACCGTGGCAAATCATTrTGCATC 
AArrAACTCAACATAAGAAAATTCATACTAGAGAGTATTCTTACAAATGTGAAGAATGTGGTAAA 
GCCTTTAACTGGTCCTCAACCCTTACTAAACATAAGATAATTCATACTGGAGAAAAACCCTACAAA 
TGTCAAQAATGTGGCAAAGCTTTTAACCGGTCCTCAAATCTTACTAAACATAAAATAA 
GGAGAGAAACCCTACAAATGTGAAGAATGTGGCAAAGCTTTrAACCGGTCCTCACCCTTACT 
CATAAAAGAATTCATACAGAAGAGAAACCCTACAAATGTGAAGAATGTGGCAAGGCCTTTAACCA 
GTTCTCGATTCTTAATANACATAAGAGAATTCATATGGAAGATAAACCCrCCAAATGTGAAGA^^ 
GTGGCAAAGCCrrAAAGTTTCTCAATTTTAAAAAAACATAAAATAATCC^^ 
ATACAAATGTGAAA 

SEQ ID NO: 2155 GTCGCGGCGAGGTACAmCATTTTCTACCCTATCAATTTAAAAAATAAATGT 
CTGTGCTCTGTCATrrCCATCTTTACCTGAGTAATATGAAAATACCTAAATTTAAGTG^^ 
AAAAGGGTTATGACTAATTTTTCCTCTCACAGTTTTTAAA^ 

GGAAAATGGGAAAGAAGAATGAGTGGAAGAGTTCAGTATCTCAATCATOACrrAATTAAAAAGTG 

AClTCTGTGACATCCTrmACAAGTTTACCTrACCTATCCCTATCCTCAAGGAGA^ 

TTAATATAAACTGCTCITAACrCTATCTGAGCTTGAAAGATAATTTATAAAAAAT^ 

TCACTCTirAATTTCTCCACACCTATTCCCACAAAGCCAAACTTACAAACATAAGAA^ 

TAAGCATCAGATGGTATGAATTCTCAACAGGTATTACAGCGTAArrCTCCAGTGACATATTACCAA 

AGCTGTAATCTATGTAAAATATCAAGTGTCTACAGGTAATCATAAATTTAAGCAGCCTGTAGCAAA 

AArrCAGAACACCATTirrATGAAAATATCCATTTTTAACAGCAAAA^ 

GTTGGTTGGCATGGCAAGTGC 
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SEQ ID NO: 2156 ACGCGGGGGCTGCCGTCGCCGCCGCCATTtTGATGGCAGGAAG AGTC CGGTT 
CTGGGACAGCTGGAGACAGTGGTGGTGACTGAAATAACTTTACCAAAGGAAAGCTATm 
CTATCTTCTCCAGCGGAGATGGCCAATGTGCrrrGTAACAGA^CCAGACTGGm 
GGATTrrGCTCTTTAGTTAAAAGGGTTGCAATCCCAAAGCCTTTTCGACT^ 

CCGGATGAGTCTCATGTGGCTGCTGTCCTCCAGATATATGCTCTCGAACAGTGTGG CCTGA TGAAA 

CTATGGGACC>rrrrGNACCrCAAAATCANAAGrrCCANArrCCTGGGAACATANGGT^^ 

TACT 

SEQ ID NO: 2157 accttgaaaagacactgaaagcatittggggtgtgaagtaagggtgggcaga 
ggaggtagaaaataattcaattgtcgcatcattcatgottctitaatactgatgcrcag 
gccttagaatatcccagcctctcttctggtttggtgagtgctgtgtaaataagcatggtagaattg 

TTTGGAGACATATATAGTGATCCTTGGTCACTGGTGTTTCAAACATTCT GGAAA GTCACATCGATC 

AAGAATATTTTrrATITrrAAGAAAGCATAACCAGCAATAAAAATACTATrrrrGAGTC^ 

AAAAAA 

SEQ ID NO: 2158 acctgaactcagcagtgacatagacagcagcaatttcgatgacattgaagat 

GACAAAGGAGATGTAGAAACCTTCCCAATrCCTAAAGCITrrGTrGGAAATCAGCTGCCTTTCA^ 

ggatttacctactatagagaaaatttattattaagtgactctccatcttgtagagaaactgattcc 
atacaatcaaggaaaaatgaagaaagtcaagagattcagaaaaaactgtatacattaqaagaac 
atcttagcaatgagatgcaagccaaagaggaactggaacagaagtgcaaatctgttaatactcgc 

CTAGAAAAAACAGCAAAGGAGCTAGAAGAGGAGATTACCTTACGGAAAAGTGTGGAATCAGCAT 

taagacagttagaaagagaaaaggcgcttcttcagcacaaaaatgcagaattcagaggaaagct 

gatcatgaagcagacaaaaaacgaaarrtggaaaatgatgttaacagnttaaaagatcaacttga 

agarmgaaaaaagaaatcaaaccmaaattccctgagaaaaogggantcacctcc^^ 

ACTGG 

SEQ ID NO: 2159 A Crrri - ini ' ri4 ' r Jl Ul ' 1 7T f T TGGTACCTGAATAATCAGGNCTTTATTCAAAA 
NAAGCTGTCCAAAATGATITGACCTn'ATGGAATAATCAAATTTAANAGrmrrGCA^ 
CTThrrCCTNTrGTAGTAGGTrrcmCTOGCAGGOT 
TCCnTTITCCNACCAGAGGCTTCTTmjCTTNT^ 

SEQ ID NO: 2 160 ggaagccggcgccgggcaggtactcttgatgaaagaccgtgaaaccaacaa 
atcaagaggatttgcttttgtcacctrrgaaagcccagcagacgctaaggatgcagccagagaca 
tgaatggaaagtcattagatggaaaagccatcaaggtggaacaagccaccaaaccatcatttgaa 
agtggtagacgtggaccgcctccacctccaagaagtagaggccctccaagaggtcttanaggtgg 
aagaggaggaagtggaggaaccaggggacctccctcacggggaggacacatggatgacggtgga 
tarrccatgaattttacatgagttcrtccaggggaccactcccataaaaagaggaccccnccaaga 

AGNGGGGGTCCTCCTCCTAANAATCTGCCCTTCAGGACCAGTTCCNATTANCANTGGA^ 
GGAAACCrCTGTTCACNTGGAA 

SEQ ID NO: 2 1 6 1 A CrnTni ' lTni - l riTl 'l J 1 1 1 ll I GACACACCTGCCCrrTATTGGTCTNTTCT 
ANCAAAGNGGCTCCAGGCCCTTCACGCCTTTNAAACACCACCCATGAGGGTrrAGGAAGGNGCCA 
TCArr(n'GNGAAGGCCCAAAGCnTACCCAAKrNTTGGAGCCX:AAGTTGAATCACCAACCA>^ 
TTGGGAAAGGAAAAGGAAACAGGCAGAGGGGAAAGGCAAGGCrCTGCATTGAAGGGGACTGATN 
TCAAGGGA^r^GCTGAGGTCCACCAGTGT^^'CCTGAAGGCATGCTGCATCCTAAGGCTCCrmAGGA 
CTGGATGGAOTAGNAAATCTGNNTGTTTAACAKITCACATmTNTATNGGCAAC^ 
CTTGATGTCAGGCTCAATTGNTTATGGTTGGNAAGGGGCX5GCTGTANCNA 

SEQ ID NO: 2162 ACATAAGCTATTCAAGArrTCTCCAGCACTGACTGATACAAAGCACAATTGA 
GATGGCACTTCTAGAGACAGCAGCTTCAAACCCAGAAAAGGGTAATGAGATGAGTTTCACATGGC 
TAAATCAGTGGCAAAAACACAGTCTTCTTTCTrrcmCTTTCAAGGAGGC^^ 
TGGTCACCTCAACATAAGGGGGACATGATCCATTCTGTAAGCAGTTGTGAGGGGGTAGAGATGGG 
ACAAAArmGGTCTCAGAGGTCrTACCATCTTAATITGGTAACTTCTAArrGAi^^ 
ATAGAAATAACATTATCCAAAGATATCTTAAAGCTGAAAACTTGAACAGCACATTTrrrGTTTTTG 
rrGTTGTTTGGCTAACTCCTCCTGGAATCACCTTTNTGGTTTAGCrAGT 

SEQ ID NO: 2 163 ACAGAGAGGTGGGCCTrGAAGCCAATAAATACAAAGCTTCCTCTGCCTTGTA 

acaaaggcaggttggcatagggccctggagcctgaatgcx:catgccx;actttgggatggggagca 

GCTATCACnrCCTCTGCTGGCITCACTrGCGTGGaSGCCTCrAGCTGGCAAAGTANCT^ 

GCACAAATTGCTTXaGAGAGAAACATTTGTAGTCrrGTTGTArrCCTCCAGNAAGCCTrGGACTCCT 

AAGNTGATTTCCACACACrGGTCTTGCTGOTGAATGTGGATCTGGCCAANNCGCTTO 
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SEQ ID NO: 2164 ACATTmGTTACAGACAGAAGGCTGATTTTGGAAAGAAAGAAACAATAOGA 
TGTATAGTGCTrrCrATCAGCAGACTAGTATGTTTAAAAATAGTCTCATCAAGGGrrCT^ 
AAATATAAATGTTGCCAGGCAGTCCCAAACTCACATTTGATATTAACTGCAGACTCATTTA^ 
GAAAACTGCTCCAGCCTCTCTCAATCTTTACTAAGGACrGGGAGATTATCA^ 
AATAATGCrmGAATTAAGATTATTGAAAAAAGGAATTTCTGTTCAGTGCAAAGAACAT^ 
AGAAATTCATGTCTACAAAGGAGTTAAATACAGATGTCCCITCATTACAGATACrrATTCTTGCT 
AAAAAGGTAGAATTATCAAGACATTGTAAATAATATGGGTCAAAAATATGTGTACAGAGTTCTAC 
AGGGCACCTTATATAAAGNTCAATCCTCAAAGTATGThrmGACACTCTCCAAGNGAGGAW 
TTTrTTGNAAAAACATTANACACACAGTNGCCrmATTArrCCA^ 

SEQ E) NO: 2 1 65 ACGCGGGGGACGGCGGCAGTGCGAGAAAGCCGAAGATGGCGGTCCCCGCGG 
CGCTGATCCTACGGGAGAGCCCCAGCATGAAGAAAGCAGTGTCACTGATAAATGCAATAGATACA 
GGAAGATTTCCACGGTTGCTCACTCGGATTCTTCAAAAACrrCACCTGAAGGCTGAGAGCAGm 
AGTGAAGAAGAGGAAGAAAAACTTCAAGCGGCATmcrrCTAGAGAAACAAGATCTrCACCTAGT 
TCTTGAAACAATATCATITATTTrAGAACAGGCANTGTATCACAATGTGAAGCCAGCAGC^^ 
GCANCAATTAGAGAACATTCATCnTAGACAAGACAAAACTGAAGCATTTGCAATACGTGGTC^ 
TATGGKTCAAAGAAACAGTTGAAAAGTrCCGGCANAAAATTCTGGCTCCTGTANCTAGAACCGGT 
TGATTGGCANCTTACCTTCAANTGGCTCACTCTGTTCANCNAAACrAAAATCTCCTO 
TACAACTCGAGTGAACAATGAANATrmAAAGACCTGGANAAANTCTTNGTGGTA™ 
AAGGAGTrrmATTTTTTNTACA 

SEQ ID NO: 2 1 66 acatcttgaccttcttggtctctgaagtattcagccacagtcagcccagtcag 

AGCTACCCGGGCACGAGCACCAGGTGGTTCATTCAmGACCATATACCAGCGCTACCTAANAGG 

tggcatctnataagttgataacaccagattcaatcatttcatggtataaatnattgccttc^^ 

GTCCNrrTCACCAACACCAGCATAACACATAGTAACCCC™GGCNTTTC 

SEQ ID NO: 2 1 67 ACCAGCTGTGGGATTTCGTCTTCGGATTCATTTGTTGCmAACT^ 
CCTCTGCCACGTTTTTTCTTCCCATGCTCTGGGG^ 

AGTATCACCACTTTCAGGTGCCACATCATCTTTACTAANAACTGATGCA^ 

CCTCrirri'CU'rCCTCTCCnTITGTTTTTCAAAATTrcriTCT 1 1 11 iC 

TTCTTAlTAAGCAAAAAGATCTTTTGGTGGCnTCATCCCAATrGCTGANAA^ 
GGGAANGTGCANCCCnSfATTTCrCANTThriTAAAAAAAGGGAGTTTCACGCTOT 
GGCAAATNTTTNTGAAAANTCTTTNGGGGAAAANCAANAATGNTT 

SEQ ID NO: 2168 Acr ri ' i - iTn ' mTiTi 1 1 1 1 1 'ii'ii iittcggaaactttttatttatatttnggt 

CTTACAAATGATCACTTITAAATGGACTmCTGTAAAAATGTAAAACTCAAAAAm 

TGTATm-GATCCACACAAATCCCTAAAAAGGTTTTCTGTGTAGTCTTCATTAACN^ 

GAATGTITCACTCTTACTGTAGGATCTTGAATATGrmACAATAATOAACTCAAAGTT^ 

GGGCATTAATTGTAAACTATAAATAACATTTGTTTAAAAANAAANCrGGGTAATANAAAAATO^ 

AAANACTCTNAGGAGCNGGCANTCTGTTGNGGCTCAANATAThrrrAm 

NCATCCTHTSIAAAGAGCACACWGGACANGNGGCNTNGGAATa^TCTGNTTGTAANC™ 

AAAAATCNNGTNGTNGTAAAACNGGACCATTATCA 

SEQ ID NO: 2169 ACTCCATGGCAGACTTCACAGGGCCTTGTCCATTGGAAATAGAACGCATCAA 
AATCGAGAGTCTTCTCTACAGAArrGCCTCATTTTTGCAACTGAAAAATTATGTGCAAGCT^ 
AGATTTTAGACATGTGCTGGGAGAAGGACTGGCCAAGGGAGAAGATGCCTTTCGGGCAG^^ 
GCTGCATGCAGCTGAAAGGGAAGCTCCAACCTGTATCCACCATTCTTGCCAAGTCACTCACAGGA 
GAGTCCCTGAATGGGATGGTAACAAAGGATTTGACAAGACTAAAAAACACTTCTCTCAGA^ 
AGGTAAGGCCATTTCAAGTGGATCCGmGGAAOAGTITGACTCAATCTGTCTAGTAAGCATTATG 
TATTAAAATTTGGAAAACCATTGTGGCATCANCCAAAANGCNAGGCTGTGAAAGNGGGGCATTGG 
GACTTGCCAACCCTTCCTTGTACCCCATAACCCACATTCCCATCGTGGGAATGCTGCTT>n' 
GTCTCCTTTGGGGGCCANGGCTTATCTThTOTCCCTATTGGAC 
NCTCTGGANACCANATGGGAAAAATGAATTGGAAAATrrANTTrrrAAO^ 
AAANGAAAT 

SEQ ID NO: 2170 ACCGTAAGATGGGAGAACCGGACCATGAACTGTATTGTCAACGTTTTTGCCA 
TrGCTATTGTTCTTTAACTGACTAAAAATGTTGGGCTAAAGCCATTAACTTAAGAAm 
ATCCTTTCCAAAAAGAGTAATAGTTGTTTACTAGTGTGCTANATGAAAAGCGTGCAATATGCm 
AAGCTTCAACAAAAAACTGAATATTATAAGCAAAGCATTATCNTAGTAATTG^^ 

tattctnntcancatc>™aaataggaaaaatttaotgctan^ 
ggnntgacttgaaaaatnccatcttatnttgacaccancttm 
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GGGCGGGCGNTCNAAAGGGGCGAATTTCC 

SEQ ID NO: 2171 ACTGTCrCTTrTGGAAAAGTTCTTGATCCCCAATGCnTCACAAGCA^^^ 

AAGTCTTCTATTTGAAAATGAAAGGAGATTACTACXGTTACTTGGCT GAGG TTGCCGCTGGTGATG 

ACAAGAAAGGGATTGTCGATCAGTCACAACAAGCATACCAAGAAGCTTTTGAAATCAGCAAA/^ 

GGAAATACAACCAACNCATCCTATCAGACTGGGTCTGGCCCTTA ACrTC rCT^ 

GATTCTGACTCCCCAGAGAAAGCCTGCTCTCTTTGCAAAGACAGCTTTTGA^^ 

ACITTGATACATTAAGTGAAAGAGTCATACAAAAGACAGCACGCTISriTAAT 

ACAACTTGACATTGTGGACATCGGATACCCAAGGAGACNAANCTGAAGCCTGGANAAGGAGGGG 

GAA^^mAACCGGCCITCCACTTTTGT^m3CCTCAT^OT 

^^^GCCTGCCCANAAATAGNT^Tr^GTTTCAr^ATGA^INNGGr^OT 

ATTTCCNAThrrGGTTTTTANGTTAATATTNAGGGGAGTAGA 

SEQ ID NO: 2172 A Cn TlTrn' i ^ i ' r J' nT J-iTl J l'l l T N AACAAAAGCAACAATnTTATTATCrrG 
CTITATATTTAATGGATTAGAACTATAAAGArrCTrAACTITGAAAGC/^ 
GTAGTrGCAGATCTrrAATACCATmCAATTTCATTTATGAGCTGCTCAT^^ 
TAAAATAATAATCGCTTTTGTTGTTGTTGTTATAAAACAATGAAAATrCCTGTrCGGAACAC^^ 
TGCT>mTATATTTGCTTGTCCTCTTAAATA>rrATTGANAAAAAG 
AGCCCCTTC 

SEQ ID NO: 2173 ACGAGTCTGAGGCGGAGGGAGTAATGGCAGGACAAGCGTTrAGAAAGTTTCT 
TCCACrcmGACCGAGTATTGGTTGAAAGGAGTGCTGCTGAAACTGTAACCAAAGGAGGCATTAT 
GCTTCCANAAAAAATCTCAATNGAAAAGTArrGCANGCAACANTAGTCCrcrc 
TNNTAAAGGAAAGGGGTNGANAGATTC 

SEQ ID NO: 2174 ACAGCnTITAGCAAAACTG(nTrCCCAGAAAAGCAAATTAAAATAATGCA^ 
CAGCAGATCAAGGAGACTACAGCTAGACATCAGAGCTACAACTTCTCA^ 
CCTATTTACCCAGAATAGACTAAAATTTCAGGACAAAACAGGGACTITITAACAr 
CCCATTTTAAGTTCTrmATCrCCCCCACCCCCCAAGCCACTAATr^ 
GAGGGCTACrTTTAGAATrGGCTACnTrATTrrrCTT^ 

TITCAGGCTCATATGCATCCATGACCAGTGTCCACTGTGGCAAAAAACTTAGCAATGGACm 

AACTCGGCTTCCGCTGTGCTTGTTCTCCCACTGATGAAGGCACrCAAGNGTriT^ 

AANACTGGAGGACAGCTGCCAAATTCATCAGGATCCCATTCTGGANGACnTrCCr^^ 

TTCCAANAATTTTITACANTGGACCATCCNCTTGNNAATT^ 

ATCNACAGCCCGTACGGGCCnTCTGNCCG 

SEQ ID NO- 2 1 75 ACGCGGGTGAAGATAAGAAGTTTCrTGACAAATACATGCCCCAGTTCATGAA 
ACATCirCATTATAGAATAATTGATGTGAGCACTGTTAAAGAACTGTGCNGACGCTGGT^^ 

agaatatgaatttgcaccaaaagaaggcttgcttctcataggggcacttgatgaca™ 

AGCATCAAANGAGCriTCANTTTTACCGAAATAACATCTTCAAGAAAAAAAATTGNTGAT^^ 
GNAGGAAAATTATNGAAAAATGGGGNAANATGAAAAACCGGT GAGTTNNTG CCANN^^ 
GCTGCCKCTTCNATITGTArrbn'GGAGGCAACTTTNT^ 1 TCTNNCNCCTGATG 

GCTrTGGCAAAGCACCCTTCGGGTTATCCTTGCATCTNAA 

SEQ ID NO: 2 1 76 ACTG AGAATGCCGTrCGGGGGCCTTTATGOCGACGTAAGAACGGGCTTGGAC 
TOGTCTGTGAATCCAGAATCCAGAGGTGCAGGTAGCACTATGGATCAGGGTTAGCCTCGGGGGG 
(XAAAAACACGGCTTCAGTTrcrCCCCACTCTCACrrAGTGTTAAAGAGTGGCAGAGGTGGGGTO^ 
GGGGAGCTTCCCNAAAGACCTGCT 

SEQ ID NO: 2 1 77 Acr rr ri- i "i- nT i"i-n"i' i i'rn 1 1 1 1 lAAAAGTrAACGCATATTTGrrnTATTTA 

TAGGTAACTACCACATGAATTATAAAGACAACAAAGGATGTCAGAATGAACATGGATAGGTGTAT 

GCATACTACGGCTAAGGAGAAACAATGTTCCTACATATTATGGGTAGTGAGAACATTATCTGTATA 

ACAGGGAACTGTOATTATTTAAAAATATGCAGAACTTATTTCATCTGTGCnTr^ 

TCAGTGrrATAAGTTGAAAAGAACTCAAAATACTAATACCAAATOTACACCTTOTNrrAN/^ 

AAAAAGCTGCTTTCTGTGAAGTCAATCAGCTTTATTAAAAAATGACACAAATC^ 

CATCTTOTATATAAAGGGGACATTGTAAGTTCCrrGCTGCANTTAAACCCAT^ 

AATTCCCTTTTAATTATCATTTAAACAGAANCrcCCAATAGTC^ 

CAAATCCACTNNTrCACTCCCCTTrmAAGAACTAAACAGGTATTCGGGTA^ 

cataan™gnttgti^aatttttncaagggttngggcaaac^^ 

SEQ ID NO: 2178 GTACAAACAGCACirrTACCmGCCATAACCTCAGGATCAGGATCTTCT^ 
GGATCAGCCCATTCAACAGrrCCAACATTCCCCCANACCTTGACmACCACTCAT^ 
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CTTGCCTGGGCAGCTGTTrrGTGATCTTCATATTCAAGAAAGCAAAAGCC^^ 

CATCCGGTGGTGGGTATAAAAATGACGTCTGGAAAGAACCCTCTGTTACTTTGCTAAANTCm 

GA^^^Cr^GTNCCTTGGTTTNC^CNTTAGGAANTAGAAGCCCACAAAAAG 

SEQ ID NO: 2179 Aa" l - Il -i4 1 - ll " rilU ' l i i m TrGGGTTTGGNATGCNCTATT^ 

TATTTACTAGTCTCAGTAATACATTAGTAAAAATCATGTCACTTAATTAATTGTGTTANAATC 
GAAACATAGAGTTGGGCAATATACTTNATCCCTACCCATCCNACCCAAATCrrACTCTO 
TNATTCTCANTTAATTTTGGGAAAANOTCCANAAAGATGTGTT^^ 
. AAATAAGCmTTGANCCCTNCCAANCCCCCATCCCCA 

SEQ ED NO: 2 1 80 ACGCGGGTGCATGCCTGTAATCCCAGCTACTCAGGACACTGAGGCANGAGAA 
TTGCCTGAACCCGGGAGGCAGAGGTTGCGOTGAGCrGAGATGGCACCACTGCACrCCTGCCTGGG 
CAACAGCATGAGACTCCATCTCAAAAAAAAAAAAAAAAAAAAATGGAGTTTCAGGATCGTGGAG 
AAATGATGAACCATATCATGTCCGCTGGGTCAAAAGCATTGACATGAAATGACACCCACTGTGTT 
ATTTCAATTTCTAAAATGCCTrAGAAAAAAGACCAGAAAGAAATCTGCCAGGATNTCCACAAGGG 
TTGCCTTTAATGGTGAGATTATNGGAAATACTGTTGGTNTANACTTTGCCAAAT^^ 
OTGCAAATCCTTITNGCACNATTNCANCrcCCAGNAAAAAAAAANAATTSrAANCC^ 
AAGNACCGCTCTT 

SEQ ID NO: 2181 ACCCCCmCCATAGAAGGGGGAAGCCCTCTTrCTTGTCTGCCAACC€GATCG 
CGACCACrrGAGTAGAGATCACrrCGGCTGCTTGAGTAACTGTCTCGACTTCCACCATATCCGTCA 
CGTGAGCTGCTGTAATCATCATAGCGACTGCTTCCACCATAAGATGGCGGGGGCCCTCGTGTAGGT 
GGAGCACrACGTNAGTACCATAACTCrTATATGAATCTCTGTAGGAACCTCCACrrGGATGAA^^ 
GAATAGTCNCGATCACGACCATATCCATCTTTATCCGCTATATNCrrCTTGAATGGATAGNCATNAC 
CGTOAACTGGGAANTGACCATTAATNACGGTAAAGTATNAATCTCGNGGNGGGTGGNGCATAATC 
CTCTANTTTTACGAAGAACCTTNGGGNAATTCTTTNGATTGAATNGCTGTh^ 
CATNATNTCnTrGGGGACAAANTAAANATTTCrAACAANAAGGCAANTNGGTTCC>^ 

SEQ ID NO: 2 1 82 ACTrCAGTrGGTGCACAAAATACTGTCATTTGCTCAAAGCTGGTTGCCAAATG 
TTTGGTGATGAAGGCAGAAATGAATGGCTCAAAACTTGGGAGAAGAG(> lAAAC CTGAAGGGGCC 

ctccagacaatgatgggcttratgatcctgctgcgatgnagagccggctnttttaaggccaagca 

gtgcacggcccttcattgtgctggtgtgtgaacacrgctggggtcagaagaacatacaaggnccc 

tgganntaacctgcrcrgagcgagtgaaaacctctggatcatcattgaactaaaacacaa^ 

gagaaaaaccrmgatagtnaaaagttrga^gactgnacttcanaaggagt^ 

aactggatccaaaatttatccngagtttttgtttaaaatatgttttca 

tcntcccaaaaactnaaatnatgtggacattnctgatgtgggcttat>mtmaa 

aggggaatccttgtmnttctaaaaaaatgnncctgacattnn^ 

atcctgncaaactmaatttattttntt 

seq id no: 2 1 83 actggtccaggagttatccaggatagattttcacccaccatgggacgtcatc 
gttcaaatcaactcitcaatggccatgggggacacatcatgcctcccacacaatcgcagtttggag 
agatgggaggcaaagtttatgaaaAagccaggggctaagccanctctaccataaccagagtcaa 
gggactcttatcccagctgcaanoacagtccaaggatatgccacctcgggtttctaaaaaaggac 
agciraatgcanatgaaattagcctgngcctgctcaatccgtcctaatgaataaaaatcaagtgc 

CAAAOCTTCAGCCCCAGATAACTATGATTCCTTCTAGTGCACAACCACCCACGCACTTAAACACCA 
CCTTTTGGGACAGACACCTCAy^CTrGGTCTNAAACTTATCCACCACTTAOT 
AAGACCCAGAAAAAGCCCCCACCGNCNAANGGAAAAACTCCTTAAACTAACTGAAC™ 
CT^fNATTTCTAA^^^ANNGGAAATNCNAATGAAGCTTGTAATGG^^^GTJ^ 

AAAACiUM'Cl'i' l 'CTGATATNNTAACCAAGGTATNATOCNGNCCNNGATANAANCCATAANATAAA 
NAAAAAGCANNTTNTT 

SEQ ID NO: 2184 ACAGACATGAGATGCTTCCAGCCAGCCTNATCCAGGCTCATCGGGATTACTT 

cngggcrcacacctatgaacrcttggccaaaccagggcagtttatccacaccaact^ 
catggtggcaccgngttatcctcatcatacaaatgcctgatcatgctgctcctgtcaccctccacg 
atttccacanaaccaggacattccatgtgcctcatggcnctgccancctggcccttrgncctot^ 
tntgttcagttnttttaaancggttggtaaganacltcrrgnagaaaacacaca^ 

SEQ ID NO: 2 1 85 ACGCGGGCACTTGATTTAGAAGAATGACACCAAACACATCGCTGAAAAAATT 
AAGTCAGCTCAGCACGAGTTGAAATTGACTACATrAATTTCTTrCCACCn'AGAATCAACAGGATG^ 
TTATTTCTATGCTGATTCrGGAGGGAGTrAACCTCCTGCAAAAAAAGGCATOT 
CTITTCTGACTTTTGGCTTCATCTTAATAGTAAGTTCAAGAAGTAGTO 

ATNATGGNCATTTGGAAAAAAATGArmATGmGAACCTGNNCACCCCAAGTAAGAAAGTGGAT 
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CTGCcrrTCCANCirmcGNTTTCArrmGCK} 

AAATACChnrrCATTTTGACANAGTrmAATGAGNGATTNATTN 
AGAA 

SEQ ID NO: 2186 acgcgggatggcaatgtggagaaggtgaaattcatgaaaagcaagccgggg 

GCCGCCATGGTGGAGATGGCTGATGGCTACGCTGTAGACCGGGCCATTACCCACCTCAACAACAA 

CrrCATGTTTGGGCAGAAGCTGAATGTCTGTGTCrCCAAACANCCACCATrATGCCTGOTCAGTCA 

TACGGGTrGGAAGACGGNTCTTGCAGTTCAAAAACTTCANTGAATCCCGGAACAATNCGGT^ 

CACCCCANAGNATGCAGCCAANAACCGCATCCAGCACCCCANCAACGTGCrrGCAuriNiiiAAC 

GCCCCmrGGAAGTGACCCGANGAAAACnTKITmAAATCTGCN A 

C^^nnTC^OTGAAAATATTCTNANGCCAAAATTGAGCG 

TCNAAANCCAATGCCCT 

SEO ID NO- 2187 acgcggggattttctcccggaacctctgctcagcctggtga accac acaggc 

CAGCCGCTCTGACATGCAGAAGGTGACCCrGGGCCTGCTTGTGrrCCTGGCAGGCT^ 
TGGACGCCAATGACCTANAAGATAAAAACAGTCtnTmACTATGACNGGCACANCCTNC^^ 
GGCAGGCTNATNTGCGCTGGGGTTCrGTGCGCCATNGGCATCATThm'CNTNATGATTGCTAAA^ 
GCAAATGCANTTTTTNGCCANAANCTTCGOTAACCATNn^AGGGNNAATO 

SEO ID NO- 2 1 88 aataaagaacctctatcagtgagacttctcattttatagcaaatacattttgc 

AGCTTAAATTITCTNGAATTCATATACGCTrCTGTCATTTAAACAAAC^ 

TCTATATATTTAAATTACAAATrrGGCCAAATACATATTTNTCCATATT TA 

TATTAAATTTGAAAAAATCAAANGTGAAAGCANAAACTG>™TTCAAGTTm 

ATTmATNATTAAAGTTrrGGrrGAATATNCCrCAATAGGGKTTCTAANA^ 

NOTCrrATGNAATTGGGGAI^NTmrrcNAAAGNATGGTGAANCGG^ 

NT 

SEO ID NO- 2 1 89 acgcggggctcttcctgctctccatcatggcgcaggatcaaggtgaaaagga 

GAACCCCATGCGGGAACTTCGCATCCGCAAACrcrGTCTCAACATCTGTGTTGGGGAGAGTGGAG 

ACAGACTGACGCCNAGCAGCCAAANGTGTTGGAGCACTCACANGGCNTACCCCTGTNTrrTCAAA 

ANCTAGATACACTTGTCAAANTCTTTTGGCNTTCCGNANAAATGAAAANATTGCTNGCCANTGCC 

AATTCGANGGGCCAAGCAAAANAAAArmT TGGAAN ANNGGTCNAAAAGNNCTGNAATTm 

AANAAAAAACAACTNCCOTATTTCrGGNTAACrriirmr 

SEO ID NO: 2 1 90 ACnr nmrnri iLliilin i GNGCTTCTTATGTTTCTCTGTGCTGTArrCT 
GGAAGTGGTCGGATACCATCGTCTOAGCrGGGACTATTCTGATAAAATITCrCTGCTATGGAGGA 
CCTGACTCACrrTCACrGCCAAATTCrGGGGGGTATATGCTGGTGACTTACTATGGCTGGGAAANC 
CACCCTCATGCTTTGGAGTGGAACCACTGATGAGTGGANAACCATAGTTTTTAAAAAAAACCATT^ 
GAGGCCTAAACCCTTCTrCACrACTTTCCCCANGTTTNTrGCAAAGNCACTTTGGC^ 
GGGACCTCCCATCATGC^^^ACTGA^WATAATTTTGCCACACC^TrTCTCCCa^ 
TAAGNCCTNTTAAAAAAANTCCCCTAACTCCCGGGGGNNGAAAACTTTGTTn'GNNT^ 
TTirmANCCTTCCCTGAAAATCTTCNCCCrGAAATTTTTAT^ 
GGAAAAATTTKNNCTNANC CCCATT TNGGCCGTCANTNGNTITr^ 
NAANCCCATNCCTTAATTTArnTTAAANA 

SEO ID NO- 2191 ACCrGCATCAGCATTAGTAATCAACCTGTTAATCCAAGGTCnTAGAAAAACT 
TGAAATTATTCCTGCAAGCCAATTn'GTCGACGTGTTGAGATCATTGCTACAATGAAAAAGAAGGG 
GTGAGAAGAGATGTCTGAATCCAGAATCGAAGGCCATCAAGAATTTACTGAAAGCAGTTAGCAAG 
GAAAGGTCTAAAAGATCTCCTTAAAACCAGAGGGGAGCAAAATCGATGCAGTGCTTCCAAGGATG 
GACCACACAGAGGCTGCCTCTCCCATCACTTCCCTACATGGAGTATATGTCAAGCCATAATTGTCT 
TA>nTTGCAGrrCACTAAAAGGNGACCAATGATTGGTCACCAAATCANCTGCTACTACTCCTGTAG 
GAAGGTTATGTTCANCATCCTAGCTATTNAGTAATAACTCTACCCTGCCCCr^ 
TACTGGAGGNGCTATGGTCTTAANGGATGGTCTGGACCCTGCTTNAATTTrrCCCTNACCTTTCCA 

NTTTTCCAGGGTNCCT 

SEO ID NO- 2 1 92 ACCCATGATTrGGACACTTTGTCGGGCCCATTTACTTATACTGCAAAGCGTCC 
TTCAGATATCAAATTCAAGCCTCTAAATAAGACCAAGGAGTATACAGCCTGTGAACTGATGAACA 
TATACAAGACTGACAATCACCTGAAACATTATTTACATATCATTGAAAACAAACCCCTGTATCCAG 
TTATCTATGATAGCAATGGTGTCGTCCTTTCAATGCCTCCCATCATCAATGGGGATCATTCCAGAA 
TAACAGTAAATACTAGAAATATTTTTArrGAATGCACGGGAACTGACTITACTAAGGGAAAAATA 
GTTCrrGATATTATTGTCACCATGTTCAGTGAATATTGTGAGAATCAATTTACGGTCGAAGCTGCT 
GAAGTGGTTTTrCCTAATGGAAAATCACATACCTTTCCAGAATTAGCITACCGAAAGGAGATGGTG 
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AGAGCTGACCTAATTAACAAAAAAGTTGGAATCAGAGAAACTCCAGAAAATCITGCCAAAC^^ 
GACCAGGATGTATTTAAAATCAGAAGTCATAGGTGATGGGAATCAGATTGAGATTGAAATCC^^ 
CAACCAGAGCrcCATTATCCATGCATGTOATATTGTAGAAGATGCANCTATTGGCrrATGGAT^^^ 

ACAACATTC 

SEO ID NO- 2 1 93 ACTGGATGGCCCCACAAGATGCTGCCACnTAATAAGGCTGCAATACACTGT 
GTATCTTACAGGAGTATTCrrATCCATCCCGTGGAAAAGGTrGCriTAACAACTGCAGTCT^ 
CGGGCGTrCACCTTCGCGAAATTTGACCAGCrrrrCACATAGGCm 
TCTGGTrCCAGGATCAAGAGTAGGGATACCACACTGTTCATCACACTITCAACATCr^^ 
TCCTTCANACACACATCACAGGCTTCAATAATITGAGCTAAATCAACATGAAGTCCACCT^^ 
TTCTCTTCTGAAATCTCAGCTCCnTrAGATTTCAGATAAGCACGAAGCTC^^ 
CACTGATGTCGATGAAGGCCGGGACGCTCATGGTGCAGGCCCNGGCACAGCGGACACTCCACTCG 

CAAGACCACGCCGACCGGAAAAGGGCCCCGCG 

SEO ID NO- 2194 ACAGTrrAAACAACAGCTGAAAGAACTAAAGAAGCAATGTGGTCTTCA^^^ 
GACAGAGAAGCTGACGGAACAGAAGGAGTGGATGAAGATATAATTGTGACCCAAAGTCA^^ 
ACTTCACCTGCCCCATTACAAAGGAGGAAATGAAGAAGCCAGTGAAAAATAAAGTGTGTGGCCA£ 
ACCTATGAAGAGGACGCCATTGGTCCAGGCAAAAGCGGAAGAAAAAGGCCrATTGCCCTC/^ 
GGCTGTAGCCACACGGATATAAGAAAGTCAGATCTTATCCAGGATGAAGCACTTAGAAGGGCAAT 
TGAGAACCATAACAAGAAAGGACATCGTCATTCCGAGTAGGAAAAGCCACCTGCTTO 
ACCAGCAGCCTACCTCCTACCCCACTGTCTGTTGAGAGCAGTGCTGACCCCAGCAGTTA^^ 
GCTGCATAGCATACTrGrrGGGGGTAAAACrrGrrGCrrTTATGTGTGCrr 
GTirCACAACAGAAAATGCCATTCATATTGTrrArrmAAGTGTIXn'ATAATGTA^ 
TTGATCTTCTGCAANAAAATAANTT 

SEO ID NO* 2195 Ac n - n ' ri 'n r iTn -iTn 1 1 1 r rrrrrn-) 1 1 aaaaacaagtntcacaatgttt 

Ar^GATAGATACAAGT^m^AAAATCAGGGCATGAACATGGCTTGATAAATTAAGTANAm 

rrCAATACrATAATAGGAGGGACCAATrCAAATTCTCACCATTTGrrrCACACCCACAAAA^ 

TTCAAGGGCATTAACGATCT^^X:AAAACTGATCAGTTT^GTGCAAGTAAACCATC 

AAGACrrGTGCACTTGCCCAGGCTCAAGGATATTAAAATCTANCACATAAAGCCCATTACTAGAG 

GTANAAATACAGGCAATATACTATTACGGCAACAACCATCAATTACAGTTAAAAATT^^ 

CAACCAAATGGATAATCAAATATTGCANCAACTCAAGTbm'ACTGAGCAAAGTGCATrrCTAC^^ 

TArrCAAGNGTTGCrATTCAAGTmCTAACTTAAAACANCCTATGATAACrGGCACAAANA^ 

Cn-GCAATAAACTGCCTCTGCTTGANAACTTATGATGGAArrATTGCTGCTGCTO 

AACATTAAANATCCTCCTAAAATArrTGGATGGNAGACTNTGATTAAACm 

CTT 

SEO ID NO- 2196 ACGCGGGGTATrTATATACrrGGTTTTAAATrAGGTCATATGTrCACATGGTT 
CAAAATTCACAAAGAACAAAAGAGCATATAGCTAAAAGTATTCTTACCrm 
CTTCACmGTGTTGTCAATTTCTTGTCAAATCTTrCCAGGGAGATm 

GAGTTGTAATATATNCATGCATACACNCACACACACNCACNCACTCKCNCACACTCACACAC^^ 

AANTAAAAAACCCNTrAAANAAAGACAT>rrrATCCTATCACTANAAGAAAATNTC 

NGAAGACANGGNAATTAAGNGAAGGAAAAGANNATTATAAACCTAAANNTNACCA-m^ 

TT^GCKGGAGTAAGTNTTTAATTATCATTNGNTAGCNrrCGAOTATAANTGGACT 

AAAAGAAACAGACTGNCTGAATA 

SEO ID NO: 2197 acgaagaaagcatttcccaagcaatgagtctcttaatggaaaaaataaa^^ 

GCAATGTAArrAAACTTTCCTTCAAAGAAAGGAAAGAAACCTAAATCTGTTATGGC^ 
/iGATAAATATrrCCTAGTCCCATAATATG^^ 

CTATOAAGCAGGGATCTGTGGCAATCCAAATTCATCCACCAGAACTCCATCCT^^^^ 

AGTGGGAACACCTTCTGGAATTGCAGGTGCAGATGCTGCCTCATCCAAATAAGAACTOT^ 

AGCCAGAAGCTCATCACCTAGTGCATCCAACTCTGCTTCTAAATCATCrrCATCCAGTTCT^ 

^^ATA^GOTACTC^^^ 

TGTAAATCCrCAATCTX3GTCNGAKrrrCACrrGCTTGTATGCCTTOT^ 

TCATAGCATCAACCCGTGGTCTTGGTGTCCrrCAAAGACTGGATGGTATAATTGGGCrrGT^^ 
GTrGAATGACTGTGGGGCAANATTGNCCCGNTGCTTGCTCATACTTCCTCTTTO 

A 

SEO ID NO- 2 198 ACmAGATACTrGGCATArrCTGGGTCTTTCCAGTAAAGCAAGTATTTi^ 
TAArrAACAAAAGCTTTGTCm 

GCTAAACATTGCACAAATTCCAACTCCAACTGAAACCGAAGTCGATTNCCAGCATCATCTGA^ 

atagcaacagcatcnggccataacaaacnaagacnccaaatcgcx:gccancctgacagagcaaa 
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ACCCANAAACGCCNGCCCCNAGTTTCAANATAAACCTGACTNACTTANAATGAAACATANTAAT^ 
AANhrrGTTGCATGTCNNATTACTAAANTAT 

SEQ ID NO: 2 1 99 ACAAACTGGTGOGTCANATCGTCTCCTCTAACATGACGCTACACTGTCGCTGA 
GGAACACATTTAATAACACTTCAGAACTGAACTGAAACGTGGCACAAACATGACAACTTCCAGGC 
ATGCCTTCAAGCATCTNCCTCANGAGGGATTCGCAACCCCTIWACTTC^ 
AATTCTTrGGAAGAATAAATNAGTTCACACTTTAGNNACTTOT 
CATAGGTOGGTTCATANCTCITAACnTATNCAGGAACAAATTTTGATNCT^ 
TAATNCNTTCAATGTTGNAAT 

SEQ ID NO: 2200 ACGCGGGATCAGACACTGGATCAGACACTAAACGAACTTAACTGTATATAAG 
CAAAACAGAAGAGTCTTGTTCCAACAGAAACTCTGGAGCTCCGTGGGTCmCTCr^ 
GAAGTrCCrrTTGTTATTGCCATCTTCGCTTTGCTGGAAATGTCAAGC^^ 
AAATATmGTNTNGGAGAAGCmGACCACCAATAAATTTNATNCCTTNCC^ 
GCACCAGNTTTT 

SEQ ID NO: 220 1 ACCCCTTAACCCCTTCTCCTTCACCCTTAG CAGCA AGTCCCACTTTTCTAGGG 
GGCAAGAAACCCCAAACCCCTTCCCTCCGTGTaTTACGCTCTCTTrTCTCT^ 
ACTATGGGCAACCTTCCATCCTCCATTCCTCCTTCTCCOTANCCTGTGTGCTCAANAAOT 
CTCnrCAACmACACCrGACCTAAAACCTAAATGCCTCATTOTC^ 

CAATACAAAOTGACAATGGCTCTAAATGGCCANAAAATGGCACTNTCNATTTCTCCATC^ 
GGACCTAAGTAmTNNTTGTCAAAAAAATGGNCAAATGGACTGANGTGCCNAGATG 

SEQ ID NO: 2202 A Lr ir iT i TnTr rrrr iTi - mr y i -i- v ncTTCANAA 

GATGATGANCCACTITGTAmCCTTAGTATTrCTCTTTGAAOT 

TATTCTTTTCAGTCTCTTATCANAANAGTTACAACCATCTTNTTTCATGGAA TA 

NATGATGAACAATCrrGTCTCTTCCTTGAACTCTITCCAAGCAACTTGCACCi^ 

CATATGCTCCATTCITACrCTrmATCCrrCTGAANAGTCACAACTATCT^ 

TCAGCATAATCAGATAATTCATCCrrCTTTTTANAAGTTTNATCTOT 

TCCATCAGTTGNNCCATTOTAAmGTTTTATGCCCTTAGGAAAATGACAAA 

GGTAACriTrCAGTGCCATCAGATTGAAAGATTCATACTGNNGGTTCCATTrrAAT^^ 

TAAAGTCTGGAAGG^^^TTTTTTTTT^m'CAG™ 

NTTNAATCGCTNTOGACTNThnTAANGAAATATNTTTGCANCCCCNGTACCTGCCC 

SEQ ID NO: 2203 ACTCCACAGAGAGATGCAGACAAAGTAAACAATGAAGGTTGTnTATAAAGG 
TGATGACCATTATAGAGTCAGAAATGGGAGTCGTTGCAGGAATrrCCTTTGGAGTrGOT 
AACTGATTGGAATCirTNTCGCCTACTGCCTCTCrCGTGCOT 
NGTAACCCAATGTATCTGTGNGCCTATTCCTCTNTACCTTT 

SEQ ED NO: 2204 ACCrTCATCTXTACTTCCAAGTAAACCCGTGGATGATTTGATGAGGGATAAAT 
GAACCTA iUlCU -ii'lACACACATACCAAGGACATGCTTGTGGCrAAAGTGAGTTGATAATG'rrGTG 
CAAAGGATAGTTGTCACCAACTCATTNCTITATGGTCCATAATGAAATAAAAATTTTGT^^ 
TAATTCTGTAAACAGATGCATGTNCAAAAGATCTATGATGGTC^GNAATCTrANTb^ 
TTAGATATTNTANANTTTTmCNTCTTGAGGAACACATn^^^ 

SEQ ID NO: 2205 ACATAGTGTCGCGAACTCAAATCGGCATTTAGATAGATCCAGTGGTTTAAAC 
GGCACGTTTTTGCTTATAAAAAAAGTGCAAAAAAGATGTGGTTTACAAGTTAAAGCT^ 
CCnrmGCrGTAATTGCACCAGTrrTAAAGCCTCCGGACAGAGCAGTATTTCGm 
TTTTCTTAAAAGCTTACAGTGTTTGGCTAATTCTCCTCCCCTTm 

GGACACTGGTGGCAGGTTAAGGGATACTGTCACmAAGAAGCCTGCAGATTGAAGTGTAAACAT 
GGAGAAATTAGGGGCTGATTrmAAACrGTGTGAGATArrAACCAGCCGCCCTGTTATAA^ 
GGAAATCCAAACAGCGAmACACCGATTAACACCCCCTTTATATATTTTTrACAAAAATA^^ 
AGAAAATAATCAAACGTTirCATaCrCrTGTCirr^ 

TTTAAArATOAAAAATTAAAAGTAAAACTCTAGCCCTTCAGTOAAGGAGACGTAAAATGGCGTGA 

GTAACAACAACTCCAAAAAAATAAAAANNGAAAAAAAAAGAAAAGGTCKrTGGCCGCGACCA 

NCTAGGGG 

SEQ ID NO: 2206 ACATrCTGGGAGAATATCACTGACGCTCAAACCATTTTTATTTCCAATATGTA 
TTTCAATACATGTTTGTTTCCACTTTTCCCAGTGCCACACACACACACACACAAAAACAAAACAAA 
ACAAAAAAAAACAOTCACAAGTTGGATTACATTAGAATTGGTGCCACAGTTAACrrm 
ATmAATAACCACCCAACTCTTAGATTTTGCAGTrrAGGGACTTO 
GAGAATCGTTrCATGTGACATGATGTTTCTATAGACCTCTTGCTCTCTAGGTGACAATG 
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AGGGCAAAGGTGTTTAANCTrrGTCACCTTCCACAGNTrrGTGGCCACATTCATAm^ 

TCGGACCCTGCTCCCNTCTTCTGGAGATGGGGAGAGGCAAGNGTGTATGTGTTGNCATACriTACT^ 

CArrAArrCTNACTTCGAACCGTmCATCAGCAGAATAAACCNAArrGTT^^ 

SEO ID NO- 2207 ACGCGGOGOTAACGGAGTGGTGCACCAACGTGAGAGGAAACCCGTGCGCGG 
CTGCGCriTNCTGTCCCCAAGCCGTTCTAGACGCGGGAAAAATGCriTCT^ 
TGAAGGGTGTGATGCirGGAAGCATTTTCTGTGCTTTGATCACTATGCTAGGACACAT^ 
GTCATGGAAATANANTGCNCCACCATGAGCATNATCACCTACAAGCrCCNAACANT 

SEO ID NO- 2208 actgtctcttttcgaaaagttcttgatccccaatgcttcacaagcagaoagca 

AAGTCTTCTATITGAAAATGAAAGGAGATTACTACCGTTACrrGGCTGAGG™ 
ACAAGAAAGGGATTGTCGATCAGTCACAACAAGCATACCAAGAAGCirrrGAAATCAGCANA^ 
GGAAATGCAACCAACACATCCTATCAGACTGGGTCTGGCCCTTAACTTCTCTGTGrrCTATTATC^ 
GArrCTGAACTCCCCANAGAAAGCCTGCTCTCITOCAAAGACAGCTTTTG 

ACTTGATACATTAANTGAANAGTCATACAAAAGACAGNACCGCTAATAATGGCAATTACTGAGAG 
ACAACTTGACATTGTGGACATCGNATCCCNAGGGANACTANACTGAAGCNNGTrAAGGAGGNGG 
AANNTAAACNGGCCITCCAACriTrGAhrrGNCTAATNCTAAANTrrAC^^ 
ATTCATGCTGCCCANANATAGTTrmGTTNCGATTTATCGACA 

SEO ID NO- 2209 ACAAAGGCATGGGGCTGTCCATGGGCACCATGATCTGTGGCTGGGATAAGAG 
AGGCCCTGCCTCTACTACGTGGACAGTGAAGGGAACCGGATTTCAGGGGCCACCrrCTCTGTAGGT 

tctggcictgtgtatgcatatggggtcatggatcggggctattcctatgacctggaagtggagcag 

GCCTATGATCTGGCCCXJTCGAGCCATCTACCAAGCCACCTACAGAGATGCCTACTCAGGAGGTGC 

agtcaacctctaccacgtgcgggaggatggctggatccgagtctccagtgacaatgtggctgatc 

TACATGAGAAGTATAGTGGCCCTACCCCCTGAAAGAGGGTGGATGCAGCTGCTTTGTTTOT 
TGACTGTCATTGGTAATACGGACACAGTGACCCATCCTCCATCCTArrTATAGTGGAAGGGCCTTC 

AATTGTATCAAT 

SEO ID NO- 2210 ACTACAAGAAAGAACTTTTTATATGAAGGATrCTTTATGTAGAGTATCT^ 
TGAAAAATCAGATTrrCITATCCTATArrACACrGGTTTT 
GCCrCATTACAATGTCTCrmGTGTTAAGAATTAACTTACAAAAGCATT^^ 
AAATGGGATAGAGAGTAAGAAGACAGGAGAGAGAGGAGAAACX^ATGTITmGriTrcAGTCA^ 
GAGGGTCTCACTCrGTCACTCAAGCTGAAGCACAGTGGCACGATCTCGGCTCACTGCAACCTTAG 
ACTCCCAGGCTCAAGCGATrrCTGGCTAATnTrGTArrrrTAGTANAGA 

ACCCANGCTGGTCTCGAACTCCTGCCTCGGCCTCCCAAAGTGCTAGGGTTCAGGCATGANCCACC 

ACGCCCGGCCCTGACACATATATTrAGTGCCATCTmCCTGCATAAANTGTATAATTATTAGGGT 

AACCGTGTNCAKn-CNCTGTCTTATTCCCAGTGCTTAACAGNNG^ 

ANT 

SEO ID NO- 221 1 ACGCGGGGGACnTTACCCCAGTGTGCACCACAGAGCTTGGCAGAGCTGCAAA 
GCTGGCACCAGAATTTGCCAAGAGGAATGTTAAGTTGATTGCCCmCAATAGAC^^ 
ACCATXnTGCCrGGAGCAAGGATATCAATGOTACAATTGTGAAGAGCCCACAGAAAAGTTACCT 
TTrCCCATCATCGATGATAGGAATCGGGAGCTTGCCATCCTGT TGGGCA TGCTGGATCCAGCAGAG 
AAGGATGAAAAGGGCATGCCTGTGACAGCTCGTGTGGTGTTTGTTTTTGGTCCTGATAA^ 
AAGCTGTCTATCCrCTACCCACTACCACTGGCAGGAACTrrGATGAAATTCTCAGGGTAGTC^^ 
TCrCCAGCTGACAGCAAAAAAAAAGGTTGCCACCCCAATTGATrGGAAGGATGGGGATAGTGT^ 
TGGTCCrrCCACCATCCTTGAAAANAANCCAAAAACTTTTCCNAAANGGATCnTCAC 
TCCATmGGCAANAAATTCCCTCCGTTCACACCCAANCCTTAAATTCTTTTr 
GCrGTNAACCCAAANGATGTCACTCCNATT>mTmTI^OT 

SEO ID NO* 22 1 2 acaccggotggcattaaggggtaaagatgtcccccttacggagcagaccgtg 

TCTCAGGTGCTGCAGTCAGCCAAAGAACAGATCAAGTGGTCACTCCTTCGGTGAAGACCT 

TCCTGGCTCTTCATCCTCTTCAAAAAATnGCATGTCTGCTGTGAATTTTCA^ 

GATGCrCrCAGGGTCATCTCGGGGATCACAGGGATCOTAAATCTCCATTCTGrrTGTGGTTGCC^ 

CCTCAACXrrCCCCTACACCCrTCCTATTCTrmCATT^ 
AGCATArrTAGATAATAGGGCAGGGGAAGCACCCTOTrCTITCTAGACTG 
CTCCCrrGCCCTGACATTITrrGTAAATTCTGTGCCCmGCT^ 
GGANAAAGAATGTGCTGAATGTTTTCCTCCTrrTGCTCT 

AGGGGGAAAAAAGAAAGAANAAmCTTTTCTATAAAACAAl^GGGGGCCGGGGATGGG^ 
GAmATCCAATCTAANCCCTAACCCCAKmAGNGACCTCANGGTTTNCrCCATO 
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SEO IDNO: 2213 ACTCTATGCATrCTATCTATACAGAAACACCTATrTATrATrACAGTTGArrrA 
TGCAGGATTAACATTTTTGGTGrrAAAATTGTTAAACATCATGCTTACT^ 
ATGAATCroAACrrCACAATGATTTACCAAACACTTrACn"AAATTACT 

ATGCACCGAGCCAAACAAAACTTAACTGGGCTATAAAGAAAAAAACCCTTCAAAACAGTCCAT^^ 
ACACATCTATTTCAAAACTACCTITGTTAGCTAGCCCCTTrCCCAAAGrm 

GTAAAATCTACACAAAATTTTATTGCAATCATACAAGGGTTACATTAGGTCAACAAATACTATGAT 

GCAArmACATTTArrAAACTACAGrrCAAAGCACAAATTTACACATTCTAAATACACTAAAACG 

rrATCTAATGAAGTCACACTGGTCTTCTAACATTTGATATATCTGGGTGAAAGACATGAACm 

AAANACrrrAAACACAAATCCTTAGTATAAAAACTGTGTCTTGTTTGTAACCGATTTAATGG^ 

CCCAATTAATGGGCTTCCATCTCCCTTAAGTCATNCATmGGTGCATATTTATm 

GGGG 

SEO ID NO- 2214 A Ci - J fi 1 J i l I ' l ill m i 1 1 1 ll 1 1 1 1 I CCACTGCTGCCACCACCATGAAANAN 
TGGCCACCANATTTTTATTGCNTACTCAGGTNAATAACTTATTATACAAT^ 
GGANACCATGCCCACTTACANAATGCAGCCGTAANTGCGGTAAATm'ATTTACAGAGGTNGGGGT 

gcaanatnaaagaantttcaccxcnagnantttgaagtganaatgatctacaaa™ 

AGGACCAACCGGGCTNGTGCTNNTGAGGTCmAAAAAATTCCTGGCAAANCGTAGGGGGAGATTA 
NATm'CGGAATTGACAGCAAKTTTGGGGACANTGCAAANAANANAGGGGTGACCT 

SEQ E) NO- 22 1 5 ACCCTGGGATAGGGAGCGATCTCCGAGCGAGGCGGCAAGATGGACGCGGGA 

tttttccgcggaacaagtgcagaacaggataatcggttcagcaacaaacagaagaaactact^ 
gcagctgaaatttgcagaatgcctagaaaaaaaggtggacatgagcaaagtaaatttggaggtta 

TAAAGCCTTGGATAACAAAAAGAGTAACGGAAATCCrrGGGTTTGAAGATGATGTTGTGATTGAG 

TTTATArrcAACCAGCTGGAAGTGAAGAATCCAGACTCCAAAATGATGCAAATCAACCTGACTGG 

ArmTGAATGGAAAAAATGCTCGAGAATTTATGGGAGAACTGTGGCCCCTGCTGCT 

AAGAAAACATCGCGGGAATCCCTTCTGCTTTCCTAGAACTGAAGAAAGAAGAAATAAAACAAAG 

ACAGA1TGAACAAGAAAAACTGGCATCTATGAAAAAGCAAGATGAAGACAAAGATAAAAGAAGA 

TTTGGGAAAAAATANAAAmrATNAATNN>rrNTT 

SEO m NO: 2216 AcnTTT irm r irm r i T nTirirvm 

TATAAGT^r^TGCAGCTmx:AAAATGTCATCAT^GCCACTAATGATTACT^ 

rrmrrCAGGCCTGTGGArrGGCATCCAAATACANAGTCTTACCCAGCGGGGACNTGGGTGCCCCC 
CCCGCNTGCCACGGAAAGCTTACNTAANTTTAACTTGAACANANCTTGGGAAATGGGG 
GGAAAGCATTTCCCACGCCAGGAACCAACGTGAAAGCNTTGGAATCANCACANCAGCCNTGGAA 
TCAGGCAGGCAGGGGAGGACGGGCrGTTCCTTCTNAGCTCTATAGT 

SEO ID NO' 22 1 7 ACGCGGGGCAGGGAAGAATGACAGCCACAGGGAGATGGTGGTGGGCAAGAA 
TGAGAGTCCCAGGATCCAGATTTAGCCTCAGATCTTCCCCATTCAGGAAGGGTriTCCAm 
AGAGCACrAGTATGAAAACATrAGGGACAAATCTCCCATGTCriTGAAArrCGGATTCTCCTOT^ 
AGATCCCCrrCCrCACCTGCCAATCAACirrATAAGGCCACAAGTGGTCACTGGrm 
AGGTTTGAGGTTCTCAGCTTTCCTrAAGCGACCCAGCAGCTCCGCTGTTTTCAG 
AAGCmGATGAGATTCTATTTTCAGTAATTTAGTGCTTCTGGGACAOT 

AGTCATTGTCTACGCAAAQAACAACGAAGCTGATCCTAAAAGTGA TCCAA TCTAAGAAAATGGTA 

AAACGAGCTCTGGCCCAGCACAAAATTTTATGTOAGGAACTCANATTm 

CAGAAAAAGGTTGCAGCCTGCACACCATAGCCCACCTNTCTGANCAACTTTGGT^ 

CGTGGCACATGnTGT 

SEO ID NO: 22 1 8 ACAACCTGAATTGAGGCTTCTCCTTCACTGGAGTGCACCTGCCTCTACCTCAT 
GGGTATAAAGTAGGAGAACTAAGAGACTTAAGAGGTCGTGGTTCCTATATCGTCCAAAAAATAGG 
CTGTTACATATCCTAAAGACTGCrCAACAGCTTCAAGTTOAAAGTGGCCAAGGACAGCCar^ 
GTrrGGGAAGGGACGAGCCTGAAGGATTCTGTCTTTACTGGGGTCAAATCTTAAAGCACACA^ 
CTGGACTCAAGACAGGAGGmGCOTCCTGATGGCTTTGCCACACArrCACAGGATAACT^ 
ATCCCTCGCTGCTGATTCACnTCTTACCATGCACTn'CCITrGATGCT^ 

GCNAAAAATCTCAAGGCTGOTCATGTGGACOTGTCAAGCTGCTCCTCCCCAGCGTNAAAm^ 
ATCANGGTGCCAAAACT 

SEO ID NO: 2219 ACTGTCAACATGACnTCAGATGCTCTTTGCCCCTTGCTGTCATCAOTGTGGT 
GAArrCATCATTGGCCGAGTTATCAAAGCCATGAATAACAGCTGGCATCCGGAGTGC^^ 
GACCTCTGCCAGGAAGTTCTGGCAGATATCGGGTTTGTCAAGAATGCTGGGAGACACCTGTGTC^ 
CCCCTGTCATAATCGTGAGAAAGCCAGAGGCCTTGGGAAATACATCTGCCAGAAATGCCATGCTA 
TCATCGATGAGCAGCCTCTGATATTCAAGAACGACCCCrACCATCCAGACCATrrC>^G^^ 
ACTGCGGGAAGGAGCTGACTGCCGATGCACGGGAGCTGAAAGGGGAGCTATACTGCCTCCCATGC 
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CATGATAAAATGOGGOTGGeei^TGTGTC3GTGCTOTCGACGGCCCATCGAAGGGCGCGTGGTC^ 

CGCTATGGGCAAGCAGTGGCATGTGGAGCArmGmGTGCC^^^ 

ACATCGCCATTATGAAAGGAAAGGCCTGCCGTATTGTGAAACTCACTATAACCAGCTm^ 

TGTrrGCTTCCCTGCAATCGNGTATANAAGGNGATGTGGTCTCTGTNTTATAAAGGCCTGGTGCGT 

GACTGNTTT 

SEO ID NO- 2220 acgcgggcgggtaactcccaggagagtgtcacagagcaggacagcaaggac 

AGCACCTACAGCCTCAGCAGCACCCTGACGCTGAGCAAAGCAGACTACGAGAAACAC;^^^ 
ACGCCTGCGAAGTCACCCATCAGGGCCTGAGCTCGCCCGTCACAAAGAGCrrCAACAG^ 

TG™GAGGGA0^Ci¥^^^^ 
TCTG^^CCCT^IIfexy^^ 
CCTCCTCCTCdftSSc^ 
CCCTGTAAAAAANAAAAA 

SEO ID NO- 2221 A CVm 11 1 1 1 rriTl 1 1 1 U n r rrAAGTCAATATAGTTTCCATTrrfACTGTGC 
ATACACATATATACAATGTATTTTAAAAATGGGCTTrACAATATGTAGm^ 
i^CT^^A^ATTGTGAACATmGl^^ 

CACAA-nTATrAAGCAATCTTGTTGGGGACATTGAGGTATAA'n-l-n 1 T'l rCTAAGGAGGCITCATT 

crrrn-ATAA^ 

SvcTNGCCxriTCCArrrANCcnTmAcrrGCTrcTCTAC^^^ 
CArmGTiTrTCAACcnx:TcrcTrcTAriTGCTrccTcrrT^ 

NATGGACrrCCATTCCITCAACACTCTGGTTCCTCCCCTTAAAGATGTTTC^ 
CTAmAATTGATTTTGATCAACICT 

aataSS^aa^^ 

SEO ID NO* 2222 ACAGTTTTCTCAGAAGACTCAAGAmCGCCCACATCCCmGAGCn^^^ 
AGATCTGCCGCCCGGCTGCATrrcTCCCACTCrrCAGGACAGAGTrAGCT^C^^ 
CATAGTCmGTAAGGGCrCGGCCAAGCGTGGGCCCGTGGGATGGAGAATT(XTm 
TGGrrCTGCAGCTGAAAATGTGTGGAATAGGGGGCATAGAGCGTGTCCCCTGTCrCTTCAAAACCT 

TACAGAAAGCATCCrCAACAGGATAATCCATGGGGTITCGTrGCTTTGTOAGCAGCCAA^ 
GGACTAACATAAGCCAACAGGAACACCCACCATTGGCAGCC^^ 

ATCCCACCTGCGGCAAAGTCAGGAACATGAGCAGGGTGATCCANGCCACCCANATGGCAATGGA 

AAGGANCATCGTGAGGTANATGTGGCCCCATGT^mmCCANCCCGNGAAGGACCACANAAGGTG 

AAGGAGGACATGAGGAAGGGCANCGCCTTCAAAAAAAAGGACTN 

SEO ID NO- 2223 ACTGCCmGGGCTrcrrCTCTCTCCTGTTTTCTCCTCT^^ 
AGCAGCACGTTrcGCTTGTATCTGrrCATATTCrrCrrGTGCTim 

GCTACACTGATCrrCAAATAAAGGCrCGTCAATGCTACACTGTrCTTCAAGCA^^^ 
S.TGmCAGATrATX:TGGaTATCGAT^^^ 

rTTCrOTAQCL^ATTTCITGTAACTrrGCTGTATTnCAAGTmcr^ 
TATrCCTGTACCTCGGCCX3AACCACNC 

SEO ID NO: 2224 ACCTAATGAAAAGATCTCCAAGAGGTITGTCTCATTCTCCTTG^^ 
AAGATTAATCXrrATATGTAATGATCATrATCGAAGTGTGTATCAAAAGA^CT^TO^^ 
TAAGATmGAAAAGCCTTCATCATCCAAACATTGTrGGTTATCGTGCrmACT^ 
TGGCAGTCTGTGTOTGCTATGGAATATGQAGGTGAAAAGTCTCTAAATGACT^ 
GATATAAAGCCAGCCXAGATCXnTITCCAGCAGCCATAATTrTAAAAGTTGTTIT^ 
GAGGGTTAAAGTATCrGCVVCCAAGAAAAGAAACTGCTTCATGGAGACATAAAGTCITCA^^ 
GTAATTAAAGGCGATTTTGAAACAATTAAAATCTGTGATGTAGGAGTCnCTCrACCACTGGA^^ 
AATATGACTOTGACTGCCCTGAGGCTTGTrCATTGGCACAGAGCCATGGGAACC^^ 
TGGAGGANAlTGGTGrrATrACTGACAAGGCNmCATATrrGCC^ 

ATOATGCTTTATCGATCaXACATTATCNTrCAAATGATGATGATGATGAANATAACriTrGATNA 
AAGTGTTT 

SEQ ID NO: 2225 ACTmrrTnTTrrriTrnTrraGNA^^^ 

mrrAAC^GAGTGTGATAGGTGAATTAAACATATTAAAAGAGTT^^ 

OATTATATACTATACAAACTATrCAAGTCATACAAAACCATTTITCATTGTCTrTCAA^ 

N/^CCTAnCAGCACGCTTCTCCTCCTGAGGNOGTTCATACT^^ 
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TATTTTOTCTACAATAGCrnCATACTTCTCTTTCATTrCGGCCAGT^ 
TTCTGGATTITCrrGGAGTAGCANCTCCrAAGTGATOCmAAAAAATCCAAAGCACT^ 
CTCTGGTTCITCATATAAGGCTACCAACACCITGGTCAGCGTGTCCACACCCCC^^ 
T 

SEO ID NO- 2226 ACATCACAAGTGATAACTTCAGCmACCACCANAAGCAGCTTCAATAGAAA 
CATCAATrmCCATCTGACrrCITGGTAGCACCAGTAACCTTrGTATTC>^ 
CTGTrrrTGAAGGATGCGTTGAAAGTTTTTAGATATCTCCATATCAATTCCAACT^ 
CCTAAAAATTCAACTGCTGTCACATCTGCACCAAGTCTTTGCCAAACrGAACCCAATO 
ATTACTCCTGCACCAATAACAACCATCTTrrCTGGAACTrrrm 
TGACACTATTGTATCTNCATCTATCGTGATTCCAGGAAAAGGAGTAACTTCT 
AAAGAATGTTCTTTGTATCAATAACCTGAGTGCCGNCATC 

SEO ID NO- 2227 ACGCGGGGCCATCAACCGCCAGATCAACCTGGAGCTCTACGCCTCCTACGIT 
TACCTGTCCATGTCTTACTACTTTGACCGCGATGATGTGGCTTTGAAGAAC^ 

ttcaccaatctcatgaggagagggaacatgctgagaaactgatgaagctgcagaaccaacgaggt 

GGCCGAATCTTCCTTCAGGATATCAAGAAACCAGACTGTGATGACTGGGAGAGCGGGCTGAATGC 

aatggagtgtgcattacatttggaaaaaaatgtgaatcagtcactactggaactgcacaaactgg 

CCACTGACAAAAATGACCCCAmGTGTGACTTCATTGAGACACArrACCTGAATGAGCAGGTGA 

aagccatcaaagaattgggtgaccacgtgaccaacttgcgcaagatgggagcgcccgaatctggc 

TTGCGGAATATCTCTTTGCAAGCACACCCTGGGAGACANNGrrATGAAAAGCTAAACCrCNGGCT 
AATTTNCCATAGCNCGTGGGGTGACTTACCTGGTCACNAAANGCArrGCATGCATGTO^ 

TT 

SEO ID NO- 2228 actttgtttgctgatacaaggtgagccaaaggggtggtgaaaagaacacaca 

IjVAAAGGATCrriTGTCTAACCCAGAAAGGTTGAGAAATITGCACTGAT 

AGAGGGCTGCTAATCAGAGCTGGGATCATrrTGCAAGACAGCCTrACTnTTCCAAAGTAATTTAT 

ATTTAGTTGTTGATAACCAAAAAAATGTCACAAGAGTAATACCTACTGTAAAAGTGAG^ 

GArrcCCTGGGTCTTGTGACCTAGGTrAAATGAAGTATATAACTACAATAGATriTGTrmGTTrT 

GCrmGGAAGTTCTGTGCAAAAGCTACCATTCACAGGATrAACGAATACCTTTACATATGCTAGT 

riTITAAGTGTCTGTTrGATTTGCAAAArrANTrrANCTACATAAATCCTGTCCCTTAA^ 

TCACAKATGTAAAACTGCAGTTnTATANATTCA 

SEO ID NO: 2229 ACTCA1TAATAATATTAATAGGCGCTTGACCCCACAGGCTGTCAAAATTCGA 
GCAGATATTGAAGTGGCTTGTTATGGTTATGAAGGCATTGATGCTGTAAAAGAAGCCCTAAGAGC 
AGGrrTGAArrGTTCrACAGAAAACATGCCCATTAAGATTAATCTAATAGCTCCTCCTCGGTATGT 
AATGACTACGACAACCCTGGAGAGAACAGAAGGCCmCTGTCCTCAGTCAAGCTATGGCTGTTAT 
CAAAGAGAAGATTGAGGAAAAGAGGGGTGTGTTCAATGTTCAAATGGAGCCCAAAGTGGTCACA 
GATACANATGAGACTGAACrrcCNAGGCATATGGANAGGCTTTGNAAGAGAAAATGCCNAAKrr 
GGATGGAAATGATGATCTNAAGANATGGAACCAAAGCTGAAGATTTACrmGTGGGGAAj|££^ 
CNNATTAANGAACACANANCAlWCTrNOTGGCTGTAATCCTANACTTTAN 
ANAACrTNAAAGCTGAATATTATTOATTTTCTAAGTATTNAAANGrrC^ 
GAAATGCXCTCCTAAATTNCAATTGTGNACACATATNCTTNimWATTrrTNCiri 

GCCTCTTGTCCNN 

SEO ID NO: 2230 ACAGAGATAACAGAGGTAACTAAAATACAATCCTTGTTCITGAGGGGCCAAT 
CTAGTGGGGAAATACATITGTAAGGAGAGATACTCCAATGAAATTTGGTGTCTCrAGAGAAGAAT 
GCACAACCTGCCATGAGAAACCAGAGGAGGAAGCAGCTGATATTCTAATAAAATTAAAGCTGGA 
GACArmAAAAGTAGTATAGCTCAGCTTCCTCrrTTACAGATGAGAGAATCCAGGTCCAGATAAG 
TCATGTGACTTATTCAAGGTCATGGCACATTTGTGGCAGAGCTGGGTATAAAGCTCAGAACTCGAT 
TTCCAGCTTCTTGTITTGTCACACAGAATATGCAACAGGGATTGAAACAGGAAGGAATAAAGATTT 
TGCATCAAACAGGACACAAAGCTGGAATGAGAGTACTGCATCTAGATGCTGGAAGAGAGTCCTGG 
ACTGCANAAATCCAGGGATGGCCTGNTGCANGCAGACOTGTGTCTGCTCTGTAATTGGGAGOT 
ATTCCATTXrATGCTOTGACNGNCCCCTNCCCTTCANAAAGTANGrmCTGTCCAA^^ 

TAGAGGGATTTNTG 

SEO ID NO- 223 1 ACrCCTCAACAGTCACAATCCATCCTAGTATCTTAAATAGTGAi ri i ii 1 1 lAA 
mACAAAAGAGGTTTATTGGACTTACAGTTCCACGTGGCTGGGAAGGCCTCACAATCATGGCGG 
AAGGTGAAAGCCACATCTCACATGGCAGCAGATAAGAGAAAAAAAGGTAGTGATCnTTAGTAAA 
GAAACCrGAGACTGGTAGTGGGCrrGGAGCCAGAGGATCGCTTAAGTCCGGGAGTTCGAGATCAG 
CCAGGATAACAAATTGAGACCCCCCCCAACTTTAAAATTAAAAAACGAAAGAAAAAATAGCTGG 
GTGTGGTGGCTCATACCTGTAATCCTAGP^CTTTGGGAGGCCAAGGCGGGTGGATTGCCTGACTCA 
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GGAGTTTGAGACCAACCTGGGCAACATGGNTGAAACCCTGTCTCTACTAAAACATAAAAA^ 
ACCAGTGTGGTNGCNTGCACCTGTNNNCCCATTACTTGGGAGGCTGANGCACAAAGA^ 

GANG 

SEO ID NO- 2232 ACGCGGGGGTTTTTAAAAGGAGAGCCnTTCIGATGCCACTm 
GGCCGGACAGTGGCTGTGGA 

TGCrATAGGTGAGGAAAAGAGCCATGAATCTAAACATCAGGAATCAGTTAAAAAGAAGGGCAGA 

GAGGAAGAGGATATGGAAGAGGAAGAAAACGATGATGATGACGATGATGATGATGA^ 

GGGrmTGATGATGAAGATGAAGAGGAAGAGAATATATAATCAAAGGTGACCAAGCCTGTGCAN 

ArrCAGAAGAGGTAAGCANCrCATCCTGTTCTGAGAC^fGCTNAGGAAGArrCATGATTGT^^ 

ANAGGAAAAATTCTNCrrrTTNTGCCTGCATCArrGCAGAATAGACATC^ 

AGGCCCTNTTGNCANCCCACCTCNTTNNTTOTGATCTGTGCCATTGCAT^ 

Gan'CrGCAAAA>rrcrrNrrATNNTTCTGNAGGANACATrGACCT^^ 

TNGATATNAACrGGCTCNTNTTTNTAACT 

SEO ID NO- 2233 ACrAAATATTGCTGAGAGCATCCACCCCAGGAAGGACTTTACCTTCCAGGAG 
CTCCAAACTGGCACCACCCCCAGTGCTCACATGGCTGACTTTATCCTCCGTGrrCCAm 
GCAAGTGGCAGTGTCTCCACCACCTATGATGGTGATGCAGCCCCTANAAGTGGCTTTCACCACCTC 
ATCCATGAGAGC-mGGTTCCCCGGGCAAAAGCTTCCCArrCAAATACCCCCACAGGACCATTCCA 
CACNATCTGCTTAGCCCGAGTGACATGCCTCANNATACrmTTGCTGCTrmAGGAC^^ 
GCCCATCCANCCAGCANGTT^GCCANAAGCCACAGTGGCTTGGCAGTlmGGCATT^r^CAT^ 
CTTrGTCANCAGTGGCAAAA>rrCAANAGGCAANGGTATNTCITNACNCCT^^ 
NNCAITAGGTATTmTNlACAArrTGGNNTCCnTnAATAT/^^ 

N^^GAA^n^^CmAANGNAAGGTAAAAACCrrCCACCCACNATNATTCATCACAnGA^ 

ANCATATT^m>^GATAANCTGGGAT^^TGT^r^GAANTr^TANAT^^ 

TTTATATG 

SEO ID NO- 2234 ACGAGTCCCACTATGCGCTGCCCCTGGGCCGCAAGAAGGGAGCCAAGCTGAC 
TCCTGAGGAAGAAGAGATmAAACAAAAAACGATCTAAAAAAArrCAGAAGAAATA 

aggaaaaagaatgccaaaatcagcagtctcctggaggagcagttccagcagtgcaagot 

GTGCATCGCTTCAAGGCCGGGACAGTGTGGCCGAGCAGATGGCTATGTGCTAGAGGGCAAAGAGT 
TGGAGTTCTATCTTAGGAAAATCAAGGCCCGCAAAGGCAAATAAATCCTTGTTTTGTOT^ 
TGTAAlNAAGGTCTTrArrGTTrrGrrCCCCCCTmANTA>rrriT^ 
>n'ATrATTTNCTTmAGTAATATATAAAGNTCATGTGTN>rm 

SEO ID NO: 2235 ACCTGCACGTCTCATCGTmCTGCCGAAGCAAACACTCTACGAAATCATCA^ 
ATTCTATCrrGCACTCTlTCTCTGCCCGAGTATAACCGATTCCATGCGCACATrCrATCCA'^ 
TCAAAAGCATGGCATCGACCAGCCATCTTGTAGGGCTGTTCACCACTCTGGAT^^^ 
TCTATGrrAAGGCCGAACCnTITCTGGATGTCCAAOAAAGGCATGGCCGATGCTCANTGCCCTTG 
CTT^rnrrrGGCCGCCXNTrCAGAANAGACTNGCTAGCGACNGCrrGQACCGTCT^ 
CCCCG>rrNTACCTGCCCNGGGCAGGNCGCTNTAAANGGCNAATTNCAAACAC 

SEO ID NO- 2236 ACrrATACCCCCTAAATATATAAAACATTTTTAAAAGAAAAAAAGGAAGAAA 
CTATTCATACATGCAACAACTTGGATGGATITCAAGGGAATTATGCTGAATGAAAA^ 
CTTGTAAGATTACArrCTGTATGArrCCATTCATACAACATTCTTGAAATGACAAAATTACAGAGA 
TGGAGGACAGAACAGTGGTAGCCACAGGTTGGGGTOAGGGTATAAGAAAGGGATGTGGCTGC^^^^ 
CTGTAAAAGGGCAGTGCAAGGGATCCATGTGACAGAACTGTTCTGTCTCTTGTGATGGTGG^^^^ 
TGAATCTACACATGTGATAATArrGCATAGAATTAAATACACATACACGAAAAAAGrrCAAGCAG 
TTGAGCACAAATATTTTAATrGTCTAAAATGACATrrTCTTTAAGAG™^ 
ACrmATGAGGTGTCACATCCATCACCAT^ 

ANCTTGTAAACATTmACTAAGGGTAANANAAAGTTAAGGGTGTTTCC 

SEO ID NO: 2237 ACTAGGACAGTCAGTAArrAATGCATCATTCAGAGGATTATGGCTG^ 

AGAAGTGCAAGrrCAAACCrGTCAACACCAGAGGTAATCATTrrATATTAATTTATACGTAA^^^ 
ATTTAAAATCmATCTGAGTATAACATATGAAAACAGTCnTrCCACAAC^^^ 
TTTAAAAAATAAGGAGTCATTrmAAAGTAACTGATCAGATTCCACAGGCT^^ 
TCrrGCrGGATAGAATCCCTrCATTTGGTGGCrrTTrGCATGCAOT 

GTGTTGrrCTAAGAGCrCACCAAAACATAGATCATGCCATATAGGTAGTAAAAAAATGCTAGA^^ 

TGTAGGGTCATAGAGTCCTGGCCCTCATCACTGGTCTACTCATATACCTCCAAATATGAT^^^^ 
AGAGGGGCATATTGAGACCCAGTGTAAGCCACTCTGCTGCACAAAGAAACCNTGACACAGAAAG 



334 



wo 02/29086 



PCT/USOl/30732 



AAAGCGTGGATGANGTACCTGC 

SEQ ID NO: 223 8 AC iTrn - i - i " n - i - i - n rn ri i rn 1 1 1 1 m 1 1 rAATccACACCTGcccmTATTG 

GTCTNr^^^'ANCAAAGNGGCTCCAGGCCCr^^CAOTCCmNA^ 
AGGTGCCATCA^^NTGTGAAGGCCCAAANCTTACCCAANT^^^TGGANCCCAAGTO 
CCAAAGGGTTGGGANAGGAAAAGGAAACAGGCANAGGGGAAAGGCAAGGCTCTNAANTNAAGG 
GGACTGATNTNAAGGGAATGCTGAGGTCCAGCAGTGT 

SEQ ID NO: 2239 ACCTGCTATTITrGGTAATGATCTrCAGGCAGATTTGGCCCCATGGAGTTTTT 
GTCAGTCATGAGGGTTTGAAAGGATGGCGTATCTTTCCAGCTGAAAAAATCCTGGCAAACT ACAA 
GCTGTCCACAAGAAAGCAAAGCAACATGTTTGAAGGGGTATGGAGTGCTAACTCTOTOCCACTTT 
TGCAAGCAATmGCTCAGTCTGTCATTGTArrAGCCTTAACAAAACTCCrrGTAATTATAGACG^ 
TTTArrCAAAATAAGATCTTACTATTATACAGGTTTCTCTCnTrTGACTATATAC^ 
AGTCAACCTTCTCCCrAGACGTTCCAACATGCTAAATTGCAGC^TAATCAACCCTCAAGAGm 
ACACACGCTATAATTTTCATrACGTAAACmGAGTATTTGGArrATCAACAAA^ 
TATTTATTGATTATGCAGCTTATTTATAACAAGGCCTGATACrCAGGGAm 
ACAATATTTTAACATTTTGAAATGGATACAGTAANAAAATAAATTTTArrCTAATATCTAGGATCT 
AAATCTTCTTATNArrACCTAATTACAAACAAAATAAATTATCAAAATNTTACrC^ 
G 

SEQ ID NO: 2240 ACrrrGACCCTGGAAAGGTATGGGTCTGCTTAAAAGAAAGAAGAAACATACA 
CGTAATCACAATAAAGCTTAACATrATGCAGGGCTTATAATCATTTTCAGCAACGGACTGCAAGCT 
GCACTGTGAAGAAAATGCATAGCAGAGGAGAAAGCTGGGGATCTGAGGAAATAGGTAAGGAAAA 
CAGTGTCAACACACAGTGGAAGAAGTGATGAAGACATCTATTCCGGAGCTCACGTGCCATGCCCT 
GCTAGCCGTTCCTTAACAAGCCACCTGCTCCAGAAGGCCACAGCCTGACCCTCCCAAGTGGAATA 
TAAATGCCCAAGTGCCACATGAAGCCACCTCCTCCACTAGCTAAAAAG CTGTC TGGGAACTGAGC 
TACANAACACACACACrrrCTGGTCTAACAAACATTAAAGTGAAAGAArmTCTTATATATCTAT 
TTTTTAATACAACTTAAACGCAACTTTrATATGAATrrGGGCTTCTATTCAGNCCCT^ 
CCrrAGGANGAACTCAATATTGGAANCNANNAAAAACAAAGATTCTAACCCAA AATG ACTTGCCT 
TTTAACCTTATTATCATTTITGNNGGACANNAGTmT>JATNT^ 
NTNTTAT 

SEQ ID NO: 224 1 ACCTCACCCATATGCTGAAGATCTTTGGGGCCGTAGAAGAGGACAGCTCCCT 
GGGATTCCCGGTCGGAGGGCCTGGAACCAGCCTCAGTCTCGAGGCTACAGTCATGCCCTACCTTC 
AGGTOTTATCAGAATTCCGAGAAGGAGTGCGGAAGATTGCCCGAGAGCAAAAAGTCCCTGAGATr 
CTGCAGCTCAGCGATGCCCTGCGGGACAACATCCTGCCTGAGCTTGGGGTGCGGTTTGAAGACCA 
CGAAGGACTGCCCACAGTGGTGAAACTGGTAGACAGAAACACCTTATTAAAAGAGAGAGAAGAA 
AAGAGACGGGTTGAAGAGGAGAAGAGGAAGAAGAAAGAGOAGGCGGCCGGAGGAAACAGGAAC 
AAGAAGCAGCAAAGCTGGCCAAGATGAAGATTCCCCCAGTGAGATGTTCTTGTCAGAAACCGACA 
AATACTCCAAGTmGATTAAAATGTAAGCATGGTCTGCCCACACATGACATGGAGGGCAAAGAG 
CTCAGCNAAGGGCAAGCCAAGAAGCTGAANAAGCXmCGAGGCTNAGGAGAA^^ 
ATATNTGCANATGGCCAAAATGGAANCTTCCAm'GAGGGGGCNCAGGACTGACi i i 1 1 1 AAACAT 
TGGGGACTATTGCCTT 

SEQ ID NO: 2242 ACGCGGGGGGAGTCGTTGCrGrTGCTGTTTGTGAGCCTGTGGCGCGGCTTCTG 
TGGGCCGGAACCTTAAAGATAGCCGCAATGGCTGAAAATGGTGATAATGAAAAGATGGCTGCCCT 
GGAGGCCAAAATCTGTCATCAAATTGANTATTATTTNGGCNACTrCAATTTGCCAa^GN^ 
TCrAAAGGAACAGATAAAACTGGATGAAGGCTGGGTACCTCGNCCNGCTACCAC 

SEQ ID NO: 2243 AC n - J ' i - i r ri TrJ -n'l' ni - J 'l 1 1 J I NGGTrrCGTTGTTTTCANAGGCrnTGAAC 
CrrGATCCTCACTGTTATTCTCmAGGACTGTTACirCCrrGrrCATCATCATCACrGAAATTCA^ 
TGCTCTGTATAATCTACTTCCATCrGAGCACCTGCCCAACCTTCATCAGCTTCAGCATCTAGGTTAT 
CAAATTTATCAAGCTCCTTCAGTrCTGATGCACTAAAAATGGATGGGCGTTCAGGCTCAAAGGCCC 
ATGAAGGAGGTGGGCCTCTTCCTCNAAGGCCTTTGTn'GTTTCAAATAAANAAGGTGGGAATCrc 
TGGGACCATGTANAGGAGGATATGTCATCCTCGGATACTGTTGGAACATATAAGGAGGCATCATA 
GCTCTATACTGGGAAGCGAGAGCAGCCTGCTGTCCATTCAGTTrAGCCTGTGGAGGACCACAAQC 
TATCCTCTTrrCCACCACTTTGAGGATATCATTTTGCTNTGATGrrCCAACTGNGCrrCATO 
AGGGAGCTTTTrCATCTTGATCAAATGACNAAAGGNGAGCCATCANCCTTACCACATT^^ 
AAGCAAACATTTGGNGGACGTAACTGGTCCAGGTCCATAN^^^G^ns^^^^Am 

SEQ ID NO: 2244 acacaaagagggggtgggtgtcggatgcagagtgtgtggcctgatgctccac 

GGCGTGCAGGACGGGGGGCTAATAGTAGGTTTCCTTCTCCACCCAGCTGCCAGGGCGTCGCCTGA 
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TGATGAGTmCTGACrrCGTCATATACGAAGATGAGAANAGAGTAGGGGAAGGCACANAACCA^ 
CAGGTAGGTTTGAGGGGATACATCCTAANACAACACCCATTCCANGGCAGTANGAAAGGAAAGC 

SEO ID NO* 2245 ACGCGGGGAAACAATGAATCAGAAAATTTCAGTICTAGTrCACCATTGACIT 
TACCAGCAGATrrGAAGAACATCrrGGAGAAACAGTmCTAAATCrrCCAGAGCTG^^ 
GArmGCTAATATAATGAAAATGCTGAGAAGCTTAATTCAAGATGGCTATATGGCCTTATTGGAG 
CAGCGTTGCCGCAGCGCTGCACAGGCCTTTACAGAGrrGCTGAACGGTTTAGATCCTCA^^ 
AAAGCAArrGAACCTGGCCATGATTAACTATGTmGGTCGTCTATGGACTTGCCAm 
GGAATAGGACAGCCTGAGGAATTATCTGAAGCCGAAAACCAGTITAAGAGGArrATTGAACACTA 
CCCCAGTGAGGGCCTTGArrGaTGGCCTACTGTGGAATTGGAAAAGTATATTTGAAA^^ 
GATrrCTAGAAGCTCTCAATCACTTTGAGAAAGCAAGAACCJITGAmATCGTOT^ 
TAACTrGGCCCACGAGTAATGTGArrATTGAAAATCTCANCCCAAAAAATAAAGATGC^^ 
AAAATTTGrrGAAGAATGCAAGTCCCTCCANTGCCAGATGCCATITNTTGCT^ 
TGGATTTTCT 

SEO ID NO- 2246 aCTQGGTCCTTCCCAAAGGGAGAGAGTCTCCTGCTGGCTCTGAAGAAGTGAA 
CTGCCrrGTTGAGAGAGGGCCTGTGAGGGGCCATGTCACAAGCATCCAAGGGTCATGCCTAGAA^ 
CXAGAGCAGCCCrCAGCTGACAGCCAGCAAGAAAACGGGCCTTCANACCTATGGTTACM 
TGAAAGCCACTAACAGCCATCTGAGCTCGGAAGAGGACCCCAAGCCTGGCCAACCTOT^ 
GTCTTGTGAGACTCTGAACAGTGGACrrCTCCCCAGACTCCTGCTCCGTGGAAACT^^ 
AAATGAATGTTGTGTTAAGGTGCCAAGTTTGTGGTAATTGGrrGTGCAGCAGTANATAACT^ 
ACTCACAAAAGCCTTTTTTCCTGCATCAAAGCTGGATCTCTGATGGCTTGCT^ 
ATGGAGCAATANCAGNGCAGGGGGCATTCAGGCTCAATTGGCCCATGTTTATCACTGGGCTTGTG 

GTCACATGTA1TAAATTCCTGNGGAGT 

SEO ID NO: 2247 ACACTGTATACATCTTGTCATGATGGTCTTTACCAATGGCCXrAATGTTOT 
CTTCCACAGCACGCTTCCCCrCTAAAAATCGGCTCCTATCATTTCCAAACAT^^ 
GCAGATCACArrCACCTCCCTGGTCACAAATAGGACAGTCCAATGGGTGATTTGCTAATAAGAACT 

CCATCACACCTTCCCTGGCirnTrGGATTTTTCTGAGTT^ 

GGCATGGCACAAGCAGCTACAACCrrAGGGGCTrrCTCAATTrcAACAAGGCACATCCT^^ 

CCAGCAACAGACAACCTTTCATGATAACAGAATCGAGGGATCTGCATGCCAACCTTCTCACAA^ 

TrGGAGGACGGTCOTrCCCGGTTCrACCATGACAGACTGACCATCAACAAATACTrCAATCAAGT^ 

GC^TGCTTGCTGTGNGCA^^^^GT^CX}AACACATCCTTANGAAACTTAGAAAGACCT^ 

TCrrACAGGTATCCmAACATATTTGCron-CCCCGGACCANGNANGCTG™ 

GNACANNAANACCCCTTAGAAGGCCGG 

SEO ID NO: 2248 ACTrGATGATAACGGTmAAAATCCTrCACTCGTTCmCTCAAATOT 
OTCrmCGAATCGrnTAGATATCTGTTCAAAATCTC^ 
TCTCTTATITCArriTrAGCTrGCTCTATTTrATCTG^^ 
rrCACGrmTTGAGCAAAGTAAmGAGCATCTTCCCATTTCTGCCAGC^ 
AACACACCrXTCACTGCAGCAATAAGACGAATGTAGTCACTAAGTAGrrCTGAAAACA^ 
GTCAGCAAAAGCrrGrrcrrGATGTAACrGGTCTATCTTCrCCTCAACCTCTG^^ 
AGCTCrAGATAAAGCAGTATGATCCTCAGAATTACCTAACATGGCAGCAC^ 
CrGTGTOGCTGAAACnTCTmCTATGACAGACCAAGGCTTCAACACrGACATG^^ 
GTrGCTGATCCAGATmCAAATTGCTGCTGCnTrrCTTCAAACCATGCATCra^ 
GGATTGCCATTTrGTTGACAGCGThn^GGCAGCCrrGTTCACCATCCTCAATAm 

SEO ID NO* 2249 A Cll - l l l 1 1 1 1 1 1 i U 1 1 1 U 1 1 i TmGGANANACACTTCTTTTATrrAGGAAG 
GAGGTCriTCCTACrrGGCNCCGGCTCCrcCCAAAGGTCGGGGTGACAGCAGGATCAAA^ 
C^CTCACACACAGTGGGAGTGGAGGATGGCrr^CCCGCANCCT^^T^AAAAAGCCCAGN^^ 
riTATTTCAAAAAAGGGCATTTCTATACAAAACATCATTCCATTACAGTCGTCACTr 
GGCTCAGGGANACGACTCTTGATATTCATGGGTATATArrrACTITAATCATATI^^ 
CTCTTCCACACTAANATrrACAATAAAAATAAAATAATTTGTTCTGTAAACTATAGTAA^ 
CAAATGTCrrAAAAACTATOTCCATACAATGGAAAAAGGCnTITGTGTTCT^ 
ACCCAAATCNCTGANANAAAACCCGTGGGCCGTGTTGCTGGAATGATGCTGGAANCACn^^ 
CTACANACGGTTGCTAAAAAAACAAATCACTTTCTCCCCTTCACCCTGNAAG^ 
OTAAAAATCTACCTOGGGATCITCCAANAAGCWAAGGNTGCTC^ 

NCA 

SEO ID NO: 2250 AC ITn - i iTriTnTl 1 1 1 1 1 1 1 1 i l U IN ACANAAGGGAGGGANAmAATO^ 
GTCCATANAAAGACATTAAATGTCANACTGACAAmANTTATGGTTA^^ 
CCATrrGGAGTAAIGGAArn-GAAGTrACTANAAAACGACAACAGATTTrr 
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AGQCTTATCTATAACACGTCAACGTmCNACNACTOTGAAATn-ATAAATAGCTGGGG^^ 

comNTrmrrcACC^ 

cSA^AAXcA^TTVciTCTCrCGTO^ 

my^LCTATTCGCTTAAGTCCAGTCTGArrGGTAACTCCrrrCACTT^ 

ATCCTGGGOrri^ 

AASS/lCTTOCTTnTGGGN^^ 

ANTCCACCANAAGGNCTTNGAnTANCTT 

SEO ID NO- 225 1 ACAAGATGTGTTACTATCGCnTrGGACAGGTrr ACACAGAAGCCAAGTOTO 
T^AGGCHTTXiACCG^^ 
GGAAGCATATACCA^^ 

gg^^^S ^ATAAATGTCACGTCCAG CTCrGAiy.TC^^ 

GACGITGAAGA nTlTn 1 i 1 1 1 1 1 1 1 1 1 1 1 1 AATATGCAGTTTOTAANAACAAAACn^GATG<BC>a^ 
^^^S^nCTCTGGAAGm 

^^^SGGTTCCAAATCAAATGTCATOACmA^^ 

aaaaaaaaaaaaaaaaaaaaaaaaa 

^gtcottcaS^t^^^ 
ggaaacaaa 

SEO ID NO- 2253 ACTCGTCCAGGAGTTATCCAGGATAGATmCACCCACCATGGGACGTCATC 

GrrcAAXTCX^crc^ 

y^A^STS^GTWCAAAGCrrCAGCCCCAGATAACTAT^^ 
rA^^/^ACACXACCICT 

ggSS^ct^aISCgc^^ 

GAAACTOTTOTOACTO^TATCTAAATAGTGGAAAA 

ATTOAANCCOATGAAAAATAAAGAAAAAANCATGTTCTTTCGATCA 

SEO ID NO- 2254 ACGCGGOTAmTTCATCCAGCATATGGGGACCAACATGTGATGGCCTC^^ 
CTCATOTTGAGCGCTG^^^ 

SS-S^^^Tg^^SaTcc^^^ 
?GSCTlS^I^Tl?^™ATCGATrrTnATrcAcrOT 

TGCCCCrr^AAC^G^m^AAaVAGCA■ITTGTA^lCTTTGT 
m NO- 2255 ACGCGGGGCCTTTmCTmrrcCGGCCTTCAAGATGTCXlAAGCGAGGATO^ 

^G^GGTCorcreG-^G^^^ 

GSoSSGGyScCAAAAACCTGTATA-rcATCTCCGTGAAGGGGATC^^^ 
TCAGAAAAAAGGT 

SFO m NO- 2256 ACCAAGAGGCCAGTGTGCTCTGTGGACCrCAAAOTTCACCTGACTTCCTO^ 
AA^(^ATTTCC>^AACmCCAGATAT^^ 

om^^S^SoNA^?^^^ 

TANGAANTANCrTANTA>rrNNATANNTGTNTT 
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SEQ ID NO: 2257 ACGCGGGGACCTAGTGTCTGAGCGGCACAGACGAGATCTCGATCGAAGGCX5 
AGATGGCGGACGTGCTAGATCTTCACGAGGCTGGGGGCGAAGATTrCGCCATGGATGAGGATGGG 
GACGAGAGCATTCACAAACTGAAAGAAAAAGCGAAGAAACGGAAGGGTCGCGGCTTTGGCTCCG 
AAGAGGGGTCCCGAGCGCGGATGCNTGAGGATTATGACAGCGTGGAGCAGGATGGCGATGAACC 
CGGACCACAAOGCTCTGTTGAAGGCTGGATTCTCTTTGTAACTGGAGTCCATGAGGA^ 
AANANNACATACACGACAAATTCGCANAATATGGGGAAATTAAAAACATTCATCm^ 
CANGCGAACAGGATATCTGAAGGGGTOTACTCTAGTTNAATNTGAAACATACATTGAACCCCANG 
CTGCTATGGAGGGACTCNATNGGCCOTGNAmANTGGGNCACCCCATNANCNTTGAOT 
TTTTTNNGGGTNCAC 

SEQ ID NO: 2258 ACATGTTGAAAGCTTTAAATAAGGATCCrrGGAT ACAA AATATGGTCCACAT 
GGCTGAAAATTAArrCTATGACACAATATTATTCTAAAGTCATCCrTATTTTG^^ 
TrrCTTTTATGGCrATAAAAATACCACCAAGCrAGACATITrAATTCTGTTGAGAT^^ 
AATTACCACAGCAGCCTACACCTTCTATGATTTCTAAACACTTAAACATNAATGGT rTTGG GAAG 
TGCTGTTGTCCTTCAAAGTTCATTAAAAGTTCANATGCCTTTTGTNGAAANCra 
CCTGCTAAATTGGCTTNTAAAAAGTCA 

SEQ ID NO: 2259 ACGCGGGCTGAACGTGAGAAATTGACCCAGCAGATGATCAAGTATCAGAAA 
GAACTGAATGAAATGCAGGCACAAATAGCTGAAGAGAGCCAGATTCXjAATrGAACTGCAGATGA 
CATTGGACAGTAAAGACAGTGACATTGANCAGCTGCGGTCACAACrCCAAGCCTTGCATArrGGT 
CTGGATAGITCCAGTATAGGCAGTGGACCAGGGGATGCTGAGGCANATGATGGGTTTCCANAATC 
AANATTAGAAGGATGGCTTTCArrGCCrGTACCATTTGAATTTCACAGGCOTC^^ 
ATAGATrANCCACATAATANCAACTGTAGTGTAGAGTATTAACTACTTAACAAAAGAGGATTGAG 
GGTGGAAGTrrAACTGTTATTrCAATGTCCATTGNAATTGAGGATAATGGTTATTTGAAATrTA^^ 
AGAATTTAATAAGAAAAAGATTTCTTTACTCATGTGTATCACTAGCA^ 
AATTITATGTGGTNGGGATATTCAAAAAATGTTCCGTNTATAGAATTTTT/^^ 
GGNTTGGCGTGGTGGCTCATGCCTGTANTCCCANCACTTTGGGAGGCCGATCACNATCCGGGGGT 

GGATCCANGGTC 

SEQ ID NO: 2260 ACTTTnTTTT CTrrri - l - i ' i ri - i ' l ' l ' l ' l ITi ' i l AAAATTTATCGGTTCCX}ACnTAAA 
ACCATCAAGTCTGGTCANAATCAACTCAGTCTAGCTGATGCAAAATCATATGCATTCAAAAAGCA 
GTCTITACCGANATGCCTrTACAAACCTTGGAATCCAGCACCTO^ 
GCAGGGAAGTGAACTAATAArmCATITACCACATCITGGTGTCrTTGAAAAAAT^ 
ACAAACCrGTrrnGTCTCTCCTTATGCTCCAmCCTTCAAGTAGACTANAATGCTN CAGG 
TCTCATTTATGCCACCATGCAOTCTCAATCCArmAACrrGTSrrCrANATACCTA^ 
ACTACACANACCTCANACANANTGNAATCTrC>mANCATGTGTISOTC^ 
CNACTTCCACCTGGNCCTGGOTT 

SEQ ID NO: 2261 ACAGAAATTAAAAATCAGGAAAAAATAAGAAAAAAAGCATTACAGTAAGAT 
ATTTTGAATrAAGAAACAAGGTGTAAACTGTAGGAAAATATACAAATAAACACAACTGAAATAA^ 
CATGGTATAAAGAGAAACTTTCCATTAAAAAGCACATCTCTATCTGGAAATACAAAGTOT 
AGAATGTTAATAAAAATCTAAAAAATAAAAAGAGAGAAAGGGTAAACAGATGTATACACCGTAT 
CTATAATTAAAArrCAGCAACATGCTTTAGAGTATGCAATATATAAAAATGGTCAGGTGTGGTGGC 
TCATGCCTGTAATCCCAGCACTATGGGAGGCCAAGGTGGGTGGATGACCTGAGGTCAGGAGTTNG 
ANACCANCCTGATCAAAATGGTGANACTCCGTCTCTCTA 

SEQ ID NO: 2262 ACGCGGGGGGTGTGTTACCTGCCCACAGCATAATGCGAGGCAATGTCCAGCC 
GTTCCACCCGGCATACAAGCTTATGGAGCAGCCCCCTTTGAAGATCTCCAGGTGGACTTCACAGA 
GATGTCAAAGTGTAGAGGTGATCGAGTGTGGATCAAGAACTGGAACGTAGCCTCnTGTGTCCAC 
TGTGGAAAGGACCCCAGACTGTCG1TCTGAGCCCTCCCACCGCTGTGAAGGTAGAAQGAATCCCA 
GCCTGGATCCACCACAGCCATGTAAAACXTGCAGCGCGTGAAACCTGGGAGGCAAGACCAAGCCC 
AGACAACCCmCAGAGTGACCCTGAAGAAGACGACAAGCOCTGCTCCAGT^ 
GACTGGTCCACGCACGGCCGAAGCCTGAGGAAGCTCATCGTGAGATTCA'inU-riCrrAAATTm 
GACTrATACAAGTAAAGGGCTrCAACTGATCTTACrCAAACTGGGGGACTGTTCCCAGT^ 
TCANGTCACCXJAAGTANGACAGCAAATTAAAAACAATCirrCTGOTCTATAGTrA™ 
GTGGAAACAATAAAAGAAC^^GT^m}TrmGGANANANAANTAN^^ 
ATAnSfTTNT 

SEQ ID NO: 2263 ACATCCTCCCAAGTCTGGAATACAGAATTGATGGAGGACACTTAACTTGCrr 
AAAATGTATrrGATTATTCTGCATTTATGATAAAAAATATCATCCAGGGArrATATO 
AAATTTAGGATTACATGTTTCTAGAACATATAATATGTAACACCATCCAAAAACAACAACAACAT 
ANAGCACTGGAACCAAAGAACCACTTAAAAmAGAATAAATTAGGAAATTTCAATCTATAAGTG 
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TCAAACAACAAATGAGTTATAATATTTTTCTAATAAGAAAAATATCACCTGN 

CTACATCCrrGATCTGGCTGGCCACCATTTTGAAGACCACCACANATCTCAAGGCATGATACT^ 

CACCAACAAAATCTATCCCTGCTATTGCACCTANTGTNATCrCAATNTGTGGCTGACACCA^ 

TCTNACrrTA>rrCTGATAGATGCTCTrATTGCATTrAAmGGGCTAGl^ 

AGC 

SEO ID NO- 2264 ACAAGATCTACCCCGGACACGGGAGGCGCTACGCCAGGACCGACGGGAAGG 
TTITCCAGTTTOTAATGCGAAATGCGAGTCGGCTrTCCriTCCAAGAG^^ 

actggactgtcctctacagaaggaagcacaaaaagggacagtcggaagaaattcaaaagaaaag 

AACCCGCCGAGCAGTCAAArrCCAGAGGGCCATTACrGGTGCATCTCTTGCTGATATAATGGCCAA 

GAGGAATCAGAAACCTGAAGTrAGAAAGGCTCAACGAGAACAAGCTATCAGGGCTGCTAAGGAA 

GCAAAAAAGGCTAAGCAAGCATCTAAAAAGACTGCAATGGCTGCTGCTAAGGCACCTACAAAGG 

CAGCACCTAAGCAAAAGATTGTGAAGCCTGTGAAAGTTTCAGCTCCCCGAGTTGGTGGAAAACGC 

TAAACTGGCAGATTANATTTTTAAATAAAGATTGGATTATAACTCTAAAAAAAAAANA^^ 

SEO K) NO' 2265 ACnTGGCTTGGAGACTGGCGCGGCGTTCGTGTCCGAGGTCACTANTTTCCCGG 
TAGTrCAGCTGCACATGAATAGAACAGCAATGAGAGCCANTCANAAGGACTITGAAAATO 
AATCAAGTGAAACTCirGAAAAAGGATCCAGGAAAO^AAGTGAAGCTAAAACTCTACGCGCTA^^ 
TAANCAOGCCACTGAATGACCTTGTAACATGCCCAAACCAGNTGTATTmACTTGATCAACAAGG 
CAAATGGGACGCV^TGGAATGCCCTTGGCAmCTGCCCAAGGAAGNTACCAGGCAAAACrrrG 
A^^IT^IWTACATATNGAGTCr^ATNTTNAATCmATAATA^^r^^ 
TTAAATGTATTNGGTNAKCTAAATNTrmANANATArrCNAAT^^ 

A 

SEO ID NO: 2266 CGCNGGGGANACATCACCGNCAACCTGGGCA'mGGGGANATGGCCGANACT 
GACCCCAAGACCGTGCAOGACCTCACCTGGGTGGTGCANACACTCCTGCNNCANATGCANTATAA 
ATTTCACACCATGTCTGACCAGATCATTGGGAGAATTGATGATATGAGTATGTCNCATTGATGATC 
TGGAAAAGAATATCCNCGGACCTCATGACACAGGCTGGGGTGGAAGAACTGNAAAGTGA>^ 
AGATACCTGCCACGCAAAANAGTTGAAGGTTTGCTAATAATTTATACTGNAATCTGGCATrm 
AAGCCATAANATATCNAATGGCTTTTTTTGCAGCTAACTACTATrGTGTAAACAGGTm 
AAAAGTTGTGCANTCrrATCACCrANTATATAGTTGGGTITNGANANNGATNTOC^^ 
TTGAACATGGT^^NTTCACAT^^TGGACCTTGGTNATNTGNNCT^ 
CrrrGGNNGTTCTTGCATAACATNNGNCAATTrmANNGNAT^^ 

G 

SEO ID NO- 2267 ACTAAATATrGCTGAGAGCATCCACCCCAGGAAGGACrTTACCTTCCAGGAG 
CTCCAAACTGGCACCACCCCCAGTGCTCACATGGCTGACTTTATCCrCCGTGTTCCATT^ 
GCAAGTGGCAGTGTCTCCACCACCTATGATGGTGATGCAGCCCCTAGAAGTGGCrrTCAC^^ 
ATCCATGAGAGCrrrGGTrCCCCGGGCAAAAGCTTCCCATTCAAATACCCCCACAGGACCATTCCA 
CACAATCTGCTTAGCCCGAGTGACAGCCTCAGCATACTTCTTGCTGCTTTCAGGACCACAGTCC^ 
GCCCATCCAGCCAGCAGGTATGCCAGAAGCCACAGTGGCTTGGCCAGTCTTGGCATTCTCATCAA 
ACTTGTCAGCAGTGACAAAGTCAACAGGCAAGGTAATCTTCACACCATTCTTCTCAGCTTTGGACA 
rrAGGTCrrrGACAATCTTGGCTCCCTCTTCATCAAACAGAGAAGTGCCAATCTCCATGTTG™ 
GCACCITAAGGGAAGGTAAAAAGCCATrCCACCACCAATAATCATCrrATTrGACm 
TATTArrGATGAGCTGGATCrTGTCTTGCAACTTTAGCTrCCGCACAGGATGGCCAGGAAGGGTCN 

CTCTGGG 

SEO ID NO- 2268 ACATATTACATACTCACAAACGTTCnTGAAATGTCAGACTCCTAACCGTATC 
O^ACAAAGCCCCrGCTCAATAGGGCCCAGCCTGTCTCTrCAACCTCCTCAGATGCCTTTCCC^^ 
ACCACGGACCAGCCACACTTCCTCAAATGTGCCACGCTCArrCCCACCTCAGGGCCm 
CTCTCTCCCCrrGGAGCATTCCTrCCCCAGTTCTCTGAATrGGAGGCAGCrrCrrAGCTGTTGGG 
TCAGCrCAAACGCCAACCCCTCCATGTGGAAATAACCTGTATCTTGAGCTGAGTGCTGGCTACATG 
GCATATGCCAANTGGCriTCAACAGTGGCGATTTCGCCTCCCAGAA GATA TATGGCAATGTCTGGG 
GACATTirrGATGGTCACAACTGTGTGGGGAGGAATGrrTACTGACArnTNGTGGGTAGAGACCAN 
TNTTGCTGCrrAACATCCTATAATACACAAGACNGGGCCGAGCGCGATNGGCTCACGCCTNGAA^ 
TCCCAACACrmGGGGANGCTGAGGCGGGCAAATTACAAGGTNATGGAATTCTANACCNCCCTG 
CCCTANATNGTTGATACCNCTGTNTTCTAATAANAAATACANAAATTTTGCCATGGOT 

NNCACNTNAT 

SEO ID NO: 2269 ACATGAATTAGAAGCGTGCATCTAGGATTATGGCCAAACTGTTTTAAAAATG 
CAGAAATGTAAAATTACATCTTGAAAATATGAAGAGATGGTCTACACACTTCAAAAA^ 
TGCTTATACCAGAGATCTATGACAATCACGGGATTCAAGTGACAAGCAGTAAGATCrCAAAAATT 
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AATACrGGTCAAAGATAATCGGAATATrrrrrCK^ATTTCACrGAAAATACAT^^ 

CGAAATCTAGCAGGAACTCAGGGAAAAAAATTCAAAATCTAAAGCCAATTACrrAATAmOT 

rrACCTAAACAACAGCATGACATTAACAGAAAACTGCACCTGCATTTCAATTGCCAATCTCACG^ 

AATNAAGrrCTCCAAAAATGAAATGCAACCAAAACGGGTTCAAACTCCAAATGAAAGTCTGC 

GCTCANATTAAAACATGGAATGTrrCAGATGAAAGTAATAAAAACACTATTGGTT TTGT^ 

TGGATGGAATAAAATGrrGACATTTCTTITGAAACrrGGGAACCTACTAA^ 

ATGGGGTCTCATTTITTTCTTAAGNGGGGCAAGTTGNCTTAAGTAGCCTrAGGAA^ 

NCAAANTNCCTGC 

SEQ ID NO: 2270 Ac nTiUTi rnTi ' i 11 1 vi 1 nrirvi ri i n 1 1 1 1 1 i aagttaaaaaaagctga 

ATTTATITAACTTATTGGATATGTTTATGTATACTAGGNAGNGACTTAAArrTCTm 

NGClTCTGTTTNGGGGCNCTTrGATTGAANAAAAATTCAATTCACAGGAT^ 

ArrCX:TTANCATTGGNCCAAACTTGAATTCCCTTTTTGCACATACTGATCTGCATO^ 

TATGAGGGCCTCCTTrAAGTCTNGGTTCrmGrrCTTTGAANCGTTCAATATC^ 

™ANAAATTCTNTGCCTrCCAAATrmAAACTTTAGCN^^ 

CTAGCACCTTrATTCTGGCTTC™TCTGCT^^^GGAGmCTTGACCAAAJ^ 

CCTNNAAAANAGAATGTNTCACAGTCCCAGTTACCAGTTCCTNACACTGCTGOT 

ANNCCTGACAGCCATCTCCAAGTTATACTGCATAAGrmATGTTTCCTGCACACACCCC^^ 

TCTGGCNAAAAAAAGATACNCrmAACNGNACCTGCATAAAGTTCTTC 

SEQ ID NO: 2271 ACTGTTAAATTATTGCTAGCCATATCTTTAAAAATGGTTITCAGGAATATTrC 
TACCAACCATCTTTCrGACTAAACCAAGGTCATGTCACGGAGGTGCCAGAAATCCTGCTGCATCAT 
GAAGGGGGTCTATGGCAmGGCATTAGAAGTATGAGATmTGCAAATTGTAGTGGAAAAGAAA 
CAAAATTGCAAATATAIGGAaCCAGGAGrrrGTCTATAGAGAATTCTACCGAATOTACAGOT 
ATAGAAAGGATAGAGGTTITCXATAAAmGGCAAGAGATTAAATGTTTACATGA AATTA CCAGT 
AATAGGnGTGAAACTGAAAGGAACATTTCTAATCCATAAACAATACAAGACAAATm 
CCATACTAAAGGAAAGGCCAArmrmCCTCTTTGTAGACAAAGTTATGA^ 
GAAGAAGCAATCAAGAGTGTGCANCCAAAACATACAGGCAGACAGGGCrCCATGGCTCACAGCT 
GTAATCCCAGCACTTTGGGAGGCCAAOGTGGGTGGATCACCTGAGGTCAGGAGTTTGAGACCACC 
OTGGCCAACATGACAAAACCXNTGTTTCTACTAAAAATACCAAAAArmGCCCGAGTGTGTO 
GGGCATCTTGAAATC 

SEQ ID NO: 2272 A Cmrim ri - 11 7' r! - | 4 U l- rU ' Tl GGCTGTCCTAAATTGTTTATTAAGTATGA 
ATmACAAACTTrACTTATATTANCGGTAACGGTGGAGCTGGANAGTATTGCNCCTTCT^ 
TGCCa}GCGAGAGCCACCAATAGTGTGGTGGAACrrGTGGCCCTTrCCAAGGCCACGGNTm^ 
GCCTGCANATGTCAGCCCACNCATCTCCCrGTGCTTGTGGACTGGTTrGGTGATCCACTGGGTGTC 
AGGATirCTTCTGATAGCTTTATGGAATGGATCAATGAGGATAACCTCAAAAAATTTGWGTC^ 
ATCTrCACCAACCCA>rrAANAATTCAGGACTCTCAAAGCCCCACAGTGGCGTCCAGCTCGCTCC^ 
■ TGCAACGGACTGAAGGCTTCGAGCAAACTrrAGCTGGTTAACACCATGATGGACAGGCTTGCCOT 
AAGTTGCACCCTTAGGAACTGGGCGTTTTCGGCCCCACGGCGAAmCNAATCCTATATATAAOT 
AACCTTGCTTGGCCTTGTACXCATrCGGCNCNCTrTATAGGCCGGGTGGGGCGGGNAACCCW 
AAACAAAAAACTTGGCGGGACITT 

SEQ ID NO: 2273 ACCTTGATACACATAATCAGCCITTTCAAAAATGCCTGACAAGAATTAGTCTT 
TCCTTTGTGCTGAAGTCTTCCCACCCATGGATGGAAGCAGGCTGACTCCCTGAGGGTCAGACAAGG 
GGTGGGAAAGGGAACACATTACrmGTGAAGGCAAAGCAGAAAGGTGTGTTTGCCAGACCAGCA 
TGGGCAGCTCAGAGGGAGCAAAGCATCCACCAGAAGAGGCTCTCCATTTTCTTTGTAGGGCCTGA 
CAGrrGAGAmGAGGCTTAGTTAACAATGGGACCACTGAACTTCTTTCCAATGGAAAACTC^^ 
CCCAGT(XCACAGGAACTOTOCGCATACCAAACAACAATGAGGAAGGAAGGGCCGGGTGGCr^ 
ACCAAACAGTTCAGGTCCACTGGGTGAATGAAGCCTGGTGGGAAAGCGGACTCCTGAAGTTGGCG 
CCCTCTGCTGGTCCCACTTCrCATCGTGGCGGGTCNCTGCCTGAGTAGAGGAAGAGCTAC^^ 
GGTTGACAAAAAAGGGAGTGAGGGGAACCCCAGGAAGATGAACCNAACTCCAAAACTmr^ 
GNTAATCATTTrrrGATCAAAAATTCAAAATOAAGGAACCCCAAAmTCCTTG^^^ 

SEQ ID NO- 2274 ACACAGCAGCCAGTnTCATCGGTGATCATGCACAGCAATGCCATTGCTGCC 
ATGACCAGCAGCAACCACAGAGCCrmCAGACCCAGCTGTCAGTCAGTCCCTGAAAGATGAC^^ 
TAAGCCCGAGCCAGATAAAGTGGGTAGGTTTGCAAGCAGACCCAAAAGCATTAAGGAGAAAAAG 
AAAACTACATCACATACCAGGGGAGAAATACCGGAGOAGTCAAACTATGTTGCTGATCCTGGAGG 
ATCACrGAGCAAAACCACAAATATTGCTGAAGAAACCAGCAAAATTGAAACCTACATTGCAAAAC 
CTGCTCTGCCGGGAACCTCCACAAATAOTAATOrrGCACCCCTT TGCCA AATAACAGTGAAAATTG 
GAAACGAAGCCATTGTGAAAAGGCACArrCTAGGATCTAAATTGTTTTATAAAAGAGGGAGAAGA 
CCCAAGTATCAGATGCAGGAGGAGCCTTTGCCACAGGGGAATGACCCAGAACCCAGTGGAGACA 
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GCCCACTCGGGCTTTGCCAATCCGAGTGCATGGAGATGAGTGAAGTGTTCGATGACGCAAGTGAC 
CAGGATTCCCTGACAAACCGTGGCGCCCTTACTACAACTACAACCCC 

SEQ ID NO: 2275 Ac rrniTi - i - i U i l i l U l i ll i gggatttttagtaaanacatggtttcgccat 
gttggctcggctggtctcgaactccrgacctcaagtgatctgtcctggcctcccaaagtgrtggga 
ttacaggcgaaagccaacgctcccggccagggaacaactttagaatgaaggaaatatgcaaaag 

AACATCACATCAAGGATCAATTAATTACCATCTATTAArrACTATATGTGGGTAATTATGACTATT 

TCCCAAGCATTCTACGTTGACTGCTTGANAAGATGTTTGTCCrGCATGGTGGAGAGTGGANAAGG 

GCCAGGATTCTTAGGrrGATCTATCTGTGGGTTATGACTTCCACAATAGCCACCCCCGGCCCCCAC 

CAGTCCITITArrGGCTCTGGATGGAAAATCCCTACCCATGTGATGGCCCTGGTCTCTCCT^ 

TGACCAACAGTTGACCCAAAAGGTTATGGTCTTCAGCGrmAATTATATCCACNCTAGATACTGG 

GGTCrGTm-CTTCAAAGTGTGGGGCTGCCTATTCTTCCANGAACCAAAAKKJCCCC^^ 

AAAAGTNTGCTTACrANGAAATACCCTGCa^CCTTANGAAATAAATGCTAOT 

TANAAACCTN 

SEQ ID NO: 2276 ACAAGATCTACCCCGGACACGGGAGGCGCTACGCCAGGACCGACGGGAAGG 

ttttccagtttcttaatgcgaaatgcgagtcggctttcctitccaagaggaatccrcggcaga^^ 
actggactgtcctctacagaaggaagcacaaaaagggacagtcggaagaaattcaaaagaaaag 
aacccgccoagcagtcaaattccagagggccattactggtgcatctcttgctgatataatggccaa 
gaggaatcagaaacctgaagttagaaaggctcaacgagaacaagctatcagggctgctaaggaa 

GCAAAAAAGGCTAAGCAAGCATCTAAAAAGACTGCAATGGCTGCTGCTAAGGCACCTACAAAGG 
CAGCACCTAAGCAAAAGATTGTGAAGCCTGTGAAAGmCAGCTCCCCGAGTTGGTGGAAAACGC 
TAAACTGGCAGATTAGArrmAAATAAAGATTGGATTGTNNAANAAAAAN 

SEO ID NO: 2277 kC rn ' mm i l I T ^ l Ul ' l - riUU i 'l'llNGGCTTNGAAATTACTTTAATTTANAAA 
TAGAAAACATCTTGAAAGGAAAAAAAAAAANCCCACAAAACm'ACAGGCAAA 
AG^ITC^CACACCCmACACTTGTTCACCAATCTOTAAAATNAAAAACT^^ 
CAGCCACCAACAATGATGTTATTAATAGGAATOTGGATCAATTTCAC AAAN AANCCAATGAATO 
CATTATAGCAAATCCTATTGCTGTTGCCATGGCAATCTTI^GGAATrCTn^ 
CATTmTTAACCAGCCOAATGGAGTCCTTTACAAACTGCCGACTTGGCTCAACAAACT 
TCATCCATGACTGCCTACCCAACCGACACCrAAAATGCCAGGGACACGTANCACTGGAGCTTGTT 
GATGGAGGCCCACCGNACCCCNTGT 

SEQ ID NO: 2278 ACGCGGGGGCCTCGGCGATGTCGTGGGTTCAAGCAGCCTCCTTGATCCAGGG 

ccctggagacaaaggggacgtgntgacxjaagaagcagacgagtcgctcctggcgcagcgggaa 
tggcagagtaacatgcaaagacgagtcaaagaaggttatagagatggaatagatgctggcaaag 
cagtractcttcaacagggcrrcaatcaaggrrataagaaaggtgcagaagtcattttaaa^^ 
gacgactccgaggaacattgagtgcritgctctcctggtgtcaccrrcataataataattcaaot 

TGATCAATAAAATAAACAATCTTCTGGATGCAGTTGaCCAGTGTGAAGAGTATGTGCTCAAACAT 
CTGAAATCAATCACTCCACCGCCCATGTTGTAGATTTATTGGACTCCATTGAGGATATGGACCm 
GTCATGTAGTTCCAGCTTGAGAAAAAGATTGATGAAGCTAAAGATGAAAGACTCTGTGAAAATAA 
TGCTGAGTTTAACAAAAACTGTAGCAAGAGCCATAGTGGGATANA TTGTN ATATGTAGAATGTTT 
GTAAACCAGGACCOTGCACATrCAAAAAACCCCATCCCCANATTGATTTTGGAACCAGACANCCA 
NTTTTAnTNAACANGCTG 

SEO ID NO: 2279 ACCAGCTGGCACAGGAGCAGGGGGCATGGCACCTCTGTTGTTTATGCCCATA 
GCACCTCCCATAGCCATCTGACCCATCCGAATCTCCTGCTCTCTCGCATCAGGGAAGGTTCCCTTG 
AATCCrTCCTGCTGTCGCCGCATCATITCTTCITGCTGCCGCCGCATCTCrrCTTCA^^^ 
GCTXriTCCTCCrGCCTGAGCrCCAGTTGCTrTCGTTTITGCACCT 
CrCCGANGTTCTTT™GGCGCCTAATCAAATCCTGTCTCATTAGCATrG 
TGCATCTTCATCTTNTTCT 

SEQ ID NO: 2280 ACTGTTAAArrATTCCTAGCCATATCTTrAAAAATGGTTTTCAGGAATATITC 
TACCAACCATCTITCTGACTAAACCAAGGTCATGTCAC GGAGG TGCCAGAAATCCTGCrGCATCAT 
GAAGGGGGTCTATGGCATrrGGCATTAGAAGTATGAGATTTTTGCAAATTGTAGTGGAAAAGAAA 
CAAAATrGCAAATATATGGAGCCAGGAGTTrGTCTATAGAGAATTCTACCGAATCTTACAGCT^ 
ATAGAAAGGATAGAGGrmCCATAAATTOGGCAAGAGATTAAATGTTTACATGAAA^ 
AATAGGTTGTGAAACTGAAAGGAACAmCTAATCCATAAACAATACAAGACAAATTTTGACCAA 

ccatactaaaggaaaggccaattttttttcctctttotagacaaag™ 

GAAGAAGCAATCAAGAGTGTGCAGCCAAAACATACAGGCAGACAGGGCCCCATGGCTCACAGNT 
TGTAATCCCAGCACnTGGGAGGCCAAGGNGGGTGGATCACCTGAGGCNGGAGnTGANACCAGC 
CTGGCNACATGACAAACCCTGTTrrACTAAAAATACAAAATTAANCCAAGTGNmTGNTGGGC^^ 
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CTNAAATCCATTTCCT 

SEO IDNO- 2281 ACTTGTTTTCTGTATGAATrCAGAACTTrTGACK^AATAAAAGTGCTCT 
AGTTTATAAAATATGCTTTCTGGAITTAAAATTTAACAGAAAAATTTCCA^ 
GATTCTAAAATATTATGTACGTGAACCACAAAGTGTATACCAGGCATATGACATGGATCCCCCTGG 
GGAACCAGGCTGATCTCrTTCCAGACkKK:ACTATCCGACCAGTGCATGATGATAT^ 
AGCTGCGGCCnX3GCCAAGAAATTGACCTGCTCATGCACTGTGTCAAGGGCATTGGCAAAGATCAT 
GCCAAGTmCACCAGTGACAACAGCCAGTTACAGGCrCCTGCCAGACATCACCCrGOT 
GTGGAAGGGGAGGCAGCTGAGGAGTTGAGCANGTGCrrCrCACCrGGTGTTATTTGAGGTGCAGG 
AAGTCCAAGGGAAAAAAGGTGGCCAGAGTrrGCCAACCCTCGGCTGGATACCTTNAGCAGAGAA 
NTCTTCCGGAATNGAGAAGCTAAAGAAGGTTGTGAANGCTTG 

SEO ID NO- 2282 ACTGTTAAATTATTGCTAGCCATATCmAAAAATGGTTTTCAGGAAT^ 
TACCAACCATCTTrCTGACTAAACCAAGQTCATOTCACGGAGGTGa:AGAAATCC^ 
GAAGGGGGTCTATGGCATTTGGCATTAGAAGTATGAGATTITTGCAAATTGTAGTGGAAAAGAAA 
CAAAATTGCAAATATATGGAGCCAGGAGTTTGTCTATAGAGAATTCTACCGAATCTTACAGCTTGT 
ATAGAAAGGATAGAGGTTTTCCATAAATTTGGCAAGAGATTAAATGriTACATGAA^ 
AATAGGTrGTGAAACTGAAAGGAACArrrCTAATCCATAAACATTACANGACAAArmGCCAA^ 
CATACrAAAGGAAAGGCCAA ri - n ril i i CCTCTTTGNANACAAANNTATGAAATNATTNGNNAAG 
TGAAANAAGCAANCAATAGNGTGCTGCCCAAAAANATACNANGCNAGACAGGGCCC^^•GNCT^ 
CAGCTGTNATCCCTCTCTTTGGGAGGCCAATCTGGGCNGNATCACCITGNGGTCAGG 
ANCTNCTTGNCCNTTTTANAANACCNCmT^TTTTACTAA 

SEO ED NO- 2283 ACGCGGGATACAAAGATATCCTAGAGACCCATCTGAGAGAGAAAATAACAG 
CACAGAGCATTGAGGAGCmGTGCCOTCAACTTGTATGGCCCTGACGCGCAAGTGGACAGGAGC 
AGGCTGGCTGCTGTTGTGTCTGCCTGTAAACAGCrrCACAGAGCTGGGCTTCTGCAT^ 
CCGTCTCAGTCCACAGATTTGCATCATTCTGTrGGCACAGAACTTCTTTCCCTGGm 
TTGCCCCTGGAGATGAGAGACAGTGTCTGCCTTCTCTAGACCTCAGrrGTAAGCAGCTGGCCAGCX^ 
GACTTCTGGAGrrAGCCTTTGCTTTTGGAGGACTGTGTGAGCGCCTTGTGAGTOT 
AGCGGTCCrGTCCACNGCGTCarGGGCAGCTTCACAGGGNAGCGTCATCCACTTm-C 
GAGTATTTTTTAGCThn'GTTCCTCAAAAACNATCAACACGGAAATTA™ 

SEO ID NO- 2284 ACTGCTAAGAGGTATTATTAGAAACAAGATTTAAAAATATGTAACAAAATCT 
TAAGTrCTTAAGTGAAAGCCATTTAACTAGTATTTAAAACCTCTGCAArrATTAGm^ 
ATCCAGGACTCAGATGTTCAGTATTCCTCCTGAAATTACATAAACAAATGCAAATGGAAAGAATC 
CAAGTCTAAATTATATAACAAAACAGCACTCCATCACAAAAGCGTGTAAAATTACAAGAACGCT^ 
TTTTAAAATACrAGCACTTTAAGAAAACGATAATCrCGAAAACCACAAAATTOCCA^ 
TAAACrCCTAAGCAGATAAACATGACTAATGAATGAGTTTGTTTTGTAAAGAAAAATCArr^^ 
AAATTGAATAATTCATACTGAGATGCAAAGTITGCrrCTTTCnTCC^ 
GGGCAAACCGTAhn"ATAGGA.\AATATACCCTATTITGAATGTGGCATCTTTGmGAA^ 
CCAArrAATNAAAAATOCrTGTAATTGGAAAGGTCTCGTTrCCTGAGCAmCCAT^^ 
' TGGGAAAANGCATNTTTNGCCAGCATCrrCTrAANCTTCCTNGNGAC^^ 

SEO ID NO- 2285 ACACCANATrACGAGACATCGTrrCATACTTCCCAAATAGTmATATTTTAG 
CmGAAGGTCAGrrACCAGAGCCAAACTTGTTCrrAACAAGCAGAATm 
GTCTCrrACACCmCTGGGCCTATTCACrrGCAGAGAGGAGTCGAAACTGTAACCAGOT 
TATCGNGCATTCATATGTNGATGTCCTGNTITCATATGCTGCCAATTTCTTT^^ 
GACTCGNCAGGAGGGCCACGNAGATGAGAANGTTCCCTATGGANGNGNATAGATGNCTNNCGCA 

TTTTNCGKTAAATNCTTCTGGGG 

SEO ID NO: 2286 A<XACACAATACrAACCCrrCCCCTCCTCTGATGTCTTACATCACCTrcC^^ 
AGATGAAGTGTATrCTTCACTGGrrrGCCAATTGGTCAGGTCCCCAGCGTOAACGTTTCCT^ 
GACCTGGTAGCTAANGCAAGTGCCAGAAAAATTACAACCACTGCTGGATAGTCrrGGAAGCAGOT 
ArrGTGTCTGGGGCAGACCGACCACCITCTATCTTrAAGTGCAGCTACATC^ 
TTOn-AGGCTGGGCTGACATGANCACNATNAATTTGTCNGACAGCTGGANTTCAGTC^ 

TNCGTGGNANAGTTTAC 

SEO ID NO: 2287 ACTAAATATTGCTGAGAGCATCCACCCCAGGAAGGACTTTACCTTCCAGGA^ 
CrCCAAACTGGCACCACCCCCAGTGCTCACATGGCTGACTTTATCCTCCGTGTT 
GCAAGTGGCAGTCTCTCCACCACCTATGATOGTGATGCAGCCCCTAGAAGTGGCTTTCAC^ 
ATCCATGAGAGCrrTGGrrCCCCGGGCAAAAGCTTTCCArrCNAATACCCCCACAGGACCA^ 
CACAATCTGCn-ANCCCTAOTOACANCCTCAGCATACrrCTrGCTGC^ 
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GCCCATCCAGCCAGCANGTATTC3CCAGAAGCCACCAGTGGCTTGGCCAGTCTTGOT^ 
AAACITGTCAGCAGTGACAAAGTCAACAGGCAAGGTAATCTTCACAACAri'rcrrr 
ACATTAGGTCTTGGACAAATCTTGGCrCCCrmCATTAAACAANANAAGTTGCCAATOT 
TGNACACCTTAAGGAATGOTAAAAGCCATTCCCCCACCNATAATCNATCTCATT 



SEO ID NO- 2288 ACNCGGGGGCAGTGAGTTCGACACACCNTGCCGACTGTCANCGTGAAGCGTG 
ATCTGCTCTTCCAAGNa^TGGGCCGCACCTACACTGACNAATAATTTGATGAACTATG™ 
^^^GGT^^^GGANCTTGATGAAATTACNTCTGAGAAGGAAATA^mWGTAAAGAACAAGGT^ 
AAGGCACTCAGGAGCCTCTGATGTTGNKrTNACAAAATTGACOTCCCTGGCCAATAAN^ 
TCCTOTGNCrGGAAGGArrGQTTCNAGGACTTCAG GTCTTT AAAAANGGGATAATGGCTC 
TAAACGGTTATGCCrGATGGAAAANGGNAGNAATrrTTnAANANANAAAACAGNTANATO 
CTTTTOCGGTACANGA 

SEO ID NO- 2289 ACGATGTCTAGTGATGAGnTGCTAATACAATGCCAGTCAGGCCACCTACGG 
TGAAAAGAAAGATGAATCCTAGGGCTCAGAGCACTGCAGCAGATCArrrCATATTGCTTCCGTGG 
AGTGTGGCGAGTCAGCTAAATACTTTGACGCCGGTGGGGATAGCGATGATrATGGTAGCGGAGGT 
GAAATATGCTCGTCTGTCTACGTCTA'rTCCTACTGTAAATATGTGGTGTGCTCACAC>^^ 
TAGGAAGCCAArrTGATm-CATANCTCAAACCATACCTATGTATCCAAATGGTTNrrr 
AGTTANTAANTTCAATNATGGGAAAATNATCCCGAAANCXrrGGNNGATAAAAAATAT^ 

ATGGGTGANC 

SEO ID NO- 2290 ACTTITnTITITnAAANAGNGGCCACCACATCm 

ATAACirArrATACAATGAACACTCCTCCATTAGGANACCATGCCCACTTACAN^ 

AATGCGGTAAATCTATTTACAGAGGrrGGGGTGCAANATGAGAAAAGTATCANCCCCAGGA^^ 

GAAAGTGAGAATGATCTACAAATTCTCCTGACAAGGAGCAACCGGGCnTGTGCTANTGA^ 

AAAGAArrCCTGGCANGAGCCGTAGGGGGANATTAGATCTCGGAATTGACNGCAAGTTrrGGGGG 

ACAGTGCAANAAAAGAGGGGTGACCTGTGAATTTGGTGCTAGGGGAGCTGCATGAGGCCCAATGT 

GANGNACCCCTAGANAGATGANTAAATTTAGGGTOAACTTrrACCCTCrCCTACCCAATa^A^ 

ANGOTANGGGTCCGGGNTATGCCANNAA>rmGNCIT(>IA>mT^AAArrC^^ 

AAAAAGGNTATTTCTAmT 

SEO ID NO* 229 1 ACTGTGTAGAATTAAGCAAACAGNGTGATGTTTCAGAAGTA ACCCA TTACrG 
OTAAAATAAAGCCTACATCAACACTTAACTCAAACGAGGACAArrGTrATTrAG 
TAATCATAAACTTAACTCTGCAATCCAGCTAGGCATTGGGAGGGAACAAGGAAAACATTGGAACC 
NAAAGGGAACTGCANCGAGAGCACAAAGArrNTTAGGATACTGCGAGCAAATGGGGTGGAGGGG 
TGCrrCTCCTGAACTNmGAANGAATGATCTGGGGNrrTANAATANAANACCA^^ 
TAGrmGCTNANAATCAANAANTGGTAATNr^CTTG^OTGGT^^^ 

SEO ID NO* 2292 acaaanatggctataaacaagatgcagccctcggtttccatgaacagcacac 

TATTACAGTAAACCAAGTITATArrCCACCATCAAGTGTGGCTCTCCCATGACTrCGC^ 
GATCATTAAGAATATCCTCAAATCCAATAGTCTCATCArrACCCCTCAAAACATCCAGTGAAA 
TTGANCTNGAAAGAAATGGAAGACNCrTGAACCrGCTGCACTGCOTGAATTrCC^^ 
TrAGCGGANCAAATANACCCTNGAATGTTTNTTArrNNTGNAAAAAT^ 

TTAGNAATrrArrrACTGATArmCCANAAGGGATACCTTGGNNAAAANTNAAi 1 1 1 iNAACNCGA 
AGAAAAACCTTTNANTNATAAA 

SEO ID NO- 2293 ACGCGGGGGATAATCACCCAGAOAAACAGTTCTCTACTGATGTTTTGAAGCA 
GGCCATTGAGCTGAGTCCTGATAGCCAATACGTCAAGGTrCTCrrGGGCCrGAAACTGCAOAAAO 
ATGAATAAAGAAGCTGAAGOAGAGCAOriTGTrGAAGAAACCTrGGAAAAGTCTCCITGCC^^ 
AAGATGTCCTCCGCAAGTGCAACCAA^^^mACAGAAAAAAAGGGTGACCCTAGACAAAGC^T^ 
GAACTGGrrrAACGGGTGTTGGAATTCCNNNCAAACATGGCTACCTmrCACAAGATO 
GCTCCAAGGCAAAAGTAAAACAAATGCANATTCAGGGANATTCTGAAGCThrrGGAAAT^^ 

• TGATrGAAGCACTAAACCANTTrGCTATGGACTATrCNAATAAAGCTOTGANAAG^^^ 

TOTGAATGCATACTCCNATCTCGCTGArrCCTGOAOACGGGAATGTmCAAACCCATTC^^^ 

GAAGTCCCTGATGCTGAAAAGCAACATCCArrAGCGCTTCTGNAANCrm 

AGNTITNAAACCTNTrmGCNANNTTGGmAAAGGGr^ 

NGGGNAA 

SEO ID NO- 2294 ACGCGGGGAGCGCCGCTCCCAGCCACAGCCTCCCGCGCCTCGCTCAGCTCCA 
ACATGGCAAAAATCTCCAGCCCTACAGAGACTGAGCGGTGCATCGAGTCCCTGATTGCTGTOT^ 
AAAGTATGCTGGAAAGGATGGrrATAACTACACTCTCrCCAAGACAGAGrrCCTAAGCTTAT^^^^ 
ACAGAACTAGCTGCCTTCACAAAGAACCAGAAGGACCCTGTGTCCTTGACCGATTGATAAAAACT 
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GGACACCAACAGTGATGGTCAGCTAGATITCrCAAATrrCTTAATCTGATTGCCATGACTNCTO^ 

TTAAGGCrGTa^GCTTCAANAANCNGACCmANGANCCNTTTOGCNCTO 

CCCTTTCCITCCAACCrTTTOTGTAATAATTIT^ 

ChrrCOTTATTCCAGGGCCCAACTGNCAATTATTAATNAAGCATTTGTGATT^^ 

SEQ ID NO: 2295 ACGCGGOGGCTGCCACCACCTCCGCTGCTGAGGTTGC AGATGGC TCT TCCCC 
CAGATCTGTTTGCATGCGGGTGGGGGAGAGATAGGAGACAGCACTCGCTCriU"ll"lUGTTGTTTTT 
ATTTNATTTTATTTTTTGTAAGTGCTGCGATCCTGGCATACTGTGTAGCTGC^^ 
AGCCAGTGTGGCCCATCAGTTAAAGCCTCCAAGTCAGCTCGAACTGGATTAATTCCATCAAGCTCA 
NTGGCTAAAAGCTAGCTGCGTGACCTTGTGCCAAAATTCCTTNACCTGCCCACACCTTO 
GTCTGTATAANGACATAATTCTACCCTGAGGTGGCTCTGAAGACTGTGTATGCACTTCAGGC AATG 
GATTCATGATTAATAATTNGGGGTrCTCTTGCTCCTTTNGGTNi^ 
GTTA^^^GACCTAGGNCT^r^G^^^ACTTGGGGCAAAAAAAACCCAGGCCAT^^ 
CCACCTTTNAACTTCCGlNriTCCCTrGGGGCCCCCANNNCAATTAN 
TGCCNGGGa3GGCCGTTTNNAAAGGGN>rrATTNNNAGCACANTTGNGGGCCA 
CCGAAC 

SEQ ID NO: 2296 A cn - nTn - i TnT ri - i 1 r iTn - i - n igggaatggtagtganaaccaacatttat 

TGAGTGATTATCGCATACCAGGCACTTTACAACATATTATCTACITCTCACAACTATCCTATGAAA 

TANATGGTGTAATGAAAAAATCAAGGACAAAANCAGTGAGCAAGTTAAGGAATAGAGAmrGAA 

CCCAGGTCATOTAACTCAAACCAGTGCCTNTCCCATCACACGTNAGGCTTNTCTGTTO 

TAACCACACrCAGGTTCCGGTGGTTTTAACCrTGTa^GGTAGAGGCCCAAATAATGACNAAT^ 

NCCNNAAAAAATTTNAACCCNAAACTGACAGGTTOTTANGGATGAACAAAAAGA ATTGA 

CAGTTAAATCCAANAAAAATCC^T^C^^^AACCCAAANTNTNAAGAATTAANGAGATTT^ 

AGAGCCAANATTTGaNriTSrrTTG>riTrGAATNAAAN^ 

TGG^^TTGGAAT^^'AATTTTCATNGACCATTTTT 

SEQ ID NO: 2297 ACi-riU"rri"lU"lU"14'l'llH"rrri''ri"rAAACANAGCCTrGCTCTGTrGCCCAGGCT 
GGAGTGCAGTGGTGCAATCTTAGCTCACTGTANCrrCTGCCTCCTGGATTCAAGCTATTCTCATGC 
CrCANCCTCCCAAGTAGCrGANATTATAGGNGGGCACAACCACACCCCGCTAATTTTTGTATTTCT 
AGTGGAAACAGGTTTACCATGTTGGCCAGGATGGTCCCAAACTCCTGACCTNAAGTGATCTACCTG 
CCTCGGTrTCCCAATCCCCTOjGhrrGG GArrAN AGGTGTGAGCCACCACACCCAGCCCKm 
TCTrTATGGCATTTATCANAGTTTGAAATTTTTGCCATTTATTTOATGTCCTTGCCT 
mTGTTTGCNCCAGGGAAAAANAGTTNAGGGAACCCTI^TTTNTG 

SEQ ID NO: 2298 ACrTTGCCTACXSGCAGCAACCTGCrGACAGAGAGGATCCACCTCCGAAACCC 
CTCGGCGGCGTTCTTCTGTGTGGCCXGCCTOCAGGATrTTAAGCTTGACTTTGGCAATTCCC^ 
CAAAACAAGTCAAACnTGGCATGGAGGGATAGCCACCATTTTTCANAGTCCTGGCGATGAAGTGT 
GGGGAGTAGTATGGAAAATGAACAAAAGCAATTTAAArrCTCTGGATGAGCAAGAAGGGGTTAA 
AAGTGGAATGTATGTTGTAATAGAAGTTAAATNTGCAACTCAAGAAGGAAAAAGANATAACCTGT 
CAANGTATCTGATGACAAATTACNANNGTNCTCC<XCATCCCCACAGTATAAAAAGATTNTrrGCA 
TGGNTGCAAAAAGAAAATTGhrmGCCChrrGGGAGTATTCNNGAGNATTTT^ 
TNATTGACTTATNCATGNAAAGGGTCTTNTATAAANTTGGA 

SEQ ID NO: 2299 ACGCGGGTTATCAGAAAAAAAmCCAGCTCAACCAAGATAAAATGAATTTT 
TCCACACTGAGAAACATrCAGGGTCTATrrGCTCCGCTAAAATTACAGATGGAATTCAAGGCAGTG 
CAGCAGGTTCAGCGTCTTCCATTTCTTTCAAGCrCAAATCrrTCACTGGATGrmGAGGGGTAATG 
ATGAGACTATTGGATTTGAOGATATTCTTAATQATCCATCACAAAGCGAAGTCATGGOAGAGCCA 
CACTTGATGGTGGAATATAAACITGGTTTACTGAATAGTGTGCTGTTCATGGAAACCGAW 
ATCTTGTTTATAGTCATCTTTGTACCTC 

SEQ ID NO: 2300 AClU'l"lU4nUU'ri'll"lU14'lUUGGGll'l'ri4 AANCCAAAATGTGTTTATTGANAT 
GGTTTCCCACTCA'rbrXTGACl'CAAAGNGCriTTAGNGCTGCTTCCTCCrG 

AAGCCTTGCrTTTCCTCCTGTAGGCTGGCAAAGGACAGTGGANCAACCAACACACAAAACTNCCG 

TITGNGCATGGCTAAANACCGNGGTGATTTTATAGCATCCTGGGCNTTTa^CATO^ 

GAArrGGGGCT^^^GCNCCAAGCGTTNNTT^^TGTGT^^ 

NGAAATCCT^^GCNAAAGGd^T^^TCTCmGTGa^TAGG^^IOTCACCGCC^ 

CTNCCNCGGGGGGATTTTTAAAAGGGNAATTCTA 

SEQ ID NO: 230 1 ACACAGTATGTCCCTCATTTAACCAAAAGTGATCCCAAGTnTGTTGGTCACT 
AGATTCCCTGTCCAGCTATGAGATATTTCATGTGCAATGACATTGGAGAGTGACTTGTCGCCTGCC 
AGTAGAGTAGGAGTTACAAAAGTAAGGCAAGGATTCTCCATGCCACCATAAGGGAAGGATGGTG 
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GCAGGACCAATAGGTCATACTGTCCCCATACATACGGTCCTCCCAGATCTTCTGCTATTTTAAGCA 

TANATTCAAGTCTCANAAAACTCATAAGCAGACrmCCCCTGNTCTTTOT 

TTCTTCGGCCAATTTGCCTGCTTTCTAAAGCrCCAACAACT 

GAACl l ri lGGATGAATTTGTATATTTTCCTGCTTGGGTCTTCTGGGTNA 

SEQ ID NO: 2302 ACCTTCAAAAGTGAGATTCCTGAAAGCATGAGTAAATTTGGGGOTGGTAATA 
rmCTCCATTTTCAAGAAATATCTTCAGCATATGTCCAAAGAGCAGGATl'CCATTCAAATAGGCA 
AGAGCAAAGTCTCGTTTTGTCTAAATAAAAAAGGAAAATATTAAAACGGGTCCTTCAAAAAANAA 
TCirGTCTTGAGCCTANAACAAAAGGTTGTTGGGATCANNAGGGTTACTGGAAAGTTAAAGGGAT 
ATCAGGGAGA^^^TNAANGGATGT^n^GTTAAANGAAACTTTTNAGAAATA 

SEQ ID NO: 2303 GGGGTACATTCAGGATCCCTCGGCCAAGGACTGGACCAGAAGAACACTTGGG 
AATCTTGGGTCCACrrATCAAAGGTGAAGTTGGTGATATCCTGACTGTGGTATTCAAGAATAATGC 
CAGCCGTCCCTACTCTGTGCATGCTCATGGAGTGCTAGAATCTACTACTGTCTGGCCACrGGC^ 
TGAGCCTGGTGAGGTGGTCACTTATCAGTGGAACATCCCAGAGAGGTCTGGCCCTGGGCCCAATG 
ACTCTGCTTGTGTITCCTGGATCTATTATTCTGCAGTTGGATCCCATCAAGGACATGTTAGTGGCCT 
Gm-GGGGCCCrrGGCTATCTGCCAAAAGGGCATCCrGGATCC 

SEQ ID NO: 2304 ACTGCGATTAAAAAAAAAGCACTTCTGCCAAAGGAACCATGTTCCAACACCG 
CAAACAAGGTGTTCTGCTTAAACAGAGTAAGATACACCACCCCCATCCATCCCTTCCTTCCCTGTT 
CCCXrrCCCAACTTGAGTTGTGTCATTCGCACCAGTGTCCTGGGTGOTAGGGATGCTACAGCCACCT 
AAGGCAAGGAGCCCTGGGAGGTGGGAGGGCTTGCATGGTTAAGCACACCAGAACTGAAGCGCAA 
AAGGGTCAGCTTGTCTTCATCTAGAATCTCTGGATGTTCCITCCAGAANGCATCCCCNATGATATC 
GCANTGCCAAGGACACTGNCrn'GGCCTGGTCCGGNTCACTTGCCAT 

SEQ ID NO: 2305 ACGATGACATCTCAAGGAGTCACTGGCCCTAGGTTTCTGCAGTAGGATCTTA 

agatctgatctctgccctcaagtgagtgcacgtgcagtaaagctgtcatcqtgccatgacagatgt 
ctgcaaggactttacagtggggtccaaaggatgccaccatcagctccacctggggctcaacaaag 
cctgaaagaagaaagggatgatgcttccagcagaaacctggaggcctgcagctctgccancccan 
acancaagojaggccatttgnccatcaggggcngaaggagggaactcacccaccatcttrtccta 
ca^^ngca^^^tccattatgagg^itccactttaaaatgctaaatnattaattt^^ 

seq id no: 2306 acgcggggctttttcgaggtaggagtcgactcctgtgaggtatggtgctggg 
tgcagatgcagtgtggcrctggatagcaccttatggacagttgtgtccccaaggaaggatgagaa 
tagctactgaagtcctaaagagcaagcctaactcaagccattggcacacaggcattanacagaaa 

GCTGG AAGTTGAAATGGTGGAGTCCAACTTGCCTGGACCAGCTTAATGGTTCTQCTCCTGGTAACG 

TTTTTATCCATGGATGACTTGCTTGGGTAAGGACATGAAGACAGTTCCTGTCATACCITTTAAAGG 

TATGGAGAGTCGGCTTGACTACACTGTGTGGAGCAAGTmAAAGAANCAAAGGACTCANAATTC 

ATGATTGAAANAAATNCANGCAGACCTGTTATCCTAAACTTGGGTTITTTAA 

AACKnCANCTTACTGNTTGAAAGGGTNTrGCCTCACCCAANCTAAAGNGCAATGGCCr^^ 

TrmNANCCTCAAACCnTnGGGGTTAAG>rrGATCCTNAANCNCCCAATC 

CTAAATGGAATmArrG>n^AAAAAANATAAAACNGNNGCTTCCNATTTrAATrAAA 

NCANAA 

SEQ ID NO: 2307 ACAGTCmCATTAAATAAGAATACnACACATACATmCANATATTTCTAC 
CITCCTGTATGTGTTTGGAATTGTATGTAGGTAGCCACTGAAAGAATTTGGGCCCCTTGGGAGGAT 
GGCATNTGGAAm'CCATGAAGTAAAGAGCATNCTmAAAAAGCANATTTGATNGCTN^ 
AANTATATGAANATTCNGAGAATCTCTNATTANACCNCANTNCATANAANATrCCT^ 
AGG 



SEQ ID NO: 2308 ACTATTTCATGGTCCAAACCTGTTGCCATAGTTGGTAAGGCTTTCCTTTAAGT 
OTGAAATATTTAANATGAAATTITCTCTTTTAAAGTT(nTrATANGGTTANGATTAN^ 
AAA 



SEQ ID NO: 2309 ACCCTTGGACAAATTGTTTCCAOCAAGAAGCTAACTCGACCACTGGTGATGA 
AAACTGGCAGACCTGCAGGAAAAGGGAGCATTACGATTTCAGCTGAAGAAATAAAAGATAATAG 
AGTGGTCTTGTTTGAAATGGAAGCCAGAAAACTGGATAATAAGGATCTATrTGGAAAOTCAGACC 
CATACCTQGAATTCCACAAGCAGACATCTGATGGAAACTGGCTAATGGTTCATCGGACAGAGGTT 
GTTAAAAACAACrrGAATCCTGmGGAGGCCTTTCAAGATCTCTCTTAACTCACTGTO 
GATATGGACAAAACCATTAAGGTGGAGTGTTATGATTATOACAATGATGGGTCACATGATCTCATr 
GGAACATTTKAGACCACCATGACANAACTCAAANAAGCCrCCAAAAANCTCACCTGTTGAATT^ 
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AATGCATAAATGAAAAAAAANGGCAAANGAAAAAANCrCAAAAATTCAGTGGTTATCAGTGNGA 

SEQ ID NO: 23 1 0 ACATTGAGAATATGTGTAAGTCATTTTTTAAAAGGCTTCTTGTGATTAAAAGA 
GAAAATTCTOAAAACCACAGCAACATATCTATGCTGTTTCCAAGCATACAAAGAGAATTAGAACA 
TCTGAGACAACTATGGCTCCAAACAATCAGAAGAAGGGTTAGTTTTCTTTTCTCCTATTGAT^ 
TCAAAATGATGTGTCATCTATTGAGCCATACTATGGAGTAGCAGGCTCTAGTTAGATGCCTTCCCC 
AGTTAACAGCACATATCCAAAGGACAGCTAGCCAAGTGGGAAGGTGGTAGGTAAATGCTCATCTG 
GGCTAGGCAACCACCACAGCAAGCAGGTCCCCTCTCAGCCTGCCTTGGCAATGAGCTGCTTCTGA 
GAAGCCCAGCTA TCTGT GGTTGAGAGCTCACTCCCTTGAGGCATTGCAGAGAACAAGAGACATGG 
GCTGNGGGGCNGCTTTTCAATAAAACTGAGAGGCACATCAACATGGCACTGTOTGTGTCCACTTA 
AGGATATATGATAAACATXSCATrrCAAAAGGTAATTArrAANATAATTGGCCATTTTTA^ 

SEQ ID NO: 23 1 1 ACGCGGGCAAACCCTGTTGAGATAAAGCTGGCTGrrATCTCAACATCTTCATC 
AGCrCCAGACTGAGACTCAGTGTCTAAGTCTTACAACAATTCATCATTTTATACCITCAATGGGA^ 
CTTAAACTGTTACATGTATCACATTCCAGmCAATACTTCCATTTATTAGAAGCACATTAACCAT^ 
TCTATAGCATGATTTCTTCAAGTAAAAGGCAAAAGATATAAATTTTATAATTGACTTGAGTAra 
GGGAGGCArr GAGGCA GCCAGCGCAGGGGCTTCTGCTGAGGQGGCAGGCGGAGCTTGAGGAAAC 
CGCAGATAAGTllliriCTCTTTGAAAGATAGAGATTAATACAACTACTTAAAAAATATAGTCAAT 
AGGTTACTAAGATATTGCTTANCGTTAAGTTTTTAACGTAATTTTAATAGOT 
AGATATGAANACITANAAGA^^'CC^^GAGGGAAGGAAAAAGATAAAACNGTT^^AAAACNTG 
NGGANGGTTGAGAANAAGCTNCTTNATGGAGNNAAAAANGT 

SEQ ID NO: 2312 ACAGAAAGAGCACAGGCCAGCTCAGCCTGCCCTGGCCATCTAGACTCAGCCT 
GGCTCCATGGGGGTTCrCAGTGCTGAGTCCATCCAGGAAAAGCrCACCTAGACCTTCTGAGGCTGA 
ATCnrCATCCTCACAGGCAGCTTCTGAGAGCCTGATATTCCTAGCCTTGATGGCCTGGAGTAAAGC 
CTCATTCTGATTCCTCTCCTTCTTTTCTTTCAAGTCGGCTTTCCTCACATCCCTCTGT^ 
TCAGCTTGTCTGCmrANCCKITSIATrmCANAAGCTTN^ 

ATNACACCXGTTTACGGCCTNGGAAAGTGmCAGACCANATGOCATAAGGCAChrrCTTTTATTGT 
TTTNAGAGGNCAGGGATATAATCnTCTNGGCAA 

SEQ ID NO: 2313 CONCOAGGTACTAGAAGTATACACCACCCAGCCCGGGGTCCAGnTTACACG 
GGCAACTTCCTGGATCGCACATTAAAGGGCAAGAATGGAGCTGTCTATCCCAAGCACTCCGGTTT 
CTGCCTGGAGACTCANAACTGGCCTGATGCAGTCAATCAGCCCCGCrrCCCTCCTGTGCTGCTGAG 
GCCTGGTGAGGAGTATGACCACACCACCTGGTTCAAGTTTTCTGTGGCTTAAGGAAGTGTGAAGAT 
ATGATCCAGT<XAGGGCTAGGCTCAGCCACCTGTCTCCTGTCCAGAAAAAAGGTGAAGATTAAGA 
AGCTTTCAGAATGATTCTATGGATTAAAATCATACAAATGGTGGCTGTTCTGAAGAATCAGTCTGG 
GTATTGATTrCCTTTTCCAGNGACTGGCTCCAGGCCATGTCTAATGACCAACTCGATrCCCTGTCAN 
GTTCANANAGCAAGTNAACCCAACCAACAAr^^^^NGTCTT^^'AAGCCCTNACC^^ 
TCCATNCTWrGGTGGGhrrCCATTTNTCAACANTGCCTTTTT^^ 

SEQ I D NO : 23 14 ACACCAACCTGAACAGATTTTGTGCCACAAmCArrCAAAGGGTTTGTCATT 
CACTTTATGATTGTGCTGGATAATCTAGATGTAAGCAAGTCTGGAGATTTTAAAATACGGGTCCCC 
TGTATGAGAGTAGCATAGTTTATAGCATACTTTTAAAAATGGCArrCGGTAATTTTGCCTTCTGGA 
CAGAAAGATATCTTCAAArrGCAAACACATCTATTTCCACAAAACAATTTGGTCAGGAAATTTTAT 
TTGAACATTCTAAAGCAATAATGCmAGATGTTCTTAAGTGTCCCAGACAGGATTAACAAAATTA 
AGTGTCTCTAAATTACAAATrrGGCTCCTGTAGGAGTCTCAGAAAATAAACAGAAGAAAACAACC 
CCCCTCCCAAAAGAAGTATGACACACACArnTGAAGAAACCCCAATGTTTCATGCAATGGTAGG 
CAAGATGTAAAAGCCCACCCAAATCACACAnTCTACACCAATCATCATAAGAAAAAAGT 

SEQ ID NO: 23 1 5 ACTGTGGCGCTCCGTGAAATTAOACGITATCAGAAGTCCACTGAACTTCTGAT 
TCGCAAACrrccCTTCCAGCGTCTGGTGCGAGAAATTGCTCAGGACTTTAAAACAGATCTGCGCTT 
CCAGAGCGTAGCTATCGGTGCTTTGCAGGAGGCAAGTGAGGCCTATCTGGrTGGCCTTTTTGAAGA 
CACCAACCTGTGTGCTATCCATGCCAAACGTGTAACAATTATGCCAAAAGACATCCAGCTAGCAC 
GCCGCATACGTGGAGAACGTGCTTAAGAATCCACTATGATGGGAAACATTTCATTCTCAAAAAAA 
AAAAAGTACCGCGTGGCATTCAAGCATAGCAGArrAGAAGGATTTTrrrTAAAGCAGTCTGAAAA 
TGGGACATCTGTAGAGAAATTCATTTCCrrCTIXTrCCTCCGGATGTGGAATGGAAGCm 
AGGAAAAGTANGAAAAGAGCGGGATGGGATGGGATGGOATGGGATGGGATAGGAAGANAGGCT 
GGGGA 

SEQ ID NO: 2316 ACTATTAAGCCATGGTCAA(XCCACCGTGTTCrrCGACATTGCCGTCGACGGN 
AAGCCCTTNNNCCCGCGTTTCCrrGAACCTGGTTGNAAAANCANGGCCCAAAA^ 
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TTTTT ' 

SEQ ID NO: 23 1 7 ACTTGATTTTGAACACAGCACGAAAACATTTTGGAGCTGGTGGAAATCAGCG 
GATTCGCTTCACACTGCX:ACCrrrGGTArrTGCAGCTrACCAGCTGGCTmCGATATAAAGAG^ 
TTCTAAAGTGGATGACAAATGGGAAAAGAAATGCCAGAAGATTTTTrCAmGCCCA(XA 
TCAGTGCTTTGATCAAAGCAGAGCTGGCAGAATTGCCrrTAAGACTTTITOT 
CTGCTGGGGAAATTGGTTTrGAAAATCATGAGACAGTCGCATATGAATTCATGTCCCAGGCATTTT 
CTCTGTATGAAGATGAAATCAGCGATTCCAAAGCACAGCTAGCTGCCATCACCTTGATCArrGGCA 
CrmGAAAGGATGAAGTGCTTCAGTGAAGAGAATCATGAACCTCTGAGGACTCANTGTGCCCTTG 
CTGCATCCAAACTTCTAAAGAAACCTGATCANGGCCGAGCTGTGAGCACCTGTGCACATCTNTCTG 
GTCTGGCAGAAACACGGGACAAAAATGGGGAGGAGCTTCACNGAGGCAAGAGGGTNATGGAGTG 
CCTAAAAAACTTTAAAATANCAA 

SEQ ID NO: 23 1 8 Aci"ri'ri'i"rri"ri-rrri'i"ri"i'rrrrrn'iTTGCTGCCNCCACCATGAAAGAGTGG 

CCNCCACATNTTTATTGCATACTCAGGNGAATAAOTATTATACAATGAACNCrCCTCCATTAGGA 

NACCATGCCCACTTNCAGAATGCAGCCGTAAATGCGGTAAATNTATTTACAGAGGTTGGGGTGCA 

AGATGANAAAAKmCNNCCCCAGGAATTTGAAGTGAGAATOAThrrACAAATTNTC^^ 

AGCAACCGGGCTTGNGCrANTGAGGTCTGAAANAATTCCTGGNANAGCGTAGGGGGAGArrANAT 

CrCGGAATNGACAGCAAGTTrGGGGACATTGCAANAAAANAGGGGNGACCTGTGAATTGGAGCT 

GGGGAGCTGCTGAGGCCCATTGTGAGGCAGAAChrrrGAGNATNA>riTAATTrAGGGTGATC^ 

TTTTC 

SEQ ED NO: 23 1 9 ACACAAAGAGGGGGTGGGTGTCGGATGCAGAGTGTGTGGCCTGATGCTCCAC 
GGCGTGCAGGACGGGGGGCTAATAGTAGGTTTCCTrCrCCACCCAGCCGCCAGGGCGTCGCCTGA 
TGATGAGTTTTCTGACTTCGCCATATACGAAGATGAGAAGAGAGTAGGGGAAGGCACAGAACCAC 
CAGGTAGGTTTGAGGGGATACATCCTAAGAGCAACACCCATTCCAGGGCAGTAGOAAAGGAAAG 
CAGCCAGGGCTGTCTCTrCAAAGAGGCCAAATATCAAGATCnTGTTCTrCATCCCCTGCTGGAAGA 
CCGAATTCCTCCTGGTCTTACAGATGACCAAGTCGGCCCACTGCACCACCACGATACTGACGAAG 
AAGGCTGTGTGGCAGGTGAACTCCACGATTn'(XTCTGCTCATAGGTCX:ACrrGCrrGCC^ 
TTCCACATCTTGATCCANCGGTCATCCCANCCACTCGGAGGCCCAACAGGTGAATTGGGANGAAG 
CCGTTCTCACCANAATCANAAATGTAAhrrAAAGAAACCTCCCAGGGCCTGATCATTCCATCTTC 
CTTANC 



SEQ ID NO: 2320 ACTGTGGAGGCTGAGGCAATTTTCTTCAGGCTAACCCAGATnTCTAAAGCCC 
AACTTAAAAAGTCTACACCTGCCAAACTAAAAAAAACGCAGCCACTTGAAAATAAACCAGCAGA 
GCATTGCCATCACrCCGATAAAGCTGCAGGTTTCATCACATGCACCAGACAAATCTACAGGGCTA 
GTTTCAGTTCTCTCCTTTTAAAGAATTTATTAAGCCTGTTATACCACACAGTATGTTn^ATACAC^^ 
ACATACAACTCCCTAATAAGATAAAGCAAAGACAAAAAAGrrTATCTTATTAGAAACAAGATACA 
CCACCACTTATTGTCTTCAAACATTATTGCACTTTAACTTTCrrAATTTGACAAA 
CATCTGCAGACTAGTmAACAGACAAATAACACCTGTAAGCAGACATGACTGCCTAAATNGm 
TTAAGTATGAArmACAAACrrTACTTATATTAGCGGTAACGGTGGAGCTGGAGAAGTTTGCGCC 
TTCTCCAAGCTGCCCGGCGAGAGCCACCATAGGTGGGNGGACTTGNGGCCTTTCAAAGGCCGGGT 
TTTTCGGCCTGCANATGTCAGCCCACG 

SEQ ID NO: 232 1 ACACCTAGGACCTCTAGTAAACCTCATAAACATCTGCCTCCTGCAGCCCTACA 
CCTCATTGCATACTACAAAGAAAACAAAGACAGGGAGGACAAGAGGAGCGCCCTGTCCTGTGTTA 
TCTCCAAAACAGCTCGTCTTCTCTCTAGTGAAGATAGAGCTCGTCrCCCAGAAGAATTGCGAAGTC 
TTGTTCAAAAACGCTATGAACTTCTAGAGCACAAAAAGAGGTQGGCTTCTATGTCTGAAGAACAA 
CGGAAAGAATATITGAAAAAGAAACGGGAGGAGCTGAAAAAGAAGTTGAAGGAAAAAGCCAAA 
GAACGAAGAGAGAAAGAAATGCTTGAGAGATNAGAAAAACAGAAGCGGTATGAGGACCAAGAG 
rrAACTGGCAAAAACCTTCCACATTCAGATTGGTXjGATACCCCTGAAGGGCTGCCCAACACGCT 
TTGGGGATGTGGCCATGGGTGGTGGAATOm-GAACTGTTrrCTGGGCTACTATTACCANATGCT^ 
ANTTCCTTTAACTG 

SEQ ID NO: 2322 ACCTTGTAGCATTCTGAGGACAGGCCTGATTTCTGAGAAGGGAAAGTGGTAA 
AAGTATTGTCXJAGTCCTTmAAGCTGGTGGCTGANCTTGGTGAGGTGTGTrmAA^ 
GTCCOTTCTACTTTTCTTGAAOAAGGAGGACCGTAAGGGATATAAAGGTTTCACTGAATACTAAGA 
GCCTGAAAAACTGCTTGGCrGATTTGACTAATAAAGGCTGGTCTGTTATCAGACTGTATAG 
GGAAGGCTAAACTOAGGAATrGTOTCrGACAGAAGGGAAGAAATGACTGTGGTGGCCTTCTCAGA 
CCCTGTAGGAAAGGCCTTTACTTTATrCAGTGAAAGTTGTCTArrrAGACrAAGAGGTTm 
CTGACTCGGGACATGTTGAhrrNAAGGTAATTTGNCAATCCTGGGTGGGGCAAATCCTCCAACCTG 
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ATGTGTACGGAAGGNA 

SEQ ID NO: 2323 ACTGTCTCrmGGAAMGTTCnOATCCCCAATGCITCACAAGCAGA 
AAGTCTTCTATn-GAAAATGAAAGGAGATTACTACCGTTACITGG CTGAGG rrGCC 
ACAAGAAAGGGArrGTCGATCAGTCACAACAAGCATACCAAGAAGCITTTGAAATCAGCAAAAA 
GGAAATGCAACCAACACATCCTATCAGACTGGGTCTGGCCCTTAACTTCTCTGTGTTCTATTATGA 
GATTCTGAACTCCCCAGAGAAAGCCTGCTCrCTTGCAAAGACAGCriTrTGATGAAGCCATTGCT 
AOTGATACATTAAGTGAAGAGTCATACAAAGACAGCACCGCTAATAATGCAATTACTGAGAGAC 
AACTTGACATrGTGGACATCGGATACCCAAGGAGACGAAGCTGAAGCAGGAGAAGGAGGGGAAA 
ATTAACCGGCCTTCCAACTriTGTCTGCCTCATTCTAAAATTTACACAGTAGACCATTTGTCA TC 
TGCTGCCCACAAATAGTTTTrGTTACGATTTATGACAGGrTTATGTCTTCTATTTGAAT^ 
CCCATGGTGGTnTATGnTAATATTNGGGAGT 

SEQ ID NO: 2324 ACTTTTTTTTriTITmTITI^^ 

TTGCTGATAAAATAGTAACATTCCGTCrrrGCAATCTGATACGGAATACAGGAANAT ACTG GTAAA 

GGCTTTGATGTGCATATNTATCATAACCAGGTNATGANAACNCCCNCGAACCACCTACrmAGTGT 

TACACCATACCAACrrhrrNAAAAAGQNCAGCTTCAANCTCTACTGGGTAGCTGAGTAAATAGCCG 

TCCAATGTTTAAAGACTGGGGAAGCGGGGAGAAAGATGAAAGGANTTAGCAGCACACACTrCAA 

AGCCCA>mrTrAANCNCACCGACNTTr™AATATTAAAAATAAAGGATGCAAACNAAAA^ 

TTCATChrTTTTCANAAAGAGCATTTGGCAATATTCAACCCATAAGGAGACTGTGC^ 

NC^CTCTCCANCTTCTNAAA^^'GTTTCCGGCA^TGCGAA^XJTTGATCACA^m3G 

TCNACGGA^rANN^r^GGGCACCNTNNNT^CCTGCCGG GNGG CCNTNAAAAAC^ 

AKrTGNGGCCGTNCTTATTGNATCCNACCXKSATCAAATrTTC 

SEQ ID NO: 2325 ACAACTAAAGGCAACTGGCATGGACTCAAATATTITGGGGAAGAAAAAGACT 
AAAAGTrCTAAGGAAGAAAATGCGAACCTTGATAGTTTGAAATAGTTAAAAAGACAGTGTAGAAA 
CTGCTTAGGCAGmGArrATGGACTATTAGATGATACTTGGGTCTGATAATGGTATAAGGAGAAT 
AAAGTATTTAGGGATCCAATATTACGCCrGCAGCTTTTrcCAAATAGTTCCTCGGGGAGGGG 
C 

SEQ ID NO: 2326 ACGCGGGGGCGGGAGAGAGGCCGAGATGGCAGATGAGATTGCCAAGGCTCA 
GGTCOCTCGGCCTGGTGGCGACACGATCnTGGGAAGATCATCCGCAAGGAAATACCAG CCAAA A 
TCATTTrrGAGGATGACCGGTGCCTTGCmCCATGACATTTCCCCTCAAGCACCAACACAT^ 
GGTGATACCCAAGAAACATATATCCCAGATTTCTGTGGCAGAAGATGATGATGAAAGTCTTCTTG 
GACACTTAATGATTGrrcGCAAGAAATGTGCTGCTGATCTGGGCCrGAATAAGGGTTATCGAAT^ 
GTGGTGAATGAAGGTTCAGATGGTGGACAGTCTGTCTATCACG TrCAT CTCC ATGTT CITGGAGGT 
CGGCAAATGCATTGGCCTCCTGGTTAAGCACGTTTTGGG GATAAT mCTCrrC'i'llAOGCAATGAT 
TAAGTTAGGCAATTrCCAGTATGTTAAGTAACACACrrATTTTTTGCCTNTGTATGGAGAGAT^^ 
AGAAATAATTTTAAACCGCATACNrmATAAAANACOTTGOTCmGGTCAAA^^ 
AAAAAAAAANNTTCCTTNGCCGGACCCGCTAAGGNC 

SEQ ID NO: 2327 GTTGGTGGAAGGAGTGCCCAGTCCCAGGGTGACACTGGACAAGAAAGAGGC 
CATCCAAGGTGGGATCGTGAGGGTCAACTGTTCTGTCCCAAGGAAAAGGCCCCAATACACTTCAC 
AATTGAAAAACTTGAACTAAATGAAAAAATGGTCAAGCTGAAAAG AGAG AAGAATTCTCGAGAC 
CAGAArmGTGATACTGGAATTCCCCGTTGAGGAACAGGACCGCGrnTATCCTTCCGATGTCAA 
GCTAGGATCATTTCTGGGATCCATATGCAGACCrcAAATCTACCAAGAGTGAACTGGTCACC^ 
CGGAATCCTTCTCTACACCCAAGTTCCGCATCAGCCCCACCGGAATGATCATGGAAGGAGCTCAG 
CTCCACATTAAGTGCACCATTCAAGTGACTCACCTGCCCAGGAGTTTCCAGAAATCATAATTCAGA 
AGGACAAGGCGATTGTGGCCCACACAGACATGGCAACAAGGCTGTGT 

SEQ ID NO: 2328 ACTCGTCAATGGGCTCGGTCATATATACCACCTCGAAGCCCCGTTTCCGCACT 
CGCTCCACAAAAGCTGAGTTGGCCACCTGCTCTTTGCTCTCACCAGTGATGTAATAGATGGACTTC 
TGTGTCTCCTTCATGCGAGAAACATACTCTGACAGAGATGTCATCTCATCTCCAGACrGGGAGGTA 
TGATAGCGCAGCAOCTCAGACAGGCGGCGGCGGTTAGTGGAGTCTTCGTGGATTCCAAGCTTGAG 
ATTTTTAGAGAATGCCTCATAGAATTTCTTGTAArrCrCCTTGTCTTCTG^ 
TCAAGGCACTTCTTAACAATGTTmGCGAATGACTITCAAGArm 

GGGAGATGTTCAGGGGCAGATCCTCAGAGTCAACCACACCACGGATAAAATTGAGATACTCTGGT 
NTCAACTCATCACAGCTGTCCATGATGAANACACGGCGGACATAGAGTTGATGTTGNCnTmnTC 
TGNTCTCAAAAAGQTCAAAaGGAGCCCG(>IAGQAATAA^^^AGCAATGCCTGAAT^CCACTGACCT 
TTTACAGAAAANGCTTGACTG 
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SEQ ID NO: 2329 ACTACG<>iGGCCTTGGCATCCCTGGGGTTCACCTGGCTGACTGGGATGTrGA 
GGCGGGCAGCAATGTCnrCCACGGTCTCATTGCCTrCTGAGATGATGCCCACACCTTTGGCAATAG 
CTTTAGCTGTGATTGGATGGTCTCCTGTGACCATGATGACCTTAATTCCAGCACTTC^ 
CACGGCATCAGGAACGGCCGCCCGTGQAGGGTCAATCNTGGAGATGAGCCCAACAAAAGCACAN 
ATTATCGATAGGGAAATTCCATCmCAGTGTCAAACTGGAACCXnTCATGAAACTTGrrCA 
GCAGAAAGAGGTGGCAGAAACCTmGACrCGTTCTCCGANGCCCCNAGCTCCAAATANNGCGNT^ 
CTGAAAGGCGNCTTTANCTCrCATCCAAGGGCTGNTCCTTGCCGNGNANGANGATAAAANTGANC 
NGTNTATGATNCTTTCTNGGGCn^CCCTTNATa^AAACAGGTGm 

SEQ ID NO: 2330 ACAnACGCACCATAACATGCGTCrTTAAAGOriTCX^CAA^TA™ 

TGACCAGCAATGACAAGAAAAAAGAGGAGCACCTTTACAAGCAGTTGATATCCAATATTAAAATA 

ATTGTGGCTTTAAAAATATITCrn'AAATIXnTGCATTACAOT 

AGATTAATCAATGAAArTTATAAGTnTATCAACGTATAAAATTTrmC^ 

GAATACAATCrGTGTTrCTGACCAGTTGAGGTAGTTAAAATAGGGAGGGCTTTTCTAAm 

TTGACTATTrcAOAAAGAAAGGrrATCITITACTGGTGAGCACAGTCATrGCTCTGC^ 

AGGATTCAAAGAATATAACACAGTGTTGTTATCATAAAGAGTGTGAAGTTTATTTATTATAGCACC 

ATTGAGACATTTNGAAATTGGAATTGGAAAAAAATAAAACAAAAAGCATTTGAATTGTATTTGG^ 

GGAACAGCAAAAAAAGAGAA.GTrCAl'lU'l'C4'lNGNCAATATACrGNGCCAACATTTGGAANTAAT 

AAC 

SEQ ID NO: 233 1 ACCTTCACTATCACTGGGCGTTTITGGAGAACATTTCTTCTGGGAGTTACCT^ 
ACATTTGGCCACTGTTTTCGGGTTCTCAGTTGAAGAACrGTTGGGTGGAGCTGACAATGAGGTC^ 
GGGAGATGGGGAACGGCCCATGGCTGTGGACATGCCGGCTCCTGCAG GAGAGGGAGGC TGAGTC 
TGTrCCTCGGACACCTTACTCCCmGGGTCCAGCCrrrrrCrTTGGTTGArrr^^ 
TCCAGGAGTTGCCTCTGCTCCnTCAGTTTTGACAGTGCrrGCTTGTGGAGGGTGCAGGTO 
ATCCACTTCCTTCTTCCTCTTTCTTGATANGOCTGGTCCm 
TTCANCCAG^r^C^CCAAGCGTTGGTAT^r^CITNCAAAAAT^^TNATTAC^^ 
TCAOCTCNGAACTTTTTTCCTC 

SEQ ID NO: 2332 ACTACTTCTCAAGGAGGATTCATGGTCCGTCCrn'GCTCACTACAGATrrCTC 
CTCTTCTCTGGGAAAAAATGGTCAATGCTTCTGCTTCCrmAATAAACT 
ACATTAnATTATTCCCCACTrGACACCTTCrrAGAAACTTGATTGTTGGATGTGT^^ 
CAAAAATrcAATGTGGCTTTGCGTGGGATOTCTGAAGTGTCAGAAACAGCACTGTTGATC 
TTTTGAGAGGGAAAATATATATAGAGATGCATACATTCCCTAGAAGAACAAATGTAAAAGACTGA 
AATAGGGCGTGCACAGAGTTAAGAGGGCTGACAGACACTACTNGGTTTGGAATTCTGTGGCTGTC 
AGCCATCAAGGACTGCTGTOTAGGCCANAATNTAAATCTCCTTAGGAAGAAT^ 
ACCACTGNAAATGNC 

SEQ ED NO: 2333 ACACATACACACCTAAAGAGTCATGGCCTTCTrAAACAGCrrrCTT^ 
TCTGGAAATATCCrrrGGTTCArrTrrATTGCCCCTCTCTAGGCAAAACA^ 
GTATCAGTGAGTTArrTCCrAGCACrTGTAAGCAAATATOriTACCAAGAGGAACC^ 
TATAATATCGTAAAGCGTGGAGTTAAGATGTGTTTTTAAAAAATATACAGGCTTTTTATATGOT 
GATCTGAAAAAGGTAACATCCACAATGACATTCTAITACAGAGTTCrrACAATCACCCTAGCCTAC 
TACACTCTGGTATAATACTCnTCTrCAATTCTGTTrAACAGAATAAAAGTAACCAAAG^ 
CAATGCATrrGAACTTAAAATATAACCTGCCCACAGGAATTAAGTAGTTTTATTC 
TACATAATTrCTCAGATCCCCrmACCTATTGTCrrAAGTGGATAAGCACATAGTCATGCACA^ 
GAmGATGTCrACCAAGTTAACTGTATGGATITATTrGNATCAATAAAAGGGNCATGAAAArrCTG 
GGTGGGAATCACATCCATGTTGCTTCACACGAA 

SEQ ID NO: 2334 ACTATGCCAAACACrrATAACTTGTATAAAAATTCCACATCCCCATATTGGCC 
ACCTCANGATGAAAACAGATAACrCCCTAAATGTTAACTGGCTCTACTCCCCTAATATTAAACATA 
AAAACCACATGGGAAATATAGAAATTCAAATAGAAGTAACATAAACCTGTCATAAATCGTAAACA 
AAAAACTATTTGTGGGACAGCATGGATGACAAATGGTCTACTGTGTAAATTTTAGAATGAGGCAG 
ACAAAAGTTGGAAOGCCGGTTAATmCCCCTCCTTCTCCTGCnTCAGCTrCGT CTCCr rG^ 
CGATGTCCACAATGTCAAAGTmTCrCTCAGTAArrGCATTATTAGCGTGCTGTCTTrGTATG ACTC 
TTCACTTAATGTATCAAGTTCAGCAATGGCTrCATCAAAAGCTGTCTTTGCAAGAAAGCAGGCT^ 
CTCTGGGGAGTTCAGAATCTCATAATAGAACACAGAGAAGTTAGGGCCAGACCCATCmGATAGG 
ATGTGTGGTTGCTTTCCrrrTrTGCTGmCAAAAGCTNCT^ 

SEQ ID NO: 2335 ACAGCCAACGGTTTCCCTTGGGGGOTrGAAATAACACCACCAGTGGTCTTA 
AGGTTGAAGTGTGGTTCAGGGCCAGTGCATATTAGTGGACAGCACTTAGTAGCTGTGGAGGAAGA 
TGCAGAGTCAGAAGATGAAGAGGAGGAGGATGTGAAACTCTTAAGTATATCTXjGAAAGCGGTCTG 
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CCCCTGGAGGTGGTAGCAAGGTTCCACAGAAAAAAGTAAAACTTGCTCrCTGATGAAGATGATGAC 

GATGATGATGAAGAGGATGATGATGAAGATGATGATGATGATGATTTTGATGATGAGGAAGCTGA 

AGAAAAAGCGCCAGTGAAGAAATCTATACGAGATACTCCAGCCAAAAATGCCAAAAAGTCAAAT 

CAGAATGGAAAAGACTCAAAACCATCATCAACACCAAGATCAAAAGGCAANAATCCTTCAAGAA 

ACANGAAAAAACTCCTAAAACACCAAAAGGCCTAGTTCTOTNTAAGACATTAAAGCAAAAATGCA 

AGCAAGTrmGAAAAAGGTGGTTCTOTCCCNANTGGAAGCCAAATTCATCAATTT^ 

OTCCGGTTGACTGArcAATAGGCTTTrCAAATCTCTGCTNTGGA 

SEQ ID NO: 2336 ACGCGGGGGQAAGGGAGAGAGCTGTGCGAGCGTGGGGGAQAGrmTCGTT 
GGAATATACGTTGCACATTTATGGCGATTCrGAGTGTGAGGGCAGACTTCTGCCAGGCTCAGCACA 
GCATTITCGCTGACAAGTGAGCnTGGAGGTTCTATGTGCCATAATrAACATTGCCTTGA^ 
TGGACACCGAGACTGGCCTCAGAAATAGTrGGCJ'rilUUlU'llU-ri'lAATTGCAA GCATA lTTC'lTT 
TAATGACTCCAGTAAAArTAAGCATCAAGTAAACAAGTGGAAAGTGACCTAC ACTTT TAACTTGTC 
TCACTAGTGCCTAAATGTAGTAAAGGCTGCTTAAGTTrrGTATGTAGTTGGATTTITrGGAG TCCG 
AAGGTTCCATCTGCANAAATTGAGGCCCAAATTGAATTTGGATTCAAGTGNATTCTAAATACTrT 
CTTATCTTGAAGAGAGAAGCTrCATAANOAATAAACAAGTTOAATAOAOAAAACACTG NTTGATA 
AANAGGC^mT^AG^tKK3CTTT^TAATGTTT^rCTGOT^ 
TTTT 

SEQ ID NO: 2337 ACGCGGGGAGGAAGAACTAAATCCAAAGATACTAGCTTTGCAGAATGCTCAG 
AGAAAGCXjAAAAATGGAACATGATGGTTCACrrmCAAGCAGTAGGAATTGGAACATrATTACA 
GCAGCCAGACGATNATGCAGCTACTACATCACTTTCTTGGAAACGTGTAAAAGGATGCAAATCTA 
GTGAACAGAATGGAATGGAGCAAAAGACAATTATTrTAATACCCTCTGATTTAGCATGTAGACTG 
CTGGGGCAATCAATGGATGAAAGTGGATTACCACAGCTGACCAGTTATGATTGTGAAGTTAATGC 
TCCTATACAAGGCAGCAGAAACCTACTGCAGGGTGAAGAATTACTCAGAGCTTTGGATCAAGTTA 
ACTGAGC ITJ-n CTTAATTTCATTCCJ 1 11 INTAGGACACTGGTGGCTCCTACCTAAAGCNNGCTAT 
TTATATTTNCTACATCCTAATTCAGANCCCGGCTNCNATN(mX3CCAAACTTGGNT^ 
GATCCCCnTrCTANrTTAATTCNC>riTCAATGa^Cl'i-lll^l^ 

SEQ ID NO: 2338 ACCACTCACGATGCTGGTGACCCCAGCTGCTGTTGCCAAACCTTGACCAGCG 
GTCGAGAGCAGCAGGCTTCCTCCTCCTGrrGCrGGGGCAAGGGCTAAACCCAGGAGGCTCATCAC 

tccaaagatgacagcagtagaggcggccaccatgttaaccttggtnaatttcttgtgggtctttgt 

CAATATCTNTATGrrAGGGCA 

SEQ ID NO: 2339 CGCCGGGCAGGACTTACrTGGAGAGACATATGTCTGAATrTATGGAGTGTAA 
TrrAAATGAACTAGTTAAACATGGTCTGCGTGCCTTAAGAGAGACGCTTCCTGCAGAACAGGACCT 
GACrACAAAGAATGTTTCCATTGGAATTGTTGGTAAAGACTTGGAGTrrACAATCTATGATGATG^ 
TGATGTGTCTCCATrCCTOGAAGOTCTTGAAOAAAGACCACAOAGAAAGGCACAGCCTGCTCAAC 
CTGCTGATGAACCTGCAGAAAAGGCTGATGAACCAATGGAACATTAAGTGATAAGCCAGNCTATA 
TATTGTATTATCAAATATGTAN 

SEQ ID NO: 2340 ACGCGGGATTCACTAAAACCATGTGTCTGAACTGAAGAAGCTTGGGCTCACT 
TCCACAAATTAGGTAGCCrTGTCAGATGATATGCAAATGATTCACATCAGTGTrTGAAGTTCAAAC 
CAAGGAAAACTGTOGAAAAATGCTAATCACTGAAAGAAAACATTTTCGGTCAGGAAGAATTGCAC 
AAAGTATGTCTGAAGCAAATTTGATTGACATGGAAGCTGGAAAACTCTCAAAAAGTTGCAATATr 
ACAGAATGCCAGGACCCAGACTTGCnTCACAATTGGCCGGATGCTTTCACCCTrCGTGGT^^ 
GCTTCCAAAGTTGCAAATCCATIxrrGGAATCAACTGTCTGCTTCTAACCCATTmG GATGAC^ 
ACTCAACTAAGAAATAACAGGAAGAGAAATAATArrTCCATCTTAAAGGAAGATCCri'rrCl'rri'C 
TOTAGAGAAATAGAAAATOGAAATTCTmGATTCCTCCGGTGATNAACTTGATGCGCATCA^OT 
CTTAGGCAACTTCCTCAAGAAATTCTGGAANATCTAAAAGTGTrTCANAACT^^ 
ACNACCAGCCCATGCCCCTT 

SEQ ID NO: 2341 ACTACCTGATGOATGCTGCCTCCTTGCTGCCTGTTCTGGCCCTCGGCCTGCAO 
CCTGGGGACATCGTGCTTGACCrATGTGCAGCn'CCTGGGGGAAAGACACTAGCGTTGCTTCANACT 
GGCTGTTGCCGCAATOTGCTGCCAATGATCTCTCCCCGTCCCGAATAGCCAGACTACAGAAGATC 
CrrCACAGCTATGTGCCTGAAGAGATCAGGGATGGAAATCAAGrrCGAGTTACCTCATGGGATGG 
CAGGAAATGGGGAGAACTGGAGGGGGACACCTATGACCGGGTGCTGGTGGATGTGCCCTGTACTT 
TTTThrnTmrrnrNNGTATITCT^AAAAT^^ 
NGTAAGCAAAATATGGGGGTN 

SEQ ID NO: 2342 ACATTAGCACCATAACATGCGTCTTTAAAGCCTTCCCAAATATTAAGTAATCT 
TGACCAGCAATGACAAGAAAAAAOAGGAGCACCTTTACAAGCAGTTGATATCCAATATTAAAATA 
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ATTGTGGCTTTAAAAATAIU ICl UAAATTCTTGCATTACACrTTT 

AGATTAATCAATGAAATTrATAAGTTTTATCAACCGTATAAAATTTTmCATOT 

AGAATACAATCTGTGTrTCTGACX;AGTTGAGGTAGTTAAAATAGGGAGajCTTTTCTAAm 

TTTGACTATrrCAGAAAGAAAGGTTATCTTTTACTGGTGAGCACAGTCATTGCTCrGCAGATG 

TAGGATTCAAAGAATATAACACAGTGTTGTTATAATAAAGAGTGTTGAAGTTTATTTATTATAGCA 

CCATTGGAGACATTTNGAAATTGGAATNGGTAAAAAAATAAAACAAAAAGCATTTGAATNGGim 

TGGTGGAACAGCCAAAANAANGAGAAGm"CANTTrrCTTTGCAAATTNTACTG 

SEQ ID NO: 2343 ACGCGGGCTATTTGAAGATATGACTGATTCTGACTGTAGAGATAATGCACCC 
CGGACCATATTTATTATAAGTATGTATAAAGATAGCCAGCCTAGAGGTATGGCTGTAACTATCTCT 
GTGAAGTGTGAGAAAArirCAACrCTCTCCTGTGAGAACAAAATTATTTCCmAAGG 
CCTCCTGATAACATCAAGGATACAAAAAGTGACATCATATTCTTTCAGAGAAGTGTCCCAGGACA 
TGATAATAAGATGCAATTTGAATCTTCATCATACGAAGGATACTITCTAGCITGTGAAAAAG 
AGACCTTTITAAACTCATTTTGAAAAAAGAGGATGAATTGGGGGATAGATCTATAATGTO 
TCAAAACGAAGACTAGCTATTAAAATTTCATGCCGGGCGCAGTGGCTCACGCCTGTAATCCCAGC 
CCTTTGGGAGGCTGANGCGGGCAGATCACCAGAGGTCAGGTGTTCAAGACCAGCCTGCCAACATG 
GTGAAACCTmCN>rrACTAAAAATACAAAAANTTAGCTGAGTm7^GTG 
GGNTACTCANGAGCTGANGCAGGAAAATCACTTGCNCTCCCGG 

SEQ ID NO: 2344 ACNOGGGTGATCGACCANAGCTAACAGGTGCCAAAGTGGTGOTATCTGGTGG 
TCGAGGCTTGAAGAGTGGAGAGAACTTTAAGTTGTTATATGACTTGGCATATCAACrACATGCTG^ 
ANTTGGTGCTTCCCGTGCTGCTGTTGATGCTGGCTNTG 

SEQ ID NO: 2345 ACTAACCTCATTGTAATGAGGTTTAAGAOAAGGCCCAGCACTAAGCCAGGCT 
CrCAGGAAAATTCAAACAACAGTTCCTCmGCCCTrCAATATAGCCCCrrC^ 
TGAAGAACCACTGGACCTGTAAACCAAGCACACAGGTATAAGTCCACAGACCAGGTGAAGGCCT 
AOATGGCAAAACACATGGGCnTrGTGACTCCACTACTGACTTCCAGGCTAAGGAAGGACTGACTT 
AGTGAGCTGTTCCAAGACCACTGAGCTCATGGTTCCCTGTGGCTGGGACCTCCATCATGACCGGGG 
CTTGAAGAGGGT 

SEQ ID NO: 2346 ACTGTTCAAAAAGGAATGCCCCACAAGTGTTACCATGGCAAAACTGGAAGAG 
TCTACAATGTTACCCAGCATGCTGTTGGCArrGTrGTAAACAAACAAGTTAAGGGCAAGATTCTTG 
CCAAGAGAATTAATGTGCGTArrGAGCACATTAAGCACTCTAAGAGCCGAGATAGCTTCCTGAAA 
CGTGTGAAGGAAAATGATCAGAAAAAGAAAGAAGCCAAAGAGAAAGGTACTACCATGCCGGGCC 
AATITTrrTTTTGTTTGTAGAAATGAGGTCTTGCTATGTTGCCCAGGCTGQT^ 
TCAAGTGATCCTCCTGCCTCAGCCTCCCGAAGTGCTGGGATAAAAGGTGTGAACCACCATACCCA 
GCCAGTATTATCTnTCATTTCATmCCAGTTGAGTTTATATTGGCTACATTTGCATACCGCACAA 
TTGTrcAriTmAAAAACCATATTTTGTTTTGTTCTGTTGCT 
AACTTACasiCCANTCNGGCCAAGTCCACrrGAGGAA'mGCT 

SEQ ID NO: 2347 ACCTGCATCAGCATTAGTAATCAACCTGTTAATCCAAGGTCTTTAGAAAAACr 
TGAAATTATTCCrGCAAGCCAATTTrGTCCACGTGTTOAGATCATTGCTACAATGAAAAAGAAGG^ 
TGAGAAGAGATGTCTGAATCCAGAATCGAAGGCCATCAAGAATTTACTGAAAGCAGTTAGCAAGG 
AAAGGTCTAAAAGATCTCCrTAAAACCAGAGGGGAGCAAAATCGATGCAOTGCTTCCAAGGATGG 
ACCACACAAGAGGCTGCCTCrCCCATCACTTCCO-ACATGGAGTATATGTCAAGCCATAATTGTTC 
TTAGmTGCAGTTACACTAAAAGGNGACCAATCATGGTCACCAAATCAGCTGNTACrACTCCTGTA 
GGAAGGhrrAATGTTCATNATTCCTAAGCTATTCAGTAATAACTCTACCCTGCNATTTAATGTAAGC 
TCTCTGAGGGGCCTI^GGTCTTAANTGGATTGTTCGGCCCT 

SEQ ID NO: 2348 ACGCGGGGCTTGTCCAGTGAAACACCCTCGGCTGGGAAGTCAGTTCGTTCTCT 
CCTCTCCnrrCTTCrrGTTTGAACATGGTGCGGACTAAAGCAGACAGTGTTC 
AAAGTGGTGGCTGCTCGAGCCCCCAGAAAGGTGCrrGGTTCTTCCACCTCTGCCACTAATTCNACA 
TCAAGTTCATCGAGGAAAGCn'GAAAATAAATATGCAGGAGGGAACCX:CThrmGCGTGCX}CCCAA 
CrCCCAAGTGGCAAAAAGGAATTGGAGAATTCriTAGGTrGTCCCCTAAAGATTCTGAAAAAGAG 
AATCANATTCCTGAAGAGGCAGGAANCTGTGGCTTAGGAAAAAGCAAAANAGAAAAGCATTGTC 
CTTTGCAACCTGATCACNCAAATGATGAAAAAAGAATAGNACrrrCTCATTCA 
CTNCTTGTITACCCTGNTATTCTAGAATGTAAAATTACATAAATGTGTrTGOT 
GAACNAGGCTTTATTAAAAAAT^TAGGTTTAA^TAAAAAT^^ 
AAAKmGTAANACTAATmT^GGNACTTNNCTTAlWT^ 
TNACCAAAT 



351'^ 



wo 02/29086 



PCT/USOl/30732 



SEQ ID NO: 2349 ACGCGGGATTGCAGOTCAACTTTTCTCTTTAGTGTTCTGTrTGAAACTA^^ 
CTTACCGAGTCAGACTTTGTGrrCATTrCATTTCAGGGTCTTGGCTGCCTGTGGGCTTCCCC^ 
GCCTGGAGGTGGGCAAAGGGAAGTAACAGACACACGATGTTGTCAAGGATGGTTTTGGGACTAGA 
GGCTCAGTGGTGGGAGAOATCCCTGCAGAACCCACCAACCAGAACGTGGTTTGCCTGAGGCTGTA 
ACTGANANAAAGATTCTGGGGCTGTGTTATGAAAATATAGACATTCTCACATAAGCCCAGTTCATC 
ACCATTTCCTCCrrrACCTTTCAGTGCAGTTrCITITCACATrAGGCTGTTGG^ 
NACGGACTGTCAGrrCTCTGGGAAAGTGGTCANCGCATCTGCAGGGCTTCTCCTOCTCTGCTTTGN 
ANAACCAGGGCTCTTCTCAGGGGCTCTAGGGACTACCANGCTGTTTCAACCCAGGAAGGCCAAAA 
TCAAOAAGTTGAGATGTAGAAAANTTGTAAAATAGAAA AAGTG GAAGTTGGNGAATCGGTTGTTC 
TTTCCTNCATTTGGGATNATTGGCATAAAGGTITTTAACATriTCCrCC 
TTT 

SEQ ID NO: 2350 ACCTTGGCTTGGCrcTTGACGTGGACAGAATTAAAAAGGACCAAGAAGAGGA 
AGAAGACCAAGGCCCACCATGCCCCAGGCTCAGCAGGGAGCTGCTGGAGGTAGTAGAGCCTGAA 
GTCTTGCAGGACTCACTGGATAOATGTTATTCAACTCCTTCCAGTTGTCTTGAACAGCCTGACTCCT 
GCCAGCCCTATGGAAGTTCCTTTATGCATTGGAGGAAAAACATGTTGGCri'rjUriUriGGACGTG 
GGAGAAATTGAAAAGAAGGGGAAGGGGAAGAAAAGAAGGGGAAGAAAGATCCANTGAAGGAAA 
GANGAAGGGGAAGAAAAGAAGGGGAAGAAGATCANAACCCACCTTGCCCCANGCTTANCANGGA 
NCrGCTGGATNAAGAAAGGGCCTOAAATNTrGCAGGACTCACTTGGATAGAm^ 
TCANGGTGNC ^TGA A^TOCTGGCTCA^TCCACCCr^ANAAGAAN^^^GCT^^ 
ACCmKTTGGCnTG 

SEQ ID NO: 2351 ACGAAGAAAGCATTTCCNAAGCAATGAGTCTCTTAATGGAAAAAATAAAAG 
AGCAATGTNATTAAACTTTCCCTCAAAGAAAGGAAAGAAACCTAAATCTGTrATOGCAAATT^ 
AAAGATAAATA^T^CCTAGTCCCATAATATGTGTTTGTCTNACAAAGAT^mCCT^m 
NATTCTATGANGCAGGGGATCTTGTGGNAATTC 

SEQ ID NO: 2352 ACAAAAATACAGTTGATGACTTGACAAAATGGCTACACCTAGGGCTTGAAGG 
TTTGAGTTTCTCCAACAGTAACAAGGGAAAGCATGCTTCCACCTGGAGCCGAGTCCAAGCACACA 
GCCAGTCCTGCACACGCATGCGTGCAAACAGGGAAGCTCAAGCATGAGAAGAGGAAAGAGGCTO 
TAGAAATTTGGGAAGAAGCCCACAATTATTCCCAGGAGAAAAAAGGGAAAAAACAGGCTGATAT 
CCrTGGTAGGGGGTAGAATAACTGAriTACACTTAGGATTTATTGTTATTGTrGTTG>^^ 
TAATGATAANCrACTTCTGCAATTTTAACCGTTGTANAAAAGATGCTACTAGTCTCCT^ 
AAGGTGAGTAAAGTGGGAGGAAATGGGAAAAGACTCAANTCITAAGTGCGAGGAGTrAACATrC 
AAGTGGCrcAANTrrmNTGTCANAAGGTTTANTCm 
GTGNAAGGGAAAAGGAAAAANAuAATTCTTNCNCTNTCTCGTGGATCC>^ 
TTNTTGTTANGATAACrCTrCCAAA 

SEQ ID NO: 2353 ACATAGACAAGTTTCITGTNAGACAGAAAACAGAGAAATCCACAGTAACTCT 
AACACATCCCTTAAGGAATAAGCAl'GTATTTGTAGGAAGCAAACAAAGCTTTCCATAGAGA^ 
ACTTTCACAGGATGATTAGGTGGACCTQCAATGAAGAAAATACATrTCAAAAGATGGGTTCAGAC 
TTACACCAAGrTTrcACTGAAATACrrAAAAAAAAAAGACCCrrCTCTCT 

AAAATACATCACGGATAAAATAAATCTCAGGAAAGGTCCAAGTCCTACTCAGNAGACATACATTT 

GCAAATTAATATAAATTrr>^\AGriTGACAACAAAAATACrrATTTGGAACTACCT 

AAGCTAAGTTCGGTGNTCTGCTrrAGTNGCATTACATTGAAATGAATCTCATTCT^ 

TTTCirmACTrTAATCACra^GCGGGCIXiGGNCACCTGAAA 

TCAGAGAAATATGNNAACCTISfmTNATGlTrNCA 

SEQ ID NO: 2354 ACGCGGGGGCGGGAGAATCGCTTGAGCCCGGGAGGTGGAGGTTGCAGTAAG 
CCAACACCGTGCCACTGCACTACAGCCTGGGCGACAGGCTGATAGGAAGATGTCTTCAGGAAATG 
CTAAAATTGGGCACCCTGCCCCCAACTTCAAAGCCACAGCTGTTATGCCAGATGGTCAGTTTAA^ 
ATATCAGCCTGTCTGACTACAAAGGAAAATATGTrGTGTTCl'l'CU'riUACCCTCTTGACTTCACCrr 
TGTGTGCCCCACGGAGATCATTGCTTrCAGTGATAGGGCAAAAGAATTTAAGAAACTCAACrGCC 
AAGTGATTGGTGCTTCTGTGGATTCTCACrrCTGTCATCTACGCATGGGTCAATACACCTAANAAA 
CAAGGAGGACTGGGACCCATGAACATTCCTTTNGTATCAAACCCNGAAGCGCACCATTGNTCAGG 
ATTATGGGGTCTTAAAGGGCTGATGAAGGCATCTC>rn'CANGGGCCnTmATCATITGATGATAA 
QGGTATTTrCGNAAATCACTOTAAATGACCrCCTGOTTGCCNCTCTTGTNGATGAAACT^ 
TATTTTCAGCCTTNCAGTTCAhn-GACAAAACATGGGGAAArrGTGCC^ 
GNNNTGATAC 

SEQ ID NO: 2355 ACCCGACCTCCATCTTCACCAAGAAATGTTATCTCTAATATAAACGAGACCTC 
AGTrATCCTGGACTGGAGrrGGCCCCTGGACACAGGAGGCCGGAAAGATGTTACCTTCAACATCA 
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TATGTAAAAAATGTGGGTGGAATATAAAACAGTGTGAGCCATGCAGCCCAAATGTCCGCTTCCTC 

CCTCGACAGTTTGGACTCACCAACACCACGGTGACAGTGACAGACCTTCTGGCACATACTAACTA 

CACCTTTGAGATTGATGCCGTTAATGGGGTGTCAGAGCTCAGCTCCCCACCAAGACAGTTTGCTGC 

GGTCAGCATCACAACTAATCAGGCTGCTCCATCACCTGTCCTGACGATTAAGAAAAGATCGGACC 

TtXAGAAATAGCATCTCmGTCCTGGCANGACCTOAACATCCTAATGGGATCATATTGG^^ 

AGGTCAAATTNCTATGAAAAGCAGGAACAAGAAACAAGTTATACCATTCTTGAGGGCAAGAGGC 

ACAAArrGTTACCATCAGTAGCCTCAAGCCTGACACTATATACCGTNTTCTAAATCX: NANCC C^ 

CAGCNGCTGGATTTGGGACNAACAGCCX:CAN>nTTGAGTITGAAACrm 

TTTTTTGGNNA 

SEQ ID NO: 2356 ACGCGGGGAGTGTGAAATCTrCAGAGAAGAATTTCrrCmAGTTCTTTGCAAG 
AAGGTAGAGATAAAGACACTTmCAAAAATGGCAATGGTATCANAATTCCrCAAGCAGGCCT^ 
TTTATTGAAAATGAAGAGCAGGAATATGTTCAAACTGTGAAGTTATCCANAGGTGGTCCCGGATC 
ATCGGTGANCCNCTATNCTACCTT 

SEQ ID NO: 2357 ACTCTrGATCAAAGACCGTGAAACCAACAAATCAAGAGGATTTGCTTTTGTC 
ACCTTTGAAAGCCCAGCAGACGCTAAGGATGCAGCCAGAGACATGAATGGAAAGTCATTAGATGG 
AAAAGCCATCAAGGTGGAACAAGCCACCAAACCATCATTTGAAAGTGGTAGACGTGGACCGCCTC 
CACCTCCAAGAAGTAGAGGCCCTCCAAGAGGTCTTAGAGGTGGAAGAGGAGGA AGTGG AGGAAC 
CAGGGGACCTCCCTCACOGGGAGGACACATGGATOACGGTGGATATTCCATGAATTTTAACATGA 
GTTCTTCCAGGGGACCACTCCCAGTAAAAAGAGGACCACCACCAAGAAGTGGGGGTCCTCCTCCT 
AAGAGATCTGCACCTTCAGGACCAGrrCGCAGTACAGTGGAATGGGAGGAAGAGCTCCTGTTCAC 
GTTGGAAAAAATAGTTATGGAGGTCCCCTCGAAGGGAACCGCTGCCCTCTCGTAGAGATGTTTATT 
TGTCCCCAAAGAAATGATGGGTTTTCTACITAAAGACAGCTmTCA^ 

CTCOTGATCTAAGANATTATGCCCCCCCCCCACGAOArmCTmCGGNGATTNTOGT^mTCCAG 
TTTNCGNTGATG 

SEQ ID NO: 2358 ACGCGGGGCTCTrCCTGCTCTCCATCATGGCGCAGGATCAAGGTGAAAAGGA 
GAACCCCATGCGOGAACrTCOCATCCGCAAACTCTGTCTCAACATCTGTGTTOGGGA GGGTG OAO 
ACAGACTGACGCGAGCANCCAAGGTGTTGGANCNGCTCACAGGGCAGACCCCTGTG'i'rrrrCAAA 
GCTAGATACACTGTCAGATCCTTTGGCATCCGNAGAAATGAAAAGATTGCTGTCNCTGCACAGTTC 
NAGGGGCCAAGGCNGAAGAAATCTTTGN 

SEQ ID NO: 23 59 ACTGGCCCTCAGTGCTGGCAAAGGTGTAGTTCCACTGGCCGAGGGAATCAAG 
ACATAGTGGTCCTTCTGCTAAGCCAAGGGCTGCCACAATGACACAGTAGCCAGATCCTGCAATTC 
CAATGAGAGCAGCCAATACAGAAGAAAGCATCGCACATCGrrTGCCACAGrnTCATGGCCACAG 
CAGCCACAGCAGTCATCCTGTTCCAGCX:CAATGAAGACAAATGCTG GCAGG AGCATCAGC AGGC C 
ACCTCCTACGATGCCAGAAAAGAACCACACGAAGCGGCTGAGGTGGTITrCGGAGGCATACrTTG 
rrrCCCCATTGGGAAAGTAAAGCAAAATATl'AGCCGCGATGCACAGGAGGG CGAG CCCCACCAGA 
GAATGTCCGATGCATCGTGCACACnrCCCATAGCACATGGTGGTCTGCTAGGTTrrCTCCCOT 
TTTGTCTTCAGCTCAGTGATACCCCAAArrANATGAAAAGTGTGCCCTTCTGGTGGANAAAGCAAA 
CACCACrCCCCGCGT 

SEQ ID NO: 2360 ACATCAAGTCAGAATGATGTTGACATQAGTTOGATTCCTCAGGAAACATTOA 
ATCAAATCAATAAAGCTTCACCAAGAAGGTTGCCCAGGAAACGGGCACAGAAGAGATCAGTGGG 
ATCTGATGAGTAAATGTTCCTTTGTGCAACAATTCGGGCTTTACTTAACCCTGCCCTAAATAT^^ 
CGGCCTGATGGGATTGAGTGCTAGAGAANCCATGT 

SEQ ID NO: 2361 ACCGGAAAGGAAGCTCCCATTCAAAGGAAATTTATCTTAAGATACTGTAAAT 
GATACTAATTTTTTGTCCAmGAAATATATAAGTTGTGCTATAACAAATCATCCTGTCAAGTGTA^ 
CCACTGTCCAa3TAGTTGAACTrCTGGGATCAAGAAAGTCTATTTAAATTGATTCCCATCATAACT 
GGTGGGGCACATCTAACTCAACTGTGAAAAGACACATCACACAATCACCTTGCTGCTGATTACAC 
GGCCTGGGGTCTCTGCCTrCrCCCmAa:cnx:CCGCCTCCCACCCTNCCTGC^ 
AGCCTGGGGGGCTTGTTAGAGTANATGTGAAGGTTTCAGGTCGCAGCCTGTGGGACTACTGCTAG 
GTGTGTGGGGTGTTANTGGCAGCACCCCNATANCT 

SEQ ID NO: 2362 A CnTITl - lTn - nTri - lTl - l - l I T AATTNAAATCAQTAGmTATTACAAT 
ACNCACTGACACACAATTGGAAAAGGAATGTCCTGACAriTTCTGAGCATTTCAC GGAA AGCAAA 
TGTAACCATCX;ACGCAGTANAGT(>TCCACACCrnTCCTAACAAAAAGCAAAAGCmCAC^ 
GATAACTGAJVNAAAGCAAAGTTTTCTTATTCTTGTCTAAAAAACTTCAGTCT 
TGAAGTGAGCTGANAAGGAGCACATTTCCTTCACAATGTCCAAAATCCTGCCTCTAGATTATCCCC 
CTGACCTTCTAAATGACnTT^ATAATACTGCAGATGGT^r^^ATGTTCAGGGGCCAAATGGGA 
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CnTrAAGGCCAACTACTCTATTGTCACrATAATTGAAAGGAAAGAATAAATTAAGAAAATT^ 

CCTGCTAAAATGCACAAAGACTTGAANAAAATTCTGGGTGTATTAAAAGTGAGATGCTCTTTTO 

GTAACTCATTNGAAAGGNTTNCAATTACCATAAACATTTTAACT^ 

AAOsriTGTTTCCTTTTOTGCAAAAGGGCTTTGNCm 

CGGGGA 

SEQ ID NO: 23 63 ACTTGCCCCTTCCCCAGAAAAGCGGG ACrrGCTGCTAAGGGTGAAGGACCAA 
GGCAGTTGTCCCTGCGTGGTCTGACACCCrTGAAACGTGGGTGTATAATCAGAGAGGCATCCCTGC 
AATGATTAAACACCAAGGGAAGOCrGCCTrCCCAGTCTGTGACCAGCGCCGGAGTTTTGGGTCCA 
CGGATAAAACGTGTCTCTTTTGTCTCTACCAGAAAATGAAAGGAATTGAAATTAAGAGAAGGG 

agattgaagtgtagtgcx:aanattgaaaggagaaagtggttgagggatagtgagggaagttggg 

GAAGAOAGTAAAAAGAGGCTGCTTCCAGATTTGAAATTGGTGAGATGTTrcrrGGGCT 

CTGAGGACCTGAGGTCGTAGGTGGATCTrTCTCAGGGAACAAAGAGCAGGAGGACGGAGGATTG 

ATCTCCCAAGGGAGGTCCCCCNATCCGATTCATGGCCCAAATTTCATGTGCXrrTCATGTGAAGANA 

CCACCAAACAGGCrrrGTGTXjANCAACATGGCTTGTlTArrmACCTGGGGTGCAGGC^ 

TTCCGAAAANANAGTTAANCCCCNNGTACmGGGCCGANACACGCTTAGGGGCC 

SEQ ID NO : 2364 ACACCTTGAAGGCGAGGTTAATTAAATCCTGTTGTGGAGTTTGAGGGCCGGA 
ATTTAATITITGGAGTTTTATTTAATATCGGGAGCAGATTGGGTAATAAAATGTATATTGAGAATA 
AGACGGCCrmGACCTTTAAGGGTCrrANAGGCTGTAAAGTGTCTCAGGGTTAmANCN^^ 
CCATGANCTGGNm'GGGArrrmTNACTNGANAAAAAhWAGGCTTG>m 

SEQ ID NO: 2365 ACGCGGGGGAGGAGACTATACAACTACAATAGAAGCA1TTATATCTGCTAGT 
GGAAGAGCTATCXIAGGGAGOAACATCACATCAmAGGGCAGAATrrTTCC^AAATGTrTGAAAT 
CGrnTTGAAGATrcAAAGATACCAGGAGAGAAGCAATTTGCCTATCAAAACTCCTGGGGCCTGA 
CAACTCGAACTATTGGTGTTATGACCATGGrrCATGGGGACAACATGGOmAGTATTACCACCCC 
GTOTAGCATOTGTTCAGGTGGTGATTArrCCTTGTGGCATrACCAATGCACrrrCTGAAGAAGACA 
AAGAAGCGCTGATTGCAAAATGCAATGATTATCGAAGGCGATTACTCAOTOTTAACATCCCGCGT 
TAGANCTTGATTTACGAGATAArrATTCTCCAGGTTGGGAAATTCAATCACTGGGAGCTCAAN 
GTTCCCATTAGACTTGAAGTTGGCCACGTTGATATGAAAGAGCTGTCAGTTTGTACCCNTCNACNA 
GATACnTGGAGAAAAGCTGACAGTTGCTTGAAAATGAGGCANAAACTAAACTTTCTAGm 
GAAGACATCCAGNCA(^CTTTANACAANGGCnTCTNAAAACCTTAAACTTATTG^ 

SEQ ID NO: 2366 ACACTAGAGGCTTTGGTAAAACATCTTCTCTCCAGAGGGTGAAGATAAATAA 
ACCTTACAGAGATTCAGAACTGGCCACTGCAGTGAAGTTTTACAGGTCTAGTGGTTAGGGGCATC 
CAGGGGTGTCCCTTCCAATCTGAAAGACAAACTGTrGCATCrrGCATCCTCATGCAAGGAAGGAA 
GCACACTGCCTGGTGAGCCTGTn'GAGTTCTGACATCACArrCTACATOTAGGTGTArrGCTTTGCC 
CACACTCTAGGTGACATANGAGGATGCCAGCrrrGANTGGAGCCTCNCAGGAAAGGACACTGCAG 
CCAGATCCAGGCCATGGCGCA>n"CACCATCCTTCAGACCCCCrGGTGCTGAAAGTGCCAhrrGGTG 
GGGTAAAATTTGGGGTGGACCTAACCAAACACCATmGGANAGTCCANTGGAGGGCCTGGGATTC 
TGGAmTAGGCCATGTTATNCCANAGCACAANAATTATGCCCATCCAAAAGCNGAAANANTATNT 
GGGGAAawCTTTTTGCCTTGTTNNTGGANGNCKrGATAAANNANAATGOT 
AAAACNACC^r^^GTAr^CC^^NG^TGCCCTANAATrNNNCT^^ 
CTGGCCANAACNAT 

SEQ ID NO: 2367 ACCCAAAGGAAGAAOGTCTGCCCTGCCTTCTATAAACACATGCATGTGGCTC 

cgtcctcccaagacatcattgcagacaagaattagtgcagagttgctcagaggctgtttgctcagc 

aactaaagccaatctgacggcagccaatccaggtctgggcttactccctggtttggta^ 

aatagaattcaacccaaatrcaacrcaacttaaaagcgtcrgttgggtctgtgtgccagg 

ctgatagaagacacaaacaaagagtgtgacaacnatccctoccctcaaggaaacgctctgcgtct 

tcnccttaagcnacaagacaacccacccgcaaaaaaarracttgntrcccangnatt^^ 

ggcaaggnttgtaaaatgtgcatgccacacttgttgagtcrcaanaagctccagatgcgtgtgtcc 

tggncnagcaatctgntcatnaangggcaatgnggctgatcchrrrgn^^ 

SEQ ID NO: 2368 ACGCGOGGGTTGCTGTAAGGGGTCCTCCCTGCGCCACACGGCCGTCGCCATG 
GTGAAGCTGAGCAAAGAGGCCAAGCAGAGACTACAGCAGCTCTTCAAGGGGAGCCAGTrTGCCAT 
TCGCTGGGGCTrrATCCCTCTTGTOAmACCTGGGA'nTAAGAGGGOTGCAGATCCCGGAATGCC 
TGAACCAACTGTTTTGAGCCTACTTTGGGGATAAAGGATTATrrGGTCrrCTGGAm 
TCAGCNGGACAGCATGG AAGA TGTGTGCTCTGOCTCQGATAAGANATGGGACATCATCATTCACT 
AhrrrGGATGGCACAAGGCTTTCAAAAACNCATCTGTAG<K;AAAAhrrGGAACT^ 
GCCGGCCCCTCNAAAGGGGCGAATTTCANCCNAOTGGCGGCCGTTNCTAAGTG 
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SEQ ID NO: 2369 ACACTAACTTTTAAGTGTGTGGCACAAATGGAATAGAGACATGAAGTrTAAT 
GAGATCCAAGCITCCAACTAAATGATTTCACATTroAAAAGCrrATGGAAAGATAC^ 
GTTTTACAAATCAAATGCTTAAACCAAGTTTAAAAGTTGAGACCGAAAAAAATTGATGAANAA^^ 
AATGGCCAAAAAAATTAAACAAAATCTmTGGmCCTTTACAGGrrAOT^ 
TTrmCAAATTTGCATTTTACANTTANAAKTGCANANCACTITGGA™ 
TnTGGGGTTCCX:CTGCOTCTAAA 

SEQ ID NO: 2370 ACACTGAAAACTGGACL\TTNTAACATTAATTTTATTAGCTCTCTGGGAGTGAG 
CTACATQATGTTGTGCACTGAAAATTACCCAAATGTTCTCGCCnTCTCrTTrCCTGGATGAGOT 
AAGGAGTTCATTACTACTTATAACATGATGAAGACAAATACTGCTGTCAGACCATACTGTrTCATT 
GAAmGATAACTTCATTCAGAGGACCAAGCAGCGATATAATAATCCCAGGTCTCTTTCAACAAAG 
ATAAATCTTTCTGACATGCANACGGGAATCAAGCTGAAGGCCTCCTATCNAATTTCCANNGCCAA 
CNNGGGG>rrCACCCATTGNANTCN>rrCACCNTTTmGTTGACTGAAAAGGN^ 
TTCTGTCACCAGCNACTGGACCAGCA^CTCTGTCAGGGATTGTAGGATTTATCCITAGTCrri'lATG 
T GGAG CTCTGAATTrAArrCGAGGCTTTCATGCTATAGAAAGTCTCCTGCANAGNGATQGGTGATO 
ATTTTAATTTACATCATTGCnTTTTTCCTTGGAACAACAGCCTGCCIT^ 
GTCTACTACACCGGCTrGGNGGAATGGCAAATNTTmGACCriTOGGCTTAATCTGCrATO 
NATGG 

SEQ ID NO: 2371 ACTAGGCTAACTAGAAGGATCTCATCCCCATATGTGGTCTCATTTCAAGTCTA 
TGGATGACTACCTTCA'n'GCTGTGTGCGAGATGG'nTCACCCCTTGAAAATATGGTCACTrCAGCA 
TAAAATAGTTAAATCTTrATAATGATCAATTCATCCTACCrCCrrTTACATGCAGCTGAAA 
CAGGCTAGGGACATAGAATArrGTGAACTTTATACTGTTAGAATCACTGTCCArrAAATGATCACT 
AGCTAATGGTCACTAAATTTACAAATTAAGGAAATTATATATAGAATACTGCAAAACACNAGTAA 
AAAGACTGAAOrrCGCCCAmCTGCTCANGGAAGTCT 
CTTCTGGCTNCAAAAATTCTGCTATTATTACrGNTTTTCCTCCTTT^ 
GTGCCAGAACTTCCANANCCTTOTCGCTCAAATGCCATCTTmGTNTCCATTTC^^ 
AAGTGATGCCITGTGGAANAAAAGGATGCTrCCCTGTCTAANATTTNCTNCI^ 

SEQ ID NO: 2372 ACGCGGGGCTTACAAGTCOTCTTGATCCTGAACTGGGTTAGGTGCCGCTGTT 
GCTGCTCGTGTTGAATCTAGAACCGTAGCCAGACATGGGACTGGAGGACGAGCAAAAGATGCTTA 
CCGAATCCGGAGATCCTGAGGAGGAGGAAGAGGAAGAGGAGGAATTAGTGGATCCCCTAACAAC 
AGTGAGAGAGCAATGCGAGCAGTTGGAGAAATGTGTAAAGGCCCGGGAGCGGCTAGAGCTCTGT 
GATGAGCGTGTATCCTCTCGATCACATCANAAGAGGATTGCACGGAGGAGCTCTTTGACTrCTTGC 
ATGCAAGGGACCATTGCGTGGCCCACAAACTCTITACAACTTGAAATAAATGTGTGGACTTAATTC 
ACCCCAGTCTrCATCATCTGGGCATCAOAATATTTCCTTATGGrnTGGATGT 

SEQ ID NO: 2373 ACAATGCCTGCCATCATGGGTCAGAAAnTGAAGGATGAAGAAATCTACTGT 
TTGAAATCCTCAC(nTrCAGACGTATmcmATTCACATCCCAGGAGCATCCATmAAGGAACT 
ATTCTTTGGAAAAAAACAAAAAACAAAAAAAACAACAAAAAAAGCTAAGTTATAAGTGAACTG 
rrGGCTGCACTGTATGTCACrriTGCTTGTTGTCATGTGAACTTGGAAACrAAGG 
ATAAAAATTCTAAATGAAAGGGTGTGGGTTTCCATCAATCTGATGCTGCCCATCGCTTGCACTGGG 
GTCmGTGGATCGGGCAGGANTTNTCAGTGTGCTGGGGTGTTGCTCCTTCCTATGTGTCnTTTGAA 
TCTGAGGCTGACArmGCTTGGAAGGCCAACCCTTGCTCCATCANAGAGGGCNAGTGGCNAAAG 
GCCAATGAGGCAGCTGTGANTTGGACAGGGTTCA 

SEQ ID NO: 2374 ACAGAAACTGGTATTrn'GGTGCTGATACAAGAGAAATGTATTTTTAAATATC 
CCACATCCTGGATCmGTTGGGTATTTAGTATATTGACATATATTTrrATAAGGTGAGGTAACTC^ 
GAACrrAATTTAAAAGTCTTAAATATTCTGATACAATTCAGCTGTCTTCTCNACOT 
AGTTGCTTTCATTITAAACCAAAGCAAGTAACATATTNAGTGACTTGAATCrrC^^ 
GTAAAAACAGCNAAANAACCTAhTKnTrGTCTTTTNAACNCNGA^ 

SEQ ID NO : 2375 ACCTGTTATTGAATCAACAGAGACTATAGAGGCTAAGGCTGCCCTTAAACAG 
TTGCAGGAAATTTTTGAGAACTACAAAAAAGAAAAAGCAGAAAATGAAAAAATAC^AAATGAGC 
AGCTrGAGAAACTTCAAGAACAAGTTACAGAmGCGATCACAAAATACCAAAArrTCT^ 
CTANATTrTGCITCTAAACGTTATGAAATGCTGCAANGATAATTGTTGAAGGATATCGTCGATAAA 
TAACATCOTCTTGANANGAAATTAANAAACTCOCTGCNACAACTNANAAAGCANGAACNGATTA 
TCAATNCGATGACTCAAGATrrGANAGGGNCAAATG 

SEQ ID NO: 2376 ACACTCGAGATGAAGATCACATTGCATTGATCATAGAACTTCTGGGGAAGGT 
GCCTCGCAAGCTCATTGTGGCAGGAAAATATTCCAAGGAATTTTTCACCAAAAAAGGTGACCTO 
AACATATCAGNAATCTGAAACCTTGGGGCCTrTTTNANGTrNTAGTGGANAAGN^ 
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NANGAAGAAGCNATNTGGCTTCACACATrANTG 

SEQ ID NO: 2377 ACATTAGCACCATAACATGCGTCTTTAAAGCCTTCCCAAATATTAGTAATCTT 
GACCAGCAATGACAAGAAAAAAGAGGAGCACCTTTACAAGCAGTTGATATCCAATATTAAAATAA 
TTGTGGCTTTAAAAATAlTl'Cl I' 1 AAATTCCTGCATTACACri I"! CVVi ri AAACCAATCTTCCAGG 
AGATTAATCAATGAAAriTATAAGTTrrATCAACGTATAAAATTTTTTTCATOT 
GAATACAATCTGTGTTTCTGACCAGrrGAGGTAGlTAAAATAGGGGAGGGCTTTTCTAATTTCGT^ 
TTIXjACTAriTCAGAAAAGAAAGGTTATCTmACTGGTGAGCACAGTNATTGGCrCT^ 
GCTANGGTTTAAANAATATAGCACAGTGG 

SEQ ID NO: 2378 ACATGGCTACACTGTGCTCTAAAATTACTTGCATTAATGAGGACATTAATTTT 
AGCATTTAAAAAAAGAGATTTAAAAATAACCATAGAAAAACTTGAAAACX:CTGACAAGCCGA^^ 
CAAAGAGAATTCCTCTCTGAAAATTTTCAGATGCGACGGTATGAGGGAGTTGGGTGGGCTGCrOA 
CrCTGAGAGAAGTCCCGGTGACCTGAACCTATTCCAGTGAAGCCAGTCAGCAGTGAATGTGGGGC 
AAGTGCATGCGGCTGGTCTACAGCTCCCATTGCCTGGTGACCAAGGAGCATCCTGAAGTTTCAGA 
GAAGAGACTTGTGTGACAGCAGCATTGTTCCAGCTACTTGAGCAAGATCCTTCAGCAGGTAAGAA 
GTGGCAGAGACAGTCCCTGCTGAAGGAGCGGGCAGGGAGTTGGCCGCCNCNCNGANCCTGGCCTT 
TCCGTTAGGGNGACCCT 

SEQ ID NO: 2379 ACGGATGTGGCAGCGAGAGGACTAGATATTCCTGAAGTCGACTGGATTGTTC 
AGTATGACCCrCCGGATGACCCTAAGGAATATATTCATCGTGTGGGTAGAACAGCCAGAGGCCTA 
AATGGGAGAGOGCATGCCTTGCTCArmGCGCCCAGAAGAATTGGGTTTTCTTCGTTA^ 
CAATCCAAGGTTCCATTAAGTGAATTTGACTTTTCCTGGTCTAAAATTTCTGACATTC^ 
TTGAGAAATTGATTGAAAAGAATNACTTTCTTCATAANTCAGCCCAGNAAGCATATAAAGTCATA 
CATACGAGCCTATGATTCCCATTCTCTGAAACAAGATCTTTANTGTTAATAACCTAAATT^ 
AGGTTGCTCTGNCAT 

SEQ ID NO: 2380 ACACAGCTGTCAGGGAAAGTCCTGATGGCCACAGTGAAAAAGGTCATGGGTG 
GAGAGAAGCAAAGTAGGAAGGATCATrTGAAGCACAAACAAATGGGGAAACTGAGCANACAATC 
TCANTATCACCACATCTGCTTCAAAAATAGCACACCAACTCrcrrCCAAAGTGCATCGTrACACTG 
CACCATCGTGGAANAAATGGAAGAGCAGGATGGATTTGGCTGGCTGGAGTCACATCTTGGGGAAG 
CTGGCCAGGTTGGCAATGCCACAGGCGTTGnCnTArrTCNAGCOT 
TC<>JCAGTTTTNTCCCCANCTTGTTTTTAATTTATCCAGTTGCn^ 

CCCACTGCAAANCCCGNATGGTTCAANATTATCGCTmTNCAGbrmCATTATAATNCANACCrm* 
NCmTAA 

SEQ ID NO: 238 1 ACGCGGGGCAGGCAAACCTGAGGTCCTCAGAATGGCGGGCACAGGTTTGGTG 
GCTGGAGAGGTTGTGGTGGATGCGCTGCCGTATTTTGATCAAGGTTATGAAGCCCCTGGTGTGCGG 
GAAGCGGCTGCAGCGCTGGTGGAGGAGGAAACTCGCAGATACCGACCTACTAAAAACTACCTGA 
GCTACCTGACAGCCCCGGATTATTCTGCCTTTGAAACTGACATAATGAGAAATGAATITGAAAGAC 
TGGCTGCTCNACNACCAATTGAArrGCTCATATGAAACCANTATGAGCTTCCTNCCCCCTTNm 
GNNAAAAAAAATGACATTACTGGNATGCAAGANATGTOTAAAACAATTTCTTATG^ 
AGCATTAAGCANTrAANATTNNANAAATCTTGGAACTAATGTAANA>nm'GGATTGTA^ 
NAAAT^m'ACAATNAAATTTTNTTmATAr^ATTTGN^r^AC^ 
ANAAAAT 

SEQ ID NO: 2382 ACCATGAAATATCCAGAACATACTrATATGTAAAGTATTATITATTTGAATCT 
ACAAAAAACAACAAATAATTTTTAAATATAAGGArmCCTAGATATTGCACGGGAGAATATACA 
AATAGCAAAATTGAGGCCAAGGGCCAAGAGAATATCCGAACTTTAATTTCAGGAATTGAATGGGT 
TTGCTAGAATGTGATATTTGAAGCATCACATAAAAATGATGGGACAATAAATnTGCCATAAANTC 
AAATTTAGCTGGAAATCCTGGGATTTTTTTCTGTTAAATTCTGGCAACCCTA 
GATCCACAAAGTCClTGTTCCACTGGGNCmGGTTNCTCCCTTrATTGCTANGGTG^ 
GNTTTGCCCACCATN1TACCTTACAGTGAATG 

seq id no: 23 83 acgcggggagctctacgcctcctacgtttacctgtccatgtcttacracntg 
accgcgatgatgtggctttgaagaactttgccaaatactttcntcaccaatctcatg^ 
aacatgctgagaaactgatoaagctotanaaccaacgaggtggccoaatctttcttc 
aagaaacx:agactgtgatgactgggagagcgggctgaatgcaatggagtgtgcattcatttggaa 
aaaaatgtgaatcagtcacractggaactgcacaaaactggccactgacaaaaatgacccccatt 
tgtgtgacttcntttgagacacattaccrgaatgagccaggtgaaancnatcaaaagaattgggt 
garcacgtgaaccaacttgcncaangatggnagcgccwnantctggctttggnggaaatattcr 
ctorrgacaagnnacaccccnggggagnacagtngkraatgaaangcrraa>rnccttggggctat 
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rmSfCCCATAACCGTGGGGGT 

SEQ ID NO: 2384 ACTGTTCAAAAAGGAATGCCCCACAAGTGTTACCATGGCAAAACrGGAAGAG 
TCTACAATGTTACCCAGCATGCTGTTGGCATTGTTGTAAACAAACAAGTTAAGGGCAAGArrCTTO 
CCAAGAGAATTAATOTGCGTATTGAGCACATTAAGCACTCTAANAGCCGAGATAGCTTCCTGAAA 
CGTGTG^GGAAAATGATCAGAAAAAGAAAGANCCAAAGAGAAAGGTACGGAAAGTGGAAATTC 
AACAGAGACTArrAATTGCACAGGATTCANAAATCATGGAAAAGCAGCTATAAGGAAAGACCAG 
ATGGATGAGAAACTTATGGAACCTCTGAAATATCrrrGAACAACTTCCTGTAGCTTCAGATAA^^^ 
CCAGAAACCGAAGCTGGAAAGACTGGCATTCCTCCTGGGGGGGCTTTTATTCTGGGATTN^ 
CACTNANTTATCATGGGTT^CIT^mAAAACTTATGGGAATTAAGCC^^ 

NACNTGAATGATGAATGNAAGAATGCATGGAOGGACTNGCAAACrrrGGATAATAATmATTGT 

>nTATNTTmTITAAAAAGNGTGGTTimTrGGNATNGAAm 

AANTNNGG 

SEQ ID NO: 2385 ACGCGGGGCTCACTCTGCGCTTCACCATGGCTTTCATTGCCAAGTCOTCTAT 
GACCTCAGTGCCATCAGTCTGGATGGGGAGAAGGTAGATTTCAATACGTTCCGGGGCAGGGCCGT 
GCn'GATTGANAATGTGGCITCGCTCTGAGGCACAACCACCCGGGACTTCAC<XAGCTCAACGA^ 
TGCAATGCCQCrrrcCCAGGCGCCTGGTGGTCCTTGGCTTCCCTrGCAACCAATTTGGACATCAGG 
AGAACTGTCANAATGAGGAGATCCTGAACAGTCTCAAGTATGTCCGTCCTGGGGGTGGATACCAG 
CCCACCrrCACCCrrGTCCAAAAATGTGAGGTGAATGGGCAGAACGAGCATCCTGTCTTCGCCTAC 
CTGAAGGACAAGCTCCCCTACCCTTATGATGACCCATTTTCCCTCATTGACCCGATCCCAAGCTCA 
TCATTTTGNAGCCCTGTGCGCCGNTCAGATOTGGCCTGGAACTmGAAAANTTCCTCATAGGGCC 
GGGAGGGAGAACCCirrCCGACGCTACAGCCCGCACCnTCNAACCATTAAACATTGGAGCCTGA 
NTCNAACCG<XnTmT^AAAGNTTGCCATATAGATGTNAACTGCrrAAACACACAAAATCTrOT 
TCCATCC 

SEQ ID NO: 2386 ACTTCTAATTCCTCTATTACTGTGCAAATCTAGAATTTCCTATGTAAAAGACA 
GTGCTGGCAGCATCAAAGCCCTGTAATAGTATGAAATAGTGATTAGGATTTOAAATGAAGAGATA 
TTATATGAAGACATGTGAATGTCTTAATGATGGTATCATAGCAACGGCCTCAGAATTCCCrrACTG 
AAACAGAAACTAAGACAAATCATTArrCAATAATCTTCTGAGCCAAAAGCCTAAGTTGGAAGCAA 
GTGATAAAAACACAAAACCAAAGATGCATrrrCTATOACCATATGCAAGAATnTAAGCTTTGTAT 
CAACATTCTGTACCAGCACCAGCCCCTCTGAAAGGAAAAAGGNGTATTCm'GGACTGTCCATCT 

SEQ ID NO: 2387 ACGCGGGCAAACTGTCAGTGAACCCGCCGTTATTAAACGATTGATTAGTGTC 
TTAAACAAAAGCACGGGTGAAOTCACAAAGAAAAAGCCTAAGTTTTTGACTAAAGGCCAGAATGC 
ATTGGTAGAGCrACAGACACAAAGACCAATAGCTCTTGAGCTATATAAAGACTTCAAAGAGCTGG 
GGAGGrrCATGCTACGTTACGGTGGTTCTACAATAOCTGCTGGTGTTGTCACNTGANATAAAAGAA 
TGATGGGTCAGAAATTTCTACCACGrrrCTGGATACAGTGNAAATAGCTACCTCrGrrTCAAGAAT 
TCAGTTmAAGTCNAAGGAACAATGTGNCA>m'GATATGNrrTTmGATGNAGAGaAGAAj^^ 
TTAAAGC>rrAAATTNGCCTGCAAAhn^AATTTTTAATAAT>^ 
GCCAA TTNG NGAATAANAAGrn^AANTGGTAAANAACANCirmCCTTTC 
TGGGATTT^CCTIm^CTTA^^NTlTGNTGGGGT^WONTGNCAAAGNA^ 
CTCAGTNANGNATTGTATCTCTNTrrTGTGNCCACNTWCCAATGANATbnsrr^ 
TANTN 

SEQ ID NO: 2388 ACCAGGOCAAGAAGCCGGATOTCTGCCCrTCCrCAACCAGCTCCCTCAGGAG 
TGTTTGCTTCAAGTGATGGCCGGTGAGCTGCGGAGAGCTCATGGAAGGCGAGTGGGAACCCGGCT 
GCCTGCCTTTITrrCTGATCCAGACCXTCGGCACCTGCTaCTrAarA^ 

CCATGAAGCCCAGATACACAAAATTCCACCCCATGATCAAGAATCCTGCTCCACTAAGAACGGTG 
CTAAAGTAAAACTAGTITAATAAAAAAAAAAAAAAAGT 

SEQ ID NO: 2389 ACACATCCAAGCCTAAAGAAAAATTAGTCTATGCCATTrTTCTTAAATGGCCC 
ACATCAGGACAGCTGrrCCTTGGCCATCCCAAAGCTArrcrGGGGGCAACAGAGGTGAAACTACT 
GGGCCATGGACAGCCACTTAACTGGATrrcrrTGGAGCAAAATGGCATTATGGTAGAACTGCCAC 
AGCTAACCATTCATCAGATX3CCGTGTAAATGGGGCTGGGCTCTAGCCCnX3ACrrAATGTGATCTAAA 
GTGCACAGAGTGGCTGATGCTCCAAGTTATGTCTAAGGCTAGGAACTATCAGGGTGTCTATAATTG 
TAGCACATGGAGAAAGCAAATGTAAAACTGGArrAAGAAAATTT ATTTTG GCNAGTTCAGCCOT 
TCCCTTTTTTCCANTTAAATTTTmnTAAAAr^ 

SEQ ID NO: 2390 ACACAAAGAGGGGGTGGGTGTCGGATGCAGAGTGTGTGGCCTGATOCTCCAC 
GGCXSTGCAGGACGGGOGGCTAATAOTAGGTTTCCTTCTCCACCCAGCCNCCAGGGCGTCGCCTGA 
TGATGAGTTITCTGACnTCGTCATATACGAATATNATAAGAGAGTANGGNAAGGCTCATAACCAC 
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CAGGTTNGTTTNAGGGGATACATTCTAAGANCAACACCNfNTTCNATrCATTATT^ 
ANCCANAGCTGTTT^TmAAAGATGNCTCAAATT^CATNAT^m^ 
ANCTATTTCATCCTNGTTNATANAAATTGANTANArmATCTANTNNT^ 
G 

SEQ ID NO: 2391 ACTCATTTACAATAAAATAACCAAGTGAAGTTACAAAAGGCATATATTACTG 
TGAAAAGAACATACACrCCACATTnGCCGATTAATAATGGCAATCATAATTTAACATAATAAAA 
GAATATATATCTATTGCTTrrCATCATACTTGATAAATACAGTATGAACAAAATTITCAT^ 
TTTTCACAAGATAATAAATAAGTTAAATAGTTTTCATATTGAGTTGTGGTGCAGTGGTGCNAATCA 
ACTCAAAACAGCTAAAAAATrCACAGTTATTCTNCAACAATTACAANGTAAGCrCTGCGCAAGGC 
TTNCAGA 

SEQ ID NO: 2392 ACAOAGGGGTCTGTTrCTAAGTCTGGAACCTCAACACGAGGCTGGGATGCTT 
CACAACGTGCrCTCTCCCACTGTCCAGCCATCrrTGGTGGTCTTCCCACAGTGCrrCCCCCATCCTC 
TCACACrCCTGTGGAGGGACCATGGGACGGGCCAGGAGGAAACCTGGGATCACTCTGACAGGAA 
ACGGACAAGCACAGCTGCCAGGAGCCAGTTGCTCTGGCCTCAGTCTCCAATrGTTAGTCTCAGGAT 
CAACAGAGAACTGGAAAGCAGCAGAATCTGGAGGAGCAGGAAGAGAGCCCAGAGGGAGTGTTGT 
GCTGAGTGACGGTTAACAGATGAAACAGAATTTCATGGAGATTCITTGrrACAAGGAGAAA^ 
TCAGTCTCAACTCX:CAAAACGTGAAAGTTGCCTGGTTTAAGANACCTATGTmCTrmC^ 
TCAAACCTCCGGGAAAGATCATCAAAGTCAATGTCrrCANATGCTGAGGTGCTGGCACCANCANA 
TCAGTTGGTANTGTGTCnXjGCCAAATGGCAACNTCTGGTANGGACAAAANTNGTCATANTTNTNTT 
GCAGGTCrrGGAAGGAAACmGCANAAGCTThrrGGNTTGGGTNCCAGAACCA^ 
ANAAAAAAT 

SEQ ID NO: 2393 ACAGAAAACATAAATCATCGAGGTGGATACCATGGTGGAAGTTCCCGTTCTC 
GTAGCAGTATTTTCCATOCAGGAAAAAGCCAAGGACTACATOAAAACAACATACCTQACAATGAA 
ACCXjGGAGGAAAGAAGACAAGAGAGAACGCAAACAGmGAAGCTGAGGATTTTCCGTCrrTAA 
ATCCTGAGTATGAGAGAGAACCAAATCACAATAAGTCTTTAGCTGCAGGTGTGTGGGGCCTACAC 
GCCCANACACACACATCCCAACCAAAAAAATCTCCCAAGCrCCTCTCTTANAATATCCTTNCGAAT 
CCTAAAThrrANAGCTCCAAGGATGCTNGGNCATTAANAAAAGGTAATACA 

SEQ ID NO: 2394 ACAAAACCATCGCCATCAAAAAAACGCTGTTCTGACAACACTGAAGTAGAAG 
TTTCrAACTTGGAAAATAAACAACCAGTTGAGTCGACATCTGCAAAATCTTGTTCnrCAAGTCr 
TGTCTCCTCAGGTGCAGCCACAAGCAGCAGATACCATCAGTGATTCTGTTGCTGTCCCGGCATCAC 
TGCTOGGCATGAGGAGAGGGCTGAACTCAAGATTGGAAGCAACTGCAGCCTCCTCAGTTAAAACA 
CGTATGCAAAAACTTGCAGAGCAACGGCGCCGTTGGGATAATGATGATATGACAGATGACArrCC 
TGAAAGCTCACrcrrCTCACCAATGCCATCAGAGGAAAAGGCTGCrTCCCCTCCAOA^ 
TCAAATGCCATGCANCCGTTTAAAATAATTCTGCAGGTGAATATTTATTAATGAOATGTTGCGACT 
ATTTGTTAGGAGACATAGTTACAAAACAGTATATATCCGGTTm 
CTGCAAGATGTATACTGATTGNGATCTTTCCCTNCTTATTTTTGCTTAGC^ 
TTNTACAATAAACAAATGTTITTGGAATAAAN^^^T^AC 
CT 

SEQ ID NO: 2395 ACACnTCTAAGTGTGCAGTGCAAGAGCTTGTTTATATTTCATACl l l'l l ATACT 
TTGAGGAAAAAAAAGTCAAAGAAAAATTGTATITGAGGGAAAAAACCATGACCAAGTAAAGGAT 
AAATTCAAAAAATAGOTCATGAGACTTGGCATACACACTCATGGGATTCCAGTTATrATGGAGTG 
CTTCCATCCCTCTCCACCCCTrCCCCCCAAA AGGTT TTCTTTGCAAGTGCTm 
AGTATOTGGATTAACTGATGCCTGCTAGTGCTTTCraATTACTCGCArr^ 
AGAAGAGTAAAGACAAGAGTGTTGGACCAGTATTGCAGTTCTGTAGTGTCATTTCTTTAAAAAAC 
AAAACACCNNCAATNATTTATCAAAATTNGGCATATTWAAGCCTACATTCTA^ 
ATTCrrTTTTAATACTrarnWACCXrrCTTrAATCTCm 
NCANACTTNTGCAATAGNTTCTTTAAAATCNCAACAAGTT 

SEQ ID NO: 2396 ACGCGGGGAAGCCAGCTGCTCGGAAAGCTCTGGACAGATGCAGTGAAGGCT 
CCTTCCTGCTAACCACATrrCCrrCGTCCTGTGACTGTGGAGCCCATGGACCAGTTAGATCATC 
AGGGACTTCCAGAGAAGCTGGTrATAAAAAACCAGCAATTTCACAAGGAACGAGAGCAGCCACC 
CANATTTGCACAGCCTGCTCCTTTGAGTATGAATATGCCATGCGCTGQAAGGCACTCATrGAGATG 
GAGAAGCAGCAGCAGGACCAAAGTGGACCGCAACATCAAGGAGGCrCGTGAGAAGCTGGANATG 
GANATGGAANCTGCACGCCATOAGCACCAGGTCATGCTAATGAGACAOGATITNATTGAGGCNCC 
AAGAAAAACTTNGGAGGATGGAAGAGCTTNCACAACCAANAGGTGCAAAAACGAAAGCTNCTGG 
GA 
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SEQ ID NO; 2397 ACGCGGGACCATGGCGGCGGCGGCGGACNANCCjGAGTCCANAGGACGGANA 
AOACGANGAAGAGGAGGAGCAGrrGOTTCTGGTGGAATTATCAGGAATrATTGATTCANACrrCC 
TCTCAAAATGTGAAAATAAATGCAAGGTTTTGGGCATTGACACTGANAGGCCCATTCTGCAAGTG 
GACAGCTGTGTCirrGCTGGQGAGTATOAAGACACmrAGGGACCTGTAGTTATATTTGAANA 
ATGTTNAACATGCTNATTCAANAGGCAATAATNAAACAGTGCTAAAATATAAATGCCCm'CCAAT 
GAAGAAGCTTATCTTGAC 

SEQ ID NO: 2398 ACGCGGGCAGGAGAAGAGAAGGAATTTGAAGAGAAAAAGGAAAATTAAAAT 
TACTAATTAATrrrTAGATTCAATATTTATATGGAGTTTTGAAAAATAATAGTGGCCCTGA^ 
TAAATTCCAGCTTTAAAAACCAAGTCTGAGGAAATATTTGGCrrCATAAAGTAAAOAGACGGm 
GGCATTTATTATTACTTTTTCCTGTATTTrATGCCCATAAAATAAGC^^ 

GATGGACTATTAAATrCATCTTAGAATAAATTAGTGAAGAATTTAATTTTAGAATAAATAATCCA^ 

TCTGAAATAATTATACCrrcmCCTTGTTAGGTAGTTATGAGTAAATCTGCAAAAGGC^ 

ATGCCTrAAATTTTATCAATAACAGAATTATTGTATTrAAAAAAAAACTAATACTTATC^ 

TAGTAAATAGGATrrrAAACAGAGAATTrrATCAGTAATAGGTGTCAGTTTTTAAAAAAA^ 

GTAGGCTGAGCGCGGGGGGCrCACGCCTGTAATCrcANCCTTTGGGAGGCCAAGGTGGGTGGACC 

ACATTGAGG 

SEQ ID NO: 2399 CGGCCGAGGACGCACTGAAGGAGACAAGAAAGCAGCAAAGGTTCAAAAGCT 
GTCTAAGAATGAAGTGCTCATGGTGAACATAGGATCCCTGTCAACAGGAGGGAGAGTTAGTGCTG 
TCAAGGCCGATrrcGGTAAAATTGTTTTGACCAATCCAGTGTGCACAGAGGTAQGAOAAAAAATT 
GCCCTTAGCCGAAGAGTTGAAAAACACTGGCGTTTAATTGGTTGGGGTCAGATAAGAAGAGGAGT 
GACAm'CAAGCCAACAGTAGATGATGACTGAANAATACCAGTTAAATAATACATTCGGATGGATT 
TGGAAGTTGGAATrCCTTTrAACAACCAAGGGGTTTArnTCANAGC 

SEQ ID NO: 2400 ACGCGGGGATAGGGAGCGATCTCCGAGCGAGGCGGCAAGATGGACGCGGGA 
TTTTTCCGCGGAACAAGTGCAGAACAGGATAATCGGTTCAGCAACAAACAGAAGAAACTACTGAA 
GCAGCTGAAATTTGCAGAATGCCTAGAAAAAAAGGTGGACATGAGCAAAGTAAATTTGGAGGTTA 
TAAAGCCrrGGATAACAAAAAGAGTAACGGAAATCCrrGGGTTTGAAGATGATGTTGTGATTGAG 
TTTATATTCAACX:AOCTGOAAGTGAAGAATCCAGACrCCAAAATGATGCAAATCAACCTGACTGG 
ATTTTTGAATGGAAAAAATGCTCGAGAATTTATGGGAGAACTGTGGCCCCTGCTGCTAAGTGCAC 
AAGAAAACATCGCGGGAATCCCTTCTGCmcXTAGAACTGAAGAAAGAAGAAATAAAACAAAG 
ACAGATTGAACAAGAAAAACTGGCATCTATGAAAAAGCANGATGAAGACCAT 

SEQ ID NO: 2401 ACCCTTGGAGATACTGGAGCGCTTCTGCATTCAGGCTGGTGCTCACCATTGAT 
GGAACCCTTCCTGGACAGGCGGTAGACAAOTGTGAAGTGACTGTGCCAGGTGAATTTTGGGGTr 
ATTCACCAmGACCTACAGGATCATGCTCTrrmCCCAGCAAATGCCAACTGTGAGAAGGCAGT 
CTGATATCCTGGTGTATCTTCTATGTCAATAAAATGTTOCTCATCAGGAATGGTATCATCTrCGGGT 
AACTCAAAAAGACCAATCAAAGACTGTAATAATGGAGTCCACAGTrrGGTATACTCAGTGTCCAT 
CATTGGGGGACATTCTGTTAGTAATTTGGTrATGCCAACCGCACAGATCTTmCTCT^ 
GATACCrrCTGAATTTCAGOAATAATAATTTTTTCCAAAACCATTOCA^ 

CATCAAATATTTCnTGTAGTGCTAGTGCCCCATATTTTATGCAATACAAArrAATAAAGACTAAAA 

AACTCTTGATAAAACTTGGTOGTm}GAATTCTGAAATCT^TO 

^^CCTATATGGGNAACTGGAT^ANGANGGCATGTGCT^^■ATmCTG^^ITAAA 

SEQ ID NO: 2402 ACGGTAAATTCACTGGCAGATACAAAAGCITArrTGGGAGACAGCAGATCTC 
TCTAAGCACCGAAGAGTGTAGGTTGGrrmCAATTGCCTCAAAAAGTGCTCTGTTGTATTATACC 
TCAGGACWTCGAGTTCAGGATGTAGTTTAGAGAGCAGGACATAAAGTAGTAAATTCCTCATATAT 
ATCACAGCAAGAAAATTAACCTAAAAAATTTAAAGACTAAATAATCTCITAAGAGGTAA^ 
AAAAATTAAArrGGTATTCATTTAAAAATATACAAACTGTGCCCTTTATACTAAAGTGCATTCAN^ 
CGTAATGTTTAAGCCCCTAAATTCCTTCTGTATGTTCTCTAA'ITCCAAAATAGTGTGTAACAATATA 
CTACTAATCATCTTGTCrACCTTAATATTNCCn^'in'GGOAATACAGTANAATCTGGOCTATTTATCA 
AGGTGGAATNTTTCC 

SEQ ID NO: 2403 ACCCATAAATTCTACmCOU^AAACAGGAGCTTTrTAAAAGAAAACCACAT 
AACAACTTTTAAAAGGCGCTGGGATTCCTCTGCTTCTAGATCAATGCTGGGCrAGAAAAGTA^ 
CrGTTCTATCAGGAATCACAAGTTGGAACTGAGTArrCTCCAAAGTGGAAATTCTAGAGTGTAGTG 
TCACrCCAGGCAAAGATTATrCAGrrCTCATCCCCAGCATCCACAACTACCTATCAGAAGGGTTAA 
ACCAGGTCAAAACAGTOZAGCATAATTAGGCTrCATCAAACX^TGTCATrATGCTCTrCTAAGATG 
CAAATAAACCAAAACAGGAAATACTAAAATAAAAATATCTGACACTGCCATACAAATTGrrAGTT 
CCTTTTTGTATCCCCCCTTCTATAACATTAACAAAGGGAATATTTTACTGCAAAGA^^ 
TATACATCACTAGCCATGAATITTTGCCATTAGTTACTATACAAATGCTGCCTAATGCCATTATCCA 
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AAATAGCACAACCATTTTACGTCCACAATTCACTTTCTATAGTTACAAGTAGAATTTTCATGAm 
CTTTAAGT 

SEQ ID NO : 2404 AC i - ri ' iuu JM ' iu - rriM ' iu ' rrJUU ' J ' i ' r GGGAACrATrACTTTAriTATArrACTATG 

CTrAAGTTACATGGAAAAAGACAACCCAGCAGTThTTGTCCCATI^CAAAAAAAGTTTCC^ 

CACCAATTATGTGGNGACAAATAACTAGGAATGGTGACATCTTTGGGCCAANACTGACAAGGAAN 

AATGGGCTNTGGTGCTACGGTTCATCTCCAACAANAATATGGCACACCGGCCAGCACAAGCCATG 

CTAACACTGAGCCTTTAGCGCrGGGCACAGATTCGGATCTCITNTCrGAAGCTAGCA^ 

AAATAACTGGGTITAAAAAAAAAAAGTITAAAAATGAAGCCCAAGTTTAAAAACCATACTCCTAA 

CATTITCCTTTCACAATTGACATAAAACACAClTITrCACrrCTAACAGCC^ 

ATAOCTCACCACATAGCTGCANACAGACTTrGGTGCCTNAANAAGTAAACATAGGGTAAAAAATT 
AACGGGCITCTGCATAATATimCTCTACCACTGCTGrrGGATGGAAAArrCCAAArmAAAA^ 
AAAAGAANGATAGAAAAGAAGGGNTTCCATnTA 

SEQ ID NO: 2405 ACTCnTGCTTATATCATCACAGAGCrGGATGAGAGAGAGCGAGAAGAGITCT 
ATAGGTTAAAGAAAATACAAGAGAAGAAAACGATTCTAAAGGAAAAATCTGAGAAGGACTTGGA 
GCAAAGGAGAGCAGCTGGAOAGGTGTTGGAGCCTGCTAATCrrCTGGCTGAAGAGAAGGACGAG 
GATCTTCTATTTGAATAATCTTTCCTGTTCTGGTTCrTTGAGAAACCCT 
ATTCACAGTGTGTAGGTTTGATTTGTGTGGCTATTTATTTTTTGGCCTAAGAATT^ 
AAATTTACCTAGATGTCTATTTATCGGATTACTTTTGCAGAATCATAATTTANC^ 
GGATGAAAGAGATCTGTNAAACCTGCCCAGGAACTTACNNANTrrACmGNAGAAGCCGT^^ 
ATACTCCATTTACAWGTGrrACACNGTGATCTNhriTACCANGCCAriTAGGGAAATACCTT^ 
AGGAAAGCATTAGCGGNCT 

SEQ ID NO: 2406 ACGCGGGCTGGGTGCAGTGGCTCATGCCTTTAATOTAGCACTTTGGGAGGCT 
GAGGTGGGAGAATTGCTTGAGTCCAGAAGTTCGAGACCAGCCAGGGGAACATAGCAAGACCCCA 
TCTCTACAAAAAATTAGCCAGGTGTGGTGGTGCATGCTTGTAGTTCCAGCTACTTTGGAGGCTGAG 
ACAGGAGGATCTCTTGAGCCCAGGAGGTCAAGTCTCACCTAGGTAATGCAGCAAGACCCCTATCT 
CTAAAAAGAGACAGACAGAGAGAGAGAGTGAGAGAGAGGACACAAAGCACTAATATCAGAAAT 
GA AACAGG TGATAATCTCTACAGATTCTATAGCCACTGAGTAGATAACAAGGGAATACTGCAAAC 
AACTTTrrACGCAGAAACITGACAACTrAATGAAATGGACCAATTCCTCAAAAACAACACT 
AAACTCATCCAAGACGAATTAGAAAATCTGCATAGTTCTAAAACCACTAAGGAAAATGAAGTGCA 
TAAmAAAAACTCCCAAAJ^GAGAAATCTTCAGGANAAATCTCCAAAAATrGCCTGGATATTrC^ 
riTGGTGAGCTTTCAAAATTATTCCNrmAACAAAACAACCGCTGGANAAATTTGGA 
AGGCCAAA 

SEQ ID NO: 2407 ACGCGGGGCGCACAGAGCTCTCAGCGCCGCTCCCAGCCACAGCCTCCCGCGC 
CTCGCTCAGCTCCAACATGGCAAAAATCTCXAGTCCTACAGAGACTGAGCGGTGCATCGAGTCCC 
TGATTGCTGTCTTCCANAAGTATGCTGGAAAGGATGGTTATAACTACACTCTCTCCAAOACAGAGT 
TCCTAAGCTTCATGAATACAGAACTAGCTGCCITCACAAAGAACrAGAAGGACCCTGGTGT(XTTG 
ACCGCATGATGAAGAAACTGGACACCAACAGTGATGGTCAGCTAGATTTCTCAGAArrrCTTAATC 
TGATTGGTGGCCTAGCTATGGCTTGCCATGACTCCTTCCTCAAGGCTGTCGCrrCCCT^G;^ 
CCTGAGGACCCCTTGGCCXTGGarrrcAAACCCACCCCCTTTCCTTCCA^ 

CCACAGCCCACCCATCCCCTGAGCACACTAAOCACCTCATGCAGGCCCCACCTGCCAATAGTAAT 
AAAGCAATGTCACi-lU'lU'l'AAAACACAAAAAAAAAAAAAAAAAAAAAAAAAAGT 

SEQ ID NO: 2408 ACAGCCAGTGTGGGGATGTGATGAGGGCCCTGGGCCAGAACCCTACCAACGC 
CGAGGTX}CTCAAGGTCCTGGGGAACCCCAAGAGTGATGAGATGAATGTGAAGGTGCTGGACTTTG 
AGCACTTTCTGCCCATGCTGCANACAGTGGCCAAGAACAAGGACCAGGGCACCTATGAGGATTAT 
GTCGAAGGACTTCGGGTGTrTGACAAGGAAGGAAATGGCACCGTCATGGGTGCTGAAATCCGGCA 
TGTTCTTGTCACACTGGGTGAGAAGATGACAGAGGAAGAAGTAGAGATGCTGGTGGCAGGGCATG 
AGGACAGCAATGGTIXjTATCAACTATGAAGAGCTCGTCCCATGGTGCTGAATGGCTGAGGACCTT 
CCCAGTCTCCCCAOAGTCCGTGCCTrrcCCTGTGTGAArmGTATNTAGCCTAAAGTrTCCCTA 
Cr 1 TCiri GTTTANCAACTTTCCA 1' 1 I'll GTCrrnnTGGGATGATNTT^ 
AAATTAACTTITrnTTGGGG 

SEQ ID NO: 2409 ACGCQGGAAGTCCCAATrATCCCAAACCATATCCAGAGAACTCAAGGTGTAA 
ATACCAGATCCGGTTGGAGAAAGGGTTCCAAGTGGTGGTGACCTTGCGGAGAGAAGATTTTGATG 
TGGAAGCAGCTGACTCAGCGGGAAACTGCCTTGACAGmAGTTTTTGTTGCAGGAGATCGGCAAT 
TTX}GTCCITACrrGTGGTCATGGATTCCCTGGGCCTCTAAATArrGAAACCAAGAGTAATGCTOT 
ATATCATCTTCCAAACTGATCTAACAGGGCAAAAAAAGGGCTGGAAACTTCGCTATCATGGAGAT 
CCAATGCCCTGCCCTAAGGAAGACACTCCCAATTCTGTITGGGAGCCTGCGAAGGCAAAATATGT 
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CTTTAGAGATGTGGTGCAAATAAOTGTCTGGATGGGTTTGAAGTTGTGGAGGGAC^^m^ 

AACATC^TT^^^Ar^CGAOTGTCAAAACAATGGAAAAGTGOANTAA^TCCAAACT^ 

CCCTGTGGACTGTGGCATTNCTrcAATCCATTCANAATGGTAAAAGTTNy^ 

TTTGNTTGGTrm'GNCATTCCGim'AACTmGGGAAmATCAOT 

GNTAATN 

SEQ ID NO: 2410 ACGCGGGGAGGCAITGAGGCAGCCAGCGCAGGGGCTTCrGCTGAGGGGGCA 
GG<XGAGCTTGAGGAAACCGCAGATAAGTTITmCTCTTTGAAAGATAGAGArrAATAC 
TTAAAAAATATAGTCAATAGGTTACTAAGATATTGCnTAGCGTTAAGTrTrrAACGTAATTTTAAT 
AGCTTAAGATTTTAAGAGAAAATATGAAGACTTAGAAGAGTACATGAGGAAGGAAAAGATAAAA 
■GGTTrCTAAAACATGACGOAGGTTGAGATGAANCTThm'CATGGAGTAAAAAATGT ATTTA AA^ 
AAAATTGANAGAAAGGACTACAGANCCCCGAA7TAATACCAATAGAAGGGCAATGCTTTTAGATT 
AAAATGAAGGGTGACTTAAACAGCTTAAAGTTTANTrrAAAAGTTGTAANGTGArrAAAATAAT^ 
TGAAAGGCNANCTTTTAAAATATANGATTAAACC 

SEQ ID NO: 241 1 ACCTTTGTCACAATCCTAACACATTATCGGGAGCAGTGTCTTCCATAATGTAT 
AAAGAACAAGGTAOTTTTTACCTACCACAGTGTCTGTATCG GAGA CAGTGATCTCCATATGTTACA 
CTAAGGGTGTAAGTAATTATCGGGAACAGTGTTTCCCATAArmCTTCATGCA ATGAC A TCTTC A 
AAGCTTGAAGATCGTrAGTATCTAACATGTATCrcAACTCCTATAATTOCCTATCrm 
TTGCAGAAACATTTTGTGGTCATTAAGCATTGGGTGGGTAAATTCAACCACTC 
CTACAAAATTTGAAATTTAGCTTGGGTTTTTGTTACCTTTATGGT^^ 
QAGATAOTAOCATACATTTATAATGTTTOCTATrGACAAGTCATTTTAACTT^ 
ATGTTACCTCXrrATAAACrrAGTGCGGACAAGTTTTAATCX^AGAATTGACC^^ 
GGGGGACTTTTGTATAGAAAGGTn'GGGGGCTGTGGGGGAANGAAAKmCCCTGNAGGTCnTO 
ACGTTNTGCCTCCCATTCNTGGTGATCAA 

SEQ ID NO: 2412 ACTGCTAAAGATAAAATACAGGGAGAAAATAACTI GITAGCAATAGATCCCC 
ATTGTTTATATATATAGGTCrTGTTCATAATATOTCAArTATGTATTOTTAAAAAGTCCTACTCACT 
TTTCAAATATGTGTTACATGGTAATGrrrGTCATTGTTGTTTTAAAGTrGCATTTGACAT^ 
CAAAGAGTGrrrGAACAGATTTTGATAACAGTGCATACACCrriTGTCTTTTTr^ 
T All ' l ' rr AGCACAGCAGCTGTGGAG Cl ' rri GCTGATAATTrrATTGTTGGTATCTTAGAATACTGTr 
CGTCTGACAGTTTATTCTTCCATACTTGTCTCCCAAGTTAAAACAGAGCAAAACAGATAAA^^ 
TTGAGAATCATCACAAAAGGGAGmGAACTTGTGAGACTAAAACTATAATTTCCTAAATGTGAC 
TAATGAACCCCAAACTGTCACTATTACATTCAGTGCCATATTTATCTTTCAAAACAGTATTTG'm 
ACCCTAACCATCTTTGGCrGAATGAAAGGCAGTTGAAAACCATCrrrrGGTCATACATAATATAAT 
TATATTAAAAGTAAAGATCGGGGCTGGGCGCAhrrGGCTCACACCTGCAATCCCACCANCACTT 

SEQ ID NO: 2413 ACGCGGGATGAAAATATAGACATTCTCACATAAGCCCAGTTCATCACCATTT 
CCTCCTTTACCTTTCAGTGCAGrriC'l'rri'CACATrAGGCTGrrGGTTC 
ACTGTCAGTTCrCTGGGAAGTGGTCAGCGCATCCTGCAGGGCTTCrCCTCCrCTGTCnT^ 
ACCAGGGCTCTrCTCAGGGGCTCTAGGGACTGCCAGGCrGTTTCAGCCAGG/^ 
NAGTGAGATGTAGAAAGTTGTAAAATAGAAAAAGTGGAGTTGGTGAATCGGTT GTTCTITC CTNA 
CATTTGGATGATTGTCATAAGGTrmAGCATGTTCCTCCTTTrCTTCACCCTCCC 
ATNAATCAAOAGAAACrrCAAAGTTAATGGGATGGTCGGATCTCACAGGCTAGAGANCTCGTTTC 
ACCrCCAAGCATTrOVTGAAAAAAGCTGCTTCTTNATTTAATCATACA^ 
AAGAGTTNCACAAATNCTTrCAAAATAAAAAGTAATraACTTATNAAAA 

SBQ ID NO: 2414 ACTITGCCTACGGCAGCAGCCTGCTGACAGAGAGGATCCACCTCCGAAACCC 
CTCGGCGGCGTrCTTCTGTCTGGCCCGCCTGCAGGATTTTAAGCTTGACTTrGGC^ 
CAAAACAAGTCAAACrrGGCATGOAGGGATAGCCACCATTrrmANAGTCCTGGCNATGAAOTGT 
GGGGAGTAGTATGGAAAAATGAACAAAAGCAATTTAAArrCrCTGNATGAGCAAGAAGGGGTTN 
AAAGTGGAATGTATGNCTT 

SEQ ID NO: 24 1 5 ACACGTTCnGTTOTCnXjGCTCGGCAACAAACACCACrrCCrGGCCAOTCTTC 
TGGTTATGGAGTCGGACAAATGTCTGGTGAGGAGTGAGTTCAGCACCAGTGTTCACATCTACCAG 
CTGGAAGAACAAGGCGAAGTTCTGGTGGCTGTCTGCGATGAATGTGCCCTTGGCrrrGGCTGGGTA 
TGTCACrCGGGTAGTTTTGGGTGCAATGCTCTGATCCTTATCCACGGTGGAAAGATCAACATTTGT 
GATGCCAACTTCAGTGGAGATCrrcACTCTGAGCTCTACGGTArrrGCAATATACCGGTTGTCACC 
TTCAACrrCGACAAGGAAGTCATAATAACCACTGGAAAATTrOACGTTCATGAAATTTAGTTCAAA 
AACATCCCCTACAGGGGTGAAGGATGTCTTCTGGAGGACAGTGGCTCTGGAAGCAACAGATTTAG 
CATGTTCTAGrrTAACAGTGGCCTGAGTCANANGCrOANACANAACATTGGTGAOT 
AANATANCCTGTTCATGAATGTCGGAAGCANAACCCTAAGCACAACCACAACTGGCACGTGGTAN 
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C 

SEQ ID NO: 24 1 6 ACGCGC-GAGAGACCCCGGACCCCAGCGCTGTCrCTTCCCGCCGCCCGAACCA 
CCATGACCCACTTCAACAAGGGCCCTTCCTATGGGCTCTCGGCCGAAGTCAAGAACAAGATTGCIT 
CCAAGTATGATCATCAGGCAGAAGAAGATCTTCGCAATTGGATAGAAGAGGTGACAGGCATGAGC 
ATTGGCCCCAACTTCCAGCTGGGCTTAAAGGATGGCATCATCXnrrGCGAACTTATAAACAA 
CAGCCAGGCTCAGTGAAGGAGGTCAACGAGTCCTCACTGAACTGGCCTCAGTTGGAGAATArrGG 
CAACTTTATTAAAGCTATTCAGGCrrATGGTATGAAGCCACATGACATATTCGAAGCAAATGATCT 
TTTTGAGAATGGAAACATGACCCAGGTTCAGACTACrCTGGTGGCTCTAGCAGGTCTGGCTAA^ 
AAAAGGATTCCATACAACCATTGACATTGGAGTTAAGTATGCANAAAAACAAACAAGACGTTTTG 
ATOAAGGAAAATTAAAAGCTGCCAAAOTGTAATTGGTCrrGCAGATGGGAACCAACAAATGTGCC 
ACXAGGCAGGTATGACAGCTTACGGGACTAGGAGGCmTmATGATCa;AAAATGCAAACTGAC 
AAACCTT 

SEQ ID NO: 24 17 ACGCGGGGGAGGTGCCGCCATTTCATCTGTCCTCATTCTCTGCGCCTTTCGCA 
NAGCTTCCAGCAGCGGTATGTTGGGCCAGAGCATCCGGAGGTTCACAACCTCTGTGGTCCGTAGG 
AGCCACTATGAGGAGGGCCCTGOGAAGAATTrGCCATTTTCAGTGGAAAACAAGTGGTCQTTACT 
AGCTAAGATGTGTTTGTACmGGATCTGCATTrGCTACACCCrrCCTTGTAGTAAGACACCAACTG 
CTTAAAACATAAGGATGmCAGTrCCrCCATTTAACAGATATCAAGAGCATTTTAAGAGGTGCAG 
CCTCTGGAAGTGGATCAAACTAGAACTCATATGCCATACTAGATATTGTTTGTCAATAAACTTATG 
ACGTGAAAAAA 

SEQ ID NO: 24 1 8 ACGCGGGCATCTGTTAOrCAGATCTACCATGCAGTTGCAGCTCTAAOTGGCTT 
TGGCCTTCCCTTGGCATCCCAAGAAGCACTCAGTGa:CITACTGCTa}TCTC^ 
TGTGCTGGCAACAGTCCAGGCTCTGCAGACAGCATCCCACCTGTCCCAGCAGGCTGACCTGAGGA 
GCATCGTOGAGGAGATTOAGGACCnGTTGCTCGCCTGGATGAACTCGGGGGCGTGTATCTCCAGT 
TTGAAGAAGGACTGGAAACAACAGCGTTAriTGTGGCTGCCACCTACAAGCTCATGGATCATGTG 
GGGACTGAGCCATCCATTAAGGAGGATCAGGTCATCCAGCTGATGAACGCGATCTTCAGCAAGAA 
GAACTITGAGTCCCTCTCCGAACCTTCAGCGTGGCCTCTGCAGCTGCTGTGCT 
ACCACGTGCCA^m'GTGGT^GTGCCTGAGGGCTCTGCTTCCGACACTCATGAACAGGCTATCTTGC 
OGNTGCAAGTCACCAATQTTCTGTCTNACCCTCTGACTCAGCCANTGTTNAACTANAACATCTAAA 
TrCTGTrGCTTCANAACCACTGrcCTCAAAAAACATCCTTACCCCTGTNGGGGATGr^^ 
ATT 

SEQ ID NO: 24 1 9 ACCCTCCAGGTCATGGTGATATTTACGCCAGTTTCTACAACTCTGGATrGCTT 
GATACCTTrATAGGAGAAGGCAAAGAGTATATTTTTGTGTCrAACATAGATAATCTGGGTGCCACA 
GTGGATCTGTATArrcrrAATCATCTAATGAACCCACCCAATGGAAAACGCTGTGAArrTGTCATG 
GAAGTCACAAATAAAACACGTGCAGATGTAAAGGGCGGGACACTCACTCAATATGAAGGCAAAC 
TGAGACTGGTGGAAATTGCTCAAGTGCCAAAAGCACATGTAGACGAGTTCAAGTCTQTATCAAAG 
TTCAAAATATrrAATACAAACAACCTATGGATTTCrCTTGCAGCAGrrAAAAGACTGCAGGAGCAA 
AATGCCATrGACATGGAAATCATTGTGAATGCAAAGACnTGGATGGAGGCXn'GAATGTCATTCA 
ATTAGAAACTGCAGTAGGGGCTGCCATCAAAAGTmGAQAATTCrCTAGGTATTAATGTGCCAAG 
GAACCGTTTrCTGCCTGTCAAAACCACATCANATCTCrrGCTGGTGATGTCAACCTTATAGTCTTi^ 
TGCAGGATCTCTGACAATGANTGAAAANCGGGAATTTCNTANAGNGCCCrTOGrrAAArrANON^ 
ATThmr 

SEQ ID NO: 2420 ACAGAAATTTCACAAGAAGTCAAACACAGTGATGCCATTTGCTATGTnTATT 
TTGCTAGTAGCTrATTAAACATAACATGCAAATAATCAAAQAGAAACATACATGACTTAGAGTGA 
AAAATAATTCTAGAAAAGTTrCACTAGGTAAGTATGCAAATTCTrATTCrAAAAATAClUClU-rA^ 
GTGCATGAAGCTrCATGTATTTTGCAATATrCTTGGCCTCAATATCTACCACCrATTTTTTAACC^ 
ACAAATTTGAATGTATCAAGATAATTTGGTGCAAGAGAGTAACATCCATATGTATTTAATCCAAGC 
TTTGAGGAACATtAAGATTTAAGGATTATAAAACTrGGCTGATTTCCATGCAACCAGTAAAAGGTT 
TrGCACATCATTTGACAGTAGAAATAAAAAAACACrAAATTrACAAATAAA GCATTGAG TTTGAT 
GTCTATTCGTGTATATGTGTGTGTCTTGTGATGAAATAGGCCTGCCTTTCATC'ri I'lVl'l i'AAAAAA 
AATAAATGTTrACAAAACATTCCCTCAGATTTTAAAATCATGGAAGTAATAAACAGTAATAAAAT 
ATGGATACrATGAAACCTGA(>ICNCAGAAAACATANCCTrAAATmTG>rrCO\GGATC^ 
TATTTA 

SEQ ID NO: 242 1 ACGCGGGGAAGATGAAGGTAAGTAGAAACCQTTGATGGOACTGAOAAACCA 
GAGTTAAAACCTCTrrGGAGCTTCTGAGGACTCAGCTGGAACCAACGGGCACAGTTGGCAACACC 
ATCATGACATCACAACCTGrrCCCAATGAGACCATCATAGTGOTCCATCAAATGTCATCAACITC 
TCCCAAGCAGAGAAACCCGAACCCACCAACCAGGGGCAGGATAGCCTGAAGAAACATCTACACG 
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CAGAAATCAAAGTTATTGGGACTAT(XAGATCnTGTGTGGCATGATGGTATTGAGCTTGGGGATCA 

TTTTGGCATCTGCrrCCTTCTCTCCAAArriTACCCAAGTGACTTCTACACTG^ 

CCATTCATAGGACC C - riirriUi ' l ATCATCrCrGGCTCTCTATCAATCGCCACAGAGAAAAGGTTGA 

CCAAGCTTTTGGTGCATAGCAGCCrGGTTGGAAGCATTCTGAGTGCTCTGTCTGCCCTGGTGGGTT 

TCATTATCCTGTCTGCAAACAGGCCGCCTTAAATCCTGCCTCACTGCA^r^3TGAGr^GGACAAAAA 

TAATATACCAACAAGAAGTTATGTTTOTACTTTTATCATGATTCACTTTATACCACGACT^ 

CA 

SEQ ID NO: 2422 ACTGGATTTCCATTTAAACAGGGTTAATTTGGAAGAATCrTCAGGAGTGGAA 
AACrCTCCAGCTGGTGCTAGACCAAAGAGAAAAAACAAGAAGTCTTATGATTTAACTCCGGTTGA 
TAAA^T^^rGGCAAAAACTTAAANGANGTCCTTTTNC^mAAT^ 
TTNTTACTNAT 

SEQ ID NO: 2423 A Crrrri lU ' lUUU ' ri ' l ' iU ri - in ' ril - riU - rriN GANACANAGTCTTGCTCCATCAC 
CCATGCTANAGTGCAGTGGAQTGATCTCGGCTCACTGCAACTTCCGCCTTCTGGGTTNAAG CTATT 
CTCCTGCCTNAGCCirCCAAGTAACTGGGATrACAGGAATGCCCCACCACNCCTGGCrAATT^^ 
TTITGTATrTTNAGTAAAGACGGGGmCTCCATGTTGOTCAGGCTGGTCrrCAAACTCCCAA 
ANATGAT^^■GCCrGCCTCCACCTCCCAAAGTGCTTGGATAACAGGTGTGAGCCGNTGNTCCCGGC 
CCCCrrTA' rri ' rcriU - rA GTGCTCAGCTCTCCC C ' lU ' i ' ri 'GGCTCATGAGGTCTAGATAGGGCATATG^ 
ACCACATCCACCTGTCCCCAGGGACTGGTAGGCACATGTNTTAANTTAGTCCAACCAGAGCCAAC 
AGACACCAAGCCCGGGGCmTTGCTGGACATACCCCGCGT 

SEQ ID NO: 2424 ACAGGCTTAAATCTATGTCArrrACACTCACTGAATCATCAACCTCATCACCA 
CCTGTTCCCTGGGTGTAGCGGGTATCATCTGCTrCCTCATCATCATCArrGACCAGTrCAGGACGA 
AArrCAAACACTTCACGACCACTG ATCACT AGT GCTTTC C CTGCm GAAGTCAGCrrrCOT 
CCATATCITGTTAAAGTTTATCAATCriTItriTGTCTITrCC^ 

AGAGTGATTTrGGTAACATTTGGACCTAGGGCAGAACGCTCTCTCTCAATrAGATCTTCTAATGAA 
ATTTCATCrrCTTTCTCTTCTTTCriTITATCTTTmCAA 

ATAAATGCAAATATCACCCCCTCCAGG GCATA CC CAAAA CCAGCCATACrrGTTGTITTCAATAGC 
TTCCAGGAAATGCTTGCACACTATTGAGTTTTNGGTTTTT 

SEQ ID NO: 2425 ACAAAAAAATTCCOTGCTTGCAGAGTTTTTOTTACTTrrrACTTAAATAm 
GAGTTAAGAAAAAAATCAAATTTAAAATCACTATTGATCGAAGCTTATATTCCTTATGAATATATA 

catgtatgcatatatacatctctgtatgaatcactcaaagcaattttaaacatcattcri'r 

tatttaaaattgcccttttaaacamagaagttccagaggtggaggctttaatggaat^^ 

agagtaaatttctrggattgaaattgttccagttcttctactgttagtttatttctgggtc 

acacattatctgttgcaatgatgctgcttoaggctgacagagaagtggatatgctgctatttccaa 

aagtgtcactggatggcttagaaaaagcagtgtgagaatgtgagccx;ggactaccaaaaccagcc 

acagaactgcaacctccaaaggctcccgcgtacaggtcagagtcttclhuluri"14'crrtttgagat 

ggagtctkkrrctgttgccagactggaatgcagtgggtgcgatctgggctcactgcaatctcccct 

ccgggtrcaagcgattctnctgcctcancctccgagtactxkxjactacaggtgcgcgccccaanc 

CC 

SEQ ID NO: 2426 TACGGTCTGAGACATCACCGCCAAGCTGGGCATCGGGGAGATGGCCGAGACT 

gaccccaagaccgtgcaggacctcacctcggtggtgcagacactcctgcagcagatgcaagataa 

atttcagaccatgtctgaccagatcattgggagaattoatgatatgagtagtcgcattgatgatct 

ggaaaagaatatcgcggacctcatgacacaggctggggtggaagaactggaaggt gaaaa caag 

atacctgccacgcaaaagagttgaaggttgctaataatttatactggaatctggcattm 

ccaagagaagatcgaatggctttitgcagctaacrractatgtgtagacaggtntatattataaag 

tatgcattctratcacctagtatatagmgmgtagagtgarrrccccccagtttcttgaacatg 

gtatcttcacatcnxigaccttggtcagttgtgctattcattatraaacactaaaac^ 

cttacataaca1tg 

seq id no: 2427 acctgtggtatgaagccgacaagcgccgcctgttcccaccctggattaagcx; 
cgcagacv^cagaaccacctccxsctgctrgtttacaagtggtgtcaaggcatcaataacctgcagg 
acgtgtgggagacgagtgaaggcgagtgcaatgtcatgctggaatcccgcntgagaagatgtat 
gagaagatcgacttgactctgctcaacaggctgctocgccccatcotgqaccacaacatagccga 
ctacatgacagccaagaacaacgtcgtcatcaactataagoacatgaaccatacgaattcatatg 
ggatcatcagaggcctgcagtttgcctcattcatcxitgcagtattatggarrggtgatggatrtgc 
rrgtattgggattgcaccgggccagtgagatggctgggcccccrcanatgccaaatgacnrctca 
gtttccaggacatagccacnx)aggctgccac(xcatccgtctcttctgcagatacattgatcgcat 
catattirritcaggtrcacagcanatgaggctcgggacctgattcaacgttcctganagancacc 
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CTGACCCCAATAATGAAAACATNGTTGGCTATAATAANAANAAATGCTGGCCCCNANATCCCCA^ 
TGCCCT 

SEQ ID NO: 2428 ACAGGTGGAACAATCCAAAGnTTAATCAAAGAAGGTGGTGTTCAGTTGCTG 
CTCACAATAGTTGATACCCCAGGATTTGGAGATGCAGTGGATAATAGTAATTGCTGGCAGCCTGTT 
ATCGACTACArrGATAGTAAArrTGAGGACTACCTAAATGCAGAATCACGAGTGAACAGACGTCA 
GATGCCTGATAACAGGGTGCAGTGTTGTTTATACnt^TTGCTCCTTCAGGACATGGAC^ 
ATTGGATATTGAGTTTATGAAGCGTTTGCATGAAAAAGTGAATATCATCCCACrTATTGCCAAAGT 
AQACACACrCACACCAGAGGAATGCXJAACAGTTTAAAAAACAGATAATGAAAGAAATCCAAGAA 
CATAAAATTAAAATATACGAATTTCCAGAAACAGATGATGAAGAAGAAAATAAACTTGTTAAAAA 
GATAAAGGACCGTrrACCTCrrGCTGTGOTAGGTAGTAATACTATCATTGAAGTTAATGGCAAAAG 
GGTCAGAGGAAGGCAGTATNCITGGGGTGTTGCTGAANTKiAAATGGTGAACATrGT^ 
ATCCTAANAAATATGTTGATAANAACACAC 

SEQ ID NO: 2429 CNCGGCCGACOTACmCCACTCrTNCmAAAANCTTGCCATTTGCTT 
KITCCTCTXiGGGCrcACCCACTCTAACATGACAAAGGArrAAGAACAAAAGATAGTCCT^ 
TTACAGGCrrGTANGGGCANATAGGATCTACNAACCTTGGAAGAAAAACAAGGTGCTCAOGAATT 
CATTCACCTAACATTTCACrmca^ACCCACCNCTrATNTGCTCCCACTTrGG 
TTGGCTTTAAANCANANGGGGGrWANTGTNCCTTGrmrrGAAATGTrrGCAAN 
GCGNATN1TCAA 

SEQ ID NO: 2430 ACTTTCCACTCTTCCrrrAAAAACTTGCCATTTGCTTATCAGTTCCT 

TGACCCACrcAAACAAGACAAAGGATAAAGAACAAAAGATAGTCCTCCGAGGTrACAGGCTTGG 

AAGGGCAGAOAOGAGCTACGAACCITGGAAGAAAAACAAGGTGCrCAGGAATTCATCGCCTAAC 

ATTTCACTTCCCCACCCACCCCrrAGTGCTCCX^ACTTTGGCAGTGATCTCTC^ 

AAAGGGGGAAATGTGCCTTGrmGCAGGTGTGCAACAACACAGCTCTGGCATCTCAAGCAGCAG 

GGGAGAACTCTAAGACAGAAGAATTTCnTCATGAAAATCACGGTATGTTATCACATACTGTCTCCA 

TGGa;CATACAAGGACTCCrrAAGGTTCTCTCTAACATACAACATATCCCCCACAACTCAGTAGAG 

AOGrmCTTCCCACTGGAATAGAAATCCmGCCTCATTTATTACAGTCTAAAAATCCACACTGGC 

TTrn-GATTCTTTCTAGCATGAGCrCANACACTATACGTCAAAAGAATGCCCAGAGGCATT^ 

AGGTAGAGACTTACCrGCGGGCTQAGAOTCGGCCCCACATAGCTCAAATCAATTCTGCANCTGCT 

CCTT 



SEQ ID NO: 243 1 ACCAATTAAAGTTGAACAAATTGAAGCAGGGACACCAGGCCGACTCAGAGT 
AGTAGCTCAGTCCACCAATAGTQAGGAAATCATTGAAGOAGAATATAATACGGTGATGCTGGCAA 
TAGGAAGAGATGCrraCACAAGAAAAATTGGCTTAGAAACCGTATGGGTGAAGATAAATGAAAA 
GACTGGAAAAATACCTGTCACAGATGAAGAACAGACCAATGTGCCnTACATCTATGCCATTGGCG 
ATATATTGGAGGATAAGGTGGAGCTCACCCCAGTTGCAATCCAGGCAGGAAGAITGCTGGCTCAN 
AGGCTCTATGCAGGTTCCACrcTCAAGTGTGACTATGAAAATGTTCCAACCACTGTATrrACTCCT 
TTGGAATATXKJTGCTTGTQGCCTrrCTGAGGAGAAAGCTGTGGAGAAGTTTGGGGAAGAAAATAT 
TGAGGTTTACAATAGTTACTTTTGGCCATTGGAATGGACNATrCCm'CAAGANNATAAC/^ 
GTTATGGAAAAATNATCTGTATTACTAAAGACAATGAACGTGTTTGTGGGNCTm^ACGT^^ 
CCGGGCGGNCNNTTAAAAGGGCT 

SEQ ED NO: 2432 AOGCGGGCCAGCAGrrACTCATGGAATATATTCTGCGTn'ATAAAACTAG'nT 
TTAAGAA0AAAl"lU"l"l"l1UGGCCTATGAAATTGrrAAACCTGGAACATGACATTGTTAATCATATA 
ATAATGATTCnTAAATGCTGTATGGTTTATTATTTAAATGGGTAAAGCrATTrACATAATATAGi^ 
AGATATGCATATATCTAGAAGGTATGTGGCATTTATTTGGATAAAATTCTCAATTCAGAGAAATCA 
TCTGATGTITCTATAGTCACnTrGCCAGCTCAAAAGAAAACAATACCCTATGTAGTrGTGGAAGTr 
TATGCTAATATTGTGTAACTGATATTAAACCTAAATGTTCTGCCTACCCTGnGGTATAAAGATATT 
TTGAGCAGACTGTAAACAAGA AAAA AAAA ATCAT GCATTCrrAGCAAAATTGCCTAGTATGTrAA 
TTTGCrCAAAATCAATGTTTGATTTTATGCACTITGTCGCTATTA^ 

TCAATAATrGAGTAATTTTAAAAGCATTArrTTAGOAATATAGTTGNCACAGTAAATATCT^ 
TTTCTATGT 

SEQ ID NO: 2433 ACTCTCCCCTACTTCCXnTAAACrCACTCGTATrrCTGAAGAACAGTAATAAC 
TCTTATGAGCXirrAATACATCCCrrCATrCTATTAGGTCrrTTCGTCC^ 

GGGCTTTATGAAGCCACCCCCACCACTTAGGCTGAGCCCCAAAAAACTAGTCATCCCTACTATCTT 

CTGTCCGGTCATACrCCTArrcrcCATTCTCAACTAOTATAAATGCCCTACTCC^ 

TGGTTTACACTGTTTCITCAAACCATCACAGCTGATATCTCTTGGTGCTATCCCCAAACT 

TTAACTCCXrrCTTAGAGTGGGTAGATGATCrrrGCTGGCAAGGCACTCTCCAATAC^ 

ATQAAGTTCTArrCTTrACTTTTTACTCACTCnrTATTCTCATO 
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CTCCCCAGCrATCTCCACCACACTATCAACCTTACCCATTCrcrCCTAGCCGCTrCrAAT 

TGGCGAACAACTGCTGGCmGCATGTCTCOTCrrCCAGTGCCTACACAGCTGCCCCGOT 

ACAGACTGGGCAACACCmCCGCTCCCTACACCTTTTOAACTCCrrTAACAGCCCTC^ 

SEQ ID NO: 2434 ACATAGTGTCGCGAACTCAAATCGGCATTTAGATAGATCCAGTGGTTTAAAC 
GGCAWTTTTTGCTTATAAAAAAAGTGCAAAAAAGATGTGGTTrACAA 
CCTTTTTGCTGTAATTGCACCAGTmAAAGCCnCTGGACAGAGCAGTATT^ 
TmCTTAAAAGCTTACAGTGTTTGGCTAArrCTCCTCCCCTTTTTACAAGACG^ 
GGACACTGGTGGCAGGTTAAGGGATACTGCACmAAGAAGCCTGCTGATTGAAGTGTAAACATK 
GGAGAAATNAGGGGCTGATTmTAAACrGTGTGAGATATT 

SEQ ID NO: 2435 ACTTGGCTTGGAGACTGGCGCGGCXJrrCGTGTCCGAGTTCTCTGCAGGTCACT 
AGTITCCCGGTAGTrCAGCTGCACATGAATAGAACAGCAATGAGAGCCAGTCAGAAGGACTTTGA 
AAATTCAATAAATCAAGTGAAACTCTTGAAAAAGGATCCAGGAAACGAAGTGAAGCTAAAACTCT 
ACGCGCTATATAAGCAGGCCACTGAAGGACCTTGTAACATGCCCAAACCAGGTGTATTrGACTTG 
ATCAACAAGGCCAAATGGGACGCATGGAATGCCCTTGCAACCTGCCCAANGAA 

SEQ ID NO: 2436 ACTTGTATTGATTATGTAGTTCAGTAAGATGTGCCCAAGTCATTTCAGAAAGA 
AAGACX;CTrCAGTTTTGATGCATTITGCTGAACACTTGGGTAGTGAGTGGQATCCTATCCAGTTC 
GGAATGCTTGCAATGCTCATTGAAGGGATTTGCTITGGGACnTTCTC^ 
TATTGTOTATTTAGGCCCATTGTNAT^G^m*GCmAT^r^TTGGTAANTATTA^m 
CTGANTrNATTGACTOm3ATThrrAAATCTTAATT 

SEQ ID NO: 2437 ACAGGATOAATTTAAATGTGTTTTTCCTGAGAGACAAGGAAGACTTGGGTAT 
TTCCCAAAACAGGTAAAAATCTTAAATGTGCACCAAGAGCAAAGGATCAACTTTTAGTCATGATG 
TTCTGTAAAGACAACAAATCCCrrTTTTTTTCTCAATTGACTT^ 

CCTCTAAAGCAAATCTGCAGTGTrcrAAAGACTTTGGTATGGATTAAGCGCTGTCCAGTAAC^^ 

TGAAATCTCAAAACAGAGCTCAGCTGCAAAAAAGCATATTITCTGTGTTTCTGGACTGCACT 

TCCrrGOCCTCACATANACACTCAGACAGCCTCACAAACACAGTAGTCTATAGTTAGGATTAAAAT 

AGGATCrGAACATTCAAAAGAAAGCmGGAAAAAAAAGAGCTGGCTGGCCTAAAAACCT 

AT ATGA TGAAGATTGTAGGACTGTCTTCCAAGCCCCATGTTCATGGTGGGGCAATGGTTA'nTGGT 

TATTTrACrCAATTGGTTCrCrCATITGAAATGAGGGAGGGACATACAGAATO 

GCn^CTAAANCCTTATGCNACCCCTNAACCACGAGGAACATCCTTGCCNGGaXK 

GOGC 

SEQ ID NO: 2438 ACCTAGAAGAGAGGCGGGTCAAAGAAGTAGTGAAGAAGCATTCTCAGTTCAT 
AGGCTATCCCATCACCCTTTATTTGGAGAAGGAACGAGAGAAGGAAATTAGTGATOATGAOGCAG 
AGGAAGAGAAAGGTGAGAAAGAAGAGGAAGATAAAGATGATGAANAAAAGCCCAAGATCGAAG 
ATGTGGGTrNA 

SEQ ID NO: 2439 ACATTAGCACCATAACAGGCGTCTTTAAAGCCTTCCCAAATATTAGTAATCTr 
GACCAGCAATGACAAGAAAAAAGAGGAGCACCTTTACAAGCAGrrGATATCCAATArrAAAATAA 
TrGTCGCTTTAAAAATAmCTTTAAATrcnTGCATTACACTTnCT^^ 
OATrAATCAATGAAATTTATAAGTTTTATCAACGTATAAAATTnTTTCATCTTCT 
AATCAATCTGTGTrrCrGACANTTGAGGTAGTTAAAATANGGAGGGCrrr 

SEQ ID NO: 2440 ACGCGGGGGCACAGCCAGAGCCTAAAGGCTAGAGCCGOAGCTGCCGCGCCA 
GTCGCCTAGCAGGTCCTCTACCGGCTTATTCCrGTGCCGGATCTTCATCGGCACAGGGGCCACTGA 
GACGTTTCTGCCTCCCTCTTTCTTCCTCCGCTCTTTCTCTTCCCTCTCGm 

TGAAAGGAGAAAGCACGGGGTCGCCCCAAACCCCTTCTGCTTCTGCCCATCACAAGTGCCACTAC 

CGCATGGGCCTCACTATCTCCTCCTCTTCTCCCXJACTATTTGGCAAGAAGCAGATGCXjCAT^ 

GGTTGGATTGGATGCTGCrGGCAAGACAACCATTCTGTATAAACTGAAGrrAGGGGAGATAGTCA 

CCACCATTCCrACCATTGGTTTTAATGTGGAAACAAGTAGAATATAAAAACATTTGTTTCACAAGT 

ATGGGATGTTGGTGGGTCAAGATAGAATTAGGCCTCTCTGGAAGCATTACTTCAAAATACCCAAG 

GQTCrTATTTTTGTGGTANATAGCAACGATCGNGAAAGAATTCAGGAAGTANCANATGAACrGCA 

AAAAATGCTThrrGGTANATGAAATTGANAAATGCANNGCTNNTACI^^ 

TTGCA 

SEQ ID NO: 2441 ACATTTAOAATTTTTGGCCGGGTGCAGTGGCTCACACCCGTAATCrCAGCACr 
TTGGGAGGCCAAGGTXjAGi^ATGGCTCGAGGTCAGGAGTTrGAGAGGTCAGGAGTTCAAGATCA 
GarrGGACAACATGGTGAAACCCCGTCTCCACTAAAAAAATACAAAAATTAGCCAGGCATQAGGG 
TGCATGCCTGTANrcCACGCTArrCAAGAGGCTGAGGCAGGAGAATCNCTTGAACCTGGGAGGTG 
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GAGGTTG 

SEQ ID NO: 2442 ACACAATGTGTAGCTCATATGAAAACCTCATGACAAGTCATTGAGTTCTGTA 
AACTGCCATACAACITACATGTGTAATATTA ATTG CACAAAGTATATCAAGACATATTGAAGAAA 
CACAAAATTAAGTGCTATTTTAACXjGTTCATCmCAGTATGTGAATAC^ 

CACAGTTTTGTTGAAAATAAGTTTCAACAATAAGACTTCAGTTGTTAAAATTAACCCAAAATATAA 

CTTTAATTAAATTACATCACTACAATATTAATCTCTrATCTTGmACATT^^ 

TTTGTTAAAAAAAAATCCAAhrrrCrAGAATTGCAAAAACrrCATTGTrQCCACTGAGAGA^ 

ATATNGTNCGCGGGGGGGAGACCANANATCTANCGACTGAANCANATTGGCCAAACrGNGTGGG 

GT 

SEQ ID NO: 2443 GATCTACCATGCAGTTGCAGCTCTAAGTGGCmGGCCTTCCCTTGGCATCCC 
AAGAAGCACTCATTGCCCTTACrG>rr(XiTCTNAGCAAGGAGGANACn'GTGCTGGCAACAOT 
GCTCTGCAAACAGCATCa;ACCTTGTCCCAACAGGCTGACCrGATGAGCATCTTGGAGGAAATTG 
AGGACCTTGTTGCTCGGCTNNATGAACTCNGGGGGCGTN^ 

AAACAACATCCITITTTTGTGGGCn"GCCACCCTACAAAGCTCArrGNATCAT^^ 

CCCATCCATTAANGGNAGGATNAGGTCATCCAACTTOATNAACCaSfATCTTTAACAATAAm-AA^ 

rmOAATTCNCTCTTACTAANChrrTTAAACAGGGAGCCTCTrGCAAC^ 

SEQ ID NO: 2444 gtacatacacacacacacacancncagagagaanacagagagaaantcctg 

GTCCAAAAGATCACATGACCTTACTAGTOTTTCCCCANTGACrGTAATTTATAAACTAAAAATm 

TACAAATCCACTGCTATCTTCrrCTGTCCrcAGTrTNGGTNGACTITAATGGAT^^ 

CrAGANTCTAGGACATGCAAACTCACTGTGAGCGAGAGGCTAGGGATCTG(XCTAACATAGGAAC 

C^GTTTCTATCAAGCCTGAATGAGGCAGCT^r^G^^^AGA^^ITAATGACAAATCAATGCCAG 

TATTCTGCAAACAGGGTAGCrmOTGCTTTCTTTTNATTATTT^^ 

TGANCCATGGGTCTACTAAmATCACTAAAGGACTGGGACCACATTCTNrAGNAANAC^ 

TNNTTACTGTCCANGGAGGGAAAATCCC 

SEQ ID NO: 2445 ACOTAAAAGTGTCTCACCTAGAAGGCCTCTACCTGTAATCACATTAATTTTr 
CTAAAGACAATTTGGTGTITrGAAQATAAATGTCArrAGTCTATGATAATAGCATCATAGGACAAT 
TAGCCATTrrAGACTTGACCATATTTTCTCTTTrrAGC^^ 

ACTACTCCAATGGAGCAACAGTTTCATTTITACATGATTGGATTTAANAAATTrACA^ 

CTCATAAGAATTCTTAAATAATTTNAAAATNGAAACATTIWACCCANAGTCTANCANC^ 

hTITTNTAAAAATACTTCArrG 

SEQ ID NO: 2446 ACGGGTATCACTTTCCGGAGCTGGTGAAQATCATCAACGACAATGCCACATA 
CTGCCGTCTTGCCCAOTTTATTGGAAACCGAAGGGAACTGAATGAGGACAAGCTGGAGAAGCTGG 
AGGAGCTGACAATGGATGGGGCCAAGGCTAAGGCTATTCTGGATGCCTACGGTCCTCCATGGCAT 
GGACATATCTGCCATTGACTTQATAAACATCNNAAGCTrTTCAGTCGTGTGGGNGTCTTTrAT^^ 
AATACCGNCAGANCCTACACACTTACCTGCGCTNCAAGATGAGCCAAGTAGCX:CCCACCCTGGCA 
CCCTAATTGGGGAANNCGGTAGGGCAasnrtm^ATNGNACATGCTGGANNCTNACCAACCTGNCA^ 
TTATCNACATCCACAGTGCAAATCITNGGGCTGAAAAAGCCXrrG™ 

SEQ ID NO: 2447 ACCAAAGCTCACTACTGCGGTTTGCCTGTGCCTGGACAATGAGGCGGAGCCA 
CTGTTGGGGCACCCCCrrCCCrcCCCGGGTrTGCAAATAOAGGCTACCGGGTC^ 
CACCTGTTTTACTATTTGTTATTAAACTATCATCTCCACCTTCCTTITGATTAGC 
TGAAACAGCTAAGTCGAGCTAATTAATATGCACTGCCACACATACTTGTCACrrnTGGGGTGAGA 
CACTTAAATCTACTCTCTTAGTGGTmATrATTAriTITITGAGACGGAGTCnTGC^ 
GCTGGATGCATCGTGCAATCTCGGTTACTGCAACCTCCACCrCCAGGT 

SEQ ID NO: 2448 ACTTTTACTTmAAAAATTTCAAACTTTTATAGTATTCTGCTA^ 

TGTGAGGAACACTTAATACTGAATTTTCCCCTCAATAGCCAAGTCAGAATATACTTATTTC^ 
CACCCAAGTCirrGTTTCTAAACGCATTATGTAGAATAGATITGATCCGTCACAGCATOT 
GCTAATGAGGTA GGAG TATTCTTAAGTTCTTAGGCTTAJ^GAAGCAAAGCAAAAGACAATCAAAT 
TACAAAGArmCl i i 1 GNAGlXjGGTTGNNCTTTNAANNAAAAJ^GCTAAAAANCACTGGNAAm 
TGANCAAATGCTACTAACATTAAAAAN 

SEQ ID NO: 2449 ACAGCATCGTAGGGTTCCCCTAAACTTGCCCTGrmTG 1"1"J i l l 1 AGTTTGTT 
ATCCCCTTACTGAGCGGCCrCTACTAGQTGGCTGTGATTAAATGTCCCAAGCAAGGATAGGGAAG 
GGGAATGGGTTGACCTCTGGAGATCATTOTAACCAATCCTGCCAGACCTGTTTGGGCAGTGGGGG 
AGCAAACCTAGATAAGGACCTGTTGGGGCACAGGGAGCAAAATCTCnTAACAACrAANCAAGTT 
CCTATTCACATCAACAGANCGAGGCTGTGATAACTTAAGGAGOCACAATCCTAATAGTCCTTCAGT 
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GCATmAATCTGTCTCCAACTGGACACCAGTAGGTAGTGTCAAGCCANANATTCGGGGGCAGTA 

AATAAATOmCATTTTACTGATGCACm 

GCCTTTTTAGGCNGGGAGTT 

SEQ ID NO: 2450 AAATrTACAAAANAGGTTrATrGGACITACAGTTCCACGTGGCTGGGAAGGC 
CTCACAATCATGGCGGAAGGTGAAAGCCACATNTCACATGGCAGCAGATAANANAAAAAAAGGT 
AGTGATCrrrTAGTAAANAAACCrGANACTGGTAGTGGCTTGGAGCCANANGATCXjCTrAAGTCC 
GGGAGrrCGAACCAGCCNOGGTACAAATTGAACCCCCCCACTTTAAAATTAAAAAACGAAAGAAA 
AAATIXKnXKjOTGNOGNGGCTCACACCTGTAATCTTANCNTrrcG 
^X}Cm'GANCTCA^WATTTTAANACCACCTGGCAACATGGTTNAAACC^^ 
AAAAANTNNCCAGGCNTQGTANCGTGCACCTTmrCCAArTACTTGG^ 
TNGTTGAACCrrNGGAGGGAAAGGTTGNAAAAATATTGCN>riTNNOT 
NAAAA 

SEQ ID NO: 245 1 TACGGNCTGAGACATCACCGCCAAGCTGGGCATCGGGGAGATGGCCNAGAC 
TGACCCCAANACCGTGCAGGACCTCACCTCGGTGGTGCANACACTCCTGCAGCAGATGCAAGATA 
AAmCAGACCATCTCTGACCAGATCArrGGGAGAArrNATGATATGAGTAGTCNCATrGATGATC 
TGGAAAAGAATATCNCGGACCTCATGACACAGGCTGGGGTGGAAGAACTNGAANGTGAAAACAA 
GATACCTGCCACGCNNAANA>nTNAAGGNTGCrAATAATTTTATACTGGNAATCrrGC^^ 
AANCCNAANTAAAATCNAATGG^^^TT^^^^CANCn'AACTACTTAT^^ 
TAAAAATT 

SBQ ID NO: 2452 CCGGGCCATCATTTA>mATGGAATATATTCTGCGTTrATAAAACTANTTm 
AANAAGAAA'ril'l^lU'l'GGCCTATGAAATTGTTAAACCTGGAACATGACATTGTTAATCATATAAT 
AATGArrCTTAAATGCTGTATGGTTNATTATTTAAATGGGTAAAGCCATTTACATAAT^ 
ATATGCATATNTCTAKAAGGNATGTGGCNTrTArrTGOATAAAATTCTCAATTNAAANAAATC^ 
TGATGTTTCTATAhWCACTTTGCCAGCTCAAAAAAAANACAOTACCCTATNrrAGTNGNAG^ 
ATGCTNATAT^TGNGTAAC^WATA^'AAACCTAA^mm■C7NCCTACCm'G^TGGT^^ 
TTTTACCATAACTOTANACAANAAAAAAAAATCTGCam-CrrAGCA^ 
TGCTNAAAAATAAATTGTNTATTTTTNAT>mOTAGNaWATTAAAN/^ 
NTAAANAATTCGATTAhrn^IAAAACCTATTmAGGNAATTATTrGC^ 

SEQ ID NO: 2453 Aci"i'i"iTrrn 1 1 u u inn i i innnggaatgcgtttattttaacaaccaaaa 

AATTCTAACAGCCTAACAATGCACATAAGTTAAAAArrAATTATCACTTAGTGATAACAAAGATA 

GTTGATrrACATGGAAAAAAQAACATTTACAATATGTTAATCCTTArrCACArrGTTGATACCGCA 

ATAAAACACAATTTGTTTITnCATTTCACAAAAAAAAAAAAAAGGCGGGAAATTGTGC^^ 

ANNAGGCAJH'ACAANAAAAAGTTmCrrCCAAATAGATTTTTTGACANATNrrrOA^ 

CATNTTTGCATTGAAATNCGNAAAGATCrGGNAAAACCACAGGCrAAAATGCCrACAGATTCACT 

TANAACCCrGNAAACTNGGCA>rrGAAATTAAAAATAAAAGGCANGACTTANCNrrrGAAAA^ 

C^AATAA^^^mAGTT^^AAAGGGAAANAAANCCNAT^m3CNGATGCCT^^'AT^^^ 

TTNTNAAAACnTGGCNAAATCANNTTNCTTTANGTG 

SEQ ID NO: 2454 ACAGGATGAATTTAAATGNGTTTTTCCTGAGAGACAAGGAAGACTTGOGTAT 
TTCCCAAAACAGGTAAAAATCTTAAATGNGCACCAAGAGCAAAGGATCAACTTTTAGTCATGATG 
TTCTGTAAANACAACAAATCCCriTrTTTTrCTCAATrGACTT^ 

CCTCTAAAGCAAATCTGCAGTGTTCCAAAAACTTTGGTATGGArrAANCGCTGNCCATrAACAAAA 

TGAANTCTCAAAACAGANCTCANCTGNAAAAAACNTNTTTCTTC 

CCTmCCCTNANATAAACACTCAAACNGCCTCACAAACNCANGGNNTCT^^'AT™ 

NGGNTTCTGAACnTTCAAAAAAANGCTTTTGAAAAAAAAAGAGCNGGCT 

ATT^TGATGAAGA™GGGGGCTGCKrTCCNAAGCCCCAT^m'Cm'GG 

GQGTArrriTAa^CAATrrGGGTNCTNTCATTTGNAATNGGGGGNGGGCCm 

NGGNGTITGTTm>ICTTAGAGCCrnmGCACCCCCTTGNACC^ 

GGCG 

SEQ ID NO: 2455 ACTGAGGACATGGCTCTCAGCTGGTrrCTTATCTGCTCroAAGGCATGC^ 
CAAATGAGGACCAATCGGAGCATCTTCTCGAGTAGCATAATTCAAATCAGATCCCAAAACTCAGG 
GTCCGAGAAGTGTOATCAATACGAACCCTGGCAAGTCGCATTNCTGCTTGCArrCTACTATGG^ 
GTTCCAGTTTGGAAAAGCATCANCAAAGGGACCAAAGAAGTCAACNAAAAAACTCATGCCrCTGA 
TAATCIWGANACCTGCTGCAAAAAGCCGAGGATGOrrGTTGTTrTNCA>rrrGTG<^ 
GNTGCATTTCCGGTTCTTNTCAGGTTGrmCCTAACCa^ANTTrAANAACCm 
AGTTTTAATGGGGNTTAAAT 
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SEQ ID NO: 2456 ACTCATTTGTATrCAOTTCACTITITCrCATGTTCrAATTATAAATGACCAAA 
ATCAAGATrGCTCAAAAGGGTAAATGATAGCCACAGTATTGCTCCCTAAAATATGCATAAAGTAG 
AAATTCACTGCCTTCCCCTCCrGTCCATGACCTTGGGCACAGGGAAGTrCTGGGGTCATAAATATC 
CCGTTrOGGAAGGAAAANCTGGNCTTAAAACTGCACATGACTGGGACCNAANATGAGNGNCACCT 
CAAATGGGTGAAAATCTGGCATCATTTTTGGAAAGACCTGCIXiAATGGTTCAAATAACTAA 

• GTTAGGCCNAAGAAAAGTTGGAATCCGAATAATCTACTGGAGGTITGNCCTTCAl"ri-riCTCT^ 
TCCTCntjCCTGGCTGATATTATCTACTNrrAAATAGCTATTTCATCCAAGTGCA^^ 
AATCTITTTGGGACTTCTGCTGGCCTGTITArrrCTTrrATATA^ 
TNrrAAACACTATCTTATCTTbn'CCTGACTGTGATTTTAATTAAAAAT^ 
AAAAAAAAAAAAAAGN 

SEQ ID NO: 2457 ACGCTGGATAGCGTCCAGGCCAGAAAGAGAGAGTAGCGCGAGCACAOCTAA 
OGCCACGGAGCGAGACATCTCGGCCCGAATGCTGTCAGCTTCAGGAATCCCCGCGTACCAACAGA 
AAGCTGGCCAGGCCCCCAAGTTCCTCATCTATGATGCCTITATCAGGGCCTCTGGCATCCCGGACA 
GGTTCAGTGGCAGTGGOTCTGGOACAGACrrCACTCTCACCATCATCAGACTGGAGCCTGAAGAC 
rrTGCAGTGTATTACTGTCAGCAGTATGCTACCrCACCGCTCACTTTCGGCGGGGGGACCAAGGTG 
OAGATCAAGAGAACTGTGGCTGCACCATCTXjTCTTCATCTrCCCOCCATCTGATGAGCAGTTGAAA 
TCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACrTCTATCCCANAGAGGCCAAAGT 

SEQ ID NO: 2458 ACAGCTAAGCGAACTTTAAGTAAAAAGGAACAGGAAGAATTAAAGAAAAAG 
GAGGATGAAAAGGCAGCTGCTGAGAmATGAGGAGTTTCTTGCTGCTTTTOAAGGAAGTGATGG 
TAATAAAGTGAAAACATTTGTGCGAGGGGGTGTTGTTAATGCAGCTAAAGAAGAACATGAAACAG 
ATGAAAAAAGAGGTAAAATCTATAAGCCATCTTCAAGATTTGCAGATCAAAAAAATCCTCCAAAT 
CAGTCITCCAATGAAAGACCACCATCTCTTCTTGTGATNGAAAAAANANAAACANNNAAAAAAAA 
ANGGTC 

SEQ ID NO: 2459 ACGCGGGGGAAGCTTGGACCGCATCCTAGCCGCCGACTCACACAAGGCAGGT 
GGGTGAGOAAATCCAGAGTTGCCATGGAGAAAATTCCAGTGT(>GCATTCTTGCrCCTTGTGGCCC 
TCTCCTACACTCTGGCCAGAGATACCACAGTCAAACCrGGAGCCAAAAAGGACACAAAGGACTCT 
CGACCCAAACTGCCCCAGACCCTCTCCAGAGOTTGGGGTGACCAACTCATCTGGACTCAGACATA 
TGAAGAAGCTCTATATAAATCCAAGACAAGCAACAAACCCTTGATGATTATTCATCACTTGGATG 
AGTGCCCACACAGTCAAGCTTTAAAGAAAGTGTTTGCrrcAAAATAAAGAAATCCAGAAATTGGC^ 
GAGCAGTTrGTCCTCCTCAATCTGGTTTATGAAACAACTGACAAACACCTTTCTCCrrGAT^^ 
TATGTOCCAGGATTATGTTTGTTGACCCATCTCTGACAGTTAGAGCOGATTTCACTC^ 
TCAAACCGTCTCTATGOTACGAACCTGCAAATCAGCTNTGNTrGCTrcACAACATGAANAAA 
TNAGTTGNTrGAAACTGNATTGTAAAGAAAAAAAAATNrrCCAACCCCTTNTI^ 
TGAANTTTGAA 

SEQ ID NO: 2460 ACCTTATGGATTTGACCCACCTCATTCTGGACAAAGCCTC^GGAGGATCTCTT 

cagggacatgatgcagttitgagactggtagagattcgaacggtttrggaaaagcttcgtccot 
gactaaaagctgaagtatcaaattgacaagctgatcaagactgcagtgacaggcagccttagtga 
gaatgacccacttcgttttaagcctcatcccancaatatgatgagcaagttgagctctgaggatga 
ggaggaagatgaagcanananatgaccantctgaggcttcaagggaana 

seq id no: 2461 accatcctgtggctccitaaggaggcttctcrctttaattctcc^ 

ccagggtgatctgggctatgggaagaacccttcaacttgggagtagacaggtgctccaattcata 

GTGCCCATTCTCAGAGGCCTTGTGTGTGAGTTTCTCCTTCATGCCTTCCTTCTGGCTCT^ 

ccataatctgctggagctggtgcccagcatagtctggcttggtggtcagcgggccagccggcaca 

gctacaccaaggacatctgacaccatgtaggggcgcagccagcccaccaagggagtgcttccggg 

gctgtagtgggtctgtttgtggtagaaganaagtccatctacctcaaaagggaaatcxzatagata 

gcacatcacacaggcttrcgggagtgcaagggaagttctttanccccacaaatitaaaaggatta 

agcttggrmctctcccagtccttcttcttctggtaactttgaatgcatccagtanaatcg 

cagtctggcaatcataaaaagggtgtccccgccagcacatcacatccanaacgtantaggtctng 

gtttacctnattngtaaatgcaatctanaatgggg 

SEQ ID NO: 2462 ACAACCCATAATTTCrGGTCX:ATCAAAAGAAAAGGGATCAGAATCATCTGGT 
TCTGACCTCATCCTGTATGTCTrrGTCAGCACrrCATTrGTAAAATArrCATTGGGTTCAAAG 
ATTCTAAGACAAAACTCATAGGCrGGCCAGCATCTGAGAACTTCACTTTAATATCrm 
TCAGAATAGGTTCATCGTGTTCCTGAACCAT ATCAC TGAGCAAGTCAACATTCTTAAAAACAGTTA 
ACCAAAArrCAGGAATTCCTTTGGGGTCTrCTTTITCTrCATCTTTm 
TTTTCTTrCAATTCCTCCGAAATCTCATCTTCITCATCTGGTTTCCAT^ 

TTCATAAATTGCATTAATAATrrCAAATCGCTTATCAAATAGAGGCTGATAAGAGAACAGCATACT 
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TCCTTTCAAGATajTGAACTrCCrrCATAGAArnGGCTTCTATCTGTGCACAm 

TTTGAGACNATTCACTCGTCrmAACTACCCTAGGCAGGCTTTCAATGGTATCCTGGTNGG 

CTACCCAAACCATCAANTTCTTrCTTGGAAGGGGCTGCAAAAAATCTGA^rGATm 

SEQ ID NO: 2463 ACGTTCAATCCGGGGAATGATGACATGTrCAATGGCATTTACACGCCTGTTGG 
TTATCTTAATAGCTTCATCCAAAGTAACAAAAGAAGTCTGCAGAGAAGCTAGrrCCACCAGTAGTT 
CCACTGCITrGGCATAATTCCTCriTAATTrAGCX:AACTGTrCCCCACCTCTGGCTA^ 
TTCATAACrGTCAGTTCOTCATGGTAATGTrCAAATACTGGCAAAGTAACACCTGCTACATTATCT 
TrCTTCGCTCGAATCTrCACrrGCGCTTTATTGACATTTTGGATAACTGTAGT 
CTGTGAACTTCGCTTCAGTTAGTGAAAAGGCAGCTTCrCTCATCACrrCGCCCAT^ 
CTCTATTATCTTCTITAGGATCrOTCGAAATCGAAGAGTTAAGGCATCANATTT^ 
TTTCGACCTGTCTGTGCTCCCTITAAACGAGCCITCATGATGGTCTGTGCCATTa^ 
ATTTCAATTCGGCTTTGCCCGACATTNTGACNATAACT^^ 
CCGAGGCTGCAANTA 

SEQ ID NO: 2464 ACTACTGCrGrrTTCTGAAGACGCGAGGGCAAGTGCAGCCAGCCG ri-l'Cl"ri-r 
CCTTCTTTAAGCGTTTCTTCTCCTGTTTCTCCAGCTrCCTAGTAATCT 

TGAACCATTGCTTCCrrCATGACATCCAGATTCTTTCGTGGTATCTCTCCAGTCTCATAGAAGGACA 

GTa^CTCTTCAACrrGTTCTCGAAGCTTCTCCCCGAATACACTCOTGGGC^ 

CNATTCGTGAGGCAATACTGCATrTNGTTTGCCAhrrNTATCNGGANATGCGGNC^ 

NCT 

SEQ ID NO: 2465 ACGCGGGGGAACACCACCCAGTGTGGAGCATCCCACCCTGCTCACTGAGGCA 
CCCCTGAACCCCAAGGCCAACCGGGAGAAAATGACTCAAATTATGTTTGAGACriTTCAATC^ 
AGCCATGTATGTGGCTATCCAGGOGGTGCTGTCTCTCTATGCCTCTGGACGCACAACTGGCATCGT 
GCTGGACTCTGOAGATGOTOTCACCCACAATGTCCCCATCrATGAGGGCTATOCCTTGCCCCATGC 
CATCATGCGTCTGGATCrrGGCTGGCCGAGATCTCACrGACTACCTCATGAuAGATCCTGA(^ 
TGGCTATTCCTTCGTTACTACrGCTGAGCGTGAGATTGTCCQGGACATCAAGGAGAAACTGTGTTA 
TGTAGCTCTGGACTTTGAAAATGAGATGGCCACTGCCGCATCCTCATCCTCCCTTGAGAAGAGTTA 
CGAGrrGCCTGATGGGCAAGTGATCACCATCGGAAATGAA(XiTTTCCGCTGCCCANAAACCCTGTr 
CCANCCATCCTrCATCGGGATGGAGTCTGCTGOCATNCATGAAACCACCTACAACAGCATTATGA 
AAGTGTGATATTGACATCAGGAAGGACCTTTTTCTAACAATGTCTATCAGGGGGC 

SEQ ID NO: 2466 ACiU-lUUUMHU'lUUlUU'l'inni'in'lUU'GGCrrNCATrrGAACATTTAATAAAAAT 
GTTTAGGTTGATATCTTAAAGTTGTCAGTGAAATCTCANATTTACATAACAGTTTGCATCCAGTAG 
ILLlLi I GAGTAACACAAACATCAGTATAGCCAAATG7TAACTCTTATCACTTTTTGTGTTAATAAC 
AAAGTAATATTTGTAATATAGNGTCCCTGAATAACTATAAAAACnTCTrGGAATTAT^ 
AAATAAAACACTATTTAATAAATGGTATCATAGCCCTrCAAAANACACATT CTGAA ATGCTATCAG 
TCAGGGGGCAAAAGTrcTTTTAGTTCAGTTTATTTTTTAATATGTCCT 

AATTATNATCrrATOACAGCNGTAACrnTAATTAATAATATTAACAAATCATTATNGATATAGGC 

rmCAATITGCTCAANATTAGGAArrTGNAAAGNGGGAATGAANCAGCCCITCCNATITG^ 

AT^GGATCCAAAAGGTAATa^AA^^^GGCTTmAAAT^TAAGC^^^GGGGGANAA™ 

SEQ ID NO: 2467 ACTTGTTAGGGAAAAAAAAAAGTTTGCACCCCCAAAOGTCCTGTATCrrATG 
AAAAAAAAAACAAAACCAAAAAAACCCCAAAAACCCrCGGAACCAAAACCAAAAAAAAGTGCA 
GGTOATTTirCTACCAAACAGCGAANCACCCrmGhriTCCChrrGCAACTTC^^ 
TNCTATNCmNTATNTNCGTTCTGGTTGGCAANCCCTGNTGATCAAAAAAAGTCTCT 
AGTOTTAGTAACTAATTTrTATATAAGrrAATGTAGGATAAAGTAGAGTGCATTAAGACACAATAT 
TGTAATCCCTACTrrrAGGCCTTGCCTTTTAACTATTGrmCAGCCCr^ 
CCTATACAATCAAGTACTGAAATTCm'GGGAAAAAACTITGGCTCCTCAT^^ 
AGGGGGTTTGGGTTTGGTTTTTTTTCCTTAATTNGCACCA^ 
CCOTGCCCTGGAhrrANCGAATTTTTGTGGGAATnmNCACATGCNCCn^^ 
TTTTCCCTAATCAANCATrTGGAGACACTTTITGNAAATNGGGACTTTTATGTCACCCATTC 
NGTTCAACAT 

SEQ ID NO: 2468 ACTACAGACCTCCACATGTTGGACAGTAGGATTCAGCAGAGAATGAGTCCCA 
CATCCTCTTTTCATCCATAAGCCACTCrrCCCAACTACrCTTCCGCCTTCCATTGTCCCTATGT^ 
TTTACTGCTCAATCTGGATrnXjGGGAACCTCCTCCCAAAAAGGCATTAGAAGGAAATGCCAAGC 
ACCGAAATTTTGTCAAGAAQCGGAGGCTCTrAGAACGGAOAGGCTTTCTOAGTAAAAAGAACCAA 
CCCCCTAGCAAGGCGCCrAAGTTGCACTCTGAACCTrCAAAGAAAGGGGAAACTCCTACGGTCGA 
TGGCACTTGGAAGACCCCTTCOTCCCAAAAAAGAAGACAGCrGCTTCCAGCAATGGGTCAGGAC 
AGCCCCTGGACAAGAAAGCTGCAGTGTCTTGGTTGACCCCTGCCCCTTCAAAAAAGGCTGATTCTG 
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TTGCTGCTAAAGTAGATITGCTGGGCKjAGrrCCAGAGTGCCCTTCCAAAGATCAATAGCCACCCAA 
CCOTTTCTCAAAANAAGAGCTCCCAAAANAATCCTCTAAAAAAGAACCATCCTTANAAGAA 
CACANNAACTCCACXrCAAGCTCATTCAOAANAATAAATGCCTCCGGAANCrrCCAAAANhr^ 
ACNGGAANAAGGN 

SEQ ID NO: 2469 AC l 'rJ-n r i - l -lU- i -l'i- l U-l Ti U- i - r riU U 'GGAAAAAACrmAACAATTrATTAGTC 
TTATTrrcCAOTAAAATATTCAAATAATGTCAAAANAATGAAATGATAGCGAT^^*AGCCAACTACC 
mAArrAATTCCACATAAATATTTAAAATbTTAAAAACCriNANATCAGCAGACCGAhrrCG^^ 
GATTCTTCAAAGCAAGTATTGCTTTACCOTGTCCTGAATGCAGNCCGTCATATNACCACTAAOT 
GCATGTNACCAAATGTTTGNATAGTGTTTTTTAAATTTGCTT^CC^ 
CAAGAAAAANAT 

SEQ ID NO: 2470 ACnMU"lU4M"l"l'rri'ri"i riUM"lCCNAAA0TGGTrTATTTGCAATTTACATGGATT 
CCrCTACTTTGTGAGGTTTAGTATAATTGAAATGTATTGCAATrrrCAGGAACArrACCTACAAATT 
GATATTAGGTGGTCTACAGCTTGTAArrGATGTGCTrCATGATAAANATGGCAGTAATGATGTTTT 
AAATATTCTAGNGCTCACTGGATITCAriTITGCAGGCAATTrGGANCTCCTCTGA 
ATATTCAATTACAGAOTCTGATAAATAGGATTCCCGCCACCCCACCCCTTCTAATCACTGCANAGT 
ArrAATAGNGCTTrcrrATOGCTOArrTCTrOCAGGGAGCAAANATAANATrATAGGCAANTT^ 
irCAAAACCTCAGCAGCATCTCCTTCTGGTAGCTATmAAATATTTTCnTCCA 
AATGACACNACCTTGCCATCCACTACITCTTGCACGACTGTCITAATCrrCCTGGri-I'l 
CTCTCrCTCCAGGGGGCTTAACCTGATATTCTGTAGrrrrTNCGTCTTCTCCTTCCAN 
GTAAGTANCAATTrCCTGTTCXGTCNAGTCriTATGTNAANAAAGATrGGTATTCGTTh^ 

SEQ ID NO: 247 1 ACll l'l-l l l'lU riU ri"l-riM"nUTriTlU4AQGATAAATACTATGCTTTAATGAN 
CCCCTTAAATANAAATTCCACTACAAAAATACANAGGAGATAGGGTGnrCCTGTATCCGCCTCAT 
TCCCATANAAAACTATAAGGGAANAAATANAACTTGGAATTAAANCAGCAGCAAGGCGAGGTGA 
NAATGCNATTTCTAGGCCATCTTGTTGGGACTGATGAACAGCATCTNTGATCTCATGAT^ 
CTGGTTATCCAAAAGGGATGGGATTGGCCTAAAAAAACCGATCAATITCNGGATTGGTTTTGT^^ 
^^AAT(XT^CAAATGACAAGAAGCAAAAAT^^mGTANAAAC^AAG<::AAGACAANAGTO 
GTTAACACCACrm-CTGCAGCNAGAAACTCAGACCmrrrAGTGAACTTAANGGCT 
AAGCTGANGCTGCAAGTGCANATCAAAAAAAAAAANGATTTTACVVGACCCTCNTCTACCCC^ 
CCCCCCGNNTACCTGCCCGGGNGGGNCGTTCNAAAGGGC 

SEQ ID NO: 2472 ACn"n"n'l-n"rn"l'l'n'n-I'CTGGGACTCTGGAGATGCCGAAGCACACGCCTT 
CAAGAGTCCCAGCAAAGAAAATAAAAAGAAAGACAAAGAT^a'GCTTGAAGATAAGTTTAAAAGC 
AATAATTTANAOAGAGAGCAGGAGCAGCTTGACCGCATCGTGAAGGAATCTGGAGGAAAGCTGA 
CCAGGCGGCTTGTGAACAGTCAGTGCGAATTTGAAAGAAGAAAACCAGATGGAACAACGACGTT 
GGGACTTCTCCATCCTGTOGATOCCArroTAGGAGAGCCAGGCTACTGCCCTGTGAGACTO 
GACAACTGGAAGACTTCAGTCTGGAGTGAATACnTGCAGGGGTTCAAAGAGGATAAAAGGAACA 
AAGTCACTCCAGTGTTATATTTGAATTATGGGCCCTACAGTTCTTATGCACCGCATTATGACrCCAC 
ATTTGCAAATATCAGCAAGGATGATTCTGATTTAATCTATTCAACCTATGGGGAAGACTCTGATCT 
TCCAAGTNGATITCAGCATCCATGAGTTTTTGGCCACGTGCCAAGATrATCCCGTATGTCATGGGC 
AGATAGNTTACTGGATGTTTrAACAAAAGGGGGGGCNTTCCAQGACCCTACAAAGANATGGGAGA 
TGTCNTTCCTGAA 

SEQ ID NO: 2473 ACTGAAGAACATTCCCATGGATATCGGTTAACTTGCCTCCCACAGCATGTAA 
AATAACTTCTGGAGCACA/.OTATCCCACTTCTTACAACCAGGACTTGCAAATACATAAGCAGAGG 
CTTTGCCTTCAATCAGCTGAATAATCTrATrrcCTGCTCCTCCrACrCXjCAG 

CATAGCAGCAACACAGTCAGTAACCAACTTQTTGCTATGGQATCGAGTAGTrGTGATAATGTGTrr 

CCCAGCAGGGACrrCTTTCAGCTGAAACCCAAAGGCGCCTAAACCTAAAACTCCCCAGATTGTCCT 

CCCCAACACAGCATCTGGTCCTGCCTCATAGTTGTAATATGGCTGGTTAATAACTCCTGCTATGGC 

TTTTCCTTCATAAGCAATTCCAATAAGAACTGTTACATTOTCAAGAAGACC^ 

GTTCCATCCAGAGGATCAACCCAGACCACGAGATCTrCTTCTTTAATAGCACTGT 

SEQ ID NO: 2474 ACGCGGGCAACATGACTGTCCnTAAACTCCAGTGGCTGGCCAGGCACGGTA 
GCTCACGCCTGTAATCCCAACACTTTGGGAGGCCGAGGCAGGTGGATCACCTGAGGTCAGAAGTT 
CAAGACCANCCTGGCCAACATGGGGAAACCCTGTCTTTACTAAAAATATAAAAATTAGCTGGGTG 
TGGTGOCNO 

SEQ ID NO: 2475 ACinUnU"riM"riM1-I4"inM'riM'rjNGGriUU'l"lUU lU'lCCACACCTGCCCrTTATT 
GGTCrCTTCTAGCANAGTGGCTCCAGGCCCTrCACGCCTOTCANACACCACCCATGAGGGTrrAGG 
AAGGTGCCATCATTCTGTGAAGGCCCAAAGOTACX:CAAGTCrTGGAGCOCAAGTTGAATCACCA 
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ACCANAGGGTrGGGANAGGAAAAGGAAACA(KjCAGAGGGGAAAGGCAAGGCTCrrGCAGTGAAG 
GGGACTGATATCAAGGGA.\TGCTGAGGTCCAGCAGTGTCTCXrrGAAGGCATGCTGCATCCTAAGG 
CTCCTCAGGACTGGATGGAGTAGGAGATCTGGGTGTraACCTVNrrCACATNTATATGGCAACT^ 
AGGAGGCGCTTGATGTCAGGCTCAATGTTGATGGTTGGGAANGTGCAA 

SEQ ID NO: 247 6 AC rin'l-rriU'lll-^U'iU 1 lUU'l'l-llMnNGGNTTCAAAAOOGTGTTTACTArrTGG 
CCAAACAATATTTTTTAATTGTCAGTCATAAAGTGAAATACATACTAAAATATATATTAAATAT^ 
ACCAAATCTGCATTGCTGCTACATGAAAACATTTTrrcGTCTGTTGGAAAAT^ 
CATTGTTGOOCTTTGTCAATCATTTTCCTCACCATCAAATCACCCTAAGTGACTTGGGAGTGTGAAT 
CTAGGATGTTCAArrrTANACCAATTTTCTCTATCrrcTAAATO^ 

AAAAGGTANAAAAATAACCATGGTGTGCTAATTITTTTCAAGaTATACCATATGGAAAAGT^ 

GCTGAACACAAAGGAAGTCnTTrCTGAATGGCTCTCGATOVCACATAAGGAACATATGTTTrCCAG 

TTAATCTGCCTTGATGT 

SEQ ID NO: 2477 ACC GGACCCTGCAGCCGCAGAGATGTTQATGCCTAAGAAGAACCGGATTGCC 
ATTTATGAACTCCTTTTTAAGGAGGGAGTCATGGTGGCCAAGAAGGATGTCCACATGCCTAAGCA 
CCCGGAGCTGGCAGACAAGAATGTGCCCAACCTTCATGTCATGAAGGCCATGCAGTCTCTCAAGT 
CCrAAGGCTACOTGAAGGAACAGTTCGCCTGNAGACATITNTACrGGTACTATITT^^ 

SEQ ID NO: 2478 ACGGGTATCACnrCCGGAGCTGGTGAAGATCATCAACGACAATGCCACATA 
CTGCCGTCTTGCCCAGTTTATTGGAAACCGAAGGGAACTOAATGAGGACAAGCTGOAOAAGCTl^ 
AOGAGCTGACAATGGATGGGGCCAAGGCTAAGGCTATTCTGGATGCCTCACGGTCCTCCATGGGC 
ATGGACATATCTGCCATTGACTTGATAAACATCGAGAGCTTCTCCAGTCGTGTGGTGTCTTTATCT 
GAATACCXjCCAOAGCCTACACAOTACCTGCGCTCCAAGATGAGCCAAGTAGCCCCCAGCCTGTC 
AGCCCTAATTGGGGAAGCGGTAGGTGCACGTCTCATCGCACATGCTGGCAGCCTCACCAACCTGG 
CCAAGTATCCAGCATCCACAGTGCAGATCCrrGGOOCTOAAAAGGCCCTGTrCANAACCCTGAAN 
ACAAGOGGTAACACTCCAAAATATGGACTCATTTTCCACTCCACClTCATTGGCCGAGCANCrGCC 
AAGAACAAAGGCCGCATm-CCCGATCCraCAAACAAATGCATTNTTQCCTNACGAATTCOA 
TT IC^CT OAGGTGCCCACGAATGT^mCNGGGGANAAGCTTCNAAAACAAG^ 
TCCTTTNTATGAA 

SEQ ID NO: 2479 ACAGGAGATCTCATTTGGGACAACrAAGGATAAAATOCTGGTCATCOAGCAG 
TGTAAGAACTCCAGAOCTGTAACCATTTTTATTAGAGGAGGAAATAAGATGATCATTGAGGAGGC 
GAAACGATCXCrrCACGATGCTTTGTGTGTCATCCGGAACCTCATCCGCGATAATCGTGTGGTGTA 
TGGAGGAGGGGCTGCTGAGATATCCTGTGCCCTGGCAGTTAGCCAAGAGGCGGATAAGTGCCCCA 
CCTTAGAACAGTATGCCATGAGAGCGnrGCCGACGCACTGGAGGTCATCCCCATGGCCCTCTCTG 
AAAACAGTGGCATGAATCCCATCCAGACTATGACCGAAOTCCGAOCCAGACAGGTGAAGGAGAT 
GAACCCTGCTCTTGGCATCGACTGTTTGCACAAGGGGACAAATGATATGAAGCAACAGCATGTC 
TANAAACCTTGATTGGCAAAAAGCAACAGATATCTCTTGCAACACAAATGGTTAOAATGATT^ 
AAGATTGATQACATTCGTAAGCCKjGAGAATCTGAAGAATGAAACATTGAGAAACTATGTAGCAA 
GATCCACTTCTGTGATTAAGTAAATGGATGTCTCGTGATGCGTCTACAGTTATTTATTGTACATCCT 
TTTCCAGACCCT 

seq id no: 2480 accataggarmgoaagatggtatcatcaarrcttctagttagtgatggtgt 
tttctcagtagttcttaccagacactcctcaagtgaatgagttaaatgaatattgtttatatattct 
tgtcccttctghrrctaacncatamgcaccctggatacxiattrancatgttgttcccaagggtc^ 
cactgggcttkcaatcanatanccctcctnctcacrgaangcnctctactatanct^ 
aatta^aat^^^ccacgct^^tggcaanctgcttcnattgatatc^ 

seq id no: 248 1 acgcgggqtctntctcgggacgggagaggccgtgtagcgtcgccgtractc 
cxjaggagataccagtcggtagaggagaagtcxjaggttagagggaactgggaggcacntgctgt 

CTGCAATCGAGGTTGAGGGTGCAAAAATGCAGAGTAATAAAACTTTTAACTIXjQAOAAGCAAAAC 

catactccaagaaagcatcatcaacatcaccaccagcagcagcaccaccagcagcaacagcagc 

agccgccaccaccgccaatacctgcaaatgggcaacaggccagcagccaaaatgaaggcttgact 

attgacctoaaoaatmagaaaaccaggagagaagaccttcacccaacgaagccgtctrntgt 

GGGAAATCTTCCTCCCGACATCACTGAGGAAGAAATGAGGAAACTATTTGAGAAATATGGAAAGG 

caggcgaagtcttcattcataaggataaaggamgggctttatccqtttgoaaacccgaccctan 
cggagattgccaaagtggagctggacatatgccactcotggaaaagcanctgcggtgtgccctt 
rmctgccatagtgcatcccttanagttcngaaaccrrccttnagtatgtgtccancnaj^ 

TGGAAAAAACCCTTTT 
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SEQ ID NO: 2482 ACGCGGGGITCTCCCCAGATGACAAATACTCrCGACACCGAATCACCATCAA 
GAAACGCTTCAAGGTGCTCATGACCX:AGCAACCGCGCCCrGTCCTCrGAGGGTCCCrrAAAC^ 
GTCTmCTGCCACCTGTTACCCCTCGGAGACnx:CGTAACX;AAACTCTTCGGACTGTGAG 
TGOCTTTn'GCCAGCCATACrcmGGCATCCAGTCrcnx:GTGGCGATrGATTATGCrr 
CAATCATGGTCGCATCACCCATAAANGGAACACATT^nsrACTTTT^^^TTC^ 
ACAAGATTATTANAGAT 

SEQ ID NO: 2483 AC l 'J- J UU UU rri " ) - rJ ' ril ' i ' ri lN GGAATCCTAACTCTATTAATAGTGTTGATACA 
TTTCTTCTAAATTTGCTGCCITGCCTCAGTANATCITrACT^ 

ATTTAGGTGATGACTCTTCCAACAAACAATCTGTAATCCCCACATTATTANAATTGGGTAATACAT 
GTTTTrrGGCTTTTrGGAACTCTCCTGAATTGTANATAATACCCCTACATTCCAAACA 
TTGCTCCCATGACCGTAATGAGGAGCAGGCTAAATGTGTGCCACrGGAACCACAACACTGACAGC 
GCCrrATTTCCCATTTGCTATCAGGTCCATrATAGTCTCG(XOT 

ATCANAACGCTCATAGTGCTGCAAAAGCTCTTGATAAGCGTTTTCXrrCTNATTCCCAGGAAGCATC 

TTrrrCAGGAATTGAATTCCCATrCTCAACATCTCmcmAAAGATGTCA 

TACCTCGGG 

SEQ ID NO: 2484 AC I - i ' lTl l - I - l - lTl - lTi ' l ' n ' l ' lN GCCAAAGACAAACTANAGCAATGCCTATGTAA 
OAACAAGGACTCTCAAGATCCTGCTACACAGTATrCACAATCAAAAGGOCCCAAGATTCAGGACC 
ACCTAAAGACAACTGACAAAAAGTGTCANAGCCAGAGGCCAACCTNTGCTANATGAAGCAGCAG 
CACATGACTCTATTTCTATCTGAfAAGGAGACAGAGAAGAGGCATCTCGAACAGATGAAAAACCA 
AAGGCTGGTGTCCTAAAAAAAAACAGATTGGCTTCAAAGAAAACACTAAGGAAGACCCCAGAGC 
TGTATTAATTTTAGTAAAAATAATCATATGCCAACAGGGGAATTGAACCACTTTCTAAATC^ 
ATGAACTCATCrCTTCANATCTTGGTAAGTGGTCAAAGCTNOTTTTTATAATrAC m 
GGGCAAAAAGTCTITCTrATCnTGGTCCTTANGTGTGGTATCAGTTrcrrT 
CAAAAAAATCnTnrrrACTITGACATCACAACCAAGGNGCAGTm'AAACACN^ 
TGNTGGTTmATACANATAAAATAACCACATTKrCCCATACATTTTTATAGGCT^ 
NTAAAA 



SEQ ID NO: 2485 ACCAACCrGGCTACrGGAATCCCCAGTAGTAAAGTGAAATATrCAAGGCTCr 
CCAGCACAGACGATGGCTACATTGACCrTCAGTTTAAGAAAACCCCTCCAAAGATCCCrrATAAG 
GCCATCGCGCTrGCCACTGTGCTGTTTTTGATTGGCGCCTTTCTCATTATTATAGC^^ 
TGTCAGGCTACATCAGCAAAGGGGGGGCAGACCGGGCCGTTCCAGTGCTGATCATTGGCATTCTG 
GTGTTCCTACCCGGATTTTNCCAC^^'GCGC^^^^^m'ACTATGCATCCA^ 

SEQ ID NO: 2486 GGTACTTATATAAAATCTAGTCCAGTrCTCrCATTTAANAAAATGAAGACACT 
GAAATACAGACTTAAATAGCrCAGATAGCTAATTAGGAAATTTCAAGrrGGCCAATAATAGCATT 
CTCTCTGACATTTAAAAATAATTTCTATTCAAAATACATGCATAATTGATTTTACACCTC^^ 
GTGGATAATTTATGTOATGTGGATTGCTGGTGTCCAGCATGACCCATAAACAGGTCAGAAOAATG 
ATGGAATGTTTTAGAATAAACTCCrGCTTATAGTATACTACACAG TTCAA AAGATGTITAAAATGC 
TTrroTATTTACrGCCATGTAArrGAAATATATAGATTATTONAACCTTTCAACCTGAA^ 
CAGTATGAGAGTTTAGTTATITGNATGNGGCACTAGTGGCTAATGAAGCnTrr/^ 
TCirCrnAAAAATATTATTAATGNGNATGGGATATNACAATrCACTrAATTrCCC^ 
GGGNGGNACCATGGTTTCCCAATTTTGAANGGNGGGGGnrTAACCTTTAAT 

SEQ ID NO: 2487 AC VVmrVl ' in i il l n 1 1 1 H I i GAGCTGAGACCAGGAGAAATAACnTATT 
TGAATAGGACCCAAGACAGCATATTGGGCrAAGGAGGAGAGGTAAGGTTCCAAAACCGCAGTCA 
AAGCTCATCAACCAAATGGACTCTACnTCCCAGCAACCrTGCAGTTAGTGC^ 
CTGCTGGGGAATGTATTTGCCACTAAATTCCCCAAGTATGCCAACATTACAAAAAAGATAGGTTT^ 
TCATCATAATTOAATTTCCACAAACCTCCCCAATCACAAGTATTATAAOTGGAAOTAAAAAATCAC 
ATTTTACAGATCTCAAACTTGTCTrCAACATTTAGTTCATCATCTTCA^^ 
AATTCATTANCTATATGATCTAACCAAGCAGCAACAAAGATGGCCAGGCCATGGCAATCCTNTTC 
CATTTCrCTACCACTNAGGGCTTAACAACANGQTGAGGCTTAAGTNGANGTANGGGGGTGGGGGG 
CAAAAGGGrrANTTrNCCCCATNCCrrGGGCGGNACCCCCCTAAGGGGAATT 

SEQ ID NO: 2488 GGTAC]- lM T J - lU - iM Ml'J-ll'r riU - in - ri - i - iU ' ] 'GGNTCCANNGATCCTTTACTGAGA 
TCCACITGAAACACTrCGGTCCTrAACTTGTTAACTGAGTTGACAGGCTGATGGCTGATCTAGGTA 
AAGGmCACGGTAGCAAAAACACTTCCTGCCTATAATACATCAAAACAGG'ITCCTGAATGTGTGT 
GAATATCATCAAGGNCrrCTGTCCCATCTOm€AGGGTTCGATTAAGCTCCrCAAATTTGT^ 
GACGGATGAAGAAATOXCATAACGCCIXKjCGACCTTGGTGTCCGCGATGTGGATCATGTCOT 
CAATGTAGAAGTCAACCTCTGAGGCTGACTQAAATTCACCATCCAGGGCTGAAAAACCCXiCTGGC 
CCCACGAGGGTCmATCTTGTGTITGAAGACAANAAGCTGGATCCCGAACITCCTGCTCT 
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GGCTAAGAAGCX:ACCCNCTTNGNCCX:AGGCATOGGNGGG>nTAAACTTT>JAGAAACC^ 
TTAAAACTTGGCCITNTGNTmCCTTCCCGGNNGGCCTTCN 

SEQ ID NO: 2489 ACAGGCCCnTGATGGCTTGGGTTACAGACAACCTCATAGCTGGTGCACCAC 
ACACACGAGATAAAACAGGAAGCCTAAAAACCCCAAGCCACACCAAGAAAAATGAGAGAGGGG 
AGGGCGGGGTAACAATGCAGCATCCCGCGGAGGGAACTTAATGCACAAGGAGGGAGAACAGAGG 
GTGGAAGGCAAGCCAGCTTCGTCTrCXjCCGCCGCAGCTGCTGTGTGGTGGTCAGGGGACTGAGTT 
CCCCGCGTACGCGGGCAAGTTTAGAACCCrrAGTGTTmCAGGGGGTAACTAAATTTAAAAAGCCT 
TAAATGGGCTGGGTGTGGTGGCTTCACACCroTAATTCCANCACTTTTGGGAGO GCCAAG GTGGG 
CCTGATCACCAAA>rrCATGAAGTTCAAGAACACCTNGCCAACATANTGAAACCCTrT™^ 
AAAATNCANAATAAAAAAAATTNTCCCNGTrGTTGTGGCNGGNNNCTrrAm^ 
NAGGCCNhfNGCTGOAAATTTATTTTAANCCNGAAGTGGNGmrrm 
N 

SEQ ID NO: 2490 GGTACCTCTTGGAAAACCTCAATGCAAGATAGGGTTTCAGTGCTGGCATATTT 
TGOAATTCTGCACATTCATGGAGTGCAATAATACTGTATAGCTTTCCCCACCT CCCACA AAGTCAC 
CCAGTTAATGTGTGTGTGTGTITTTTTTTAAGGTAAACATTACTACTTGTAACT^^ 
TATTTGAAAAAGTAGAATrGAGTTACAATrTGA'l"IllU"ll"iCCAAAGATGTCTGTTAAATCTOTra^ 
GCTTTTATATGAATATTrGTTTTTTATAGTTTAAAATTGATCCTTTG 

CAAATACTTTATAAGAAGTITATCAGACATCTCTAATrrGGCCATGTCAGTTTATACAGTTTCAAA 
ATATAGCANATGCNAGATTATGGGGGAAATCCrATATTCANAAGACCTTGCCCGGCGGNCGNTCN 
AANGGCCNAmTCANCCCNCTGGNGGCX:GGTTCTAANGGGANCCGAGCNCNGGNACCAACCTGG 
NNNAANANTGGCANNANTGNNNCrmGGGGAAATNGGNATCCCC 

SEQ ID NO: 249 1 GGTACAACTGATTTTTrAATGGAAATATArTANNATGACCTGTATAAATTATG 
AGATTCAAAACAGTGGCGCCACTATACTGCTAAACCTATGCATGAAGGTAGTGACTAGGATGGAA 
ATCTGTCAOTGCrACAAAAATATGTATGAACAAAATAATmCACCCnTGATAAAGCTACAAGAT 
ATAAAATTTAGAATACTTATATAATTTCATACTAGATATGTGAAAAATATGCCATGCTAGAACCAT 
CTTGTTCCAAAGTTTGAAACATATTCTGTCAAAAATACTCTrCGTACTTGATGAACCTGmC^ 
TTCTCGTCCrCCCTGAAAAGGGTTACXCCAAGCAGTCrGGCTATCAGTGCAGTGGGACCTCCATh^ 
ATGTTGGGTTATGTAGGTGGATGGATGAAACTAGGTGGGCCTATGTTCCCATCTITmCTCTCTTT 
TAAGNAGAAGACANGATTTNCCANGOGAGNNQGGGGCTACANAGTNCCNC^^^TCT^^ 
GGNNANNACTTTTNGCCAANTrGGCCNAATCTCrmGGTrAACCCT 



SEQ ID NO: 2492 GGGA LTi ' l lU H 1 14MlU ' lU ' lli OTA(>rGATACAATTGGCTTTTATTTGC 
GArrCATGAGTCAGGGCAGmCCATTCTGCAAAATATAGNGATAGCrCCTACTGGGCAATACAAC 
AGTANAACAGNGGGTTITGTAAAATGGGAATCCAGGAACAGAANAATATAAATAAATTGATTTAA 
ATAAACTGArrGGNTAATrrCANAATACTTCATAlTACriTmCTAAGAGTTAAAGCANAAA^^ 
CTTTCTTACTGNGCTGACTCANACAGCCTGGACrCTCATGTTTTTAGGAAAATm 
GATCTACCTGCTTCCTCATGTTTCAGGGGGGAGNATATGGCATTTAACATGACTGGCTCCAh^ 
GGAGNCACCmGCCTGNCNCXCTAAATGAGAGTTTGACTAANCATANGGCNTTAA^nsrCTO 
NGGTCCCNCmTTTrGNACTTCCCTCCTTCCNAGGNTATGANGCCCCCNNACCNTACCKr^ 
GNCnrrnGGCTTNCCCCCCNAAAACAAANrrNCACNTTGCC 

SEQ ID NO: 2493 A c n i ' iTrnu ' iwM ' rinM i ^lM ^ lUMM M ^ l ' ^ ^•AcT^MT^Tm TnT ^^^^ n iu 

ri mri l nnil A NGGGGCCAATTTTAAANAGTmATTTAAAANATTGCATTTTCCACTTACAA 

NACAGNGTTTATAAAGGGCAATGTTATTTCCTTCCCCTGGGCATATGTTCCATArrCAAGTATTQA 

NAATGCCCAGTAACITACTATAGCAGCTTAACTrTTTAAAACTGCCACAAAAriTGCTAC^ 

AGGNCCTTCAAATGTmAAATGGGNGGAACAATGCTACATOTACAChTIXjGNNG^ 

CnmAAATGGNGGGCCCTCGGGAAGCNCCNCCAGAGGGNGGNGCTCCtXNCCAGGA^ 

CAGGCATTTCTCTGGGATGCCTCTGGACTTTGGGNACCTmGGCX^GGGA 

TCCANCACACTGGCGGGCCTfTACTAAGGGGAACXWACCNNGGNCCAAACTTGGC^ 

GNAAAACNGNTNCCTGGGGGAAAAmrNANCC 

SEQ ID NO: 2494 GGTACACTrGAAACCAAATrrCTAAAACTTG'rri"l'l'CrTAAAAAATAGTTGTT 
GTAACATTAAACCATAACCTAATCAOTGTOTTCACTATGCTTCCACACT AGCCAGTCTTCT C^^ 
TTCTTCTGGrrTCAAGTCTCAAGGCCTGACAGACAGAAGGGCTTGGAGATTnrm 
CAGTCnCAGCAACTTGAGAGCTTTCTTCATOTTGTCAAGCAACAGAGCTGTATCTGCAGGTTCW^ 
AAGCATAGAGACGATTTGAATATCTTCCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACA 
AACATAATCCTGGGGACATACTGGCCATCANGAGAAAGGTGTTTGTCAGTTGTTT CATAAACC AG 
ATTGAGGAOGACAAACTOCTCTrGCCAATTrcrrXjGATTTCTTTATTT^ 

GCTTGACTGT^GTGGGCWCTCATCCAAGNGATCAATCCCCGTANCTGGCCCGGCGGGCGGrrc;^ 
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AGOGGGAATTCCANCCACmGGNGGCCGTCT/u\.TCGATCCAACCCGGGNCCAACTT^ 

SEQ ID NO: 2495 A CTJ JTl 1 J J U 1 1 1 1 i 1 1 ] 1 i 1 1 li n GAGTGTGGGTTTAGTAATGGGGTTTGT 
GGGG'l-rilUriCTAAG CCTTC TCCTATITATGGGGGTTTAGTATTG ATrGTTA GCGGTGTGGTCGGG 
TGTGTTATTATTCTGAATTTTGGGGGAGOTTATATGGGTTTAATAGTTTTTTTAATTT 
GAATGATGGTTGTCTITGGATATACTACAGCGATGGCTATTGAGGAGTATCCTGAGGCATGGGGOT 
CAGGGGTTGAQGTCTTGGTOAGTGTTrrAKrGGGGTTAGCGATCGAGGTAGGATrGGTGCTGTGG 
GTGAAAGAGTATGATGGGGTGGTGGTTGTGGTAAACTITAATAGTGTAGGAAGCTGAATAATTTA 
TGAAAGGAAAAGGGTCANGGTTGATTCCNGGANGAACCCTATTGGTGCCGGGGCNTTGTATGAAT 
ATNGGCCTTNATTAANTATAAhrTACCTGGTTGAAAATGTTGGTTGGGGTATATAATGTAATTGAAA 
TGCn'CCX}GGGNATAGGTATITGAATAGGAATAGGmTGGATAATGGGAANAAAAAGAAGGAANT 
AAATTAAT^m:CCTT^XKK}TTAAGGGNNANCCCNNTN 
CTGNGGC^^^^AATGNANCA^^MTG^INCAACT^GGAAANTGNCTAACN 

SEQ ID NO: 2496 ACATrCTAGCTGAGAAGCAATGGGTCACTCATTAATGAATCACATTITmAT 
GCTOTGAAATATTCAOAAATTCrCCAGGATmAArrTCAGGAAAATGTATTGATTCAAC^ 
AGAAACITTCTGGTGCTGTCTrTTGTTCTCTGAATT^ 

GGTGACTGTGTAACmCTCTTAAOATTAATTITCTCTTTGTATGTCTGTTACC^^ 

AATACATGCAACAGAAGTGACTTCTGGAGAAAGCTCATGGCTGTGTCCACTGCAATTGGTGGTAA 

CAGTGGTAGAGTCATGTGTGCACTTGGCAAAAAGAATCCCAATGTTTGACAAAACACAGCCAAGG 

GGATAmACTGCTCTTTATTGCAGAATGTGGGTATTGGAGTGTGATTTGAATGA 11 CATTTGG 

CT^ANGGCAAGAA^mATGCCAAAAGNTCITCAT^^GAGNTTTO 

GATCTGArATGGTTTCCCTCANGGANCCCNNCTGGGAAAAAAAAAACCClT>^^ 

CAA 

SEQ ID NO: 2497 ACAGCCAGCAAAGGGCGCTATATrCCTCCTCATTTAAGGAACCGAGAAGCTA 
CTAAAGGTTTCrACGATAAAGACAOTTCAGOOTGGAGrrCTAGCAAAGATAAGGATGCGTATAGC 
AGTTTTGGATCTCGTAGTGATTCAAGAGGGAAGTCTAGCTTCTTCAGTGATCGTGGAAGTGGATCA 
AGGGGAAGGTTTXiATGATOGTGGACGGAGTGATTACGATGGCArrGGCAGCCGTGGTGACAGAAG 
TGGCITTGGCAAATTTGAACXjTGGTGGAAACAGTCGCrGGTGTGACAAATCAGATGAAGATGATT 
GGTC AAAAC CACrCCCACCAAGTGAACGCTTGGAACAGGAACTCirrrCTGGAGGCA^ 
ATTAATTTTGAGAAATACGATGACATTCCAGTTGAGGCAACAGGCAACAACTGTCCrTCACATAT^ 
GAAAGGTTCAGTGATGTTGANATGGGAGAAATTATCATGGGAAACATTGAGCrTACTCGTATCTC 
GCCAATTCAGTGCAAAACmrCCArmCTATTATCAAAAAAAAANhfhWl^ 
GCCGGACACCTTANGGCNArrCACAAATGGGGGCCGTCTANNGGNTCCACTNGGNCCAACTTGNG 
NAACTGGCAAACTGOTC>JGG>n^AATGmcmCAATCCCAATTTNGCOGACTAN>^ 

SEQ ID NO: 2498 CGAGGTACTGGCTGGQATGQCTCTGATATAGCAGCCTTGGTGTAGTTTCTGCA 
mCGGGAAGAGTGACTGGACrcGATIXnTCTAGCTCCTTCAATCCCATITrC^ 
TAAGTATAAGACCTGCTCrCTTCCTGAAGACCTATAAGCTCGAGGTGGACAACrCAATGTAAATTT 
CAAGGAAAAACCCTCATGCCTQAGATGTGGGCCACrCAGAGCTAACCAAAATGTTCAACACCATA 
ACTAGAGACACTCAAAlTGCCAACCAGGACAAGAAGTTGATGACTraVTGCTGTGGACAGTITIT 
CCAAGATGTCCCAAGCCTCATCGTGACGAOGCrCTTATCCCACTCCATTTTTCCTGCTCATGCCTGC 
CTCTTTAATTTGGTAAGATAATGCTGNAACTAGAATTTCACAATCAGCGCCTrGTGCAGGTAAr^ 
GCAGAATGGTGGATGTGCATGNCATCATGTCAAACCCAATATTTGACTAANGGATCCTTATTCTNG 
CCAATG^^ITACT^TACACATCC^AATACACTGGTTATTCAATCCCGGGGNCC^X;GTAAAGTANA 
TTAANTNACTGGTNTACNCCTGGTTNAATTANCCACTrrGGATCCCANAAANAAWGCCACT^ 
NAA>WGGNGAAmGCNATGTGAACCriTrcGTGCCANAANAAACCGGGGGCNGCCG 

SEQ ID NO: 2499 GGTACACTTGAAACCAAATrTCTAAAAC 1 1 G i 1 1 T 1 C n AAAAAATAGTTGTT 
GTAACArrAAACCATAACCTAATCAGTGTGTrCACTATGCTTCCACACTAGCCAGTCTTCrCACAC 
TTCTTCTGGTITCAAGTCTCAAGGCCTGACAGACAGAAGGGCTTGGAGATTTTTT^^ 
CAGTCTTCAGCAACTrGAGAGCTrrcrrCATGTTGTCAAGCAACAGAGCTGTATCTGCAGGTTC 
AAGCATAGAGACGGTTTGAATATCITCCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACA 
AACATAATCCTGGGGACATACTGGCCATCANGAGAAAGGNGTTTGTCAAGTTGTTTCATAAACCA 
GATrGAGGAGOACAAACTGCTCTGNCNATTrCTGGATrCTITATTTCAACAAACACrrrC^ 
ACTTGACTGNGTGGGCCCTCATCCAAGNGATGAATAATCATCAAGGGGTTTGGTGGTTG^ 
GGATTATATAAAACTTCTTCTATTGTTGAGGCCCAAAGAGTTGGCCNCCCCACCCCTGGGGANGGG 
GTNGGGN 

SEQ ID NO: 2500 0GTACn"lM"inU lUU-lU"lllUU41GTAGATACAATTGGCTrTTATrTGTGATTCA 
TGAGTCAGGGCAGTTTCCATTCTGCAAAATATAGTGATAGCTCCTACrGGGCAATACAACAGTAG 
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AACAAGTGGGXITTGTAAAATGGGAATCCCGGAACNGAAGAATATAAATAAATTGATTTAAATAA 

ACTGATTGGTTAATTTCAGAATACTTCATATTACnTITITCTAAGAGT^ 

TTACTGTGCTGACTCAGACAGCCTGGACTCTCATGTTTTTAGGAAAATTITGT^ 

ACCTGCirCCTCATGTTTCAGTGTGAGTATATGGCATTTAGCATGACrGGCTCCATrCTGGAGTCAC 

CAGGCTGTCACCTAAATGAGAGTTGACTAAACATAAGGCATTAACACTCTGGCAGGTACCCATCA 

KITTTGGACTTCCATCATTCATAAGGTATGAAGNCCa^CTAATCATACAATITAT^ 

NCATTCCCCCCCAAAANAANAAATTTGACATTTGCCNAAAGGGTGGT^CCAACC^r^ 

T 

SEQ ID NO: 250 1 ACCTGAAAAATAAAGAGAAAAAAGCACAGANGCCAGTGAAAGAAGGACAG 
CCQAGT(XAGCAOATGAGAAGGGGAATGACAGTGATGGGGAAGGAGAATCTGATGATCCTGAAA 
AAAAGAAACTACAGAATCAACTTCAAGGTGCCATTGTTATAGAACGACCAAATGTGAAATGGAGT 
GACGTTGCTGGACTTGAAGGAGCCAAAGAAGCACTNAAAGAGGCTGTGATACTGCCTATTAAATT 
TCCTCATCTTTTTACAGGCAAGAGAACACCTrGGAGGGGAATarrATTATTTGGGCCGCCTC 
AGGAAAGTCCTACrTATTCAAAGCTGTTGCACCAGAAGCCAACAGCCTCANCA'l'l'ri'l'rrCAATAT 
CTTCCTCTTGATCTTGTTTCTAAAKTGGCTAGGGTGAAAGTNGAAAACTGGT^ 
CAACTTGGCCAGAANAGAACAAGGCCTTTC^TTIT^^^TTATTGGATGAAAAAT^ 
TTTGGGTTChWAAATNGAAAATTAAATTTGANNCCNCTCCmn'AAAT^ 
TGCCAANTNCAANGNGGrrNGGGTT 

SEQ ID NO: 2502 GGTACTATAGAGACTCAGTTGCAAAA ATTAA CAAATATGCrGCTrGArTAAA 
ATGGGTAGGCTTCTCATGTGGCrCATTCmAATCTATTCTCTTTTAm 

TCTGCCTATGGATCATACTTCAAACTCTTGGTGTGATCCTCCTGATTGTCACAATATTAGrrACCCT 

GGTGTGCTGTATTCTCTAAAACCTrrAAATGTTTGCATGCAGCX;ATTCGTC:AA^ 

CTCmGGCTGGAATGACAAAAACrCAAATAAATGTATGATTAGGAGGACATCATAACCTATC 

TGATGGAAGTCCAAAATGATGGTAACTGACAGTAGTGGTAATGCXnTATGTTTAACTCAAACTCTC 

ATTTAGOTGACAGCCTGGTGACTCCAGAATGGGACCCAGTCATGCTAAATGCCATATACTACACTG 

GAACATGAAGAAACNAGGAGATCCAGAACAGACCAAATnTCTAAAAACNTGAGAAT(XAGCTG 

NCTNAGTCAGCiXAGTAGAAAGNCCrrCTGNTTTACTCTTANAAAAAAGTATN^ 

ATTAACCATNCGrrATTTAAATCAATTAATTANATCTCNGGTCCNGGATKCANTTACNAAACCm 

GTTAACGGTGmTGCCNATGGACTTTCnTrmGCCAAAGGAATKGCCT 

SEQ ID NO: 2503 ACCCACrGTATrTATTTATOTGCAACAAGAAGTOTCAGCAACrGCACAAACTC 
CTCCCTGTTCAGCTAGTAGGCAGCAATTCTGTTATCTGGCATCCCATAGCTGGGTTAAATTAAAAC 
AGGGCGTGAGAACAGGTGAGTCTAGAGGTCTAACTCTAACAGGGACCACCGTGCATTTOAATAAA 
CAGTTGTATTAGGGATGTTGTTAAAGTTAGCCACTGGGCAGAATAAAGGATCCCTTAGGTCAAAT 
AmGGGTTTGACATGATGACATGACACATCCAACACTCTGTCAAAlTACCTGCACAAGGCGCTGA 
rrGTGAAATTCrAGTTACAGTArrATCTTACCAAATTAAAGAGOCAGGCATGAGCAGGGAAAAAT 
GGAGTGGGATAAGAGC(nx;GTCACGATGAGGCTrGGGACATCTTGGGGAAAAACTGTCCACAGCA 
TGAAAGTCATCAACnTCrrGTCCTGGTTGGCAATTTGAGTGGCrCTAGTTATGGNGGTGAACATTT 
^^GG^^^AGCTTTGAGTGGNCCACAT^IT^AAGCATGANGG^rITITCOT 
CCCCTCCAC 

SEQ ID NO: 2504 GGTACrATAGAGACTCAGTTGCAAAAA TTAA CAAATATGCTGCTrGATTAAA 
ATGGGTAGGCTTCTCATGTGGCTCATTCTTTAATCTATTCTCTTTTAT^ 

tctgcctatggatcatacrrcaaacrcttggtgtcatcctcctgatrgtcaca^ 

gototgctgtatrctctaaaacctttaaatgtttgcatgcagccattcgtcaaatgtcaaat^ 

ctcirrggctggaatgacaaaaactcaaataaatgtatgattaggaggacatcataacctatgaa 

tgatggaagtccaaaatgatggtaactgacagtagtggtaatgccttatgtttaagtcaaactctc 

atttangtgacacctggtgactccagaatggaaccnggcatgctaaatgncctatctcacactgg 

aacatgaggaacaggtngatcccaaacagacx:aaaattttctaaaacatgaaaatccagctg^ 

gatcacncagnaaaaaagnccttttgcttacmrtaaaaaagtatitgan 

seq id no: 2505 ncnagcggccxicccngncnngnacgcgggcatgtgcncctatggcncaccc 
nctgcagaacaggangctccnngagcccttggtmctccncgaaaacccccaacarrao^aacc^ 
cx3^agtgtcctgnacrctgncctaaataatcccacccnca(ntcatgcrc^ 
ccaggcccggtgagoccaaggrrtcccagacgagcctctgcgoctctccactgnttnatgagccc 
aaacnccctcntggcacaacgct^n:accctgcagcr^tgganaac^•cc^ 
tgcangactrcnctgcngcctrtcnnangactctgcagata>n'gccnotgcaaa 

GGGCTAAATCCCAAACrTGTCTCTtiGACTG^Cn^AAGAAATAANGGCtiCCAACrTACrCACCCCCA 
TGGCCACANGGAAGCNCGGACCGGACCTTAATTTGGAAmTrTGGTTITGGGC^ 
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TTTTrnTGGNTTGATTAAGGANTTAAAAACNG 

SEQ ID NO: 2506 GGTA(XTGTTCCTCTGTTTTCAGATCCCCAAATTCT1GCATAAATGTTG^ 
GAAGGAAACCrrAAGGGTTCAAGAAAGTGNAGAAGACTAAATAGTKnrCAACTGTATmGGAG 
AGGGGTGCCAGTCAAAAGCACXnTGTGTTCCTACAAAAAGTGAGAACAArrCACATCAAACATC^ 
CAGTACAAAGGTCCCAGATGGGTCCATATGAAACAGCTGGGGTCCrrTCTCATCAACTCCTCCAAA 
TAATAATGCTACTCCAAAGGGACGAGACATGGCACCTGGATCTGCATCTTCrrCTCCAAACTGC^ 
AGCCAGATTGGACACAGCTTGGGTCACACrcrCCACTGTCATTGTCTCATTGTAGGTGAACCAAGT 
GGrrCTGTGTCTCCACTCTQacmATCAATTAAAGTCrn"AACATCAGCAATTAACCCACTCATGGC 
CAACCTATGTGAGCATCAATCTCTACAATrrrCTTCAATCCTGCTGGGCTTCATCA>n'G 
AATrCTmrrrrcACAAGCTANGCNCACANCCrrTGATGTCTGGNANCCCAAG 

SEQ ID NO: 2507 A ClU - ril - rrr AAGCAATGGGTCACGGNATTAATGAATCACATTTTTTTATGCTC 
TTGAAATATTCAGAAATTCTNCAGGATTTrAATrTNAGGAAAATGTATTGATO 
AACTTTITGGTGCTGTCTmGTTCrCTGAATmCAGAGACrrrrrr 
GACTGTGTAACTTTCTCTTAAGAATAATTTTCrCTTTGTATGTCTGTTACOT 
TNCATGCAACANAAGTGACTITCTrGAGAAAGCTCATGGCTGTGTCCCTGCAATTGGTGGTO^ 
TOGGAAAGTTTTGTNTraCNCTTrGGCAAAAAAGAATTTCCAAT^^ 
GGGGGATATTrrACTNCTCmATATGCAGAATnWGGGATTTAGTGNGGAATTN 
NATTTGGCrmAGGGGNNATTTTTCNTTGCNNAAAAGri^^ 

CTmATNATTCNGAAmGTmC^fNTNAGGANACNGT^^•CTGTAATAANANANAN^ 
GGNGTCTCAAACC 

SEQ ID NO: 2508 ACGCGGGGGCAGAGAGGCTGTTCGCAGAGCTGCGGAAGATGAATGCCAGAG 
GACTTGGATCTGAGCTAAAGGACAGTATTCCAGTTACTGAACTTTCAGCAAGTGGACCTr^ 
GTCATGATCITCrTCGGAAAGGTTTTTCTTGTGTGAAAAATGAACTTT^ 

ATTATCAGAAAAAAATITCCAGCTCAACCAAGATAAAATGAATTTTTCCACAaXrAGAAACATTC 

AGGGTCTATTTGCTCCGCTAAAATTACAGATGGAATTCAAGGCAGTGCAGCAGGTTCAGCG 

CATITCTITCAAGCTCAAATCTTTCACnjGATGTmGAGGGGTAATGATGAGA 

GGATATTCrrAATGATCCATCACAAAGCGAAmCATGGGAGAGCCCCACTTGATGGNGGAATATA 

ACCTGGGTTACTGNAATAAGTGNGCTGGTCATGGAAACCXJANGGCTGCATCTTGTTATAGCCATCT 

TTNACCTTGGCCQNANCACCTAAGGCGAATTCCACAACTGGCGGCG 

SEQ ID NO: 2509 GGTACTATACTCTGTATTCTCACAAATAGAGAAGACTAATATTGCAGACCTG 

gtgacagctctgattgtccttttggtrgtatccattgntnaagaaataaataagcgcttcaaaga^ 
aaacttccagtgcccattccaatcgaattcattatgaccgtgattgcagcaggtgtatcctacggc 
tgggacm-aaaaacaggtttaaagtggctgtggttggggacatgaatccnxjgatttcagccccct 
atracacctgacgtgoagacrrrcraaaacaccgtagoagattgcttcggcatcgc^atggttgca 
tttgcagtggccrmcagttgccagcgtctattccctcaaatacgaatatccactrgatc 

ANGAAGTTAATAGCCTTGGGACIXjGGTAACATAGTCIXjTGGAGTATTCAAAAGATTGCT^ 

TACCTGCCCGGCCGGCCGTCNAANGGCGAATTCCACCCACTGGCCGGCCGTACTAGTGGATCCAC 

Ta}GACCAACCTGGCGNAACATGGCATACTGTTCTGGGGGAAATGGTATCC 

SEQ ID NO: 25 10 ACGCGGGGACrCAGAAGCTTGGACCGCATCCT AGCCGCCGACTCACACAAGG 
CAGGTGGGTGAGGAAATCCAGAGTTGCCATGGAGAAAATTCCAGTGTCAGCATTCTTGCTCCTTGT 
GGCCCrCTCCTACACTCTGGCCAGAGATACCACAGTCAAACCTGGAGCCAAAAAGGACACAAAGG 
ACTCTCGACCCAAACTGCCCCAGACCXrTCTCCAGAGGTTGGGGTGACCAACTCATCTGGACTCAG 
ACATATGAAGAAGCTCTATATAAATCCAAGACAAGCAACAAACCCITGATGATrATTCATCACTTG 
GATGAGTGCCCACACAGTCAAGCTrTAAAQAAAGTGTTTGCTGAAAAA>WN^ 
NAGGTACC 

SEQ ID NO: 25 1 1 ACACrTGAAACCAAATTTCTAAAACirai'l'lUUUiu'AAAAAATAGTTGTTGTA 
ACATTAAACCATAACCTAATCAGTGTGTTCACTATGCITCCACACTAGCCAQTCTTCTCACAOT 
TCTGGTTTCAAGTCTCAAGGCCTGACAGACAGAAGGGCTTGGAGA'l 1 111 liCl 1 1 ACAATTCAGT 
CITCAGCAACTTGAGAGCTTTCTTCATGTTGTCAAGCAACAGAGCTCTATCTGCAGGTT^ 
ATAGAGACGGTTTGAATATCTTCCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACAAACAT 
AATCCTGGGGACATACrGGCCATCAGGAGAAAGGTGTTTGTCAGTTG'ITTCATAAACCAGATTGA 
GGAGGACAAACTGCTCTGCCAAmCTGGAmCriTrATmCAGCAAACACCriMUllU' 
GACTGTGTGGGCCNCTCATCCAAGTGGATGAATAATCATrcAAGGGTITGGTTGCTTGGGCTTGGG 
AATITATm'AAAACNTITlm'CAT^ITGT^^TGA^mCCCAAA 
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SEQ ID NO: 25 1 2 GGTACITGAGTrCAAATGATAAAACTTGAAGTrGTAGGCTTGGAAGAGTATC 
AGCTCAGTATATCCTTCCTTGCATAAATACAAGGGAAAGGCCAAGGAATAATCAGCATTAACCTG 
CCAGGTCCAAGGGTCTTCTATCCCTGACTTCATCTGAGTCACAAGATTTCTCTAATCCCGCGTACCT 
GCAGAACCCAAArrGGCGTTTQTCATCAGAATCAGAGGTATCAATGGAGTGAGCCCAAAGGTTCG 
AAAGGTGTTGCAGCTTCTrCGCCTTCGTCAAATCTTCAATGGAACCTTTGTGAAGCTC^ 
TTCGATTAACATGCTGAGGATTGTAGAGCCATATATTGCATGGGGGTACCGGTAGAACTGCTATTA 
TTCATCCTATGTGGGTAATTGAGGAGTATGCrAAGATTTTGCGTAACrrGGGTTTGGN^ 
CCTCACTGNCTGCTATGATGGATAAAATGAAAAAAGTGAGGAAAAGCnTACNTCAGTGAGGGAAA 
AATirGGTATATGCCCGCGTACCTTGCCCGGCNGGCCGTCKAAGGCGAATTCCACCCCTTGCG 

SEQ ID NO: 25 13 ACGCGGGGGCCAAACACCTTCCTGACACCATGAGGGCCAGCAGCl-l'L'nGAT 
CGTGGTGGTGTTCCTCATCGCTGGGACGCrGGTrCTAGAGGCAGCTGTCACGGGAGrrCCTGrrAA 
AGGTCAAGACACTGTCAAAGGCCGTGTTCCATTCAATGGACAAGATCCCGTTAAAGGACAAGTrr 
CAG1TAAAGGTCAAGATAAAGTCAAAGCGCAAGAGCCAGTCAAAGGTCCAGTCTCCACTAAGCCT 
GGCTCCTGCCCCATrATCTTGATCCGOTGCGCCATOTTOAATCCCCCTAACCGCTGCrrGAAAGAT 
ACTGACTGCCCAGGAATCAAGAAGTGCTGTGAAGGCTCrrGCGGGATGGCCTGTTTCGTTCCCCAG 
TGAGAGGGAACCCG GTCCTGCrr GCACCrrOTGCCGTC CX:CA AGAGCrrACANOCCCCATCTTGGTCC 
TAAGTCCTTGCCGNCCTTTCCTTTCCACACrGTCCATTCTTTCTTCC^ 
ACTTGCTTTTTTATCACrrrcAATAAAANGTCCrTTTGTO 

SEQ ID NO: 25 1 4 ACGCGGGGGTATTCnTCCCCAAOTCTCTATGGTAGCOTCAGCGTCGGAGGC 
GGTAGTGACGGTGGCGTTrcCTTGAGGAAGAGTGAGGGTrCCAACrmCTGCTrATCT 
GTTGGGCGCGGACAGTCGAGATGTCAGAGAAAAAGCAGCCGGTAGACTTAGGTCTGTTAGAGGA 
AGACGACGAGTTTGAAGAGTTCCCTGCOjAAGACTGGGCTGGCTTAGATGAAGATGAAGATGCA^ 
ATGTCTGGGAGGATAATTGGGATGATGACAATGTAGAGGATGACTTCTCTAATCAGTTACGAGCT 
GAACTAGAGAAACATGGTTATAAGATGGAGACrrCATAGCATCCAGAAGAAGTGTTGAAGTAACC 
TAAACTTGACCTGCTTAATACATTCTANGGCAAAAACCCANGATGGGACCTAAAAAAATGNGTTA 
TTCATTATCTGCTTGGAATTATTGGGGTTTTGGACCCCAAAAATAATOGTTTOATGTNh^^ 
NN^WN^^^N^nWNN^mNNNGGTACC^TGGCCGG^ 

SEQ ID NO: 25 1 5 ACCT ACTGTATTTATTrAlGTGCAACAAGAAGTGTCAGCAACTGCACAAACTC 
CTCCCTGTTCAGCrAGTAGGCAGCAATTCTGTTATCTGGCATCCCATAGCTGGGTTAAATTAAAAC 
AGGGCGTGAGAACAGGTGAGTCTAGAGGTCTAACTCTAACAGGGACCACCGTGCATTTGAATAAA 
CAGTTGTArrAGGGATGTTGTrAAAGTrAGCCACTGGGCAGAATAAAGGATCCCTTAGGTCAAAT 
ATTTOGGrrrGACATGATGACATOACACATCCAACACTCTGTCAAATTACCTGCACAAGGCACTGA 
TTGTGAAATTCTAGTTACAGCArrATCTTACCAAATTAAAGAGGCAGGCATGAGCAGGGAAAAAT 
GGAGTGGGATAAAGAGCCTOTTCACGATGAGGCTTGGGGACATCTTGGGGAAAAACTGTCCACAG 
CATGAAAGTCATCAAOT 

CATTTTTGGGTAACTTTTGAGTGGNCCACATTTCAGGCATTGAAGGGTTTTTCC^ 
TGGGNTGNNCCNCCrCCANT 

SEQ ID NO: 25 1 6 GGTACATGTGCOVCATmCrrAATCTGGTCTATCATTGTTGGACATTTGGGTT 
GGrrCCAAGTCrrTGCTATTGTGAATAATQCCGCAATAAACATACOTOTGCATCTGTCTTTATAG 
AGCATGATTTATAGTCATTTGGGTATATAOCCAGTAATGGGATGGCTGGGTCAAATGGTATTTCTA 
GTTCTAGATCCCTGAGGAATCGCCACACTGACTrCCACAATGGTTGAACTAGTn'ACAGTCCCACC 
AACAGTGTAAAAGTGTTCCTATTTCTCCACATCCTCTCCAGCACCTGTTGTTrCCTGACTT^ 
CAAAACCACTATGAGATATCATCTCACACCAGTTAGAATGG CAATCA TT ATATGG TTAATTTTrTA 
AAGTTrATTITrAATAAACAGrrAATTTTTAAAAGTTTATGCCTTm 

AAAATCCTATAATAAATCAGAACTGCTTAAAAATAGAAATAAAAGACCTATTACTGGCCTCACAT 
CAGTGCACTCTGGCAGCTTTAAAGAAGTAGAATGAAG^rmAAAAANC^GCGGCCACATAAAGGG 
GTAA 

SEQ ID NO: 25 1 7 ACTGTGGTGTGTGAGTCTCAGCAGCCGCCXrACACGCTCCTAACTCTGCTGCAT 
GGCAGATGCCTAGGTGGAAATAGCAAAAACAAGGCCCAGGCTGGGGCCAGGGCCAGAGGGGAAG 
GCCCTGGATTCTCACTCATGTGAGATCTTGAATCTCl T 1 CI 1 1 GTrCTGTTTGTTTAGTTAGTATCAT 
CTGGTAAAATAGTTAAAAAACAACAAAAAACTCTGTATCTGmCTAGCATGTGCTGCATTGACTC 
TATTAATCACATTTCAAATTCACCCTACATrCCTCTCCTCrrCACrAGCCTCTCrrG^^ 
GCCAGCCCTGGAGAAGCAa'GGTGTCTGCAGCACCCCTCAGTTCCTGTGCCTCAGCCCACAGGCC 
ACTOTQATAATGOTCrGTTTAGCACTTCTGTA'nTATrGTAAGAATGATTATAATGAAQATACACA 
CTGTAACTACAAGAAATTATAAATGTTTTTCACACCCAAAAAAAAAAAAAAAAAAAAAAGACC^ 
T^GGTCCANCCNT^^T^^CCAAGGAAATGCNCCTT^GGGCrIT^rm^ 
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ATTTCCAAA 

SEQ ID NO: 25 1 8 ACGCGGGGACAGTAAAAGCAAAAGCAACAGCTCAAGCAGCCTCCTTGGAGA 
AAACCTGAAAATTCAACrrGTTCAAGAGAAGGTCTTGTACTATAGAGACTCAGTTOCAAAAAT^^ 
ACAAATATGCTGCITGATTAAAATGGGTAGGCTIXrrCATGTGGCTCATTCTTTAA 
TATTTGGTTTGGTrCATGGGGTCTCTGCCTATGGATCATACTTCAAACTCTTGGTGTGATCCT 
ATTGTCACAATArrAGTTACCCrGGTGTGCTGTATTCTCTAAAACCTTTAAATGTTTGCA 
ATTCGTCAAATGTCAAATATTCTCTCTTTGGCTGGAATGACAAAAACTCAAATAAATGTATGATTA 
GGAGGACATCATAACCTATGAATGATGGAAGTCCAAAATGATGGTAACTGACAGTAGTGTTAATG 
CCTrATGTTTAAGTCAAACTCTCATTTANGTGACAGCCTGGTGACTNCAGAATGGAGCCAGCATGC 
TNAATGCCATATCTCCCCTGNAACATGAGOAAGCAGGTTGATCCCAAACAAACAAANTTTCTAAA 
ACATGA 

SEQ ID NO: 25 19 GGTACTAACTCTGGAAGAATGGCAAGACAAGTGGGTGAACGGCAAGACTGC 
TTTTCATCAGGAACAAGGACATCAOCTATTAAAGAAGCATTTAGATACTTTCOTAAAGG 
GTGGACTGAGGGTATTTTTrCCTCTTTGCGGAAAAGCGGTrGAGATGAAATGGTTTGCAGACCGGG 
GACACAGTGTAGTTGGTGTGGAAATCAGTGAACTTGGGATACAAGAATTrTTTACAGAGCAGAAT 
CTTTCTTACrcAGAAGAACCAATCACCGAAATTCCTGGAACCAAAGTATTTAA^ 
AACATTTCATTGTACTTTNNl"ll'in'l"ll'lU'NlU'lU'lUU"ll"l'l'NTNANAAGGGNCCT 
TAAATTGAGTAGTAGGAATOCCGNAGTAGTTAGOATAATATAAATAGTTNAATTAAAAATGGGTN 
TGTTAGGGGTGGCCTGCCNGGCGGGCCGTTTAAAAGGCGAAATNCACCCACTGGGNGGCCGTCTA 
ATGGATNCCACCCGGGNCCAACCTGGGGGAACAAGGGCTAACTGNTTCCTGGGGAAATGGTATCC 
CN 

SEQ ID NO: 25 20 ACGCGGGCTGGACGCCAATGACCTAGAAGATAAAAACAGTCCTTTCTACrAT 
GACTGGCACAGCCTCCAGGrrGGCGGGCTCATCTGCGCrGGGGTrCTGTGCGCCATGGGCATCATC 
ATCGTCATGAGTGCAAAATGCAAATGCAAGTTTGGCCAGAAGTCCGGTCACCATCCAGGGGAGAC 
TCCACCTCTCATCACCCCAGGCTCAGCCCAAAGCTGATGAGGACAGACCAGCTGAAATTGGGTGG 
AGGACCGTTCTCTGTCCCCAGGTCCTGTCTCTGCACAGAAACTTGAACTCCAGGATGGAATTCTTC 
CTCCTCTGCTGGGACTCCnTrGCATGGCAGGGCCTCATCTCACCTCTCGCAAGAGGGTCTCTT^ 
CAArrTTTTTAATCTAAAATGATTGTGCCTCrGCCCAAAAAAAAAAAAAAAA^ 

SEQ ID NO: 252 1 ggtacaagatctaccccggacacgggaggcgctacgccaggaccgacggga 

AGGTTTTCCAGTTTCrTAATGCGAAATGCGAGTCGGCTTrCCnTCCAAGAGGAAT^ 

taaactggactgtcctctacagaaggaagcacaaaaagggacagtcgoaagaaattcaaaagaa 

aagaacccgccgagcagtcaaattccagagggccattactggtgcatcnxritgcixjatataat^ 

ccaagaggaatcagaaacctgaagttagaaaggctcaacgagaacaagctatcaggtgaggaat 

gcmcattatgagcatmacctattcggatggattttaggtttgggtcttcagt^ 

ccaaaaaaaaaaaaaaaaaaaaaagt 

SEQ ID NO: 2522 ACGCGGGCTGGACGCCAATGACCTAGAAGATAAAAACAGTCCrrrCTACTAT 

gactggcacagcctccaggttggcgggctcatctgcgctggggttctgtgcgccatgggcatcatc 
atcgtcatgagtgcaaaatgcaaatgcaagtttggccagaagtccggtcaccatccaggggagac 
tccaotctcatcaccccaggctcagcccaaagctgatgaggacagaccagctgaaattggotgg 

AGGACCGTTCTCTGTCCCCAGGTCCTGTCTCTGCACAGAAACTrGAACTa:AGGATGGA ATTCT rC 

CTCCTCTGCTGGGACTCCTTTGCATGGCAGGGCCTCATCTCACCTCTCGCAAGAGGGTCTCmGT^ 

CAA l ' l ' lll ' l ' l AATCTAAAATGATTGNGCCTCTGCCCAAAAAAAAAAAAAAAAAAGTACCTCGGNC 

GCGACCACCCTTAGGGCGAAATTCAACACACTGGGNGGCCGTACTAATGGATCCGACTTCGGACC 

CAACT^GGGGGAATNATGGGCATAA^^^GGTTCCTGGGGGAAATGGT^^'CCNTTNCAATO 

AATAANANG 

SEQ ID NO: 2523 GGTACCCTCAATTTTCTTCATCCAGTAAGACAAGTATGCATCTTAAAGGGAAA 
GCTTTTAGTTCATAACAATATOTTTGGTTGCTTTTCTCTGCT^ 

ATTGGCTITAGTGTTTGCTGCCAACCAGATAACATGTTAACATATAATTTACAACrCTCTCTAGGTA 

GGTGATTACATAGAAAACTAATGCCAGCTAAGATTTATGGATTCAAAAGATAACTGACACAAGTT 

CTTGTCCTCTACACTTGAGAGAACTAGCTGTITAACGCATAAATATATTGATAGTAACTACAAGTT 

AAATAACATTCAGTGTCACTATATAACATGCTGTCTTACrrGATTCTGTTmCAGACrr^ 

GTTTCTCTTTGTTGGAAGAOTCTCAGAGATAATGTGTGCTTTAATTrAAAC^ 

ATTTTGCCCAATGGTTGGTAATGNAAGGTAAAAAAATATATTGGTTGGATCAAGATTTAT AGATT A 

CACATACTATrCATCAAATATATGGTGATATrAATAAATOAATCCTAAATAAOTTCACCrmTTO 

AAAAAAAAANAAAAAAAAAGTCTGCCCCNGCGTNAAAGGGAATCNCCCTGGGGCGTATANGGNC 
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CNCCTGGGCAACTGGGAAATNGGN 

SEQ BONO: 2524 CGAGGTACGAAACCAACGCCCCGAGGGCTGGTCGCGACTACAGTGCATATTA 
CAGACAAATTGAAGAGCTGa}AAGTCAGATTAAGGATGCTCAACTGCAAAATGCTCGGTGTGTCC 
TCCAAATTGATAATGCTAAACTGGCTXKTGAGGACTTCAGACTGAAGTATGAGACTGAGAGAGGA 
ATACGTCTAACAGTGGAAGCTGATCTCCAAGGCCrrcAATAAGGTCTTTOATGACCTAACCCTACAT 
AAAACAGATrrGGAGATTCAAATTGAAGAACTGAATAAAGACCTAGCrCTCTCAAAAAGGAGCAT 
CAGGAGGAAAGTCGATGGCCTACACAAGCATCTGGGCAACACTGTCAATGTGGAGGTTGATGCTG 
CTrCAAGGCCTGAACCTTGGCGTCATCATGAATOAAATNAGCCANAAATATGAAAGCATNOGCCA 
GAANAACCrTTCANAGGCXX;AANAACAAhriTGAGAGACAGCCTGAAGTTrGCAGCACNGGTC 
TGGAATNCTGGAGAATrAA>O^GGACnjrGQTCACTACCGACTANCNNCCTTCC^ 
ANACTNAGCrCATITACATGAANACTTTGAGCCNCTTTAANGAACAAGGCCTTCATANCNGT^ 
AACTC^mW^TmNCCmGNGCCAC^GTNCAAATCGNNN 

SEQ ID NO: 2525 ggacgagtcaaocacaaactgctgcoccaggaaaagacaaogctaattggg 

CCCAACTGCCCTGGAGTCATCAATCCTGGGGAATGTAAAATTGGCATCATGCCIXjGCCATATTC^^ 
AAAAAAGGAAGGATTGGCATTGTGTCXAGATCTGGCACC(nt}ACTTATGAAGCAGTTCACCAAAC 
AACGCAAGTTGGATTGGGGCAGTCTTTGTGCGTTGGCATTGGAGGTGATCCTTT^ 
TTTTATTGACTGCCTCGAAATCTrrTTGAACGATTCraCC^ 

AATTQGTGOTAATOCAGAAOAGAATGCrGCAOAATTnTGAAGCAACATAATTCAGGTCCAAATr 

CCAAGCCTGTAGTGTCCTTCATTGCTGamAACTGCrcCTCCTGGGAGAAAAATGGGTCATGCC^ 

GGGCAA^^AT^GCTNGAGGNAAAACNGGAGCTTAAAGAAAAAAATCT^m'GCCTTCAAAATGCAA 

GAAGTTGGGGTCAGTATTGTCrrCTTGCACAGCTTGGGAANCrcCATCnTCCAAGGAAm 

GAAGGAANATCCTTTTNAA 

SEQ ID NO: 2526 GGTA Cn ' n I ' l ' LTVl rriM ' i ' lMn ' lUU ' riU AOKrnTACACAGTn'ATrCANA 
AGmAAAAGTTAAAATGNGGGCATTTTCCCCTGAAACAAATCATCCCCTGCCACTCCCC^ 
AAGACTCTCCCAACACCGCAGTrCCTICn^TITACCCAGGCnTACCACACATTOT 
CAGGAGTCAACACCCCAACTATTTGGTAATACAAAGAGACTACAAAGTCACACAAAAGAATCTCA 
TrrACTTCCrmCACTAGCAGGCTATGAATGAAAGACTGAGAGGCAAACACATCACCTTCATCCA 
GAGTTGACACTTCCCTAACCTTTmCTTCCTCTAAGAGGrrAAAATC^^ 
ACTGTAACTCTGATATTCCATTTrrCTCTTCTGAATGGCTAAGCTAGGACATA 
CTGATCTTCAGAACCTCAAGACTGGCAGAAAGCANGGCTCCCACTGCAGCTNGGCACAATACTrr 
TtKTAACANGGCAAAGrrCCCCATAGGNGCAAATGAGCmTAAGAAGTrrrCTAAACNri^ 
AAAATGN 

SEQ ID NO: 2527 ACAAATTCCAGTGTGCAGACCACAACCTCAAAACAAAAAAOGTTGOTCATGC 
AGCATTAAACTTrGACAAGGCCACTGACATCGTGACAGGTCTGAAACAGAAAACCrCCTTTG 
GArcAATGTGGGACAATGAGTmCTACAATAGCTACCTCCX^ACCCCAAGTCTGTAGTGGGAGTTT 
TCTTATGTGGCCCraMACnTTGGCAAAGAGCCTGCGCAAATGCTGTCACCGATATrCCAGTCTGG 
ATCCTAGAAAGGTTCAATTCTACTTCAACAAAGAAAATTmGAGTTATAGGAATAAGGAC GGTA 
ATCTGCATmGTCTCTrTGTATCrTCAGTAATITACTrGGTCTCGTCAGGm 
GGATAAGAATGTGCCTCTCAAGCCTTGACTCCCTGGTArrcrTTTm 
ACTTGAGCTTTAGCAACTTAAGAACTITGAAGTTCTmAAAGTCnTGAA^^ 
ATCCNTTTTNANAAAAAAAACTGTAAAATTTITITGGACAGChJT^ 
CAAAG 

SEQ ID NO: 2528 ACTGATGCTGAAAAATCAATAAGATTAACCCAGAAATTGGCTCAATAAGAAA 
CCAGGGACTCrCAAAGGGGGTTTrACAATTCAGAAACAGGAAAAAATTAGAAAGTGGTTTATrr 
CAATITACATGGATTCCIXn'ACTrrGTGAGGTn'AGTATAATrcAAATGTATTC 
CATTACCTACAAATrcATATTAGGTGGTCTACAGCTTCTAATraATGTGCTTCATGATAAA 
CAGTAATGATGTTTrAAATATTCTAGTGCTCACrGGATTTCATTTTTGCAGGCAATTr 
CTGAGTAAAACATn'ATATTCAATTACAGAGTCrrGATAAATAGQATrCXCGCCACCCCACCCCTTC 
TAATCACTGCAGAGTATTAATAGTGCrrrCTTATGGCTGATTTCTTGCNGGGAGCAAAGA'r^ 
TATAGCX:CAATTITmCAAACCTAACAGCATCnTSICTm'GGGAGCTAm 
TTTTGACmX:ANATGACACGACCITGCCTCCCTA>rr^ 

SEQ ID NO: 2529 GGTACGCGGGGATTTCTCCCXKjAACCTCTGCTCAGCCTGGTGAACCACACAG 
GCCAGCGCTCTGACATGCAGAAGGTGACCCTGGGCCTGCTrGTGTrCCTGGCAGGCrmCCTGTCC 
TGGACGCCAATGACCTAGAAGATAAAAACAGTCCnTICTACTATGACTGGCACAGCCTCCAGGTr 
GGCGGGCTCATCTGCGCTGGGGTTCTGTGCGCCATGGGCATCATCATCGTCATGAGTGCAAAATGC 
AAATGCAAGTTTGGCCAOAAGTCCGGTCACCATCCAGGGGAGACTCXIACCTCTC^ 
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CTCAGCCCAAAGCTGATGAGGACAGACCAGCTGAAATTGGGTGGAGGACCGTTCTCTGTCCCCAG 
GTCCTGTCTCTGCACAGAAACTTGAACTCCAGGATGGAATrCTTCCTCCTCTGCTOOOACTCCm 
CATGGCANGOCCn'CATCTCACCnrOTCGCAAGAAGGTCTCTTTGGTCAAl 1 1 1 i 1 1 1 AATCT AAAATG 
ATTGNGCCTCTTGCACAAAAAAAAAAAAAAAAAAAAAAAGTCCrrGCCCNGGCGGCCGTTCAAA 
NGGCGAAAT 

SEQ ID NO: 2530 CGAGGTACGCGGOAGCTCAGTGTGATTCAATAGTGATATGTAACrGCCCTCA 
AAAACTATGAAAAGCTTAGAGTTTCAGCAGAGGATGGTGTATCTGTTATGAGTGAGATGAGAAGT 
CTCTCTCTTCATGGCTGGTAGAATOAATCCAOAATTOTCAACTrCACATAAGTCT 
CAACCCTGCCTGTmAGGTGAAACTAATGCACCAATTATTTCATTCATrrACAGAGAAACTTAAG 
CTAATATGTACGCGGGCTTCGCAGGATTTTTCTGAGCCTTTTACCACTCCAGCCTAGCCCCTACCCC 
CCAATTAGGAGGGCACrGGCCCCAACAGGCATCACCCCGCTAAATCCCCTAGAAGTCCCACTCCT 
AAACACATCCGTATTACTCGCATCAGGAGTATCAATCACCTGAGCTCACCATAGTCTAATAGAAA 
ACAACCGAAACCNAATAATTTCAAGCCCTGNTTATrCAATTTTACTGGGGCTCTATm 
CAAG<XTTANAAGACCTGCCCXjGCNGGCGOTCNAAAGGCGAAATTCAACNCV\.CT 
CTASrrGGATCCANCTTGGACCAACCTGGGGTATCATGGCATACTGGTTCCTGGGGNAAATGTTTCC 
GTCCAATTCCCNCAN^^^ANAACCGGAACC^AAAN^™AAACNNGGNGCNATGAAG 

SEQ ID NO: 253 1 CGAGGTACTGGTTTTGGATTAGGAATTGTTTTCTCACTTACCI-1 Cn i AAAAG 
AAGAATGTGGCCATTAGCCTTCGGTTCTGGCATGGGATTAGGAATGGCTTATrCCAACTGTCAGCA 
TGATTTCCAGGCTCCATATCTTCTACATGGAAAATATGTCAAAGAGCANGAGCAGTGACTTCACCT 
GAGAACATCCCAGCGGGAGGACAAGAGAAATCATGnTATTCCrCAGGAATACTGAAGTGCCCTG 
GAGTAAGCTGCCArrCrTCrGTAACAATGTTATCAAGTAATGCTTTAAACrCCAGCACCTGGm 
GCATITGAAACCAAGTCTGGTTCTTNGhnmGNATTITCTCrrCT 
^TAAATAAATTAAAC^fNAAATAGTAATANNANATATTNNATCTT^f^^I^ 

CKGGCCGGNCGhTTCNAAAGGGNGAAmCAACNAACTGGCCGGCXOTACTATTGGATNCTANCT^ 
GNTCCCATCTTNCGTNATCATGGGCA>n^CTGTCCCTGTGGGAAAATGTaW 
ANmANANCNGGANCTTAANmAANCCnXKjGhrrcNAATNGNGGCNA^ 
NNTGNCCTTTTa^CNGNAACTITrrCNTCTrnMTA^ 

SEQ ID NO: 2532 ACATATTAGAAGTCTAAGGAGTAGCAAGTCAGTGGGAGGACTnTTCACCCC 
TGGCAlTAGCAGCTTCOACCTCATrrTCCAGATGCACCAGCTCCTATTAATAAGTTAGCAAGGAAA 
GTGTATGTCAOSTOCAGGAACAGTGAGGCAGGGACAGGGGTTCTGCTCCTTCTCACTrCACCACCG 
GCACACAGCITGCCCCTGTCTTTGCCCCCAAAGGTATriTGTGTCTAGTGTCAAATTGGAGCrATrC 
TTCACTGGTCCTTAACCTTGGGTTTTAAAAAGAAGGCTTCTCTGTTTGGGTAGCCGT^ 
GTATAGTAAGTCCT^m■CAAAGAGATGGCAATATGCTGGGCATCTACTTTAAACAAAGTQTCT 
TmGCAAGAAAAGrrAGOATTrrATTGNCTTATTTCCTTTACAGGTCTGC^ 
TTrmAAATAACTCANGGTGGATGANAANAAArrANAAATGAAAATTACTTATGGTGGACrGNA 
AATGTTTAATTGGAANAATCTTTTNATAAACCATTTTTGTN^ 

T^n^^N^^^^^^m^TOANTrANN^^ 

SEQ ID NO: 2533 GGTACTGGTTriTCTGAOAAACAGTCCCTCGTGAACTGACAGTAGCTCAGAG 
AGTCTAGTCTCAATCTGCTGTCATGGGCTGGTAACCACTGAGGCAACCGATTn'CCACTGmGT^ 
GAATArrGCATGATaTGGGTAGGTCTGGATGGTTTCAGATGGGAGTCTCTGGTCAAACATTCTAT 
CCCGCGTACTACTTCAGGTAATCArrGTTTTACITAAAGTrCAGATrCCAGCATATATTGAGATGA 
ATATTCCCrGGTrATACTTTGTCAATAGTTrTCTCATTGCTACAGTGTATTGGTrrAATO 
GCTrAATITAAAAGACArrG0ArrA(XnTIXK3ATCCATTTGTCAACTGGAAGTGCTGC^ 

cttacaatcctaatcttgagcnaatigaaaaagcctatatcaataatgatitgttaata™ 

TTAANAAGTACAGCTGGCATAAGAACATAATTITATGAACNGAAAGAACrCAGGACATArrAAAA 
AATTAACTGAACTAAACACITITGCCCCTNACrGATACATTTCAGAATGGGC^^ 
ATCCNGTTTTAAATAGNGT^A^T^AAAACAANTATTCCANA^INTTTAT^mT^ 
AATTCTITGTTTACNCNAAAGG^^^ATAGNAACTTNGGTTCNGO 

SEQ ID NO: 2534 CGAGGTACTTTGACTTACTAGGGTGATTCAAAGrrTCAGGAAAAAGAAAATT 
CCCAGTATCATTTTCrrAATCTrATTAAACCCAAACATAAGAATGCCAAAAAATACAOAGCTCACA 
TTITGCTOGCATACATTrCCAAAriTITAATGCXrrCCCrGACAGGTGAATTT^ 
GCAG AO^Cm CAAAACArrCCTTGTGATGAAGTAGAAAAAGCCCTGGATAAG TGGC CAGCTACA 
CTGGAi 11 llGTCrCAATrCITCATTAACTITACAGTCnTCAGCAAATCCCTTAACr^^ 
TATGTAGTGACATCATTATAAATGCTGNGGTATTAAAATGTGATAAACATAGATGAAAGCATTrrG 
AAATGGTTGAAATCTACATAAATACC^^^AATAACTAAATTCCACAGGATAGGATC^GAGAGAGCT 
AAGCTGCCAGTATTAITAAAAGGAGOTATTACAGAAAANCCAGCCTCATTCATTACAGGNTATGT 
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CrCATATCTCTTGGNCAATGGrnAATAGAATGAGTATNTTGAGAAAAAAGTNGGAATNANCTAG 
GAGACANACCTGGGCAGTNAAATGCrrCTCACGNTmAAGGGTAAGNGTAGGCACCCTCNGANA 
TCCTANTGGGGGTTTACCNCAANGCCAGGNCCNNNOTCCT 

SEQ ID NO: 253 5 CGAGGTACCCTCCAGGTCATGGTGATATrTACGCCXGTnCTACAACTCTGGA 
TTGCTTGATACCTTTATAGGAGAAGGCAAAGAGTATATTmGTGTCTAACATAGATAATCTG^ 
GCCACAGTGGATCTGTATATTCTTAATCATXrTAATGAACCCACCCAATGGAAAACGCTGTGAATTT 
GTCATGGAAGTCACAAATAAAACACGTGCAGATGTAAAGGGCGGGACACTCACTCAATATGAAG 
GCAAACTGAOACTGGTGGAAATTGCTCAAGTGCCAAAAGCACATGTAGACGAGTTCAAGTCTGTA 
TCAAAGTTCAAAATAmAATACAAACAACCTATGGATTTCrCrrGCAGCAAGTTAAAAGACTG^^ 
GGAGCAAAATGCCATTGACATGGAAATCATTGTGAATGCNAAGACnTGGATGGANCCTGAATGT 
CATTCAATTAGAAACTGCAGTANGGGCTGNCATCAAAGTTTTGAGAATTCTCTAAGNATTAATGTG 
CCAAGGAGCCCGTTTOTGCTGTCAAACCACNTTAGACTCTTGCTNGNGATGGCCAACCTCTATAG 
TChrrAATGCAGQATCTm"GACATGATGNAAACCGGAATTTCTANGTGCCXrrTO 
NTTTACNAAGGT^WGATTTTTNTAAAAT^^AAN^^TCCC^ 

SEQ ID NO: 2536 ACGCGGGGGGAGACGTGCTAGCGCGTCGAAGGTAGCTCTATGGTTTTCCTCG 
CGTTCTrGAGTCGGGAAATXKX:CGCrGTGTGGTrGCAACGOAGATAAATTCCCGGAACX: GCGAT^ 
CGGCGTCTCAGGAArrCGAATTTAGAGTrrAATITCTCAGAGCArrCTCTCCAGGAAGAAri-l'l'lA 
CAGTATCrCAAAGACTTCACrrGACTTCTrGATCCTGCATAAAACCAAGGAGAAAAaAAATGGGT 
CGCTCCAATTCTAGATCACATTCrrCAAGGTCAAAGTCTAGATCACAAGTCTAGTTCTCGATCAAG 
ATCAAGATCTCATTCTAGAAAGAAGCCGATACAGTTCrAGGTCTCGTrCCAGAACATArrCAAGGN 
CTCOTAOTAAAAATCGNATOTATTCTAAANAATATCGTCCGATTACNGAAATTATAGAAGAATGA 
GACCACCTTATGGGTACCTNGGNCGCGANCACOCTTAGGGCGAATTCCNCACACTGGCNGGCGTA 
CTANTGGATCCNA0CTCX3GNCCAANCTrGGCGTATCATGGGCATAACTGTI>IC^ 
TATCCCGTTACAATTTCCCNCACATCNAACCCGAAG^r^mANTGTAAACCTGGGNGCCAAATAGN 
GACThUCTNCATAATGNGTGGGCNAmrCCbrrTCNATNGGAACNNNNNCT 

SEQ ID NO: 2537 ACATGCAOAGGTAAAGCTGAAGCTGGCCAGGGGATGGCTACAGTTCATGATC 
CCCAA^TCTGGTGCTGATAGAGGCTCACACTGAATCACTTGACAGGTTGGTTCTGGAGATGACCA 
GTTrCCAAATGGTCCACAGGTGGTTTCrrCAATCCCAGTTAAGTTTGTTCCTTCAGAGCAGCTGAA 
GGCACACTGTGAGCTGAAGCTGAAGnrCCCAAAGGGTGAGTACATTACACACCAGTGAATAGTA 
ACAGATTGTTAGTATGTTCAAGCTTrGTGTTGCATGATGGTAACTATAGAGTTTGCGGGTCAGTTG 
nTGACAGAGGAATCCAGTACAAGAGATAGAAAOACCAOTCCTTGCTGAAAGACAAAGTCTOAAT 
GCTCCACTTTrrCAArrCTCTCrCCATTCTTCAGTAAAGTCAACTTCAATGTC^I 
CAAACACATAGCAATTCAGGAAAmGACrnCCm-CIWGCTGGATGACGTGAGTAACCCTGA^ 
NTTGGAGTACCTTGGCCGGACCCNCTANGGNGAATTCACACACTGGNGGCCGTTCTTrhW 
GCnTGGTCCAACnTGGGTANNT 

SEQ ID NO: 2538 ACCGTCCAGCGAGTTGCAGATCTACACTTGGATGGATGCAACCTTGAAAGAA 
CTGACAAGCriTAGTAAAAGAAGTCTACCCAGAAGCTAGAAAGAAGGGCACTCACTTCAATTTTGC 
AATCGOTrTrACAGATGTTAAAAGACCTGGCTATCGAGTTAAGGAGATTGGCAGCACCATGTCTC 
CAGAAAGGGGACTGATGATTCCATGACCCTGCAGTCGCAGAAGTTCrAGATAGGAGATTACTrGG 
ACATAGCAATTACCCCTCCAAATCGGGCACCACCTCCTTCAGGGaaCATGAGACCATATTAAATTC 
TAmACTATTTG>n'GAATTTATTma:CTCAGTTATaTAAAATAAACATACTCT^ 
ATTATTGGCATTAAGCCTTTAAATTCTAAACAAATTATAATGCATCATCTATTTANGAGTTAGATTT 
GGATGTGCTATTGGATGAATACNAATAAGCTGGATGGTTCAAGCCCTTCTGTAAAATATGAANAA 
AAOGCTCrrACATTCTGGGNAAAAATGTCCTTTGGCGGGAACACCCTANGGCGAATTCCACACAC 
TGGGGGGCGGTXrrTATGGATCCAACTTGGTa^^ACrrGGGGNAACATGGGCTAACTGGTCCT 
AAAATOTATCCGTTCAATTCCCCANNTrCAACCGGAGCTTAANG 

SEQ ID NO: 2539 ACGTGGGCACAACCTCCACCTCCTGGACTCAAGCAATCCI^CACC TCAG CCT 
CTCAAGTAGTrGGGACTCCAGGGTCACACCACCACATCCAGCTAATTTATAGAGACAGAGTTTCAC 
CATGTTG<XCAGGTTGGTCTCGAACTCCTGGACTCAAGCAATCCTC TCATC TCGGCCTCCCAGAOT 
GCTGGGATTACAGGTGTAAGCCACCACATCTGGTAACAACTAGCAGTrTTAGCTGACAAAGAAAT 
TCAGCCCTAAATGACCTGCCTGAAGrrATITrTGACATGTATCrrCCCCACTGCTGCACAAGGGGC 
TAAAGGAACTCTCTACCGTCATATCTATGTCAGTGTCTTATGCTAGGArTAAGAGTATT TATA TGC 
ACAAAGGGCTGCATTmCAAAAACCTITCmCTTNCTCTGNCAT^ 

CCTGTGCACAAGANACTGCCTOOTCAAAGANGGGAAGGAAAGAATCTGTAACAATGCCCQTAAN 

AAGCAAGGGCrrGTGAANTGATGGATGCKjTTNGAACAAAACAAAGTmr^GCAGGATG 

AAAAGTGGGAACCAGCCnTNGGANGGTGGGAAGGGTNAAAGNTCNNAATNAACNCTGAAANGAN 
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GGAAAAN^r^GGGCCCATGANGGGGGAANCCAGGNTNGGGGACCCCCTNAAGGG 

SEQ ID NO: 2540 ACCATTCCITCTGAAACTATTCCAATCAATAGAAAAAGAGGGAATCCTCCCT 
AACTCATITrATGAGGCXAGCATCATTCrGATACCAAAGCCVVGGCAGAGACACAACCAAAA 
GAATmAGACCAATATCCCTGATGAACATTGATGCAAAAATCCTCAATAAAATACTGGCAAACT 
GAATCCAGCAGCACATCAAAAAGCTTATCCACCATOATCAAGTGGGCrrCATCCCTGGGATGCAA 
GGCTGGTTCAACATATGCAAATCAATAAACGTAATCCAGCATAfACACAGAACCAAAGACAAAAA 
CCACATOATTATCTCAATAQATGCAGAAAAGGCCTITGACAAAATTCAACAACGCTTCATGCTAG 
AAACTCTCAATAAATGAGGTATTGATGGGATGTATCTCAAAATAATAAGAGCTTTCTATGACAAA 
NCCCCAGCCATTTCATCCCGAATGGGGCAAAAACTGGAAACCTTNCCTTTGAAACAGGGC^ 
NANGGGGGGCCmTmACCATTCTATTTAAAATATNGTTGGAAATTTTGGCCGGGCCN^ 
GGAAAGGAATAANGGGTTNCATTTGGAAAAAGGACCCAANTTGC 

SEQ ID NO: 254 1 GGTACCCTTAAACTGGCAGGACATTnTGAAATCACAAATTrGCACATAAAG 
AATGTCACGAACAGCCATGTATCCATATACAGCAATCAAATAAGGAACTTATGACCTAAAGCAAA 
GGTAAACTITCTTGAAACTTAACATTCTATACCAACTAGGCAACCTCTGCCCAGGATGAGAGTTGG 
ATTTTTCAAAAACCTCTAATTTAATAGTGCAGCATTrCOTTITCCCTGATGGCCT^^^ 
AGTTTITAAAGACTGCTrGTCAACTATAGCTGCAGCCTATATCCCACTATGGAAAAAAAAGTAAAT 
CITAGTTCAATTTTTGGCAGTTGGriXn'GTATTTAAATTrAAAAAAAAAC^^ 
GTTAAAAGGTTATTATCAATCTGTGCOTAACTAAAAAGTCAAGCAAATCCAATTTTGCTT^ 
CATTGTAAAGNACAATCTTGGNNTTOTGCCCTGGNTGAACCATTCAACCCTTAANAATTAC^ 
TTGOGTCCTrOTTCAA>mAAAAACC<>rANTTGACCGCTTAACCnT™ 
AGGT^^'AACCNAACK^^ATTTrn' 

seq id no: 2542 ggtacagtitcx:citctccagggtgacagatgagccttrrccgaagttcnx;ag 
cmctcrrctatcgaaacttcccatgtcggttaaagtgtttgtagagatag^ 
aggttgtgcaggtcatcagcgaacattctaotcx;aaccattttcctcit^ 
aattcattccxntrcc^\acctcgaagccatatgggcaxctoatcaagttctttggg 
aagtitcccaggatcccgatgttgtcatacactccgaacatggcccmtcrcgttccaacgatca 
actactttggggggcgggagagtgagccttataccgatgaatctaggcacaccaagaaaoaagct 
tctgcacgccagaagcacccagtcccacaggcgcctacctggcccggatnaganggtccctgcca 
tirmmagtotcccccmtsiaacaggacacccaaattagcccaaaaaaaatgacagg/^ 

CCCGGCCAACTTTANAACAATTCCCTGCCCGGGNGQGCCGTTTAAAGGGGNAATTCAACCACCTG 
GGGGGGCGTTNTATNGGGACrCGGCNCGGGNCCAACnTGGGGGNNN 

SEQ ID NO: 2543 ACTGGAAGTGCTATATAATGCAGTGTTAAAAAAAAAAAGAAATAAAATGCA 
CATAGAmQAAAGGATOAAATACAACmGTTCACAAGTAACATGATKjTCTTTGTAGAGAAT^ 
TAAATAACAAAAACCCTGCTGACATTAATAAGCAATTACAGCAGGATTGCAGGATACAGTCCTCA 
GGTAGATGATAAAAAGGAATTGGACATCAAGTCATTAAAAGACACGGAGAAACCCTAAATGCTTA 
TTGTTAAGTGAAAACCAATCTGAAAAGGCTTrCTATITGTATGATrGCAATrATATGACATT^ 
GAGAGGCAAAACTATTGACCAGTGAAAGGACCAATGGGTGCTGAAAGTTGAGGGGAAAAGGAGG 
GATGGCAGGTAAAAAGNGAATTTTAAGGAGTGNAAGTNCTNGGNCGGACCCCCCTAAGGCCGAA 
TTCANCCACIXjGCGGCCGTATTATGGATCCGNCTCGGTNCCANCCTTGGNGNATCATGGCCANACT 
GGTTCTTGNGGNAATITITCCCTTCAAriTCCCCACANCCAACCGGAGCTrAAAGGaAAACNCGGG 

ncccantaaaagaan 

SEQ ID NO: 2544 ggtacaggttagggggcggggtgggccaagaaggcaatcatttgggcagaa 

AAACAGGOTCAGATGTTITCACrTAGGGCTGGQGCTCCAGOCTTGAAGGTGGGAGCCCAOCCGTT 
CTGTGTCACTCATAAGAAGTGACAAGAGTGCAACCTGTCTCTCTCCXn'CX;CCTCATGCACACAGAG 
GCCACGGAAGCACACAGCAAGATGCAGGCCACOCAACAAGCCAAGAAAAGTGGCCTTAGAATGA 
AAACTATGTTACTGGCACCTTGATCrrGGAATTCCCAGGCTCTAGAATCATGAGCAATACATTTAT 
GTTGTTTAAGCCCCCACGGCCGGGCGCAGTGGTTCACGCCTGTAATCCCAGCACTTTGGGAGGCTG 
AOGTAGGAGaATTGArrGAOOCCAAGAGrrAAAGACCAGCCTGGTCAATATACAAGAACCCXrrNT 
CTTAANTAAAAAATAATAGGGCCGNCCGGGA 

SEQ ID NO: 2545 CGAGGTACTGATGCTGAAAAATCAATAAGATTAACCCAGAAATTGGCTCAAT 
AAOAAACCAGGGACTCCCAAAGGGGGTTTTACAATTCAGAAACAGGAAAAAATTAGAAAGTGGT 
TTATTTGCAATTTACATGGArrCCTCTACTITGTGAGGTnAGTATAATTQAAATGTA^^ 
TCAGGAACATTACCTACAAArrGATAmGGTGaTCTACAGCrTOTAATrQATOTGCrrrcATOAT^ 
AAGATGGCAGTAATGATGTTTTAAATATTCTAGTGCTCACTGGATITCATTmGCAGGCAATTI'GC 
AGCTCXrrCTGAGTAAAACATTTATATTCAATTACAGAGTCTGATAAATAGGATTCX;CGCCACCCCA 
CCCCTTCTAATCACTGCAGAGTATTAATAGTGCTTTCriTATGGCTGATTTOT 
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AGATAAGATT^™GCCCAATTTaTTCAAAACCTCAGCAGC^T^ 

AmrrCrmCNCCrNriTrGACTTAAAATGAACCAACCriTGCOT 

TAAACITCCNGGGTTCl'llUM'J"i'lCCTmTCCANGGGGNNTAACTGG 

SEQ ID NO: 2546 acgagcagctgtctgggaagtagggggttagcttggggacctgaactgtcct 

GGAGGCCCCAAGCCATGTTCCCCAGTTCAGTGTTGCATGTATAATAGATTTCrCCTCTTCCT 

CCTIXKjCATGGGTGAGACCTGACXIAGTCAGATCGTAGTTGAGGGTGACTTITCCT^ 

TTTATAATmACTCACTCACTCTGAmATGTmGATCAAAmGAACTTCATTTT^ 

TTOGTACTTThrrTrrTTTTTrrrTrrrrriT^ 

TTCAAAAACCAGAAAATCAAATTATTATGCTCTAGCCAAATTATGTAANGGTTTCini'lUUJ 

TGTCACCCAGGCTGGACTGTAGTGGAACAATCACAGTTCAANTGNAGCATCGGCCTCCTGGGCTC 

AAGGGATCTTACKlGCXrrAACCTrCCCGANTAACTTGGAACTATTGGNG;^ 

NCTAATTTmAAmriTrTKAAAAACNGGhriT™ 

GOTTAAANGAACCTTTCCCNCNC 

SEQ ID NO: 2547 GGTACTGAANGGCTGGAGACAACACrrrATCACGTTTTGACACrGACAGGAG 
TGGAACAGTAGACCCACAAGAATTGCANAAGGCCCTGACAACAATGGGATTTAGOTTGAGTCCCC 
ANOCTGTGAATTCAATTGCAAAACGATNCANCACCAATGGAAAGATCACCTTCGACGACT^ 
NCCTGCTGCGTCAAACTGAGGGCrCTTACNGACAGCTITCGAAGACGGGATACTGCrCACCAAGG 
TGT^GTGAATTTCCCATATGATGAT^^^CAT^C7^T0TGTCATGAGTGTTrAAATCAAGANGAAGCT 
GCATGAATGTNATCAACATTCCAACrGGAGCTNTCCTT^GC^^GTCCTC^^^•GCCT^ 
TAuAACITACATCACCACTTTCTCTTAACAGCTOGTOGCAAAGTnATTACTrr^ 
GGCCGNCCGTTCAAAANGGCGAArrCCACCCACTGGGNGGNCXiTTNCTAGGNGGTTTCCAACTCC 
GGNCCCAACCTOGGGGAANCA^TNGNCANTACCT^^mCK^TGGGGAAAATT^OT 
NTTTNNCCAAC^^^T<XAANCCGAAAACTAAAGGGTAAACCTNGGNNC^^ 

SEQ ID NO: 2548 ACTCAAGTTTATAATGT(XCCAAACCrTAAGACTAGAAAATCATCCCAAGAA 
AAAGGCCTATAGTnDGTTTAATTCCACCCTGAGAATACTGTGATAAAAATCAATATATTrCAGAGC 
TAGTAAGTATITAAAAATTAGTGTCTCAAAAAGGGGACATCATAAGGGAAATACAGGGTTTAGAG 
GTCTGAGCrCAAGTGGTGTAAGACAG>nrcrrrCTTCrrCCTCCmAAACrc™ACm 
CACGGAAGATGGGGGACAOTGATCCCGAAGGTITACTAAAATATTGCAGCTTTCAGTANTTATGA 
NGAAGCNCATATATNACT^mAAAAGAAAGCNATCATTTGGAGT^mCTCNGGCCGCNACCACGCT^ 
AA 

SEQ ID NO: 2549 acgcggoagctacgggcatgcagtggacctatgagcagaggaaaatcgtgg 
agttcacctgccacacagcctrcttcgtcagtatcgtggtggtgcagtgggccgacttggtcatct 
gtaagaccaggaggaattcggtcttccaocaggggatoaagaacaagatcngatatrrggccrc 
tttgaanagacagccctggctgctttcctttcctactgccctggaatgggtg^^ 
atcccctcaaacctacctggtggtnrrgtgcctixxcctactctctnc^^ 
aagtcngaaaactcatcatcaggcgacnccctggc^gg^^rggk^gganaangaaacctac^t^^ 

CCCTCCGTNCTGCACTWCNGTNGATCNTCATGGCACACACTCTTGCAT^^ 
TTmXjNTNCTTGGGCNGGlWNCCNCNCTTAGGGGGGAATO 

NGGAATCCNCNTNGGTACCCANCNNTGGCGAANTAATNNGANAANNNTmCCTGNGGNA^ 
TmrCCCCTTrWATTCTTNANCAAANANCANCCNGAATCAAAAANGTAAAACCCN^ 

SEQ ID NO: 2550 ACTOGAmAACTACCTTTGGCTTAATTCCAATCATTGTTAAAGTAAAAACAA 
TTCAAAGAATCACCTAATTAATTTCAGTAAGATCAAGCTCCATCTTATTTGTCAGTGTAGATCAAC 
TCATGnAATrGATAGAATAAAGOCTTGTGATCACTTTCTOAAATTCACAAAGTTAAACGTGA 
GCTCATCAGAAACAATTTCTGTGTCCTGTITITATTCCCTTCAATGCAAAATACATGATGAm 
AAACAAAGCATITGACTTrCrGTCTGTGGAGGTGGAGTAGGTGAAGGCCCNCCTGTAACTGTCCTr 
TTTCTTCCCTTAGCAATGGTGAACTGTCATTACAGAACCTANAGGCTCACAGNCTNCTGGAC^ 
CAGCCTCC^mTGGATCAAGAAATATTAAGGAAAACCATTGTTGGGGG^^^CCGG(^T^GCA>^ 
CTCAAACNAAA>rrGG00ACATNriTGTGGTrnX3KrGCCTNAGGAAm 

NCNGNC'l UCl ^NNAANT^ANGGCCNNAGG^4TNNGCTANT^GA^1 1'l ICCTCnTNCCTCri'l'iGGG 
GGGNGAGGGNTTTACTCNGTTTAAAANTGGAACCTCCN 

SEQ ID NO: 255 1 ACaCGQQAOGCATTGAGGCAGCCAGCGCAGGGGCTTCTGCTGAGGGGGCAG 
GCGGAGCTTGAGGAAACCGCAGATAAGTTTTITrCTCTrrGAAAGATAGAGATTAATACAAC^^ 
TAAAAAATATAOTCAATAGOmcrAAGATATTGCTrAGCGrrAAOTTTTrAACGTAATTTTAATA 
GCTTAAGATnTAAGAGAAAATATGAAGACTTAGAAGAGTAGCATGAGGAAGGAAAAGATAAAA 
GGTTTCTAAAACATGACGGAGGTTGAGATGAAGCTTCTrCATGGAGTAAAAAATGTATTTAAAAG 
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AAAATrGAGAGAAAGGACTACAGAGCCCCGAATTAATCCAATTTGAANGGCXJAATG 

SEQ ID NO: 2552 ACGCNGGGAACGCTGGGAAACTCCCGGCCTCCXjCCACCATOTGCTTTCCTTT 
AATCCGGCAGTGACTGTGTGTCAGAACAATCn"GAATCATGAAGCTACTAACCAGAGCX:GGCTCTl* 
TCTCGAGATmATTCCCTCAAAGTTGCCCCCAAAGTTAAAGCCACAGCTGCGCCTGCAGGAGCAC 
CGCCACAACCTCAGQACCTrGAQTTTACCAAGTrACCAAATGGCTTGGTGATrGCTTCTTTGGAAA 
ACTATTCrCCTGTATCAAGAATTGGrrrcTTCATTAAAGCAGGCAGTAGATATGAGGACrTCAGCA 
ATITAGGAACCACCCATTTGCrcCGTCTTACATCCAGTCTQACGACAAAAOGAOCTTC^ 
AAGATAACCCGTGGAATrcAAGCAGrraGTTGGCAAATTAAGTGTGACCCGCAACCAAGGGAAA^ 
CATTGGCTTATACrGNGGAATGCCTNCCGGGGTGATGGTTGATTTNTAATGGGAGTCCTQGTrCAA 
TGNCNXCNCNACANCNAAAATTrrTTCCTTNGGGAANAAACKrACCCTTT>^ 
TTGAAAAAAANTTGGGGCmTAAAAACCNGNAACNl'ri I'l "J i 1"! NAAAATTG 

SEQ ID NO: 2553 ACCCCATGGAGCTGGGCTAAGTAAATAGGAATTGGnTCACGCCTGAGGCAA 
TrAGACACTTTOOAAGATGGCATAACCTGTCTCACCrGGACTTAAGCGTCTGGCTCTAATTCACAA 
GTGCTCITITCTCCTCACTGTATCCAGGrr CCXrrCCCA GAGGAGCCACCAGTTCTCATGGGTGGCA 
CTCAGTCTCTCTTCTCTCCAGCTGACTAAACTTTTTT^^ 

GGGGTTTTAAACTTTTTATTTGCACATTAAAAAAATrGTGCATrCCAATAArrA;^ 

CAAAAAAAAAAATGGCACTCTGATTAAACTGCATTACAGCCCTGAGOACACCTTGGGCCAGCTTG 

GTTTACTTTANATTrCACTGNCGTCCACCCCACTTTTTCXCCCCAC™ 

NGTmrmmTNCCTGCNAGCNAAAATAAOTAANACAAm'GGGNAAA 

TNGT^mN^rrAGT^T^ITNGNGGAAAOGGGGQQG^INAGC^^T^AAACTGG 

AAGNTNANCCGCNTAANAAATNAA 

SEQ ID NO: 2554 CGAGGACAAGGCAGCTGGCAACGTTCCCTTCAAGACACAGAGGAGAAATCC 
AGATCA'rTCTCAGCTTCCX:AGGCAQACCCACTCAGTGATCCTGATCAGATGAACGAGGACAAGCG 
CCATTCACAGGGCACATTCACCAGIGACTACAGCAAGTATCTGGACTCCAGGCGTGCCCAAGATT 
rrGTGCAGTGGTTGATGAATACX;AAOAGGAACAGOAATAACATTGCCAAACGTCACGATGAATTT 
GAGAGACATGCTGAAGGGACCnTTACCAGTGATOTAAGTTCrrTATTTGGAAGGCCAAGCTGCCAA 
GGAATTCATTG<nTGGCTGGTGAAAGGCCCGAGGAAGGCGAGATTTCCCAGAAGAGGTCG^ 
OTTGAAAAACnTrGGCCCGCAGACATGCrGATGGGTCTTTCTCTGATGAGATGAACACCATTOT 
ATAATCrrrCCCGNCAGGGACTT^^ 
TACTTTTTTACirrTCAAGAACATTTTTACACATTAKCT^ 
AANANC^^■GGAATT^AAAAGGGGT^^TNTNGGGa:C^T^GTTN^ 

SEQ ID NO: 2555 GGTACGGATGACGCTTTCrCCAGGATGACCTCTGCCAGTATATCACATCAGAT 
GACCTCACTCAGATGCTOGACAACXn'AGGGCTrAAGTATOAGTGCrATGACCTTTTGTCCACCVVT^ 
GATATATCrGACTGaTTATTGATGGTAATGAAAATGGAGACCTGCTTTGGGATTTm 
ACCTGCAACTTTAATGCCACAGCACCACCTGATCrCAGAGCAGAGCTTGGOAAAOATCrACAAOA 
GCCTGAATTTAGTGCTAAGAAAGAGGGGAAGGTTCTTTTTAATAATACrCTGAGTTCAT^ 
GAGGCATAACnWCAATCACAAAAGTATATTCAAAAATTATATTITGAACAACTCGAATCACT 
TTATrTCrATATTAAAATCACAAACTCATCCATAATGTAGATAANGCACrGTTrGGATATGAGATG 
TAGCAAATTCAATTCATTATTGGACTTNCATTTGGAATCATATGGGAACTGNTGGCTT 
CCTCNTCCAGGTAGAGAGACCACAAGCOGCTCACATACCTrAGCTNNAAANTITGATGACOGATrT 
TTTGGOTmGTAAAAAATTCATCCCTITC 
TGNTTCTTTTTGGGAGNTGCCNTACNA 

SEQ ID NO: 2556 GGTACnTGAAAATTGAAACTGGATCAGTGTOTGACGGGACrGTCAACTGAA 
AAATGAATTTGAGmKKjAAACCAAAACATGAACAGCAGCAGCAGTAGTGAAGCCACT 
GGACTCAGACAGGTATATCACTACAAATCCAATCCGCAGAATCCCAAAAGCCAACTGGATGATTC 
CAGAAAGCACTGTGACTGATGCCGCGCCGNCACCCrCACCCTCrCXn-CATCCAGTAGTGAAGAAT 
NATTCGAGTTGTTAGGCAATCCCAAAGTAGTTGCATTGCGATCTGGGACTGCrmGAAACTGCTC 
CTGAAACTGCrrNOT(XCACCATCATTACTCNNAATC>rGAAACNGGCCCCCGGATAT^ 
AAGTGCCGAAGAAAAAGTAGATTATGGCTGGGAAAAAANGATGCATNCACCCATAGACTGNGGG 
AATGTCGACCAGCAGANCAAATGCTTAACCCTTGTATACCTGCCCOGCGOCNINTTCNAAAGGCGA 
KTCCAACTCACTGNCGNCCGTTCCTANTGGNTCCNATCTTCNGTACCAAACTNTGGNGNTATCATN 
GTCTNTNCTGGTCCTTGNGGGAAATTNGThn'CCCCnTCCAATTCCCWIANANACN 
TAAATGGTAANNCTNGGGGGCCTTANNGG^INa^ACTTNATT^m■GNGGGNC^W 

SEQ ID NO: 2557 GGTACAGrTGATCCCACnrGGAATAAATGCCCAGAAGGTAATAAGCATTAT 
CAGTGAGTGAGAGCTCTAGGCACAAAATAAOTTCTCArrCAGAAAGTGGACAGAGATATGAAOCA 
GTGAAACATATAGCTTTAAAAACTGGAAATCATTCATGACATrTOTTrrCAAAGTA^ 



384 



wo 02/29086 



PCT/USOl/30732 



GCATTTCAAGAACrGTAATTTrCAAAAGTAGAATCAGGCCTGATrAAGTAATATITATGACT TACA 

GATAAAATTNCAAAAATAAAAATGAAAACTCrrCrGCCCTTGAAAGAGATNGAAAACTATAT^ 

mCCCTGTAATGTCACAGAGATrCATCAGTATrrCATCCTTACCGTCGCTTAAAAAATGTTATTTT 

TAATTCGNCTITCCTGm'CTGCCACrmACATAGCAAAACCGTGTrrAATOTGGCOT 

ACTGNAAGOATOTNTTATTTACNGNCATGAATGNNAAAAAACTOAAr^ 

m'ATTTACCCAC>m'AGGAAAAAAAAGATrANGAANCCGGTTmC>OTATCCN^^ 

CTTCCTAhnsrAAANTCCTTNCCCGGCGOGCCGTTNOANGGGCXSAmCONCCN^ 

ANNGNTCGCNNGG^^^CTAANTTGGNTA^^^ATGGCAT^CTT^^^C^^ 

SEQ ID NO: 2558 ACCACCACGCCCAGCTGArTTTGTATITrTAGTAGAGATGGGA'nTCACCATG 
ACGGCCAAGCTGGrrrCAQATrCCTGACCTCAAGTGATCTOCCCACCTCGGCCrCCCAAACTGCTA 
GGATTACAGGAGTTAGCCACCGTGCCCAGCXrrACTGTTGATAlTrTCrrAAGGCCAAAGGGCl'CTT 
NAAACAGCTTGTGGTGAATGCCACCTANTCAGGGATTCACCCrTAAGTGCAGNGGGCANCCCTCT 
CTNCCTXJGGCAAAGTTCANAAATCCCATNCAANGAGCCAAGTCITGGAATTGGAAACCrc 
GTCTGCTTAGTGCTCTACACCCTGTGGCTTANCTGCTATCTAAAAGTGAAGTCAANGTNCCCT^ 
ACTTTTCCCTCTCTTTTCTCACACAANAAGGNAGTTrfACCCCOT 

TGTGCTGANTCT^ANC^^'GANGCCNCATGTCCTAATGTTCCAACCCAAGGCCCCr^ANTGGTAT^^ 
TGGATATTGTACTNGCTTTTAAGGGCCXAAGOTriTANCTATCAAGOTOATGNAATC^ 
TTAGNTTATTTCTTCAAGGAGGNGGTANTCTTNCmCrCAANGATG^ 
AAGGmGAAAGGGGGKTGNNAANCTmrATTANCGTGTrNGTNNNAAGG 

SEQ ID NO: 2559 CGAGGTA Cl ' 141 ' rrri I in ' ri ' l ' t ' l ' I ' lU ' l GGTrTATGCAACTTTATTGAAGAAA 
AATAAATCAATTACTAGCAGAGCTATrAGTTGATCACTrATCCATTGACAACTNGCATCATTTATr 
CAGNGCTATATTAAAC^GTGTATTGGAAGATANATrAACTAATAGCTCCAAGCCTCCrAACAATTT 
AAATGAAAATTACAAAATGTTTGAGACCCTATTTTGGAATACAAAGGGNGTTTGACTTCCA^ 
CATrCTCTGTANAACAAGAACAGGTCATTCCmArrcACATGCATAAAATACATCACATm 
NTCTGCTGANGCTATAAAATCAOTACCAATOrrCCAGCCACATCANTTATGAGCATAAAGCATA 
ATGTGACCTCAAAATCnTTAACAGTGGAANGNTTAAACAAGAATGGATGCTGCnT^AANAATNGNG 
TATCCCITIX3ATGNGGCACCNGANArrCATNTCCCTCCCCTGAAAT0ArrT^ 
ACATGGNGGAATGGGTTTAGCCTNTTAtXAANTTAANGGGANACTTGGCCCCTANACCNACGCCC 

CCAAGNGGGNAACTAATNTTAANAAACCNGGANNAGG 

SEQ ID NO: 2560 ACTATACTCTGTATTCTCACAAATAGAGAAGACTAATATrGCAGACCTGGTG 
ACAGCTCTGATTGTCCTTTTGGTTGTATOCATTGTTAAAGAAATAAATCAGCGCT^ 
CTrCCAGTOCCCATTOlAATCOAATTCATTATGACCGTGATrGCAGCAGG 

GACmAAAAACAGGmAAAGTGGCTGTGGTTGGGGACATGAATCCTGGATTTCAGCCCCCTAT^ 
ACACCTGACOTGOAOACTTTCCAAAACACOTANOAGATTOCTTCGGCATCGCAATGOTTGCATTT 
GCAGTGGCCTTITCAAGrrGCCAGCGTCTATTCCCTCAAATACGATTATCCACTTGATC^ 
NGAGTTAATATCCnTGGGACrGGGTAACATATrCr 

SEQ ID NO: 2561 A C^ ^ ^] ^ ^l ^ rl ^ l ^ l ' ^l ^ l ^ ^^I ^ l ^ l ^ l ^ l • l ^ ^i ^ ^^rn ' n TACC AAACAA ACGA TGAAGTCTCA 
GGAGTAAAAGTTGATACACAAGTAAATTTTATTGGTAATarnTTGTGTGGTCTTTAAGCAGAGGG 
AAAATTAGTCTGCArrATGGTGTATCCAGACTAAATAACTGATATTAAAATGAAATTATCCTTAGG 
ATTTACAATCTTAGAGAAAACTTTrrCATITriTrrGAGTTACAAATTATOT 
AACAGTGAGTCACAGAGGGA^^AAGTAT^mACTCAAGATCTTGCAAGTGmGGT^^GAACCCAA 
TCTrITCACTCTGCAGAACTCAAGAGTCACTCTT^mTGGAAAC^^ 
AATATGGGCTTNCTArrATTNATTCCGGATTAGTCAGAAGTTTGCAAGCAGGCAGAATTCA 
CCAATACNQG Arn 1 l CNTCAGCCCCCGTC C^l ^ n ^ i ^ i ^ l ^ l ^ l I ' l 1 Tn -iTrn riTl ri'NANGAANGGNT 
NACTTGArmTTTTCCCTAAGGGGGGGNANANAANGAlTNCn-AATAAATO 
CAT>mTrTAAAAGCCC>rn'AAAAAAGTrTNTTTTNAANI^^ 
AAATTAAAAAAA 

SEQ ID NO: 2562 GGTA CiTi i rn iTri rni riTi 1 1 1 1 iggagcagttgattccagttcacgagc 

GCTCTCGGTAGCTCAGGAAAOCGACATAGTCTCTAGCACTTAGTCCCTCTCXrrACAATGCAAA^^ 

AAAAAGACrGTGGCTCCAGGACTNTCTGTGGGCGGAATCX)GCACTAAGGAGTTGGTGC^ 

TTGTTGCAAGGNAGGAAGCCAAAAAGCCTGCATGCAACAGACTGGCATGAATAAATGTATGTTTC 

CCCCTCCCTTCTOCAAOAAGGGAGCTGGTAATGOCAGGAGTTCCTCCAGTGCAGACrCTTAAAGA 

GCGTGCAAGAACTAGAGACCCGACAGCGAAACAGGCCTCTTCCCCTCACACACAGTTTCTCTTGCT 

GCAGCTTANAACCCCCGCGTCCTGCCOGOCGGGCGC 
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SEQ ID NO: 2563 GGTACGCGGGAACANAATCCCAGGATGTTCCATCTCTTATATAAATGTA'nTG 
CATATAACCTATOCACATOTXXTGTATACmAOAAGTCTACATGTTTGGTACCACGATGACCCKj 
CTAl 1 1 1 1 U 1 Icri-ICATAGATATGGGGGTCITGCTATGTTGCCCAAGCTGGTCTTAAACTCCrGG 
CCrCAAGCAATCCTTOXlCCITGGCCCCCCAAAGTGCTGGGATTGTGGGCATGANCTOC^^ 
AGCCTCCATGTTTrAATATCAACTCrCACTCCTGAATTCAGrrGCTN^ 

TCrcATGCAGAAATTATTGNGGCTCrmAGGGTAAGAAGTrCGTGTCTITGTCTGGCCACAT^^ 
OACTAGGTATNGTCTACrrCTGAAACCTTTAATGGCCrrCCCTCTTrCA 

CTTGCAATGGGCAGCTATCCAGTTGACTGNTNTGAKTAAAGNGTGTTCATTAATGTOTATTT^^^ 
TCTGAAACAAGAGTGATATNCTTCCAAGACNTAAAATNATNNCTAAAGTGCTGGNNGNCAAAAGA 
CAGAAGCGGANTITNAAAATNGGTCTTGGATAmXjCAGGANAATrrGGCTATGNACCTNG^ 
ANCAANTTTTCCT^WGANANTTGGAGNGGGTACCITm'CTTTNAA^ 

SEQ ID NO: 2564 GGTACTTOCAACTCTQGGTTGGCCCCAAATCCAACTAATGCCACCACCAAGG 
CGGCTGGTGGTGCCCTGCAGTCAACAGCCAGTCTCrrCGTGGTCTCACrCTCrCT^ 
CTCTTAAGAGACTCAGGCCAAGAAACGTCTICrAAATTrcCCCATCT^ 

GCGTCTGGAAGTCCAATGTGGCAAGGAAAAACAGGTCTrCATTGAATCrACTAATTCCACACC^ 

TATTGACACAGAAAATGTrcAGAATCCCAAArrTGATTGATrrGAAGAACATGTGAGAGGTTTGA 

CTAGATGATGGATGCCAATATrAAATCTGCTGGAOTTrCATGT 

SEQ ID NO: 2565 ACAGAAAGGGTATGTTAAGTAGTTCAGCCAGCAGCTCACCACAGGGATTAAC 
GGCATCTGCCAGAAGGACATCAAATTTTGACTCTTGTAGTTTTCTCATAAGTTTOT 
GCATCTTCACAGAGCTTTATATTATAGTCAGAATATTCCCAACACAATTCTTGTAGTTGTGAAA^ 
TATGACCAAAATOTATTTTTTGAAATACTATATGTCCATCrATCGAACATTTTCATAAAAAAATOT 
CCAAATCATTTTTAGrrAAAGATGTAGGATAAACITCTAATTTAATAGCAGATGATTTACTGGC^ 
TGACAAGAATAGAAGCCGAAGATGTCAACACAATCACCTCATGACrCCTCTGAACAAGCTCTTCC 
AGGATTGNCTTCATATTTATCCAATCGCTGNATTCrGTGGGCCACACCAGCACCTTTNCACA^ 
CCAAAGCTAAAGTACAACTGAGCTGCATCACAOAAAOACTGACATCCTITCAAANACATCCTGGT 
CNTATGCCAAGCl'l CllTl CCAGTimGGCCCCNGTACCTNGGCrGGGACCCCCTTANGGNGAATT 
CAACCACTGGCGGOTGTCTNATGGACCCGNCTCGGACCAAOTGGGGGAAAATGGCATACTTGTT 
CTTGGGNAATGOTTCCNTCCAATNrCC 

SEQ ID NO: 2566 ACCGCGGGCTrGCCACACCTrGAAGTGATACTGGCGGGGAGCTCTTCCCTGC 
TCCTGGG AAGAC TCCCCCAAAGCACATCACAGCCCmOCATGTCTrAGQCGCTTTCCACCTTCrC 
TGTGTCTGTTrrCTTTCCX;CCTATATGCCCrrCCCTTACCCACTTCCT 

COTAATAGACCTCAATGGATTAGAATCATAATCTCCAATTCrrTGTCAAGAGTAAGGTGGGCAAG 
AGAGGCCTTAAATAATCCATmGGTGTTAAAATGTTTCTnTTGGCIXjCC 

CCTGTAATCCCAGCACTCrGGGAGGCCCGTAGAGGGTGGNTCACCTGACCGTCAAAGAGTrTGAG 

ACCAGCCTGGTCAACATCGTGAAACCTCGTCrCTACTAAAAAATATAAAAATAACCAGGTGTGGN 

NGTGCACACCTGNTAGTTNCAGCTACTTAGGAAGGCAGGAGAATCGClTrGACCTO 

NGTrcATTGAOCAAAhrrTNGCCCTGAATTCAAACTATTCTACAGGGTTGAAGAATAA^^ 

GA^^GGNTNCT^WGCCCX}OAACCCCCTNAGGGCAATTCNAACCAT^n^GGGCGTT^ 

ACCItiGGACXZAAhrrTGGNGAATATGGCNANTTNTr 

SEQ ED NO: 2567 ACCTGGCCATCnTGGGCAGTGTGAajTTTCTGGCTGGCAATCGGATGCTGGCC 
CAQCAGGCAGTCAAGAGAACAGCACATTAGTTCCAQAAGAAAGATGGAAATTCTGAAAACTGAA 
TGTCAAGAAAAGGAGTCAAGAACAATTCACAGTATGAGAAGAAAAATGGAAAAAAAAACTTTAT 
rrAAAAAAGAAAAAAGTCCAGATTGTAGrrATACrriTGOTGTTTTTCAGTTTCCCC^ 
GCAGATACCTGGTGAG CrCAG ATAGTCTCnTrCTCn'GACACrGTGTAAGAAGCTGTGAATATTC^ 
AACTTACCCAGATGTTGCTrrrGAAAAGTTGAAATGTGTAATrGTTTrGGAA^ 
AACNNAAAAAANGNCm^f^^W^™AAAGTCCTCNGNCX3TO 

SEQ ID NO: 2568 ACTATAGAGACTCAGTrGCAAAAATTAACAAATATGCTCCTTGATTAAAATG 
GGTAGGCmrrCATGTGGCTCATTCITrAATCTATTCTC^^ 

GCCTATGGATCATACTTCAAACTCTTGGTGTGATCCTCCTGATTGTCACAATATTAGTTACCCT 

GTGCTGTATTCTCTAAAACCTTTAAATGTTTGCATGCAGCCATTCGTCAAATGTCAAATATTCTCT 

TTTGGCTGGAATGACAAAAA(nx:AAATAAATGTATGATTAGGAGGACATCATAACCTATGAATGA 

TGGAAGTCCAAAATGATGGTAACnxSACAGTAGTGTTAATGCCTTATGTTTAGTCAAACTCTCAm 

ANGTGACAGCCTGGTGACTCCAGAATGGAGCCAGTCATGCTAAATGCCATATACrCCACTGAAAC 

ATGANGAAGCAGGTAGATCCCAGAACAGACCAAAATTTTCTAAAAACATGANAGTCAAGCTGTCT 

GAGTCAACACAGTAAGAAATCCTTCTGG^T^ACT^^'AAAAAAAAGAATTTGAAG^^ 

TAACCATCAOTTTTTTAAATCAATTAiM'l'rilCrriGGTCCGGGATCCCTTTANAAAN 



386" 



wo 02/29086 



PCT/USO 1/30732 



GTTGTTTGCCCAAGGACTTCNTTT 

SEQ ID NO: 2569 ACGCCCTGCTGCITCIXXrrGATCTGCTnAACGTTGGAAGTGGAm^ 

CAGGTCTTAAGCACAAGAAATGAAAATAAGCTGCTTCCTAAACATCCTCATTTAGTOCGGCAAAA 

GCGCGCCTGGATCACCGCCCCCGTGGCTCTTCGGGAGGGAGAGGATCTGTCCAAGAAGAATCC^ 

TTGCCAAGATACATTCTGATCnTGCAGAAGAAAGAGGACTCAAAATTACrrACAAATACACrGGA 

AAAGGGATTACAGAaCCACCrmGGTATATTraTCTITAACAAAGATACTGGAGAACTG 

ACX:AGCATrCTTGATCGAGAAGAAACACCATTITrTCTGCTAACAGGTTACGCm 

GGAAACAATGTAGAGAAACCCTTAGAGCTACGCATrAAOGOTCTTGATATCAATOACAACCAACC 

AGTGTTCACACAGGATGTCTrrGrrGGGTCTGTrGAAGAGTTAGTGCACACATACTCTTGNGATGA 

AATCAATGCACAGATGCNGATGANCCCATACCTGATTCNAAAATTCCTATAAATCGTNTTNTGGNC 

CCNKmCCTCAGNOTTTCCTAATAAGACCGGANNTTTTTC^ 

CACNCTCCTTTGCCGTAAA 

SEQ ID NO: 2570 ggtactcx;cagcaaatcctctgaatactccacagactatgttaccx:aotccca 
aogctattaactccnxlatroccatcaagtggataatcgtarrtgagggaatagacgctggcaact 
aaaaggcx:actgcaaatgcaaccattgcgatgccgaagcaatctcctacggtgttttggaaagtc 
tccacgtcaggtgtaataggogoctgaaatccaggattcatgtccccaaccacagccactttaaa 

CCTGTTmAAAGTCACAGCOjTAGGATACACCrrGCTGCAATCAaKjTCATAATGAArrCGATTGG 
AATGGGCACTGGAAGTTTGTCTrrGAAGCGCTGATTTAmcmAACAATCGATACAACC/^^ 
GACAATCAGAGCTGTCACCAGGTCTGCAATArrAGTCrrCTCTATTTGTGAGAATACAGAGTATAG 
TACCTGCCCGGGCNGGCCGC 

SEQ ID NO: 257 1 GGTACTTTTTITITITITITITr^^ 

ArrGNGrmCCACATAGATAAAAAAATAAGGCTTTITGATGAAAAGAATCCATTACAAAGTCAA 

AAATCCATTACAATTATAATTOAATCAGTAACAAAATTTAGCTTTAAATGAGTCAAGTATT^ 

TTTGAAATTTAATATCACAAACArrCAAGArrAGTGAATmGGTAAGAAAAAAATACTAGAAGA 

AAGGAAAAGGACACCTTITCAACAGATAGTAAmATAAAAA'li'ril'riAAAAGTGCTTTGGGA 

AC^CACAGTATCATTACTrAAGAAAAGTCAmAAGGAAGACrTAAGTGCITCAAGTOGQAGTGT 

ATTCAGACTAAAAAATGTTTTAAAATTTGCCAAGAAATTTAAGTGTTAAAAANCT 

TCAGTTCATGTTTAANGGAACATITGACAACAAGTAACCAACCGCCAAAAAAAAGTCCCNGCTTT 

TNAACTAATAAATCTGGACTGGAAAACTCrTOGTTQNACCTGCCCGGCGGCCCTCGAAAGGGAAT 

TCCACCCCTGGGGCCGTCnrmGGACCCGCTCGGGCCCACCTGGNGAANAAGGGAAAACG'nTCr 

TGGGNAATGTTTCCCNCCAATNCCCCAN 

SEQ ID NO: 2572 acggagagggtcaccaaocgtggatcgttggcattgtggaaaagggaaacc 

GAAOKjCCCGGATCATTGACAAGCXXJCGAGTTATTGAAGTCXJKK^CTCGTGGGGCCACAGCTG 
GTTCTTCCTCCrGACAGrrcAAATGCCTCCTCTGAGOT 

GTTTGGACCITAGAGCa^TTGTCCACAATCACGGATGGTTCTCAAGAGrrGATTGTAAGAAAm 

CAAAGAAGGCTGCCTGCATAGTGGTTCCGGCTGCCCTTTCTAGGTGATTGGAATCAGCCCATC^ 

AGCAGTCTTTATATGCATTOCGAGGCCAGAGTAACATTTTGAACTTTGGGGGGATATTTGTT 

ACTTGGGTAGAAGAAGAGCAAAAATACCTCTGTTTTCTCnTGCCAAAGTAAGATGAAGCT^ 

NGTTGANGGATTTTTCTITTGCCCGGGGTTGATTAAmCTrCACAGGGAGTGAO^ 

ACNCNCCCCCCAAGTAAATTGCCAAAAAAAAAAAAAAAAAAAAAAGTCClTGGCCGGACNCCCr 

ANGGCAATTCCACCNCTGGCGGCCGTITraGACCAACTCGGNCCACTTGGNGAANATNGGANAC 

TTTINCTGNGAAArnTCCCTCAAATC 

SEQ ID NO: 2573 ACCCGAnTAAGTAGTGACATTGATACTAGTAAnTTGATGACTrGGAAGAA 
GATAAAGGAGAGGAAGAAACATTCCCTATTCXn'AAAGCnTTCGTTGGCAATCAACrACCT^ 
GGATTTACATATTATAGCAATCGTAGATACTTATCTTCAGCAAATCCTAATGATAACAGAACTAGC 
TCCAATGCAGATAAAAGCTTGCAGGAAAGTTTGCAAAAAACAATCTATAAGCTGGAAGAACAGCT 
OCATAATGAAATGCAOTTAAAAOATGAAATGGAGCAGAAGTGCAGAACCTCAAACATAAAACTA 
GACAAGATAATGAAAGAAITGGATGAAGAGGGAAATCAAAGAAGAAATCTAGAATCTACAGTAT 
CTCAGATTGAGAAGGAOAAAATGITGCTACAGCATAOAATTAATGAGTACC 

SEQ ID NO: 2574 ACATCAAOTCCATCTOACAAAATGGGGCAGAAGAGAAAGGACTCAGTGTGT 
GATCCGGTITCTTITrGCTCGCCCCTGTlTITTGTAGAATCTCTrC^ 

TTATTCCCGACGACACATATACATATGAGAATATACCTTATTrATITrT GTGTA GGTGTCrGCCTTC 
ACAAATGTCATTGTCTACTCXTAGAAGAACCAAATACCTCAATTmGTTI^^ 
TCAAACAATOTCAGTGTCATAAAGTCAAAGAAAGTATTTGGAAATAATCCAGATTAAAGAAAACT 
GAATTAATACAACAGCCAAATGTAGTACC 
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SEQ ID NO: 2575 ACTAAAAAAACK}AGAAATTATAATAAATTAGCCGTCTTGCGGCCCCTAGGCC 
TAAACnrrCTGGTATCnTAGTGTCTCAOTATCTrAGTGTCCTTCACTCGGA 
rrCArrAACCCTCCATTCCTGTTAGATTCAGTCAGGTCTTAGCAATTT^ 
CCTTCnXTGACTCTTGTCXnTrCCACTTCTCrATTCCCCAmCCTC^^ 
CCCAAACCTTCTCAGTGCCCACATAACTrcGTAAACCACTCAAATCAAGGCCT 
GGAGGGAAAGGGCTATAGTGGGGTCTGAGGGAATGTTGACGGGCAGTTTCACACAGATAAATCTC 
TGAACCAGCCGGGCGCGGTGGTCACGCCAGTGATCCCAACGCTTTGGGAAGCCCGAGATGGGCGG 
ATCACGANGTCAGOGGATTGAGAACATCCTGGCAACGTGGTGAAACmCATITCTGCTAAGGGAC 
CTCOGCCOCGACCACNCTAAGGGC 

SEQ ID NO: 2576 ACCCACTGTATrrATTTATGTGCAACAAGAAGTGTCAGCAACTGCACAAACTC 
CTCCCTGTTCAGCTAGTAGGCAGCAATTCTCTTATCrGGCATCCCATAGCTGGGTTAAATTA^ 
AGOGCGTGAGAACAGGTOAGTCTAOAGGTCTAACTCTAACAGGGACCACCGTGCArrTGAATAAA 
CAGTIXiTArrAGGGATGTTGTrAAAGTTAGCCACTGGGCAGAATAAAGGATCCCTTAGGTCAAAT 
ATTTGGGTTTGACATGATGACATGACACATCCAACACIXrrGTCAAATTACCTGCACAAG 
TTGTGAAATTCTAGTTACAGCATTATCnTA(XAAATTAAAGAGG<>GGCATGAGCAGGGAAA^ 
GGAGTGGGATAAGAGCCTCGTCACGATGAGGCTTGGGACATCTTGGGAAAAACTGCCACA GCATG 
AAGTCATCAACTTCTTGTCCTGGTTGGNAATTTGAGTGNCTm 

GGTTAGCTCTTGAATGGGCCACAT^^X:AAGCATTGAAGGGTTT TCCTT GNAAATTACAATGG 

GCCCCXJTCCACCITATAGGTTTNAGOAAAAAACANOGNriTrCNT^^ 

GATTGAAGGNCTTAAAAATCCCNCCCCCTTCGGGNGGT 

SEQ ID NO: 2577 CGAGGTA CiTnTnTrrrn rn - i -i T i invi n G<KiriTiTiTvnTnTirn'i 

TTTTTTCAAAAAACTGNNGGCTITriTNTTCAGTTAAN 

CCCATTTAACrrGCCCAAAGTrCACATTCCACrn^ATTCATCCATCCCACAC^ 
AANAAATAACCAANATTGCCAACAGGClllTlCM-nrriACriCriCrrANCAATAACAGTCAATA 
AACTGNTrGCAAAAAAGTTACTAATTTAAATTTTCACTGNGTTCAAANAACCTCATCCCT^ 
AGCTCCCGAATATGCCAATANATCCTGCCNCTNCTGACAGGAGGGCAAANT 

SEQ ID NO: 2578 A Cl ITl l I ' l - l rrrrni lTl I ' l ITI 1 - ll l CAGNGCCTTCCrCANACTGCTGTGGA 
TCTTCACTTGTAGCACAAATTGTCACAAAAGTAAAAAAGCTAGGTAAAATTTGAAATGCTTTAOT 
AATirAATTAATrrACTTTGCTTAATTmACATAATTGGCTTACCACrrCT 
ATATGGQTGTTCTATCCAAATTCCATAATGTTGGTAATCATTTCCACAATGATATATAAAATGTCA 
TCCAGCTTrACTGGGGCAGTATTCCTATAAATTTCAGCAAGTTGGCAATAAAAATAACAGCTCr^ 
NAATAACCATTAATGCCATACTTGCTrrGGTTTCATTGATATATTACTGGGCrTAAT TATCAG TTAG 
CAAAAAATCGGCCTAKTrAGCAAGCAGATTTCrrmAAAATTAATrCAATCT 
ANAATTAATAAAGCCNAGNATGGGGATAAAATGATTmriTATrAACAGCANTCNTTTGAAACC^ 
NTTTAAAO(>rCCGGTCCAAAACATTNGATGOACCTrGGCOGAACNCCTANGGNNATTCNACCACT 
GGNGGCGTCTTmGGACCrACCCGGNCCACCTGGGGAANNNGGAAAATGTTCCTGGGNAAAGGTT 
CXTNCCA 

SEQ ED NO: 2579 GGTACGCGGGATGTTirrTCTGATTCCATCCTGTGTCCCCrrCATCCTTGACTC 
CTTrGGTATTTCACTGAATITCAAACATTrGTCAGAGAAGAAAAAAAGTGAGGA^ 
TAAATAAATAAAAOAACAGCCTnTCCCTrAGTATTAACAGAAATGTTTCTGTGTCATTAAC^ 
TTTAATCAATGTGACATGTTCCTCTTrGGCTGAAATTCTTCAACTTC^ 

GAAGGTGTTCAAACACAACCTACTCTGCAAACCTTGGTAAAGGAACCAGTCAGCTGGCCAGATTT 
CCTCACTACCTGCCATGCATACATGCTGCGCATGTTTTCTrCATTCOTATGTTAAGTAAAGGTTO 
TTATTATATAnTAACATGTGGAAGAAAACAAGACATGAAAAGAGTGGTGACAAATCAAGAATAA 
ACACTOGGTOTAGCCCAAAAAAAAAAAAAAAAAAAAGTCCTGCXrCOGCNGGCGCTCGAAA 

SEQ ID NO: 2580 ACTATGTCGATTCGACAGAACATTCAGAAGArrCTCGGCCTrGCCCCTTCACG 
AGCCGCCACCAAGCAGGCAGGTGGATTTCrrGGCCCACCACCTCCTTCnXKjGAAGT^ 
TCAAGAACTCTTTATTITCTATCATTCTTTCTAGACACACACACATCAGACTGGCAACTGT m 
GCAAGAGCCATAGGTAGCCTTACTACITGGGCCTCnTrCTAGTT TrGAA TTATT^ i G 

GGTATGATTAGAGTGAAAATGGCAGCCAGCAAACTTCATAGTGCTrTTGGTCCTAGATGA'irrirA 
TCAAATAAGTGGArrGATTAGTTAAGTrCAGGTAATGTTTATGTAATGAAAAACAAATAGCATCCT 
TCrTGTTTCATTTACATAAAGTATmCTGTGGGACCGACTCTCAAGNACTGTGTATGCCCT 
TTGGCTGTCTATGAGCATTTANAGATrAGAAAAAAAArrAAhnrrGGTmACCCTrOG^ 
ITGGTGGTGGTirriTrrTCAAGCCCAATCCTTGACrrAGACAATAAGAGGCCAATITrAOT 
TrrCCTTGGCCGGACCCCCTANGGNGAATCCACCACTOGNGGCCGTCrATGGTCCANCCTGGNCA 
ACTGGGTATATGGGT 
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SEQ ID NO: 258 1 acatgcttttattctttccataggacatatttccaaataatgatcacagtatt 

ATQGTTAATAAGATGCTGGACAGATGCGTAGCCAAAGAATATTACATAACAACAAAATCTCATCA 

ATGATATATCAAATGCAAATTTACTAAGACATCAAGAGAATATCTGTTTGTAGTTACATGGCTAAA 

AACAAGACATTATTACAGTAATCAAGTCCATAAAACTAOAAAAAAAGCACTGTATAACAAAAAG 

GCCrmAGAAGCTATTAAATGAACAAACATTATAGATAAACCAATTITATATCTGAAGCAGCrTA 

TAGCACCAACACGTTGGCAGGACCAGCAGAGGGGTGGGGTCTTTOGACCAAGGCATCTGGGAACA 

GGAAGGGCrCTCAGCCATCATCTACTGNAGCCACTTGTTITACAAATGGAGAAATANAGGCCCNA 

AAAGATGAAACGCCrrGTCCAAGGCCCACAACACAAGACAGACTGGGAAATGATCCAGGGCTN^ 

GATCCAGGCCCTGAC^nt;AGGGAATGCTT^mTACATACAGATATAAAATCT^^^^GGATTGATG^ 

rrmCCCATACAGGCTTCCNTAAAAAAACCTTAANCCCTATGGNmGGCCGGACCCCCTANGGG 

AATCANCACTGGNGGCGTCmmGGTCNO 

SEQ ID NO: 2582 acqcgggggtggttggtgtgcgggtttcggttggaggactcgttggggaggt 
ggcxrrgcgcttgtagagactxjcatccccgagacgatggcxjgagggagataatcgcagcaccaacc 
tgctggctgcagagactgcaaotctgoaagaacagctocaaggatggggagaagtgatgctgat 
ggcrgataaag7cctccxjatgggaaagagccixk3tttccacctgccatcatg ggtc 

GGTGTTTCTGATTATCTACTATCTAGATCCATCTGTTCrGTCCGGCGTTTCCTGTTT^ 
TGTGCTTGGCTGACrACCTTGTTCrCATTCTAGCGCCTAGAATTm 

TGAACAACAGCAAAGATTCATGAAATTTGCAGCAATCTA>rrAAAAACTCGACGCAANCTTGTGG^ 

TTGGTGGAAACCCTCTTCACCrrAANGAAGAAAAACCTAAAATGTCCTTiGGNCGCAACACNCT 

NGGCGAAATTCAACACACTGCGGCCGTCTAATGGATCCACTCGGANCCACCTTGCGNAATATGGC 

ATACTTGTTCTTTGGNAATTGTbTOCCmCAATTCCCCAAATACANNCCGANOT 

GGNGCCAATAAGAACTNA 

SEQ ID NO: 2583 ACAATGTAGAACTCTGTCCAACACTAATTTATTTrGTCrTGAGTTrrACTTC^ 
GATGAGACTATGGATCCCGCATGOCTGAATTCACTAAAGCCAAGGQTXjraTAAGCCACGCTGCOT 
TCCOAGACTTCCATTCCTTTCTGATTGGCACACGTGCAGCTCATGACAATCTGTAGGATAACAATC 
AGTGTGGATirCCACrCTTrrCAGTCCTTCATGTrAAAGATITAGACACCACATACAACT 
GGACGTTTrCTTGAGAGmTAACTATATQTAAAC^TTGTATAATGATATXSGAATAAAATGCACAT 
TTTAGAAAAAAAAAANNNAANNAAAAAAAAGGAAGTTCC 

SEQ ID NO: 2584 CGAGGTACAGTGAGGGTGTTCAGAGGGAGGCACAAAGAATAGCTCTGAGAT 
TAGOCAATOGAAATGACAAAAAAGAGATGAATAAATCCGATTTGAATACCAACAATTTGCTCTTC 
AAACCTCCTGTAGAGAGCCATATACAAAAGAATAAGAAAATTCTTAAATCTGCAAAAGATrTGCC 
TCCTGATGCACTTATCATTQAATACAOAGGGAAGTrTATGCTOAGAOAACAGTrTGAAGCA^ 
GGTATITCTTTAAAAGACCATACCCTTTTGTGTrATTCTACTCTAAAT^ 
TGTTGATGCAAGGACTmGGGAATGAGGCTCGATTCATCAGGCGOTCTTGTACACCCA^^ 
GGTGAGOCATGAAATTCAAGATGGAACCATACATCmATATTTATTCrATACACAGTATTCCAAA 
GGGGAACTGAAATTACTATTGOCmGATTTIX}ACTATGGAAATTGTAAGTACCTGCCCGGCNGGC 
GTCGAAANGGCGAATTrcAQCNACTGOCNGGCOTACTATGGATCCGACXrrCGGACCCAACCTGGC 
GTAATCATGGGCATACATAAANGTAAANCNGGGGGCCAATAANGGCNACTNCATTAATGGGTG 

SEQ ID NO: 2585 ACCTGAGCTAAATGACTGAAGCnTAGGGGTGCATAGAAACCACCATAA'nT 
OTATOACATrrraAAQTOAATTAAATATTriTGAACATGCTrCTrCGACAGCCAGTGTrATArrm 
CAGATCAACACAAAGCACAATGATTACTCGAAATTCAGTATTTTCAAATITACATATTTAAAGTCA 
TGCAAGCTGTAACrnXCTGTCAAAATTACTGGCTGCCAAAmATACCTOTTTOT 

TTtnTrTrrrrrrTTTTTriTnAGGGCAGA^ 

GAGACCCTCTTGCGAGAGGTGAGATGAGGCCCTGCCATGCAAAGGAGTCCCAGCAGAGOAGOAA 

GAATTCCATCCTGGAGTTCAAGTTrCTGTGCANANACAGGACCTGGGGACAGANAACGGCCT^^ 

CCAATTTCAACTGGCTGNCCTCATNACTTTNGGCTGACCTGGGGTGATAAAAGGNGGAGC^ 

GGTGGGGACGOACTTTGGCCAACTrQChriTGCATTITGNCTCATGACAAAAAAAG 

AACCCACGCAATAACCXrCACaX3AGNTGGCCANOTn-AAAAGACGGTTT^^ 

CAANAAGAACCTC 

SE Q ID N O: 2586 GGTAC1^1 i ri-ri-l'rrrrt' ri'rr riMl lM-i-l-r TCTAC AGTTNGGAC TGAA TTCTAA 
lUU l rC'l"l AGCrACATGTCnTCAAAATAATGTnTCAA'l"rri"ri'l'CC"riC"l"l"l"l"l i"l lCTCCAri i riC 
CAAATITGGAGTCACTGGAAACTAANATGTOCnTrcATAAAGCCCrGTGAAATGAAGCT 
CTTGAGCrrCANAANAAAATAGCAGCGACCTATTTACATACATAAGCCACTmrATACCTGCCT^ 
TGATGTATGGACTTCANAGTAATGTGGCATATATCTATmCCAGAATTGTTCTmG 
GGCCrCCX:CCrATTrrCTCTTCACAGGACATGAGACTTCACAACCTrCTAAAANGGAG 
ATAACTCAGGACCTATCTATCTANGAATAAACCATCCTACCNTGAGAGATCAAACNAAACCTGNG 
GGCAGAAACCCA*ln■i'lN^l■^^^AAAATOCGTTCTrCCACAAAATTmAAAAAAAAANGGGGGGAA 
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TGGGAAGTNTTTrGGGCCCTCAATCTrAACTAANGQAAACmAGGTCCrGCCNGGGNGGCG^ 
AAAGONGAATTCXIACCCCTGOGOOCOGTTrATNaOACCACCCGGACCAACTNGGGGAAhfNTGGG 
ANAATGGTTCCTGGGAAATGTNCCCC 

SEQ ID NO: 25 87 GGTACTGATACATGCTATAACAGAGATGAACTTCGAAAACATGCTAAGTGAA 
AGAAGCCAAATCCAAAAACAATAAAAACACATATTGTATCCTCACCCTmTGCATTTTAGTGAGC 
AATCATTGCATATGAATGTn'ATGGGAAAAATCAATGTGTGCTAAATCATTGTATTCCAGTAAATA 
GATTGGACTTAAAAOTGATACAGAAGTTGCAAATAAOTGGGArTGAGTrrcATTATTATATAGAA 
AATAATTACATGATTCATn'AAGAATAATAATATCCACCAmATTGAGCACTrACTATOAGCCTG 
TGTGCCAAACATTTCATGCATTTCTCATTTAATTCTCACAATAATCCTGTGAGG^ 
GTTGAATCATATGAACTTGCCAATATATGATAArrrCTAAAGAAGTTGGGAATmTGAGGA 
AATGGACCTGCCNGGCGGCCGCTCAAAA 



SEQ ID NO: 2588 GGTACnTACTGAAG'rri"l'l"l"ri"l"i"ri l"rri'GAAACCAAGTCTCGCTCTGTCGCC 
AOGCTGGAGTOTAGTGGCATGATCTCACCACACTGCAACCTCrGCCTCCCGGGTTCAAACGATTCT 
CCTGCCrCAGCCTCTCAAGTAGCTGGGACTACAGGCATGTGCCAACACACCCAGCT 
CnTrCAGTAGAGATGGGGTTTCACTATOTTGGTCACTATGGTCITOATCTCITGAT^ 
CCCACCTTGGGTTCCCAAAATCITGGGATTAC^ 

CTTTTAGTGGTGOT'CTTCTCTCTTTTGACnTAAGOATGTTGCCCTrAAW 

ACTGNGATACACTACTTGAGAGATGGATTGNTGCTCnTTCTrCrACAGTCTTTACAAG^ 

TAT^AAGACAGAAGAAGTTACCATTGCATTAATGGTTGGAAGCTGACAGTCTT^^"AAAm 

CAACTGGTn-ANGNAOANONCCOAAAGACTTTCACCAKTTCATTCTNT^ 

ANAACANNNGNGTGGANAATTITNGGTGGG^^■G^r^ITCTAAATGNATAAAGa 

NNGGGGGGGG 

SEQ ID NO: 2589 ACTTGAACrGGTAGGAAATGCATCAAAAGACTTAAAGGTAAAGCGTATTACX; 
CCTCGTCACTTGCAACrrGCTATTCGTGGAGATGAAGAATTGGATTCTCTCATCAAGGCTACAAT^ 
GCTGGTGGTGGTGTCATTCCACACATCX:ACAAATCTCTGATrGGGAAGAAAGGACAACAGAAGAC 
TGTCTAAAGGATGCCn-GGATTCCTTGTTATCTCAGGACTCTAAATACTCTAACAGCTOTCCAGTGT^ 
GGTGATTCCAGTGGACTGTATCrCTCTGAAAAACACAATmCCCTTTTO 
AGTTGOAAGTTTAATTAAGCTTTCCAACCAACCAAATTTCTGCATTCGAGTCTTAACCATATT^ 
AGTGTTACTGNGGCrrCAAAGAAACTATTGATTCTGAAATAATGGGTTTTGATTGAGTTGACTC 
TTTNAAAAACTGGTTGGANTITNATTONGATGCNAAAAGTrATAGTACCAACANr™ 
CTTNGGCGCGACCACNCTAAGGrcAAATCCNCACACTGGCGGCCGTCrrNTGGATCCA 
CAACCTGGGGAACATGGCAAACTGGTrCCTCGGAATGGTTTCCGTTCAATTCOCCCANAT^ 
AACCTAAAATNAACCNGGGC 

SEQ ID NO: 2590 GGTACGCGGGGAACTCGGTGGTGGCCACTGCGCAGACCAGACTTCGCTCGTA 
CTGACTGCTCVLAGAGAAACATACCCCATTGTTTTTCATTrGTAAGCOTTCTGTOATCTrCTACAAT^ 
GGTCACGTCCTCTTCATTTTCCATCTTGAAAGAGAGAGCCAACAAGGACTTTAriTCArrCT^ 
AGGTAAACCTCCTTGCCCACTGGCTGTATCTATACTITCCTTGAGAAAAATCCCATAAAGTGGA 
GACCTOTGAAGAAAATOTATOCTTATGGCCTAGCCrrCATGTCTGGCTGATGTATCCTATAAGGCA 
AGTAAGCCCCTTrrCTAGTCncrGGTAAGATGCAAGAGCrCATATCCCCATCACrrGACAT^ 
TGGAAATAATATTCAGACTCTGCTATGACCAACCCCTOATGTTGGTTTTTCTITTCAAACT^ 
TATGAGTAGAGGAAAAGCCTAAAAGTTAAGTATTTATGTCTGGGGGGATA(XT1CANGTGGCITA 
TCTGGmATGCCAANAAAnATGTGGTCATCrnATTCAGTGCAAAAATITTI^ 
TAATOGAAGGGAACATTAAANAACri>riTCCTCCCAANAAAACCCCTAAAATAAATTCCT 
GGTTTCCTTGGCCTTAAAATACCCTTAAG 

SEQ ID NO: 259 i A CTlVrnTi ' l I ' l 1 1 rn l - I II GGGTGAGGOOACCCTACTCTOTTATCCCAAGT 
GCTCTTATTCTGGTGAGAAGAACCTTACn'CCATAATTTGGGAAGGAATGGAAGATGGAC^ 
(XGACACCACCAGACACTAGGATGGGATGGATGGTTrrTTGGGGGATGGGrrAGGGGAAATAAGG 
CTrGCTGTrrGTTCrCCTOOGGCGCTCCCTCCAACTTTTGCAGArrOT 

GGGATTGTCCAATTACTAAAATGTAAATAATCACGTArrGTGGGGAGGGGAGTTCCAAGTGTGCC 

CTCCTCTCTTCTCCTGCCTGGATTATTTAAAAAGCCATGTGTGOAAACCCACTATTT^^ 

AATAAGAATCCGAANNNNAAANNNNNNhn^NNNNNN^^ 

CCTAAGGGCNAATTCCANCCCCTGGCGGCCNTACTATTNGATNCAACCCNGTCCAAC^ 

ATCATGOOCCNACCNNTTCCNOOGGNAAATGTTNCCGGTAAAATCCCNCNANTN^AACCG GAAC 

(>TAANNNTAAAACCCGGGGGCCTANGNGGNNNNAANTTNAATm 

CCAAAGGNAAC 
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SEQ ID NO: 2592 ggtacagtggaaggtggataaogccctccaatcgggtaactcccaggagagt 

GTCACAGAGCAGGACAGCAAGGACAGCACCrACAGCCTCAGCAGCACCCTGACGCTGAGCAAAG 

CAGACTACGAGAAACACAAAGTCTACGCCrGCGAAGTCAOCCATCAGGGCCTGAGCTCGCCCGTC 

ACAAAGAGCTTCAACAGGGGAGAGTGTrAGAGGGAGAAGTGCCCCCACCTGCTCCTCAOTTCCAO 

CCTOACCCCCTCCCATCCTITGGCCTCTGACCCnTmCCACAGGGGACCr^ 

TCCAGCrCATCTITCACCTCACCCCCCTCCTCTCTTGGCmA^ 

OAATAAATAAAOTGAATCTTITCCKA>mAAAAAANNNNNNNN^ 

CGGCCGTTCA 

SEQ ID NO: 2593 GGTACAGTGAGGGTGTTCAGAGGGAGGCACAAAGAATAGCTCTGAGATTAG 
GCAATGGAAATOACAAAAAAGAOATGAATAAATCCGATTTGAATACCAACAATTTGCTCTrCAAA 
CCTCCTGTAGAGAGCCATATACAAAAGAATAAGAAAATrCTTAAATCTGCAAAAGATTTGCCTCCT 
GATGCACTTATCATTGAATACAGAGGGAAGrrrATOCTGAOAGAACAOTTTOAAOCAAATGGGTA 
mCITTAAAAGACCATACCCnTrrGTGTTArrCTACTCTAAATTTCATGGGCTAGAAATGTGTG^ 
GATGCAAGGACTmGGGAATGAGGCTCGATTCATCAGGCGGTCTTGTACACCCAATGCAGAGGT 
GAGGCATGAAATTCAAGATGOAACCATACATCTTTATATTTATTCTATACACAGTATTCCAAAGGG 
AACTGAAATTACTATTGCCTITGATmGACrATNGNAATTGTAAGTCCTGCCCGGCCGGCCGTrC 
NAANGGCGAANrCCACACACTGGCGGCCX)TCrimGOATCCACCTCGNACCAACTTGGGGAATAT 
GGCATACTGGTTCTGGNGAAATGGTTTCCGTNCAATTCCCCAAATACAACCGAACNTAAAGNNAA 
CCNGGGGCCAAAAAGAACAAC 

SEQ ID NO: 2594 cgagqtactgctcggaggttgggttctgctccgaggtcgccccaaccgaaat 

TTTTAAT GCAGGT TTGGTAG TTTAGGACCTGTGGGTTTGTTAGGTACI T^ 

1 1 1 n r I GGGG' I ' 1" 1 C 1" 1 " l"l rAATTATCTTTAGTCTTGTQATCACACATAATTTTAAAATTTGNGTATA 

TCTCCGTTACTTTAATCXTTTTAAGTTGGCAAA^GCACCATTCCCAATC 

GTTTTGTlTCTGAATGGCTGTTrAAAGACAATCCTAAATTATAACTTAGmGACT^ 

GAATTCAAGAQTGAAGTTTAACTTOCTACTATTTTAAAAGCATGTGACCTTATAAA 

GTGANAAGNGTTGAATAATCTITAATATTACACATAAACCACACTAAAATGCCTTTCAATAAA^^^ 

AAAAGAAAa;ATmAAATACAGGGAATrATAATTAAAATGGCATAATTAAGGCCAAACTATAAA 

CATTG^^TCC^^^ATTAT^^TCAACCNTTCCTITAANAGGCAAtNAAa^^ 

AArITrNTGGGT^^'AAACAGGAACAATT^^CCC^mT^AAA^^TTC 

AAACCACTCCCTAAGTTAAA 

SEQ ID NO: 2595 GCGTGGTCXiGGCCGAGGTACAGAGAAGCACCTATTGACAAAAAGGGGAATT 
TCAATTACATCGAGTTCACACGCATCCTGAAACATGGAGCCAAAGACAAAGATGACTGAAAGAAC 
TTTAGCTAAAATCTTCCAGTTACATTGTCTTACTCTCTrTTACTTCTCAGACACT^ 
TAGAACCrcTrGCATGCAACTTAGTTTCACAGCTTTGCCTCTTCT^^ 

CTTTCTGCCACTTAGCACTTGTATAATCAGACTGGAAATGOGGATGAGGGTGTAAATTGTATTGAA 
AAAGATCXK;GAATAAAAATCAACAAATGTGAAAGCCCAGAAAAATATArrCGTATITCTGGTm 
GCTGGATTTTTACATTTITATATAATAAAAATGTrATTTrGAAATT 

SEQ ID NO: 2596 ACGCGGGATTTATTTrAAATGAGACAATCATTrrAAGTrrTAAGATAACAGAA 
GTGACCAATGTAATTTCACAACAC CTAAGGAT TTTTTGGTTGATCAGGTTACTC 
ATTQTCCTGGATGAATAGACTGTGC l"!"!"!" I'C i 1" ll'lCTCTCCCTrCCTTCTTGOTTTCCC AT AGTATA 
ATAAGCATGCATACTTTAACTTCTATAGTTTTCTCCTTTAGAGGGTCGTCTT^ 
ACTTCTCCCTTGCCTTTGACTCATTGGACrAGTGC^^ 
CrTTTCTAGGTCATTAACGTrrmATTTAAGTITCTrTAGCCAATAAGTGG^ 
GATTTCAATATTITATAGTAAAGAAATGACAAACTGCTrrGGTTCAmCATAAACAAACTCTGCA 
TTAAGATAACTATmAAAOGTrGrrAAGAAGAAAAATTACTGGTTCrrTGGTACTCGTGGGACCT^ 
GGCCGCGACCACNCTAAGGCGAAArrCAACACACTGGCGGCCGTTNTAATGGATCCNACCCGGAC 
CAACCTGGGGGAACATGGCATAGTGGTTCCTGGGGAAATGGTNTCCGTCCCAATCCCCCAAANAC 
AACCGGAACTTAANGTNANCCNGG 

SEQ ID NO: 2597 ACTAGAGCCAGTCATCCTTAACAAATCTTITCACAriTTATrrCTTTCACATGT 
AGTCATCTTCAAAAAGGAAAGATTTCGAATTITAGAAAAGGGGCAACTCriUr]'^ 
ATCAGAAAGTCACAAAAATCGATGGAATCATITCCACTGGGAAGATrGACClll'lGTATTrATTrG 
TGGGGTAAATrAATAAGCATTCCAGATX3CnXK:AGCTrCCTGCATCCAGGAGATGCTG 
OTOATGCAGCTGGAACCCAAGCTGCAGCAaGAOATGCAAQTTTCAGGATGTrccrCACrGAGCrG 
GAGQAATATCTACAGCAGTGATGCrrGAAATTTTTGNATGAATTATTITGTCGCCTA 
CCAAACAAAAATrAGAGGGATTATTTTAATCTmGGATCrTCCCCTrTT^^ 
TATCAAA^n<NNN>^^^nW^INNNNNN^^ 

CCATATTCTAAACCAGGCGACTGNAATCCTGACGTGAAATCAATANACTACTGGCCTACCCGGCN 
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TGGCTGAAACrTXjCCTAAAATCKXGGACGGGGTCCTCnX}ATCCCCrrrGGGG<^ 

SEQ ID NO: 2598 GGTACTGATACATGCTATAACAGAGATGAACTTCGAAAACATGCTAAGTGAA 
AGAAGCCAAATCCAAAAACAATAAAAACACATATTGTATCCrCAa:crTm 
AATCATTGCATATGAATGTTTATGGGAAAAATCAATGTGTGCTAAATCATTGTATTCCAGTAAATA 
GATTGGACTTAAAACTrGATACAGAAGTTGCAAATAAGTGGGATTOAOTTTOATTATTATATAGAA 
AATAATTACATGATTCATTTAAGAATAATAATATCCACCATTTATTGAGCACTTACTATGAGCCTG 
TGTGCCAAACATTTCATGCATTTCTCATTTAArrCTCACAATAATCCT^ 
GTTGAATCATATGAACTTGCCAATATATGATAATTTCTAAGAAGTTXjGGGAATTTTTGAG^ 
AATGGTCCTGCCCGGCNGCCGCTCCAAAGGCGAATTNCACC 

SEQ ID NO: 2599 ACnXKiAACATACAACACACACACTmAGTAGGAGAGTCGGCCACCACATTT 
GCTCAAAGTATGGGGTTTATCAATGAAGACTTATCTACCAGTGCTICrCAAGCrC^ 
TGGCTTGCTCGGAATTGCCAGCCAAATTArrcGAGGAATGTTATACCAGATCCCTCAAAATACT^ 
GGACCCTACAAACCACCTGGTATCTTAAAGCAOGATATTOCTATCCATAAAGAAACAGAAGATGA 
TCGTGOTCATGACACTATTGGCATGGTTGTAATCCATAAGACAGGACATATTGCTGCTGGTACCAG 
CATCCCCAGCGTCTGGCATTCCATGTTTCTGCTCCTOTGGCCTCCACGOTGCAACAAGCT 
TTACTTGGACCTCTGCCTCATC"n"l t'll CL il I GCGCTTAACCTGCGCATTCGCTTCTTNCTNCACTT 
GGCTCTCATGGCGCAAANGGTTNCAAAAAAATGGCGCTAAGGCCGAAACCCCGCGTCCCTCGGNC 
GNGAACACGCTAA 

SEQ ID NO: 2600 ACTOGCTGGGATGGCTCTGATATAGCAGCCTTGGTGTAGTTTCriXjCAT^ 
GAAGAGTGn-rTTATTATCCACCTGCAGACTGGACTGGATTCmTAGCTCOTCAATC^ 
TCCTGTGGCATCACTAAGTATAAQAOTGCIXnCrTCCTGAAGACCrATAAGCrcGAGGTG 
CTCAATGTAAATTTCAAOGAAAAACCCTCATGan'GAGATGTGGGCCACTCAGAGCTAACCAAAA 
TGTTCAACACCATAACTAGAGACACTCAAATTGCCAACCAGGACAAGAAOTTGATGACTTCATGC 
TGTGGACAOTTTTTCCCAAGATGTCCAAGCCTCATCGTGACGAGGCTOT 
TGCTCATGCCTGCCTCTTTAATTIXXn'AAGATAATGCTGTACTAGAATTTCACAATCAGC^ 
GCAGGTAATTTGACAGAGOGTNOGATQTGTCATGTATCATGTCAACCCAATATTTGACCrAAGGG 
ATCCTTATTrrGCCChrrGGCTACTTACAACATCCTATACACTGGrrATTCAATCNCGGGGC^ 
NAATANACCnTAAATACCTGTCTACNCCTGTITATTAAaX;CTTTGGAGCCCAAANA 
TACT 

SEQ ID NO: 260 1 ACGCGGGAGATGAATGCCAGAGGACTTGGATCTGAGCTAAAGGACAGTATTC 
CAGTrACTGAACTTTCAGCAAGTGGACCrmGAAAGTCATGATCTTCTTCGGAAAGG l I I ' l ICl ' l G 
TOTGAAAAATGA ACrm GCCTAGTCATCCCCTTGAATrATCAGAAAAAAATTreCAGCTCAACC^ 
AGATAAAATGAATTTTTCCACACTGAGAAACATrcAGGGTCTAriTGCTCCOCTAAAAT^ 
GGAATTCAAGGCAGTGCAGCAGGTTCAGCGTCITCCATITCTrrCAAGCTCAAATC^ 
TGTTITGAGGGGTAATGATGAGACTATTGGATTrGAGGATATTCTTAATGATCCATCCAAAGCGAA 
GTCATGGGAGAACCACACTTGATOGTQGAATATAAACrrGGGTTACrGTAATAGTGTGCrGTCATG 
GAACCGAGGCTGCATCTTGrrATAGCATCTTTrCACC 

SEQ ID NO: 2602 ACTCrrGATAAAAGACCX}TGAAACCAACAAATCAAGAGGArrTGCTnTGTC 
ACCTTTGAAAGCCCAGCAGACGCTAAGGATGCAGCCAGAGACATGAATGGAAAGTCATTAGATGG 
AAAAGCCATCAAGGTGGAACAAGCCACCAAACCATCATITGAAAGAGGTAGACATGGACCGCCX; 
CCACCTCCAAGAAGTAGAGGTCCTCCAAOAQGTTTTGGAQCTGGAAGAGGAGGAAGTGGAGOAA 
CCAGGGGACCTCCTrcACGAGGAGGACACATGGATGATGGlXjGATATTCCATGAATTTTAACATG 
AGrrCTrCCAGGGGACCACnX;CCAGTAAAAAGAGGACCACCACCAAGAAOTGGGGQTCCTXXTCC 
TAAOAQATCTGCACCrrCAGGACCAGTTCGCAGTAGCAGTGGAATGGGAGGAAGAGCTCCTGTAT 
CACGTGGAAGAGATAGTTATGGAGGTCCACCTCGAAGGGAACXGCTGCCCTCTCGTAGAGATGTT 
TATTTGTCCCCAAGAGATGATGOOTATTCTACTAAAOACAGCTATTCAAGCAGAGATTACCCAAGT 
TCTCQTGATACTAGAGATTATGCACXCACCACCACGAAGATrATACrTACCXjNGATTATGGTCATT 
CCAAGTTCACGTGATGACTATTCATCAAGAAGGATATAGCGGATAGGAAOATGGGATATTGGGTC 
OGNGAATCGGGGACTATTTCAAAAATCAhrrCXCAAGGNGGGGJ^GGGrmcrTACNGGAGAATTT^ 
TTATTGAAGANGrrATTGGGGNAACCTTCACCGTTAAGNGGCTTCCACCCTTACACCGAANGQGCX; 
CCCCCGNCCTT^mTTTTNOGNGGONAAGCCAGTCCC^mT^GA^^rGAA^TACCCCT 

SEQ ID NO: 2603 ggtacaataaaggaatggggaagggggaaatgaaagaatagagaaaactat 

ACGGTAGTAGTCAGGATGTGGTGGAACCAAATTGCV^GTmCTAATrGAGAATQTAATCTTGGTCT 
TTAAAQAACAGAGTTCTGGAGTAAAGAAGCAGGTTCCCTTTTCAGTAGACACCTCCCGTCTXK^ 

tggaacacatcaattgtatcitcatccrccatttccaacrgtgcaggtgtgtctgtttcatrgato 
gttgcccgtcaaatcggaatctgatctocctcattoacaatccctgtcgttcacaataggctttcat 



392 



wo 02/29086 



PCT/USOl/30732 



TAGTn'ACTAAGTCGTGTATGCCTCTTAATCTTAAACrCK;ACCACAGAACCATCClGCa:ro 

CTTCAAATTAATATGATCGrrGTrCTCAGTCrrGACTCOTCCTTGOGCTTITCGTCOGCCATGGC^ 

AGaKX:GGAGTCTCCTCAGCTGCCGCrrCACAAAAGAGGT 

SEQ ID NO : 2604 GGTACGCGGGATG 1 1 ri i I CTGATTCCATCCTGTGTCCCCTTCATCCrTGACTC 
CTTrcOTAmCACTGAATTTCAAACATTTGTCAGAGAAGAAAAAAAGTGAOGACTCAGGAAAAA 
TAAATAAATAAAAGAACAGCCmrCXCTTAGTATTAACAGAAATGTTTCTGTGTCATTAA 
TTTAATCAATGTGACATGTrGCrCTTTGGCTGAAATTCTTCAACrrGGAAATGACAC^ 
GAAGGTGTTCAAACACAACCrACTCntK:AAACCTTGGTAAAGGAACCAGTCAGCTGGCC AGATT T 
COCACTACCTGCCATGCATACATGCTGCGCATGTTTTCTTCATTCGTATGT^ 
TTATTATATATTTAACATOTOGAAQAAAACAAOACATGAAAAOAOTOGTOACAAATCAAOAATAA 
ACACTGGGTGNAGCCCAAAAAAAAAAAAAAAAAAAAGTCCTGNCCCGGCGGCCGTCGAAA 

SEQ ID NO: 2605 A CnUMM ' IU ' r i lJ ' l ' l ' l ' ll ' l^U 1 i ' l^ GGAGGNATTTGAAATACAACTrTATTCTGAT 
TCTAAACGAAAAGGAATGGGAATOACAOTAACAAACAAGATTTCACCACTGAATATTONOATGNG 
ACTGCAGCAGTCTTATATATGAAACTCAAGGAATCAACTGCGTTCCAAAACAGCT 
GNCCAAACAATGAAmATTTmAAACIXjCCACATTCACTCCGAAGrc 

ATCCCACAGATGAAGCACATGTTCCGCTTAGCTAGATAATAATGAGGNGGCACACACGCTGCACX: 
GCTGACATCACAGGACAGCTGCCrATAAAACTAGACrrCTGACGCTGGGCTCCAGCTTCATTCTCA 
CAGGTCATCATCCTCATCCGGGAGAGCAGTTGTCTGAGCAACCTCTAAGTCGTGCTCATACTGTGC 
TGCCAAAGCTGGGTCCATGACAACTTCTGGTGGGGCGAGA GCAGG CATGGCAACAAAT TCCAA GT 
TAGGGTCTCCAATGAGCrrCCTAGCAAGCCAGAGGAAGGGCTmCAAAGTTGTAGTTACTTTTGG 
CAGAAATGTCGTAGT 

SEQ ID NO: 2606 ACGCGGGGAGCGCGGAGCACCTGCGCCCGCGGCTGACACCrTCGCrCGCAGT 
TTGrrCXSCAGTTrACTCGCACACCAGrn'CCCCCACCGCGCTTrGGATTAGTGTGATCTCAGCTCAA 
GGCAAAGGTGGGATATCATGGCATCTATCTGGGTTGGACACCGAGGAACAGTAAGAGATTATCCA 
GACmAGCCCATCAGTGGATGCTGAAGCTATTCAGAAAGCAATCAGAGGAArTGGAACTGATGA 
OAAAATGCTCATC^GCATICrcACTOAGAGOTCAAATGCACAGCGGCAGrrGATTGTTAA^ 
ATCAAGCAGCATATGGAAAGGAGCTGAAAGATGAOTGAAGGGTGATCTCTCTGGCCACTT^ 
CATCTCATGOTGGCCCrAGTGACTCCACCAGCAGTCTTTQATGCAAAGCAGCTAAAGAAATCCATG 
AAGGGCGCGGGAACAAACXjAAGATGCCITGATTGAAATCTTAACTACCAGGACAAGCAGGCAAA 
TGAAGGATATCTTCTCAAGCCTATTATACAGCATACAAGAAGAGTCTTGGAGATGACATTAGTTTC 
CGAAACATCTGGTGACTTCCGGAAAAGCTCTGTTNACmTGGCAGATGGCAGAAAGANAATGAA 
AAGTCrGAAAGGTGGATGANCCATTrGGCCCAAACAAAGATGCCCCANAAATTCTCTTATAAAAG 
CTGGTGGANAACANAAGGGGGGCCCCQGGAhrraAANACAAAhnTCNCCTTGOAAAACCTGTGGT 
TNAANGGANCTTrrcCTCAATTTAAAACCTAACCATITrcm 
CCCAAANGGCCCTTTGGGGGGCANCWCAhfNAAAAGG 

SEQ ID NO: 2607 GOTACGCOGQGCTACAACAGGCAGGCAGGGGCAQCAAGATGOTOTrOCAGA 
CCCAGGTCTrCATTrCTCnX}TrGCTCTGGATCGCTGGTGCCTGCGGGGACATCGTGATGACCC^^^ 
CTCCAGACTCCCrGGCTGTGTCTCTGGGCGAGACGGCCACCATCAACTGCAAGTCCAGCCAGAGT 
GTTTTATACAGCTCCAACAATAAGAACTACTTAGCrTGGTATCAGCAGAAACCAGGACAGCCrc 
AAGTTGCTCATTTACTGGGCATCTACGCGGGAAGTOJGGGTCCCTGACCGATTCAGTGGCAGCGG 
GTCTCGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGGCTGAAOATGTGGCAOTCTATTATTG 
TCAGCAGTATTATAGGAGTCAGTGCAGTTITGGCCAGGGGACCAAGCTGGAGATCAAGCGAACTG 
TGGCTGCACCATCTGTCTTCATOTCCCGCCATCTGATGAGCAGrrGAAATCTGGAACTG<^ 
TGTGTGCCTGCTGAATANCTTCTATCCCTCTOACCCTTTTTNCACA GGGG GA CCTAC CCCT 
GGTCCCITCCAGCTCATCTmCACCTTTAACCCCCCTTCCTrCCTC 

CTAATGGTTGGGNNGGAAGAAATGGAAATAAAATTAANAGNNGAAAAKimTTGCCCCNTbn^ 
TNANAT^IN^^^C^GCNa*^NNAGCXJCT^^ACTX3CCCGC<^^ 

AN^TmcTC^x^■CA^r^IN^^^CNCCCANAN^™ 

AAAAGGGGGCNAAATTTCCNhmc>IACITGGGGGNGNGGKnTATAATGGGGANCCCCCCCX:CGCG 

SEQ ID NO: 2608 GGTACAGTCTITCATrAAATAAGAATACrrACACATACATTrTCAGATATrrc 
TACCTTCCTGTATGTGTTTGGAArrGTATGTAGGTAGCCACTGAAAGAATTTQGGCCCXn^ 
GATGGCAGTGGAAGNCCATGAAGTAAAGAGCATTCrrrAAAAAGCAGAmGATTGCATACCnT 
TAGTTATrTGAGATTCTGAGAATTCTGATAAACCCCAAAGCAGAAAGATrCCTTATACCCTTGGAA 
GATGGGAAAGGTGAGGGAAATATTTGAAGCAGGGTCAGAACATCCACTAAGAACATAGCACCTC 
AGTAGAGCTTACATTATAGTGCCAGGGTAGAGTTATTACTGAATAGCTTA GGATG ATGAACATTA 
ACCrrCCTACAGGAOTAGTAGCAGCTQATrTGOTGACCATGATTGGTCACCTTITAGTGTAA 
AAACTAAOAACAATTATGGCTTGACATATACTCCATGTAGGGAAGTGATGGGAGAGGCAGCCTCT 
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0T0T(K3GTCCATCCTTGGAAGCACTCK:ATCGATmGCTCCCCTCrGGTTTTAAGGAGATC^^ 
ACCTirCCTTGCTAAOCTOCTITrcAGTAAAAnCTTGGATGGGCCm 
CATCTTNTTCTCNCCCCTTCTTTTTCATTGGGAGGCAATGGATCCT^ 
rrGGGCTIX}GC\NGGAAATAAAmCAAGGNTTTrrCNrrAAAAGACC^^ 

TTGGATTACCTrAAAGGCTGGATGCCAOGGNACCCTOGNCCCXiGGGCNGGGCCGGTTCGAAAAAG 
GGGCGGAAATTCCCACrACNCTTGGGNGGGCCCGGTTACTTAAGGGGAAN 

SEQ ID NO: 2609 GGTAL4"l lH l lU ini"i'lMniUnil'i-rCGCTAAATGCCCrGTTTATTCTGCAAACA 
AGGCTTANAAATGTATAOACACAATn'GTAQGCTGAGAAATCrCTGTGTAGTrGNGACTTCX^ 
TTCCGCAAAAGTAGTAAGGACAGCGACTTCTGAACACATTAACCTACCACTOTTITGTTrANA 
AACACAGNGAGAATGGATTCCTAACGGCCTGTrrCTCTAAGNGATCGTCGC^ 
TCTTTAANAACAAGNGGAGCCTTCACTTGAGCCGCCACCTCCTTGCCATTCAGGGCTT^ 
GCAAGCGCAAACTCGAAGCTGGTCCCAGGCCCCCGGCTTGTAANAATCAGGCCGTCTTTTTCCAC 
ACGATTCrCANAGTAGGTGTAATGACCnXATTCATCATTTTGTCm 

AACTTTACTTCCAAAACCCATTTCATGAGCCAACAGAGCAGTAGGACCTGCACAGATGGCGGCTA 

TOAGGCanTCCGGTTrrCCTGCTCCTTCAGTATCrCCI^^ 

NGCGCCCAGATTACCTOrrGGTAGAACCACCACATCATATGGGCCCTXriTn^^ 

GCTGGCATCAGGGACAAAATGGACCNCATCACGGGNTACACTGGTACCTGGCCNGGCGGGCCGTT 

CGAAAGQGCOAAATTCCAACACACTGGGNGGGCCGTTACmAANGGAATCajAGCnCGM 

CNAACNTITGGNGGAAANCAANGGGCCAAAGCTGGGTmCCGGGGGGGAAAATGGGTTTTCCOTC 

CCAAANTNCCCCCCT 

SEQ ID NO: 26 10 TbTTACCAATGGCAACCAGTGCTCnTITCCTGCAAATATrCTGATGTAATTTCT 
CCTGAACrrcAATGAAGCTGTCATATCGATCTITAGTAAACTrTATATTACGOAGAACTGCTGCT^ 
CCGCAAAAGGACGTATXriTAGCTGTCTCTTCTGTGATAATCj^Am 
TACCCGTTrATACACTGGAGCCITTATCCnTCrrTGAAGACCTCAAGTCCTC^ 
AGACNCAGGAGATCATATCTATTGGCAGGGACGTCAATnTGTAAAGAACAACATCAGAGGCTCC 
TGCTGCCTITACATTACCTTCTTCTITACTrATTAmCCTrCTC^ 

CAGACCAAATTCAAAACATAGTTTCATCrAAATTCTTCGTCAGTNGTAGOTOOGGCCCAG^ 

GAAQAOCANATCACCGCTTNACGCTGACAGrrCCGGCATGGGTGGTGTCGAACTCACTGGCCCTCC 

GCGTTACCCTGCCCCGGGCGG<XGGCnTCCNAAAAGGGGCGAAATTrCCANNCACACTGGGGCGQ 

GCCCGGTTACTAAGTGGGNATCCCttGAAGNCTCXKKJTTACCCAAGNCTTTGGGCNG 

GGGNNCCATAANCNGGC^nTNCCCCCGGGGTGGAAAATTlGNTTNTTCCNGNCnCCCC^ 

NCACCCAAACATNACCANGCCGGGGAAGOCACTAAANGTGGTNANANCCCOGGGGGGNGGCCTT 

AATGGAGGGGNAGCCTmACCTCCACCAATTAAATTNGCGGTTGNCGGC^rrcAAN*^GGCNCrc 

NrrT<XCACGTCrGGNGANAAACCCTNGTNGTGCCCCAGCCTGGCCNNTAAAGGGAAATGGGCCC 

ACTCCCCCGGGGGAAAAAAGGNCGmTITGGGCGAAATGGGGNGNCCrTTTCCCCG 

SEQ ID NO: 26 1 1 ACCATGTCCACTCCATCGCGTCGCCATGATGGGCCATCarCCAGTGCTCGTGC 
TCAGCCAGAACACAAAGCGTGAATCCGGAAGAAAAGTTCAATCTGGAAACATCAATGCrOCCAAG 
ACTATTGCAGATATCATCCGAACATGTTTGGGACCCAAGTCCATGATGAAGATGCrTrrGGAC^ 
ATGGGAGGCATTGTGATGACCAATGATGGCAATGCCATrCTTCGAGAGATrCAAGTCCAGCATCC 
AGCGGCCAAGTCCATGATCGAAATTAGCCGGAOCCAGOATGAAGAGGTTGGAGATGGGACCACA 
TCAGTAATTATTCnTGCAGGGGAAATGCTGTCTGTAGCrGAGCACTTCCTGGAGCAGCAGATGCAC 
CCAACAGTGGTGATCAGTGCrrACCGCAAGGCATTGGATGATATGATCAGCACCCTAAAGAAAAT 
AAGTATCCCAGTCOACATCAGTGACAGTGATATGATGCTGAACATCATCAACAGCTCTATrACTAC 
CAAAGCCATCAGTCGGTGGTCATCmGGOTGCAACATIXJCCCTGGATGCTGTCAAGATGOTAC 

SEQ ID NO: 26 1 2 GGTACGCGGGCTCGTCTGACTrCTTTTATTOGTGCCATCGCCATI^ 

GGTAAAGAGCACCTTGGGACCCAAAGGCATGGACAAAATfCTTCrAAGCAGTGGACGAGATGCCr 

CTCTTATGGTAACCAATGATGGTGCCACTATTCTAAAAAACATrGOTGrroACAATCCAGCAGCTA 

AAGTTTTAOTTGATATGTCAAGGGTTCAAGATGATOAAGTTGGTGATCGCACTACCTCTGT^^ 

TmAGCAGCAGAATTATTAAGGGAAGCAGAATCTTrAATTGCAAAAAAGATTCATCCACAGACC 

ATCATAGCGGGTTGGAGAGAAG<XACaAAQGCrGCAAGAGAGGCGCTGTTGAGTTCrGCAGTTGA 

TCATGGKrCCGATGAAGTTAAATTCCCGTCAAGAATrAATGAATATTGCGGGCACAACArrATNCT 

CAAAACT^^^TACTCATCACAAAAGACCACTTrcCAAAOTTANCT^GTANAAOCAGT^CT^ 

GAAANGhTTTTTGGCAAOTGGANGCAATTCTTATTTTCAAGAAGCT^ 

TNCTATT 

SEQ ID NO: 26 1 3 GGTACTATACTGGCAATrGCTQGAGAAGATmGCAATTGTTGCTTCTGATAC 
TCGATTGAGTGAAGGGTTTTCAATTCATACXjCGGGATAGCCCCAAATGrrACAAATrAACAGACA 
AAACAGTCATTGGATGCAGCGGTTTTCATGGAGACTGTCrrACGCTGACAAAGATTATTOAAGC^ 
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AGACTAAAGATOTATAAGCATTCCAATAATAAGGCCATGACTACGGGGGCAATTGCTGCAATGCT 

GTCTACAATCCTGTATTCAAOGCGCTIXnTrcCATACTATGTTTACAACATCATCGGT^^ 

GAAGAAGGAAAGGGGGOtiTATACAGCTITGATCCAGTAGGGTCTTACCAGAGAGACTCCTTCAA 

GGCTGGAGGCTCAACAAGTGCCATGCTACAAGCCCCTGOTGACAACCAGGTTGG'l l'l I lAAOAA 

CATGCCAGAATGTGGAGCATGTTCCGCTGTCCTrcGACAAANCA^X5a}G^r^ 

TTCAnTCTCGGTTGANAAAANATGTGTACCrGNCC 

SEQ ID NO: 26 1 4 GGTACTTGGAAACAGTTACAGCCCrCACCAAGCTAGGrrOGOGACATOAAGT 
CCAACAGCATCTGAAATGCTGTGAAGGCAAACnTTACAAATAGCAATGTAAGCrTACAGAACTTG 
CCTTTACTAGGGTGGTATGrrCATGTAATrOIAGCGCOTCGCAATTTAACGAACCAAATTT 
ACTTCACrTCTAAAAAOrrAGATGAAACATGAAAAAa^AOCCATACAGTATAAATTGCAACTrC^ 
GGAAAAGTTCCTGCTTTATTATTCCAAATAATTTrrcCCATGTAAAATAAATGTCCAATAGTTCCT 
GTATGTGGGAAOATAATGTTTGTTAAAGTCACAATGAAGCCTTATTGACGGAAGCTTGGCTATATA 
AACAGCACAAGTTGAGAAAAGGTTTAATCrcCAACCTATTAGACTGGATGAAGTAAGATC^ 
CGGCCAanrrGCGACATTACAGGANGTCTCATTNCCATCGOAGGNCGAOGAAGGCCCCCCTGGT 
NAGGGGGGCACCATTGGGGGGCCCTGGCCTACNGGGGCATANGCCCCAGGAAGGTNTTCTGNCAT 
TC 



SEQIDN0:2615 GGTAC'i'i'i'iTirri I'l'i iTrrj-iTn-iTri'i'riTAOAOOACCTnTCTAi'ri i'lAA 

AATGGAATGATTCTGNCTNAGGTCAAATGCTTTAGTTGNGTCTGCTTCCTNrrTCCAAATCCANATA 

TTrCAATAATTTGNrGATATCTTCTACrACCTCCCTCCGATAAriTCTGATTTCT 

ACATITAGAGCATCCAGQTCAGCGATCAQCCCCOAGAGCNCACAGGATAAGNGCCTGCAGGNCrrN 

ATTGTTGTTCACACCCATNATAAGNGCAATCAGGACCCCTOGGGCCrrcTTCACCT^ 

GAAGTTGATTTTGGCAACGGAAGOATOTGCATCCnNGaAAAGCGGCAGGGAAGGCTGCTITT^ 

TGCNGTCTrC>rATTATmTCTTGC:ACCCGCACAGANTITGG>rrAAAGTG 

CAAGGGAGATriTrcCTCX:NmTrrAACATGGNGTCNN 

ThTITNTAACNNATmAGTTNCCNCCNGGTGCCrCCTrnAAAAANGG 

AAGGANTNGG 



SEQ ID NO: 26 1 6 AClUMCU'ri lU-riTi-rrN i'i'i N l'riU"l<3GAAACTTTTrArmATATmGGNCrT 
ACAAATGATCACTTTTAAATGGACrnTrmjTAANAATGTAAAACTCAAAAArm 
ATCTGANCCACNCAAATCCCTANAAAGGNTTTTTGNGNANCKITCATTAACGCAAATh^^ 
ATGTTTCACTCTTACTGTNGOATCTTGAATATGTTTTACAATAATGAAGCTACNAAOrrmATO 
GGGCATTCmTGNAAACTATAAATAACATTTGTATTAAAAAGAAAGCTGGGTAATACNAAAATAG 
GAGAGACTTTGAGGAGCAGGCAATCTGTrGAGGCTCANNATATCTTATTTOCTITGN^ 
CCATTCCrmAAAGAGCACACAGCACATCAGGCATAGTAATCATCTGTOT 
TTTANAAATCrGGGGGTCGTAAAACTGNGCCAATTATCACAAACTATNATTTGCATAATTNAACCN 
CAAGTCNGGTACAAAAGCArnTNGTAGGGGGOATCAACCAATGCGNQAG 

SEQ ID NO: 26 1 7 GGTACGCGGGGGCTCACTCTXKXSCTTCACCATGGCmCATTGCCAAGTCOT 
CTATGACCTCAGTGCCATCAGCCTGGATGGGGAGAAGGTAGAnTCAATACGTTCCGGGGCAGGG 
CrGTGCTGATTGAGAATGTGGCTTC0CrCnX3AGGCACAACCACCCGGQACTrCACCCAGCTCAA 
GAGCTGCAATGCCGCrrrCCCAGGCGCCTGGTGGTCCTTGGCTTCCCTTGCAACCAATITGGACAT 
CAGGAGAACraTCAGAATGAGOAGATCCTGAACAGTCTCAAOTATGTCXXJTXXTG 
a>GCCCACCm:ACCCTTGTCAAAAATGTGAGGTGAATGGGCAGAACGAGCATCCTGTC^ 
TACCTGAAAGGACAAGCTNCCCTACCmATGATOACCCATTTTTCCTCATGACCOATCCCAA^ 
ATTATTTGNAQCCCrGTGCGCC(XCTCAGATI^GGCCTGG(^CnTTGAAAAAGTTCT^ 
GCCGGAAGGAAGAAGCCTTTCCGACGCTNCNAGNCGNACCTTTNCCAAC>TTC^^ 
CCG 



SEQ ID NO: 26 1 8 ACGCGGGGGCAGAAGTCTCTCTCAGTCAGGACACAGCATGGACATGAGGGTC 
CCCGCnCAGCTCCrGGGACTCCTGCTGCTCTGGCTCCCAGATACCAGATGTGACATCCAGATGACC 
CAGTCIxrATCTTTCCTGTCTGCATCTGTAGGAGACAGAQTCACCATCACTTGC^ 
GGCATTGCTAATTATTTAGCCTGGTATCAGCAGAAACCAGGGAAAGTTCCAAAGGTCCTGATCTAT 
GCTGCATTCACTITGCAAQCTGGGGTCOCATCTCGGTTCAOTGOCQGTOCTTCTGGGACAGAQTTC 
ACTCTCACCATCAGTAGCCTGCAGCCTGAAGATGCTGCGATTTATTACTGTCAAAAGTATAACAGT 
GCCCCTCAGACGTrCGGCCAGGGGACCAAGTTGGAGATCAAACGAACTGTGGCTGCACCATCTGT 

ATAACTTCTATCCCANAGAGGCCAAAGTACTNGGCCGNGANCACCTTANNGG 

SEQ ID NO: 26 1 9 AtnTCACCTTCCAGGAGGTGAAAGGGAATACAAATtCACAGCAOACITCCAG 
AOGCCCCAGGCCAAGCAGGTAGAAAGTrrCCATCCAATTAAAAAGAGGl 1 1 1'iCl I'l iCTGAATAA 
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AGTCTTCAGTGACGAAATACTATATATGGTGAATAGTAACATGAGTAAGATTTTAATGGGAAGTTC 

TGGTGCAGTGAAGAGCAOAGGAAAGAGGGAATAATGTCCTGTTGTGOTCAOAATCAGAAAAATC 

GAAGCGTCTCCTGCTTTTCCCACAGACAAAAGGCTCATTGGGAGAATrcCTAGA^ 

TCATGAACATGOCACOCAAACATAAAGGAGCTCAAGGCACAAAGAGTTAGACATCGGAGAAAGC 

CTCmGGCCCTTGGGGTTTAAACCAAAGACAGANAATAGAGGGCAATATGGCAATCAGTGTGCAG 

ATGAGGGTTGCCAAGGGAGTCACTGATGGAAGGACrcTGTGTTGGAACTGCTrGAACCAAACCAC 

TTTGTCATTOAQOCCTrGOGAATArrGTTGGOATCCAGAAATTTCANTrCAAACCOATGN 

SEQ ID NO: 2620 GGNAC lTl ' lTtTl 1 1 11 iTl mill I TTNGGGGCAAAGATTCACTrTATTrATT 
CATTCTCCTCCAACATTAGCATAATTAAAGCCAAGGAGGAGGAGGGGGGTGAGGTGAAAGANGA 
GCTOGAGGACCGCAATAGGGGTAGGTCX^CCTQTGGAAAAAGGGTCANAGGCCAAAGGATGGGAG 
GGGGTCAGGCTGGAACTGAGGAGCNGGNGGGGGCNCTTITCCCTOTAACACTCCCCTGT^ 
TCTTTGTGACGGGCGAGCTNATCGCCCTCATGGGTGACTmrCAGGCCaTANACrrrGTGmCT^ 
GAAGNCIXXTTTTTGCTCAACCNTCAGGGTrGCTNCTGAGGOT 

TNCTCCTGNGGACACTCrCCCTGGGAGTTACCCOJATTGGAGGGCGTTATNNCACCTTTTCNC^ 
ACAACTTTTAATGCTTNrrATGAAATTrTTGCCCATTrrrTAACT 
CCATT(XCACTGGGGNATANTrrAmTCGGGNTANTTAAhrriTrNGTW 
GNOGNGCNnTTOGGT 

SEQ ID NO: 2621 ACAGGTTmATGTGAACATACATmCATTTTCTGGGATAAA TGCTCA AAAG 
GGCAACTCTTGGGTTGTATGGTAAACACATATArrTTrGTAAGAAACTACCCTACTCI^ 
AGTGGCTCTACTTTITACATACAGCCACTCATACAATrcAGACAGCAATGTATGATTOATCCAGTT 
TCrrcACATCCTCACCAGCATTTGGTATTACTACTATTTTITATCrr^ 

GTAATGATAa:ACATGTGGTmAATrTGCATTTCCAATGOCTAAT GATGTT QAGT ATCi' rri'lGTG 

TGCTAATTTGCCATCTATGTATCCTCrrCGGGGAAATGTCTTCATGTCTmGTCTAm 

GGGCATTTGTTCTTTTTACTATTGAGTGTTGAGAGGNGTmrTATATATCCTAGATAAA^ 

GTTNOATATGNGGGTGNTrGAATTTTAACATAACTTCTACCAGGGAAAAATANGTrAAATTTCC^ 

CXTTGCATGGNCAGCACTTACTTAATTCCTGGCTTCA 

SEQ ID NO; 2622 ACTTTGCCTACGGCAGCAACCTGCTGACAGAGAGGATCCACCTCCGAAACCC 
CrajGNGGCGTTCTTCTGTGTGGCCCGCCTGCAGGAT^^ 

CAAAACAAGTCAAACirGGCATGGAGGGATAGCCACCATITTTCAGAGTCXn'GGCGATGAAGrrGT 

GGGGAGTATTATGGAAAATGAACAAAAGCAATTTAAATTCTCTGGATGAGNAAGAAGGGGTTAA 

AAGTGGAATGGATGTTNGTAATATAAGTTAAAGTTGCTACTCAAGAAGGAAAAGAAAAANCCTGT 

CTAAGTTATCTGATGACTAATTACGAAAGTGCTCCCCCATCCCCACAGTATAAAAAGATTATTTTG 

CNTGGGTGCAAAAGAAAAATGGNTTTGCCCGCTTGGAGTATCAANAGAAGrrTAAA^ 

NCa^AAATGACCTATNCAGGAAAGGTCbm:AGAAGNAATTGATNACAAThriTNTAAAA^ 

AAACNCAANCTITmAGAACATTAACAGGAATmTTTTNAGGGG>^ 

ATITf 



SEQ ID NO: 2623 A Cn ' rri ' ri ' i ' llH ' ril ' ll ' ri ' l ril ' l ' Il ' ril ' lMH CGNGTTATA ATCC AATCmATTT 
AAAAATCTAATmXjCCAGTTTAGCGTTTTCCACXAACTCGGGGAGCT 
CAATCTTTTGCTTAGGGGCTGCCTTTGNAGGTGCCTTANCAGCAGCCNTTGCA 
CnT^GCTTAGCCTTTTTTGNrrCCrrAGCAGCCXrrGATAGCTra 

GGTTTCnXjATTCCmrrGGCCATTATATCAGCAAGANATGO^CCAGTAATGGCCCTCT 
ACTGCTCGGNGGGTTCrrTTNTrrTGAATTrCTTCX^GAC^ 

GQACAAGTCCANTTTATCTGCCNAGGATINCTTITGGAAAGGAAAGCCGACNCNC>mTC 

AAGAAACTGGAAAAanTCCCGNCGGNCCTGGGGTANCGCCTCCCGGGTCa5GG>n^AATCT^^ 

ACNTCGGCCGGNCCACCCTAAGGGCOAAATTCCACCACACTGG 

SEQ ID NO; 2624 ggtacaggagatctcatttgggacaactaaggataaaatgctggtcatcgaa 

CAGTGTAAGAACTCCAGAGCTGTAACCArrnTATTAGAGGAGGAAATAAGATGATCATrGAGGA 
GGCGAAACGATCCCTTCACGATGCrrTTGTGTGTCATCCGGAACCTCATCX;GCGATAATCGTGTGGT 
GTATGGAGGAGGGGCTGCTGAGATATCCrGTGCCCTGGCAGTTAGCX:AAGAGGCGGATAAGTGCC 

ocacctranaacagtatgccatoaoagcorrrgccgaoocactggaggtcatccccatggocctct 
ctgaaaacagtggcatgaatcx:catcx;agactatgaccgaagtccgagccagacaggtgaagga 
gatgaaccctgctcttggcatcgactgttttgcacaaggggacaaatgatatgaagcancagcat 
gtnatagaaaccttgttggcaaaaaacaacagatatctcrrtgcaacaca aatg gttagaa tgat 

TTTGAAGATTGATGACATTCGTAAOCCCTGGAGAATCrGGAAGAATGAAACTTTGAGAAAACm 
GT 
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SEQ ID NO: 2625 GGTACCGGCTCTACTGCCACCTCTTCCAGCTCCACCGCCGGCGCAGCAG^ 
AAGGCAAACKJCAAAQGCGQCTCGGGAGATTCAGCCGTGAAGCAAGTGCAOATAGATGGCCrTGT 
GGTATTAAAGATAATCAAACATTATCAAGAAGAAGGACAAGGAACTGAAGTTGTTCAAGGAGTGC 
TOTGGGTCTGGrrGTAGAAGATCGGCTrGAAATTACCAACTGCTITCCTTTCCCTCAGCACACAG 
AGGATGATGCTGACTITGATGAAGTCCAATATCAGATGGAAATGATGCGGAGCCTrCGCCATGTA 
AACATTGATCATCTTCACGTGGGCTGGTATCAGTCCACATACTATGGCTCATTCGTrACCCGGGCA 
CnXTGGACTCrCAGTTANTTACCAGCATGCCATTGAAGAATCrGTCGTTCTCATTT 
AAAAACTGCCCAAGGATCTCTCTNACTAAANGCATACAGACTGACrCCTAAACTGGATGGGAAGN 
TTGGAAAAGAAAAGGArrTrTCCCCTQAAGCmGAAAAAAOCAATATCACCTT^ 

^ATAAAGACAAAACTTCAACAGTGGCACCCACCATACACACCACTGTGCC^ 
ACCTACTCCAAAGGAAAAACCAOAAGCrGGAACCTATTCAGTTAATAATGGCAATGATACrTGTC 
TGCTGGCTACXrATGGGGCTCCAGCTGAACATCACrCAGGATAAGGTTGCrrCAGTTATrAACATCA 
ACCCCAATACAACTCACTCCACAGGCAGCTGCCGTrCTCACACTGCTCrACTTAGACTCAATAGCA 
GCACCATTAAGTATCTAGACTITGTCTrrGCTGTGAAAAATGAAAACCGATTrrATCn^ 
TGAACATCAGCATGTATTrGGTTAATGGCTCCGTmCAGCATTGCAAATAACAATCTCAGCTACT 
GGGATGCXCCCCTGGGAAOTTCnTATATGrrGCAACAAAGAAGCAGACTGTTTCAANGTCTGGAGC 
TTTTNAGATAAATACCTTITGTTCTAANGGTCAACCTnCAATGGGG^ 

SEQ ID NO: 2627 ACATGCTCATGGCAGCAACAACCCATTGACCACTTCTTCAAAGTAGTTCACTG 
CCAAGGAGAATCAAAATTCAATTrGGATTCCCAATACTCAGCCTACCTrCAATn"CCCATC^ 
TATATTCTTAGCXnTrAGTTrAAGTGTGGGTrATCTTAACAGCTCACTTGGC 
AATTTCCTGAGTOTTGAAAGAAOTAGAAACCCAATCAATGAGTrTTTCTNTrGG^ 
GAGAAGGATACATTTCTAGAAATTTNAATACCTTCCAAGNGTCAGAAGGAAATATAATGTAGC^ 
TTGAAAnTGCGATGTAGGATAATAAGGTACCTCGGGCGCGACCACNC 

SEQ ID NO: 2628 GGTA Ci ' i ' i ' i i ' l - ri i - ri ' rri - i ' i ' j - ri ' i ' i ' i - rrrrri ' i A i ' m - iTrrrri ' iTri 1 1 1 1 1 j 1 1 

NGAAGGTTTTAGTTTATTAANG^r^CTTGCNAAAAATXXACAGGGGCCNCAGCTAACATCATT<^ 
GCACCTITACTCCTTCGGNT^ 

CXnTTGCAOTCXXICCTGACJlUNTTNATTCTGNTCTTGCGTrCCrT^ 
^TNTTNTCATACAGGCCATGT^^TGCAAGTCTATG^^^NGGGNTCATT^^^ 
GAATCATAAATNATGCCAAAGCCAGTTGTNTTGCCACCACCAAAATGAGTTNTGAATCCAAANAC 
AAAGATGACATCCGGNGNGGNCTTG 



SEQ ID NO: 2629 GGTA CTTTTATATAAAGTAATNCrGGATTTGACATTCTCATTTAGAGAAACCT 
A 1 " 1 1" 1 Cri 1 ' I" 1 1 C'l ' 1" 1 ■ I ' ICTATTTTAONGTTrCATTAATGTGCGGNCTCCAATTTAGG AC i " I'l I'CCAT 
AGTGCCAAAGCCATACATATTCAGNAGAACATCAATAAATTACATCAGAAATTCAACACTrrAT^ 
ATAAAACGGGCnTCGTGTTAGATAATTITGCTAAACAOTAGQCrACnXn"AAGTGTrOT 
TTAAGACAGGCTAGCATTTTTCAATTTACAATATTITrCAACT^ 

ACCCTATGATAAGTIX}AGAAACATCT>mTrmGGTGTGTCAACTGTGATrrAAGGATr 

ATCANCCCAAOCATTOT^^GNAGTAATAAATAATGANTTAATANAA^INTGTNC^^I^^ 

GAAGGGAGAAmTCTrrAArrGGCN>rrTGACAATAAACTKX^ 

TCAAATOTTTATCNATOTTTTCNCTNGGCTTCCTAGGAAATNAAITAAACCT^ 

TTCAGKTTNTACTNN 

SEQ ID NO: 2630 ACGATCGAAGGGACTATGTCTrcATrGAATTTTGTGTTGAAGACAGTAAGGA 
TGTTAATOTAAATTTTOAAAAATCCAAACTTACArrcAGTTGTCTCGGAGGAAGTGATAATTTTAA 
GCATTTAAATGAAATTGATCrrTTTTCACTGTATTGATCCAAATCA™ 

CAGATCAATTTTATGTTGmACGAAAAGGAGAATCTGGCCAGTCATGGCCAAGOTTAACAAAAG 

AAAGGGCAAAGCTTAArrGGCTTAGTGTCGACnTCAATAATTGGAAAGACTGGOAAGATGATTCA 

GATGAAGACATGTCTAATmGATCGTTTCTCTGAGATGATGAACAACATGGGTGGTGATGAGGAT 

GTAQATITACCACAAOTAGATGGAGCAGATGATGATTCACTVNAGACAGTGATGATGAAAAAATGC 

CAGATCTGGGAGTAAGGAATATTGTCATCACCTGGATTTTGAGAAAGAAAAAATAACTTCTCTGC 

AAGATrrCATAATTTGANAGAATTCCTGGAOTTNGATAQCTCTAAAAGCCAGATATOCT 

SEQ ID NO: 263 1 ACGCGGGGCTCrmTCCGGCTGGAACCATGGAGGGTGTAGAAGAGAAGAAG 
AAGGAGGTTCXTGCTGTGCCVVGAAACCCrrAAGAAAAAGCGAAGGAATTTCGCAGAGCTGAAGAT 
CAAGCGCCTGAGAAAGAAOTTTOCCCAAAAOATGCrrCGAAAGGCAAGGAGGAAGCTTATCTATG 
AAAAAGCAAAGCACTATCACAAGGAATATAGGCAGATGTACTrCTCTGTAACAAAGCTTATGATA 
ATCGCCATTTAATTAATTGTrGTGTAATTATmCATTCCATCnTrATCCCC^ 
GAAGGCAAGGATTATGlii'rrJ'GGCTCACCACAGTGTCACTGGCACCTAGCATAATGCXrrGTCrTA 
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CnTCK^AGGTTAAGGACnACTGGCriTrcGAAAGATGAGCCnnGTGAC^ 
TTGAGGGTCATTCAAACAGATAAAATTATCAAAGCnrGGACC 

SEQ ID NO: 2632 GGTACACnrrrmOAATGOTTTTCTAACAACTTGAAGTACAGGATCAAGGAA 
TTAGGGTCGTCTACritiAGGCAGATGGGATAGTAGCrGGGAACTGrrCCCTITCrGATrAAm 
GCAGCATCGGAATATAmGGAGCACACCCTAGTAACCTCTrQAGATTAAATTACATAOTCITAAT 
ATTTCTGTTCCTCCATGCAACTGATGTrrGTTTmAAAGGGTAAGATGCTGCCTCCCA^ 
TGCCATCTGACTGGmCCCCATGTCCTCCCATTCACCCATCTCTGCTCCCACCCTTGCCTGCCTCT 
AACCCACCACTOGCCAGCCCCCTTGCCCTACTCTGGGCTGCTGAACACTGGTGCTGTGGTGGTTTT 
CAAGGGTAATTCCTAGGCTAACCX3TATGGCCTATAGTTTAAAAGCACATCTATGTT 
CTGAAAAAGGQGAATTATTTCTCAAGTCTTTCAAGOCTTTaAGACTAATATANG^ 
AAGAAGAAACCCAAGTTTGGAGGGTGGGAAGAGGGAAAAGGANAAAAAAAGATNNT 

SEQ ID NO: 2633 GGTACTCTGTGACGGAGCTGAAGGACTCTTGCCGTAGATTAAGOCAGTCAGT 
TGCAATOTGCAAGACAGGCTCCTKKXXKSGCCGCCCTCGGAACAT^^ 
TGTATCX:ATCrAAGTTCCCGTTGTATCX:AGAGTlXnTAGAGCTTC 

ACCCTTCCTTATGAGCATTmAGAACATTGGCTAAGACrrATmCCCCCAGTAGCGC^^ 
OATTTGCATTCAGGTGTTATTCTrAATGTlTCTGTCL\AAGCTT 

GCCATAGTTCACCTIXXXTGTTCCAGGTrrATTTAATTCCAAAGGTGAGAGTrGGAGTGAGATGTC 
TTCCATATCTATACCTTTQTGCACAGTTGAATGGGAACTGTTrGGGTTTAGGGCATOT 
ATTGATGGAAAAAGCAGACAGGAACTGGTGGGAGGTCAAGTGGGGGAAGTTGGGTGAATGTGGA 
ATAACTTACCTTTOGCTCCACTTAAACCCAAANTOOTOCAACrm 



SEQ ID NO: 2634 GGTACJTrn-i-i'iTn'n-n'rn-n'i i'i I'l'i ataggttcca 1 1 1 1 1 actgngcat 

ACACATATATACAATGTATTTTAAAAATGGGCTTTACAATATGTAGTITGATC^OTGGrrTACAA 

CTAAATATATTGNGAACATTITQTCrrcrACAACAGrrAAAAQAATTGAATAGCTTGQAGGAAACA 

CAATirATTAAGCAATCnTGKTGGGGACATTGAGGTATAATTTTTrrrCTA^ 

TTTATAATGCCTTTGGGAAAAAAAGGGGArrrCTrGNCrrrATATAGCTrrCT^ 

CTTGCCCTTCCATTTAGCC II 1 1 i ACTTGCTTCTCTACCACCACCTAATCACCAATCAAGTAACXrCA 

TTTrariTrTCAACCTCTCTCTrCTATrrGCrrCCTCT™ 

ATGGACTTCCATTCCTTCAGCACnCrGGGTCCTCCCCTTAAAGANGG'l 11 CI 1 1 ICl 1 1 1'AAAGANA 
CCAATrrTAAATGGATTTGGACAACCCTACTCAAAACTGTTT 

SEQ ID NO: 2635 GGTACGCGGGGGAAGrrAGGGCGTOrGGCGTCACTrCCGGCrrCCTTCAGTCC 
OCTGGTCCCGAQCACOAGTTGTOAGGGGATTCACITGTGTGCGGAACTCCTCGGAACCATGGCGT 
CCCTirCCCTTGCACCTCTTAACATCmAAGGCAGGAGCrGATGAAGAGAGAGCAGAGACAGCT 
CGTCTGACTTCnTrTATTaaTGCCATCGCCATTGGAOACTTOGT^^AQA^^ 
GGCATGGACAAAATTCTTCTAAGCAGTGGACGAGATGCCrCTCTTATGGTAACCAATC^ 
ACTATTCTAAAAAACATTGGTGTTGACAATCCAGCAGCTAAAGTmAGTrGATATGTCAAQGOT 
CAAGATGATGAAGrrGGTGATGGCACrACCTCrGTTArcCGTTTTAGCAGCAGAATTATTAAG^^ 
AGCAGAATCTTTAATnXAAAAAAGATTCATCCCCAGACCATCATAAGCGGGTTGGAGAGAAG^^ 
ACWAAGCTGCAAGAAAAGCCTGhrrcAATTCTGCAGTGATATGGNTCCGATO 

SEQ ID NO: 2636 ACACTGTGTCTCTATGTGAATATGGACAGTTAGCATTTACCAACATGTATClXf 
TCTACTTTCTCTTGTTTAAAAAAAGAAAAAAAAACTTAAAAAAATGGGGTTATAGAAGGTC^ 
AAGOGTGGGTTTOAOATOTrrcOGTGGGTTAAGTGGGCArmGACAACAT^ 
ATGTTTAATTGTGATATrrGACAGACATCCTTGCAGTTTAAGATGACACTTTTAAAATAAAT^ 
CXn'AATGATGACTTGAGCCCTGCCACrcAATGGGAQAATCAGCAGAACCTGTAGTATCTTATrTGa 
AATTGACATTCTCTATTGTAATrrTGTrCCTGTTTATTITrAAATTTTCTTrTO 
GAAAGATGATGCTCAG'l 1 1 1 AAACGTTAAAAGTGTCC 

SEQ ED NO: 2637 A Cri ' n 1 rrrriTiTri ' l I ' l ri ri CTAATTATQATCAAeTmATTGATTTTACAT 
AAATAGCTTATAGAATGCCnTrACAAACTATGTTrn'GCAATATATTTGrrACAACA 
TATGAATGTATTTGTTAAATCTITmGCAGAGGAAOTAGOGAAAAGTGAGGGGAGAAOAGQAGO 
AATAAGATTTTAAAACTAATCGTCAGCAATGGGCCTAGGTbmTGGCAATTATTTC^^ 
AAATTACNGGGGTbn'GTANCnTCAAACATNAAAAGTGGNAGTGCATNAATTTAACTAAAA^ 
GTAGCOCCTAATATATTATCTOGGCATCAAAl'l'lCl 11 i'lTIAAATATATCCAGATTCACATATTTT 
TTACTCTrATTAAATAATCGTTTTAAATAATAATCATCACNCTGTTNAGCmCAT^^ 
GGKmAAANrCATGGGGATATCCQNCTTmAATTCCAATrrATTrhnTAATCAOrrA/^ 
CX}GGGGAAAGNGNGCCCCAANAATNACCX;CCTNTGTGTTOAAGA 
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SEQ ID NO: 2638 GGTACAGAGGATCAOACCXnTATAACAAAAAGAGTTATGTTTGCTrATOAAC 
AGTGCCTGCTTGTGCTGGGCCATCACCCTGATATITGGTATGAAGCTGCCCAGTATCTrGAGC^ 
CAAGTAAACTGCTCGCAGAAAAGGGAGATATOAATAATGCCAAATTATTTAGTGATGAAGCTGCT 
AATATATATGAAAGAGCCATAAGCACTTTATTGAAGAAGAATATGCnTCTTrATnTGCATATGCA 
GATTATGAAGAGAGTCGCATGAAGTATGAAAAGGTrcACAGTATATATAACAGACTrCTGGCAAT 
TGAGGATATTGACCCTACCTKMTATATATCCAATATATGAAATTrcCACGGAGAGCAGAAGGCA 
TCAAATCTGGAAGAATGATATTTAAAAAAGCAAGAGAAGATCCAGAACCCGCCCATGTCTATGTT 
ACTGCAGCACTCATGGAATATTACTGNAGTAAGGCCAATCTGGTGCCmAAOATTTTGAQCTG(^ 
CTTAAAAAATATGGAGACCATTCCANAGATGTCCGGGCTATTTGAC 

SEQ ID NO: 2639 ACCATGAAATATCCAGAACATACTTATATGTAAAGTATTATTTATTTGAATCT 
ACAAAAAACAACAAATAAriTITAAATATAAGQATTTTCCTAGATATTG<^ 

AATAGCAAAATTGAGGCCAAGGGCCAAGAGAATATCCGAACTTTAATTTCAGGAATTGAATGGCT 
TTGCTAGAATGTGATATTTGAAGCATCACATAAAAATGATGGGACAATAAATmGCCATAAA 
AAAmAGCTOGAAATCCnXJGATTITmaXiTTAAATCTGGCAACCCTAGTCT^ 
CCACAAGTCCTTGCTCCACTGTGCCTTGGTTTCTCCTTTATTT^ 

CATCTTACCTCACAGTGATGTrGTGAGGACATGTGGAAGCACTITAAOTITmCATCATA^ 

AATTATmCAAGTGTAACTTATTAACCTATTTAATAATTATGGATTTATTTAAGCATCA^ 

NGCNAGAAATTTGGAAAAATAGAAGANGAATCTTGA1TGGANA 



SEQ ED NO: 2640 OGTACATGACCTAATTnTACATCATAOTAAAACAGGCCCTATGGAGAGAGG 
ACATGGGTTICTCrGCTGAACAGCCATTATTTATACrcGTTCCAAGGCT^ 
TTTCCTCGTATTACCACCATTCCAATATTGTTCTOTTGTCCACTAGTCGCCATCTCCA^^ 
CTATCACAAGGTTCATAAAGGGATCAAATarCGCAATATTCCTTGGACATGTCTGCCACCATTTA 
ATTTCAATOATAACrrcnTGTCCATAAAllI'iriCAACTCGGGAGGGTGAGCm 
TACTCCGCGGGCTCACAOATGCCTTGGAACGCAAOGCACGGCTTTCCrcAC^ 
CCGGCGTCTTGCGTCTGGCCrCCGCGT 

SEQ ID NO: 2641 ACXSCXJGGGAAGACAAAGACCCGCAAAAGATGTATGCCACCATCTATGAGCT 
GAAAOAAGACAAGAGCTACAATGTCACCrCCGTOCTGTITAGOAAAAAGAAGTGTGACTACTGGA 
TCAGGACTITTGTTCCAGOTrcOCAGCCCGGCOAGTTCACGCTGGGCAACATTAAGAGTTACCCTG 
GATTAACGAQTTACCrCOTCCGAGTGQTGAGCACCAACTACAACCAOCATGCTATGGTGTTCTTCA 
AGAAAGTTTCTCAAAACAGGGAGTACC 

SEQ ID NO: 2642 AC'l-l-n-I-riTl'l'l'l IJ'll lM'i'l'i'li ri'iCCAANATTITGTmA'mTATTATGGC 
TANAAAGACNCTGTmrAGCCAAAATCGOCAATOACACTAAAGAAATCCnWONGCTT TT^ 
TGCAAATATA l i IV I I CCAAAAGTTGCCCTGGNGGGACnTCAANAGTTCATGTTANL'riUl'rrtCrG 
GAAACTTCCTTTT>m*AGTTGTTGTATrCTraAAN 

GGGCANNGAACTCXrITGATGTT^m3GCAAGTAANTGTITATCTGGCCTGCAATGANCANCGAGTC 

CTn'CCTGNCAGGCGGCThrrTGGTGGTTAAAAGAGTTTGGACAGGTCaCCNCANGGAGCGGGG 

TCTCCTCGGCT^mJGCNCTXJAATATTCr^CTGCTGGCNACGCTGCNGANAC^^ 

TGGGGNGTACNANGAATTCCCCNNNGNATNGGGTNGGAATTTAACTATTTCTTGNOT 

CACTTTGTCCTTNACCANCTGGTANATTCTCCCCCAAhrimTTGTNC 

SEQ ID NO: 2643 GGTACAGTTGGAOTCTGTGTGTTITCTTGAATGTTrGAGACAGCTTCACCT^ 
AACTTrGAArmTCAGCAGCTGCTAGTTGTGCTTGCTGGGATAAATCTTCGA^^ 
AAAACTATGTAAGTATCTGAAGCAGOGCTCTTGTAGACATCTGGTTTTOTGATGACAAAGAGOA^ 
ATTCTTAGATTTCCGGATAGTGACTCTAGTAACTCCTGTAACCTGCCGAAGACCCAGTTTGGACAT 
AGCCTTCCGTGCCnTCrriTrCACTCCGACTCTGTTTTGCTrTACT 
CTGCTGCOjCCAGCTGGGCTTGrrGTGTGGTTGCCTGGGTGGAATCCK} 



SEQ ID NO: 2644 ACATATTTTCGTIXlAAGACACCAGACTGAAGTAAACAGCTGl GCATCCAA'rT 
TATTATAGTTTTGTAAGTAACAATATGTAATCAAACTTCTAGGTGACTTGAGAGTGGAACCrCCrA 
TATCATTATTTAGCACCGTTTGTGACAGTAACCATTTCAGTGTATTGTTTATTATACCAC^ 
AACTTATTmCACCAGGrrAAAATTTTAAmCTACAAAATAACATTCTGAATCAAG^ 
ATGTTCAGTAGGTTGAACT'ATGAACACTQTCATCAATGTTCAGTTCAAAAGCCTGAAAGrrTAGAT 
CTAGAAGCnXjGTAAAAATGACAATATCAATCACATTAGGGGAACCATTGTTGNCr rCACTrA ATCC 
ATTTAGCACTATTTAAAATAAGCACACCAAOTTATATGACrAATATAACTTGAAAATTITI^ 
TGAGGGGTTGGTGATAACTCTTGAGGGATGTAATGCATrAATAAAAATCAACTCATCATTTTCT^ 
TTGGTTTCAATGGGGTGGAAACTGNAAAATGATACTGNAGAACCTNG 
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SEQ I D NO : 2645 ACOTGATACACATAATCAGCCTTTTCAAAAATGCCTGACAAGAATTAGTCTT 
TCCTTrGTGCTGAAGTCrrCCC ACCCA TOGATGGAAOCAOOCTOACTCCCTGAGGGTCAGGCAAGG 
GGTGGGAAAGGGAACACATTACTTTTGTGAAGGCAAAGCAGAAAGGTGTGTTTGCCAGACCAGCA 
TGGGCAGCTCAGAGGGAGCAAAGCATCCACCAGAAGAGGCTCTCCATTTTCTTTGTAGGQCCTGA 
CAOTTGAQATTTGAOGCTOAGTTAACAATGGGACCACTGAACTTCTTTCCAATGGAAAACTC^ 
GCCCAGTCCCACAGGAACTGTGCGCATACCAAACAACAATGAGGAAGGAAGGGCCGGGTGGCTC 
TACCAAACAGTTCAGOTCCACTGQGTGAATOAAGCCraGTGGQAAAGCGGACTCCTGAAGTTGGC 
GCCCrCTGCTGGTCCCACTTCrCATCGTGGCGGGTCGACTGCCTGAGTANAGGAAGAGGCTCAGAT 
GGGGTrGACAAAAAAAOGGAGTGAGGGGAACCCCANGAAGATOAANCCAN 

SEQ DD NO: 2646 GGTACTGGAAGCATOCTCCAAAGACCTGTAAGAACTITGCTGAGTCGGCTCG 
TCGAGGTTACTACAATGGCACAAAATTCCACAGAATTATCAAAGACTTCATGATCCAAGGAGGTG 
ACCCAACAGGGACAGGTCGAGGTGGTGCATCTATCTATGGCAAACAGrrTGAAGATGAACTTCAT 
CCAGACTTGAAATTCACOGGGGCTGGAATTCTCGCAATGG 

CAGCCAGTTCmGTGACCCTCGCCCCCACCCAGTGGCITGACGGCAAACACACCATTT^ 

AGTGTGTCAGGGCATAGGAATGGTGAATCOCXjTGGGAATGGTAGAAACAAACTCCCAGGACCGCC 

CTGTGGACGACXJTGAAGATCArrAAGGCATACCCTTCTGGGTAGACTTGCTACCCTCrrTGAGCAG 

CTXrrT C TGAG ATGGCCCCAGTGAACCAGCTTCTANATGACATANAATGACATOTAATGCTAAATCA 

TTTTTGCTTTCAAGTCATGAAGCTTAAGAAGNCTGGCArTmGGGTGAGTAA 

SEQ ID NO: 2647 ACAGATATCTTCAAAGGAGGAAGAAGAAAGGGAAAGCAGATGGTGGAGCTG 

aatatgccacttaccaga ctaaat caaccactccaocagagcagaoaggctgaatagattccaca 

acctggtttgccagttcatcttttgactctattaaaatcttcxatagttgtta 

ctcat gagt gtaactgtggcttagctaatattgcaatgtggcttoaatgtaogtagcatcxnttga 

tgcttctttoaaacttgtatgaatttqggtatgaacagattgcctgctttcccttaaat 

gatttattggaa>.gtcagcacagcatgcctggttgtattaaagcagggatatgctgnatm 

AAATrG GCAAAAT TAGAGAAATATAGTrCACAATOAAATTATA llllCri ' i GTAAAGAAAGTGGCr 

TGAAATCTTTTTTGGTCAAAGArrAATGCCCACTCTTAAGAATATTCTr^ 

TTTATATATCGGTCATTGNAAAAAGNCCNTAAAATATGTGGTNT 

SEQ ID NO: 2648 GGTACGCGGGGAGACGAAOACTGAGOOGTrGTOOCCGCOTTGCCGACCTCCA 
GCAGCAGTCOGCT IXn'CTACG CAGAACCCGGGAGTAGGAGACTCAGAATCGAATCTCTTCTCCCTC 
CCCri Ul 1 GTGAGA 1 1 1 ri riGATCTTCAGCTACATTTTCGGCTrTGTGAGAAACCTTACCATCAAA 

cacgatggocagcaacgttaccaacaagacagatcctcgctccatgaactcccgtgtattcattgg 

gaatctcaacactcttgtggtcaagaaatctgatgtggaggcaatcttttcgaagtatggcaaaat 

tgtgggctgctctgttcataagggctttgccttcgttcaqtatgttaatgagagaaatgcccgggc 

tgctgtancaggagaggatggcagaatgattgctggccaggtiitagatattaacctggctccag 

agccaaaagtgaacccgaggaaaagcaggtgtgaaacgatctgcaacggagatgt 

seq ed no: 2649 acatgotagtaaacctattgatgocaatntgctgactgtggaagtgactcat 
ccaaacrccatgccagctgtcaacattcagtatgaagtcatcggtaattactattcgtctgagaga 
atggctgataatgcctgigttctttttgccgtctctgttcttatgtttat;^ 
ttatggagcaatttcttatcaagtgggttggctgattocattcttctgttaccga cl 1 1 1 ig acttc 
gtcctcagtt gcct ggttgctattagttctctcacctamgccaagaatcaaagy^ 
aactacctgatttrccctacaaagatgacctcctggccttgoactccagctg 
tcttgtgttctttgccttattcatcatnttaaggcttatcta^ 

aaatacatcaacaaccgaaacxjtgcxxk}agatg>n"gtgacctcggccgcgaccaccctanggcga 
attctacccactoggcggccgtactaatggatccaactcggaccaaccrggcgnaatatc^ 
ctggttctgggngaaatgtitccctccaatcccncacatacaaccggaacctaangga^ 
gcnaaagngacaactn 

SEQ ID NO: 2650 GCGTGGNCGCGGCCCGANGTACAACATNTGTGCAATAAATTCACAAAACTAT 
ATTACNGCNGGlTAATCAGTTTAAGAATTGTTCCCGTCAGNCACATTrTTTGCCCTCAGAAGT^ 
TTCCTAAAGATTTCAACTACTXn'AAATTTCTAGCTACCAAGAAGTTAAGAATGATTATAAGAAGCT 
TTCCAAGGAGTTATGAAATCTTroTAGACCAGAGGCCAACTATCATCACCTCAAGTCTGCTCTCAC 
CAACAGCCCTTGTATTITTCAGGGAGAAATCTCTAGGAAAAAAGTCAOACACCAOTGTAGTCACT 
ATCTNCCATGTCAAACCTAGGGGACTAAAATGGTCNGTrm-ACCATAAAATGATAATTTTGAGGTr 
TACCTTAAAAGGCTTATTCrGGTCTCAAAAArrAGATAAGArrATCrrCTACTGAAATGAATAT^ 
ACTAACATAQAAOACTGCTOCCrmGCCAGTCTTAGAGCWAAGNCATtWCATGAGTCTTAA^ 
CTGGGTTCIATNNTA^ 

GGAGGTGCTTTTNAAAGGCTTAAGOATTACTACTTCCGim'ANNGCAOOGOC^ 
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NAAATTCTTTTTTGGCAAAAANNAAGTTCTCCAATGTAGCTTCCANG 

SEQ ID NO: 265 1 ACAAGTATAGGCAGAGTrATTTTCCTG'nTACA l 1111 n ' ll GTTTTQGGGAAA 
AAATrGGTAGGTGTCTAATTACTGTrrACTTCATTQTTATATTGCAGTAAAAGTTTTAAAACAACC 
ATTGCATGTTTGCTTTTGATGTATCCCmGTGAAATTAGCACTTTrGGGGC 
AGCATTCACTCTCCCTGTCTTITCCXXnTCCCTCAGCAGAAACGTGT^ 
AAACTGCTOCCTTTTAAAAAACCCACAAAA TGCT GATTCAGTTCAAAATTAATG 
AACTGGGTTTCTGATATTTOTAAATGTGmCT^ 

TAGTATAATATTGCnTCAAAAAGAAATGGTAGACAAAACTATAATCCACATCrrTTATTGCA™ 

GAAAGACTGGCAAAGTCTTTTGGATGGGTIGGGGAGATGTGGCTGGAAAGACCTaJGCCCG^ 

CCTANGGCGAATTCCACACCTGCGGCCCTCTANrOGATCCANCTCGNNCCAACTrGGGGAATA 

QCATACTGTrCTGGGGAAATGTNTCCTJTACAATNCCCAACAACACCCGAACTAAANNNAANCCGG 

GOGCTAAGANGCNACCA 

SEQ ID NO: 2652 cgaggtacacacacatggogaacacaccacagcttagatgtgacaaagttcc 

TTCTCA TATGA CTGrrCCCTCAAGGTAAGACTAAGATGGGAGGGGGAGAAAAAGAATCCTAAAj^ 
AGCCAATTTTTAAAAATTCTGCTTTGGCroATTITC 

ATAri'lCl'lCAAAGAGTTTTATATCGAAAGGAAGACTAAATGTAATGTGCCTrrarr^ 

GGAAAACAAC TGTGGT ATTTCAAAAATTCTAAAAATATCrACTrTCATGAAATAGTAACTA 

CTACAAACCTATTmCACTAAATACACACATTTCCTAATCCTGGAGCCTCCTGGGAAAAG 

GGTTCTCCCTGGGTCTTGCAAGACTCCAGTGTTCACAGGAGAGTGTGGGGAATAGCTTCACCCCGT 

CTCCTGGGATGGCTTTGTCAGAAGTCACGTGCCNANACCCTATCAACITCCCCAGCATGTTTAAGC 

AATACCCCATCTTGACCTGCCCGGCNGGCGGTCNAANGGCGAATCCACACCritiGCGGCCGTCTAA 

TGGATCCAACTCGGNCCAA(XrrGGCGGAACATGG>mACTGGTTCCraGGGAAATGTOT 

ATCCCCCACATACAACCGGACCTAANGTAAACTNG 

SEQ ID NO: 2653 ACi i ri 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ri iGGCAGrrrcTAAGTCATrAcrTnTAT 

TTTGAAGGATITGTGAAACTCITCACATCATGGTGAGAOTTTGTATGATTAAT^ 

TTCATGAAATGCTTGGAQGTGAACGAGTTCTCAGCCTGTGAGATCCGACCATCCCATTAACTTTGA 

AGTrrCTCTTGATTAATAGAAGAAAAAAGGGGAGGGTGAAGAAAAGOAGGAACATGCTAAAAAC 

CTTATGACAATCATCCAAATOTOAGGAAAGAACAACCGATTCACCAACTCCACITmCTATrrTA 

CAACTTTCTACATCTCACTCnTGATTrrGGCCTlCCrcGCTGAA^ 

CCTGAGAAGAGCCXn"GGrrCrCCAAAAGACAGAGGAGGAGAACCCTGCAGGATGCGCTGCCCm 
CCAAAAAACTGACAGTCCGTGCTCCCAAAGTrraAACCACNCCCTAATGTGAAAANAACTCCCT^ 
AAAGGT\\ANGAGGAAATGGNGATAACTGGCTIKr^ 

Atll-riMl'lAArrCAGCCTAOGCAACCCCNWroGTNGGGGGTTGGNGOATTnt^ 
TCCNAACCTCITGNACAT 

SEQ ID NO: 2654 ACAGATTTGCTTTCTCTTACAAAAAGAAAAAAAAAATCCTGTTGTATTAACAT 
TTAAAAACAGAATTGTQTTATGTGATCAGTrnXjGGGGTTAACmGCTTAAT^ 
GATTrAAGGAGGAGCTG<XrrTAAAAAAAAATAAAGGCCTTATmGCAATTATGGGAGTAAAC^ 
TAGTCTAGAGAAGCATTTGGTAAGCTTTATCXn'ATATATTrTTTAAAGAAGAGAAAAACACC^ 
GCCTTAAAACXKn'GCTGCTGGGAAACATrTGCACTCTmAGTGC^ 

TCACTGCAGTCTTAAGAAAGAGGTAAAAGGCAAGCAAAGGAGATOAAATCTQTTCTGGGAATGTT 

TCAACANCCAATAAGTGCCCGAGCACACTGCCCCCGGNTGCCTGCCTGGGCCCCATGTGGAAGGC 

AGATCCTGCTCCTCTGCCCTGTGCCTrCAAAACACCACAGTNAa;CTCANGCTTCCOT 

ATTTATTTGTAGGGGNOGTTTAATAAACAAAAAATCl J'l rrri'l'n'l'n'lCCAATTACCTCnTTAAA 

AGGNGGGGGC^TCCTNAAGGN^^^GGNATTGNGGNAAATNNm^AA^^GGNa:C^^ 

TAANNTTT 

SEQ ID NO: 2655 COAGGTACAGTGTGATCCTGTrTAAAGTTACATAAATTACAATCAGGGTAAC 
TGCTATTATTAGGCAAATTGAAAAACAGCACAGAAAGGCCAGTGTGCAACTAGGACCCACTAAAG 
CrCAGAGGAAAGCAATGCAAGATTATCAGACA0aCATGATTGCATAGAAGGAAAATX3TGGTGTCA 
GAGTTCAAACTGGCACACACACACACAGTTATCTAAGGCAAAGTGGTGGCAAAGAAGTTATTTGT 
TCCATGTCACACAAACAAGAGCAGAAGATTCATGGGAACCCGT 

SEQ ID NO: 2656 GGTACCAAATTTAACTTGGCAAACTTICTATTGCXnXjTCCCATGTGCA 
TTTAAAATTTCCCCCATGGAAATCACTCTCCTGTTGACTAmCCAGAGCTCTAGGTGm 
CGTGTGGTGTCTGAGAGGCCATAGCGCCATCATGGGCTOATrmATrACCAGGTCCCCCAGAAG^ 
AGQTOGGAGGCTCTGCnTCCTGCTGCCGCTCTGCAGCCTGGACCTGTGGACCCTGGTTG 
TAAATTGTATCTTAGGAAACCAGTGTCACCirrTrTTCACX^ 
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CATTTCCTGTAACGGAAGTGTTAATTTTACTGT 

SEQ ID NO: 2657 GGCCTTGATGCGGTACTATGCCAAACACITATAACTTGTATAAAAATTCCACA 
TCCCCATATTGGCCACCTCAAGATGAGAACAGATAACTCCCTAAATGTTAACTGGCTCrACTCCCC 
TAATATTAAACATAAAAACCACATGGGAAATATAGAAATTCAAATAGAAGTAACATAAACCTGTC 
ATAAATCOTAAACAAAAAACTATTTOTGGGACAGCATGGATOACAAATGGTCTACTGTGTAAATT 
rrAGAATGAGGCAGACAAAAGlTGGAAGGCCGGTTAATTTTCCCCTCCTTCTCCTGCTTCAGCTTC 
GTCrCCTTGGGTATCCGATGTCCACAATGTCAAGTTGTCTCTCAGTAATTGCATTATTAGCGTC 
TCTTTGTATGACTCTTCACTTAATGTATCAAGTTCANCAATGGCTTCATCAAAAGCTGOT 
AGAGCAGGCTTTCTCTGGGGGGTCAGAATCrCATAATAGAACACAGAGAAGTTNAGGGCCCGACC 
CANNCTOATAOQATGOGTNGGGTGCN 1 'I C 1 ' l'ri'GCTGATTCAAAACGTCTTGGN >rrCTT GTGGGCT 
GAACAAAATCCTTTTTGNATACChmGGNACXrn^CCAGNACGGGGGAATNCT^^ 
CTTN'l"r]'l"rilGGAAIWTTGGONAAAAATTTCAAAA 

SEQ ID NO: 2658 ACAGCnTTrAGCAAAACTGCrrTCCCAGAA^AGCAAATTAAAATAATGC^ 
CAGCAGATCAAGGAGACTACAGCTAGACATCAGAGCTACAACTTCTCATATCTGTGTCAG AAAA T 
CCTATTTACCCAGAATAGACTAAAAmCAGGACAAAACAGGGACTTTTTAACATTTCTTACm 
CCCATTTTAAGTTCTTTTTATCTCCCCCACCCCCCAAGCCACTAATTCAGCAAGATCCA^ 
GAGGGCTACTTTTAGAATTGGCTACTTTAl 1 1'l I'C'l 11 CTATGGCAAQACTGTCATQGAGCCT GGAT 
ATTTCAGGCTCATATGCATCCATGACCAGTGTCCCACTGTGTCAAAAAACTTAGCAATGGGACTTT 
GTGAACTCGGCTTCCCGCIXjCTGCTITGTTCTCCACTGATGAAGGCAGCTrCAAGGNGNA m 
ATCAAACCTTTTAAGACTGGAOQACAGCTOCCAAATATCATCAGGATCC^^^CCTGNAGGAC^T^ 
GGGGCCCCAAAAANAAGCTTTCn^CTTTTTGANGGTTAAAGGGChWAATCCCCCCC^ 
GGGACCANCTANGGNGAATCACCCCCGGNGOCGTNrrAATGGATCCACCCGGNa:AACTTGGGGA 
ANAGGGAAAATGGTCCGGGGAAAGG 

SEQ ID NO: 2659 ACTTCTTCnXjGCCAAAGGCTGTTCCACATTCACTAC AITrA AAAGGCTTCTCT 
CCAATATGGArmCTCATGCrCAQTAAGGTTGGATrrGCX;ACTQAAGOTrTITCX;ACACTCm^ 
ATACAAAGGGCTrCTCTCCTGTGTGAGTTCTCTGGTGTCTGATGAGm 
CTTrCCX;GCAATCnTTACACTCAAAAGGTTTTTCTCCAGTGTGAATTI^ 
TTCCITCrGGCTAAATGATTTTCCACATTCATrACATTCGAAAAGCTTCT 
TGATGTTTAATGACATACTGCTTTTGGCTAAAGGCTTTTCCACACTCGTTAC ATTCA AA^ 
CTCrCGTGTGAAAATGCTCATGCTCAATGAGaGTrGArrTGTGGCTGAAGACTrTTNC^ 
TACANGCCAAGGGGGTTTNCCCACTGTAAATCrCTGCrGGCTGAGGNGTGACATC^ 
CTTTNCCACATCACAAGGTTCTNTOCAGNAAAAATTmGATGAGTGAGGATTACTTTC^ 
AATTNCNCACTTTACATTATAAGGTTTTTTGGTTGANCNCCAATOCAAAANGC^^ 
AAAGT^WNACCCC^^^ATTCAAGCm^NT 

SEQ ID NO: 2660 CGAGGTACAGOOTCCrTTrGCAATAAAACTQGTTATOACrrGATCCAAQTGTT 
TAACAArrGGGGCTGTTAAGTCTGACX^ATACATCACrGTGATAGAATGTGGGCITTTTCAAGGGTC 
AAGATACAAGTCTTAACCACAGTGTAACTTACAGTTTCCTTTAAAAAAAAAAAGTA AACCrG GCA 
GCTATAGAATACACTATGTOCAmATAATAGCTATTTTATATATTGTAGTATCAACA'1'irJ 1 AAAT 
TAAATCTTTTACATTCACAAGTGGTGGGGAGTCTTGTCATTAAGGTGTGTGTAATrrAGAOT 
ITGGTTTTCriTCTGACTGCACTrGTTCTCATAGTAOTAAAATGCTATGCXiCATTTATACC ^ 
AGTCCTCATTCTACCACATGTTAACCCTCTAGCTGATAATGCAAACACTAACTGGGGGA.TT^ 
TATAAGGGCTCTAGAAAAAACGAGTTArrCACACCACX^ATCATCTTAACTA ACATT CrGACTAGTT 
AGNGCANCTTTCATGGGGTGGGGGGNGGCTCATACTANGGNGGGTTTTCTCCnriT^ 
ANAOTGCCCGGCGGCGTNNAAAGGCGAATTCCACCNCTNGGGCCGTCnTmGA 
AACTGGGGNANAGGGAAACTGTTCCTGGGGA 

SEQ ID NO: 266 1 ACGCXKjGGATGCGCAGACACX;CTCAGGCGACTGGCGGGTCGCGGCTTCCAAG 

ctctaaatggagagttgtccctactgtgcggcaggcggaggagacctatgtccagggtgctgcag 
aaogacgcgqagcaogagtcacaoatgagagcggagatccaogacatqaagcaggagctctcca 
cagtcaacatgatggacgagtitgccagatatgccaggctggaaagaaagatcaacaagatgac 
ggataagctcaaaacccatgtgaaaoctcggacagctcaattagccaagataaaatqgqtgataa 

GTGTCGCTTTCTACGTATTGCAGGCTGCCCTGATGATCrCACTCATTrGGAA GTATT ATTCTGTC 

TGTGGCTGTCGTGCCCGAGTAAATGGATAACCCtrrCTAGACCGCCTGGTAGCCITTCCTACTAN^ 

TAACAGGTOGNGTGGAATTACCTGTTGGATTITAGTCTGTACAAANTrGCGCTATTGTGCT^^ 

GTCACTGAACANGAGGATGATCAGCCNCAGGTrAAAACGATTTCTCTCTACrrAAACTGArrACCT 

QjjQYiTTAAAACAAAQQAlATTAAATnTTTOTQAATQTTQCTQNTnA^A^ 

NCCGT 
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SEQ ID NO: 2662 A fl - 1 1 1 1 1 1 1 1 ' i 1 1 1 1 i i 1 U 1 i 1 1 1 I GGAAAATTATACTTTTATTTQAGTCAOC 
AGGAGAAAGATTCACTTGTG<nTCAAGTCAAATGTTCANAATCATAACGGGCCANAAAGGTITGA 
TCCCGAGCACAAGCCCACGAGGGAGOGGACCAAAACAGACCAAAATGAGACAACAACCCCATAT 
AAAAAGATGAACTGGCGGNTTCACACACTCACACACATACACATACACACGGATGAAATGTrrGG 
ACAGAGGCAAATTrCACOTOGTCATTTCTGTriKrrmAAATACAGGTITO 

TnrrrCCAGCTATAAAAAAAGGCCCAAAAGTGCATNTNTNAGGGGGGAAGGGCAGAAATTAAGC 

AATAAAGTCATITIXXXnXSGAGGGACATGANAGGGAGAAAACAGGAGGCANTGCTTGGANAACC 

CACrnrCTCCCACTGGGCrnmTGTATriTAANArmGGCXCC^^ 

CTTTNATGCCTXnTTCTrGGGATTAAAAGAACTTGAAAAACCl-J-l-liAACTGTO 

GGTTAANCCAACCCANCCCI^ACCCnTAOGGOTTrOGOCTITrrAAGGA^ 

AAAAAACCCCANANCCCCGGGArn-AAACCAGGNGANGTCCTCNTnT 

SEO ID NO- 2663 GGTACGAGTCAAGCACAAACTGCTGCGCCAGGAAAAGACAAGGCTAATTGG 
OCCCAACTGCCCTGGAGTCATCAATCCTGOAOAATGTAAAATrGGCATCATGCCTGGCCATATTCA 
CAAAAMGGAAGGATTGGCATTGTGTCCAGATCTGGCACCCTGACITATGAAGCA^ 
CAACOCAAGTTGGATTOGGGCAGTCTITGTGCGrrGGCATrcGAGGTGATCCriTrAA^ 
ATTrrATTGACTGCCTCGAAATCTITITGAACGATTCrGCCACAQAAGGCATCAT^ 
AAArrGGTGGTAATGCAGAAGAGAATGCrcCAGAATTnTGAAGCAACATAATTCAGGTCCAAAT 
TCCAAGCCTGTAGTGTCXriTCATrGCTGGmAACTGCTCCTCCrGGGAOAAGAATGGGTCATGCC 
GGGGCAATTATTGCTGGAGGAAAANGTGGAGCTTJAAAAAAGATTCTGCCCTrCAANTGCAGGATT 
GTGGCAGTATOTCTTCTOC>lCACTGGGAACCACATCTACAAGGAATTTGAAAAGANGAAGAAGCr 
TTTNNTGA 

SEQ ID NO: 2664 ACAGTKKSAGTCTGTGTGTTrrCTrGAATGTrrGAGACAGC^^ 

GAAmCTCAGCAGCTGCTAGTTGTGCTTGCTGGGATAAATOTCGATCTrGGCrrCCCC^^ 
TATGTAAGTATCrGAAGCAGGGaxnTGTAGACATCTGGTTTTGTGATGACAAAGAGGATATTCTr 
AGAmCCGGATAGTGACTCTAGTAACTCCTGTAACCTGCCGAAGACCCAGTn-GGACATAGCCTr 
CCGTGC CI ' ICl i 1 l CACTCCGACTCTGrmGCTTTACTQACTGGTTCTrCATa}ATTTCAGCTOCTG 
CCGCCAGCTGGGCTTGTTGTGTGGTrGCCTGGGTGGAATCCTGTTCTTCAAGCTCT 

SEQ ID NO: 2665 ACTCCrTXX:AGTIXrrACTCAACAAAAATCATGATAATTGTGATAAAATAAGA 
GCAATrCTACrrTATATCTTCAGTATTAATGGAACTACGGAAQAAAATTTGGACAGGTT^ 
AATGTAAAGATAGAAAATGAGAGTGACATGATTCGTAACTrOGAGTrACCITGGTGT^ 
CCCCAATCTCAACAAGGCAAACCGTT'AAGAAAGOATCGOTCTGCAGAAGAAACTTrrcAGCTCT^ 
TCGGTGGACACCnriTATCAAAGATATrATGGAGGATGCTATTGATAATAGATrAGATTCAAAAGA 
ATOGCCATATTCTTCCXAQnrGTCCAGCAGTNTGGAATGGTTCAGGAGCTGTA^ 
ACCCAGAGCTTATTATTrAOAAGACCGNAAAAATGGGTCAAAGCraATTui ri 1 iGTAATTGGAOG 
GATCACATACIXH'GAAGTCCCTrGTGCTTATGAAGTTrCTCAGCACATAAATCCTGGGAAGTATTA 
TTGGTrCTACACATGTTTAOCCCAAAAGCTiyrGQTGATI NNAGA TGTGATAACCC^ 
OTATTAAAATGAAACCTTTITrcGNGG™AGATmTCT^^TTO 
CTNATTAACAANTAA 

SEQ ID NO* 2666 ACAACATCTACCCCGGACACGGQAGGCGCTACX}CCAGGACCGACGGGAAGG 
TTTTCCAGTTKriTAATGCGAAATGCGAGTCGGCrTTC^^ 

ACTGGACTGTCCTCTACAGAAGGAAGCACAAAAAOOGACAGTCGGAAGAAATrCAAAAGAAAAG 

AACCCGCCGAGCAGTCAAATrCCAGAGGGCCATTACrGGTGCATCTCTTGCTGATATAATGGCCAA 

OAGGAATCAGAAACCTGAAOTTAGAAAGGCTCAACGAGAACAAGCTATCAGGGCTGCTAAGGAA 

GCAAAAAAGGCTAAGCAAGCATCTAAAAAGACTGCAATGGCTGCTGCTAAGGCACCTACAAAGG 

CAGCACCTAAGOU^GATTGTGAAGCCTGTGAAAGTTTCAGCTCCCCGAGTTGGTGGAAAACGC 

CCNACTGGCAGATTAGATTTTAAATAAAAOATTGGATTATAACrC 

SEQ ID NO: 2667 ACATGGCAATTAGAAGTTGTCATCGCAAAAGAAAACCACAGCTGGCCTGCCA 
CAGCCAACACAAGAACCAGAAAATGGTAGATGAAATGAAGGAATAAAGGTGGGGTTTATTCCTTA 
TTATAAAAGAAAAAAAATAATTCTrCAGCAGTCTTAACAAAGACATCAAGATACAAAATTACAAG 
TGTTITGACTCCAGCCCTGTCCCCATCrcCTCCAAGAGCAGAGGTAGGAGACAGTTGAAGCAAAC 
AAGCAATTCTOTAAAAATTACCTAGAAACCCrACAAATTTGATTAAAATCTAAACTTCTATAArrr 
TGCTTTTTAAAAAATTTAATATCAAAAGGOCTGCTITAGTGACATGCTATTCCrArc 
CXCCAATACCCCCTCTGTCAGTAACATGCTCAAGTTGACCAAGCCAACTCTTATCT 
GTGGACAAAGTCTGTCTTrrAAACCAAAGCCACCGGAGATCAAATGACTOCTAGTAACGTGTCGT 
CTGGTAACTAAATCCAGTGGCCCCCTTCTTCAAANGGNGGGCAGGGCAGAAACCXX^^ 
TCNGGGGCTAATNCCAAATOWCATTAANAACCACmxriTGNTrnTrC^ 
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TNTATOCGCTrTTGTmT^AAATrGGGGCCCa^GACTGijrGGTm 

SEQ ID NO: 2668 CX3A0GTACACAGTmCTGTaAAATATGATGCTGTATGTGGTTGTGATTTrrT 
TTCACCTCTATTGTGAATTCTrmCACTGCAAGAGTAACAGGATrTGTAG^ 
AAOAOAAAGAAAAACAAAATCAGAGGGCATTAAATGTTTTGTATGTGACATGATTTAGAAAAAG 
GTGATGCATCCTCCTCACATAAGCATCCATATGGCITCGTCAAGGGAGGTGAACATTGTTGCTOAG 
TTAAATTCCAGGGTCTCAGATCKjTrAGGACAAAGTGGATGGATGCCGGGAAGmAACCTGA^ 
nAGGATCCAATGAGTGOAGAATGGQGACTTCCAAAACCCAAGQTTGG CTAT AATCrCTGCATAA 
CCACATGACTTGGAATGCTTAAATCAGCAAGAAGAATAATGGTGGGGTCmATACTC ATTC A^ 
AATGGTITATCTOATGCCANGGCTGNCTIXXnTTCTCCCTrTGGATG GGTGGTG AAAT^ 
GCCCTGNCTGCTCCTTCTAGCTATTAAGAAAGAACCCAACTTGGGTCin'l I i I I'GCT^ 
AAAAATAAATTGGAAAAANGANACGGG GGTG TGGNAAAG GCn^AA AAATTGCTCTrGmyCCTT 
ANCCCAGGGTTTCAATTTGCCAATGACATnTGGCCGGGNGlI'rii'NGGGNOO 

SEQ ID NO: 2669 acattttttaaagcccgctagcaaagcaaatgtgcaggcatccaaaatgttt 

a:ATCGTAGTCGAGGCAAATGAGATCACAGTATAGAACCCAGACAAGCTrTCCTGGAGAAATGCA 
TAAGTCCnrmCTGAATTATCTGTGAATmCAATGACATCnxaCAATG 

GCCTCITCTCCAGGAGGTCCAGACCGGAATCTCGATGAACACAGGGGTGGTAGATCCACA-n-AGG 

AACAACGTATCCTTTATCAGGGGCATCTGTTGATGGTOCTGCAAArrCTGCm 

TACTGNAGTATTTCCAACTTCACTAAAGCAGAACCATCTGCGGTACC 

SEQ ID NO: 2670 ACTCTGGTGACTCACCACTTCAGGGCTTrACTCCGTAACAGATrrrG'rTGGCA 
TAGCTCTGGGGTGGGCAGTTTTTTGAAAATQGGCTCAACCAOAAAAOCOCAAGTTCATGC^^ 
TGGCAGAGTrACAGTIXnX3TGGTTTCATGrrAGTrACCITATAGrrACTG TGTAAT rAGTGCCAOT 
AATOTATGTTACCAAAAATAAATATATCTACXCCAG ACTAGA TCTAGTATTTTn'GTATAArrGGA 
TTIXXTAATACTGTCATCCTCAAAOAAAGTCTATTGGTTTTITA AAAAAG AAAGTGTA 
TAAAGTCAGATGGAAAATTCATTTirrAAATIXXXXSTTTTGTCA 
TATrACCCXnTTTCGGCCCCATGTATCTCAATACXrrC 

SEQ ID NO: 267 1 ACTGTTTAAGGCCCAAAGTAATAGTTTTTACAGATCTTTTAGTrTCAACTAAG 
CTmACAATAGAAAGGACTTGTATTGCATTGAGTTTATAAACrrrrGGTTTG^^ 
GATCTGTTXrrmCCAACCAAATGTCTAGGCTrcACTTTTCCA(XCCAATGA 

AATATTAGCAGATATACTTrGATAACCAACACAGCTTGTATGAAAACTGAGCAAGTGTCCATCATG 

ACCCATAGGGTTCTaGGTAOTTGTTACCTrACTOTTTAGTCCACAAAAAATAACGACTTAGATAAG 

CTCTCTGATrrcCTTTCGCrrATTAAACACGTTCACCATCAGATGATGTGTGTAA^^ 

TTAAACTAGACTAGGAmAATACAAATCCTGAAAGCrTAAAATAAATAAATAAATAAATGACTG 

CAAGGATTCCACAACAGGGCATGACTATAATTTAACAAAAAAAAAAATCrGTOOCCAGCATGGNG 

GCTCATGCCTATAATCCCACCACnTGGGAAGCTGAGGNGGGAAGACGCTTGAGTTTATCCAACCT 

GGGCAAAATAGGGGGAATNCCTC^^^ITCCAAAA^r^AAAACTACTTGGTGCCCCCOT 

GTTACACCTITATCCCCCXmTNGGNGGCGNGGGGGGGGACTCrT 

SEQ ID NO- 2672 tcttccatttagtttgtggtancaaagcagcannggngnaattgaatganaa 

ACTGAGATTTCTCAATAATGGTGAATAmCGCTCnTrAAACCTAAAACTCTrCATTGAGTAGCrrA 

TATTTGAACATGANGGGGNAAACATTTGCCrCTACCTCTGATmGCTrTGCrGTCAAAGm 

AOCTrCCAATTACTTATGTGTGTCCTGNAACACAGGTGATTGAACGTATGAGAGGGAAAGACA^ 

GAAAAAGGAAGCX>GACACTAGGAGAATTATTAACrrcrCATACTIXXX;CACATTC 

CGOAGTGTATITAGCCTGTAGATGITGTGATATGCAAATATCCCATTCCCTGGTrACTrGGC^ 

AAGATTCTTNATGGTATTITCAAACTTTGGGATAAAriTACAGATTAGAAAa 

AATCTCT>n^TTTCCTrACAAAATNCCTrrTGrmCTGCTTGGAAANGATCT^ 

GACTAKGTTmANTNAAAAGCCCTTTCTCNAAAGCNCTTTCAGTTGCANC^ 

CAGAmATGCTATACCTTTGGATAAGAACCTGTTrTTTGNAAAACCATCATNTNTrn^ 

AhTTATIGGNCGGTAAACAGGGTTrCATGTrCTCTCCaXAATGGANhrrCCT^ 

SEQ ID NO: 2673 AarANGTrrTQOTGTCAACTAOAAGAOQTCTTATTGAA GTTA AAACAGATGA 
GTITOCTCXiCCATGGGAGCAACATAGAAGCCATGTCCAAGCTAAAGCCTTACrriCriACrGATGG 
AACGGGAACAGTCACCCCAGCCAATGCTTCAGGAATAAATGATGGTGCTGCAGCTGTCGTTCTTAT 
GAAGAAGTCAGAAGCTGATAAACGTGGGCTTACACCTITAGCACGGATAGTTTCCTGGTCCCAAG 
TGGGTGTGGAGCCTrCCATrATGGGAATAGGACCAATTCCAGCCATAAAGCAAGCrGTTACAAAA 
GCAGGTTGGTCACTGGAAGATGTTGACATATrrcAAATCAATGAAOCCTrrGCAGCTGTCTCTGCT 
GCAATAGTrAAAGAAOTGGATTAAACCCAGAGAAGGTCAATATTGAAGOAGGGGCTATAGCCTr 
GGGCCCCCrCTTGGAGCATCTGGCTGTCGAATTCTTGTGACCCTGTTACACAC^ 
GGCAGAAGTCmXjGTGTTGCANCCCTGTCATTGGGGGTGGGATGGGAATACCATGTGTGTCANAN 
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AAAATAAATGTTAACTTTGACAACCTCAATrcrmAACTAATAAATACCTGGCCGGACCANCT 
GGGGAArrCACCCNTGGCGCGTCTATNGATCCACTTG 

SEQ ID NO: 2674 CGAGGTACGCGGGGTTCTCTTrurGGTCAAAATGGCTGOTAAGCAGGCCGTTr 
CAGCATCAGGCAAGTGGCTGGATGGTATTCGAAAATGGTATTACAATGCTGCAGGATTCAATAAA 
CTGGGGTTAATGCGAGATGATACAATATACGAGGATGAAGATGTAAAAGAAGCCATAAAAAGAC 
TTCCTGAGAACCrn-ATAATGACAGGATGTrTCGCATTAAGAGGGCACTGGACCTGAACTrGAAGC 
ATCAGATCTTGCCTAAAGAGCAGTGGACCAAATATGAAGAGGAAAATrrCTACCrTGAACCGTAT 
CTOAAAOAGGTTAITCQGGAAAGAAAAGAAAGAOAAGAATGGOCAAAGAAGTAATCATGTAGTT 
GAAGTCrGTGGATGCAGCTGTTATGAAGATGGTTAAACTrcAAACAAACAATmAAGAATTAm 
GGTCTGAAGATGTmACrrTAAATAAATGTCTATTGTAATGGCTGQATAAAAAAAAAAAAAAAA 
AAAAAAAGTACNTCCCGGCGGCCGT 

SEQ ID NO; 2675 TCGAGCCGGCCCGCCCGGCCAGGTCCTTCnTGATGGGCCATATGCTrrATCTT 
CTGGGGTAATATAAACAAAAGAACTATCCATAAGTATGAACAGGAGTCTAAAAAGGCTGGCAAA 
GCTTCGTTTGCATATGCATGGGTCTrGGATGAAACrGGCGAAGAAAGGGAAAGGGGAGTAACCAT 
GGATGTrGGTATGACAAAGmGAAACCACAACCAAAGTTATTACATTAATGGATGCTCCAGGCC 
ATAAGGACTrCArr<XAAATATGATTACAGGAGCAGCCCAGGCGGATGTAGCTGTnTAGrrGTAG 
ATGCCAGCAGGGGAGAGTTIXjAAGCTGGATrTGAGACTGGAGGACAAACACGAGAGCATGGACr 
CnTGGTCCCGTrCTCTGGGAGTGACOCAGCTTGCAGTTGCAOTTAATAAAATGGATCAGGTTAAlT 
GGCAACAAGAAAGOTrCAAGAGATTACTGGAAAACTTGGGCACrmCTrAAGCAGCAGGTm 
GGAGAAGTGATGTAGGTTTTATTCCTACAAGTNGGCTCAGTGGTGAAAATCTAATCAC^ 
AGTCAAAGTGAACTCACAAATGGTrTAANGACTATGTTATANACAAATGGTTCTTTAAGCCrCCCA 
CGACTITGNAAACNTl'AATATTGTTCCATGTTTNAAACAAGATTGGTrTTt^ACT^ 

SEQ ID NO: 2676 ACGCGGGGGGGAATCATGCCTGCrCGCAGAGCTCTGCACTTCGTATTCAAAO 
TGGGAAACCCGCTTCCAQACGGCGajTITCTATCGGGACGTCCTGGGGATGAAGGrrCTGCGGCA 
TGAGGAATTTGAAQAAGGCTGCAAAGCTGCCTCTAATGGGCCTTATGATGGGAAATGGAAAA^ 
ATGGTGGOATTTGGGCCTGAOOATGATCATTITGTCGCAGAACTGACTTACAATTATGGCGTCGGA 
GACTACAAGCrrcGCAATGACTTTATGGGAATCACGCTCGCnXTAGCCAGGCTGTCAGCA^ 
AGGAAGCTGGAGTGGCCACTGACGGAAGrrGCAGAAGGTGTTnTGAAACCGAGOCCCCGGGAO 
GATATAAGTTXn-ATTIXKrAGAATCGCAGTCTGCCTCAGTCAGATCCrGTATTAAAAGTAACTCTAG 
CAGTGGCTGATCTTCAAAAGNCrrGAACTACTGGTGTAATCTACTGGGAATGAAAATTTAT^^ 
AAGATGAANAAAAANCAAGGGCTTTGCTrGGOCTATGCTOATAACCANOGNAACCTGGACtn'AC^ 
GGGCGGNACGGGTGGGGGGGGCCNTTCNACCACmTrGNAAAAATGNCi'inU'l'lUGCCCCCAA^ 
ANAATTGCCCACrrANAAAACTrGNTGAAAAGGGGAACCCNAAAAATTrTATTCCCTGGGQG 

SEQ ID NO: 2677 ACTCCTCTCAGTTTGGTGGTGQAAGTCAATATGCTrATTTCCATGAGGAGGAT 
GAAAGTAGCrrCCAGCTGGTGGATACAGCGCGCACACAGAAGACGGCCTACCAGCGGAATCGAA 
TGAGATTTCCCCAGAGGAACCTCCGCAOAOACAAAGATCGTCGGAACATGTTGCAGTTCAAC^ 
CAGATCCTXjCCTAAGAGTGCCAAACAGAAAGAGAGAGAACGCATTCX}ACTGCAGAAAAAGTTrc 
AGAAACAATTTGGGGTTAGGCAGAAATGGGATCAGAAATCACAGAAACCCCGAGACTCTTCAGTr 
OAAGTTCGTAGTGATTGGGAAOTGAAAGAGGAAATGGATnTCCTCAGTTGATGAAGATGCGCTA 
CTTGGAAGTATCAGAGCCACAGGACATTGAGTGTIXjTGGGGCCCrrAGAATACTACGACAA^ 
TTGACCGCATCAOCACGAGGAGTGAGAACXXnTG<XGAGCArrAAGCCATNTTN^ 
CACCACANACAACCrn-CATTCGCAAGCTGGCAAAACTCANGGGAATGTGTTTGCCCTrGAGCCAT 
NCTGGCCCCCTGATAACTGTNCTTGGCCGNACCCCCITAGGGNGAATrCCACNCCTGGGGGCCGT 
ATTANGGATa:>WCTNGNNCCAACTTGGGGNANNANGGGNTAAhn^NmC^ 

SEQ ID NO: 2678 ACAGAAAAGAAAATCCCCGTrGTrnTCGATTGCAAGAGGGlTATGATCATA 
GCTACTACTTCATTCCAACXnTrATTACTGACCACATCAGACATCATGCTAAATACCTGAATO 
QAAAAAACTCCAAATAAGAGAATCTCTTtAGOATTATAAAAGTTGTAAAATGCAACTGTATrGCr 
GAGCAAAAAAAAAAAATrCAAAACATTGGATTTrATAGTOCTAAAAGGGCTTrATTCTATAGTTG 
AATCACCTCTGAATAAAGATATAAAACCTACNAAAAAAAAAAAAAAAAAAAAAAGTAA 

SEQ ID NO: 2679 AcrrrnTrn 1 1 1 i i 1 1 1 1 l u 1 1 1 iacagi i 1 1 1 acatttatttaaacagaaa 

ACGTGCACATGAGCTGCCTACTCATTTrcTTCACTGCGCAGCCTGGCATTGGGGT^ 

ATG0CCAGTrGGGCAGCT(nTIXX:ACGATGGCTTTCCGGTTCTIXK3AGGAAACATTGTG 

I'CGGCACAGTAAGATTTGTTCCACATCAACAGCACTrCCAGCTCCTTGACGTTGTGGACCAGG^^ 

TTCGGAAGa:ACTGGGCAGCATGTGCITIXrrrrrTTTO™ 

GATCTGGCCCTTGAATCTTCTACGAACCCTGTTCTCAATGCCTCTGC^ 

ATTTTGACATATCGGTCTGCCCXJCGTACGGATGCTACTrGTCCAATGATGGTAAAAAGGGTAGCTT 
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ACTGGGTGGCCrCCGATTCAACnTAAAATGAAGANGTCrGGCGGCTAGGAATCAATAAAAGGGAT 
TQOCTTAATGGGOCGAAAAAATATGCCTTGGTGGTTNOOATATArrOQAOGAAGGGGATTA^m^ 
TTNGGATGAAGGATGOATATNAATAGGGCAGGGNCCCCTCn i 1 1 TriAGGGGCGGATCGGAAAA 
TITITAGGCNAAAAOAAAATTATrCGCCTTATTTGGGNGGGGGTTAANGG 

SEQ ID NO: 2680 A CTn ' i ' n ' 1 ' 1 i ' l 1 1 ri - Il ' l i 1 1 1 I GOATCTrGTCATTCTrGGCACTGTTTCTAAA 
GAAAAACTCCATTATCCCAAGCAAAAAGCACAGAAGGTGGAGTTTGGCTTCAAGAGATGTTAACT 
CAAAATXriTAGQCCTAGCAGAGAATCACCAAATTrATOOAGAGTTAACAGGOOTTrAACAGGAAG 
GAAGTGCCTITAGTAAGTTCTCAAGCCAGAGGCTGGAGGCAGCAGCrAAATCAGAGGACAGCATC 
CTCAGTGAAAGTGAGCCATTCGGGGTGGCATGTCACTCCAGGAATAAACACAACTTATAAACAAA 
TGATTTCGTAGGATAGCACAGTGACATGGTGCACTGTGAACCKSAGGCCACrG^ 
ACTGGTIXJTGAATAGGGGAGAGCCAAAAATTATGTCCTACTGGTAATGAGCTrTCAATGGCT 
CCTCTCCACTOAAGCnmGTAGAOCACTCAAACCCC^CCACTCCCACATrGCCCm™ 
CTGTCTGTGGCACCCACAGGAAGGACTGGANTCCCATANGATNCCOCCCCCrTGAACCCCAAACT 
GCGGGATTGGGTCTATGGTI^CCCrAACTCC^rCNAAAGCNCAATTITGAAANGGAJWACA^ 
AGGCTTTAATOA 

SEQ ID NO: 268 1 GGTACCTAGAAGAGAGGCGGGTCAAAGAAGTAGTGAAGAAGCATTCTCAGT 
TCATAGGCTATCX:CATCACCCTrTATrrGGAGAAGGAACGAGAGAAGGAAATTAGTGATGATGAG 
GCAGAGGAAGAGAAAGGTOAGAAAGAAGAGGAAGATAAAGATGATGAAGAAAAACCCAAGATC 
GAAGATGTGGGTTCAGATGAGGAGGATGACAGCGGTAAGGATAAGAAGAAGAAAACTAAGAAGA 
TCAAAGAGAAATACATTGATCAGGAAGAATTAAACAAGACCAAGCCTATTTGGACCAGAAACCCT 
GATGACATCACCCAAGAGGAGTATGGAGAATTCTACAAGAGCCTCACTAATGACTGGGAAGACCA 
CTTGGCAOTCAAGCACTTTTCTGTAGAAGGTCAGTTOOAATTCAGGGCATTGCTATTTAT^ 
CGGGCTCCCmxrcCTTTTTGNGAACAAGAAGAAAAAGAACACK^^ 

GGTCATCATGGACAGCTGTGATGAGTTGTACCCGAGTATCTCAATTTTATNCGGGGGGNGGTGACT 

^m•AGGACTTONCCTGAAATNTTCCGAGAAAGCTCCGCCOANCAAATCT^GAAGTC^m>^CA^ 

OTGTAAAAGGCCTGNCTNTTTTTGCTGGCGAAANAGGAAATAAAA 

SEQ ID NO: 2682 ACAGTATTGGAAATGGATCTGTCTTTGGTAAAGATCAGCCTATAATTCrTGTG 
CTGTTGGATATCACCCCCATGATGGGTGTCCTGGACGGTGTCCTAATGGAACTGCAAGACTGTGCC 
CTTCCCCTCCTGAAAGATGTCATCGCAACAGATAAAGAAGACGTTGCCrrCAAAGACCrGGATGT 
GGCCATTCTTGTOGOCTCCATGCCAAGAAGGOAAQGCATGGAGAGAAAAGA'nTACTGAAAGCAA 
ATGTGAAAATCTTtXAATCCCAGGGTGCAGCCTTAGATAAATACGCCAAGAAGTCAGTTAAGGTT 
ATTGTTGTGGGTAATCXXGCCAATACCAACTGCXn'GACTGCTTCCAAGTCAGCTCCATC 
AAGGAGAACTTCAGTTGCTTGACTCGTTTGGATCACAACCCGAGCTAAAGCTCAAATTGCTOT 
ACTTTGGTGTGACTGCTAATGATGTAAAGAATGTCATTATCTGGGGAAACCATTCCTO GACT CAGT 
ATOCANATOTCACCATGCCAAGGTGAAArroCAGOAAAOQAAGTTGGTOT^ 
ATACAOTGGTTCAAGGAAAATTTNCX^CTGGGCACAACGTGGNCriUll 1 1 lAAGGTTGAAAATNT 
CCANGCCTTTTTGThWAAAAACNTITNNACCC^ 

SEQ ID NO: 2683 ACTCACATTCATTrGTCACATAmCAGGCCCTCATACACCCCTTTTAAATTOT 
CTAACTCCrATCCCAG l I ' l Ci I i i I ATAGTCTAAAAACAAGGAATCACCCAAGTAAGATACTCCTT 
CAGAGCACTGCTGAAAACGGATCAAACGTGGAGATCCCCCAGATCCCTGrrCTCAAGTGT^ 
ATATTTTATATTAGCACATATAATACCCTTAGATATATTCrGTTATGTTCTAAAGAGTTTGTGTTTC 
CCCCTTTTTGATGATGTCTTCAATTTCTTCTGAGACCTTNCCTGT^^ 
TAACTTCTCTTGATCmCAGCGGCNAACCATTTCTTTTGCACCCATGCTAATAAT^^ 
NGGGGATOGGGGAGCACTTTCCTAATTTGTCATNAGATAACITCGACAGGGTNAiUUCn'l^ 1 1 CCT 
TNTTGNGTrGCAACITTTNACTmATTAC>fAAAACATATC>Cnm 
AGAAATAAAATGCTmGNTCATTITCAAACrACTCrCX:CA>rrCT^ 
GNGGTTTCCCCCTGCCNAGTTGACTWrmGCCTTGGGAATGCNAAAANC^ 
GGGGGCCCNANAAAAACCCCCNGGGGTTTGG 

SEQ ID NO: 2684 GGTACTGGAGATGTATTTGATAACCAAGGTmAGGTAAATnTCACCAGTAT 
TAGTTCTATITOCAAACTGAAAAATGriXn'AGOCTTAATATAAAATAACCACATTAGTGAAC^ 
TATCrCTTAGAAGAAAGGCCATATTITGCTCCTGCrrCTGTAAAAATArTAriTG'nTGAAGGGGA 
AATAATCGTAGTGTGACCTTTCACnTAATrCCTACTCCCTrAATGTGAGAGAGACAAAA TGAGC TG 
AAGAAGOAAAATTCTGGAGTTACACTCCACAACCnTGAACATACTGACGGACAT CTCTO T^ 
AACX}ATITCTCCATGCCAax;ATGCTCTAATGCCnTGTGGATCACGGACAArc 
CTACAGCATCAGCGATGTTATCnTGCAGCAAAGCACrOCAGGOATAAATGACAGGCATTAACrGC 
TCCTGGGGrnTGGCCATCATTACCCCAGTAGCGGCTATTGATCTGAAATATCCCATAATCAGTGC 
TTCTGTCTCCANCArrGTAGmGGNAGCrNGGGTGTTNTAACCACT 
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CATCCAGTmGGTTGGm'GGATtXCCTNGAGCCnOJCTTTCCCAATCTTm 
TXX^^CTTTCAAAAACTTGCXnTGAaXJTACCAAAAGNNGGCAGCaXX 

SEQ ID NO: 2685 ACAAATAAAATCAAAAAOAGCAOTQTrCTQTTGTATTCA TTTCT GCATGTATA 
GCTnATTAArrGCTAATGAAAATTAGAACnTTCTGGGATCTrCTGACAAGA 1 1 1 l"l AAAAAATCT 
TAAAATGC CIUM - IClM tAOTOAAGCCATCTTTGGAGTTAGTCATTACTCrCACCT 
TGACTTCAACCTGATATTCCT Cl IL ' l i 1 1 GGTCCAGACX:CTCAAATTTTAAAAGTAGCTTCAA0TTA 
AGOAAAGGTCATTTrrCCACAGTTCAGTTCTCTGAAAAACTTCCATCTCCCACTGAAAGTCACAGT 
CCAGGAGTOAAGTAATCACATGCTAGAACATCAGGGCCAATTOOAAAGTCATrATOAACA CTTOC 
ATTCGTCGATCTrATTrATCACCACAAGCCTGAAAATGCAATGTCCTGAAAAANGNGACCTCTITG 
TOCCCACGTAATTTTTAAAAAAGGAGANGGGTAATATTAAAGGQGACTOAGGCTTGGTCACCCAA 
AAATNAGCNCAATGAAACCAACAATAATGAATAATGAGCNCTAAA ATCAAAT ACCCGAGTTTCAA 
AAAAAGGGGGCCGGTTTAAATNC^mTGACCCCCNT^TCAAAANAACTT^mAAAANA^ 
TGGGGGAAAAAAAAAAAAAAAAAAAAACCTGGCCCC 

SEQ ID NO: 2686 GGTACGCGGGGGCGTGCTGTTGGGAGTTGCrTGGAGGTTGGCGGCGCGGGGC 
TGAAGGCTAGCAAACCOAGCGATCATGTCGCACAAACAAATTTACTATTCGGACAAATACGACGA 

cgaggagtttgagtatcgacatgtcatgctgcccaaggacatagccaagctggtccctaaaaccc 
atcix3atgtctgaatxnx3aatggaggaatcrrggcgttcagcagagtcagggatgggtcca™ 
tgatccatgaacx:agaacctcacatcttgctgttccgococccactacccaagaaaccaa agaa a 
tgaagctggcaagctacttttcagcctcaagcrmacacagctgtoc^ 

OATAACATTATTATGTTGCOTCTrGTrTCrCACTrrGATAmAAAAGATGTrCAATACACTa 
GAATGTGCTGGTAACTGCrrrGCTTCTTGAGTAGAGCCACCACCCCATAGCCC AACCA GATGAGTG 
CTCITGTGGACCCCCACCTAAGCTCAGTGTGACCCCAAAAGCCACXJATGTGCTCTTT^ 
ACACTTGNCAGATGGAGGAACCTCTNA^^mGAAACATNG^^ITNTNCAGGGACATGTAAAOT 
TGGTTTG Ul 1 1 VlCl I CCCGGGGTNGATGTTGGGGGAmyCGGArrATrmTCN 

SEQ ID NO: 2687 TTNATGGCAATTAGAAGTrGTCATGGCAAAAGAAAACCACAGCrGGCCTGCC 
ACAGCCAACACAAGAACCAGAAAATGGTAGATGAAATGAAGGAATAAAGGTGGGGTn'ATTCCr^ 
ArrATAAAAGAAAAAAAATAATTCTTCAGCAGTCrTAACAAAGACATCAAGATACAAAA>r^ 
GTOTTTTGACTCCAOCCCTGTCCCCATCrCCTCCAAGAGCATAGGTAGGAGACAGNTGAANCAAA 
CAANCAATIXrrGTAAAA^TTACXrrAGAAACCCTACAAAmGATTAAAATCTAAACTTCTATAAT^ 
TCGCTTTTTAAAAAATTTNATATCAAAAGGCCTGCTTTAGTGACATTGCTAm 
AACCCCCAKTACNCCCTCTGTCAGTAACATNCTCAAhrrTGACCAGCCNCCTmTATCTCTAAA^ 
ATGGGGAACAGGTGTCNCirrmAAACCCAAACCCCCNATGATCAACCNCTNGCTGT^ 
GTCrrGTNNCTNrrCCATNGCCNCCTrrTCCANANQOGOOGCrcGGGCTTAACC^ 
TKGGGCCAATCCCNTITACCCNriTAAAACC^mTNCTA^ 
GOCCCATT^m^•AANTTGTNCTCCNCCTGCATG^r^ATIT^n^ 

SEQ ID NO: 2688 GGTACTTTTGGOATAACnTrcOTrACAOTrcmCAAAATGrrCAATGCTC 
ATCTTCAAAAATGACTGTrCCTTGAGGCAATAGTCTGACATCTGrrcCAAC^^ 
TCXnTGATTGTGAATTCCACATCATCGCCAGGCTGTAAGGTTTCTAAGTCACCCTrAAATTCACT^ 
AGTGAAAGAATATCrcrmACAACATCACCrCTTTCAATAAAGCCAAATGCCTCCr^ 
AAACTACrcCCTGACAGCGGGCrrGn'I CnM t ICAACAGCATAATGTTGCGAGCACTTACAGCAC 
CAGTATGTTTATTCTTATCAATTACAAAGrrrATmATCTCCAGTTTCCAGCTOA^^ 
GACATCTTCAGGGGTGTAAGTCAGATAAAACACTTCrrGCCATTCATTCGTTCTTCAGGGAGGGAT 
TTCirGTmATCTTCACCAAOTTTAACAGCAATGGGTTTCCAGTCCGTCGTCCGATGATCTT^ 
rrC^CATCATCTCCTACrrTTTAAGCTTGC^GGGTGCCATATC TGGGA ACAGNGGAAGAA^ 
ACNTGACGTCTGAACCTGAATAAATOCNGAAGAGGTAACAGTTrmATAACCXANTTTACCCANG 
NTTGITAAANACTGCCCGGCGGGCNCNCNAAAGGGAA 

SEQ ID NO: 2689 ACTATGCCAAACACTTATAACITGTATAAAAATTCCACATCXCCATArrGGCC 
ACCTCAAGATGAAAACAGATAACTCCCTAAATGTTAACTGGCIXrrACTCCC 

AAAACCACATGGGAAATATAGAAATTCAAATAGAAGTAACATAAACCT GTCAT AAATCGTAAACA 

AAAAACrATTTGTGGGACAGCATGGATGACAAATGGTCTACCGTGTAAArnTAGAATGAGGCAG 

ACAAAAGTrGGAAGOCCGGTTAATITrCCCXn'CmCTCCrromCAGCTTCGTCrCOT 

CGATGTCCACAATGTCAAGrrGTCTCTCAGTAATrGCATTATTAGCGTGCnXjTCrrrc 

TCACTTAATGNATCAAGTTCAGCAATGGCrrrATCAAAAGCTGTCTTTGCNAGAAAGCAAGCm 

TCTGGGGAGTCA 

SEQ ID NO- 2690 ggtacgtctgcatcgattatcttacgtgoggcaaatgatttcatgtgtgatga 

GATGGAGCOCrCTITACATGATGCACmGTGTAGTGAAGAGAGTmGGAGTCAAAATCTGTGGT 
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TCCCGGTGGGGGTGCTGTAGAAGCAGCCarrra^TATACCTTGAAAACTAT^^ 

GGTCTCGGGAACAOCTTOCGATTOCAOAOTTTGCAAOATCACTTOTGTTATTCCCAATACACTAO 

CAGTTAATGCTGCCCAGGACTCCACAGATCTGGTTGCAAAATTAAGAGCrmCATAATG^ 

AGGTTAACCCAGAACGTAAAAATCrAAAATGGATTGGTCTTGATTTGAOCAATGGTAAACCTCGA 

QACAACAAACAAGCAGGGGGGTTGAACCACCATATTAAAGTAAGAGTTTGAATTTGCACAGAACT 

GCAATCCXXATTITCGAATTGTGATCTATAAATACTCAGAAGAAATATACTGGAGTTAAANTGCCTC 

TOACCTATGTGCnOTGCTTATATACATGTATNATGCTGACTCCGGCGCGTCAGNGATCCCCTGCGC 

GNTAGACACTGCACTGGAT 

SEQ ID NO: 2691 CGAGGTACAGCATCGTAGGGTTCCCCrAAACTrGCCCTG'l'ri'l'l-G l'n 1 1 1'1'I A 
GmOTTATCCCOn'ACTGAGCGGariXn'ACTAGGTGGCTGTGATTAAATGTCCC^ 
GGGAAGGGGAATGGTTGAGCCTCTGGAGATCATTGTAACCAATCCTGCCAGACCTG'nTGGGGCA 
GTGGGGAGCAAACCTAGATAAGGACCTOTTTGGGOCAGCAOGQAGCXAAATCrCCTTrAAC^ 
AAGCAGTTCCTCATTCACATCAACAGAGCGAGGCTGTGATAACnANGAGGC^GCA^ 
GTCCrrCAKTGCATITTAATCTGCCTCCAACTGGACACCCAGTAGGTAAGTGTCAAANCCNAAAA^ 
TICrGGGGCAOTAANATAAAATGGTTCATTTTTACaXiANGCCACITITAGTT^ 
COC^GTTTTNCAAAAAAAT^mX^NGGGCCTmANGCCGGGGAAGTTAGGGCGACCCANA^ 
NGGAGAOCCCCCAATNCCTTGNATTTTOGGOCT^CNA^^^GGTGGGTGGGNCA^TCCCTAATGGG 
GGAT^aCAANTTNCT^CCTGGGGGC^m4C^WAAAANANCT^^C^^ 
ATNGNrrTNGOGrnTANCTNTTTACAACTr 

SEQ ID NO: 2692 ACTTCAGACCATTCTCCTCCCCCATGGCCTTGOTOCTGAAGTTGGTATGGCAG 
CCTGCACCATTCCAGTTCCCAGGAATGGGCTTAGGATCAAAGGTTGCTATCACTCCAAAGTCT^ 
CACACACGATGCAAGATGAAACGGGCXIACXXAGAGATGATCTCCCATOCTGATrC^^ 
TCCAATCTGAAATTCCCACTGGGCAGGCATGACCTCGGCATTAGTCCCCGCAATCrrTGACrCCAGC 
ATACAAGCAGGCOCGGTAATGGGCCTCCACGATGTCCCrGCCATAOGCTCTGTCTGCT^ 
ACAGTAATATGGACCCTGGOGCCCTQGGAAGCCGTTGGAAGGCCAACCAAAGGGGTGCCCATCTG 
TCCCCATGAGGATATACTCCTGCTCCATGCCAAACCAGGGGTGCTGGTrGCTCACCATGTCATTAT 
CGTTrACAGGTGTGCCTCAAATrGGTCTCTGCAGCCTTTOArraTACCTC^ 
ANGGGGAATTCA 

SEQ ID NO: 2693 ACGCGGGGCrCGGAAACGGAAGTOAGCGGCGGGGTCGACTGACGGTAACGG 
GGCAGAGAGGCTGTTCGCAGAGCTGCGGAAGATOAATGCCAGAGGACTTGGATCTGAGCTAAAG 
GACAGTATTCCAGrrACTGAACTTTCAGCAAGTGGACrrTITGAAAGTCATGATCTTOT 
GGTTTTTCTTGTGTGAAAAATOAACrnTGCCTAGTCATCCCCTTGAA^^ 
AOCrCAACCAAOATAAAATGAATTTrrCCACACTGAGAAACATTCANGGTCTATTTGCTrc 
AATACAGATGGAATTCAAGGCNGTGCAGCAGGTTCAGCGTCnTCATTTClTrC^ 
TCACTGGATQTTmGAAOGGNTATOATOAGNCTTTTGGATTTOAOGATATTCTTAATGATCCATC 
ACAANGCGAATTCATTGGGAGAACCC<XCnrrGATNGTGGAATATAAACTTGGmTAC^ 
GGTG>rnnx:ATGGAAACCNAGGGCAGCATCITGTTArrANTCATTTNOrC(^ 
CTTAGGCAATNCANCNCNCTGNGNCGTTOTATATGANTCCACTTCGGNCNANCTTG 

SEQ ID NO: 2694 GGTACCAACCTGGCTACTGGAATCCCCAGTAGTAAAGTGAAATATTCAAGGC 
TCTCCATCACAGACGATGGCTACATTGACCTTCAGTTTAAOAAAACOXrrCCT 
AGGCCATCGCACTTGCCACTGGGCTGTTTTTGATNGGCGCCTTrcrCATTATAATAGGCTCOT 
G^^^GTCNGGC^TCAT^ANCANAGGGGGGGCANACNaKKiCC^^m^CA^^^GCTC 
CTCGCOTrCCTACCXGNATThrrACNACKrGCGGChrmAC^ 

NGGTGTCOTGCTATGATGACTTrTNCAANCTTTrGArrcACCTAGNATCCCrCX;CCC^ 

rmGAGGANGTCACA>riTG0AA<nTGTTCCANQCTmOA0OATCTrCT^ 

hrrGCTGACTAAAAGATT^TTACTGCATCCTTGa^ACNTGGTTTNAACAAAAAAC^ 

TGTATGNGCCCCTTCCCANATAANGNTAANNTTNAAOTCCOTACAACOTAArrAAGGACAAAC^ 

TTTTTTTCATNCrm^IGGCCX^^CCAhrrNT^ 

CTGCAATGGAAAANGNGGCNTCCCCACC 

SEQ ID NO: 2695 GGNA fl i 1 1 III IVUlil rri ' rri - | ' ] - i ' i ' ri €GAANCCAAACTGTATCCAGCTTT 
ATTAAAGATACTTTCCATAAACAATCATGGTATTTCAGGCAGGACATGGGCAOACAATCGTTAAC 
AGTATACAACAACmCAAACTCCCnTCTrcAATGGACTACCAAAAATCAGAAAGCCACT^ 
ACCCAATGAAGTmCATCTGATGCTCTGAACAGGGAAAGTTTAAAGTGAGGGTTGACATrrCACA 
TTTAGCATGTTGmAACAACnTrTCACAANCCGACCCTGACmCAGGAAGTGAAATGAAAAT^ 
CAGAATTTATCTGAAGATCCACAATCTAAAAATGQOAACCACTGCTCTTTTOACAGGTGC^ 
AGNGGCATCACTGGAAAGTCCANATTGCCnXjACACACTGGTAAOCAATGACTAGGGGTCAGGT^ 
CAACAAATGTCTXSGGCrTAAGGGAGTTAAGTCTATIXn'GAAANATGGAAAGGGAOAAAAGGACm' 
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AAAACCAATTTirmTTTrTOCCCAAAGCTriTrcTG 

ATCCIXntnXjGGAGCCAGAGGAATTTTTAAACTANAANGGAAGCKSGT^^ 

GGACATTNTTTNGATATNCCCTTCCCNAAANAA 

SEQ ID NO: 2696 ACGGATACCGGAAAGGCTGGATACCTCGGTTATTAGAGGATTTTGGAGATGG 
AGGTGCTTTTCCAGAGATCCATGTGGCCCAGTATCCACTGGATATGGGACGAAAGAAAAAAATGT 
CGAATGCGCTGGCCATTCGGGTGGATTCTGAAGGAAAAATTAAATATGATGCAATTGCTCGACAA 
GGACAGTCAAAAGACAAGGTCATTTATAGCAAATACAaXJACCTGGTTCCAAAGGAQGTTATGAA 
TGCAGATGATCCAGACCTGCAAAGGCCCGATGAAGAAGCrATTAAAGAGATAACAGAAAAGACA 
AGAGTAGCCTTAGAAAAATCTGTATCACGGAAGGTCGCrGCAGCCATGCCAGTTCGAGCAGCTGA 
CAAATTGGCTCCTGCTCAGTATATCCGATACACACCATCTCAGCAAGGAGTGGCATTCAACrCTGG 
AGCTAAACAGAGGGTTATTCGGATGGTAGAAATGCAGAAAGATCCAATGGAGCCTTCAAGGNTCA 
AGAATAATAAGAAAATTTCCCNGGGACCACCTTNTCrrCTGNGCCTGNrrATTC^ 
AAAAATACTGITAAAGAACAA 

SEQ ED NO: 2697 ACrrrri'l'l i i 1 1 1 1 11 1 ri 1 11 1 1 INGAGNCATTACirnTArnTGAAGGATT 
TGTGAAACTCTrcACATNATGGGGAGAGTTTGTATGATTAATAAGAAGCAGCTTT^ 
CTTGGAGGTGAACNA>nTCTCAGCCrGNGAGATCCGACCATCCCATTAACTTrGAAGm 
ATTAATAOAANAAAAAAGGOGAGGGTGAAGAAAAOOAOGAACATOCTAAAAACCnT ATOA CAAT 
CATCCAAATGTGAGGAAANAACAACCGATTCACCAACTCCACTTTTIXrrATm 
ATCTCACTOTGATTTTGGCTTNCTGGCTGAAACANCCTGGCATGTCCCTAAAGCCCC^ 
NCCCTGOTTCTCCCANANACAGAGGAGGAGAAGCCCTNCANGATGCGCTGACCACTTTCNAAAAA 
ACTGGNCNGTCmjNGCTCCCCAAAAAGrmOAACCNACCNGCCTAATGGTGAAAAGAACTGGC 
CCTGAAANGTAAANGANQAAATNOGGATTAA>rroGGCTT>nTGTGAAATGNCTATT^ 
CCACCCCAGAA^T^TT^^T^AATT^WGC^^rANGCAACCNCIT^^ 

SEQ ID NO: 2698 GGTACAAAGCTTTGCAAGGGTGTGGTrTTGGAATGACGCTAAACTGAAGGTG 
OAGAOAACAGATAAAAAGGTTGGAAGTTGCACACrGTACTCTCTATCACTGACAAATGCAGGCTG 
GATTCTTATTATATACAGAGATGGCrCAAAAATGGGGTTTCAGATCTTTGTGACGAAATAGAATAC 
TGTTTCATATTCGAATCAGAGGGCTTCTTGGTCTNAGAAATANGTNCATAATC^TTOGAACC^ 
ACAAGAATAACTTATTGCTATCTGTGATAACAACTOTGTTCTAAACACAANNGAri l ICl 1 1 1 1 lAT 
TAATGTGCACATAGACATTGCCATTITAGAATNATAAACCNCATGTrGGGGTmAAAAATGAAAT 
TCTGGCTAATTCGAGCCAKITCANCTAATITrCTATNCAAGTAAATGGGTGNGT^^ 
NAAAAACGGGGTNCAAANCOCACrmGCCNCCmaiAANCTATATGGCOT 
GCTTTANTAATGGTCATITCTTrGTrAAAANNACCCAAACTrcAAATCAN^^ 
AAAATTbMNAGTCITNTnTNGGATATTAAANTTNGATNNTO 

SEQ ID NO: 2699 GGTACAGCOGrrGTCAATGGAGAGTTCA AAGA CCTAAGCCTTGATGACTTTA 
AGGGGAAATATTTGGTGCrri' I CTl CTATCCTTTGGATTTCACCTTTGTGTGTCCTACAGAAATTGT 
TGCTTTTAGTGACAAAGCTAACGAATITCACGATGTCAACTGTGAAGTTGTCGCAGTCTCA 
TIXX;CACTITAG<XATCTTGTCTGGATAAATACAOCAAGAAAGAATGGTGGm 
CATCGCACTCTTGTCAGACTTAACTAAGCAGATTrCXXGAGACTACXKn"GTGCTGTTAGAAC^ 
TGGTCTTGCACTAAGAGGTCTCTTCATAATTGACX:CCAATGGAATCATCAAGCATrTGAGCCGTCA 
ACGATCTCCCAGTGOGCCCOAAAOCOTOOAAGAAACCCTCCGCTTGOTOAAGGCXJTTCCCAQTA 
GTANAAACACATGGAGAAGTCTGGCCAACGAACTGGACACC(XK}ATTCTrCTACTATC^^ 
AGTCAGCTGCTTTCCAAGAGT 

SEQ ID NO: 2700 ACATGGGAAATOTAAACAAATGTOAAGGAGGACCAGAAAAATTAGTTAATA 
TTrAAAAAAATGTATTGTGCA'rrTTGGCrrCACATGTnAACnTTm 
ATGGAAAAAAAAATCTOTATACAOTATCTGTAAAAACrATCTTATCTGTTrCAATrc 
ATCCCATATAATCTAGAACTAAATATGGTGTtnXKKXATATTTAAACACCTGAGAGTCAAGCA 
GAOACmOATITGAANCACCTNATCCTTCTrrCAATGCGAACACTATCA 
GGATTITGTCTAACCATATGrnjCCATGAATTAACTCTGCCGCCTTTCTTAAGG^ 
TTTGATTITGGGAATCTTCCCCTTrCOUWU^TGAAAATANANA 

ACCCCTAANGGCCGAAmCACCCACTGGOs'GNCCGTTOTAGTTGGATCCCAOCTCNGTNCCAAG 

CTTCNGTAATCATNGTCATAACTGGTTCCCTGTGTOAAOTG^^ATCC^n^NACNATTTCC^ 

CCGANCCGGNAGCATAANANGAAANCCTGGGGGCTTAANGANGNNCTAAC 

SEQ ID NO: 270 1 A Ci rrri lU I ' riU ' lJ ' l ' l rri4 ^ ^^ rmGGOGNCTCAAAAATCAGTAAAACTTTA^^ 
CGCTTCCATTCTTTCGCCA'nAACAGAAAACTGGAGAAAGCAAAAATGTl^^ 
AAACGGCC^^r^T^ACCCANAaATCAAAACCTCAAACaACAAOGGGGAAGATA 
CCACATNCCCTGAGCTGACCCTTGTCATCTTANACAAAGCCrnJAGTCCACTGGCCANGGACCCTG 
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TrATGGCX:AATTCAAGAAGAGGCKX>AGA AATCANA CCCANCCATTCCCGTGATTTAAAGNGGAA 

CAGAAGCTACACCAAGGGTCAAAhn'GTCTGTTTr iTCAT NG ACACANTA NCX^ 

NTAAAAAAAAACAATATTTATTTCANGCCCCAACCrTTTCAl I I'l Tl n-lGGNTCGGTTTTTAAATC 

AGTGAAANAAACCGGGTNCTAANNAGCT^WNGGGNNNCTAGCCGCCC^mJGTTGGGCNATCAAT 

TTTTTT(KJTITGOTTAAAAATGarAAAhrrATTACNCGGOGGGC^^ 

CATAiNriTACCAATTGGTNAAAhns™CXX:CCANCCACAAAA^ 

SEQ ID NO: 2702 A Uri ' lJ ' ] ' llU1 ' l ' iJ ' ri ' riTi ' l ' ri - l ' lM GG(nTNGAAATTTANAAACAAATTTrATT 
TAAGATCTOAAATACAATTCXTAAAATATCAACTTrTCCAaAAAACajTC 
ATTGCCT^^'ATCATGTTAGAACGTGCATTNGACTCAAATACAAAAACC>TGAAACAAATN 
CCTTTAACAATTTOAGCAAAQATOGAATaCCrAAGAACAACATAQATGGAOT 
CTGTTTTACTTCAAGCACCATAAAAAAAAAAAAAGAGCNCAAATGCCrGGGTTTrcAGGTN^ 
CATTAAGNTGAACCTTTNGCCCTAGGAATCAGGGCGTTITNTa^ATNGCTT^ 
AATTGTGTAhfNGTNAAAGGGNTANQAACX:NCCNCCCNrrCAAGCCAATGTTGTCAACTANG CAAT A 
AAATGNTCTACTGGAANGTTCTTATTTTGmrcnT^JATTACTGNATACCCTTGNGN^ 
AAATGAGAAAAAGOAGCriTNa;CCCKmATTTTNTNOTTTAA^ 
AACATNAAGCCCrc>n"ATACNTrAa3ATTTTAAANAACATNANm 

SEQ ED NO: 2703 GGTACArrGTGmAAGAGAAAAATGAAACCCACATGCCGCCATnTCCTGA 
ATCAAATTCTGCAOTGGAATGGAOAGGAAAATACTTCTAGGCAAGCANCTANACTGGTO/^ 
GGGAAATAGAAGGAACTAGTAACTGAGACTCCTCCAGCCTCTTCCCTATTGGAATCCC^ 
CTGGNGTAGGAAAAAAGTTTAAACTACATTCATGTrCnTGTrcnX}TGTCACTCGGCCCT^ 
CTACCATTTACrrcACCCCAAGTCCTGCTGCCCATCCAGTTGGGAAGCCATGATmCCT 
CCAGGGCCATGGGGAGATACAATnCCANAGTTCTCGCTrNCTCCTTTGGGCATC^^ 
CCAATCAANOAAGC^^rcNCQCTCAOCTCTCAGCIT^CGGCCAATO 
NAAACNTGGGAGACTCCTGGTCTTITACCC TCCX:C nT^NT^ 
ATGGTCTTGGACAATTATAGAAACAAATGACnTrTTGGGAATAGCCChnXiT^ 
GGCCNCCANGAAACACTTANCCTNCATNCXXXIAGACCi i I IL'1 1 GCNTTGACNANTGACAANG 

SEQ ID NO: 2704 ACATAGTGGTGCCrcCTGATAGGACATTGTTAGCATAGAGGTCCTTCCTGATG 
TCAATATCACACTTCATGATGCTGTTGTAGGTGGTTTCATGGATGCCAGCAGACTCCATCCCGATO 
AAGGATGGCTGGAACAGGGTCTCTGGGCAGCGGAAACGTTCATTTCCGATGGTGATCACTTGCX:C 
ATCAGGCAACTCGTAACTCTTCTCAAGGGAGGATGAGGATGCGGCAGTGGCCATCTCAT^ 
AGTCCAGAGCTACATAACACAGTTTCTCCTTGATGTCCCGGACAATCTCACX3CTCAGCA0TA 
CGAAGGAATAGCCACGCTCAGTCAGGATCTTCAIXjAGGTAGTCAAGTGAGATCTCNGCCAGCCAN 

atccatacgcatgatggcatggggcaaggcatagccctcatagatgggggacattgtgggtgaca 
ccatcttcagagtccagcacngatgccnagttgtgcgtcx:aagagccatanaanaacaacacccg 

CTGGATAGa;ACATCATTGG>rrGGGACATrcAAAGTCTNAAACATrATTTGAGTCATTTCTrc 

ttggccttgggggtnaagggggcctc^rigaancaaggtcggatgctcitc^ 

seq id no: 2705 ggtactgtattttccgcaaaagaaaattaaca'ntagtaacacactaatgaa 
tmatxrrccaaagagattagtgcactggcaaagtattcggattacagtggagagctgtgaacag 
cagataccgaaaacaccxcagcctggcatatcccaaggtaacgatggccttnactgaccactccc 
tttgccagaggtccagaaagaactrgcagcatgatcacctgcctgcttgcact^ 
agctaacttcaaggagccaaanaggaotccxragtgggcaaggggaacanaoaaganaaac^ 
ggtaaatgaaaacctaaaggacatgtaaaatgggatacaatccccattcaaatccx;anagatgtg 
ggcaagttccnaaagtngtggttagnttanaaaacrcggcacaccttnot 

AANTAC:A>n'AGAAATTNCCGGGGGATANCXrrGNAAAAAATAGANAAAATGCTCNATGGANNACC 

CAANCNCTTNANCTOTAACAThnXTAAGGGCTGAAACACATTCC^ 

CriTGGCTTTTAACTNGGATGGNA>riXKXnT<AATGACACNAAAATC^ 

GTCCNTTNGNG 

SEQ ID NO: 2706 ACCTAGAAGAGAGGCGGGTCAAAGAAGTAGTGAAGAAGCATTCTCAGTTCAT 
AGGCTATCCCATCACCCTTTATTrGOAGAAGGAACGAGAGAAGOAAATTAGTGATGATGAGGCAG 
AGGAAGAGAAAGGTGAGAAAGAAGAGGAAGATAAAGATGATGAAGAAAAGCCCAAGATCGAAG 
ATQTGGGTTCAGATGAGGAGOATGACAGCGGTAAGGATAAGAAOAAGAAAACTAAGAAGATCAA 
AGAGAAATACATTGATCAGGAAGAACTAAACAAGACCAAGCCTATTrGGACCAGAAACCCTGATG 
ACAT^CCCAAGAGGAGTATGGAGAAmCTACAAGAGCCTCACTAATGACTGGGAAGACCACTT 
GGCAGTCAAGCACTTTTCTGTNOAANGTCAAGTrGGAATrCAGGGCATTGCTATTTATTCCTCGTC 
GGGCTCCCTTTGCCTTTITGAOAACNAGANGAAAAAGAACATCATCNAACrCTATGTCCCCCG^ 
TTCATCATGGACANCTGG0AT0AA^^X3ATCCNCAAGTATCTCAATTTTATCCGG0QGNG 
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CTTGAGGATTTGCCCrGGACATTCnxrCNNGAAATGCTrCACCAGAGCAAAATTTTOAA 

SEQ ID NO: 2707 GGCNCGGCOiANGT ACTCT ATGCATTCTATCTATACAGAAACACCTATTTATT 
ATTACAGTTGATrTATOCNTKATTAACATTmGOTOTTAAAATTGTTA AACA TCATXj 
CATCAACTCTCCATAATGAATCTGAACTrCACAATGATITACCAAACACTTTACTTAA^ 
TTAAATAAATGAAAAATGCACCGAGCCAAACAAAACTTAACTGGGCTATAAAGAAAAAAAC CCT^ 
CAAAACAGTCCATATACACATCTATTTCAAAACTACCTTrGCTAGCT 

AGCCTGAAAAAAACAGTAAAATCrACACAAAATnTATTGCAATCATACAAGGGTTACATTAGGT 

CAACAAATACTATQATGCAATTTTACAmATTAAACTACAGTtCAAAGCACAAATTTACACATO 

TAAATACACTAAACATTATCTAATGAAGTCACACTGGTCTTCTAACATTTGATATATCTGGGTG^ 

AGACATGAACTTTACAAGACirrAAACACAAATCCrrAOTATAAAAACTaTOTTOTGT^^^ 

CGATTrAATGGGGAACCCAATTAATGGGCTTCCATCTCCCTAAGTCATCCATnTGGTGCATArTTA 

mTAACGCTTAANGGGTAG 

SEQ ID NO: 2708 ACGa}GGGGAriTCAAAATCAACACCOATGAGATTATOACTrCACTCAAGTC 
TGrmNTGGACAAATAGAAAGCCTCATTAGTCCTGATGGTTCTCGTAAAAACCCCGCTAGAAACTC 
CAGAGACCTCAAATTCTGCCATCCTGAACTGAAGAGTGGAGAATACTGGGTroACCCTAACCAAG 
GATGOVAATTGGATGCTATCAAGGTATTCTGTAATATGGAAACTGGGGAAACATGCATAAGTGCC 
AATCCTTTGAATGTTCCACGGAAACACTGGTGGACAGATTCTAGTGCrGAGAAGAAACACGTTrG 
GTrrOOAOAGTCCATGOATGGTGOTmCAGmAGCrACGGCAATCCTOAACTTCCTOAAGATGT 
CCTrGATGTOCAGCTGGCATTCCnTCGACTTCTCTCCAGCCGAGCTrCCCAGAACATCACAT^ 
CTGCAAAAATAGCATrOCATACATGGATCAOGCCAOTGGAAATGTAAAGAAGGCCCTOAAGCTOA 
TGGGGTCAAATGAAGGTGAATTCAAGGCTCAAGGAAATAGCA AATT CACCTACACAGTTC^ 
GGATGGTTGCACGAAACACACTGGGGAATGGAGCAAAACAGTCrrTGAATATCGAACACCCCAAG 
GCTOGOAGACTAC 

SEQ ID NO: 2709 ACAATGTTGAACAAAAGACCACAGGGGGACCTnTGTTCAA GGTA GCACCAA 
TCCATTKOCnXjATTGNGmCCAACATTAACCTrCCTGTTGACTCTATCATTGGCAC^ 
ACTTCTrCTQCTTTAGTOAOGATTCCTACACnXjACTAAGCACACTGTGTTGCTAA^ 
GTGTGGCAGCATCAACCCXjGGAAATGGCACATTTGAACCAGGATCXrCCCTGACATGGGCT^ 
TTTTGTATGTGTTTTCCCCACCCCCTGACCTAGCTGGTATCTTGTGGATGTTTC^ 
AAAGTAAATAAATTGCATTTAAGGGGTAAGCAGGTGmAAAAACAAAACAAAACAAAACAAAA 
TGAAAGAATGTGGTCXjCAGCTAAGGAGGTTTTCACATGTTGTTrAACTCCAGTAAAGTCT^ 
GCATCTGATGCATACTAAGGTTTTCTTCTrrGGCATGAOa:ACTTT^^ 
TTTCTCCAATTTAGTTACTGACCTCTCCGCAAACTCAGCCXXj 

gaaaggacctroatctcttcctcatatctgcm:cttct0cgaqtc 
ngg 

SEQ ID NO; 27 1 0 ACACTTACTGGGCCTCGCCTrmAATTmACTCTTTTGCCrrCC^ 
NATNCAAOATITITOCTrGCntTCACCTCGGAGTTOAGCCGATTTCACAAATCTGCT^ 
ATGAGCCAAATACTAGCACAACCTlXXCTGGACCrGAGGGTGGCnT^ 
GGATOCTGCTGAAGCAAATGACTCTGCTCCAGTCAGCTCAGGCCTCTATGGAGCAA^ 
TGTCCTTCTTGGAGCX3CTGCCTTCATCTCCTTGAACTGAGC^^ 

AGCCAGCGGTAGGCTTGGTGCTCATGGGAGAGGCGGATCTCCACGTCATAGTCCTTCACCrCCGC 

AGCCAGTAAATGACTOTTITAGGOTGTTCCTGGCCACATAArroAGTTCXXnT^ 

TAATGGTCAAGCTGGCCTGCTTCTATGCXnX}CriTCCTOTGGGTCrrCC^ 

CATCCrCTCCTOOTTCCACATGGCCTrrGGGAGOAGTCCAAGTGATGAATGCCATCT 

ACAATAAAAACTCAATTGCATTGTGTCCACTTTGGGAATGAGGCATCTTOGGAA 

ACA 

SEQ ID NO: 27 1 1 ACATCAAAGATTACATGAAATCAATCAAAGGaAAACTrGAAGAACAGAGAC 
CAGAhTTTNAGTAAAACCTmATGACAGGGGCTGCAOAACAAATCAAGCAC^ 
AAAAACTACCAGTTCmATTGGTGAAAACATGAATCCAGATGGCATGGrmjC^ 
CGTGAGGATGGTGTGACCCCATATATGArrmnTTAAGGATGGTTTAGAAATGGAAAAATGT^ 
CAAATCTGGCAATTATTrTGGATCTATCACOXn'CATCATAACrGGCTTCTG OT 
ACACCAGOACTTAAGACAAATGOOACTOATOTCATCTTGAGCTCTTCATTTATnTOACTOTGATT 
TATITGGAGTGGAGGCATroTTmAAGAAAAACATGTCATGTAGGTTGTCTAAAAATAAAATGCA 
TTTAAACACAAAAAAAAAAAAAAAAAAAAAAGTCCTCGGC 

SEQ ID NO: 27 1 2 ONACAGGNGOAACAATCCAAAG^TITAATCAAAGAAGGNGGTG^r^CAmTO 
CTGCTCAQmNGTTGATA(XCCAGGATriXXiANANGCNGNGGATAATAGTAATTGCTGGCAGCC 
TGNTATCGACTACNTTOATAGTAAATITGANGNCTACCTAAATOCNAAATCACGAATGAACAAGA 
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CCrCAGAATGCCTGATAACAGGGOTGCAAGTONTGTtTATACTTCATTGCTCCTTCAGGAC^^ 

OTAAACCATTGGATATTGACrmATGAAGCCGTrrGCATGAAAAAGTGAATATCATCCCAOT 

TGCCAAAGCAGACACACTCACACCAOAGGAATGCCAACAGTrrAAAAAACAGATAATGAAAGAA 

ATCCAAGAACATAAAATTAAAATATACGAATTTCCAGAAACAGATGATGAAGAAGAAAATAAAC 

TTGTTAAAAAGATAAAGGACCGTTTACCTCnTGCn'GTGGTAGGTAGTAATACTATCATTGAAGTrA 

ATGGCAAAAGGGTCAGAGGAAGGCAGTATCCTTGGGGTGTTGCrGAAGTTGAAAATGGTGAACAT 

TGTGArrTTACAATCCTAAGAAATATGTTGATAAGAACACACATGCAGGACTTGAAAGATGTTCTA 

ATAATGGCCACTATGAGAA 

SEQ ID NO: 27 1 3 ACTGTCACATTCCTCrJ-l ICANATTCl I'll Ti i IGGGATAATTTCTCl'lUl ICTA 
G'nTANCAGATAACATGTGAGAAAGGGACTGTCTGCATTCCTrATTGAAAATGTCATTCATTAAAG 
GTOAACATTCAGACAAGACCTTGAGGCACAGGGAAATTCGATCCACATCATCATCAGTAATTGGC 
TTCTTAGGAAGAGAGGATTTTCCCAAATGCAGGATAGTAGCCATGAGCAACATAGCCTNAGC^ 
AAAAGAATTTTGCTTmcrrCTCCTGAACCAAAGCTACATAGCNCAATGCAATOT 
TGTGGCAAGGGAGGCAGCAACAAAGAAATCTCCATCCAGAAGGAATCCrCTAAAGGGAGGANTG 
TCTTCCrCTrCTTGGGGGGCrAGAACTGCTAAGGGCACTCTGAGTTTGCATAGGTAOT 
CTOGCGCTTGAAAGGNCNAATTCCAGCNTAACrGNOGGCCTANTAC 

SEQ ID NO: 27 1 4 ACGCGGGAGCTCTTTCCnTrCGCTGCTGCGGCCGCAGCCATGAGTATGCTCAG 
GCTTCAAAANAGGCTCGCCTrrAGTGTCCT<XGCTGTGGCAAGAAAAAGGTCTaGTTANACC^^ 
ATOAAACCAATGAAATCGCCAATGCCAACTCCCGTCAGCAGATCCGGAAGCTCATCAAAGATGGG 
CTGATCATCCGCAAGCCTGTGACGGTCCATTCCCGGCTCGATGaXJGAAAAACACCTTGG^ 
GGAAGGGCAGGCACATGGGCATAGGTAAGCGGAAGGOT 

SEQ ID NO: 27 1 5 AOKXKSGGGCAGTCCGCrGGTCCCGAGCACGAGCTGTGAGGGGATrCACTTG 
TGTGCGGAACTCCTCGGAACCATCGCGTCCCTTTOXrroCACCT 

GCTGATGAAGAGAGAGCAQAGACAGCTCGTCrGACrrCTmArrGOTGCCATCGCCATTGGAGAC 

TTGGTAAAGAGCACCTTGGGACCCAAAGGCATGGACAAAATTCTTCTAAGCAGTGGACGAGATGC 

CTCTCTTATGGTAACCAATGATGGTGCCACTATTCTAAAAAACATTGGTGTTGACAATCCAGCAGC 

TAAAQTTTTAGTTGATATGTCAAGGGTTCAAGATGATGAAGTTGGTGATGGCACTACCTCTGTTAC 

CGTTTrAGCAGCAGAATTATTAAGGGAAGCAGAATaTTAATTGCAAAAAAGATTCATCCACAGA 

CCATCATAGCGGGTTGOAGAGAAGCCACGAAGGCTGCAAOAOAGGCGCTGTTGAGTTCTGCAGTT 

GATCATGGTTCCGATGAAGTTAAATTCCGTCAACATrrAATGAATATTGCGGCACAACATTATCCT 

CAAAACTTCTTACTCATCACAAAGACCACnTTACAAAGTTAGCTTGTANAAGCATTI^^ 

AAAGNTTCTGGCA 

SEQ ID NO; 27 1 6 ACTCAAGTCACTTAATGAGGAAGCTGTGAAGAAAGACAACTCTGTCCATTGG 
GAGCGCCCTCAGAAACCCAAGGCACCAGTGGGGCATITrrACGAACCX;CAOGCTCCCTCTQCTGA 
OGTGGAGATGACATCCrATGTGCTCCTCGCTrATCTCACGGCCCAGCCAGCCXX;AACCTCGGAGGA 
CCTGACCTCTGCAACCAACATCGTGAAGTGGATCACGAAGCAGCAGAATGCX:CAGGGCGGnTCT 
CCTCCACCCAOGACACAGTGGTGGCTCTCCATGCTCTGTCCAAATATGGAGCAGCCACAm 
GGACTGGGAAGGCTGCACAGGTGACTATCCAGTCTTCAGGGACAriTIXrAGCAAATO 
GACAACAACAACCG<XTGrrACTGCAGCAGGTXnrATTGCCAGAGCTGCCTGGGGAATACAGCAT 

gaaagtgacaggagaaggatgtgtctacctccagacatccttgaaatacaatattctcccagaaa 
aggaagagttccotttgcrrraggagtgcagactctgcrcaaacttgtgatgaacccaaaocx; 
caccaocttccaaaatctccctaagtgtcaagttacacagggagccgctc^ 
atcgttgatgtgaaga 

seq id no: 27 1 7 acagtgtartacgtaaatatgtaaagattcttcaaggtaacaaggotttggg 
trrtgaaataaacatctggatotatagaccgttcatacaatggttttagca^ 
caaacaagtcctatcttttitrtttggctggggtgg^ 

aagacgtx^itcactgaaaqacagaatgccatctggocatacaaataaoaagtttgtcacagcact 
caggattttgggtatcl 1 1 igtagctcacataaagaacrrcaglgcttttcagagctggatatatct 
taattactaatgccacacagaaattatacaatcaaactagatctgaaqcataatttaagaaaaac 
atcaacattttttgtgctttaaactgtagtagttggtctagaaacaaaatac^ 
aarrttcaaataaaacccaaaataatag<mtgcttanccctgttagggat^ 

GOAGCACATATmrATTAACi IVl 11 IGAGCrrTCAATGTTGATGTAATrTTONTCTCrGTGTAAT 
TTAG GTAAA CTGCAGTGTrrAACATAATAATGGTrTAAAGACTrAGTTTGCCAGTATAAAATAATC 

crGGcmr 

SEQ ID NO: 27 1 8 ACCAAACGGGCAAOGACATCrCTACAAATTACTATGOGAGTCAGAAGAAAAC 
/^TTTGAAATTAATCCCAGACACX;CGCTGATCAGAGACATGCTrCGAOGAATTAAGGAAGATGAAG 
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ATGATAAAACAOTmGGATCTTGCTOTGOTTTTQTTTGAAACAOCAACGCTC 

TTTTACCAGACACTAAAGCATATGGAGATAGAATAGAAAGAATGCTTCGCCTCAGTTTGAACATT 

GACCaX}ATGCAAAGGTCGAAGAAOAGCCCGAAGAAGAACCTGAAGAGACAGCAGAAGACACA 

ACAGAAGACACAGAGCAAGACX3AAGATGAAGAAATGGATGTGGGAACAGATGAAGAAGAAGAA 

ACAGCAAAGGAATCTACAGCTGAAAAAGATGAATTGTAAATTATACTCTCACCATTTGGATCCTGT 

GTGGAGAGGGAATaTQAAATTTACATCAn ICl I I I'tGGGAGAGACTTGTTTTGGATGCCCCCTAA 

TCC CCTTCTCC CCTGCACTGTAAAATGTGGGATTATGGGTCACAGGAAAAAGTGGGTTTTTTAGTT 

GAATTTTTTTTAACATTCCTCATGAATGTAAATTTGNACCTGCCCNGGGCG 

SEQ ID NO: 2719 ACTGCAGTAATAGGAATCTCTTCCACAGAGGCAGCAGAGAAGTGGTTTAGTG 
CCATGGAT AGGGA GGAAAGATAGGAGCCCITGCCCAGAAGGTOACTGOCTTOCTTAGGCCTT 

ccx;aggagtttttatattctotckkxaggacaaaaatagaattcggggaaaataaggta^ 
atctaaaacacttgtagcaggaaagacgtgoagaagagcaattgcaaaggacagggtagactgc 
ttgctggataatatttgcctttaaagagatocattggtccacagcaactggaaaaggggtgtagc 
aagaagaatgoaaagaagagaagcaaggccctctaattccacttaccrcaaaagactgagccctg 

AGGACTATGTGAATACACACCCCTAAAGGCAAAGGCTCTCACTCCACCATCCCTTTCTCACAA^ 

GCATCTGGTGTOCCATCTCTCCCCACAGTAGGGTATTGCCCTCAGCAGAGAACAGAAGCCCA 

AGTACCATTGGTXKK;CAATTGAmGATGGTAAGGGAGGGATCGTrGCCTCaTCTGTTATGTAAAG 

GATGCCGTANGGATGGGAGGGCQATOAOGACTAGNATNATGGCNGCCAGGATANTTCANACGGN 

TTCTAnrCXrrGAOCGGTTG 

seq id no: 2720 acccagtaaaaaccagaatgaoratixkxv^goacgcatcaaagttoactrr 
gtgatcxctaaagaaotccctrtoqaoacaaaqatacgaaatccaaggtgaccctgctggaagg 
tgaccatgttaggtttaatatttcaacagaccgacgtgacaaattagagGgagcaaccaatatag 
aagttctgtcaaatacamcagttca 

GAGATOaiM'riGGTITCATCAAGTGTGTGGATCGTGATGTTCGTATGrrCTTCCACTrCAGTGAAAT 
TCTGGATGGGAACCAGCTTCATATTGC^GATGAAGTAGAGTTTACrGTGQTrCCTGATATGCT^ 
TGCTCAAAGAAATCATGCTATTAGGATTAAAAAAClTiXCAAGGGCACGGTITCATITCATTCCCA 
rrCAGATCACCGTTTTCrGGGCACGGTAOAAAAAGAAGCCACTTrTTCCAATCCrAAAACCACrAG 
CCCAAATAAAGGCAAAGAGAAGGAGGCTGAOGATGGCATTATTGCTTATGATGACTGTGGGGTGA 
AACTGACTATTGCTTTTCAAAGCCAAGGATGTGGGAAGGATCTACTTCTCCTCAATAGGAGATAAG 
GGTGAATTAAT 

SEQ ID NO: 272 1 ACACTCTATAAATAAACACATCAATTTTGCTCTATTACTACTCTrCCAAAAAC 

gtctcatataattggaattaacctagcatttacttatcattaaatata i 1 i l l lf ig ttagaaagga 

tatacagcatatgaaattg<>ggctrmatgagttaaagacx^aatttagaacrctcaaatact^ 

ttta caatgtaaacttcacagttattacattagctagaatacatcacaacattcacaacacaatat 

ttttatttggt itgga tattgaagttgttttcttgaattacttatgtagaact 

gttgc gtatcg ttttccttcaagaattcactgggaagtigtx:atcagctttgct 

tcatcatttttctaactritctggaaagtcggg 

ttgagttcaaatatttcttracagccatttgttttctaaggcoogtatagttgtcagto 
atctgagtqaccotttgactggtaccrcggc 

seq id no: 2722 actacttggttcccgatatggatgatgaagaaggagaaggagaagaagatg 
atgatgatgatgaagaggaggaaggattagaagatattgacgaagaaggggatgaggatgaagg 
tgaaoaagatgaagatgatgatgaaggggaggaaggagaggaggatgaaggagaagatgacta 

AATAGAACACTGATGGATTCCAACrrrCC^^ 
AGTCTTnriTTTTTITITITITrrCCCTCrrGT^ 

ACTCCATGGTTCTCAAriTATTTG GGGG GAAATACCTGANCAGAATACAATGOGAAAAQAGTCT 

CTACCCCTTTCTGTTCGAAGTTCATrmATCCCTTCCTGTCTGAACAAAAACTGTATGG^ 

ACCACCGAGCICTGTGGGAAAAAAGAAAAACCTGCIXXXnTCGCTCTGCTGGAACI^ 

TAGGCOCCTGTGTAGTAGTGCATANAATTCrAGCTTTrTTCCTCCTTTC^^ 

AGAGTACCTCGGCCX3GAAC(XGCTANG0GCG 

SEQ ID NO: 2723 ACCAAGAATGTGCTGTCAGAGGACTTGNGCTCTGGTCTGGTAGCAAATOCCC 
TCAGCGCTGACTCCATATCCCnGGCTGTCANATTCTCTTCTACATCTACACTATAGT 
AAOTAGCTOriTCATCCCTGTGATGTCAAAGTCACCTCCATTCCTCXK^ 
TGT ATrGCATA TGATGAGAGCCAGGCGTGTGCGGTTGTTIXn'CTanTrAr^ 
GCTCri'rCl^'l'ACATAKTCTCAGGAATTCITCATGAGGACAAAGCTTGAGGGCATCTGTAGATTCT 
CCTGACTCANGTGGTCCAGCCTCCATATTXXKjATGAGCTTrriTATTGG 
TAAAAAAGGTTNGAAOAAGCATrrorCXn'GCCATACQTTGCnCTCTTGC^ 
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CCCGAACTTTGTCTTCAUri 1 1 AGCATNGNAATAl I'lCTlTl ri'lCCTGTTCCrNNCAGNTCAGT 

SEQ ID NO: 2724 ACAGACAAAACTACAGACTTAGTCTGGTGGACTGOACTAATTACTTGAAGGA 
TTTAOATAOAGTAmOCACTGCTGAAGAGTCACTATGAGCAAAATAAAACAAATAAGACTCAAA 
CTGCTCAAAGTGACGGGTTCTTGGTTGTCIXrrGCrGAGCACGCnXSTC^ 
CTGACTCAGATGAAGACCCAAGGCATAAGGTTGGGAAAACACCTCAmGAarmiCCAG 
CTTCAAACCCTGCArrrGAACCGACCAACATTAAGTCCAGAGAGTAAACTTCAA^^ 
CATTCCAGAAGTTAATCATrTGAATTCTGAACACTGGAGAAAAACCGAAAAATGGACGGGGCATO 
AAOATACTAATCATCTGGAAACC»ArrrCAGTGGCGATGGCATaACAGAGCTAGAGCTCGGGCCC 
AGCCCCAGGCTGCAGCCCATTCGCAGGCACaX}AAAGAACITCCCCAGTATGGTGGTCCTGGAA^ 
GGACATTTTTGAAGATCAACTATATCTrCCrGTGCATTCCGATGGAATTTCAm 
ACCATGGCCACCGCAGAACACCGAAGTAATTNCAGCATAGCGGGGAAGATGTTGACCAAGGTGG 
AGAAGAATCACGAAA 

SEQ ID NO: 2725 " ACAOACAAAACTACAOACrrAGTCTGGTGGACrGOACTAATTACTTGAAOGA 
TTTATATAGAGTATTTGCACTGCTGAAOAGTCACTATGAGCAAAATAAAACAAATAAGACTCAAA 
CTGCTCAAAGTGACGGGTTCTTGGTrGTCTCTGCTGAG<^ 

CTGACTCAATATGAAAACCCAAGGCATAAGGTTGGGAAAACACCTNATTTGACCrrGCCAGCTO 
CCTTCNAACarrG<mTTGAACCGACCAACArrAAGTCCAGANAGTAAACriTGAATGGAATAACT 
ACATTCCAGAAOGTTANTCATTTOAATTCTGAACACTGGAGAAAAACCOTAAAKraGACOGONCA 
TGAAGATACTAATCATCTGGAAACCGATGTCThrrGGCGGTGGCATrGAC 

SEQ ID NO: 2726 ACTGTCACAGAACTTTTACATACATTCTCAGTCCrAGTTGTGAAAGGCCTAAA 
OAGNNhrrAAACTCAATTTQCAGTCCAACACAAAGGGGGGAATTTCTAAAATAAATAATCCAACAG 
TTTTITGCATTTTTTTAAATTAATTTTTC>TT^^ 

ACAAAAAATGTCCGTTGAAGAATAATATATTAAAACTGTGGAAAAAAAGGAAAAAGACACGTCA 

CAAAATTTTAAGATrAATATGAAGATCATAATTTAACATAAAAGAATATATTCTA TGGAT TTG 

TCCCGATAAATATGAACAAAATTAACAAAAAAAAGCATAGTTTGGCAAT 

TTAAATAAGCTTTTTrATATTGATGTGCAGTGACAAGCAAAATTTTTGCTCTCCAATTTC^ 

TATATOAAGnTAAAACCCAGGGAAGAAAAGCATGOCGTGAGTGCrcrAAGGATAGACCTACGGT 

ATTCTAGAGCAGAACCATTAAAGCTACTTCTCAOGAAATCGTTTACACAGATTTGOTNTG TQQAAT 

GGAAACCATTAACTGCCACCCTTAC 11111111111 i AGCTNTCTNTCAN 1 U 1 1 1 iGNGGGTTTTTT 

TTAAAAAGA 

SEQ ID NO: 2727 ACTOTOATTGAACATCCTGAATACQGAOAOGTTATTCAOCTTCAAGGrTGACC 
AAANNNNNNACATCTGCCAGrncrCTTGGAGGTTGGCATrGTAAAGGAGGAACAGOT 
CATGGATTCTAAAATGAACCTAAATACGTGGAGAATrrC TrGAA TAGTTrrGTrCTCTAAA^ 
TTTGOCTOCCTTQTOAAATGATTCCCTGCAGTAAACGGACTTTTCATTTArrr 
CCATTCACATCTGCATGATTACAGAAAACATGGGGTATGTAGACTAGTAACACATAAGAAAATTG 
CAOTAAGATGOTAACAAAACCTCATArrGCrrrACATOTTIXrAATGGAAAATQrr^ 
ATroTrCAGTrTArrACCGTTTCACTTGATTAAAl 1 11 1 1 1 IGTTGTTGTATTAAACCATGT 

SEQ m NO: 2728 ACAAATACGCAAATTTTCATAOTGCCTAGAAATAGCACAGATCTATTCTACTC 
A^^r^r^•AmOTAT^T^TCA0O0TATTCTACC^AGA0CCTOTG0TTAA^^ 
TACCrmATTCCCTACCCrcrCAGGGAATTTGGATACATGTGAGGAATAGTCXnTrG ri I Tl CI 1 A 
TGAACCTAGAAAATTACAGATCATAAAATCTGGATATTAAAGTAGTTTCCAAAAGCATCTCATGG 
GAAATCAAAGTGCTCGGCATTrCCGAGCTGGAGAATAAAATCAAGAATCCTTAAAAA AAAA AAA 
AAAAAAAAAAAGTCACCCTATGAGTGGAAGGGTCCATTTTGAAGTCAGTGGAGTAAGC^ 
CAGTrrGATGGTTTCACAAGTTCTATTGAGTGCTATTCAGAATAGGAACAAGGTTCTAATAGAAAA 
AGATGGCAATTIXjAAGTAGCTATAAAATTAGACTAATCTACATTGCrnTCrOT 
ACCTnTATOaTraATAArrAGCAGTTTOTCTACTTGOTCACTAOOAATGAAACTC^ 
GGCTAACAGGNOTAATAGCCCACTTACnXXTGAATCmAACATntiNGCATT^ 
TCGCGATCTT 

SEQ ID NO: 2729 CAGCTGATOGGAACOGGCTCCAATGOACTOOATrGCATrCAAAATATTATnT 
GGACAGGTTTGGAAAATGTGAGGGCCCATATCATCATAACCAGCATAAGGAOACCAACACCATAT 
GGTCCCGGCCTATCTGGTGGTATCTGGGTCTIXJCTTCCAATTAGAATACAACGAGACAC^ 
OTCTATIXAATACAAATCTXjGAATCCAAACACTCCrcACGCATAAAArrACATAACAGTCT 
CACAGTAAGCCCCXJCAATTGAGATACCAATATGGTGTCAACATGGAGAATTrT^ 
GCAAGCTCTGATTGCGCCCTTTTCAATGCAACCAAAACTGCATQAGTTn^ 
GTGGCTGAACCTTGTTTAACAGCTrCATTGCATATTCAATTTGATGAATO^ 
AAACAGTGACATCATTOTCATACTOAmCOAAACATGGTGOCOGCGCGGCXTGGrrGCGO^ 
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AAAACTGAGAATCAAGGACKjTGCTGCCGAAAGTATCGCTCAACGATCTACAAAAAGTCTGGCCCC 
CCOCOT 

SEQ ID NO: 2730 GTACCNGGACAOTGCATGTCACATATGATrrCACAAAAGTTCATCTrCATTGC 
AGATACOTGCCTTTCTTICrAGGTIXJTATCTCCCACTTCACCCnTCTAGAC^ 

TAAOATrrCATCTGGGAAATCACTAOGAGTTCTTOGAAOOOAAAGAAGGAAGATTGTTGGTTGGA 

ATAAAAACAGGGTTGAATGAGTTCCAGAAAGCAGGGTTCTCAACCTCGTGGACAGCAATCTGCAN 

AANAAaAGAACrrCAAAAAACCAACTAGAAOCAACATGCAGAOAAOTAAAATGAGAGOGOCCTC 

CTCAGGAAAGAAGACAGCTGGTCCACAGCAGAAAAATCTTGAACCAGCTCTCCCAGGAAGATGG 

GGTGGTCGCTCTOCAGAGAACCXCCCTrCAGGATCCGTGAGGAAGACCANAATGAACAAGCAGA 

AGACTCCTGGAAACGGAGATGGTGGCAGGTGAAGTCnCTCCACAGCATGCCrrcGGCCrr^^ 

CAGGAGAAAAAGCCCCCCAGTTCTACATGATTCCTGAGGTCTTCTGTTCTTCTGCGGOT 

GQGCCCrOCXJCCCTOOAAATAQTGGGTANOACTCGGACACTGCAGAGGTCCTGCCC 

SEQ ID NO: 273 1 ACACAACTGCAACTCTACATAAATGCCACAGATOCAGAATACTU i l l l CI 1 GC 

tctatttacacagctgatatacctattctaacgaaggagggagaggagtaatgcacaagaaactc 

aggccaatgggggagcaagaagaaaacgaagaagtgcagtgcatgcgtcatcggtgtttaacag 

tcagaagcgaaacagttcagaacaaggcctgccctgtcaaaagaagagctaaagacagttatata 

aaaattaaggtgggctttcagactgoctaacacaacaacattccatgagtagatggtaaritattt 

trgtttatccatttcgttgggagcaaggacaaaaatgtaaatctacaccttgc ntat caaaattc 

cgaaaaaagaatgctctgccttrtaaaaaagtatcatgattttgtagaccttgt^^ 

tatttgoaaaaggtgtcattttcatattcctactx:aaatgccagtgtitix^ 

GAGTTTATITAAAACGGTmG<nvrri'ri"l'l'iCGACAAATATCCrrTO 

OAGACACCTCAAAATGCCTGTAAAATTATTGfl ri'lCl"! ICrCTAAQTCAOGCAGGCGAGGCTACG 
GAAAGGAAGANA 

SEQ ID NO: 2732 ACCTGOGTGTTCCCCACCTTGGGCATCATGCACCACAACAAACAGGCCACTG 
AGAATGCAAAGOAOOAAOTGAOGCGAATTCTGGGOCTCCTGGATGCTTACTrGAAOACOAOGACT 
TTTXTTGGTGGGCGAACGAGTGACATTGGCTGACATCACAGTrGTCTGCACCCTGTTGTGGCTCT^ 
AAGCAGGTTCTAGAGCCTTCmCCGCCAGGCCTITCCCAATACCAACCGCTGGTTCOT 
ATTAACCAGCCCCAGTTCCX}GGCTGTCTrGGGGGAAOTGAAACTGTGTGAGAAGATGGCCCAGTT 
TGATGCTAAAAAGTTTGCAGAGACCCAACCTAAAAAGGACACACCACGGAAAGAGAAGGGTTCA 
CGGGAAGAOAAGCAOAAGCCCCAGGCTOAGCGGAAGOAOGAGAAAAAQGCGGCTGCCCTGCTCC 
TGAGGAGGAGATGGATGAATGTGAGCAGGCXjCTGGCIXKTCAGCCCAAGGCCAAGGACCC 
CTCACCTGCCAAGAGT 

SEQ ID NO: 2733 ACTTATITCAACAAnOTAGAQATQCTAGCTAOTOTrGAAGCTAAAAATAGC 
TTTATTTAlXJCTGAATTGTOArrr IT I ' 1 ATGCCAAAl" nTlTrrAOTTCTAATCATTGATGATAGCTT 
GGAAATAAATAATTATOOCATGGCATTTOACAGTTCATTATTCXTATAAGAATrAAATTGAa'nTA 
GAGAGAATGGTGGTGTTGAGCTGATrATrAACAGTTACTGAAATCAAATATTrATr ro 
TTCCATTTGTATTTTAGGTTrCCTTTTACATTCTTmATATGCAT^^ 

GACTATGGAAATAATTTAAAGATTTAAGCTCTGGTGGATGATTATCTGCTAAGTAAGTCTGA^ 

GTAATATTrrGATAATACTGTAATATACCTGCACACAAATGCTmCTAATGTTTTAACCITGAGTA 

TTGCAAGTTGCTGCTTTOT 

SEQ ID NO: 2734 ACTGGTCCAGGAGTTATCCAGGATAGATTTTCACXCACCATGGGG CGTC ATC 
GTTCAAATCAACTCrrCAATGGCCATGGGGGACACATCATGCCTCCCACACAAT^ 
AGATOOGAQGCAAOTTTATOAAAAGCCAGGGGCTAAOCCAGCrCTACCATAACCAOAOTCAGGG 
ACTCTTATXXCAGCTGCAAOGACAGTCGAAGGATATGCCACCTCGGTTrTCTAAGAAAGGACAGC 
TTAATOCAGATGAGATTAGCCTGAGGCCTGCTCAOTCGTTCCTAATGAATAAAAATCAAGTGCCA 
AAGCTTCAGCCCCAGATAACrATGATTCCTCCTAGTGCACAACCACCACGCACTCAAACACCACCT 
CTOGGACAGACACCTCAGCTTGGTCTCAAAACTAATCCACCACTrATCCAGGAAAAGCCTGCC^ 
GACCAGCAAAAAGCCAOCACCGTCAAAOGAAGAACTCXriTAAACTAACTGAAACTGTraTGACTG 
AATATCTAAATAGTCGAAATGCAAATGAGGCTGTCAATGGTGTAAGAGAAATGAGGGCTCCTAAA 
CACTITCITCTOAGATQTTAAGCAAAGTAATCATCCrrOTCACrAGATAGAAGCGATOAAGATAAA 
GAAAAAGCAAGTTCTTTG 

SEQ ID NO: 2735 ACAAAATCCCCTTTGTTGAAAAATAAGGGGCnTCTAAACTAATAAAAAAGO 
AAGTITTCAAAAArrATAOTITATrAAAACAAC TTITIT GGCAAA CAAAQT TACTr^ 
AATnTATACTGTGAATAAAATICCAAATGAATCl 1 1 ICl lAAAACTTTTTAAAAAATTATGTGCXZA 
GTOTATACTAATGCTATAGATTCTrcTCTTAGAAGTTTrrAAAGCATICrGTTAATGC^ 
CAATGGGACTCCAAAAATATAGTCAATAATCATGATAAAAAATTATAATATGATTATCAAGTGAA 
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GCAGGTATTGAGAAATAAAAATTCTCACTTGCTCACTGGCAATTTCTTOT 

GAAGGCCTGAAGTAATTCAGACAGATAGCTGTrTATGGTGAATTATAATAATACTTCATGAGGGC 

AGAGCTAArrAAAACTAGTAATTGCTTAAAATTCAAAGCXATTTCTGOACATATAAAATGAGAGA 

TGAATCTGAAAGTTmCTmGTAAAACCTCTTTCCAGTTCTTAAAGTCCAGT^ 

CAATCTGATCrACCATTGCATATTAATGGATCAACTTAAATGTGGTATTTAGATGTGCAT^ 

TTGGTATAAT 

SEQ ID NO: 2736 ACANCCACCNNCTCTGCGAGGACNTTATCCNACTGAANCCCGACGNGGACAT 
T^^nWNTAAGGGCAN^rTGAGA^m4AGCNCAGCACTAa^TATGCGGGCCAAT^^ 
GCAGAGNCX;GGAmACANACAATAATCGNATTGCTAaAGCCrGTOOGGCCCGGATAGTCAGCCG 
ACCANAGOAACTGAQAGAAOATaATOTTOaAAtyLGGAGCAGGCCTGTmCKJAAATCAAOAAA^ 
TGGAGATNAOTAC^OTACTTTCANCACTGACTGCAAAGACCCCAAGGCCTGCACCATCC^ 
GGGGGGCTAGCAAAGAGATTCTGTCGGGAAGTANAACGCAACCTCCAGNATGCCATGCAAGTGN 
GTCANACTCNTCTXXnXKJACCCTCANCTGGTGO^AGGNGGNGGGCCTCCNAGATGGCT 
CATGCCmGACAGAAAAAGNCNATGGCCATGACTGGGTGTGGOAACAATGGGCCNTACAAGGGC 
TGT 

SEQ ID NO: 2737 ACATCCAGCAGCrCTGTGAGGACATTATCCAACTGAAGOCCGATGTGGTCAT 
CAmTAAAAGGGCATCTC^GATTTAGCTCAGCACTACrrTATGCGGGCCAATATCACAGCCATC^ 
CAGAGTCCGGAAGACAGACAATAATCGCATTGCTAQAGCCTXjTGGGGCCCGGATAGTCAGCCGA 
CAGAGGAACTGAGAGAAGATQATGTTGGAACAGGAGCAGGCCTGTTGGAAATCAAGAAAATTGG 
AaATOAATACTrrACTrrCATCACTOACTQCAAAOACCCCAAGGCCTOCACCATTCTCCTC^ 
GGCTAGCAAAGAGATTCTCTCGGAAGTAGAACGCAACCTCCAGQATGCCATGCAAGTGTGTCGCA 
ATGTTCTCCTGGACCCTCAGCrcGTGCCAGGGGGTGGGGC^ 

TGACAGAAAAATCCAAGGCCATGACTGGTGTXKiAACAATGGCCATACAGGGCTGT^^ 
TAGAGGTCATTCCTCGT 

SEQ ID NO: 273 8 ACCATOAOAAATGTCGOTTCAGATATTGCrGTGCTAAGGAGACAGCAACGTA 
TGATAAAAAATCGAGAATCCGCTTGTCAGTCTCGCAAGAAGAAGAAAGAATATATGCTAGGGTTA 
GAGGCGAGATTAAAGGCTGCCCTCTCAOAAAACXiAGCAACTGAAGAAAGAAAATGGAACACTGA 
AGCGGCAGCTOGATGAAGTTOTGTCAQAGAACCAGAGGOTAAAOTCCCTAOTCCAAAGCGAAGA 
GTTGTCTGTGTGATGATAGTATTGGCATTTATAATACTGAACTATGOACCTATGAGCATGTTGGAA 
CAOGATrcCAGOAGAATOAACCCTAGTGTOAOCCCTOCAAATCAAAGGAGOCACCTTCTAGOATT 
TTCTGCTAAAGAGGCACAGGACACATCAGATGGTATTATCCAGAAAAA CAGC TACAGATATGATC 
ATTCTGTTTCAAATGACAAAGCCCTGATGGTGCTAACTGAAGACCAl^ 
CTTGTCAGaXXTAArrAACACAACAGAGTCrCTCANGTTAAATCATCAACT^ 
ATANCATGAAGTAGAAAGGACCAAGTCAAGAAGAATGACAAATATCAACAGAAAACCCGTATTC 
TTCAGGGTGCTCT 

SEQ ID NO: 2739 ACAGOATOAATTTAAATGTGTTmCXrrGAGAGACAA GGAAG ACTItK^ 
TOCCNAAAACAGGTAAAAATCTTAAATOTGCACCAAGAGCANAGGATCAACTTTTAGTCATO 
TGCTONAANOACAACAAATCCCri'l I It 1 1 ICTCAATrGACTTAACTOCATGATTTCTGTTTATCTA 
CCTCTAAAGCAAATCTGNANNGTTCCAAANACrrrGhWATGGNrnTAAGCGN^ 
AAAAATONAAATTCrITCAANCAGAGCTT^WCTGGNAAAAAGCATATT^■CT^ 

TTAAAATAGGATCTGAACATTCAAANANAAAGCTTTGGAAAAAAAANAGCraGCTGGGCCT 
ACCTAAOTATATGGATGAANAnGGrmGACTGTCTTrCCCAAGCCNCATGTTCATGGGNAGGGGC 
AATCGGrrATTNNGGNTATTmACTAAATTGGGmCTCTCATTTTC^ 
TNATATTTTGGATCANGOTOTTTGGCrOTTACTAANAAACOT 

SEQ ID NO: 2740 ACGGTAACTGACTCCAGGGTCACTCATACTOTGTCCGTGGTAACGGTA^^ 
GCNNCTCCATCAGGATGGGCCCXrnrCXAGATCTACAATAGGCAGCAGCAAACCTrGTTGCCrrCT^ 
GGACGCACAOGATATCCATTCCATXrACTCTCAGCCCAQOAATCAAATCaCC^^ 
CAGTGCTGGCTGCCGCrCIXnXIAACAGACGTTCXXATTCCATAGCOATTATTCTCACAGATGAAAA 
TACAAQOTAATrrCCACAAAOCTGCCATOTTGTAAOCTIXXlAATATCTGGCCCTGG^^ 
CATCGCCATATAAAGTCAGGCAGACCTCATCTITTCCATTATACTrACAGGCTAGAGCAA 
CGCCCAGGGGCACXrrGCGCTCCCACGATGCCATrcCCCCCGTAGAAGTrCTTGGCATACATGTGCA 
TCGATCCTCCTTrCCCmAGCACAACCTCCTTrrCGTCCTGTA^^ 

GGAAAGGCCCCGGGTGAAAGTAAAGCCGTGAGCCCGGTAGGCTGTGATGAGATGGCTGTGGGOTT 

GATGCCGGCXTTCCAGCCCACACAQCAGCTTCCTXiACCATCXCACAAC^ 

ATTTTCT 
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SEQ ID NO: 2741 Aa:AGAGTCAAGQTGCCAGGOTCCTGOOAOCAACAAATOCATTCCCATTGAT 
TTATIKITGAAAAACarrCTGATCAGCCTACCCAAAGGCCCTGCCCTACCCT 
ACATACGTTCAOTTTGTAATICriTrrCCnTrrGCTGCACAATAACTGOT 
GOAAGACTCTGGCTTCGAAATCrcCAAAATGCATACTrTATCTGT^^ 
CTrCMGACTCACTTGCTGACTTCCTAACCatlAGTrrCTC^ 

GGCTITAAAGAGGTGATTAAAATGAAGACATrAGGaTGaOTCCTATTCCAATATQACTOATGTCCT 

TATCAGAAGAGTAGAnTAGGACACAGACAAAAGGAAGGCCACGGAAGACAGAOTGAGATATGA 

CCATCTGCAACCAAGGAGAGAGGCCTCAGAAGAAACCAACXXTGCTGACATCT^^ 

CTAGCCTCTAGAACTGTGAGAAAATAAATGTCTGCTGCTTGAACCATCTGGCTGTGCTAC^ 

ATGGCATCCXTAOAAAACTAATATAATCAAGrrrGACAAATGCATCCCAAGCATTANTAACCAGT 

GAGCATCnTG 

SEQ ID NO: 2742 ACTGGGAGATACAGCCATCCACCTTCAGATGTGTCTACGTGCXjCTCTGCCATr 
CAACTCGOAAACTATAAOTAATTCTCAAOAAAGCCXn'CArriTrATAACCT 
ATGTCATTGCTAAAAAATAAATAAAAGCrAGATACTGGAAACCTAACroCAATGTGGATGTTTTA 
CXX;ACATGACTrATTATGCATAAAGCCAAATTrcCAGTTTAAGTAAriXX:CTACAATAAAAAGAA^ 
TTTTGCCTGCCATTTTCAGAATCATCTTTTGAAGCTTTCTGT^ 

ATTCTTATTTCACTAAATGTAAAATTrcGAGTAAATATATATGTCAATATTTAGTAAAGCr 

TTTrAAmCCAGGAAAAAATAAAAAQAGTATGAOTCTTCTGTAATTCATTGAGCAGTTAGCTCAT 

rroAGATAAAGTCAAATGCCAAACACrAGCTCTGTATTAATCCCCATCATTA CTOOT AAAGCCT^ 

TTrGAATCTGTGAATrcAATACAGGCTATGTAAAATTTTTACTAATGTCATTATriTGAAAA^ 

AATTTAAAAATACATTAAA 

SEQ ID NO: 2743 ACAGCTTAAACCACAATGGTATAAATCTTCArrTTGTAATTAATAATrrCT^ 
CATAACAATGTTTGATATTTGCAAACAAACAACATITITGGAAGCATrAGATTCAGTC^ 
CTGTCACAAAATrAACTACAGTCAGTCrGTOCAATGAAATTGATGTTGGAOTTCTATQTGTG^ 
ATTTCATGTTGAAAACAGATGGTAGTGCTCCTAGAAATA l-riCl'l'C"i i CTAGCTAATGTGCNTrGG 
AACTCACATGTATAACCAATGACTGACTCTGAAATATCAAGCACTGTOOGOTGGCTOQAAGGTAA 
AGGTCTAAGCrTTGTGAAACACTATACATATATAATCTATATTTACTTATGTTX3GC^ 
ACAGTAAAAOTCACAATACAOCTAGAACATACCAGAAAAGCAAGCTrrcTCATTCCTGCm 
GGTATGATCTXXiTCTAAACAAACATTTCAmCAGAAAATCrGCATCAATCTACACGGAC^ 
CAGTGCACAAACTGAAAAGGGCrri 1 1 1 1 1 1 1 1 1 U I CTCCCGCGT 

SEQ ID NO: 2744 ACCAGCOATTCCTGCGGCAACACGTGCACCCTGAGGAGACAGGTGGCAGTGA 
TCGCTACTGCAACTTGATGATGCAAAGACGGAAGATGACTTTGTATCACTGC^ 
CrrCATCXV^TGAAGATATCrGGAACATTCGTAGTATCTGCAGCACCACCAATATCCXAT^^ 
CGG<>AGATGAACI<KXATGAGGGT0TAGTOAAGOTCACAGATTCCANGGGACAC>GGGAAGTC 
CAAGGCACCCAACTGCAAATATCGGGCCATAGCGAGCACTAGACGTGTTGTCATTGCCTGTGAGG 
GTAACCCACAGOTGCCTXnXK:ACTnGACGGTTAGATGCCACCATGTAGGGATTATCGC 
TGACCTTACACn-ACTCXriTAAATAGCAGTa^GTAATGCATTTGAGCTGTCCCAGGCTC^ 
CAQCTCArrrCCTACTCrnTrCTCTATATAACTCATrcrATTAAATACATTG C^ 
GGAGACATAAACCTGTAATGAATGAGGCTGGGCTTTrCTGTAATAAGCTTCCTTTTATAA^ 
TCAGCTrAGCrCTCTCAGATCCTATCCrGNGGAAmAGTTATrATGGGGTATTTATO^ 

SEQ ID NO: 2745 ACAAaTATAGAAAAGGTAAAOGAAACCCCAACATGCATGCACTGCCTTGGT 
GA(XAGGGAAGTCACCCCACGGCTATGGGGAAATTAGCCCGAGGCITAGCTTTCATTATCACTGTC 
TCCCAGGGTGTGCTTGTCAAAGAGATArrcCGCCAAGCCAaATrCGGGCGCTCCCATOT 
GTTGGTCACGTGGTCACCCAATTCmQATGGCTTTCACCTGCTCATTCAGGNAATOTGT 
AAGTCACACAAATGGGGGTCATirrnjrCAGTGGCCAGTTTGTGCAGTTCCAGTAGTGACro 
ACATTTTrnCCAAATGTAATGCACACTCCATTGCATrCAGCCCGCTCTCCC^ 
GTrrCTTOATAl'CCTGAAGGAAGATTCGGCCACCTX;GrrGGTTCTGCAGCrTCAT^ 
ATQTTCCCTCrCCTCATGAGATTGGTGAAGAAAGTAmGGCAAAGTrcrrCAAAGCCACATCATC 
GCGGTCAAAGTAGTAAAGACATGGACAGGTAAACOTAAGGAGGCGTANAGCTCCAGGTTGATCT 
GGCGGTTGATCGCXMCCTCTGAGTCCTGGGGGTAGNTmTGGCGCCCT^ 

SEQ ID NO: 2746 ACTGGGAGATACAGCCAT<XACCTTCAGATGTGTCTACGTGCGCTCTOCX:ATr 
CAACTCGGAAACTATAAGTAArrCTCAAGAAAGCCXnx:ATTmATAACCTGGCAAAAT ^ 
ATGTCATnKn-AAAAAATAAATAAAAGCTAGATACrcGAAACCrAACTGCAATGTGOATGrrrr 
CCCACATGACTTATTATGCATAAAGCCAAATTTXXIAGTTTAAOTAATTOCCTACA^ 
TTTGCCIXKXATTTTCAGAATCATCrmGAAGCmCTG^ 

TrCTrATrrCACrAAATGTAAAATTTGGAGTAAATATATATGTCAATATTTAOTAAAGc 
rrTAATrrCCAGGAAAAAATAAAAAGAGTATGAGTCTrCrOTAATCATTGAGCAGTTAGCTC^ 
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GAAATAAAGCAAATGCCAAACACTACKrrNTGTArrAATCCCNATCNrrACNGGGAAAAGC^ 

TGAATOGGNGAATCCAANAC^iGGCTTGGAAAATnTrACTAAGGNCATTATTTNGAAAAAATAAN 

TTNAAAAATACmTCAAATTTACTATNGTAANCACCTTAATTGGNNAATATTCCCTAAACAC^ 

SEQ ID NO: 2747 AOCACTa;AGTTGTCrrCACAArrrAATGCTACAGAAACCTAAATGTrTCTAC 
TTCAGCTACGAAATCAnCAATrGGCV^GGCAAAACATTTmATrACCTAmATAA^ 
AAAGGGAGGAAAAATAATAGAATGTATAAACTCATGCTATCATAATTCTAATG TCTA G CTATA AA 
GGCATATGOrrGTAACAATKKn'ATCTAAAATTTmATT AGCTG CAGTAGCTC 
TACCATTrmAAAGTCTrTTTGCTTGGTCATCTAAAGACGTTITrcAA^ 
TTATTGTCCTCCTTGTGCCCCTAAGGAATrAATAAAATGCACATTCATmX^ 
CCAOTTAAAAACQAAACAATQCAAAAATGGCA AACA TTTTC TrQOAC CAri'l 
ACAAAAAATGTGCAAGGl'AAATAGAACTX^AGTmGTTTTATTlTn'GGCT 
AGTCCCCCCAGAATAATACAGGTTACATAGCTGTAACCATCrrAAGGAAAATGACATTAAGTrTG 
CAACAAATACTTGCTGGGTGAAAACACATTACTAGNCATn'CAAAAATTrACAAAAATGTAAACA 

SEQ ID NO: 2748 ACAGGATGAATITAAATGTOTnTIXXnXjAGAGACAAGGAAGACTTGGGrAT 
TCOCNAAAACAGGTAAAAATCTTAAATOTGCACXAAGAGCANAGGATCAACTmAGTC^ 
TGCTGNAANGACAACAAATCC CI 1 1 111 U I CTCAATTGACTTAACTGCATGATTTCrGTTTATCrA 
CCTCrAAAGCAAATCTGNANNOTTa:AAANACrrTTG>WATGGNrm^ 
AAAAATONAAATTCTTCAANCAGAGCTrhn^CTGGNAAAAAGCATATTTTCTOTOTTTC^^ 
ACTGrrGTCCTTCCCCTCACATAGACACTCAGACACCCTCACAANCNCAGTATrCTATT^ 
TTAAAATAGQATCraAACATrcAAANANAAAOCnTGGAAAAAAAANAGCrGGCTGGGCCTAAA 
ACOAANTATATGGATCAANATTGGTNGACTGTCrrrrcCAAGCCNCATGTTCATGGGNAGGGGC 
AATGGGTTATTNNGGNTATTTNACTAAATTGGGTrACTCrCATTTTO^ 
TNATATTTTGGATCANGGTGTTTGGCTGTTACTAANAAACCTr 

SEQ ID NO: 2749 ACAGCTTAAACCACAATGGTATAAATCTTCATTTTGTAATTAATAATTTCTrG 
CATAACAATOTTTGATATTTGCAAACAAACAACATTmGGAAGCATTAGATTCAGTCCATAGAT^ 
CTGTGACAAAATTAACTACAGTCAGTCTGTGCAATGAAATTGATGTTOGAGTrCTATGTGTGTGGC 
ATTTCATGTTGAAAACAGATGGTAGTGCTCCTAGAAATA'l' I'lC 1" IL" 1' 1 CTAGCTAATGTGCNTTGG 
AACrCACATOTATAACCAATGACTOACTCTOAAATATCAAGCACrOTGOGOTOOCTOGAAGGTAA 
AGGTCTAAGCTrrGTGAAACACTATACATATATAATCTATATTTA CTTAT GTTGGCAATTAATATA 
ACAGTAAAAOTCACAATACACCTAOAACATACCAGAAAAOCAAOCTTTGTCATTCCnXK^ 
GGTATGATCTCXiTCTAAACAAACATTrCATTTCAGAAAATCTGCATCJ^ 
CAGTGCACAAACTGAAAAGGGCn'l'n 1 i 1 1 1 i 1 1 1 ICTCCXGCGT 

SEQ ID NO: 2750 AOCAGCGATTCCTGCGGCAACACGTGCACCCTGAOOAOACAGOTGGCAGrGA 
TCGCTACTGCAACrrGATGATGCAAAGACGGAAGATGACTrrGTATCACTGCAAGCGCITCAACAC 
mCATCCATGAAGATATCrOGAACV^TrCGTAGTATCTGCAGCACCACCAATATCCAATGCAAGAA 
CGGCAAGATGAACTGCCATGAGGGTGTAGTGAAGGTCACAGATTGCANGGGACACAGGGAAGTC 
CAAGGCACCX:AACTGCAAATATCGGGaV^TAGCGAGCACTAGACGTGTTOTCATO 
GTAACCCACAGGTGCCTGTGCACnTGAOGGrrAGATGCCAOCATOTAGGGA™ 
TGACCTTACACTTACTCXnTAAATAGCAGTGAGTAATGCATTTGAGCTGTCCCAGGCT^^ 
CAGCTCATITCCrACTCITriTCTCTATATAACTCAT^ 

GGAGACATAAACCTGTAATGAATGAGGCTGGGCTriTCrcTAATAAGCTTCCnTTTATAAT^ 
TCAGCTTAGCrCTCTCAGATCCTATCCTGNGGAATTTAGTTATTATGGGGTATTTATO 

SEQ ID NO: 275 1 ACAACTTATAGAAAAGGTAAAGOAAACCCCAACATGCATGCACTGCCTTGGT 
GACCAGGGAAGTCACCCCACGGCTATGGGGAAATTAGCCXXiAGGCTTAGCTTrCATTATCACTGTC 
TCCCAOGOTOTGCTrGTCAAAGAGATATTCCOCCAAGCCAGATTCGGGCXICTC^ 
GTTGGTCAa>TGGTCACCCAATTCTTTGATGGCTrrCACCTGC^ 
AAGTCACACAAATGGGGGTCATrTTTGTCAGTGGCCAGTnXSTGCAGT^ 
ACATirrmCXAAATOTAATGCACACrCCATTGCATrcAGCCCGCTCrCCCAGTCAT^ 
GmCrrGATATCCTGAAGGAAGATIXXJGCCAOCTCGTTGGTrCTGCAGCrrc 
ATOTTCCCTCTCCTCATGAQATTQGTGAAOAAAGTATTTGGCAAAGrrCnCAAAGCCACATC^^ 
GCGGTCAAAGTAGTAAAGACATGGACAGGTAAACGTAAGGAGGCGTANAGCrCCAGGTTQATCT 
GGCGGTTGATGGCGGCCTCTGAGTCCTGGGGGTAGNTNTTGGCGCCCTGNGAAGTGGACGCCG 

SEQ ID NO: 2752 ACTGGGAGATACAOCCATCC^CCrrrCAGATOTOTCTACOTGCGCTCTGCCATr 
CAACrCGGAAACTATAAGTAATrClCAAGAAAGCXX:TCATTTTrATAAOCTGGCAAAAT^^ 
ATOTCATrOCTAAAAAATAAATAAAAGCTAGATACTGGAAACCTAACTGCAATGTGGATGrnTA 
CCCACATGACTTATTATGCATAAAGCCAAATTTCCAGmAAGTAATrGCCTACAATAAAAAAA^ 
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TTTGCCTGCCATmCACUUTCATCTrrTOAAGCmCTGTTOAT 

TTCrrATrrCACTAAATCTAAAArrrGGAGTAAATATATATGTCAATATrTAGTAAAG C l 1 1 ICl r j 

TrrAATrrcCAGGAAAAAATAAAAAGAOTATOAOTCTTCTGTAATCArraAGCAGTTAGCrCAm 

GAAATAAAGCAAATGCCAAACACrAGCrmXn"ATTAATCCCNATCNTACNG^ 

TGAATGGGNGAATCCAANACAGGCTTGGAAAATTTTrACTAAGGNCATTATTTNOAAAAA^ 

TTNAAAAATACmrCAAATTTACTATNGTAANCACCITAATrGGNNAATATrC^ 

SEQ ID NO: 2753 ACCACTCCAOTTGTCTTCACAATTTAATGCTACAGAAACCTAAATGTrTCTAC 
TTCAGCTAajAAATCArrCAATTGGCAGGCAAAACATTITn'ATrACCTAm 
AAAQOGAGOAAAAATAATAGAATGTATAAACTCATGCTATCATAATTCTAATGTCrAGCTATAAA 
GGCAT ATGO TrGTAACAATTGCTATCTAAAATTTmATTAGCTGCAGTAGCTGCTTTC^ 
TACCATnTTTAAAOTCTTTTrGCTnaOTCATCTAAAQACGTITrTC 
TTATTGTCCTCXrrrGTCCXrCrAAGGAATTAATAAAATGCACATTCAm 

CCAGTTAAAAACGAAACAATGCAAAAATGGCAAACA rr IH ClU OOAOCA 11 ■ I'l'I riCACTGTAACA 
ACAAAAAATOTGCAAOGTAAATAGAACTCAAGTmGTTTTATTTTTTGGCTGT^ 
AGTCCCXK:CAGAATAATACAGGTrACATAGCrcTAACCATCTTAAGGAAAATGACATTAAGTTTO 
CAACAAATACriTGCTGGOTOAAAACACATOCTAGNCATTTCAAAAArrrACAAAAA^ 

SEQ ID NO: 2754 ACCCTGATGCTACAGACGAGGACATCACCrCACACATGGAAAGCGAGGAGTT 
GAATGGTGCATACAACGCCATCCXXXiTTGOCCAGGACCTGAACGCX}CCTT^ 
OTG0OAAGGACAG1TAT0AAACGAOTCAGCTOGATGACCAGAOTGCTGAACCCCCAGCCACAAG 
CAGTCCAGATTATATAAOiGGAAGCTATTGATGAAAANCATGGANNTTCCOATGTGAATGGATAG 
TCAGGAACTITCrAAAGGCNGCCGGQAATTaiACAOCCrrGANrri^ 
TGGTrGNANACCCCCAAAAGGTNAANAANAANAOTAACCCCCCCAAAATTTCNG>ri^ 
ATTAAAANGGGC>nTITITTNGGGGCCAATTTAAAG>nWAAAAAAATCCCATTm^ 
TrrrWCCNAAAAAAAAAAATNGrnmTTTNCCO^AATGGNAAAAGAACNCC^ 
TTTCCAAATTTTANOGGGGOANGGGGNGTTTTTrmKKjGGNNOT 

NANAANAAAAAATAAAAGNOGGGO^^^TmANOOAANCCCCCNNGANAAAAAAAAAC^^ 



SEQ ID NO: 2755 ACGCXGGGGACTTCCGGGTCGCGGTGCTTGAAGGGAGTGTTCCGTCXiTTrCC 
GTTGCCGGCTGTTTGCAGTGOGGAAACCOAOGCAGCTCCTGCTCCCCCTAGTTCTTCCGCTCCTG^ 
GAGGAAAAAAAATGTTTA II 1 C CI 1 GTGGGAGTICTTCTATGGGC AC 1 I'll T 1 CGATTTTGGATNAA 
>mjGCTATTACAAC>GATGACTGGNAAGTGTGAATTNGCAKa^AATNTTGATCOT 
CCCAAAOGAACCCCCGGOATAAAAAATTCCTNGACATACNCNAANAmAAGGTTTNACANAAGG 
CAACACATGTTGTTCANAGTGAAGNGGACAAmOTGTANATGATTTTATGANGGAAAAGAATm^ 
CCOCCTGAGAAGGANGCCArmANAAAAATATGCATGAAGATGTGCCTTCTGNAAATANCTGm 
ATAAACAGKrTGTTTTTGGATGAAAAAAGGGGGNANNAAAAANGawrATTATTTTO 
ACCGAANNGAAAANGCTNCrCATGAAGNTTNGGAATCrTCTAATNCCCACGAAAAAATTAACCCC 
CrAATATrCTCCAAAG<>IGGNGGACTTAAANTTCTrrTNGNGGGGTGATGAACCCAAAA^ 
TT 

SEQ ID NO: 2756 GTACATTCGAATGTCTGCCAATCAGGAAOCrCCCCAAOCCrTCACrOGCCAG 
AGTITTTATTTGGGGTTTCmACATANGCAGCATTGCTrAAATCACTGG^ 
CTCAANTrCCAGCCCCATCACCTCCrrGGAGATTGGGGGTGGGATGGGGCTGAAAATINCAATGCr 
TTAAOIACATOCrrGOTCTITCrGGCITraGCCAACTCrrOTGAAACTAAN^ 
A<XTTOGTTNGmTAACCCC3^AGafITGGGTCAA\AAGGGCT^^ 
CTCCTATCANGAAACTTTTNGAATTTTTGAANCaXJTGTGCTrAGGAACNA 
NNA>^NC^^TC^ITATTGTATNACCACTGGGGAT^^"GCCCACA^^^GATAAAA;^^ 
GN^mJNTGAGG^^^UAAAACACCTANACNCTTNAAGT^^^IT^^ 
AAAAOTICrrTTmnrrOGNAAAGAAAGANNCm'CCTGGGGN^ 
CTTWTAAAAGGAAAANAACNACaXKrCTO 
GCCnTTG 

SEQ ID NO: 2757 ACCAGCAGTGTGTCAGGTGCTGCAOAGOGTTCTTGGAGAAGGCCCACTGAGG 
CAGGTTCGTGCCCTGCTGCGGCCAGCCTGACTAGACCCCACCCTGAGGTCCTGCAm 
GTGTOTAATCACGTrCX:AGGGOCCAAAGCCCAGCTCTTTOTTCAQT^ 

AAAAAGTAATIXnAGATGGAAATCAGTTGTGTITGGCANGGANAATCAATAAAAAArrmGmT 
CAAACAGCmATGGGGTATTTTAAGCATTCTrAAACTAGTTOAACATCrcACTTTGC 
AAAAATAGTAGAACAAAGCAACATAAAACAATGAAGGAAAACCTCACTTGAAGGCCCAGGTCAA 
CATCTAAG<XTGrrGAGACITANATAATCGAGTCTACCTCTTrAGTAGTrrGT^ 
AAOGCAAGGTGCCTnTGNTCOCCANTQOTTA Cl i riTl 1 - n CCTAAGGGCCTTTTGNNGGATTGAC 
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AAGTAATCCCCTTCGTAGGANCTrANAANTCTAAAATTAGAAAONGNTTrmATTTr^ 

CCATAAGTGCNNACTTGT>nTm'GGAAAAAATANGGGAAAAANAAAAANATTT 

TC 

SEQ ID NO: 2758 ACGCGGGGAGCGTAAGACGGCGCTATTCOGCTGTAACAGCTTCCGGCGGGTC 
CTGGATGTTGATGTCCTGCATCTAACGCGGTGTAACCCCCGAAGCCGAGCGAGCTCCGGAGGAAT 
TTCAOTATCTGCTACOGTAACrrcATCAGCCXXJCCAAGATGGCaATaCAAGCOGC^ 
AACATTCGACTTCCACCTGAAGTAAATCGGAT^^^TGTATATAAAAAATTGCCATAAAAAA^^ 
CNTGAANAAATOTTTA'l'l'l" n'l '1 ' IGGGAAAAATTGACCCTATrTCOT CAAAAT CAGAHNGGGQAAC 
ACACCCTOAAACTNGAGGAACAGCTrATGTTGGTCTATrAAGGACArrriT^ 
GCATGTGATCACCTATTGGGAT^CAATGGT^^■GTAACAAGATACCTT^mTGG^^IT^ 
NOGCNGCCNOTTTOAAAAOGCNANTrcCACANAACTGGCCGGCCGl^ANNTAGTTOATCCGAQCT 
TGGTCXX:CAACTITGGNGTAAATAAATGGGCANATNCTNNTTTCTGGNNOAAATTThO^ 
TCACCAATTCANCAACTTA^^'AACCC0NAAACTATAAANTT^AAAANCTOGGOTNCCCT 

SEQ ID NO: 2759 ACGATrTCGTTCTCTGATTCCAGCAOAGGGAAOGCTATCC AAAA AGTTGGCA 
GCAATAAAACGTAAAATTCATCCrGATCAGAAAAATArrAATGCCTATGTTGTGTTTAAGGAGGA 
GA0TGCIX3CCACGCAAGCATTGAAAAGAAATG0GGCCCA0ATT0CAGAT^ 
TTGATCTCGCATCTXiAGACCrrCATCTAOAGACAAGAGATCGGTnTrGNGGGGAATCT 
AGGTGAANAATCTGCCATTGAGAAGCACnTCrGGACTGNGGAAGTATCATGGCCGTGAGGATTG 
TGAGAGACAAAATOACAGGCATCGGCAAAGGGTTTGGCTATGTGCTCITrGAGAATACAGATrCT 
OTTCATCITGCTCTGAAATTAAATAATTCTGAACTCATGGGGAGAAAACTCAGAGTCATGC^ 
OTTAATAAAOAAAAATTTAAACAACAAAATTCAAATCCACOArrOAAOAATOTCAAGTAAACCT 
AGCAGGOACTTAATTTTACTTCCAAAACTGCAGAAGGACATCCTAAAAGCTrATrrATT^ 
AAAGCTGTrCTCCTTAAAACCGAAGAAGAAAOGACAGAAGAAAAQTGGACOCCCrAGAAACAOA 
GAAAACAG 

SEQ ID NO: 2760 ACCAAGGOATGGAAGAAGTAAATATAGCTCAGGTAGCACTTTATACTCAGGC 
AGATCTCAGCCCTCTACTGAGTCCCTTAGCCAAGCAGl' r I'Ci'i'iUA AAOA AOCC AGCA GGC^ 
AGCAGGOACTGCCACTOCATTTCATATCACACTGTTAAAAGTTGTGTTTTGAAATm 
TGCACAAATTGGGCCAAAGAAACATTGCCrrOAGGAAGATATGATTGGAAATTC AANAGTGTA NA 
NAATTAATACTGOTTTACTOGCCAAAOACATOTITATAGNGCTCroGAAATQTTC 
GTCTCTGGCAAGATGCTTTAGGAAGATAAAAOTrrcAGGAGAACAAACAGGAATTCTGAATTAAO 
CACAGAGTTCAAGTTTATACCXGTTTCACATGCTTTrCAAGAATGTCGTAATrACTAAGAAOCA^ 
TAATGGTGTTTTTTAGAAACCrAATTGAAGTATATTCAACCA AATACTT TAATGTATAAAATAAAT 
ATTATACAATATACTTGTATAGCAGTTTCTOCTTCACATTTGATTT^^ 

TAGAGATCTATATATOTATAAAATATOTATrrNGTCAAATTTGGTTACTTAAATATATAAGAGACC 
A 

SEQ ID NO: 276 1 ACCCAACTGGArrGTCATCTrCTGAGAATGCAAACrGCAGACCCCATATCnT 
aAaGATCCTTTCAGTATt:AACTCTGGAAAGAAAATGACTACATCAAC^ 
CACAATTCTGGACTGCAATACATGTAAATTTGCTAa:ATCACATGTAATCTCACT^^ 
AGCCAAGTCAATOTTTCOCTTATCTTOTGGAAACCAACTmATAAAATC^ 
TNTTACTATNAGGGGAAAACTTCGGAAGTGAAAATGCATCTCTGGTTTTAAGTAGCA^ 
AAAAGAGAGCTTGCTATTCAAATATCCAAAGATGGGCTACCGGGCAGAGTGCCATTATGGGTCAT 
CCTGCTGAGTGCrmGCCGGATTGTTGCTGTTAATGCTGCTCA TTTTA GCACT 
TTCTTCAAAAGACCACTGAAJ^GAAAATGGAGAAATGAAATATTTTACGAAAGAAAATAATAAC 
AATrATrCAATAATCrATCCTCANOTTTGCCTCAAATATGTGACAAOAAATOTATAATT^ 
TAGTCATGTTACTATGTAATCCATCAGGGATTCATTACTTGGGAAAATGACAGGGTCATGCAT^^ 
CC 

SEQ ID NO: 2762 A Cl l 1 1 U 1 n rnTl - l Ti r rn'riTl rn rACAAAAOTTGTTACACAAATACAO 
CTGACCAGAAGGTCTAAAAACAGCCCAAACrNTTCCAACCCrCATGCAl'NTGTANATANAAGGAG 
AGCTGTGOTC^TQC^X:ACACACAGGGOAGCCX^T^^TANAAAAACTGCCTGTXXXr^^ 
AAAGTCTTGGGTCCAGCANCANANAGGAGCCCAACCTGCOTGNACAACCCTTNGAGGNNN 
GGNT^WNAGCT^WNNTGGGNGGGCNACANGTTTNAGr^TCATA^^TCACATQT^rc 
AOTCAAATCAAGGCNTGAAAATAAAAGGGAAAAAGGGGAAGGCTGGAAAAGGGAGCCTGGAAN 
AGGrrGCAGGTAGGGGAAGOAGACACAGTGGGCTTCCGANAAGCrcGCAATTTCTTO 
GGAGTrACGTAATNTTTCACTITAAAATrATraThMAArraTACGTAC^^ 

GTTANAGGTGGCANAACTATTTCACACTAACCAGmGAAGACCTACNC AGGA TrAATTACCTTCC 
AGCATCAGGATTATAGCTGGGGGATmACAAACCATTCnTATTTTCTAACnTCAGGGAG™ 
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TGTTmT 

SEQ ID NO: 2763 ACCTCAGGCCTGGGCACCTCTrrGCrTGAAATATGGCAAGACTTGGAAAAAT 
GrrrGCCCTTAOAATCTATCTCACTACTrrAOTrAaTTGTCTCCTTTGGGOT 
CCCTGATCTGGAACAGACTCCCTrrrCTAAAACTAAACnGACCACATCAAAAGTTrGTA^ 
TCTCCATGGTAATTAAACTTGCATTCAACACCATATGGTAACAGAAGATGGCAAAGGATAAAArc 
CAAANNTAAANCTTTCCAAGGAGGGCATGTTAGATGATAGAAGGATTAATTGCTiAGCTGGAT^ 
AGCTCAGGCrrGGGCATGAAGGAAACTGTCTCCCATGTGGTTTGGAAGAGTTAGOGOCTCXrCTGA 
GCrCTATraTGAACTATACGGGTrrCATCCAAGGAATGGTATGATGTGGGCATAAAACCATrCTTC 
AGACAACTCAAGATGGTCCCCTlXnXn"AGCCAGAAACACTAGCTGTCCTGCATrGT^ 
TAGCCCCAGGCGGTCCTGTGTGTACCrCGGC 

SEQ ID NO: 2764 ACATCAAAGATTACATGAAATCAATCAAAGGGAAACTTGAAGAACAGAGAC 
CAGAAAGAGTAAAACCTTTTATGACAGGGGCTGCAGAACAAATCAAGCACATCCrrGCTAATT^ 
AAAAACTACX;AGTrCTITATTGGTGAAAACATGAATCCAGATGGCATGGTTGCntrrATTQGACTAC 
CGTGAGGATGGTGTGACCCCATATATGAl 1 1 ICl 11 AAGOATGGTTTAGAANTGGAAAATTGTTAC 
CAATGNNGCAATTATTTIGGATCTATCACCTGTCATCATNACTGGCTTCTGCTO 
CACCAGGACTTAAGACAAATGGGACTGATGTCATCrraAOCTCTTCATTTATT^ 
ATTrGGAGTGGAGGCATTGTrmAAGAAAAACATGTCATGTAGGTTGTCrAAAAATAAAATGCAT 
TTAAACACATAANAAAAAAAAAAAAAAAAAANGTACCTCOGC 

SEQ ID NO: 2765 ACGAAQTrCTCAGTrrCACTTTAGTAGAAAGAGCTCTAGAAATGAG GCTGA T 
AAACACATCTAAGAACACTGGTTGCmCTAAAATITCCAAAGCTCCACCATAAATGTAATTTTrA 
GTGTrrcAAATOATTGCATnTAAAOTATATAAATATOOGTTATCCAATATCAATGCTATAGTAAC 
ATCCTGAAACAAAACAACCACAAAGGTATAAATOCCTAAACTCGAGGAAACTTGGAACCCTO 
GGTAAANNCTAAANGTAGTATTTCThU^CTTCNGAAGACANATTGGTAGGCAGCCAT^^ 
CTTAAAATAACTGGGGGCATAGTTAAAATTTTATACATCAAGTGATTGCTATTATO 
GGTGAOATGTOGTTATTTITAGTTTAmGAAATGTTTGACTGGAAAGGGGGGAGGGGGAAGCAA 
ATATTTOAAATTTOOAAAACCCTAAACXriTrTGGTAAGAAATTGTAAT^^ 
TAAGGATATAAGAGGTTTATAATTGATGTAGTTAAATTGAACAATAAACCATTGGTGACTGGAGC 
AGGTAATTATAGCCTGCAGAAAAAATTATCTAAGAATTTTAAAAATAAQAACCTOAAAGTTOGTr 
TAAATTGC 

SEQ ID NO: 2766 ACTAGGAAGGTTAlTGCACAQCTCXnTAACTGATCGGGGTCAGGGCAGAGTG 
GTCACmCCCCACAGCCAGGCTCTGGATrrGCCTCCTGTGAAOACACCATGCCTAGCACAGGCTG 
ACGGGGCGGCTGCAGTCGAACCTTGCCTCCAGATTATGAACCAGTATAAGTAGCACAATTCTCGT 
GGCTACT^^CACTrcAGAGTGCATGTTTATTGAT^^'GGAGCTT^C^ 
CrG>rrcAm"AAAACAAATATTrCnGNGTAGCGTTCrANATTGAGGGCAGCAGTC^ 
TAAACTTCAGTAGCATGTrCCTGTCAGGTTATAT GGOVG CCCTGGCTCAAG GATr TCTO 
ACGATrcCTAAGACTGrrrGCTAGCTGATTCTTTOTTITGOCAATTrGATOOT^^ 
TGTGATCCAGGGOTGTTCAAAOT 

SEQ ID NO: 2767 ACCAAGGGATGGAAGAAGTAAATATAOCTCAGGTAGCACTTTATACTCAGGC 
AGATCrCAGCCCTCTACTQAOTCCCTTAGCrAAGCAGTnCTnCAAAGAAGCCAGCAQGCGAAA 
AGCAGGGACTGCCACTGCATTTCATATCACACTGTTAAAAGTTGTGTTTTGAAATm 
TGCACAAATTGGG<XAAAGAAACATrGCCTTGAOGAAGATATGATTGGAAAATCA AGAAGNGN A 
AAGAATTAATTCTGGTTTACrcCCAAAGACATGTTTATAOTGCTXrraNAAATGTTCC^^ 
AAGTCTCTGGCAAGATGCTTTAGGAAGATAAAAGTnGAGGAGAACAAACAGGAATTCrrGAATTA 
AGCACAGAOTTOAAOTTTATACCCGTTTCACATGCTTTTCAAGAATGTCGCA^ 
GATAATGGTGTTlTTTAGAAACCTAATTGAAGTATATTCAACCAAATACTTrAATGTATAAAATA^ 
ATATTATACAATATACTirGTATAGCAAOmCTGCTTCACATTrGGATTTTrCAAATTTAAT^ 
ATATrAGAGATCTATATATGTATAAATATGTATTTTGGCAAATTTGTACTTAAATATATAOAOACC 
ANGTT 

seq id no: 2768 accttgatacacataatcagccrmcaaaaatgcctoacaagaattagtctt 
tcctttgtgctgaagtcttcccacccatggatggaagcacgctgacrccctgagggtcaggcaagg 
ggtgggaaagggaacacattactmgtgaaggcaaagcagaaaggcgtgtttgccagaccagcn 
tggcagctcagagggagcaaagcitccaccanaanaagcttcrcat^^t^^tg^^ 
agt>faaatttgaagcrgagtraacaatggga(xactgaacttcntccaatggaaaactcacggt 
cx;aotcccacagoaactgtgcgcataccaaacaacaatqaogaaggaagogccx)ggtggctcta 
ccaaacagttcaggtccacnkkjtgaatgaagcctggtgggaaag^ 
cctctgctggtcccactixnx:atcxitqgcgggtcgactgcctgaot^ 
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GGGGTTTGACAAAAM.CKjGAGTGAGGGGGAACCCCANGAAGATGAAGCCAAAACTC 

ATTCrGCTAATTCAGTITrTTGATC^AAAAArrCAAAAATrAOAG^ 

GGCCCGCG 

SEQ ID NO: 2769 acatcagggacctgtgcagtgggctcaagccagacacgcagccacagatgat 

TCAGGCCAAGCTCnAMGGCAGATCTTCACGGGGCTATTATTrCAGTGACAAAATCCAAATGCCC 

CTmATOTGGGTATrACAGGAATCCTrCTACAGGAAACAAAGCACATTTrcAAAAT^ 

AGAAGAO^GCCTGAAAGTTATCCCCAAGCTAAACTGCGTGTrCACTGGGGAACCCOK^ 

TCCTACATTTACGGGAGCAAATrCCAOCTTCOGTCAAGTGAACGGTCrGCGAAGAAGTTCAAAOC 

GAAGGGAACGATTGACCTGTGAATTCTITGCCGTCTAAGGCV^GTTGriTATGACAGCrcAAAAC^ 

GACACTCCCTAAATOTCCACCTTTCAGrGAAGAGATAGTrAAGCCAATTCCATTTATAGACCAOT 

CCAGCCAOTGACGCTCCGAQTTOAaOATGTTGAACAACATGGGAAGGTCGCAGCGT 

SEQ ID NO: 2770 ACTTACAAAGTTACAACCATTTGCTTCCTTAACATITrCCATGTTAAGTTCATA 
CATGTAGATATGATCAGATTTACCATrmAGGGGGAAGAGGGAAAAAAAGGTGTATTTATCATC 
AOCTAGATGTGCTCACrcTATCCTCCGTTATTTATATGCAAGGCCCGGGTGACTGGAAGTTC 
GTCAGGCATTTTAATAAACTGGACAGCCATrrcmcrOCCGACAAGGCTTCTrACACAC^ 
TCXK}0AAAAAACAGOAAACAACCAA0CACT^m)CACTONAACAC0CCACX^TAACA0CTAA 
GCATTACTCAACTGCTACACAACTGCGCCTAGTGCACAAAAATACATAAGAGAAGAGATTAGAAT 
TGTGTTTGATAAACAATCXnTrCAAAAAATCAAGTCnTnCACCTGAAAAGTCmATA 
GGrrTCAGATTACCATTTAGATCTGAAGGTAAATAACATATACAAATTTACACCAACTTTO 
TmAATTITAAGOAATGAAGGCAATGCCGAGTCAATATCTTGACAGCmAGGCTGTGTGTAATG 
GCTGCACACOTTnXXriTAAAAOTCAOGGAGGCCACAAAATTAQOOTACCCAATCTOGT^ 

SEQ ID NO: 277 1 ACGCGGGGTATCTTGATGCACrTAACTrATAGTTGTGATCCTGCACCTGGACCA 
GATCCATTITC^CnTATTGGAGAGAGCACGAmATTGTGOTGACAATrCAGTGTGGAGTCGTGCT 
GCTCCAOAGTGTAAAGTGGTCAAATOTCOATTTCCAGTAOTCQAAAATGG AAAACA GATATCAGG 
ATTrGGAAAAAAAAAAAAAATANNNNGNGGGTCC'l-l 1 1"!'! I Tl I'l'l l l l'l 111 IT 1 1 1'NTOCGGGAG 
GGAACTGNNCTTTTATGGGGNGCCCNANGACAIWAAAANC^m'AG<XNNAGNCANAGOG^ 
CAACCAGCAANAAAAGAAAAGGTTTTNGCAGTAATTGNTANAACCNGGCGANAAACNCAGGTGG 
AAAAGGGTTGGGGTGCCCAACCCCTCANNC^a^GGGGAG>mTrnXKX;CCTTOT 
CCGGNANCTOCCCGGGNGGOCCGTn'AAANOGGCNAAATTTCCAACACCNCTTC 

SEQ ID NO: 2772 ACTCTTTGTTTTGGCACACrmCCTOACAAACAGCCAGTGTTCrCAACACAT 
AAATACTAGTCCACGTTAACAACAATAGCATATGAGACCOCTCTCCGTAAAGATGCCAGATTGG 
TGCAAATOGACTOGAAATACCTTGGAGOGTTTCACAAAAATAAGACAAAGGGCAAAGGAACITrG 
(XAAAGGAGATGGAGAGCyUTTCrrrAAAGATAGTGGGAAGGAGGAACCAAAAACTTATAAATT 
CCAGCCrrm-AAAATCGGACGCATlTGCCTOGCOCCTACTGGGTGTCTOCAGCTCAQC^ 
CCACACAGGACACCGACmAAGTGGCTGCCTTTGCAAGGCTGAGAGGCCATGAG 
TGAAGTGTCAGCGCCATCTAGTCGAAACATGGGGCATGGCCGCrrroGATO 
TOAGAGGCAOAGAGACACGCTCTATTTCGCATCTCACAATGCAAGATGAGAGAAANGCrrGTTGA 
GTTirrATTTCATCATCGCCCGTrrAAGGTCAAANGAGATGCCCrTTGGCTT 
ACTGGCAACACCCAAGGACCAGGCTGGOCTTCAmGCACATTCCATTITAGCCCAAOG^ 
NGGAN 

SEQ ID NO: 2773 ACATCTTAGGTTmCn'CCmAGTGTGAAQAGGCGTmCCACCAACCCACA 
GCTCTGCGThFNAGTTTTTACTAGATrOCTOCAAAmCATOOAATCTTTC^ 

ATTTATTGGAGCCAAAAATTCTAGGCGCTAGAATGGGAACAAGGTAGTCAGCCAAGCACAAAAAC 

ATAACAAAACAGGAAACGCCGGACAGACAGATGGATCTANATNGTNGAATAATCANAAACANCA 

AAAAAACCCm^CCATGATGGGNAGGNGGAAACCAAGGCThrrrrcCATCGGAAGACm 

CATCAGCATCACITCnrCCCATNCTTGCAGCTGTCTmCAACTTGCAGTCT 

OGTGCTTGCNATTATtrrCCCTNOGNCATCGCTrWQGGQATGCAATCThn-A 

CTNCCCAACGAAGTCCTTCAACCCAAAACCOGACACCAAA<XAACAAACCCCCGGCGTTAC^^ 
GGCCGGGACAOG 

SEQ ID NO: 2774 A CiTi 1 l l ' l i I ' ll in i ' l ' i iTnTn - i - | - i ' j - i ' i i ri i ri 1 1 ri rnri ' iTi i n i aaa 

AAANAATTTTTAAAA^lTAT^rr^AT^r^r^CCTAGGAAGG^mTACNC^ 

mAAAGGACTGACATANGAAAOCNNCCTCCAGTrAAAAATAOTGNGCCCGGGCrCGQGGGCrCA 
CCarmAATCCCAACACTTTOGGAGGNTCAGGGGGGNGGACAACCTGAGG^^ 
CCANCCTGGCCAANATGGNGAAAGCCCCOTCCCCAATAAAAATACAAAAAAAAATTANCCWGG 
OTOOGGGCCCATGCCTOTANTCCCANaXjmTGOGANCTaAGGC^ 
• AGGCANAGGCTGCAGNGAGCCCAAATNTCACCATTGCACTOTGCCrGGGC^ 
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CCATCTCAAAAAATAAAATATAATTGCTCKiAGGGANATAACAAGACTXjAAAACAGTCC^ 
GGACCACGC 

SEQ ID NO: 2775 ACTCCrGGTOTAnTATGGGACCXACAAATATAOCA'nTACXTCACAAGCACC 
AGCTGCATTTCAGGGCTrTCCATCGATGGGCGTGCCTGTGCCTGCAGCTCCTGGCCrrATAGGAAA 
TGTGATGGGACAGAGTCCAAGCATGATGGTGGGCATGCCCATGCCCAATGGGTTTATGGGAAATG 
CACAAACTGGTGTGATGCCACTranX^GAACGTTGTTGGCCCCCAAGGAGGAATGGTC 
ATGG<jrGCACC<XAGAGTAAGTTTGGCCTGCCGCAAGCTCAACANCCCXAGTGG AGCC ^ 
GATQAATCAGCAGATGGCTGOCATGAGTATCAGTAOTGCAACCCCTACTGCAGGTTTTGGCCAGC 
CXnrCANCACAACAGCAGGATGGTCTGGAAGCTCATCAGGTCAGACTCTCANCACACAACTCTGG 
AAATGAAAACTCCAATACAAGTTTCATCCAGAACTACCACXn"GACATrCCTTGCTOAAACGCATCT 
AGTTCCCCTGTTATTCATATGCATAl 1 1 111 1 1 C H 11 1 ACCCATTTGGTCATATTAAGAATG ATCTG 
ATTGACCGNGTTTGGTCTTGTACCT 

SEQ ID NO: 2776 A ClTn ITi I ITVl H ' l 1 i i 1 U i U i A GAANANATGOGGTTTCACGATGTTGQC 
TAGGATGGTCTCGATCTCTGGTCANAGTCnrrCTGTAAATATCCTrGGTAAANAANCAAT^ 
ACTGAAACNGCCGCAAATA>OTATATGNATAAACCTGCNNGANGAGGAGANAGAANGGANAAAA 
CAATTAT^rrrCCTGAOGTCTNAGCCrrcNATCAGGAGATTATTGAAGTANATCC^ 
AATGCTGAAGCTTITGGACTT(X;GNAGTCTGTCCXACCTTCAGGTCACrCAGCCTA 
OAATTTCAAAACGCCnXXjGGGACCrcrmGAArrTTTI^ 
TGGOACAACGAAAAAAATTAG 

SEQ ID NO: 2777 ACXTmATrGGTATAAGAACGTAAGrrCCAGATTAACCATGTCATTGNTTCA 
TirrcACCATGOATTTTrTTTCACAAACTCXnTrGJ^ 

tacattaaaactacti^caacccacaaagaccccacitactactaattt^ 

actcgtitnatctaatattanaaaatcancnttitgcx:agcagttnaaaanacaacc^ 

tcaaatnactttccancantnccggaaaagaanacc>icccathn'ccaaanc^ 

natttctantttnccaaaaagta^mcagac^gaa^™canactctacntt^^ 

m*attcmgancanactootaancx>ioggtakraacotcnantttcnnt 

ACTGG 

SEQ ID NO: 2778 A CITI t rn 1 ITl J 1 1 1 1 1 i 1 1 1 1 1 i GGATAAAANATGTCTTTATAAAGTrTTrr 
TCAANACnCATTCTAAAfAafCAOAATAAAAAATGGNGTCAGCTCACrrGTAANACACCAACCA 
NATTrTCCTTATACrGTCrCAAAATTTAAAGATCAATITCCCCANAGGC^ 

AATCGCCCTTmTGAGGATGGGAAAGGAAGGGTTGGGCAGGA TGGA ATATTAAATTGTAACATG 

ATAAACATGCAANACTGTTATCCAATCTAKATAATTTATATACATTTTGATGACTTAGGAAAACAA 

AGCAATC^mTGTGACAAG<XTAAAAAG<^TGACATATTTAACATAC1TAGGAACl^|■^ 

OGNOGGAATrCTCTAATTGTATCATGTGGGCCnTnXiAAAQTAACAAACANAAGOCCAQTCTGTrG 

CAAGrmSCTGCroAAC^TCACATTCCACCCrAANAAAACACAAGGTGG 

ATACCTTACCTTANCACAGAAGGAAAAAGTATGTCAGNGCA AAGTA TGGACTAAACTGCTTrCAN 

GAAAAAAGTTGTAAAAAATTGATACAGGTKKjAAAAGOGAATmCCTTNCCGGCTT^ 

CCAATTTAANGG 

SEQ ID NO: 2779 ACXK;G0aAGCGGCAGAGACATrGTTCTTG0COOCTaXTACQGT0CCX)TGTG 
TGCGTGAGAOAAGACCAGTCTTTCCTCTAGCATTTGACATTGTGTAGCAAAGAAAT^ 
AAGCCCAGTCCG<nX3OTGTAGG0CGGGAGTTTGT0AGGCAATATTATACTTTGCTGAATAAAGCT 
CCGGAATATTTACACAGGTTTTATGGCAGGAATTCTKXn'ATGTrCATGGTGGAGTAGATGCTAGT 
GOAAAGCCrcAGGAAOCTGTTTATGGCCAAAATGATATACACCACAAAGTATrATCTCTGAACTT 
CAGTOAATGTCATACTAAAATTCGTCATQTGGATOCrCATGCAACCTTGAGTOATOG AOTAG 
CCAGGTCATGGCTTTGCTGTCTAACAGTGGACAACCAGAAAGAAAGTITATGCAAACXnTrGT^ 
GGCTCCTGAAGGATCTGTTCCAAATAAATTTTATGTTCACAATGATATGTTrCGTTATGAAGA 
AGTGTTTGGTGATrCTGAGCCTGAACrrGATGAAGAATCAGAAGATGAAGTNAAAGAGGAACAAG 
AAGAAAGACACCATCTCCTGAACCTOTGCAAGAAAATGCTACCAGNGGTACTATGAANCnXIACCT 
GTGACTAATGGCATA 

SEQ ED NO; 2780 ACTGCAAGTCAAGGGGACTCTTTGCAGGCGrrGTCTTTAGAAGGGA GCTGT TT 
GATTGAAAGGAAAGAAACTAATAGAAAATmATTGTCAAGATATCCGAGCTT ATGAC ATTTTATT 
TGGAGATACACCGCGGCCTQCTCAAOCCGAAGATCTTTATGAAATTCrTGATrCCm 
OTATGAAAATOAAGGACAACGAATCAATGCAAGAAAAGCAGCAAGGGAGCAGAGGAAGTCTTCT 
GCTAAAGAATTACXrrCCAAAGCCATTGTCAAGACCACAGCAGTCA TCTGC ACCAGTCCAGCTGAA 
CTCTGGCTCTCAAAGTAACAGAAATGAATATAAGCTCrATCCrGGACTrrCCAGCTATCATGAGAG 
AGTTGGCAAmGAATCAACCCATAGAAGTGACAGCGCTGTATrCATTTGAAGGACAGCAGCCTG 
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GGGATrmAATTTTCAACCTGGAGACAGAATCACAGrrATATCAAAAACAGATTCACATTTTG 
OGTOGOAAGOAAAACTTCGAGGTCAAACTGGCATTTTTCCACCAACTACOTAACCATGAATTAAA 
GCCGTATACTA l I 'i- iCl ICl 1 l OAOAATTACAAAAAAATrATTTCrACACTGACAGGATrTACTAGT 
TAAGCATGTTTAAT 

SEQ ID NO : 27 8 1 AATAAAGAACCTCTATCAOTGAGACrrCTC ATTH ATAGC AAATAC ATTTrrO 
CAGCTTAAATITIXrrroAATTCATATACGCTTCTGTCATTTAAACAAAC^ 

CTCTATATATTTAAGTAACAAATTTGACAAAATACATAmATACATATATAGATCTCTAATATAA 

ATATTAAATTTGAAAAAATCAAATQTQAAGCAGAAACTGCrATACAAGTATATTaTATAATATTTA 

TTTTATACATTAAAGTATTTGGTTGAATATACTTCAATTAGGTn'CTAAAAAACACCArrATCTGCT 

TCrrAGTAATTGCX}ACATrcrrGAAAAOCATOTOAAACGGGTATAAACTTCAACTCTOTC 

TCAGAATTCCrGTrrcTCTCCTCAAACrmATCTTCCTAAAGCATCTrGCCA 

AAGGAACATTTACAGAGCACTATAAACATGTCTTTGGACAGTAAAACAGTATTTATTCTTCTACAC 

TCTTGATTrTCCATCATATCTTXXTCAAGGCAATGGTTaTrGGCCCAAT^ 

AAATTCAAAACACAACCTTTTAAC 

SEQ ID NO: 2782 ACACAGGTATrrrCAAAGGAAACAAGTCATCITAAAGTAATATrrTTCTATAT 
GCTAATTGATACATCTTTATAGCAAATTGAAAATrCrrcAGTAAACrGAAAGTATGCTTAACOACAA 
AATAAATACAGCATATATGGTTAACATATACATTTCTrAGTGTAAAGGCAGCAGTGAATTrGTGTC 
TCACAATAAATCTOTAAATCCAGTTGClll'CrilCrOGAATrrTATATAGTOTCTC AOCA TC 
CAATGCTGGAAATGTCrnTITGGCATCAATCTATGCACAAATTO 

TOACATGTAACrn'i' 1 1 1 AACTITTCCAGAAAAATATGGAAACTITATCAA CCAC TTATTAACTGAA 

CAAAAAGTTAQATTACTACCAAATGCTCITITAATTrrGCTCTAACAGATGTm 

CATCGCrCATGTnTTGAGGATAACTGCATACAACACACTAGATGATTTCAAACGATGCATaTAG 

TATCCGAATCATTTGGCCATCCTTAGTATCCAAAATAAAATCAGTAOAAATAAAAGTAATATAATT 

TTCAAAQAATTCATACATACTAGAAGTCrrAOGAAAAGCAGCrTCTAAATGCAAGGACTAGGAGG 

OTTGCCCA 

SEQ ID NO: 2783 ACTATAGGAATACATTAAGTAATTCAATGGAAATATACCrTGCTAATATTATA 
ATGGTATAGCTCrGTTAATGAATTCTCTTAGAAACATTATACTTAATGTATTCTGTTGCTGTATGT^ 
TCATTTTAATTOAOCATTAAOOGAATGCAOCATTTAAATCAOAACrrCTQCCAATGCTm 
AGGCGTGTTGTCATTTTTGTCnXTATGAAATTTTTGTCCCAAGAAAGGTAG^^ 
TAACAGATTAAGTTGGTGTAGTGTATrCTTGTTTATCAAAATACTAATAAAGCTITGGGAT^ 
ATTGGTAAATATTCATOATGTGTCAAAAATCATGATACATACrOT 

SEQ ID NO: 2784 ACTAAATAAGACCATGOATGTTAGTAAACTCTCTGCTGAAAAAGTGGAAATT 

gcaacactaacaagagagaatggaaagacagtaatcagagttctcaaacaaaaagaagtggagc 
agttgatcaaaaaacacgaggaagaagaa gcca aagctgagcgtgagaagaaagaaaaagaac 
agaaagaaaaggataaatagaatcagagattitamctcattrggggcaccatrtcagtgtaaa 

AOCAGTCCTACrCTTCCACACTAGOAAGGCmACrrnTn-AACTOGTGCAGT^ 
CATTACATACTGAATTGGGTCCTTGTCATITCTGTCCAATTGAATACTrrAT^^ 
TACCCTrCATOGACOTCTTAATCTTCGACACACATCCCCTrrnT^ 
AAATGAANGAAAAAAA 

SEQ ID NO: 2785 A(XV^TAT7X:ACAA0CAAAACTATt:ACTA0TATaXATTAATrTAAATrCGCC 
ATQAAAATGGTGCTCAATCATGAATTATGCCAGGAAAAGAAAAAGGAGATTGGGATCACTTATTG 
ACTAAGACATGGAAAGCKCTCCAGGTGACATTACACAGACTGCAATGGGTGTTAATCCATGTGTT 
CTGCAACTCTrGCTCTATGTGGAGTCTAGGTAAGGATmGACCTAAATTCTCTA AAACAAG ATTA 
GAGATGCTCCTAGATATATGTGAAACCCTTATTAAATCTCTCTCrCAATTTCTC^ 
TTGCTGTCACnATTTATCAAATAAAAATITrTAGCITATAATGTAATrGTTATTTrC^ 
TATTTAAAAOCTTAATTATAAAAAArriTrrATGGGCCATATAGGTAOTAAATGTTAAATAAC 
ATATAATTTGTATAA Cn i - 1 11 Tl 1 1 l GACAGAGTCrCGCTCTGTGGCCX:ANGCTGGAACGCAGTGG 
TGCAATCTCAGCrCACTGCAACCrCCACCTCCCAGGriTCAAGTTATGTTACCATCTATTAAGGGAC 
TAAATTGNGGCCCCTCTAAAATTCCATGTGTTGAAACACTAACCCTAGGGTGACCATATATGGACA 
TAANCTTTT 

SEQ ID NO: 2786 ACX3CGGGTAAAACAGGGAAGAACATCATCTGGGTTCTTTGCGCAGTAGCAGC 
CACTCnXnXrrCCCACATGATGCrGGCCTTTACGATGTGGAACXrCT^ 
GGACTCTCXrTACrCATTGCITGCCTGTGCATTQTGGCCAATGGTGGCATTTGTAGTTCCT^ 
AGCTGGGAACTGCATATGGCTTCATGCAGTCOVrit^GAATCTTGGGTTGOCCATCAT^ 
TrccrGGTATGATACTGGATTCTCGGGGGTATTTGTTTTrGGAAGTGTTCT^ 
TTTGTCACrmATCTOTGGriCrrACnTrArrraGrrGAATCOT 
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TCTGCAAGACAAAGGGMGAAATAAAATrrrCCCATACTGAATGAGAAGTrAAAATGAATGTGTC 

ATGAGAATGGGCTTAACACATCGTTGOTrrGAAAACTTCCATITrrAAAAA 

ATTANAAAAAATAATGGACTGGAAAGTTATATTTATATCX:AAATATACCTATTrC AAAGT GTATn 

GTGAGGGCCnXjTTTTAACCTGGaNCTmGNATTGGGNGNTOCTAAAGAATT^ 

CTAATTC 

SEQ ID NO: 2787 ACAGCCAACGGTTTCCCTTGGGGGCTTTGAAATAACACCACCAGTGGTCTTA 
AGGTTGAAGTGTOOTTCAGGOCCAOTGCATATTAOTGGACAGCACTTAOTAGCTOTGGAGGAAGA 
TGCAGAGTCAGAAGATGAAGAGGAGGAGGATGTCAAACTCTTAAGTATATCTGGAAAGCGGTCrrG 
CCCOXjGAGGTGOTAOCAAGGTTCCACAGAAAAAAGTAAAACTT GCTG CTGATGAAGATGATGAC 

gatgatgatgaagaggatgatgatoaagatgatgatgatgatgattttgatgatgaggaagctga 

agaaaaagcgccagtgaagaaatctatacgagatactccagccaaaaatgcacaaaagtcaaat 

caoaatggaaaaoactcaaaacxiatcatcaacaccaagatcaaaagoacaaoaata^^ 

aacaggaaaaaactcctaaaacaccaaaaggacctagttctgtagaagacattaaagcaaaaat 

gcaagcaagtatagaaaaaggtgotixrixntcccaaagtcgaaccaaatrcatcaattatgtgaa 

gaattgcttccgoatgactgccaagangctattcaagatctctggcagtggaggaagkrrc^ 

naaaatagtttaaacaattt 

SEQ ID NO: 2788 A ClTlTn - l - ri ' iTl I ' l ri I ' l l n I CNAAAGGrmTNAATATCCTGCAGGTAATA 
ACACTGATTTmn-AATACTCAAAAACATm-ACTTANCAGTTGTGATACrAA 
TANTGTTATNCAAAT>rrAANATNCTACTAANTOCOTAGTAAAAATAATNGCATGATTCCT 
TATATTOU^GGTATAAACATGACTGATITCOCrQATNCTACANAATAAAAAAAATAAAGCTGNTA 
TGTAAAAAATTAAANAATATCTCAATTANATATTTTNGTTCCCAATCTmTT^^ 
AAATAATATANAATATOCTATCAATATQTICnTCATATaAAGNOAAAAAAANGOOATTTAAGTA 
GNGANATAATTTCTATnTITACTITmAAAAAAATANACAGGGTCTTC^ 
GTCTNAAACTCCNGGGCTCCAGNGTGCTTCCTGCCTCAGCCTCCCAANGNGC^^ 
ATGANCCATTGNCCTGGCCTGNGGGATCATTCTATGOGTmATATCAGNATTAATTANTAAGTCA 
ATCAATrAATTAACXrrTAATTATTTrrmn'AAATAAA>n^AGNTGGAT^ 
AATGCNTN 

SEQ ID NO: 2789 ACAAGTATTCrrGATCTCAGAGACAAGTrcAATGAATCTCTTCAAGTGAATAC 
TACTGCrCTCATCCCAAAGGAAGCCAACTCTGAGGAAGTCTTTITGTTTAAACC^ 
TTTTOAAAATGGCACAGATCTITTCATTGCTArrCAGGCTGTTGATAAOQTCOATC^ 
AATATCCAACATrGCACGAGTATXrmGTTTATTCCTCCACAGACTCCGCCAGAGACACCT AGTCC 
TGATGAAACGTCTGCTTCTTGTCCTAATATTCATATCAACAGCACCATTCCTGGCA TrCACA rnTA 
AAAATTATGTGGAAGTGGATAGGAGAACTGCAGCTGTCAATAGCCTAGGGCTGAAi'il 1 IGTCAG 
ATAAATAAAATAAATCATTCATCCAAAAAAAAAAAAAAAAAAAAAAAGT 

SEQ ID NO: 2790 ACAATAGOTOAQAAAAATCTGCAOTATAGAAGAATAGAGGCAGAOAAATAT 
GAAGGACTAAGGAGAAGGOTATCAGAGTAAATAGGATITGTAATAGCAAACCCAGAAGTTAAGT 
AGAAAG<Xn"GTAGCTGTGCATGCTrcATTTATCCAACCTTAATACCAGGCAAC^ 
AGTCAGATTTGCTATTGTrrGGGGAAATmCAGTAAGTTGAATTAAGAACCATCTAT^^ 
TCAGGGGTAATGACTGGGATCAAGAGCTCACAACGGCCAATAAATGTTATTGCTATATCCAATCC 
AAGAGGTGGGTGCAACTAGAATACAAAAAGOACAAAGCACCTAGTGATGACrcCTTCGAAGTC^^ 
CATTAAAAATCAAGTAATTGCTCATTAAGTATITCAGTATTTGAACTAAATCATTTGAGAACATO 
TGGCAGCTAAGAGATAAAAGCAAAGGGGAGTCTGTCAAAACAAAGAAGATCAAAAGTTTAGTAA 
GTGAAAAGAGTTTAATGAAGCCrCAAGGGTTTAAGGQAAATTAAATGGAACTTTrAATCCTAATT 
GCmrCATCTGACATGCTTAAAGGAATCCTmAAACTGATAAGAACATGAATAGGAAAGAA^ 
CCTGCTAAAATCTCTTAAA 

SEQ ID NO: 279 1 ACAAATGAACACAGrrTATATTCTAATTCTTACTGCAGCTCATTTTAATTTTTA 
GGATGCAAGCACAATTTAGTATTCAACTGAGTAGCAACATATTCAACTTGATCCCATTGTCT^ 
TTACTCTTGCCCATGAAAAATOTICATAAATAAACAOGGTATITOA CCAT ATGA 
CAGCACATTACTTTATGAGAAACTACCTACTGATATGGGCTTGAAATTT^ 
ATTrCTACACTAOAAGTAATTTCAAAATTOTTGOTTmATAAACAOGAAAAAOGT^ 
GACTTTTAAGCATCTCTGAAATAAAAAACTTCTTmACAGACAAGCAT^^ 
ACAACAGTGTGTATATATGTAATATATATATAGTAAAATGAAATTTAAATATOAAGCCA AACl 1 1 1 
TAAAATTAGAAACTCAAATGGTTATACTGATTAGTGTCTAGCCTAGAGTGG TAACCATC CT 
AATTO^GTTATGAAATACATTAmATAATGCATTAGCTGTATTAGCTGTTGCI'i'i'rri'GATGGTCA 
GGATAACTATGTTACCmATTTCTGOCATTTAATTAAATAOCTCGA0GTATTAAAAOCCXX:CTC^ 
TTCAANAAA 
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SEQ ID NO: 2792 ACAGATCCACrrAGTCATTTTCTCCTTTTmAAOAACCATm 
TTAAACTCACGATACXIAOTTATCTGTTAATCAAAATTaCATTTTACAATn 

CTATOTCTACAGCATACCTTATrAGGTATAAAACCTACTGCAACrrAGAAAAAGGAAAGAAAAAA 

GAAAACTirrcCAACTOCTGCATrAAQATAQOGTGGATTTrATaTGCITTTTT^ 

TTCnTrCCTCACirrTACCnTTACAGCGTATTAOTAGTGAACATTAC^ 

AATATTnATTGAGGGCCTATGTGCTAAAAACTATGCATATCTATATATTGGCCAATTATCTT^ 

AATTTACCTTTTGAAATTGCATGTTrATCATATATCCTrAAGTGGACACATAC^ 

GTGCCTCTCAGTmATTGAAAAGCrGCCCCACAGaX:ATGTCrCTTGTCTC^^ 

GAAGTOAOCrCTCAACCACA0ATAOCT0T0GCTTCTCAAQAAGCAGCTCATTG<XAA 

TGAGAGGGGACCTGCrruCTGNGGTGGGTTGCCTAGCCCAAATGAGCATTTACCTACCACCTTCCC 

ATTT 

SEQ ID NO: 2793 accagotcccccttcaccatcctgggagaaggatggaggacagaggaaagg 

GAGATGCAGCCAGTTCAGATCTTCCCGGGAAACGTGCCTTCAGCOC^^ 

ACCCAGGATGTGGGCCAGAGGOACTGGTACATAATOTATTTATATQTTAATTTGrrATGTATATAG 
ATGTGCAAGTCirGTCAGAATTGGCCTCAGTGTAGTrAAAGGGCAGAAGGGGAAGATACTGACTA 
GTCATAGAAATACCTCATTCWCCTGTGGGAAGAGAAGGGAAGCCTCTTCAGGGTGAGTGAATOGC 
AAAGCGOTTGOTCTQOCnXXrrCXrrrCCCCTGTGGTCTTGGAA^ 

GATGGAGGCX;GAGCCAATAGACTGAAGAGA(rACAGCAATTGGCTCCTCATCTAGAGATrTTC^ 
GGCAGTArrCCATGGOATGTTAAGCAAAOOAAACCAAAGGAATamTCAAATGGACTCATGGCr 
TANAAAKCnTATTCTTAGGGCA 

SEQ ID NO: 2794 ACACTGAAACATAAATCCGCAAGTCACCACACATACAACACCCOGCAOOAA 
AAAACAAAAACAOCAAOmACATGATCCCTOTAACAGCCATGGTCTCAAACTCAGATGCTTCCTC 
CATCTGCX:AAGTGTGTTCrGGATA(>GAGCACATCGTGGCTTCTGGGGTCAC^ 
GTGGGTCCACAGAGCACTCATCTGGCTOGGCrATGGTGGTGGTGGCTCTACTCAAOAAGCAAAG 
AGTTACCAGCACATTCAAACAGTGTATTGAACATCnTTAAATATCAAAGTGAGAAACAAGAAGG 
CAACATAATAATGTTATCAGAAAGATCTTAGGAAGTAAGGACAGCTGTOTAAAG<n^ 
AAAOTAGCTTGCCAGCTTCATTTCTTTGGTTTCTTGGGTAGTGGGOGCCGOAACAGCAA^^ 
GGTTCTGGTrCATGGATCATATAATGGACCCATCCCTGACrCTGCTGAACGCCAAGATTCCTCCAT 
TCAGATTCAGACATCAGATGGGTmAOGGACCAGCrrGOCTATOTCCTTGGGCAGCATGACATGT 
CGATACTCAAACrcCTCGTCGNCGTATTTGTCCCAATAGTAAATTTGTTGGTGCGACA^ 
COGTTTGCTAGCCCTT 

SEQ ID NO: 2795 ACii'i"rrii"n"i'i"i'i"rrrrrrrri"rACATGANAACCAAGGAcrrri'i ATAcrTAN 

AACAC^TACTGAAAAATGAGGAGGCAGTAGTTAAGCTACAANAAGCACCANATGCCTTCTGGGAT 

GAGTTGCCTTCCTTGCCTCCACAACAGCAGCCANACTOQACCTCrGQACAACATGATAAGT^ 

CAATCACACTTCCTAGAGTAGGACCAAGGGTAGGAGGCCACCAGGCrCAGCTTACAAGCCCC^ 

TAATCGTCCATCAGGGAGCCAATAGCAAAAATGTCATTCTTGTCCAC\CG^ 

AAGCGGATCTCATCX3TCrrOCTGAATCACAATATCCTCATCCATTGTCTTC 

GAGTTAOGATCAAACTCXATCTCTGAAGGGATGGAATGTCGAGAGATGAAGCAAGACATGGGCCC 
AATTrCTOTGAAGAGTCCAACCTTGTTGACCTGAOTGACAACAGCATCCACOACCTCCCCTTTAAA 
TGGOCGGAAAACAATGGCCrrGT 

SEQ ID NO: 2796 ACAAGGGCATAmAACGGATrCTCAGTTACACTTAAAGAGGATGGTGTTCG 
TGGTrrGGCTAAAGGATGGGCTCCGACTTrCCTrGGCTACTCCATGCAGGGACTCTGCAAOm 
CTTrrATGAAGTCTTTAAAGTCTTGTATAGCAATATGCrroGAGAGGAGAATAOT 
CACATCACTATATrrGGCTGCCTCTGCXAGTGCTGAATTCTTTGCTGACATrOC^ 
GAAQCTGCTAAGGTTCGAATTCAAACCCAGCCAGGTTATGCCAACACTTTGAGGGATGCAGCTCC 
CAAAATGTATAAGGAAGAAGGCCTAAAAGCATTCTACAAGGGGGTrcCTCCTCTCTGGATG^ 
AGATACCATACACCATQATaAAOTTCGCCTOCnTGAACGT 

SEQ ID NO: 2797 ACTGCCACCAGATirmATrACATCATrTGAAAATTAGCAGTATGCTTAATG 
AAAATTTGrrCAGGTATAAATGAGCAGTrAAGATATAAACAArrTATGCATGCrGTOACTTAOTCT 
ATGOATTTATTCCAAAATTGCITAGrrCAOCATGCAGTGTCTGTATTriTATATATC^ 
ACATAATGArrATAATACATAATAAGAATGAGGTGGTArrACATTATTCXTAATAATAGGGATAAT 
GCTGTTTATTGTCAAOAAAAAGTAAAATCGTTCTCTTCAATTAATGGCCC^^ 
GCrrTTATrrrCCCTGATATTATTTCTATTTAATACTCTTTTC^^ 

I'l'r i 'C t'l'l ArrGTCXnTCATAGCAGGCCAAGTATrGCCrCIXn'GCAATAGACAOCTACTOTCAATAC 

ATGCTGTAATTTGACATTCTGGGTCACAGATATAAGGTATrrAAAATCTArrTATGCTTrATAGA 

AAACCAGACATTAAAACTTCATGCACrACTTATTTCX/AATrACrGCCTCGOC 
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seq id no: 2798 acao rtgcctgaagttactataaatgaagaaactgcntagcagaagttaat 
traaagaagaaaagttamgaatattagaactcatccacrrtgcaacttcctt^^ 
gacacmgcttatagttgaccctactggagaggaggaacatctggcaacaggaaccttaacaat 
agtaatggatgaggaaggcaaactcrgttgtcntcacaaaocagatggaagtggoctaact^ 
ctaaacttcaggactgtatgagccxiagcagttacaagacacaaaqaagttaaaaaactgatggat 
gaagtaattaagaotatoaaaoccaaataaacag<x:accacattttcaaaacagatttotaaaaa 
trgrrattrgttaa(>ctgtgcacaaacgttitatactaaataaatatcaaacta^ 

AAAAAA 

SEQ ID NO: 2799 ACAAGCrrrrOTCCAAAATGGCACAGTOAGCACAAATGAGTTCCTGTGTGAT 

aaagacaaaacttx:aacagtggcacccaccatacacaccactgtgccatctcctactacaacacc 

tactccaaaggaaaaaccagaagctggaacctattcagttaataatggcaatgatac^ 

tggctaccatxxiggctgcagcroaacatcactcaggataaggttgctrcagttattaacatcaacc 

ccaatacaactcactccacaggcagctgctottctcacactgctctam 

ccattaagtatctagactttgtctnx)ctx)taaaaaatgaaaaccoattntatcto 

acatcagcatgtatttggttaatggctccgttttcagcarrgcaaataacaatctcagctac^ 

atgccccctggqaaq1tcttatat0t0caacaaaganca 

seq ed no: 2800 acocgggoagocattgaoocagc cagc qcaooggcrrctgctgaoogqgca 
ggcggagcttgaggaaaccgcagataaoi 1 lu i ictctttgaaag atagag attaat acaa ctac 
ttaaaaaatatagtcaataggtractaagatattgcitagcgttaao'lm'rriaacgtaatntaat 
agcitaagattttaagagaaaatatgaagacttagaagagtagcatgaggaaggaaaagataaa 
aggttrctaaaacatgacggaggttgagatgaagcttctrcatgoagtaaaaaatgt ama aaa 
gaaaattgaqagaaaggactacagagccccgaarraataccaatagaaoggcaatocnriagat 
taaaatgaaggtxmcttaaacagcttaaagtitagtttaaaagttgtaggtgattaaaataatttg 

AAGGCCGATCirrTAAAAAGAGATrAAACCGAANGGTGATTAAAAGACCTTGAAATCCATGACNC 

AGGGAOAATTGCGTCATrTTAAAGCCTANTTACNCATTrACTTAACCCAA^ 

ATTAATTGGGAGNGGTNGGATGAAACAATTTGGANAAAATA 

SEQ ID NO: 280 1 ACCATOACCCTACATAAOGCTGQATGGCACCTCAGGCTGAGGGCCCCAATGT 

atgtgtggctgtgogtgtgggtgggagtgtgtctgctgagtaagoaacacgattttcaagattcta 
aagctcaattcaagtgacacattaatgataaactcagatctgatcaagagtcc 

GTCCTTGCrrrGGGGGGTGTCCTGACAACrrAGCTCAGGTGCCTrACATCT^^ 
TGCATATGAGCCTGCCCTCACTCCCTCTCCAGAATCCCmGCACCTGAGACCCTACTGAAGTGGC 
TOGTAGAAAAAGOOGCCTOAOTGOAGOATTATCAGTATCACGATTTGCAGOATTCCOT^ 
TrCATTCTGGAAACTTTTG'rTAGGGCTGCITn rTT^^ 

AATrrGAATGTATTroATTTATAACri-ri"ri-rri"l-rJ'riGGOTrAAAAGATGGTTOANCATTTAAAA 
TGGAAAATTTCIXXnTGGTTIXiCTAGTATCTTGGGTGTArrCTC^ 

ATCA^^^GAAAGG^TAAAAAANCCAAGGTGGCCATNTTATGCTGGTGGTTAAGGCNANGGCCT^^ 
CACCACTGGCCC 

SEQ ID NO: 2802 ACGCATACTAGCAAAGGTAATGGTGATCTAGCAAACAAAATT GGTTTC TGCA 
GTTAGAAGTGAGCAGGAGCACTrGTArrATAGTATTTAAATAATOTOTT 
CGAGTAACCCCTCXlAGATTTTGCCTrmATTATTGAGOCTGGCTTrA i ri'I C'J 1 CTACI in i J I CC 
CGTTTTATAGCAGTTAATTATTTrrGTGATTATTATGCAAGAAGCATTGCCCnrrOAGTTAAACT^ 
ATTOmCATAAGCAOCTATTAAAATAACTOAGCATIXnTITATOAACATACACTAATCTGAGA 
CraAAAAGCTTTGCAACTAAAAAGCAAAACAACCTACATTAGTCATCT 
TrQAQTTGATTrmATGGTGCCTCTTTTAGCnTGGAATATTACGTTrAC^ 
CCTTrrAAAGGGTOnTAAAArrAAAGrrcAGAATGTGAATCCCTrTGACATCT 
AGGACCrTTTTGGTTGTGATrACTGTmCAATAOJATTGTATTAA 
TTAAAATGGAGGTCATAGGAGTCCCOGOAOAAATOGCTCTCCTOTTTCT^^ 
GTCACCC 

SEQ ID NO: 2803 ACVW^CCAAATGTrroTTACTATAACTTCTOCATCACAATTA AAA^ 

AGTTTTTTAAAAACAGTCAACTCAATCAAAACCCACTACTTCAGAATCAATAGL'rivrriOAAOCC 

ACAGTAACACTTAAATATGGTTAAGACTCGAATGCAGAAArrroGTTGGTTGGAAAGCT 

ACTTCCAACTTGCTCAAATAGAATTACAAAAAGGCAAAATTCTOTTTrrCACAGAG^^ 

CTGGAATCACCAACACTGGACAGCTGTTAGAGTATTTAGAGTCCTGAGATAACAAGGAATCCAGG 

CATCCTTTAGACAGTCTTCTOTTGTCCTTTCTrCCCAATCAGAGATTO 

CCACCACCAGCAATTGTAGCCTTGATGAGAGAATCCAATICTrCATCTCCACOAATAGCAAOTrGC 
AAGTCACOAGGGGTAATACGCTTTACCTTTAAGTCrmGATGCATTTCCTGCCA 
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SEQ DD NO: 2804 A CiTl 1 1 1 1 1 1 ' l ' i ' i 1 1 11 H 1 1 1 1 1 lA AAAATCTGCCTATTTTATTTC CATTA NA 
AAAATATTATCTATATAACATTriXnTCCACTATCTTCTCCTTGATCTCAAATTTAAA 
rrTACTCAAGTAAAAGCANAATCACATAACGGACATCAAAACTVV^ 

AATrAAATATOTTCTTGATTATTTCTCAGOAATAGTAACtCTTCrrrCCTACCT^^ l 

(jmACTGAGTAACTATGTAANGGGTATCTCTTTCCTATATTCAGTAATACAG 

TAAmAAAAAAGTAACTGGATTCCTTCTCTAATATTCATGTTCAACTCTCCCTATTACATG 

TCCATAATAGCTTCAGATATlTrcATCAAACTCACACrGCATCAAlTGTGAAAATTAAAAG^ 

TrAAGATGTTCATCAATTGTAAAAATTAAAATGTTAACAATTCCAGACTACACAGCACTGGGCCCC 

TTATAGTGATGCCGTTGGCAGNGGCCTTGGGATAGCTACTGGTTCAATTTCAATTCGGGTGCC^ 

GGGGAAAACAACCAACTTGGGGAAANC^WCTTTANCCAGGAAAATTACCCTTOGAAANA^ 

ONAN 

SEQ ID NO: 2805 ACCGTCCAGOiAGTTOCAGATCTACACTTGGATGGATOCAACTrTG AAAGA A 
CTGACAAGCTTAGTAAAAGAAGTCrAOOCAGAAG<n'AQAAAGAAGGGCA(^ACrrcAATTTT 
AATCGTITTTACAOATOTTAAAAGACCTGGCTATCGAGTTAAGGAGATTGGCAGCACCATGTCTGG 
CAGAAAGGGGACTCATGATTCCATGACCCTGCAGTCGCAGAAGTTCCAOATAOGAGATTACTTGG 
ACATAGCAATTAOCXX:TCCAAATtXKK}CACCA0CTCOT 

TATTTACTATTTGTTGAAmATTTTTCCGTCAGTTATGTAAAATAAACATACTCTT^^ 
GATTATTGCCATTAAGCCTTTAAATTCrAAACAAATTATAATGCATCATCTATTTAGGAGTTAAGA ' 
TTTGOATGTGCTATTGTATGATTACOAATAGTCTCTATGTTTCAAGOCCTTCrrc 
AAAGTGCTCTTANCArrCTGTGTAAAACTGACCrCGGCXrGNGACCACACTAA 

SEQ ID NO: 2806 ACTGTrATTAAAAGCATATTGTATTATAGAGCTATTCAGATATTTTAAATATA 
AAGATGTATTGTITCCGTAATATAOACQTATOOAATATATTTAOOTAATAGATGTATrACrTGGAA 
AGTTCTGCTITGACAAACTGACAAAGTCTAAATGAGCACATGTATCCCAGTGAGCAGTAAATCAA 
TGGAACATCCCAAGAAGAGGATAAGGATGCrrAAAATGGAAATCATTCTCCAACGATATACAAAT 
TGGACTrGTTCAACTGCrGGATATATGCTACCAATAACX;CCAGCOCCAACT^^ 
CAAGCTCCTAAGAGTTCTTAAmATAACTAATTTTAAAAGAGAAGmCTTTT^ 
OGGAATAATCATTCArrAAAAAAAATGTATTOTGGmATGCGAACAOACCAACCTGGCATTCAGT 
TGGCCTCTCCTTGAGGTGGGCACAGCCTGGCAGTGTGGCCAGGGGTGGCCATGTAAGTCCCATCA 
OGACOTAGTCATGCCTCCrGCATTrCGCTACCCGAGmAGTAACAGTGCAGATTCCACGTTm 
TTCCGATACTCTTGAGAAGTGCCTGATGTTGATGTACCGGCCCCOGCGGGCCGCrCGNAAGGGCN 
AATTTCCCACNCAC 

SEQ ID NO: 2807 ACAmACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTC 
ACITOGGCTAAAATGCAGAAATGCATGCTGTGAGCGTTGGTATTTCACATKXATGGAGC^ 
TTO^GGACCTCTTCCCATTGAAGCTATAATTTAmGGACCAAGGAAGCCCTGAAATGAATO 
AATTAATATTCATCGCACTTCTTCTGTGOAAGGACTrraTGAAGGAATT^^ 
TGTTGCTATCTGGOTTGGCACTTGTTCAGATTACCCAAAAGGAGAT GCTT CTACTGGATGG AATTC 
AGTTTCTCGCATCATrATTGAAGAACTACCAAAATAAATGCmAAT^^ 
TTTTATTATCCXrrrGGAATGGTTCACTTAAATGACATmAAATAAGT rrATGT ATACA^ 
AAAAGCAAAGCTAAATATGTTTACAGACCAA AGTGTGAT TTCACACTGTrm AAATC TAGCATTA 
TTCATTTTGCTTCAATCAAAAGTGGTrTCAATA r L'l l" I'l 11 AGT T GGTTAN AATACTTTCrTC ATAGT 
CCATTCTCTCACCTATAAmGGAATATTGGTGGGGGCCrmGGTTTTTCTCnAAA^ 
TTAA 

SEQ ID NO: 2808 accatcgcacacactgttgacgtcattggaaagaaggaagacgactttgtct 

GCTOC CM ' rC ' ri ' ll GAGTGGCAAGCCACTGCACTGGACCCATCTCTGCn'Al'riUVri'll'rC^ 

TTTCAAGGATGACCTCACTTCTOCAATGOTTTrOAAGAAATTCAOTOAAOTAACAAA™ 

GGAAACATATTTCAGATGGGTAAACCACAAGAACCTTAATGGGGGGCAGTAGTGTCGTC 

AAGQAAGTCTIXrmATCCTTrCGTGCXrrCCACATTAOATAGATCCCTGC CACCA GCACCCATC 

GCCACCAGCAGAGACAGCAGGAGGAGAGGCAGCCAGCCTCCCGGCTTGCTriTGTrGTTAT^^ 

AGGGAAAGGGACGCCTOmGTGGCAGAGCACAACTGTTCCmATGTCGGATGCAGTCGCTGCC 

ACAAGTAAGGAAAATATGGAGTCAGCTGCCCGTANCACCTTCACTATCCCCATCACTGOANTC 

AOTGAAACTAOCGTTTGTTTCTThmjGTGTGGCTCAAANACCNGANAANANCCC^ 

OCTNTGTTGGATAATANCCATGTATCr 

SEQ ID NO: 2809 ACCTAGAAGAOAGGCGGGTCAAAGAAGTAGTGAAQAAGCATrCTCAGTTCAT 
AGGCTATCCCATCACCCmATrrGGAGAAGGAAOGAGAGAAGGAAATrAGTGATGATGAGGCAG 
AGGAAGAGAAAOOTGAQAAAGAAOAGGAAOATAAAGATGATGAAGAAAAGCCCAAGATCGAAG 
ATOTGGGTTCAGATGAGGAGGATX3ACAGCGGTAAGGATAAGAAGAAGAAAACTAAGAAGATCAA 
AGAGAAATACATTGATCAGGAAGAACTAAACAAGACCAAGCCTAnTGGACCAGAAACCCrGATG 
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ACATCACCCAAGAGGAGTATOGAGAXTTCTACAAGAGCCTCACTAATGACrrGOGAAGACCAC^ 

OCAOTCAAGCACTTrrcrGTAGAAGGTCAGrrGGAATTCAGGGCATTGCTAT^ 

GCTCCCTTTGACCTTTTTGAGAACAAGAAGAAAAAGAACAACATCAAACT 

TTCATCATGGACAGCTGTGATGAGTTGATACCAGAGTATCTCAATmATCCGTGGTGTOGTTGCT 

CTGANGATCTGOCCCTGAACATCTCCCGAGAAATGCTOCACCAGAGCAAAATCTTGAAGOTCATT 

CGCAAAAACCTTTGTTAANAA 

SEQ ID NO: 2810 ACAACAATTATGACAmGATTAGOCTOGACTCTAACTrGCrrGGATGAGATT 
TATTn"GGAATCACAGTATCCAAGCAAAGAAAAACAGTCAGTTTATAAGACACA\CAAAAATGAA 
AAAAAGCGAGACAAATGTCTGOn^GAAGTTTATCAATAACCTCrrATATACACATATGTATT^ 
CAAAOTGOTTGAGAGTCATTTTATACCATCCTTTAGAAGAGGTCCAATOG 

GAGACACAGTGGGACAGAAATTATCATGACTn CAATGATCI rnCl I ICCCCTAACTTTAATATCC 

TITAGrrGOGOAOAGAAAGAAOTCCATmCATCTGCTGTATCTAAGATmACAGATCACTGGAG 

ATTCAACCCCAAGAATATATTGACAGGAGTCAGGCTCTAGCATATATACAGTAACAGCATO 

GAATCTGATTCTTrGCACTTTAGTTrTACAGTCACCTOTCnTOG™ 

CrCCATTTCCATAAAAATGTGACACCATCCTGACTGTCTGGGGNXOCTGCCCCOGGCGG 

GAAAGGGCGAAATrGNCAGCACACTGGCCGGCCGTTACTAANTGGAATCCGAGCTCCNGTCCCAN 

GCTGGOCOTNATC 

SEQ ID NO: 28 11 ACCCCAATXTOAAGTCAGTAAATGAACTAATCTACAAGCOTGGTTATOOCAA 
AATCAATAAGAAGCGAATTXSCmGACAGATAACGCTn'GATTGCTCGATC^^ 
CATCATCTOCATGGAGOATTTGATTCATOAOATCTATACTGTrGOAAAAOOC^ 
ATAACTTCCTGTCGCCCrrcAAATTGTCTTCTCCACGAGGTGGAATGA^ 

TTOTAOAAGOTGGAGATOCTGGCAACAGGGAGGAOCAGATCAACAGGCrTATTAGAAGAATGAA 
CTAAGGTGTCTACCATGATTArrTTTCTAAGCTGGTTGGTrAATAAACAGT 

SEQ ID NO : 28 1 2 AO^rNTNTTATTOTANCAGATKTNAAGAGTCCATTrm 
AAAOAACCTAKTATCAmtlNNACTTm'CATTTrATAGCANATACATOTGCTGC^ 
hrrGAATNCATATACCCTTCTGTCATrrAAACTAACTGNCAGAGAAAACTGGTCTNTATAT^ 
GTWCAAATrGTGACNNAATACOTATGTATACATATATAGATCTCTAATATAA ATAT TAAATrTGA 
AAAA^^rcAAANGTGANCCAGAAACTGCTATACANCTATNTTGT^^^ACTATITATmATACATTA^ 
AGTATrTGGTGAAATATACm:A>rrrAGGTmCTAAAAAACACCATTATCTGC I AGTAATNG 
CTOCATTCrrOAATGAOCATGTNAAACGGGTATAAACTTCAACTCTOTOCTTAATNCANAATrCCT 
Gm■CGTTCTCCTCANACTITNAT^^^ACCTAAACCAT^fITGCC^ 
TrrCAGAGCACTATNAACATGTCTTTGGAOCAGTTAAACAGTATTTA^ 
T^TNCAATCTT^^rCTmCTCAGGNCAATGGTTT^^ 
TCAAAACACA 

SEQ ID NO: 2813 ACCGGATTCTGTCmAACCCTCCCCirOGTGmCCCCCAATGTTTAAAATGT 
TrGGATGGTrrGTTGTTCTGCCIXjGAGACAAGGTGCTAACATAGAriTAAGTaAATACA^ 
TGCTAAAAATGAAAATTCTAAC^AAGACATGACATTCTTAGCTGTAACn'AACTATrAAGGC^ 
TTCCACACGCATTAATAGTCCCATTTriXn'CTTGCCAmGTAOCmC^ 
ATGGGTGGACACGGATCTGCItXjGCTXnXJCXrn-AAACACACATTGCAG^ 
OTGTTCTGTTTGAAACTAATACrrACCGAGTCAGACrrTGTGTT^ 

GCCTGTGGGCrrCCCCAGGTGGCCTGGAGGTGGGCAAAGGGAAGTAACAGACACACGATGrrrGTC 

AAGGATGGTTTTGGGACTAGAGGCTCAGTGGTGGGAGAGATCCCTGCAAAACCCACCAACCANAA 

CGTGGTTTGOCTGANGCTGTAACTGAAANAAANATTCTGGGGG 

ATTCTCACATTANCOOCAGTrrAATNACCCATTTNCTNCrmACC^ 

ACATTAA 

SEQ ID NO: 28 14 ACGCGGGAGTTTATAATGAAACTATCTACAATTCTTGTTTTAGCACATCTGTT 
ATCCGTAAAACACCTGTAACTAGCITITnAATrrATrATTTXjAATmAGGAT^ 
TTNTTAGrrGCraANGTrcGCATTTTAACTGATTATTAAGCACTT^^ 
AACGTATrTTrrCTGCmGAANGATCCTCTGAANNAATTTCNT^ 
GTATTONANCAGN^rmATNTCNAATGANCTGTGC^GTNGAAAAAACATTAACCCT^GTT 
AA 

SEQ ID NO: 28 1 5 acaaattccagtgtgcagaccacancctcaaaacaaaaaagatctatttcta 

CTGGATCTGCAGGGAOACAGGTGCCirncCTGGTTCAACAACCTGTTOACTICCCTG^ 
GATGGAGGAATTAGGCANAGNGGGTITlCrAAACTnXKjCTCrrCCTNAi^^ 
^r^ATTOTaGACATGCAGCAT^AAACTTTGACAAGGCCACTGACATCATGACAGGOT 
AAAACCTCCrrraGGAGACCAATGGGGGACAATGAGTTTrCTACAOTAGCTACCTCCCACC^ 
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TCTGTAGTGGGAG iri ' lVl ' l ATGTGGCCCriTGGACTn^GCrAN^ 

CQT^^ITCCAGCTGGGATCCTAGAAAANGTTCAATNCTNCTTCAACNAA^ 

T 

SEQ ID NO: 28 1 6 ACAAAGATGACTATAAACAANATGCATNCCTCGGTTTCCATGAA CAGN ACAC 
TATTACANAAAACCAAAGTTTATATNCCACCATCAAaTONOOCTCTCCChmiACT^ 
TGGATCATNAATGANNATCCTCAAANCCAOTAONCTCATCATTACCCCNCANAACATCCNGNTGA 
AAGATTraAGCTTGAOAAOAAATGGAANAACGCraAACCTOCTGCACTGGCCnTOOAATOCC 
CNGTNATGTNTTAGCOGGAANCATATNCAACCCTGAATTGTrTCTCAGGTGTGGAA^ 
T 

SEQ ID NO; 2817 ACATOAATTAGAAGOTGCATCTAOGATTATOOCCAAACTOTTITAAAAATO 
CAGAAATGTAAAATTACATCrrcAAAATATGAAGAGATGGTCTACACACTTCAAAAATCAAATGT 
TGCTTATACCAGAGATGTATGACAATCACGGGATTCAAGTGACAAGCAGTAAGGATCTCAAAAAT 
TAATACTGGCAAAGATAACNGGGAATATTTTGCATrrACTGGAAATACATTGGCCTACTAGAATCC 
NAATCTACCGGGACTCAGGGAAAAAArrANCAAATCTAA AOCCA TTAKCTAAATATTNCTTT^ 
CCTAANCACCAGCTTGGACTITACCAGAAACTGGCCCCOGCITmATTGGCCATCrC^ 
AAGGTrcrCCAAAATTGAATNCCACCC^AAACCGGGTTCAACCTCCAATGAAAGTN^ 
CAAAmAAAC^M^GGATNGTTNAAATGNAA^^^ATTAAAACCCTNTGG^^mGGTCC^^ 
ATGGGATNAAATGGTGGNCNTTXrrTTTGAANCTTGAACTNCNTAATTTX^^ 
TCChnTirmCNTrAGGGGGNCAGGTrGGChm'AAGNTGCCTAAGGAANNCAT^^ 
CTNCCCGOCGOOCGTNCAAAA 

SEQ ID NO: 2818 ACATGAATrrAGAAATAAAATCGTCGCAGGATrCTCAAATACTGATACCTrA 
AGATCTAACACACTGATATTAGTCCATTCCCTACAAAGCAGCCACATTAGCAGTTCAGATrTGGTC 
TTTGTrGTAGCTGTrGACATTAAGTTCTTTAAGTOAAATGCCCAGCAGCATrrAAATAA 
AGCAGACATGAACTAAGTTTCAATATTOC^TCTTroGAAC AAATT ATGCT^ 
QOTGGCTATATTTACTNCCCTATTGTGAGTTTAATGACTOATTTTAAACTACAGAAG^^ 
GCTATTArrrCCTTTAGTTCTAAAAGTACCGACTTATATTAATGTrTTATAAAAGATAGTGATGAAA 
AAAAGGTAATGCTGAATAAAGGCG<nTTAGAAATA'nTAAGGACAACATAAGTATTAATATTGGA 
AAAAACTGT 

SEQ ID NO: 2819 ACCTCriTGAACGCATrGATTCTCAAAATCGAGAOATCATGAAACACXn"GAA 
GGCAATTrG ri ' i ' [UJ ' l '(XACCTACAAAGGAGAATGTGGATrATATTATrCAGOAGCrCCGAAGACC 
CAAATACACTATATATTTCArrTATTTCAGTAATGTGATCAGCAAGAGTGACGTGAAGTCATrGGC 
TGAAGCTGATOAACAGGAAGTrGTGGCTGAGGTTCAGOAATTTrATGGTGATTACATTGCTGTGAA 
COCACATITGTTTrCCCTCAATATTTmOOTTGCIX^ 

ATCTAGAACAACTCAAGGGCTTACAGCTCTCCTTrTATCTCTGAAGAAGTGTC^ 

TCAGCrCTCATCAOAOGCAOCAAAOAGAmGCAOAQTOCOTrAAOCAACn-GATAACTAAAO 

ATGAACTGTITGAATTCCGTCGGACAGAGGTTCCrCCATTGCrCCTrAT^ 

TGCCATCACCCCATTGCTAAACCAGTGGACATATCAGGCCATGGTCCACGAACTACTAGGCATAA 

CAACX:AATCGGATTOATCTTTCX:AGAGTGaXIGGAATCAGTAAA^^ 

ATCTGCTGAAAATQATGAATCTA 

SEQ ID NO: 2820 ACACATGTCATAGTGACCACAGCrTGTGGCTOCTTGAGGGAGGAGATTCANC 
CCGGCTGATATTGTCATTATTGATCAGTTCATTGACAGGACCACTATQAGACCTCAGTCCTTCTAT 
GATGGAAGNCATTCTTGTGCCAGAGGAGTGTGCCATATTCCAATGGCTGAGCCGTTTTGCCCCAAA 
ACGAGAGAGGTrCTTATAGAOACTGCTAAGANOCrAGOACTCCQGTGCCNNTCAANQOGGACAAT 
GGTCACAATOAGGGACCTCGTTTAGCTCCOGGCAGAAANGCnx^TGTT^ 
NGGATOTTATCAACATGACCACANTTTCAOAGGTOGGTITrGCTAAANGANGCrOGAATn^ 
CGCAAAGTATNCGCCATGGGCANACNGGATNNTTAACTGCTGGAAAGGAGCTOGAGNAAGCANT 
T^CCGGTGGACCGGOT^m■AAANGCCCTGGAAGGAAAACCGCT^^TAAANCCATAAAGC^T^ 
NTCNATTACAATACCT^WAGATAGGGTCNACAAAATGGTCCANAAAACCT^ 
ATNGCCO^pnrnmGGTITmACAAOGACTTTNAN 
ACACATTTTNATTCCAONTCTTT 

SEQ ID NO: 282 1 ACTCTTGCITATATCATCACAGAGCTCGATGAGAGAGAOCGAGAAGAGTTCT 
ATAGGTTAAAGAAAATACAAGAGAAGAAAAAGAnCTAAAGGAAAAATCTGAGAAGGACTTGGA 
GCANAGQAGAOCAGCTGGAOAOOTGrrGGAGCCTGCTAATCTTCTQGCTGAAGAGAAGG ACGAO 
GATCTTCTATTTGAATAATCTITCCTglTCnXiGTTCTrTGA GAAA COT 
AATrcACAGTGTOTANQTTTGATTTTGTOTGGCTAmATITmGGCCTAAOAATTT^ 
GAAAAATrrACCTANATGTCTATTTATTGGQGATTACTTITGCAGAAATCATAATTrAGCAACCAT 
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TrATCATGGATGAAAAGAAGATCTGTAAAAACCTGCCCANGAACTTACAGAATTTACTTTGCAGA 

AGCGGTCANCATACTCCATTTCATTCTGNGTTCACGTGATCTGCrrACCAAGCATTTrAGGAA^ 

CXTrCTTAAGGAAGCATTANCCGQCTrcATCCCTATTACCraGGGGAGCNCTIT^ 

CTTGCAAACCrrGGCGCroTrAGCTGAAANTTGCITGChWCCCTTCT^ 

COCOAACCCNTCTAANOCC 

seq id no: 2822 acatgatacagattggtitrccagttttraatgaactgaaatagaaatgtcta 
aatacagcagtatctgcctgtgcaacaaatatctgtaaggtaaaataaggta'ntga taga agan 
catctgcanqaacaaatcaoatgaaaaatctoaaaaogtrrctatataccttctxloattitaa^ 
aaccx:anaaaitaatggctcaagatactacattgctaaagttaggggaaaaaantaaaaaggctg 
toaorrctgttgcaaoagctcatttgtanacttgcaaaanctaactaatmatattatgcttc 
gtaagagcagtgctcaaaatnncaoaagcttcaaat^tjott^^'gtrtcacaaaaatttgct 
tgttggcatgaatgtotgtcagggaattcatacccaggtnaatgacaattacatcagtatagct 
ttirmccacctrggqaooaatggaattctggctatmcnaattaatcctacaca^^ 
aaactaacagcccatggo^ccataattacatmgtgnggtccctctggnc^^ 

SEQ ID NO: 2823 ACAGTCCCTCAACTGGACTAAAATCATGAAGACCATTGTTGATGACCCTGAG 
GOCTTCTTCGAACAAQGTOQCroGTCnTrCCTG OAGCC TOAOGGTOAOGQGAGTOATOC^ 
AGOGGATTCAOAGTCTGAAATTGANGATQAGACrmAATCCTTCAGAAGATGACTATGAAGAGG 
AAGAGGAQGACAQTOATOAAGATTATTCATCAGAAGCAOAAQAGTCAGACrATTCTAAGGAGTC 
ATTGGGTAGTGAAGAAGAGAGTGGAAAGGATrGGQATGAACTGGAGGAAGAANCCCCGATTNGC 
NGACCGAGAAAGTCGTAOGANGAANATAAGAACAAAGTCNGAATTATGAGCCOGNANANGANNG 
TTNTGTOCACANTrrGGCCNGTGTCTNCCGTGTTTCANACAANTTCTGNCACCCAAAAAAAAG^^ 
NATCT^m3ACTTGGCCNGNm■0C™TT^CTTAGCCAC^ 

SEQ ID NO: 2824 ACXiCGGGTCTCTTTCCGGCGGTGCrCGCAAGCGAGGCAGCCATGTOT 

GCrcATGATTATGAGTCTGAGGCGGCrTATGACCCCTACGCTTATCCCAGCGACTATGATATGCAC 
ACAGGAGATCCAAAGCAGOACCTTGCTTATOAAOGTCAGTATGAACAGCAAACCTATCAGGTGAT 
CCCTOAGOTGATCAAAAACTTCATCCAOTAmCCACAAAACTGTCTC^ 

AGTGTATGAGCrACAGGCX:AGTCGTGTCTCCAGTGATGTCATTGACCAGAAGGTGTATGANATCC 
ANGACATCTATGAGAACAGTTGGACCAAGCTGACTGAAAGATTCTTAAGAATACACCTTGCCC^ 
GGCTGAACCATroCTCACAGGTTCCATTGATCCrGCTTCCTNATrrATACAAAGAT^ 
ACTA>m)CCAAAGTAGGGGGGGACCrrCTTGGAGCAAGGTTGAATCTTTANACACCGGATm 

SEQ ID NO: 282 5 ACATCnTGCCTAGATGTCGATGACTGCAAGTAATAATACAGT TTATAAT GAA 
ACTATCrACAATTCTTGTTTTAGCA<>TCTGTrATCa3TAAAACACCTGTAACTA G^ i 111 1 AATT 
TATTATTTGAATTTTAGGATAGCGAATCACTAATTmAOrrGCTGAGGTrGGCATm 
TTAAGCACTTCTGTCAGTCTTTGAAAAAAQAACGTATTTTTTGTGCTTO 
TCTTTTATAATAGAATGGGCATGTATTGTAACAGTTTTATGTCAAATOATCTGTGCTGTAGA 
CATrAACCCTTGTTCAAAAAAGAAATGGATAAACTTGCCTTTCTAAGTGGAAGAATGOT 
TAATATCTGNATX3TTACArrrATTAAATrrAATCTCTrATGATAGGGTGATACCrK^ 
ANGATGCAATGTTTCTANAACimTTAAGTGCCCAmGCAGNNCTGCCOGaKSCGT 
AATCAACA 

SEQ ID NO: 2826 ACTGGAGGGAGAGGCCGGGCTCrCAGGAAGCAGCAGGCACarGCCAGGTGG 
AAGCCAGCTGCAGGCAGGGGAGGAAGGAGGCCXnTACTCrTCCTrCTTGTCCATGGGACCA^ 
CrcCAGCCTGCAGCTTTGCGTGCrCXrrCCAGCAAGCGGTCGTACl CI ri'l 11 U rn-l'ITl Tl 1 1 1 IN 
GACGGAGGCTCACTCTGTCAOCCAGGCnXjGAGTGTAGTGCCACX:ATCTCAGCTCACTOT 
GCCTCCTOOOTTCATOCCATTCItXTOCCTCAOOCTCCCOAClTC^ 
NAGGANACTXjACAGTTTGCTCAGTCCTGGGCTACATGGTCATTCTGCTGTAGCATGCA 
AGAGTGAGCACCAGCAAAAAATGTCrTANCTGAACATCTCTrrAAGTCAAATTAAAAAN^ 
AAACAAAmCTAACCATGCCATGAAAATCTCTTANNATTITNT^^ 
CNAAAAAATAANAATX:CAAAGTCTCCAAAAATGGmAAAAA AANTTT TGCGGTTN 
^rmGAAAAAATTTTTN^^^C^WGCCNCNT^^^■ANNO^ 

SEQ ID NO: 2827 ACAAAACCAAATGTTrGTrACTATAACTTCTGCATCACAATTA AAAT CCAAAC 
AGrirrTTAAAAAa^GTCAACTCAATCAAAACCCACTACTTCAGAATCAATAGCnT 
ACAGTAACACTTAAATATGGTTAAOACTCGAATGCAOAAATTrGGTTOOTTOGAAAGCT 
ACTTCCAACTIXKnx;AAATAQAATTAC:AAAAAGGCAAAATTGTGTTTrrCAC^^ 
CTGGAATCACCAACACTGGACAGCTGTTAGAGTATTTAGAGTCCTGAGATAACAAGGAATCCAGG 
CATCCmAGACAGTCirCTGTTGCCnTrCTrCCCAATCAGAGATrrc 

CCCCCAGCAATrGTAGCCTTOTGAGAGAATCCV^TTCncrCTCCACGAATAGCAAGTTGCAAGTGA 
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CGAGGGNAATCCCmACaTAAOCTrTGATGATrCTOGCAGTCACrTCC^ 
AATCACANCTGCfCGGCGGT 

SEQ ID NO: 2828 ACTi l'l m I'i i ! U i 1 1 J l-iTI'i l"l'ri ACKKjACTCATAAATTCnTATnTOOCT 
AACACTGATTGCATTCATTACTACCATTTTATTrATTACAGCTCANAGGANAGTCATITACTANAC 
AriXX}<7m}TTATTAAACACTAAAAATTAGGCAGAGACTGGAGGGGATAAGGATAA0A0AACAG 
GATAAGGAGAGCAGAAAGACGCTCAGGAGAGTAGAAAGACACTCAGGAGAGAACAAGAGATCXT 
GCCCATTCTTATTATGAGGATGAACTGGCTACAGGACATGAGTAGCCCAATCAGATCTAACTTAAA 
TACCAOCCTCTCCAAAAAAOTITrCCAATAAAGATCCCCrACAATTATrrACTO 
TGTATTrCTAGAAOAATGATCTXTrrG<:ACAAAAGACCACTrAGACCCACTTATACTAAT(K^^ 
GCAGACTGO^TCXTTAAAATGACrGGAAAATACArmAGOACNCCTGCCTCCACTOAT^^ 
T^WGCCAAAAAGNGTAAGGTCTAAAAAAT^f^^TTGGGGGGGGTTTTT^GO^ 
TNGGGNGNNCTATrCCCGGTItX;GGGGGAANAAGAACAGGATAGATITCKmNATAANGGG^ 
AAAAAATGG 

SEQ ID NO: 2829 ACAGATtnX3CAGGAATGCTAG0TGT G0TTGG TT0ATG CCGA TTGTAACTATT 
ATGAGTCCrAGTTOACTTOAAGTGGAGAAGGCTACGATTTTTTTGATGTCATmGTGTAAGGGC^ 
CAGACTGCTGCGAACAGAGTGGTGATAGCGCCTAAGCATAGTGTTAGAGTTTGGATTAGTtKK} 
ATTTTCTGCTAGGGGGTGGAAGCGGATGAGTAAGAAGATTCCTGCTACAACTATAGTGCrTGAGT 
OQAGTAOGOCTGAOACTOGGaTaGOaCCTTCTATOGCTGAGGGGAGTCAOGGGTGGAGAOT 
TGGGCTGATTTCCCTGCTGCTGCTAGGAGGAGGCCrANrAaTGGGGTGAGGCTTGGATTANC^^ 
ANAAGGCTATr^GTTGTGGOTCTCATAAGTGGA^ITGTANGATAATCATGCTAA0CC^ 
CGTATCCCCGTTCGGTTGATAGNATGCTTNATGOGTGCNGGTTGCrrGTCNGOGT^ 
ACAANANGTTTA1TCTCX;CC7TT 

SEQ ID NO: 2830 ACATATTTTGGTTGAAGACACCAGACTGAAGTAAACAGCTGTGCATCCAATr 
TATrATAGTTrrGTAAOTAACAATATGTAATCAAACTrCTAGGTGACTTOAGAOTGGAACCT 
TATCATTAmAGCACCGTrTGTGACAGTAACCATTTCAGTGTATTGrrrATTAT^ 
AACTTATTTTTCACCAaOTrAAAATmAATTrCTACAAAATAACATTCTGAA^^ 
ATGmAGTAGGTTGAACTATaAACACTGTCAlX:AATGTTCAGTTCAAAAGCCTGAANGTrTAOAT 
CTAGAAGCTGGTAAAAATGACAATATCAATCNCATTAGGOGAACCATTGTT OTCT TO 
ATTTAGCACTATTTAAATNAGCXACCAGGTTTATGACTrATATNCTTGAAANTI^ 
TGl^ATAACT^mT^AGATGTAATGCTTATAAAATNACT^^^CATmANCTT^^ 
ACTGATAAGATNCTOAAAC 

SEQ ID NO: 283 1 ACGCGGGG<XrTTCTAACTCCGCTGCCXjCCATGGCTCCTCTGAAAAAGC^ 
GGTGAAGGGGGGCAAAAAAAAGAAGCAAGTTCTGAAGTTCACTCTTGATTGCACCCACCCTGTAG 
AAOATOGU\ATCATGGATGCTGCCAATTTTGAGCAGTTTITGCAAGAAAGGATCAAAGTGA^ 
AAAGCTGGGAACCTTGGTGGAGGGGTGGCGACCATCGAAAGGAGCAAGAGCAAGATCACCGTGA 
CATCCGAQOTGOCTTrCTCCAAAAGGTAnTGAAATATCTCACX:AAAAAATATTTGAAOAAGAATA 
ATCrACGTGACrGGTTGCGCGTAGTrGCrAACAGCAAAGAGAGTrACGAATTCOTTACTTCAGArr 
ACCAGNCXJAANAGANGAGGAAGACXJAGGATAAATrrcATTATCTOGAAATrrrGTrr^ 
AATAAACTTGGGACCCNAAAAGAATAAAAAATA 

SEQ ID NO: 2832 ACGCGGGGAGAAGTTAGGGGCTCCAGCGGCGCTGGCTTTAGGTGAACGACGT 
GGTGAGGAGTGGGTTTCGGGCATGAGAAGrrCACAGGGCOGTTTCCTAGTCT^^ 
0GTmCTCAGAGAAAGAAGOCrG<XOTGGGTAGGCrGGGGGCXjGAGACTATCGGGAAGAGAAA 
ATTACTITTCCCACTGAAACACACOCAAGTATATGCCCAGCUITCATGAAAGTGAACAGAG^ 
OAAGCGCCTTTATOTGGGTGGCTTAGCCAGGACATTTCTGAGGCAOACCrACAAAATCAGTTCAC 
AGATTTGGAGAAGTTTCGGATGTGGAGATCATCACACGGAAAGATGACCAAGGAACCCCAGAAGT 
TriTOCATATATCACATCAGTGTACAOAA0CGGNCT0AAAAATGhm3CTOmAAATAAAACAAA 
TGGAAGGGGACNTTACAAATCACTAOCAAAGAAGCnTCTGAANATGGCCA 

SEQ ID NO: 2833 ACAAGTTCGGCTntiAGCTnxrrCAGGGGCCTCrGGGAACATCC^^ 

AAAATATOaOTOTGTAGACTACTOGOTGAAGGC I'l VJ Vl'1'GACCOCCCGAOCCAGCCAACTCAAG 

AGACAAAGAAAAACTrrGAAGTAGTGGATCTGGTGGATGTCAATACCCCTGATTTAATGO^ 

GTGTCrcCTAAAAAAGAAAAGAAAGTTTCCTGCATGTTCATTCXrrGATGGGCGGC^ 

GCTCGAATrGACAGAAAAGGATrCTGTGAAGQTGATaAGATn"CCATCCATGCraACTrrGAG^ 

ACATGTrCCCGAAm>TGGTCCCCAAAGCTGCCATTGTGG<XCGCCACAOT 

AQACAAOGTGCTOACTCAAAGTraCATCATCANAOCATCATTTATCTAAOGAKr^^ 

GTGCAAGANCCmJGGTTAAAAANNAGCCTITrrCTGGGCTOAAATCITNAGT^ 
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SEQ ID NO: 2834 ACl'l'lTl 1 1 i'l'l'l'ITl'l 1 HI 1 1 1 1 lNATTnTTAAAACAACTTTC3T0ATATTTAT 
ATTCCATTGCACANATATaGGTTATACAAATTAOTAACACAAGTTATrrCTTAAATTCCAAACATr 
TTCTAAAATATTATTACAGCAGATOCCrACKjATATT AAAA CATAAAGGNGGAAATCATTCTCTAA 
ATAOTTAAATATATTTGACACrrGAAAATACTOCACATmAATrAGAGGAAAATTAAAATGTGTGC 
TCAAATGCACTAACAAACTAGGTAGNANGNCVW^CrNAC^CTACAANrAAGTCACTCT 
TTTACGTATTGCATATATATNATATTCTCTAGATAATCCGACThmX3NAAATNCrrATTAAAC^ 
AGOXXiATNATGATTAAAGAAATTGNAKIxrrTCTTANNCTAANGGGANTmTt^^ 
TTNAAGGNAAATTACAAANNTGGNGGNChn'ATAhWGGATCCAACCTCGNTCCAAC^ 

SEQ ID NO: 2835 A Cn ri Tin 1 rrriTl l m m l m 1 1 ANAATCTOAAQTCTTONGmTACTA 
ATGGAAAAAAAAATACAGAAGAGOTmGTTCTCATGGCTGCrCACCGCAGCCTGGCACTAAAAC 
AGCCXAGCGCrcACrrCrGCnTGGANAAATATTCrrrGCrCTrrrGOACATCAGGOT 
ACraCCAGGTTTCCAGCCAGCTOGGCACACrrCCCCATGTTO 
TAGTCrCAAAGTCTXIVVTOCACAGAGCGGCCAACAGGGAGGTCATTTAC^ 

TACCCTTATCATCAATGATAAAAAGOCCCCTGAACGAQATOCCTrCATCACCTTAANCCCATAATC 

CTGAGCATGGTGCGCTTKGOTCTGAANCAAAGGAATGTCATGGNCCChWCCTC^^ 

TT^Na:rrGCCmATGACAAATNANAT<XCAAANCCCAATCCOTGGAG 

NTNCTGAACAAGNNTCCGGGGGCNCAAGGNGAATCAAANGNGAAAAAAAAACNCAl Vi II I I I IN 

AAAGNGGTTrrTAANACCTOGNNAAAm'GGTTATTGGGGNGaGCCATTTWT™ 

NCCANAAGGOAAAATCCC 

SEQ ID NO: 2836 ACATOTTraAAGAAGTGCCGATTGTAATTAAAAATrcACATCTGATtl^TCT 
CTAATGTGGGAACTTGAAAAGAAGTCAOCTGTrcCAGATAAACATGAATrGCnX:AGCC^ 
CAGCAATCAmOOGGAAGAATCTACAGTrOCTOATOGACAOAGTGGATOAAATOAGCCAAGATA 
TAGTTAAATACAACACATACATGAGGAATACTAGTAAACAACAGCAGCAGAAACATCAGTATCAO 
CAGCXmXXCAGCAGGAGAATATCCAGCGCCAOAGCCGAGGAQAACCCCCGCTCCCTQAGGAGA 
CCTGTtXAACrCTTCAAACCACCACAGCCGCrTGCCAGGATGGCTra 
AAACTTACTGCCANACTTAANGAGTCACTGGCCAAAACTrAGGNAGCmfrCATGCC^ 
AAATACAACACTANAAANGOAGGTTCNGAAANAGTACATGACTrrGNAGCCCXCAGGCANTrTTG 
GANAATTTTTGNTTTGAANCCAAGNNTT^r^^NAGC7^TGCC0^ 1 ANANGnTTNCCAATNAA 

AAAANAANTNGAAAAAAAAAAAAAAAACCmCCG 

SEQ ID NO: 2837 otggtccggccggogtacctttitgatoctatattactgcgattaaaaagttc 

TTGCAGGnrAATGTTTATGATATCrrAAACGrrrGTAATrrCCTATCXjTAATrATAACA 

TTTGTAGATOAAACTTCTACATATTGAACCACAaATrrTCnXJAOCriXrrAAAT^ 

CACATTTCAGTGATCAGAATAGATATCCrmACACGCACAAAAGCAATAGATTCATTCAGTGGAC 

AAGTrcCrTGTTTAACTACACAGCTATGATGGAATGATATATCCAAGTTCCTTGCCTC 

ATGCATATTOTATATCATOAAAGTGQGATGCCAAGTAAGCTTAAAATGNATTCT XJJGC^ 

TTAGACrrrAATACTCTATAAACANGTTGCGNCATrCCCAGATNGNTTCC^^ 

ACTTATNATTITQCCGTTAAACACTACAACGOAAG<XTTCTAAAATTXniJCCim 

CT^mTGGTTNAGGGNTNGCAAACNGGAAANTTITmCCCCTTTC^ 

NGGNCCTTTAAATTAACGGANATT 

SEQ ID NO: 2838 ACOOGGGATTCTATTGAGCTATTACACCAOTTITAACACCTTCCTra 

GTTrAAAAAAATAAATAAATrTAAGAAAACCATTTTAAATAATGCACAGTTGCAGCCTGGAAAAA 
CTTAAGGTGGCGOCTTAfAGTATCAATmAGOAGCnTAriTGGTOCAmAACGCAACT^ 
TTOCAQAATCCACTTTCCCTGTGTAAGTGAAAAATATAGACTGTrATCTO 
TCTGCACrmCATTATATACTCTACCTTCTTAATTACTrCTGGC^ 

NTTGCATTCTTTC L r i ' rJi C 1 1 C CTXiTTCTrATGCTTTAATTCTOAGGACATATGAGGGTANATA TAT 

ATCTTTTAAAAATACAANATrrNnTTAGGCAACCAmCTTAAGTTGTTOCNAAA^ 

T 

SEQ ID NO: 2839 ACTCriXnCTTTTGGAAAAaTTCITGATCCCCAATGCr^ 

AAGTCTICTAmGAAAATOAAAGOAOATTACTACCGTTACTTGGCTGAGGTTG^ 
ACAAGAAAGGGATTOTCGATCA0TCACAACAA0CATACX:AAGAAGCTriTGAAATCAGC^^ 
GGAAATGCAACCAACACATCCTATCAGACTGGGTCTXKKXXrrrAACTTCT 
GATTCTGAACnXCCCAGAGAAAGCCTGCTCIXrrrGCAAAGACAGCIT^ 

CTTGATACATTAAGTOAAOAGTCATACAAAOACAGCACGCTATAATGCAATTCTGAGAGACAACT 
TGCATTGTGOCATCOGATACCCANGGAGACNANCTGAAGCAGGANANGAGGGAA ATTAC CGGCT 
TOCACTriTGCTGCTATCTAAATTACACANAAACATTmCTCAThrrGC(^^ 
TATGCAGGTTNGTACl l'l l I IGAATTrTAATTCCCAGCGGl I'l I'l 11 lATTT 
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SEQ E> NO: 2840 ACX3ATCGAAO(KjACTATCn'CTrCATTOAATmOTGTrGAAGACAGTAAGGA 
TGTTAATCrrAAATTTTOAAAAATCCAAAaTACATTCAGTTGTCrCGGAGGAAGTGAT^ 
GCATITAAATGAAATTGATCnTITCACTGTArrOATCCAAATGATTCCAAGCATAAAAGAAra 
CAGATCAATTITATGTTOTTTACX}AAAAGOAOAATCTGG<XAGTCATX3GCCAANG 
AAAGGCAAAGCrTAATTGGCTrAGTGTCGAOTCAATAATTGGAAAOACTGGGAAGATGATTCAO 
ATOAAOACATGTCTAATmQATCGTTTCTXTrGAQATGATCAACAACnTGGGTGGTO 
TAGATmX:AOAN>nTGATGGANCNGATGATGATrcCAAGNANNGATCTGAAAAAT0CC^ 
AOTAOGAATTTTGGATCNCCrQATriTGAAANAAAAAAACTTIX^ 

TCCTCAATCNi^ACTnAAGCAMmjmCmTGCCtm-AACrrnACCGl mil 1 AAAGTNTATT 

seq id no: 2841 acacctgtaatctgttccrctrcaacaaaggaagcagaagatgcaoctgaaa 
aactttcx:agagcatctgatatgaaggacacacaoctcctcaagaaaataaaggaaocaattoot 
aagatccctgctgixaccaagoagccagaggaacaaactgcatgtcatggcccatcagottgtct 
tagcaacagccttcaagtgaaaggcaatactgtctgtcatgotaotottitcacttcraacto 
gtctgactgoagcatctcttcgtrrtcaacgttcac^^ 
ccrrgcggcattagatgccaacatagctagactccagaagtcrtranggactgt^^ 
oaattcaoaagaaaatcatcaggtgctcntraaacnagaacttgctatattgaatgtgati^^ 
ttagngaaagangtttatgtntatgnggangaanatatggacrgcccggcgocngttagggc 

seq id no: 2842 actacacgcocctgggcaacgacttccacacgaacaagokxitgtgcgagoa 
oatcoccattatccccagcaaaaagctccgcaacaagatagcaggttatgtcacgcatctgatga 
ancgantrcanagaggcocagtaagaggtat^r^0catcaangc7ixk:aogac^ 
oganagacamtatgttcctgnngtctcatcctrga^tcaqgagattattca^ 
ctaaogaantgctgaat l ' l 1 1 1 i ggacitcngcagtctgtncanccttcangccactnagcctaca 

mGGGATGA>nTrCATAAACGCCTTOGNGACCTG>rrmAArrnTATTG>^ 
TNATTNAATTCTGGGACAATNTGCAT 

SEQ ID NO: 2843 ACTGTCGGTTTCAOAAATGCCTTGCAGTGGGOATGTCTCATAATGCCATCAGG 
TTTGGGCGGATCCCACAGGCCGAGAAGOAGAAGCTGTrOGCGGAOATCTCCAGTGATATCQACC^ 
GCraAATCCAGAGTCCGCraACCTCCGGOOCCTGGCAAAACATTTGTATGACTCATAC^^ 
CTTCCCGCTGA(XAAAGCAAAGGCGAGGGCGATCTTGACAOGAAAGACAACAGACAAATCACCA 
TTCOTTATCTATGACATGAATTOmAATOATOGQAGAAOATAAAATCAAGTTCAAACACATCACC 
CCCCTGCAGGAG<>GAGCAAAGAGGTGGCATCCGCATCTrrCAGGGCrGCCAGTT^ 
AGGCrGGCAGGAGATCACAGANATGCCAAAAGCATCCTOGTron^AAATCTTCCTTG^ 
TAACTTCTCAATTTGGANCACGANACATTrCACATOTTGCTCTTGNTATAAGAA 

SEQ ID NO: 2844 ACTAAACTGATGGGCCGGGACGGGGCATTCACAGTTGCTGGGCAGAGCAGTG 
ACAGGTCAGAGTTGTGTTTCAAATAGGCCTAAGTAGGACTAAGTTOTATTAACTAACnTOAATGGA 
CTCAGCTXjTAACATTCTCTGTCTrGAACACTACACACTmjGGCCCA 

GTACrGTTTATTAACCAACCAGCrrAGAAAAATAATCATGGTAGACACCTTANTreAn CTrCTAA 

TAAGCCTGTTOATCTGGTCCrCCCTOTTGCCAGCATCrCCACCTrCTACAA^ 

CTTCATTCCACCTCGTOOAGAAGACAATTrGANGOGCCCAGGAAGTTATTTGCTC^ 

NCAACAmAATCmATGAATCAATCCTCCATOCAAATGATGCajNT^ 

AGGTTTNTCCAA 

SEQ ID NO: 2845 ACGTATAGTTAAGTGATCAAAOAAAGGTAmGGTTTGTGTCGTCACCTTATA 
CATCCTAaTaATGAAACTAAa:AAACTATGCCTCCATGCXTrCCCTC 
GA(XiAAACAGACTGAGC:AGCXjTGGTIXrr(::vVGTCCACTrTATCAGAG<mGAG<^^ 
GGGCTAATGTAOTCTACCAGCAGCTraTGCOQATAACCAAOAOTTCTTOOCTGTTOCAGTAT 
TGTCCCAAAGGCTAAGCCACTOAOGTGGAAATCAATCG<>GCATAAA0AAGACTGANC^ 
GCATAACCTGCGAGACTACCTGGAGAGGACAAAAOAGGATCAGGATOCTCATTOCCACAAAATGT 
OGTGCXSGQTTTTAGTGCTCTATGCATTANCTGCTCAGANAANTACATTAC^ 
NATTNAATGCTACACTNAAAATCNCCCCmGATTGNGGGG>nTrTT>^^ 
CCCa^TCCCOTCTTrNTTTTTAA 

SEQ ID NO: 2846 acgggtaotogcgcacatggtagcattgagcatatggacaaactccaccttg 

TCCATCATCTTCTTGCrrrcCCCATATCXXjATrCOAAGCCGG 

OAACAGOAOACCAACrCACOGAAGOCTCCTaAGaXXjGAAACCAOGCCrCXAGGTCAAGU 
ACTGGCAGCATGATTCAAAGAACCTCAOACAATATTCACAATOTGGTAAGGAATCCCXI^AGGGACr 
GGTAGAACTCCTCTGCOGTOGTAATCATCTCrrCAAACATCrrcOCATGACTrOTTC 
ATOAGTACCTO 
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SEQ ID NO: 2847 ACCTAOAAOAGAGOOGCKJTCAAAGAAGTAOTQAAGAAGCATTCTCAGTTCAT 
AGGCTATCCCATCACCCTITATTTGGAGAAGGAACGAGAGAAGOAAATrAGTGATGATGAGCK:^ 
AGGAAGAGAAAGOTGAGAAAGAAGAGGAAGATAAAOATQATGAAGAAAAQCCCAAOATCOAAO 
ATOTGGGTTCAGATGAGTAGGATQACAGCGGTAAGGATAAGAAGANGAAAACrATNANGATCAA 
AGAGAATTACArrGATCAGGAAGAACTATCAAGACCAAGCCTATTIXJGACCAGAACCCTGATGAC 
.ATX:ACCCAAGANGAOTATOONOATITCTACANGAGarrcACrAATGCT^ 
TCAANCACTTTCrGTNAAGGCNGTTGOAATTCAOGCTTGTTTTATTCCTCGCCGG 
TTTGGGACCTGNTG 

SEQ ID NO: 2848 ACAOATTrOCTTItrKnTACAAAAAGAAAAAAAAATCCTGTTGTATTAACA'rr 
TAAAAACAGAATTGTGTTATGTCATCAGTrrTGGGGGTTAACTITGCrTAAn 
ATTTAAGGAGOAGCTtKrCTTAAAAAAAAATAAAGGCOTArrTTGCAATTATGOGAGTAAACAAT 
AGTCrAGAGAAGCATTTGOTAAGCTTTATCOTATATATTTTTTAAAGAAGAGAAAAACACCTTGAG 
CCTTAAAACGGTGCTGCTGGGAAACATTIX3CACTCTmAAGTGCATn-CCTCCT0CCT^ 
TCACTXKANTCTAAAAAAGAQaTAAAAGCAAOCAAAGGAGATGAAATCTGTTCTGGGAATGTT^ 
GCACCNANTAAGTGCCGAGCCACTGCCCCXimXSCTGCraGCCCm^ 
TNTCNCCraTGCTITAAAAa^CAOANTTACCTmANATTCCNCTGCTAAATATrr^ 
AGGTrrATNAACAAAAAAATTTTTTTTTTTT 

SEQ ID NO: 2849 AATAAAGAACCrcrATCAOTGAGACTIXnx:ATTTrATAGCAAATACATTTTrO 
CAOCITAAATTITCTraAATTCATATACOCrTCTOTCATTTAAACAAAC^ 
CTCTTATATATTTAAGTAACAAATTraACAAAATACATAmATACATATATAGATCrCTAATATAA 
ATATTAAATTTGAAAAAATCAAATOTOAAGCAOAAACTOCTATACAAGTATATTOTATAATATTTA 
TTTTATACATTAAAOTATTrGGTTGAATATACTTCAATrAGGTTCTAAAAAACACCATTAT 
CTTAOTAATTOCGACATTCTNAAAAGCNTGTGAAACGGNTrAAACTCNACTC^ 
ATTCNOOTNGTCTCTTAACl'l rrilTIXTAAANATTTGCCGQAGNTACAAGGAA 

SEQ ID NO; 2850 ACTACTGCTGTrTTCTGAAQACGCGAGOGCAAGTGCAGCCAGCCG' J'l'l'Cl" I ri' 
CCl i Cl'l 1 AAGCG ri'ltl'lCrCCTGTTTCTCCAGCnCCrAGTAATCTCAGCAOCraCTTCCT 
TGAACCATTGCTIXXnTCATGACATXXAGATTCTTTCGTGOTATCTC^^ 

NGTCGCrCTTCAACTTGrrCrCGAAGCITCTCCCCGAATACACTCGTGGGCACCTCAOAGAAGCAA 

TCGATTCGTGAGGCAATACrOCATITOTTTGCCAQOTATCOGGAGATGCGGC 

GCTG<naKjCCAATGAAAGGTGGAGTGGAAAATGAGTCCATATTrrGGGAGTG^ 

AGGCTCTGACAGGGCCTTTTCAGCCNAAGGATCTGA>aGNGGTNNTGAT^ 

GGTGCAANTGNOCGANANACNTCCCCAACGNTCCCAATANGGGG 

S EQ ID NO: 285 1 Acu 1 1 i 1 i 1 1 rrci J 1 1 i 1 1 1 1 1 1 1 1 1 1 nggananacaaggnctnactatgttg 

CCCAGGCTGGNGTCAAACTCCrrGGCCTCAGGCAATCCTCCTGTCrcAGCCTCCCAAATTO<^^ 

TTACAAGNGNGAGTCACCACACCrGGCCTTATmCGAGTmAAAGGCAATTTrCTrGGC^ 

ANACTOCATGTrcAGTCAGTATCGTCTGGGGGTTGAAAAATTTAAAAAATCCTATTT^ 

CTGCANAOCCCATQATATAQGOGATTITITrxXnTrTCCATGTCrC^^ 

AATATTTATATTCCTGATACTTTTCAAAATCAGTATATmACAAACTTT^^ 

TTCTACrATTTATGATTAAAOCACACA>rimGATrrNTATAGCNQACAGTrNATNTA 

AAANAATCTTrrcrCTTGTXKiAGGCAAATTNACNATNAAAAAANGANTGGWTT^ 

NTTAAAAAAOTNTTTTAli 1 1 Ul INNAANCCCAAAAAAAGNGGGGNCCCTNTrTTTTCOOOAAAAA 

AAAANAAANQGGTTTNNGNGGOG^f^r^ITAAAAAAAAATTANGAAAAAATAATCC 

SEQ ID NO: 2852 ACCTGTGGACCAAGTCTITGGGCAGGATGAGATaATCOACOTCATCGGGG 
ACCAAGGGCAAAGGCTACAAAGCGGTCACCAGTCGTTGGCACACCAAOAAGCrGCCCCQCAAOA 
CCCACCGAOGCCTOCGCAAGGTGGCCTOTATTGGGGCATGGCATOCTGCT^ 
TGGCACGCGCTCGGCAGAAAGGCTACCATCACCGCACTGAGATCAACAAGAAOATTrAT^ 
GOCCAGOQCTACCTTATCAAOGACGGCAAGCTGATCAAGAACAATGCCTCCACTGACTATCAOT 
ATCTGACAAGAGCATCAACCCrrCTGGOTGGCITTGT<XACTATGGTGAAGTOACCAATGAC^ 
ATGCTGAAAGGCrGTGTCGNGGGGACCAAAAANCOGTCrACCCrCCOAAGTCCTrOCrGGTCC^ 
CNAAGCGCGGCTCTGGANAANATGACCTAAATTCATTGCCCACTCANTTTGCATGGCC^ 
TGOAGAAAAAAGATTCTTGGACCCTTTAAAAACNATTrTNGNAGANAAGThmXNCCAAA^ 
TNGQNGGGGOGhrrAAAATTTTNNAAAAAAAAAAAAAAANTTCGGCC 

SEQ ID NO: 2853 AATAAAGAACCTCTATCAOTGAGACTTCTCATTTTATAGCAAATACATTTTTG 
CAOCTTAAATmcnTGAATTCATATACGCTTCTGTCATTTAAACAAACTrCCAGAG 
CICTATATATTrAAOTAACAAATTTGACAAAATACATATTTATACATATATAGATCIXTAATAT^ 
ATATTAAATrrcAAAAAATCAAATGTGAAGCAOAAACrOCrATACAAGTATATTGTATAATAm 
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TTTTATACATTAAAGTATnXJOTTOAATATACTTCAATrAGOmCTAAAAAACArc 

TCTTACrrAAriXjCGA(>TrCrrTGAAAGCATGTQAACCKiGATAACCTCACTCT^ 

TCTGTTNGTCTXXX:AAACmATCTCCTAAACATKrOCN0A0CrCCAA0OAANGGACA 

SEQ ID NO: 2854 ACGCGGGGAGTCAGTCCCAGTCAGGACACAGCATGGACATGAGGGTCCCCO 
CTCAGCTCCTGGGGCTCCTGCTCCTCTGTCTCCGAOGTGCCAOATGTGACATGCAGQTGACCCA 
CTCCAGCCTCCCTOTCrOCOTCTTTGGGAGACXjGAOTCACCATCTCTrGTCGGACA^ 
TTACCAAGTrTGTTAATTGGTATCATCAOAGACCAGGGGAAGCOTTAAACT^ 
CTTCCATmGCAAACTGGAGTGCCATCAAOATTCANGTOCCAGTGOATTTOGOACAOAOTTCACT 
CTACCATCAAGTAGTCTGCACCTGAAGATTTGGGAATrACTArrGCAACAGATTACACTrATCX^ 
CACCTTCGCCAAGGACACGGTGNACTCAACANCTGGGCTGACAltnXjCTr^ 
CA0TI7MATCN0GACTGCTNTNTG0TGCCTGTAATACTNATCAAA 

SEQ ID NO: 2855 ACACTGTTGGTGTTATATGGGGATGGGGTrCTCGGTAATmGTTTATTATTTA 
TQTTrATTATTATGTrrrATCATTAATrATTCAATAAArrmATTrAAAAAOTCACCCrrAm 
AATCITCTGTGOGOGTGOOAGGGACAAAAGATTACAAACCAAAACTCAGGAGATGGTAACACTG 
GAArrcATAAAATCACCTGGGATTAGTTGTATAACTCTGAACCACCAAACCIXn^ 
TGCTACAQTCATGGCraNCCANAAAGAATTACCNGTAriTmCNNAGAAAGGATCCATC 
AAGAACTTCANAACnTrAAaAACTCANAAGTCrrAAGTrGCTGAAGCTCAAOTA^ 
GCAATCAAAAAAAAATNCANGGAGCAANGCITGGAGOCNArrCrATOCTAAGGACTOTCN 
NACAOACANTAArrACT 

SE Q ID N O: 2856 ACl 1 1 11 1 1 1 1 1 1 1 1 I'l l i AGTA TTI X:AGC AGGATCTGCTG GCAGG GTrrTnTG 
TTTTATTTOTTTGCTrATTTTTAAATTAACTGTTTTGAGCnTO 
AACX;CAATTTTCAATTATGrrGGCTTTTTATAAAGCTTGAGTTATGTAAGATTTA^ 
CTA(XAAGAT0ATTGCCTTATTGAATAGGTCACTATTAAAmCTnAAATGTIX3ATAT^ 
TOTGOAAACAACaJTAAATTCTACTTAAQTOTAAACAACGCAAGCCTCAGACCAGCAATAAATTA 
CTCAGTTroGATAACATTATTTTGTGCAGTAATCAAATTTGCCAAAGCrrrATCTG 
GTTGAGTAAAAATAAAGGNATTrrAATCAATGOOTCATCATTTNOCTNAATTAATCmAAGAANG 
GACTmCAAGGCAATTAAAC^m•ANAAAACTCNAACCITGGACTTG^TAAAJ^ 

SEQ ID NO: 2857 ACATTTAAATTTTTGGKKnXKJTTraTGTTTTAAAAGAAA 

ATriXK:CTCAAGTTCnX3GTCGAAATGCTrACA0GAACTA0CTAATAAACAAAAAACAAGAGAAG^ 

GCATTCAAAATACTGATTrACTTTGGTAGCAAATGOl 1 11 I C I 1 IGAAGACTAATGAAOATAGACA 

AGACCCATTAAGGTGAAGTGGACTATTTCAAATATTCAACAGTTTACATAAAAAAAAAAT^^ 

AOAOCTAAOCOTCTGTATCCACGGATAGCAATGCAATACCTAGTrrATGATGACTTGAAACA^ 

CAAATGC(XATAAGGAAAAAAGCTGTATTITATCTAAATTrACTrTCAGCAATAGTrCAGTAACAT 

TTTCTrCCAATCAATACTCCTCTmATAAG>rraTNCTGaGACCAACAAGTTAGGAAT^ 

AAAAAGTrA(XGmTCAGTn"AAATAACAGGACCTTGNTN(XTTA>n"GC^^ 

KrGCATTrTAAAAAGANCTGGOCGGCCCCTAOOOONO 

SEQ ID NO: 2858 ACOCOGGGAGCOOAAGTAGGAGCTCrCAGAGGCTAAGAAGGTGGAGACCOG 
AOAAGCTGTGAGGTTCTTTAGCGTCACCTCCCTCACrGGGa^GCATGGa 
TGTOGGGTTCCAGAGGATCTGTTAAATXKiTTraAAGOTTACAOATACTCAOGAAGCra^ 
TGOCCCTCCAOTTCCTGATCCCAAAAATCAGCATTCCCAGAGTAAGCItK^ 
CCCATCTCCAGOAGGACCAGGGAGAAGAOGAGTGTTTTCATGACTGCAGTOCCTCATTTGAGOAG 
OAOCCAONAGCGOACAAGGrrGAGAACAAATCTAATCAACATGTGAATTCCTCTGAACTAGATGA 
AOAATCCTANTAAACTGGAAAAAACATGTCGGATOAAGAGAACX^AGAAAAGAAGAGAAGAGAC 
NCTrACTAANGAGOAQGOAATGACNOTTAAAANQAQATTrTTNAAGCraAAAXriT^ 
CTCAAATGGCCATCTNCTCCAAAG 

SEQ ID NO: 2859 ACXAAGGGATXjGAAGAAGTAAATATAGCTCAGOTAGCACTTTATACTCAGGC 
AOATCrCAGCCCTCTACTGAGTCCmAOCCAAOCAaTnCIT^ 
AGCAOCGACTGOCACTGCATITCATATCACACTGTTAAAAGTTOTGTrrTGAA^ 
GTTGCACAAATTGOGCCAAAGAAACATTGCCrrOAOGAAOATATOATrGOAAAATCAAGAGTGTA 
GAAOAATAAATACTGTTTTACTGTCCAAAGACATOTrTATAGTGCIXrrGTAAATGTTCCT^ 
GTAGTCTCTGGCAAGATGCrrrAGGAANATAAAAGTTTGAGGAGAACAAACAGOAATCTGATT 
GCACAGA0TTGAAaTTATA0CCGTTX:ACATX3CTTTCAAGATGCCCAATrACrAAAAGC^^ 
TGTTTnAAAAACTATTGAANNrmCACCAATNCTrAATTG>n'AAATNAm 
NNC>QTTCTONTrAAATTGATITn"CAATTNATTTTTTm 
ANNTNTGAAAAAANAAAAANCCNTNGTGTGGTNTIt>AAACNm^ 
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OTTTATAAA 

SEQ ID NO: 2860 ACCrGTTACTGCTGCTACrTCCrCGTTOACACCITCCTGOAATCrCTC^ 
rriTGAGGAAATACCTACn'AACAAACATGACTOACnTCTIXXKnTATATCTGC^^ 
CCTGTCAGCAAACATCGGAGAAATIXjCATGCGGCAACITCAAAGAACAATAAT^ 
TCCAAGTTCAATATTCCTGAmAAAGGTTGGCACOTTOOATOTCTTGGTTGGCrrOTCAG^^^ 

ctgctaaactggatgcatttctagaangagtggttaanaaagtagctcaatacatgoctgatgta 

rrGGAAGATAGCAAAACAAA^f^r^CAACAAAATClXr^GG^^•AAT0GAGNGOC^T0GCTA^TATATA 
ACAAGOCTCCAGOGGACT 

SEQ ID NO: 286 1 ACAGCCAACGGrrrOCCTTGGGGGCTTTGAAATAACACCACCAGTGGTCTTA 
AGGTTGAAGTOTGGTrcAGGGCCAGTGCATATTAGTGGACAGCACTTAOTAGCTGTOOAGOAAGA 
TGCAOAQTCAOAAaATOAAGAGGAGQAGGATGTGAAACTCTTAAGTATATCTGOAAAGCGGTCTC 

■ CCCTOGAGGTGGTAGCAAGGTTCXIACAGAAAAAAGTAAAACTraCrGCTGATGAAaATGATGACG 
ATGATGATGAAGAOGATGATGATGAAGATOATOATGATGATGATtTTGATGATGAGGAAGCTGAA 
GAAAAANGCCAOTGAAGAAATCTATACOAGATACTCCANCCAAAATGCACAAAGTCAAATCAGA 
ATGGAAAAACTCAAACKITfrcACCACCAGATCAAAGGACAAOAATCXn^ 
TXriTAACNCCNAAAGQACQm>m}TNAAACTTTAACAAAATCAGCAGT^^ 
TCCAAANGGAGCCACTNTTAhTTTITITWAAANATmrcGGATACCCAAONT^ 

■ GGGAGCrmTAAAAAN 

SEQ ID NO: 2862 ACrrrGCCTACGGCAGCAACCTGCTGACAGAGAGGATCCACCTCCGAAACCC 
CTX;GGCGG<XTTCrrCTGTGTGGCCCGCCTG<>.GGCAAGAAGGGGTTAAAAGTGGAATGTATaTT 
GTAATAGAAOTTAAAOTrGCAACTCAAGAAGGAAAAGAAATAACCTGTCGAAGTTATCrGATGAC 
AAATTACGAAAGTCCTCCCCCATCCCCACAGTATAAAAAGATT^mTGCATGGGTGCAAA^ 
ATGGTTrGCCGCTGGAGTNTXX^GAGAAGTTAAAAGCAATAOAACCAAATQACTATACAGGAAAG 
GTCTCANAAGAAATTGAAGAO^CATCAAAAAGGGGNAAACACAAACnxnTrANAAC^^ 
AATATATCTAAGGGhnrCTm'GGCTAATATAAATTTTrAANCTTGAAACAC^ 
TTGAOCTTTCAGCAGNOCTTGAAGAGATNTACrTGG 

SEQ ID NO: 2863 ACTTTGAAACTCATOT TGAOAT mACX:CriCTCCTCCAACCATTT^ 

TTATXjGAC TGGG ACTCTTCAGAAATTCTOTC'J "J "riCr t'CTGGAAGAAAATOTCOCTCX:CTTACCCCC 
ATtXTTAACTTTGTATCCTGOCTTATAACAGGCCATCCATTTTTGTAGCACAC^^ 
TATATACCCrGGTCCCATCTTTCTAGGGCCTC 
AACTnTACCTAOCCCGGCTAATCATGGAAGTGTOTC^ 

1 ■ I'll 1 1 ' I' I IJ riGGCAGAGTAATOTAAAATTTAAATOGGGAAAOATTTrTAATArrrAATACTAANCT 

TTAAAAANAAACCNGCTATCNTrGCTrmiATUITOATGCAAANNCm 

TOCCOGGNCGGCGGNCCNAANGGCOAATTChfNCNACTNGGNGGarGTm-ATG 

SEQ ID NO: 2864 ACTATTACCAAAGGGGTATrCTGGCAAAAGTGGAAGGTCAGCGCTTGGTGTA 
TCAGTTTAAAGAAATGCCAAAAGATCTTATATATATAAATOATOAGGATCCV^GTrCCAGCATAG 
AGTCrTCAGATOCATCGCTATCITCATCAGCCACTTCAAATAGGAATCAAA 
TATCTrcAAGTCCAGGGGTAAAAGGAGGAG<X:ACTrCAGTTCTAAAAC^ 
GCAAAACCCAAAOATCTGTOGGAAOTTGCACACCATCAAAAAKnTGANGANAGNaCANCCACG 
CAlWTTTCCNTTTmACCCAGTITITCANACTGGTTAATGOTAOT 
TGAACCCNANOCGGGCTGAGCXTrCCAAAAATNGNOOCNACmmjCTmv^CTAAA^ 
AAAA ACAAN TTGCGAATAACAGTXXXCGNGGCCTrCCTGNOANGGGNGACGTTGCATTANC^ 
CCTCNTTTTAGTTGCCTAAGCATrGCCTGGCCrTOCTGNCTAANCT^ 
TGGACATTCCANOGAGTTGGAAGTGAAGCrmGATrGTOAAACCTTTTGCT^^ 
GGCGOGACCCCTAAGGGNGAATTCCANC 

SEQ ID NO: 2865 ACACATGCACATCAAAACACTNNAACTOAATATAOATGCCATTACATTATTT 
AOTTACOTTACAAAGCAAATOGCAOGTTCATAAACGTTOTTCTATTATGTATCAACT 
ATArrCAAAAAAAAAGTrTTTGAANACTCATGGGAGTGGAATGTOCCCACATrAGGAATAAAOCT 
TTTACAGGACCACCTGTCTCCAOCrGOCTCCCANOOACEAhrrGAAAACAG 
AGACX:AAGATGGG<nTGNTAATAATTTTCAATTGGACmCTAANCTTATTC 
A>ICGOOAGGGNCNAAa;CmATAAATNGGCCCa::ATGGGGTTOGa>rrTGGGCNCNAAA 
ATrraONAAG 

SEQ ID NO; 2866 ACAGCAATGAAACACCAAAGGGACGTTTTCCTCCAAArrGTGTATAAGCTTG 
TTTGATATCACACAGCGCTGTAACCAACTGCrCACMGOTATTGGCTrCCTOATACTGTA^ 
CCTTTQAGCAATXjAGCCrrAGTrCATTAGTCAGAACATTAGCATX^GAAGTTATGCCrGCC^ 
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OCAAGCCATOTCCTCATTGAGTrrATAAATTITnX>GAAAAAAAGAC^^ 
GATGTTGCGTCTXrrCTGCTGCAAGCAAAACACCATO^mGCTTAAATTC^^ 
TGTTCAATANOCTTCCATOOATATTCAACTTN QGATA AGCGACCTTrTGGANAAATATAGNGGGCC 
TGGGNG<::ATAT>rmX/AAACATGGTTTCTGAACmATrmAGTT^ 

ANACCTGGNAATNOOCACGOGGCXXrmACTrCCCCGOAAAAATAAACCNCCNCCGTCACCQAAT 
ACCCX3TTQATCCCCGGGGTACCTIWGGCCGNOAAaXXjCTr^^ 

NGGGGCGGTAKTAAOTGGGATNCCAGCCNCGGANCAAACnTGCGGGNATCATGGaCANANCTG 
TTNCTGGGGGAAAATTNGTNCCCNC 

SEQ ID NO: 2867 ACATGGCAATTAGAAGTTGTCATGGTAAAAGAAAACCACAGCrGGCCTGCCA 
CAGCCAACACAAGAACCAGAAAATGGTAGATOAAATGAAGGAATAAAGGTGGOGTTTATrCCTTA 
TTATAAAAGAAAAAAAATAATICITCAGCAGTCTrAACAAAGACATCAAGATACAAAATrACAAG 
TGTTTTOACTCCAGOCCTGTCCCCATCTCCTCCAAGAGCAGAGGTAGGAGACAGr^^ 
AAOCAATTCTOTNAAAATTACCTANAAACCCTACAAATTTTGATTOAAATCT 
TIXI>riTITITAAAAAATrrrAATNTCAAAAGGCCTGCmANGTGNC^ 
ATAACCCCCATCCCCCCnTITGGCAGTAACATOCTCAAG>rroANCCaX:CACTC^^ 
CCThrrGTGGGACAAGTOGGG>riTTTTrAAACAAAGCCCXXraAGAATCA^ 

GGTTNGNTTGTTANTAATCCNGNGCCA Cl 1 1 1 I CCANAAGGGGGGGNGGGGCNAAAACCCCAAAA 

GGGOTTTGNGGGGCCTTNTCCX:CNATNCCCCTTTAAAAAACAC^ 

NGAAGCTTTrrrnTAATCCCANCT 

SEQ ID NO: 2868 ACl I'm i'l'lTl 1 1 1'l 1 i'l l'l 1 1 1 i riATATCACAACATCGnTATTATOTOAAT 
TTTn-ACAATACAAACAAAAAATACAGAAATGCAATATATGAATACAOCTAAATGCAGAATOGTO 
A UI J ri ' i I CTCTTCAAGAGGCCATGATrCCCAmCrAGTAAAATAAAGAGACTGCATATAGGTAC 
AAACAGGTTGGTCATTAGCTTCACAATTTKjCCTANAAATGATCTAT 
CTACTTACCATAAAGTGTAAAAAGGGAOTTAAAGOAAAGTTIXXrrTGm'GGGTCC^ 
AANANCTN■i■l■lUlHU■l■^i'AGC^WGGCCCAT^^^TTTGGGAAAAT^mT 
AATQAAGCCGTNATGAGATrCTGGTAAAAGAGGGaXTAATCNOAATTATATITm^ 
CAAAACAACNAACAAAAAQACCNTGTT^a■AAAAAAAGCT^^'ANGGCCCCAOTAGCATATAGGGG 
1 1 1 1 n 1 ri CCATGGG Ori 1 i 1 \ l AANAAAGGAAGNGGCAAAAAATAGGGGCAACTOGOGmJAA 
ANAAATNAlWTT^^TGGGTTTTGCGCCNr^GAATGGGAACCNTT^GGm^ 
NGCNNGOTTGCnCNNNTnTGGNA 

SEQ ID NO: 2869 ACAAGOTOTTrrOCAOCOTGCCTCAG<>AAATGGAAAGACGAT(nTCAAC^ 
TGGCTCTCCTATGTGGCTITn-GTAAGAAGTGGGCTACTAAAACTCGACTrAGCAAGGTArrCT 
GCCATGTTGGCGArrCATTCCAACAAACCAGCTrTGTGGAn'ATGOCAGCCAAA 
AGATCGATTGTCnTCAGAAAOCGCAAGGCAACTATTTCTrCGCOCACTGCGC^ 
CCCAAAACrn-ATAAAGAATACTTTANGATGOAGCTGATGCATGCTGAAAAACrGAGGAAOOANA 
AGOAAAAAhrrTGAAAAAACCAGATGGATGGGGAGAATCCTGATTATrCTGAAAAATCCTAA<^^ 
AAGTn}GCATGGATCATTTCAAAAATTCTGTAAGCATAArrAAAGGGG<>NAAT^ 
CTTrCGATTGNACAG>rrATTraNTTrrOCCAAAGATCrACAAAAQAAAATn^ 
CT^^TNACAAANATGATCCTITACTTGGGANTNTGNGGCAAGGGGANAATTA;^^ 
NACNAGAAAACAGCCITCAACNAAACAGNCAAACNATNGGANGTCGQCCOQAAGGAGGAAAAG 
GGCTTNGNCTTTNTTTAAAAGGCCANTGA 

SEQ ID NO: 2870 CGCGGCCOANGTACXrcATmCATlTrCAGTGCTNNTACAAGGAAAAAAGGTa 
ATATG^IT^AA^TTTAAAATmAATTGGCTAGCTCTNGCCCTTATATGACT^^'^ 
ATTCCCAGCTTANATNAACAA^WGNNAGTAT^AOTCrCACACATANGTGCCATACAT^ 
ATGGATGTGATGCANTGAAAAGTTAGNTGCTCTCCri 1 i'l'lCl't 1 1'l'l I IGOGTOCATATTTNATTr 
CTGNAA^^^IraXKJTTAACTACCCTAAA^r^GA^^^AANAANNAACAATCC^^^ 
GGAAG 

SEQ ID NO: 287 1 ACG<X}GGGAGTCGCCGGCGCTGCAGAGGGAGGCGGCACrcGTCTCGACGTG 
GGGCGGCCAGCOATGAAOCCGCCCAGTTCAATACAAACAAGTGAGTTTGACTCATCAGATOAAGA 
GCCTATTGAAGATGAACAOACrCXAATTCATATATCATGGCTATCmGTCACGAGTGAA^ 
TCAOTTTCTCGGTnATOTGCrCTrCCAOGTTGTAAATrrAAAOATaTTAGAAGA^ 
AGATCAGAAOAACrAAAGAGCroTGGTATACAAGACATAril'lUl'rriCTGCCCAGAAGGGGAAC 
TGCAAAATATAGAATCCOCAAACCrrrmKiATCTCTACCAGCAATGTGGAATTATCACC^ 
ATOCAATTXXSNAGATGGAGGGGACrcCTOACATANCCAGCTGNTGGGGAAATAATGQGAAGANCT 
TACAAOTGCCTTAAAAATTACCGAAAAACCCTTAATACACTTO 
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SEQ ID NO: 2872 ACNCGGGGCTCATTGAACrCGCCTGCAGCTCTTGG O I'l 1 1 'l I'GTGGCrTCCTT 
CGTTATTGGAOCCAGGOTACACCCCAGCAACCATGTCCAAGGGACCTGCAOTTGQTATrGATCTT 
OOCACCACCTACrcrrGTGTGGGTGTTrrCCAGCACGGAAAAGNCTANATAATTGCO^ 

ggaaaccgaacx;actccaagctatgtcgcctttacggacactgaacggttgatcggt^ 

AAA0AATCAA(3TTGCAAT0AAaX:CACXAACACAA0TrriTGATGCCAAACCGTCTGAATra 

OCAAATTGATCATGCTGTTGTNCAGTCTGATTTOAAACATTGGOCXrrTATGGNGGNGAATGAT^ 

TGGCANGCCCAAGGTCAAGTAGANTACAANGOGAAAGACCAAAAGCTTCTATCCAOAGGAAOGT 

GTCTTTTATTNGGTTTTACAAAAGATNAAAGAAATTTNCANAAACCCTACC^^ 

ACAATNCTGGNGTCNCCAGGGGCCNGCTTACnTAAATXjGNCl'l 1 1 lAACONCANOaiTNCCAAA 

AATNmOAACT^^^■0G^^^GG^r^TNAATGNAACTTGCCCGGGGGGGNCGGTTNAAAGGGGNy^ 

TCANNACAATGGCGGGCCGTTANTATTG 

SEQ ID NO: 2873 ACrraTITGTariTAAATACmATGCCTCTOAACTTrCATAGAAT^^ 
AAAGTTAACTTCATCAATAGACGGTTAATATTAATAGAGCCACAGTGCTACCAGTAGCAAACTAG 
GTAGACX:ATTArrTGTITraCAACAAGATGCTAACATGGCAGACTTTGAAGTTGCGT^ 
AGCACCAAGGGAGGTAACTTrAAGGTTXKXAOTOGTGGATCCAOCTCCOTTAGOCTAAO^ 
ACAGCTAATGATTGGGTCTTTATTCTATATCCCCAGCACCCTAAAACAGGGG^ 
TAAATGTTGO'nGAATAAANANGTAACX:ACCTAATTGAAGC I'lT I'l' 1 If i'lCCTATNTTAACATOA 
AOACTGCATTGTTTCTCTAGGAAATCNATGAAATCTGAACCnTTmGAC^ 
NTrriTTTACA«^ANATTTGGACTITGNrcATAGGGT^ 
ACCTaNCaKK)CaKHKX3GTCOAAAAGGONGA>rrrcC>rOACrc 
GATCamG<TOXK;GACCAAACTTGCGCGGAATCNTGGCNTAACTGTT<XTGGGG 
NTTCNNTNCNAAATTCCCCCNA 

SEQ ID NO: 2874 ACAQTTCACTCTGCAAAAAATACTCCTTCTCAGCATrcACATTCCAT^ 
TAGTCCTGAAAGGTCTGGGTCrCGTTCTGTTGGAAATGGATCTAGTCGATACAGTC^ 
TAGTCCAArTCATCACATrcCTTCACGAAOAAGTCCTGCAAAOACAATCXrcACCACAGAATCC^ 
CAAGAGATGAGTCTAGGOGCCGTTCCTCGTTTTATCCTGATGGTGGAGATCAGGAAACTGCAAAG 
ACTGGGAAGrrCTTAAAAAGGTTCACAQATGAAAAGTCTAGAGTATTCTGGCTTrWNAGGGGOT 
ATNCCAAGQGATTAAAQAOOCTTNCAAAAGAOAAAAOGGATCAAANAAAGGGAAGGCCANANG 
GACAATOOGAAGATCAGGAAGCTCTAAATrACTTrANNTC>n'AAAAGAGGTCTmW 
AAAGTTTATGATmAAAAGGGGGATGACCCNGANGGAQACAGAGGAT^^^ITNGCNNGNTC^ 
AAATCAGTCCTCGCCANATNAAGGTAAAAATTTrrGCTNCrrGGATmTACCGGGA>n^ 
GGAAAGGCCTNAAOTCCTNGCCGGGCCGGGCCGNrrhWAAANGGCGCAATTCC^ 
CCOTTNCTTANTGGTTCCOANCTCGGNCCCAAG 

SEQ ID NO: 2875 ACATCAGTGAATTTTTAAATGCTAAAAATrrATGATAAAAGAATACTGAATC 
AAAACATCAAAQAAAGAAAATAGAGGCTCAGCAGCATGATTrcAATATATTTTCCTGGA;^^ 
TTTTTTTTTITrrAAAGTOTATGACTrmATCCAAGAACAAAA 

CTTGAl m i i 1 1 lAAACTAAOCCCAGAAAACTGGGGCrCATAAAATAAAGGCACAATGTGGGCA 
GCAAGAOAAAAAAGGAOAAAAAATOOGGAAAAAATTGGTrrrAATCrCACGNNTAACATAAGGT 
GGCTGGTnTG Gl 1 1 1 I GTTTTAGACAATrrAAGGCAOAATTATCI'NCCTTGCAACACTGCAGATA 
AATCCTAGNGOTTTAGCTATAAGTn"CAAGAGTTCAGATmrCAQATNAAGCAAT^ 
CTTTTrTTATTXTAAGGCTTTrcATGCATTATATAAATCTTTAAAAATTCCT 

NACATACTrCTATACCTTTAATAAAGGCNCTCTrAAOATATAAAACNAGTTTTAAAAACAATGCCA 

TTCTrACTTTAOOGNTTACCCATrCCTIKrQCCATAGOQGTGGOGGQGAATTC 

ANNCAATNTTTGGGGCCCCACAA 

SEQ ED NO: 2876 ACCAGAAGTATAAGTITATGGAACTCAACCITGCrCAAAAGAAAAGAAGGCr 
AAAAGOTCAOATTCCrOAAArrAAACAOACTTTOGAAATTCTAAAATACATGCAGAAGAAAAAAG 
AGTCCACCAACTCAATGGAGACCAGATTCTTGCTGGCAGATAACCTGTATrGCAAAOCr^ 

CTCAGGCATTGTTGGAAAAGAATTTATCGACrCCCACAAAaAATCnTGATTCC^^ 
CCn-GACTTTCTTCGAGATCAATTTACTACCACAGAAGTCAATATGG 

GTAAAAAGAAOAACAAGGATGACTCTACCAAGAACAAAGCATAATGCTGGCATTAAAAATGTC^ 
nTAATnTCCAAACATTG 1' 1 1 C 1" 1 AATACCCXrmATOTACAGGGTTGCATAAC^TTGGAATGTTT 
TAACAOCAAOAATTTTAANAAAAAOATAAACCCAl 11 n I'l l 1 1 I INTAAAAAACAAAATTTAGTT 
TCAAAATA l 1 1 1 I GGATTGNGA l 1 1 J 11 1 1 11 1 I CNCAnTNTCAGCAAAGTTAANGGNATTTAATC 
ATrATrmTGCTGCATAAAAAA 

SEQ ID NO: 2877 AC'lTlTrri ill I'l 'I'l 11 IM 1 1 1 IGCTTGCAGAATCCrAOOACOTnTATTAAT 
TCATGATGCCtlACrATCCITGTGGOAGCAAGAATATGCGGCOTrn:^ 
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catttgacaaagttttatatcagtccaagagcccccaccxx:atctocccac^ 

aagagccttaatcagaaaacacatgagaaggccaogcatgctagctcatgccrgtaatccca 

ctttqooaggctgaqgtgcgccogatcacctgaggtcanggagtttogggacc^ 

seq id no: 2878 acgcggggactcagggaggcogaggggacgcgccggaggaaagatgoaao 
actaccaggctgcggaggagactgctrrtcttgttgatgaaotxl^ocaacattotaaaagaggct 
ataoaaag<x)caattogtogtaacgcrratcaacacagcaaagtggac^ 
agtaoaacaaactttaagccaactcaccaagctggqaaaaccat^raaatacatcgtqacctgto 
taattatgcagaagaatgoagcrooattacacacagciaagttccigcttnta 

ACGGGAGCrrGCACTGTGCGATKGGGAGAATAAGAOiATXjTte^ 

SEQ ID NO; 2879 acgcgggoatcaagcctcagtccccttcatattaccctctcctttttaaaaat 

TACGTGTGCACAGAGAGOTCACCirnTCAQQACATTGCATTTTCAGGCTTG^ 
ATXXiACXAATGCAAGTGTrcATAATCACmCCAATTGGCCCTQATG^ 
AC TCCTX jGACTGTGACTTrCAG TGGOA GATGGAAGTrmCAOAOAACTGAACTGTOQAAAA^ 
ACCTTTCCTTAACTTGAAGCrACTTTTAAAATTTGAGGGTCTXKjACCAA 

OTTTOAA GTCA AGATGACAGATAAGGNGAGAGTAATOACTAACTCCAAAATGOCTTCCTOG^ 
AAACGCATmA AAGA TTTTTAAAAAT^^^roTCANAAGATCCAGAAAAAGTC^AATITNAT^ 
AATTTAATAAAOCTTrrOmjNCGGAAATTGAATACAACAGAACACTGNTCTT^^ 
GACCTCrGGCCGGGACCANCCCITA 

SEQ ID NO: 2880 ACAAAATCCCCnTraTTGAAAAATAAGGGGCTTTCTAAACTAATAAAAAAGG 
AA Grrrr TCAAAAATrAT/.GTTTATTAAAACAACTTTTITGG<:^^ 

AATTTTATACTGTGAATAAAATrcCAAATOAATCri riVl'lAAAACTmTAAAAAATTATGTGCCA 

OTOTATACTAATGCTATAGATTCTTOTCTTAGAAGTTmAAAGCATTCIXjTTAATG 

CAATGGGACltX:AAAAATATAGTCAATAATCATGATAAAAAATTATATATGATTATCAAOT0AAG 

CAGOTATTOAGAAATAAAAATrCTCACTTGCTCACTGGCAATITCTrrCTAACAGA 

AAGGCCTGGAGTAATTCAGACAGATAGCrGGTTATGGNGAATrATAATAATCTTCATOAGGOCAG 

AGCTAATTAAA ACTA GTAATTOCTTAAAATCAAAGCCA'nTCTGGACATATAAAATGAGGAGATG 

AATCTGOAAGGTTTTCTTTTTGTAAACCTCrrrcCAGGTTCTTAAAAGaX^ 

CAATCTTGATCCTCCATTGCATATrAATGATCCACTTAAATG^^XK3NA^ITTAA^ 

CCATTTOfTTNTTATAATAATTT 

SEQ ID NO : 288 1 A C ' 1 1 1 1 1 1 ll Itl 1 n ri ' ll 1 1 1 1 1 1 1 IN ACAGCAAGATAAAATOAATCAATnT 
ATTXXAATTCTTCAAAATTTATACGTAATATGTTGTriXXIA^ 

GnrrATTATTTCAT f 1 J ' lf J ' l l OATAG lTJTrfrrri CAT CJ i I ICJ MA NGO" J Jl- ' I I CAGTANAANC 

CANAATCTTGAGTTGCCCAGTTAGGAGCCThn'GACCTGCTATTCrOATTAAGTTTC^ 

CATOGCCAACATCTGOCTTCTAAAOQAAAOGCrrrrOONCTTrrCAATCCA^ 

GANACTGCATTTOTTCCCATACCCCATOACCTAATTTAAAATCAATCr^ 

TrnfTAATCAAATTTACAATCCCCTTITCAGCncmm^ 

CATGTAAATGCTGAACCCIXjGGGQNGGCNCAAGAAAAGTNTCAAAA 

SEQ ID NO: 2882 ACGCGGOA TCCAC AGCAAAACAAAAATAAGCTTTTATTTTATTAATAATTTCG 
TTOCTCTTGTGCCCAATCAAATCrmAGOAACAAACTGCAAGAAAAGCTAAG^ 
OAACTAAATACAGACATTGCrrACTTGTTTIXjAAGAGGGTr^^ 
OTTTTCTGATATGCCCCCITrCAATATTTAGATAmATrrGTT<^ 

GTTCTTATrCCAGATTCTGGGCAGTGOTCTOTGAGTAAGl J 11 i'l 1 CCTGGG ATGN AAAOGG AGCA 
AGCCCACTTGNCACTAAAATGAATTGGGGTGAAATGTGCTCACTTGGGACTCCA^^ 
CTGCTaXAAATTGCCATGCCAGAAGGNmCOGGAnCTTNmCTATCACCTCTCGCT^ 
CAiVrCTTGTTANAANGGCATGCCTrTTX^ 

ATTTTAAAAATGCAATAAAGGNGGCAAATGCATTGGATGAAAAAmCTCANNGOTrAATCTOAN 

AAATrTTTGCATONTGOGTTAATTGGGGGCATTCITlTAATTT^ 

GGGGAAAAANCnrmTA 

SEQ ID NO: 2883 AC"]4N'1-I117 ril l-ri'l-rJ-i-riNW-iNOGNAATnTGAATGTATTTTTAAA'nTAT 
rT TTTCA AAATAATGACATTAGTAAAAATTrTACATAGCCTGTATTOAATTCACACATTCAAATC^ 
GGCTITACCAGTAATGATGOGOATTAATACAGAGCTAGNGTrrGGCATTrcACTTTATCT^^ 
AGCTAACTGCTCAANGAATTACAOAAGACTCATACTCrnTrTATTTTITCCTC 
AAAAGCTTTACTAAJ^ATTXjACATATATATTTACNCCAAATTTTACATTTAGTONA^ 
CCTCTAGTNOCTCAGTTAACATCAACANGAANGCTTCAAAAAOATOATTCTOA^ 
AA liTfniTl 'ATTGG 
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SEQ ID NO: 2884 ACATaAATTAGAA<K;GTGCATCTAGGATTATGCKX:AAACTGrnTAAAAATO 
CAGAAATGTAAAATTACATCTrGAAAATATGAAGAGATGGTCTACACACTTCAAAA^ 
TGCTTATACCAOAGATGTATGACAATCACGGGATrCAAGTGACAAGCAGTAAGATCTCAAAAATT 
AATACTGGTCAAAGATAATGGOAATATITnXJCAmCACTOAAAATACATTOACTACTAOAATAC 
GAAATCTAGCAGGAACTCAGGGAAAAAAATTACAAAATCTAAAGCCAA'n'ACTTAATAi 1 lui lA 
TTACCTAAACACX^^GCATGACATTNACAGAAAACTIGCCCTGCATTTCAATrc 
TAATTAAGTTCTNCAAAAATGAAATGCAACCAAAACCGGGGTTCAAACTCXXAAATGAAAAGTCN 
NGCVU^GGCTCAAArrAAAACATGGGAATGTTTCAAATGAAAAN TNATr AAAAACACTATTGGTn 
TGGTCTTTTATrGAATXKKlAATAAAATGTTGGCA r r IXJ I'll GAAACTTTGOAACTACTTAAAnTAT 
rrTTrcAAATGGGTCTNCA'i-ri-l"rUCriAANGGGNGOCAANGrr 

SEQ ID NO: 2885 ACTGATGCCATGOATGTATTGATGGGAAGGGTTATTCCAGTCAAACTTGGAA 
TAATTGGAGTAGTTAACAGGAGCCAGCTAGATATTAACAACAAOAAGAGTGTAACTOATTCAATC 
COTGATGAGTATQC J'r i4 C I' I CAAAAGAAATATCCATCTCTGGCrAATAGAAATGGAACAAAGTAT 
CTTOCTAGOACTCTAAACAGGTTACTOATGCATCACATCAOAGATTOTTTACCAGAGTTGAAAACA 
AGAATAAATGTTCTAGCrGCTCAGTATCVU^GTCTCTTCTAAATAGCTACCGGTGAACCCG 
ATAAAAGTGCTCrmACTCCAAmAmCCAAATmGCCACAGAATATTGTAACA CTAT^^ 
GGGACTGOCAAAATriTrroAAACTTCGGACrATG<XGGGGTGCTAGAATTrO 1 ICAT 

GAAACTrrmGGOjAACCCTrANAATCTGGTGATNC XrrXKjNGG GCTT^ 
0ACT0CCAT^IAAAAANCTACTN0GNCCC^0GGGCCOC^T^mONGCCT 
TNCTXjGGGGAACCGGCNAACCAAACGNTTAGAAAANCCCACCNCCGCTTTGGGGGAACT 
TTGGGOGAATGCCAAGGGTCCT 

SEQ ID NO; 2886 ACrrOCAAOTGTCTTCATGTGOTCTITCTaXAOTTTAACCAOC 
GAGAGCTOGCAGGTCTGAOTAACCCCAGTaACTATTCrrrCCACCTTATAAAAAOT 
ACAOTGCATTAGCTQATGACAGCAGAGGGTOGCAGGGCTGAGOACOCAATA'n'C A'nTC CCAGGC 
TGGTGOAGAGTGAGTGAGTATGGTTCCAAAACTAAACAAGGGAGGTCAGAGGCTCnTCCAACCT 
TACCTQATGGCTIXTOGCCAAAGCAAGGAAGTGTCATXIGAAGGTGTTGGGTGGT^ 
AAAGGAACrraAOGAGGAOGOAGAAAAACAGCACCCCTXXJATAAAOOTGGOGGAGAQAAAATO 
GGGAAAGCCCTGAATTNCTCACCCGAGGCCAATCTCAGAGGGGGAOTGGAGGGGGTACAATCTAT 
QCTrrCGaCAAAOGCOGGAAOTGQAAGAACCCTNGGGACAGGATGAATGGTTrGGAAAGGGGCA 
TNACTTAAGACTTAAAGAAOCAAGGGGGNCCAGCCAAAACACCCbn^CACACCTANCA GNATC CC 
CCANGCIXWGACCTGAAACCTTTACAmACrmAACAANCOCCCCAAGCTNA^ 
TCAAGG 

SEQ CD NO: 2887 ACTGCCTCTGTAAACAGGGCCAATTCACATCArrCCACAGCGTGGCCATGrTC 
AATAGGTTTCACroCAGGTTTrCAAAAAGCAGATrCCACATCACCCATCTC^ 
CAGCIXrrGAATAATATTCGATTGTCACACGGCCGTTCTCrCTGCCAAATCCTGAC^^ 
CCAAAGGGCAACTCCACTGGGCTGACGTTATAGTTGTTAATGAAGCACGTCCCAGCCTGAAGCrc 
AGCrACCACTCTATGAGCCCGTTGQATGTCOCTGGTAAAQACCCCAGCTGCTAGTCCAAAAGTC 
ATCATTGGCTCTITCTAAACCTCAG<nTCAGTGTCAAATGATAAAATGGA^^ 
TCIXrrrCTTCACACAGGTCATGGTCGGCTCTGCAATTAGTTAATCACAAC^^ 
ATCATTTAATTTGGGATCTrCAGGGCCCTCGGGCCGCGAACCACCCTTAAOGCCOTACT 
ATCCGAGCT 

SEQ ID NO: 2888 AATAAAGAACCTCTATCAGTGAGACn-CTCATmATAGCAAATACATnTTG 
CAGCTTAAATTTTmGAATTtATATACGCTTCTGTCATTTAAACAAACTTC^^ 
CTCTATATAmAAGTAACAAATITGACAAAATACATAmATACATATATAGATXTCTAATATAA 
ATATTAAATTTGAAAAAATCAAATOTGAAGCAGAAACTaCTATACAAQTATATrGNATAATATTT 
ATTTrATACATTAAAAGTATTTGGG1TGAATATCrrCAArrAGG>m'CTAAAAAAACACCATTATC 
TGCTrCTTAGTAAAnOCGACATrCTTGAAAAGCGTCTGAAACGGGTATOAACrrCAACT 
TTAAATCAGAATTCTGNTTOGTCTTCTCAACTTTTATC^ 

GGGAANGGACAT^^ACAGGNGCACTATAAACATGT^^TNGGCAGGAAAAACAGNA^T^AT^CTTC 

TCANCTmGOATITCCCAANNN^^^C^^^CTCAAGGGAATGGGTT^^T^ 

CTTAACCTTNAAATTrcAAANaXACTTTTAACCAGGGG 

SEQ ID NO: 2889 ACTACATATTTCAGCACTAAGGCGGTTGCTTCACnTrATATCTATATAAAAAA 
AGTGGTAAAAATCTTITCCITrrGTGCAGTTGAACCCATCCTACATTCAOATrCTCn^ 
TAAAATACTrATrTGGTrOAQGAAGATTTAAGGCAAGTTCGGGCCCrTCCAAAGGCACrGTGAGA 
CTCCCCCCCCACTCOCCOTTATTGCTACATOTCmATACTTO 

AATAAGCAAACACTrTmACTACTrrATAAAAGTTGGAATTAGAAAAGCA TGCC ACATTTAANCT 
OATTNOCAAAGOATGTGGCATTTTTTTTCTTGAAGTTGGANGGGCTACAAC^^ 
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GAAAACCTCATKGGGATGGTrrCCTNAAACTA>nTrCOCAAGCATCAAA>m:Qhrrrr^^ 

GAAATCATNGCATbnmCANAATrTNCGTNAACAGGGGAAGGAATTAATGAAATAAAANTT^ 

TACAATC 

SEQ ID NO: 2890 ACCATAGAACCCCCATCACTGTCTAAGCAGCCAGATITGCAACATGCGTO 
AATGrrCTrcAGCGTGAAOATCXXAGTrTGAAAGTGAGGCTAGATC^ 

TGTOTGGTATGGGGGAGTTACATATAGAGATrATTCATGATCXjAATCANGAGOGAATATGOACTG 

OAGACXrrATCTOOGOCXn'CTCCAOaTOOCATATCGAaAGACCATCCTAAACTCAOT^^ 

GATACCTTAGATAGAACTTTAGGAGACAAAAGGCATIXTTGTaACCrGTANAAAGTCGAAAGCNA 

AGGCCCATTTGAAACATTANNNTOTTATOCCTGNGATTTGAOTATGCTOAAAaTATTC^ 

GCCTTTTGAANGGCTTNCAAGAGGCCATTTGNAAATGOGAATnW 

CAATGGTTTGGANCCAA 

SEQ ID NO : 289 1 CAGNaGCGCCOGGCAOGTACGCOOGGOOGCTGOTrrCOOOAOTGCGTCTCTA 
GAGGGATGCACGTTGCX7ITAGCCGAGCrta3GAGAGAAG<XTGATATGTAACCCACGCAGQT^ 
AGCCTCAGTCTGTOGGGCTX)AGGTCTOG<>TCTA(>AAGCCTCrrGGCCGTGTT^ 
CCTGGAGOAQTTCTCTGCTCAGCACAGCCAAOQAACAGAATTAGAAGAAAAGGAACCCTCGC 
AGGCAGGTOACAAACATTACCACCCCAGCTGTXSCACGATGCAGCAAATGCCACCAGATGTTACAA 
OAAAGNAAAAGQAAATNTITTTITAANa>fraXm)TTT0OCNrCCC^^ 
AACOGNGGAAAAACTGNNGGCCAAAANTAOACAATTGAGAATCCTGGATTACAAGGATITAAC^ 
OCCOTTCCCAAGGGTCAAAGGNOATTTATTQACATTGACGTNCAAAAACNTT^ 
TTTCTACNCTTirXKKjCrmTGATOAAAACAGGGAGA 

SEQ ID NO: 2892 ACTGGAGATGTATTTGATAACCAAGGrrTrrAGGTAAATTTTCACCAOTATTAG 
TrCTATrrGCAAACTGAAAAATGTTGTAGGCTTAATATAAAATAACCACArrAOTOAACATTATAT 
CTCTTAGAAGAAAGGCCATATTITGCrCCTGCTTCTGTAAAAATATTAmQTrTGAAGGGGAAAT 
AATGGTAOTGTCACCTITCACTrAATrCCrACTCCCTTAATGTGAGAGAGACAAAAT GAGCT 
AAOGAAAATTCraOAOTTACACrCCACAACCTTGACATCCTGNCCGGNCAi 1 11 1 IGNTnT 

SEQ ID NO: 2893 ACATTCAGGAT(XCTCGGCCAAGGACTGGACCAGAAGAACACTTCCGAA^ 
TOGGTCCACTTATCAAAGGTGAAGTTGGTGATATCCTGACrOTGGTATTCAAGAATAATGCCAGCC 
OCCCCTACTCraTOCATOCTCAT^AQTGCTAGAATCTACTACraTC^ 
CTGCCAAGGTGGTCACTTATCAGTGGAACATCCCAGAGAGGTCTGG<XCTGGCCCAA 
CTTGTGTTTCCTGGATCTATTATTCTGCA 

SEQ ID NO: 2894 ACmGGCCTCTCTOGGATAGAAOTTATTCAGTAGGCACACAACAGAGGCAG 
TIXX:AGATrrcAACTGCrCATCAGATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGT^ 
TTAATGrCCAOTCGTOTCCCITGOCCGAAGaTGATGOOOGGGOCACTOTAACTCraTrG^ 
TAAGrrGCAAAATCTTCAGGTTGCAGACTGCTGATGGTOAOAGTGAAATCTGTCCCAGATCCACrG 
CCACTGAAOOGTGATGGGACCCCACnTGCAAACTGGGATGCTGCATAAATTCANGANCTTrANG 
GGCCTITCCTNGTTTrCTTGGTGANACCAAAAT 

SEQ ID NO: 2895 ACTTT Nl 1 ll lUl 1 1 1 1 i 1 1 1 U 1 1 IG GONTTATGCAACTITATTGAAGAAAAA 
TAAATCAATrACTAGCAGAGCTATTAGTTOATCACTCATC<>TrcACAACTTGCATCATrrA^ 
GNGCTATATTAAACAGNGTATTGGAAGATAGATrAACTAATAGCTCCAAGCCrCCTAACAATTTA 
AATGAAAATTACAAAATGTTTGAGACCCTATTTTGGAATACAAAOGGTGTTraACTTCCAAT^ 
ATTCTCTOTAGAACAAGAACAGGNCj^TTCCTTrmTTGGGC 

SEQ ID NO: 2896 ACGCGGGGGGGCraGTTTCCGGAGTCCGTCTCTAGAGGGATGCACGT^ 
TAGiXGAGCTTCGGAGAGAAOCCTGATATGTAACCCAGGCAGCTGGGAGCCrCAGTCTGTCGGGC 
TGAGGTCTGGCATCTACAAAGCCItnTGGCCGTGTTCTGAA(^^ 

TCAGCACAGCCAAGGAACAGAATrAGAAGAAAAGGAAOCCTGGCCTGAGGCACGTGACAAACAT 

TACCACCCCA0CTQTGCACGAT0CAGCN0ATGCCXn>4CNT0NGGNTNCANAAAAGGAAAGOAAAT 

TTTTTTTITCANGm'GNbWCCGTTAGGCCACNCCCCATTT^ 

AACTGCNGGCCAAANAGTCCCAATAANTATCCTGGATTANAAGGGNTTAACAA^ 

TNAAOGCAATTTATGACTTTGGAACOTCCAAAAhmTATTNCCrOTOAGCX> ^ 

CTTNTTGATGANAAANAGGANANACAGACCCTTGGAAAAGTTTNCNAAGAAh^^ 

ATTANNCANAAAGOGNTCCTTCGGCCONOAACNCCCn'AANOaNOGAATTCCACCNNCTO 

CCGT>nTAANNGGTTCCCAA(>frGGGNC 

SEQ ID NO: 2897 GGTACAGAAGATACACAAAGTOOCACCAGCCTCACTAGCACTCrcATTTCTC 
AAGAGCCAOAACATtKrroAATACAQTCAAOTrCAOTOTATTTGATAATATCXATITGGGCCCC^ 
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ATCTCTCTCACAGTCTTCCTGGGAGTCTATTCrrCTrCXrAAAACTC^ 

GGTAAAGAACTTGGCCGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTITAGGAGGCrGAGOC 

TGOAAGATTGCrrcAOCCTAGGAOTTrOAAACCAOTCTGAGCAACATAGTAAGACCC^ 

TCTAAAAAACAAAATAAGTAAAAAGGACTGTAGGAGGCCAAGACAGGTACTGATmAAAAACT 

AATAACTTAAAACTGCCNCACGCNAAAAAAGAAAACCAAAGTGGTCCACAAAACATTCTr 

CTTTTTGAAGGGTrTACNATGCATTnm'ATCATTAACCAAGTC^^ 

OCATTCNAACAAACAGTTbnTAGACCGGTCmCNCCmXjATTAAAAANGGGGGNGNCNi^ 

AGGGhn^TATATrTCATTTTCCrnTTCNCCnTrrGGGNA 

SEQ ID NO: 2898 GGTA Cl 1 H U 1 U H lH 1 Tl 1 1 ill 1 i ogagtcttgtgttttactaatggaaa 
AAAAAATACAGAAGAGGTmGTTCrCATGGCTGCCCACCXjCAGan-GGCACTAAAACAGC 
CGCTCACTTCrcCTTGGAGAAATATTCTTTGCrCTITTGGACAT 

GGTTTCX^OCCAGCTGGGCACACTTCCCCATGTrTGTCAGTGAACTGGAAGGCCTGAACrAGTC^ 

AAAOTCTCATCCACAGAGCGOCCAACAGGaAGGTCATTTACAGTGATCrGCCOAAGAATACC^ 

ATCATCAATGATAAAAAGGCCXXn-GAAOGAGATGCCTTCATCAGCCTlTAAGAa:CCATAATCCT^ 

AGCAATGGTGCGCTTOKjGTCTGATACX^AAAGGAATGTTCATGGGTCCCAANCCTCCTTOTr^^ 

AGGTGTATTGCCCATGCTAGATGACAGAAGTGAGAATCCNCAGAAGCCCAATtXTTG^ 

TTCTTAAATITINTGCCTATACTGAAACAATNATTTCNGNGG^ 

NNAAAAAAAAACCACAT 



SEQ ID NO: 2899 ACTrAAGTAAATCATGAAAATTCTACTTGTAACTATAGAAGTGAATTGTGGA 
CGTAAAATGGTTGTGCTATTIXKLVTAATGGCACTAGGCAGCArn"GTATAGTAACT 
ATTCATGOCTAGTOATOTATAAAATAAAATATTCTITGCAGTAAAATATTCCCmGTIA^ 
AGAAGGGGGGATACAAAAAGGAACTAACAATTTGTATOOCAGTCTCAGATATTTTrATr^ 
mCCTGTmGaTTTATTIXK;ATCTrAOAAOAOCATAATaACATTOTITaATGA^ 
CTGGACrGTTrrGACCTGGTTTAACCCTTCTGATAGGTAGTTGTGQATGC^ 
ATAATCTITGCCTGGAGTOACACTACACTCTAGAATTTCXiACTrTGGAGAATACTCAG 
TGTGATrCCTGATAGAACAAGACTTTACTTTICTAOCCANATTGATCTANAAGCAGANGAATCC^ 
CCCTTTTAAAAGTGGTATGTGGTTTCrmAAAACTCCGOTTT^ 
GCOCA 

SEQ ID NO: 2900 GOTACACCTTGAAGGCGAGGrrAATTAAATCCTGTTGTGGAGTTTGAGGGCC 
GGAAl'rTAATnTTGGAGTTTrATTTAATATCGGGAGCAGATTGGGTAATAAAATGTATATTGA<^ 
ATAAGACGGCCTTTTOACCrmAGGOTCTAGGOCTOTAAAOTGTCTCAGGGTTGC^ 
GCCATGAACTGGGCTGGGTTmATATrroATGAAAAAGAGCXn-AAACGCTTCTGAT^ 
AGAAAAAGGAGCATTAACCTTGACTATGTCTTTAGCTCCAGCCACCTrTTTAAGAGTA^ 
GGCAGGTQGQOGAGGACTAGTCACGG/VACXiAAACTOT/U^GCCGGACCAGGTGTGAGCAGGGGAG 
GTGATAAAAAGATTACAGGGTGGAGOAGTCGACCTGANGAANAATrGGGACCTAACTrGt^^ 
AOAAGAAOGOAOAGTCANATGGTTrOTAOAAAAOAAOATAACAACTCACACCCTGGGTrGGNTA 
GGGCAGTGOAGGAAAAGANATTNACAGTNCTGCXIAAGCAGAGGCGGTTNAA 



SEQ ID NO: 290 1 ACTGrATmCCGCAAAAGAAAATTAACATrTAGTAACACACTAATOAATITT 
ATCTOCAAAOAGATTAOTOCACTOGCAAAOTATTCOOATTACAGTOGAGAOCTGTGAAC^ 
TACCGAAAACACCCCAGCCTGGCAGATCCCAAGGTAATOATGGCCTTCACTGA 
CAGAGOTCCAGAAAGAACTTGCAGCATOATCACCTGCCraCTOCACTTO 

ACTTCAAGGAGCCAAAGAGGAGTCCCCAGTGGGCAAGGGGAGCAGAGAAGAGAAACGTGGTAAA 

TGTAAACCTAAAAGACACTAAAATGOGATACAATCCCCATTCAAATtXX;AGAGATGTGGGCAAGT 

CCCCAAAAGTAGTTGTTAGATrAAAAAACnXKjGCACACCTCACTrCCCAAAAAATATAATA^^ 

OAAATTTXXiGGGATACTX3GAAAAATAQAGAAATGCTCAT0AAACCAACrCTIX^ 

AGG>m }AACCATCrTimATTCAGCACOAGCCmOTTmACTNOQTOC^ 

CTTTNG 

SEQ ID NO: 2902 ACTCTTGATGAAAGACCGTGAAACCAACAAATCAAGAOGATrrGCTTTTGTC 
ACCTTTGAAAGCCCAOCAGACOCTAAGGATGCAOCCAOAOACATOAATOGAAAGTCATrAGATOG 
AAAAGCCATCAAGGTGGAACAAGCCACCAAACCATCATTTGAAAGTGGTAGACGrrGGACCGCCTC 
CACCTCCAAGAAGTAGAGGCCCTCCAAGAGGTCITAGAGGTGGAAGAGGAGGAAGTGGAGGAAC 
CAGOQGACCTCCCTCA(XGGGAGGACACATGGATOACGGTGGATATTa>TGAAT^^ 
GTrCrTCX;AGGGGACCACTOCCAGTAAAAAGAGCACCACCACX:AAGAAGTCGGGOTC^^ 
AAGAOATCTOCACCTTCAGQACCAOTTCGCAOTAGCAOTOOAATOOOAOOAAAAOCTC CTGTA TC 
ACGTGGAAGAAGATAGTTATNGAGGTCCCCTCGAAGGACCGTGCCTCriCT 
CAAAAATATGGTTTNTCTAAAACACTATNAGCCAGATACCCAGTCnrnr^ 
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CCCGA 

SEQ ID NO: 2903 GGTACrri'l'l 1 1 1 i i 1 1 1 1'rrin'i'ri'ri'i'lNGGCCATTTGCTATGTTTTATTTTGC 
TAOTAGCTTATTAAACATAACATOCAAATAATCAAAGU^OAAACATACATOACrTANAGTOAAAAA 
TAATTCTAGAAAAGTTTCACTAGGTAAOTATGCAAAriXnTAriXTrAAAA ATACTT N^ 
ATOAAGCrrCATGTATTTTGCAATATTCTrGGCCTCAATATCTACeACCTATTT^ 
ATTTGAATGTATCAAGATAATTTGGNGCAAGAGAGTAACATCCATATGTATTTAATCCAAGCTTTG 
AGGAACATTAAGATTTAAGGATTATAAAACTTGGCTGATn-CCATGCAACCAGTAAAAGGrnTGC 
ACATNATTTGACAOTAOAAATNAAAAACACTAANTTrACAAATAAAQCATTGAQTTTGATGTCTA 
TTCNGGGATATGTGTGNGTTCITGTGANGAAANNGCCTCCmANCl 1 1 1 U 1 1 N AAAAAAATNAA 
TG^^^^ACAAAACATTTCCN(>GATTTAAAA^f^CNTGGGNCG^^^TAACC^ 
TNTNGAAACNGCCCCCCAAAAA 

SEQ ID NO: 2904 TCGAGCGGCGCCCGGGCCGGNACTCOTCCrarrGGCCGANTXjGANACTGGTG 
TTCTCAAACCCGGNATGGfn^GNGNNariXKrrcCAGTCAANGTTACAACGG^ 
GAAAATGCACGATGAAGCrnGAGTOAAGCTCTTCCTGNGGACAATGNGGGCTTC^ 
ATGTGTCTGTCAANOATGATCTNGCTTGGCAGCTGTTOCTGGTGACAGCANAAATGACTCCACCAA 
TGGAAGCANCTOOCTNCACTGCTCAAOOTGATTATCCTQAACCAArcATAGCCAAATAANOCGCC 
GGCTATGCCrCTGTATTGGATGTGCTACANGGCTCACATNGCATGCAAGTTATCT 
AAAAOATTOATCGCCGTCTOTOTANAAAGCTNONAGATXKXXn'AAATTChn'GANGTCT 
CTNCCANTGGTNAThnrbn^CCTGGCANGCra*ATGTCITmt3AG 
GNGTNGCrmCTGGTCXXjAGATATAAAGAAAACAGGTGCriTT<KK}NGNTTT^^ 
AAAAAAO>riTITONAGNTQGCNANGOTTC(XCNCTTG<XCCAAA 

SEQ ID NO: 2905 GGTACCGCGGGGCTCrrCCTGCTCTCCATCATGGCGCAGGATCAAGGTQAAA 
AGGAGAACCCCATGCGGGAACTTCGCATCCGCAAACTCTGTCTCAACATCTGTGTTC 
GOAOACAQACTGACXJCGAGCAGCCAAGOTOTTOGAOCAGCTCACAGGGCAGACCCCTGTOTTTTC 
C:AAAGCTAGATACACrGTCAGATCCTTTGGCATCCGGAGAAATGAAAAGATTGCrGTCCA<^ 
CAGTTCXjAGGGGCCAAGGCAOAAGAAATCrTGGAGAAGOOTCrAAAGGTGCGGQAGTATGAGTT 
AAGAAAAAACAACTTCTCAGATACTGGAAACTTTGGTTTTGGGATCCAGGAACACATCGAT^ 
GTATCAAATATGACCCAAGCATTGGTATCTACXjGCCTGGACTrCTATGTGGTGCTGGGTAGGCCAN 
QTTTCAACATCGCAOACAAQAAGaJCAGGACAGGCTGCATTGNQOCCAAACACAGAATCAOCAA 
AAAAGAGCCATGCGCTXKjTmCAGCAAAAGTATGATGGGATCATCCriXnXKK^ 
GTTTTATTCCAAAANANCAATANAAAGTTTTAOTGGAA 

SEQ ID NO: 2906 ACATAGTOOTGCCCXXrrGATAGGACATTQTTAQCATAGAGGTCCTTCCTQATG 
TCAATATCACACTrcATr.ATGCIXjITGTAGGTGGTTrcATGGATG<^ 

AAGOATGGCroOAACAGGOTCTCTOGGCAGCOGAAACGrrCATTTCCGATGGTOATCA CTrG Ca: 

ATCAGGCAACTCGTAACTCTTCTCAAGGGAGGATGAGGATGCGGCAGTGGCCATCTCATTrrC^ 

AGTCCAGAGCTACATAACACAGTrrCTOCTTGATGTCCCGGACAATCrCACG 

OGAAGGAATAGCCACGCTCAGTCAGGATCTTCATGAGGTAGTCAGTCAGATCTCGGCCAG^ 

TCCAGACGCATGATGGCATGGGGCAAGGCATAACCXTCATAGATGGGGACATTGTGGGTGACACC 

ATCTTCAOAOTCCAACACCOATOCCCAGTTOTGCCGTCCAGANOCATAAGAAOAAGACAGCCC^ 

GCCTGGGATAAGNCCACATACATTGGhriNGGGACAArrcNAAAGTCTTCN>^ 

GTCANTrrTrrrrCCCGGGTGGGNCCTTNGGGGGTTAAAGGGGGGGCC^ 

SEQ ID NO: 2907 ACACGTTCTTGTrGTCTQGCTCGGCAACAAACACCACTTan"GGC^ 
TGGTTATGGAGTCGGACAAATGTCTXKJTGAGGAGTGAGTTCAGCACCAGTGTTCACA 
CTGGAAGAACAAGGCGAAGTTCTGGTGOCTGTCTGCGATQAATGTGCCCTTGGCTITGGCTG^ 
TGTCA(XX;GGGTAGTTTTGOOTGCAATGCTCTGATCCrrATCCACGGTGG 
GATGCCAACTTCAGTGGAGATCITGACTCrGAOCTCrAaKn'ATTTGCAATATAC^ 
TTCAACTTCGACAAGGAAGTCATAATAACCACTGOAAAATTrGACGTTCATGAAATTTAQTTCA^ 
AACATCCCCTACAGGGGTGAAGGATGTCTTCTGGAGGACAGTGGCTCITGAAGCAACA AGAT TTA 
ACATGTTCTAATrrAACAATOGCCTGAGTCAAAAGCTTGAGACAAAAACAATrGTGGACnrCAAC 
CCCCAAGANAACCTGTTtATTAAATGGTNGGAAGCAGAAACOCTTTGGhWCC^ 
GNCGNGGNGGCGAATNTTCCAAANCCCACCACCTT 

SEQ ID NO: 2908 GGTACGCGGGGAOAAAGGAACACAGTAAACTGAATTGATCCGTTTAGAAGTT 
TACAATGAAGTTTCTKn'AATACTGCrCCTGCAGGCCACTGCTTC^ 

TCTACAAGCCTGGAAAAAAATAATGTGCTAmGGTGAAAGATACTTAGAAAAATITrATGGCCTr 

OAGATAAACAAACTTCCAOTGACAAAAATGAAATATAGTGGAAACTTAATOAAOOAAA^ 

AAGAAATGCAGCACTIXrrCGGTCTGAAAGTGACCGOGCAACTGGACACATCTACCC^ 
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ATGC\C<X:A0CTCGATGTGGAGTCCCCGATGTCCATCAmCAAQOQAAATGCCAOGGGGW 

GTATGGAGGAAACATTATATCACCrACAGAATCAATAATTACACACCTGACATGAACCCGTGAGG 

ATGTraACTACNCAATCCWGAAAGCrmCAAGTATNGAOTAATOTACCCCmOOAATC^ 

AATACCAGCTXKn-GACATTrrcnXKSrriTGNCCGNGGGCrATCG^ 

GGATCTAOCNTCN 

SEQ ID NO: 2909 ACTGGGATAAATGAAGAAGAAGGCATAAGGACAATAAACATOOAACTCCAC 
TOCAAATOQATTTrATGCAGCTGAGGAAAGTTKKKjCTTATTAGTAT^ 
AGTTITCTCCATTGCGGACAACXn'AACrACXAGCTXXTrGGCTX:^ 

GTTCCCAGTAOGTTCraTCATTATTOTrGGCACATAOGCCCTGAATACAGGTGATATAGGGCCCCC 

ATGAGCOCTXXrrCCArrOTGAAACCAAATATAGTATCATTCATmCTGGGCmCT 

GAGGAAOACAGAACCATTrAGCACAGTOACATTGGTGAAATATGTTTCATTGAnCTCACAGAGT 

AATTQACOGAGATATATGATrGTOAGTCAGOAGGTOTCACAGTTATAOGCTCATCAGCGGAGATG 

TTGAAGTTACCTGAAGCAGAGACGCAAGAAGAOTCTrrGTTAATATACAAGAANONCnTrrc^ 

CANGOCAGOTAAAACCTGOOCrOCAACGTrrGGGAATGCTGAATGCTOCrrGAGAAAriTCCGN 

ACCWCTOrmCNAAGTTTGTTGGCAATrCNG 

SEQ ID NO: 29 10 GOTACl 1 1 1 1 1 J 1 1 1 1 1 1 1 n ri n n i l riGNCCAGAGAGCAAGnTATrrOGT 
OAATOCTOACOOCAAACATCATCCAAGAGAGACAAGATGGGAAAGTTGCTGANACAAGAAAGCC 
TAGGGAAACTTTAGOCTAGATACAAAATTCACACAGGGAAAOGCACGGACTCTGGGGAGACTGG 
GAAOGTCCTCAGCCATTCAGCACCATOCOOACOAGCTCCGCCCCATGGGCCCGTCACCCCGACAC 
GATATGCCTCACAAACGCrrcATAQTTGATACAACXlATTGCTGTar^ 
CTCTACTICITCCTCTGTCATXnTCTCACCCAG^ 
ACOOTGCCATTIXrrrCTTGTCAAACACCXXSAAGTCCTrCGACATAAT 

CTrGTTCrrGGCCCTOTCTGCAGCATGGGCAOAAAGTCTCAAAGTCACACCn'CA(>TrCATW 

NCThnTGGGGTCCCAAOAOTACCCTCGCOTTOQAGGOThrrGGCANGCCTATAATC^ 

TG 

seq id no: 29 1 1 ggtacagtattggaaatggatctgtcrrrggtaaagatcagcctataatrctt 
otgcrgttgoatatcacccccatoatgggtgtcxrroaacgototcctaatggaactgcaaga 
gcccrrcccctcctgaaagatotcatcgcaacagataaagaagacgtroccttcaa^ 
tgtggccattcttgtgggctccatgccaaoaaoogaagocatggaoaoaaaagarrractgaaag 
caaatgtgaaaatcttcaaatcccagggtgcagccttagataaatacgccaagaagtcagtt;^ 
gtrattgrrgtgggtaatccagcx^aataccaactgcctgactgcttccaaotcagctccat^ 
cccaagoaoaaotcaottocttoactogtrrogatcacaacccgagctaaagctcaaattt^ 
taaacttggtgtgactgcraatgatgtaaagaatgtcattatctgggggaaaccatt(xtcgact 

AATATTOCAGATGTCACCATGCCCAGOTOAAATTOCCAOOOAAAGOAAATIXjGTGNT^^ 
TCrroAAAAANGACACTGGGTTCAAOGGAGAArnT 

SEQ ID NO : 29 1 2 ACCCAGTAAAAAOCAGAATGACXXATrGCCAGGACGCATCAAAGTTGACTTT 
GTCATCCCTAAAGAACTTCCXrrTrQGAGACAAAGATACOAAATCCAAGGTO 
TGACCATGTTAGGTrTAATATTTCAACAGACCGACGTGACAAATTAGAGCGAGCAACC^ 
AAGTTCTGTCAAATACATTrcAGTTCACTAATGAAGCCOOAQAAATGGGrrOTGATTGCrc^ 
OAGATGQTTTIXjGTTTCATCAAGTOTGTGGATCGTGATOTrCGTATG l I L' l I CCACrrCAGTGAAAT 
TCTGGATGGOAACCAGCTCCATATTGCAGATGAAGTAGAQmACTGTGGTT(XTGATATGCrCTC 
TOCTCAAAOAAATC^iTOCTATrAGGATTAAAAAACTTtXCAAGGACAOGOTTTW 
TTCAGATCACCGTTTrCraGGCACNGTAGAAAAAGAACCACTITmC^ 
AATAAO0CAAAGAAAAGGAGNTGGGATXKnTNTONTATATGACT0OOQOOAACTG>r^ 
OCCAGNN 

SEQ ID NO: 2913 ggtacactgaaacataaatcogcaagtcaccacacatacaacacocggcagq 

AAAAAACAAAAACA0CAAOTITACAT0ATCCCT0TAACAGCCATGOTCTCAAAC7tl\GATG^ 
CTCCATCrGOCAAGTOTtiTTX:rnMATACAGAaCACATCGTGG<^ 

GCTGTOGGTCCACAGAGCACTCATCTGGCTGOGCTATGOTGOTGGTGGCTCTACTCAAGAAOCAA 

AGCAOTTACCAGCACATTCAAACAQTGTArrGAACATCTmAAATATCAAAGTGAGAAACAAGA 

AGGCAACATAATAATGTTATCAGAAAGATGTTAGGAAGTAAGGACAGCrXjTCTAAAGC^ 

TOAAAAGTAGCTTGCCAGCTrcATTTCmGGTrrCTI^GTAGTGGGCCC^ 

TGAGGGTCTGGTTCATGGATCATATAATGOACCCATCCTGACTCTTGTTOAAC^ 

CATTCAGATTCANACArrAAAATGGGGrnTANGGACCCAOCTroQ<nTrm 

OAAATOTTGATACCTCAAACCTCCTCNTNGNNGGATTrTTNCC 
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SEQ ID NO: 29 14 ACrGAAAGAAAATGTCATGCTWAOTCACAAAAGGOGGGCTCTCCTCCn^ 
ATTGTmOOOCCCTGCCCAGOTTCAGCACraTCCTrACTAOCAATGTTCAGACCATTTGCAATGGA 
ACCGTCAGTGTGGCTTCCrCTCCATCCTTC^GTGCTACTGCACCT 

GTTCACAGCTGGTTGCTCACCCACCTOGCACTOTAATCAOTCAOTTATCAAAACTCAAOAAACAA 

AAACnrrTACACAGGAAGTAGAGAAAAAGGAATCTGAAGATCATTTGAAAGAGAACACrGAGAA 

AACGGAGCAGCAGCCACAGCCrrATGTGATGGTAGTGTCCAGTroCAATGOATnAmCTCAGGT 

AGCTATGAAACAAAACOAACTGCrGGAACCCAACTCTrTTTAGTTAATATACCAAAGC^ 

AATTGTTGGTAATTGAACATTTCAATrATATGCAGACTGGCTGATTCTAAAATAAATrCT^^ 

GGTTCTAATTTTGTAATTO0T^UAAATAAA00TAATTTTOCTTrGTTQANGJ^ 

CTGTTCTCTTTGTATCTAAAAGG 

SEQ ID NO: 29 1 3 CGTACTTGAGTCAAAGACGACATTrAGATTmCAGCTTTGAAGCArrrAGTA 
ACATCATTGTCTAGTCAOOCAGAGOAAAaAAATTCAAAAOCACl r 1 0 ITl'I' lU 1 ICTGTCAGTTCC 
TTTGCAGTAAGATCXiATCTTCTCACGTGAQAAATCArrAATAGCGAGACCT^^ 
GTCTTATTCrrGATTACAACAGGGAATGAGTAGAGCAGATCATCAGOAACACCATAGOAOTroCC 
ATCAGAGATAACACCCATGCACACAAACTCTCCXn'CTGGGGTTCCAAACCAGATGTC^ 
GGTCACAGATGGCTTTTGCAGCAGACATGGCACTGGATAGTriTCGAGarrTGATGACAGCAGCO 
CCACOCTOCTOCACAOTCOTOACAAATTCTCCmGAGCCAGCrGTCAT^^ 

TGGGrmCCCCNAAAATGACATIXnTrACATCATrAOCCGOCCCCCCCAGGTrAANAGCAATTGGG 
CCTTACTCGGGTGGGAACANACAAGTNAACN 

SEQ ID NO: 2916 ACGGGGGGACTCTGCITCXGrnxn-GGTTTTGCTCTAGTGTTTGGG 

CCGCTGCTCAAGATGAACXGACTCTTCGGOAAAOCGAAACCCAAGGCrcCGCCGCCCAGOT 

TGACTGCATTGGCACGGTXjGACAGTAGAGCAGAATCCATTOACAAGAAGATTrCTCGATTGGATG 

CTGAGCrAGTGAAGTATAAGOATCAGATCAAGAAGATGACAGAGCCTCCrOCAAAOAATATGQT 

CAAGCAGAAAGCmOCQAGTTlTAAAGCAAAAGAGGATGTATGAGCAGCGGCACAATCTTGCCC 

AACAGTCATTCAACATGGAACAAGCCAATrATACCATCCAGTCTrrcAAGGACACC^ 

GTTGATGCTATGAAACTGGGAOTAAAGOAAATGAAGAAGCATACAAGCAAOTGAAGATCGACCA 

GATraAGGATTTACAAGACCAGCTTGANGATTNATGCAGATGCCAATGAAATNCAAGAACAC^^ 

AGTCCAGTATTGCCCCCAACTGGTGAAGATATTTAAAACCAAAGTTGGATCNCTAGGTGATGACTr 

NTOCrGNTAANANAGnrr 

SEQ ID NO: 29 1 7 GajTGGCGCGGCCCOAGGTACOOGCCAAOOTGGTCCCCrrt^ 
CTGTGTATGGGCGGAGAAAATCCAGCTrGTTCTTGCTGATGACGCAGAOGTCA^ 
AGCCCAGQTCGCTOAAOATGCCAOCTGCGATGGCTTCGCTCACCAGAT^ 
CCATOTCTCGOTAAACTTATCTTCAAATACAGCCATTGCTGCCAAGOAGCCAGu^ 
CATAAGGCAACTTATCAGTrGATCCATOAGOATAGATOCTOTAGAOGTOAGGTCCAGTAACATCT 
ACTCCCCCTAAAACTAGGCCTGCACCAATGTAACCtrOATAOClXjAAAAGCATCTO 
CGATrGGCTGTCACAACTCTGGGAAGACCG(XAGTGGAGAAGGAGTGGAGCr<XANGTTOGAAG 
AAATGANCraGGTTGTCATGTCTGTGTCrcCAGCTGTOOCAGCAC^ 

ATATGAAGTGT N till 1 GAACAAOTT^aTGTAGCAACNACCATCCCTTCAATTGC^CTTGT^^ 
TCCXAAAACTATTNChrnxnTOTANACCACCCCNNCOAAGGNGGGGC 

SEQ ID NO : 29 1 8 GGGACACGTAAATCTGTTCAAGGAATCATTGTmKK:ATrGAAAATAGGAGA 
GAGATCATGAGAAATTACCAAAATGTCCACCAATAGAATGAGTCAATCACCTGTGGTATATTrAT 
AAATAAAAATATTAAATAGAAGOTAAAACAAATCAGACATACACACATTAGATAGATAAATTTAG 
AGGACAAAATAAGGAACATGAAAAGCAAATAOTAGATGGCAAATXjTAATATAATATACTATGCA 
TAAATTTTAACAACACACAATATTATATTOCrn'ATracCCAATAAOCAGAAOCCATAAQaTC^ 
A C I l 1 1 KJI I ATAATGCAATAGAATAAAAATGACAACAAAGTAAAAGGOTATCCCAAAAAAAAAAA 
AAAAAAAAAAGTACACATTTCAATCTTlXTAATACTrAAAAGTAATrGGTAAC^^ 
GAATATGTCTrcCAAGTCACTGAATAATATCAAAAGGC^^m^AACCCCA^^ 
TTGGAAAACCAAGGAAAAAGCCNGGAAGCCTrAAAAATGGGGAAGCCTTAAAAT^ 
AAAAAAAAAAAAAAAAAAAAGTrCCTTGCCNGGONOQGCCGTTTTAAAAGGGGGAA^ 

SEQ ID NO : 29 1 9 ACATGCrCCTGCAGATCTGTATTTTOCAATGTrGTTAACATCAGCAAGCTGGC 
AGTCTACAACCTGTCTTGTATAATGTTCGAAGAGAGGCATCXTCCAGACACGGTX^ 
TaCTOOCCTCOAAOAGTTOTrCCAOAGCCAGGATOAATTGOTAAAGACCCXAGTGGCACCTGA 
CCCAAAGCTACATCCATGGCACCTOTTAAGGTGGOGGCATTGAGGATGACCTTCGGGrrA^ 
GTGTGCGTAACAGAGCGCATCAGCCAGTATGAGCCTCCOCTCAQCATCAGTGTTATCAACC^ 
GGTCrrCOC ai 1 1 1 I GGCTCTAACAACATCCXXCGGCnGTTGGOCTIXjCCGCrGGGCATATTTTCA 
CAAAGAGGGG<XAGACCrATAATATTAATXKKK:AAATrAAGCTITGCAGCAaACACGATGGCrc 
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GCATATAOTTQCAOCTCCnXX;ATGTCAG<XXrrCATCAGQ^CCATATTrGCAG>^ 

AT CCAC CCCTOT<XAANGTAATTQnTIXX:AACAAACACCAAGGGNGGGTCGTrTC 

OCrnTAONGAAATTNCA 

SEQ ID NO: 2920 ACTTOQCATCGTCATCCCCGCAAACATCACrAATAATrAAAGAGCAOTrCCC 
GTOCrcATCGTAGTCTATCTGGAAGTGGCGGGACTCCCTGATTGACrcGTCA^^ 
AACCTCGOaOTCTOGOTATOOTCAATCTTGCAGTCAAATCTAGCAGCAC^ 
TAAATCGCGAATGGTCTTAOAGAAATAGGGrmACATGAGGCTTTTCCrCAGCAACAGCCTCAAG 
GAAAGCTTOGGACACATCTrCTTCAGATTCTAGTITrTCIX^ 

TGTTGAGQATrrcCTGCCACTGAGCCCTGAGATCATTGCCATAGAGGACAGTCTrCCAAT^^ 
CACAGCATrGOCCGTmCTGGAAAATAGACACraAGGGTTGGACTCAGGCGTTOTCCCTGOTTNC 
AGCTGOGTAOTGCGAOOaOCTOCAOGOAGTAAAGCCCACCCCAGAGAAGCAGCTG-rTCCTGGAA 
CAAGGACCTCTGAAGCCGAGGTrhm}CCAAAGAACGAAOeAACCTTTm^ 

ccttggcoggac 

seq id no: 2921 ggtacqcaogggctgactctctntcggacrcagcccgcctgcacccagotg 
aaataaacagccatgttgctcacacaaagcctgtttggtggtctcttcacatggacgcac 
tttggtgccatgactcggatcx;ggggacctx;ctttgggaqatcaatcctccgtcct^ 
ctccctgagaaagatccacctacgacctcaggtcctcagaccgacgagcccaagaaacatctcac 

C AATI TCAAATCTGGTAAGCAGCCTCTTTTTACTCTCTTCTC^ 
ACTTTCTCCTTTCAATCTTGOCACTACACTTCAATCTCTCC^^ 

TTCTGGTAOAGACAAAAGAGACACGTTTTATCCGTGGACCCAAAACTCCGGCGCTGGTC^ 
TGGGAAGGCAGCCrrcCCTKKSTGTTTAATCArrGCAGGGATGCCTC^^ 
TCAAGGGTGTCAGACCACGCAGGGACAACTGGCTIXKiTCCTTCCXXTITACAACAAGTCCGC^ 
TGGGGAAGGGGCAAGTACCTOC 

SEQ ID NO: 2922 ACTl 1 1 I Tl 1 14 1 1 1" ["l I ■l-riMni'iMIX}CTmCTGATOCTTTTCATTATCACAGA 
ACACACCACCTXiTAATOTGTGCAAAAAGGAAATCAGGGGTGGAAGGAGAGGAAGTCTAATTGGG 
AAAG GCr<K ?ATGGTCACTrCATTrCrrCTC'i'riCi 'IC I'l ATCCmTCATTCATGOOCAGGATCATC 
ATOAGTTrTCTOAAQ ACAGT AATGAAATCTAAGAAGAGATCAATGCAGTGCCAGATATAATCTTG 
ATCTCCATOTTCGGCCTTTTCAATAATGACnTGAGTATCAAAAAGGACGAAGCCAC^ 
CAGTCCCACATACAGOTTTOCXrraOAAAAGCCAAATGGATCCAAAQAAAACATrcCCCAGGGAAG 
ACAAAAGCAACAAGCrCAGGGCTGACATCAAGATACCrCCCAGAAAGAGGTAGCTAOGGCGCCT 
GCATAAAGTGCACTGAGGGTGAAGCAGGTAAAAATCATTOCCOTGCCCATOAAAAOCATGGGAA 
NGATCTGOGGTTGACANCAATCNAAACTrCAGGGGGGGGCCAGCCAACrrCTGTAGGAAGCNATC 
CNCCAGAAGTCCAACNTTT 

SEQ ID NO: 29 23 G GTACATAGTGTCGCGAACTCAAATCQQCATrTAGATAOATCCAQTQAITTA 
AAC OGCACGT TTTrGCTrATAAAAA AAGT GCAAAAAAGATGTGGTTrACAAGTTAAAGCT 
ATCCCTTTITGCTGTAATrGCACCAGTmAAACCCTCTGGAC^ 
TGrrmCTTAAAAGCTTACAGTOTTrOGCTAATTCTCCTCCCCTTTTrACAAGAC^ 
GGTOGACACTGOTGGCAGGTTAAGGGATACrcTCACTrrAAGAAGCCTOCAGAT^ 
CATGGAGAAATrAGGGGCrGATTTmAAACT0TGTX3AGATATrAACCAGCX;0GCCCTGTTATA^ 
ATCAGGAAATCCAAACAGCGAmACACCGATTAACACCCCXrmATATATTTTTTACAAAAAT^ 
ACTGAGAAAATAATCAAACGTmCATCTCTCTTGNCTTTTITTGGT^^ 
CTACATTTAAATATAAAAAANTAAAAGTTAAACTCrACCCTIXiAGTGGANGAGACGT^^^ 
NGGNGACACACCTCCCCAAAAAAAAAAA 

SEQ I D NO : 2924 CGAGGTACl 1 1 1 1 i i l i 1 11 U i l M'l I riTl'l'CGGANGOTTAAOGAAmTCTT 
TATTrmACAAATTAANACTATCCAGATTTCATATATTTCIXiAAT^^ 

AGTATaAGTrA>UATGCAGCCrOAGCrcAAAATCAANAAACTAGAAAAGAAAGNGGTAGAGATA ~- 

ACTATATTAAAAATCTOTTAGOTA l I llCin AAAAGTAGGNG l H i 1 IJ HI 1 1 ICI 1 ICl U 1 i 111 

TTrAAANATCTTGGCATTCTNOGCNCTGTTraTAAANAAAAACTCamNT^^ 

CANAAGGGGGAGTTTXXjCTrCAANAGATGlTAACTCAAAATCTrAOGCCTANCANAGAATCACC^ 

AAmATGGAGAGTTAACAGGGGTTTAACAGGAAGGAAGTCCCTTTAGTAAGTTCTCAAGCCAGA 

GGCTGGAGGCAGCAGCTAAATCAAAGGACAGCCrcCTCAGTGAAAGNGACCCTTCNGGGNGGC^ 

TGTCNCTCCNGGAATAAACNCAACTTAAAAACAAATGATTCTAGOATACCCAAGGAATGGGCNCr 

GGGAACCTGNGGCCCTGGGTNAAA 

SEQ ID NO: 2925 GCGTGGGTCCCGGCCGAGGTACGCGGGGGCACAGCCAGAGCCTAAAGGCTA 
GAGCCGGAGCrGCCOCOCCAGTCGCCTAGCAGGTCCTCTACCGGCTTATrCCTGTGCCGGATCTTC 
ATCGGCACAGGGGCCACrOAGACGnTCTGCCTCCCT C l 1 l U l I CCTOCGCTCTTrCTCTTCCCrCT 
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COmAGTrTGCCTGGGACKTrGAAAGGAGAAAGCACCKK}GTCXK;CCCAAACCCCn 

CCCATCACAAGTGCCACTACCGCCATGGGCCrCACTATCrCCTCCC^^ 

AGAAGCAGATGCGCATTrrGATGGTrcGATrGGATGCTCCTGGCAAGA 

CTOAAOTTAGGGGAGATAGTCACCACCATTCCTACCATTGGTTTTAATGTGOAAACAGTAGAATAT 
AAGAACATITOriTCACAATATGGGATGnCGTGGCAAAGATAAAAATAAGCCTtTTCT 
ATTACTTTCAGAATACCCANGGNCnTATTITGNGGNAGATAGCAACGATCGTGAAAGAAATC^ 
AAGTANCCATTAGCTGCNNAAAAGCnVrOGOAOAAOAAATGANAAAA 

SEQ ID NO: 2926 ACACAATGAATTGCTmATTTCGGTATGCATCCACATTTCAGCATTTAGTGG 

tcctgaacagcaagtggaaagacgcagcaatitgccaggaggtcaagcccaccaattrcgoogat 

ctgctgtgcacaocgoatrccttctraatcccixkngaggatctt^^ 

aacx:aagocatgcaccggattcaagottctttitgttccagttgtcag^ 

atogattgcaaggatgaccaaatoaaagccctomaaaacrrcttcaattmaaaaqc^^ 

caottacaggaaagtaaaacaatrcaoagggatcatgtgtgcrracaagtatcttcatgcggtc^ 

tctccaagttcaaccaa^aggactccgagagctggcaggtctoagtaaccctggtqactarr^ 

ttcaccitatcaaaacxrraaocraaaaacaatocatcaactgatgacagcaaaagggtgg^ 

ctganqacccaatattcatttcxlaagcttggtgganaatgaatgaatatggtr^^ 

aaggoaggggagangntntntcaacctr 

seq id no: 2927 cgaggtacam'gtgatocrcgagcaatixkktcnxjcttnagagggtgcccag 
agcrccttgcaagaagmacxiacaagtcratgactttgaaagaagccatcaagtct^ 
atcctcaaacaagtaatgoaogaoaaoctgaatgcaacaaacattgagctanccacagtgcagcc 
tggccagaatttccacatgttcacaaaggaagaacttgaagagottatcaaggaca'ntaaggaa 

TCCTGATCCTCAGAACTrcrCTGGGACAArrrCAQTTCTAATAATGTCmAAATTTTAT^^ 
TanXrrTNCTTOQAAAATCriXXIlATTGTATGTGCAl 1111 lAAATCATGTCTGT 

SEQ ID NO: 2928 CGAGGTACTrn ^ r i l i i U rrri - il - i - l - ri^ - rri - lTC TTAAAOAATGCTTTATTAAT 
ACAAATACACACAAACTCTGAAOCACTAANAAATTTAAATATCTATGTCACAOCAAACAGGTGGC 
AATTCAACATCCAGGGTCGACAGAATGCTrGAAGGANACTOCV^CANArrGGAT^ 
AGAOGGCATCrTCACAGGTGAAGGGGGGCCCAOCTGAAACAOCITrTCAAGCT^ 
AAGQATCATGANAGGCACTCCACTCAAGOGGAOGNGCGCAATCTGGTGCrCTTO^GGC^ 
AACTCTCAAAGTCTAGAGGATTGAAGGGAAAGAATITTTCTATTnnXjG 
CAGGAACAGAGCTrTTrccrn-AACAGTXnTCTCAONCATCn^^ 
TTOTn'GAGGGGCCCTTGGTCTrrACAAACTnTCTGTACCrCrC^ 

AATANCTTTAAOTAAGmXKiNGGGGCATCAACGTrrrGCCAAAACNOOGOTGNAACCTGAA^^ 
CCNTTAAGGCTTTANTGAAOGC 

SEQ ID NO: 2929 A LU H 1 1 m m 1 1 1 1 i I CCTGrrGTCCCAGATTTATrGAAAATAATACAGC 
ACTACAOAAAAAATTCAAACAGGTCCXXGAGGCGTmGAAATTCATCOCAACrOT^ 
GACCTGAAOGTTOGACAGACTQCCGAAOTCCAAAAGCTTCAGCATTrCCTrAGTGTC^ 
TTCAATAATCTCXTGATCCAAGGCTGAGACCTCAGGAAa^TAAnGNCTCTCCn^^ 
TCCTGCAGCTTGATCGAOATACCTCrrACTOGOCCTCTCTGAATTCGCTTCATC^ 
TAACCTGCTATCrrcTTGCGGAGCrTTrrGCTGGGGATAAT^ 
TCGTGTGGAAGTCOGTTGCXXIANGCGCCGTGTANTACCrC ' 

SEQ ED NO: 2930 OGTACl 11 1 1 ill 1 ii l i 1 11 11 M'i'i 1 11 11 i INGGCCOTTTCCACACCTGCCCT 
TTATTGGTCT^m^TANCANAGNGGCTCCAGGCCCT^CAC3GCCTO^ 
TTAGGAAGGNGCCATCATTCTOTGAAGGCCCANAOCTTACCCAAGTCT^ 
CACCAACCANAOGGTTGQGAGAGGAAAAGGAAACAGGCNGAGGGGAAAGGCAAGGCrNTGTAG 
TOAAOOGGACrGATATCAAGGGAATGCTGAGOTCCANCAGTGTCTCCTGAAOQC^ 
TAAOGCTCCTCAGQACTQOATGGAGTAGGANATCTOTGTGTTGAGCAGTTCACATCTATATGGCA 
ACirrAAGGAGGCGCrrGATGTCAGGCrCAATGTTGATGOTTGGGAAAGNGCOKr^ 
GGAAANGGCTCTCCCrrCOOOCCTATOAAAGAAACTTCTTTAAAAGTrCCACG^ 
CGGGGCACAAAGGCmCAAAAAGAATGAAACTTNGGGAANCGGNCATTGANGGNAAAAANGGT 

SEQ ID NO: 293 1 GGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGOOCrrCTATTTCWAC^ 
GATGATGTOOCTCTGQAAGGCGTGAGCCACTrxnTCCGCGAACTGG<XGAGGAGAAGCGM 
CTACGAGCGTCTCCTOAAGATGCAAAACCAGTGTGGOGGCCGCGCTCrCrr(^ 
AGCCAGCTGAAGATGAGTGGGGTAAAACCCCAGACOCCATOAAAOCrGCCATGOCCCTGOAGAA 
AAAGCTGAACCAGGCCCTTITGGATCTrCATGCCCTGGGTTCTOCCCGCACGGACCCCX;ATCTCTG 
TGACITCCnXSGAGACTCACriOTAGATGAGGAAGTGAAOCTTATCAAOAAGATGOOT 
TGACCAACXn^CCACAGGCrGOOTGGCCCGGAGGCTGGGCTGGGCGAGTATCTCTT^ 
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CACTCTCAAGCACCGACrrAAAAGCCTTCTGAAGCCCAGCGAACT^ 

AAOTAATAAG<Kia>rrTCTOCKXnT*AANCX:xnTrCCCTm^CAAGCCAAAT^ 

AACTTAT 

SEQ ID NO: 2932 ACGCGGGGTTCTCCCCTCGCGCTACCCGGACATCT^ 

atoaaoatctooacttcgoaocacotctttgaccacccgtggoaaacto^ 
gcagaaatacccaaaccctatgaacccaagtgtggttggagttgatgtgttggacagacatatag 

ATCCCTCTGOAAAGnGCACAGCCACAGACTrcnx;AGCA<>GAGTGGGGACTG^ 

AGTCTCTTATTGGTGCAGCAAGAACGAAAACATATGTGCAAGAACATTCrGTAGTrGATCC^ 

AGAAAACAATGGAACTTAAATCTACTAATATTTCAmACA AAAC ATGG'mCAGTAGATGAGAG 

ACTTATATACAAACCACATCCTCA0GATCCAGAAAAAACTX3TTITQACACAA0AAGC^ 

CCGTGAAAGGAGTTAGCCTCAACAGTTACCTITGAANGACTGATNGCAAGTCCCTCGGGCCGCXJA 

ACCCCG 

SEQ ID NO: 2933 A Cl l - lTri 1 11 1 U 1 1 11 1 U H I U l OanT mAT rTOOCCAAATCCA TAGC O 
ACTATAACTAAACCAAAACATGAGCTAAGTAAATGAAAACTACITr^mCIXK3A^^ 
ATAAAATAOTTGAAGACAGACTACAATGAGACATTOTAAAATAAGACTAGAAAATAATTrAAGAG 
CGTATTTTAAGCACCGGGAriXXXJACACACTCATAAACGTGTrrGCTCCCAGCIXn^ 
AATAAGGOTTTTCTCCATCAAAAACAACCCAOTTACAOCAGTAAATCTIXX^ 
OAAGCAACTOATACTCCATTAAATCNGGAAACACTQTCTACTCAAAATATTTOTAAT^^ 
TGACAAAAATTTCCCACXXXn^ATATAGGGOTAACCATAGTTITCAGCAAAAGCAACAAGGGCm 
ACTTArrnXSGCirmCAGGTrATCAGAGCrcCAATITATGAAACATACTGNGCCNAC^^ 
ATAACAGCACACmTCGAAGCCCNCCACCCAA 

SEQ ID NO: 2934 TCNAGaKKXX}CCCNGGCNGGNACGCGGOGCCAGNNANAANCCAGCNGGGC 
TGGTOCraGGOC^T^mX^XX^X3AAGaGGCT0CANGAOGGWANGCT^ 

AGAAGGGTGATCAGCACCNNNAAAGCCCCANGGQCCATTGGNCCCTACACNCAANCTGTATTAOT 

NGACCAGGACCATTTACATTrCANGACANATAGGCATGGACCCTTNAANTGGACAGCTrGTGTCA 

NGAGGOGTANCmAANAANCTAAACAAGCTCTTAAAAACATGaGNGAAATTCTNAAAGCTNCNG 

GCTGTGACTTCACrAACGTGGTGAAAACAACTGTTCTTTTGGNTGNCAT 

NCAATGAAAimACAAACAGTATTTCAAOAOTAATTmxnxaCTAGANCrOC^ 

CTTTACCCAAAGGCAGCXgAATTrNANATTGATGNGTATCTATNCAAG GACC^ 

ThWCTTTAATGGGOCCAGTGCrGCCmCTCTGGAATTGGTNACATTrAA^^ 

SEQ ID NO: 2935 GOTACCXXTrAACCCCTTCrcCTTCACCCrrAGCAOCAAOTC 
GGGGGCXAGAAACCCCAAACOXTTCCCTCCGTGTCTTTAaxn^^ 
TTCACTATGGGCAACCTTCCATCCTCCATTCCTCCTICTCCCTTAGC^^ 
AACCTCrrCAACTCACACCrGACCTAAAACCTAAATGCCrCATTTTCTT^^ 
CCCAATACAAACTTGACAATGGCTCTAAATGGCCAGAAAATGGCACrrrCGATTTC^^ 
AAGACCTAAATAATTTTTGTCAAAAAATOOGCAAATGGTCnXlAGOTGCCTGATGTCCAaGCA^ 
TTTACACATCCGCCCTrCCTAGTCrCTOTGiXCAGTGCAACTajTC^^ 
CCCACCTGTCCCCTCAOTCCCAACCCCANGCOTrOGTOAGTGT^ 
AT^^roACCTr^CCCTCTTTCACGCCAAGTANGGCCATT 

SEQ ID NO: 2936 GGTA fi 11 1 1 1 fl 1 1 lA TTmATTACGAAOTTTCATTCnTTTGAGCAAAAAA 
GTCGAACTTTITCTOTTGAACAAAATATTCACAACAGaGCAGTTGTGATACGAATAGAACAAAAA 
AAAAAAACACTTAAACrrrGTTAGGACTOCGATGAGTTTGGGACTTCAGGAAAAATCAACrc 
ACCAGCAGCTACCAACCACCATTCCATCnCITCAmGAACAOCATTAGTTAAGT^^ 
AACCCnrCTCITANAAGAAGriTOCTAATTGTGTCTCAOACCGGTGTA^ 
ACCTTGCTAAACCTATAAOCrTTrrAAAATCCAATATATTCTGCCAAGAATATGCCTTGATA^ 
GCCCTCAGCCCATAOOTO rn ' rri ' Gri ITl 1 AACAOAArrATATATOTCTOGOQQTGAAAA AACC C 
TroCATTCCAAAGGNCCATACroGTTACTTGGTTCATTGC ^ 
AAACCATTKntrrGGTCCCTCTNGAAGCCTTCCCANAGCnT>^ 

SEQ ID NO: 2937 GOTACrrCTAQCAAGTGCACCAACAGAAAAACAACQCCTGGAATGQOTOOGC 
rrGGTGGAATCAAAAATCCGAATXXTOGTTGGAAGCTTGGAGAAGAATO 
TCATOTGAATCCCCAGTCATrrcCACCACCCAAAGAAAATCX;CGACAAGGAAGAAm 
TGTCGGTGATTGGGTTAGTGTTTAAAAAAACAGAAAACTCTCAAAACCTCAO 
ATGATATTCAGTCTTTCACAGATACAGTTTATAGCCAAGCAATAAACAGCAAGATGTTTGAGGTO 
GATATOAAAATrGCTOCAATOCATOTAAAAAGAAAGCAACTCCATCAACTACrACCTAATCATO 
GCTTCAGAAAAAGAAAAAGCATrcAACAGAAGCTGTCAAATTNACAGCTCrCAATGACAGCAGCC 
CTCGACTTOTCTATGOACAOTOATAACAGCATGTCTGTGCCTTCCCTACTAGTGCTACGAANCCAG 
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TCCTTraACAGTCTGCAGCrmAAGGCAAAAACATCCT^^ 

SEQ IDNO: 2938 TCNAGCGOCCGCCCNOOCNGaCACi'l'iri-ri-l l ITn-fl 1 i il l 1 1 1 i lOATGA 
AAAAGAGCCTAAACGCTT>rraATTTCOCATAAAGAAAAAGGAGCATTAAariTGACT^ 
AGCTCCAGCCA(XTTTTrAAGAhrrAAAriWn"GGGCAGGTGGGC}GAGGG^ 
ACTOTAAGCCGGACCANGTGTGA0OAOGCGAGaT0ATAAAAA0ATTACAOO0NOOA(3OAQT0GA 
GCCTGANGAAGAATTGGGACCTANCTIXKiCGNGGAGAGGACGGGAGAGGTCANATGGGmG 
OAAAAOOAAOATTAOACACACTX^TCAACOCCTOGGOTTOQOACTGANGGOACAGOTG^ 
AAAGAAGGAATATTTGGGACAANTTGCACTTGNGCACACAAACTANGAANGGGACNGGATGT^ 
TAAAAAAATOCCTGOACATCAAGCGCCrCAACCATTTTCCC^^ 
TCITNTTNGATGGAAAAAATCGAAAGCNCCCTTTCTTGGCCCT^^ 
TTGGGNCAACCGT 

SEQ ID NO: 2939 ACCTGCATCAOCATTAGTAATCAACCTGTTAATCCAAGGTCnTAGAAAAACr 
TGAAATTATTCCTGCAAGa:AATTTTGTCCACGTGTTaAGATt>TTGCT^ 

TGAGAAGAGATCTCTGAATC(>GAATCGAAGGCCATO^AGAAmACTGAAAGCAGTTAGCAAGG 

AAAGGTCTAAAAGATCTarrrAAAACCAQAOGOOAGCAAAATCQATOCAOTOCITCCAAOGATaG 

ACCACACAGAGGCTXKCTCTCCCATCACTTCCCTACATGGAOTATATGT^ 

TAGTITGCAOTTACACTAAAAaOT0ACCAATCAT0GTCACCV^AATCAGCTGCTACT^ 

GAAGGTTAATGTTCATCATCCTAAGCrATTCAGTAATAACTCTACCCrOGCACTATA ATC^ 

CTACTGAGGKKTGTOTTCTTAGTGGATGTICraACCCTGmCAAATAmC^ 

CTTTCAANGGTATAAGGAATCTTTTCTGCTTreGGh^ 

SEQ ID NO: 2940 GGTACTTITAAGAAAAAAAGCAGCOCCITGGAAGTmOGTTtJl J I U ICCTC 
CCCTGTIXX^JU^TTCTCATGGTTTGGGTTGGGTGGTGGAGAGCGCGTGTCAT^ 
GCCCACGGTGGGCGGGCGGGCCrCT CTACT(XAAGGTGA CC ACg 
TGGAGGGTGAATAGGTCACGG<XKjCtnTlTrrmAGTrTAACTTTTCC^^ 
TCCTCGTCHOTCTrcrGCTTCTTOGTATCGACATCOTCATCCrCATCAT^^ 
COGTAGCTGACTCAGCITCCrCATCTTCATCrCCArcCTCTTCCTC^ 
TCCTCTTCCrCCCCA a;! ICl I CCTCnCTrCGTCTACCTCATTGTCAOCCTOCTGCTCC CCA 
CTCATTANCATTCCCGTTAACAAGGGGCGTCTCTTNCATTTTCTGNCrCTTCC^ 
nTAAANCCTrGGGGGGAATTTCQAACTGGNGTCT 

SEQ ID NO: 2941 OOTACAAGCATCXJTAOOGTTCCCCrAAACTraCCCTGTITrraT^^ 

TOTTATCCCCTTACTGAGCGGOn-CTACTAGGTGGCTGTGATTAAATGTCCCAAGCAAGGATAGGG 

AAGGGGAATGGTTCAGCCTCTGGAGATCATICTAACCAATOCTGCCAGACC TC 

GOGAGCAAACCTAGATAAGGACXriXmTGGOOCAOCAGOOAGCAAAATCTXXT^ 

CAOTnXrCATrcACATCAACAGAGCGAGGCTGTGATAACITAGGAGGCAGCAAT^^ 

CTTCAGTQCATTTTAOTCTGTCTCCAACTOOACACCAQTAOGTAOTOTCAAOCC AOAN An^ 

CAGTANATAAATGTTCATTrTACTaATGCACTTrAimTITTGGTCTGm'ACC^^ 

GTGGCCTrrAGCNGOAGTTAGGaXX^ACCAKrGAGAOXXAATCCTGAGTITG^ 

GNGAAAGCCTATGGGACTCCACrCCTCTGN 

SEQ ID NO: 2942 ggtacaaagatgactataaacaagatocagccctcggtttccatg aacag ca 

CACTATrACAOTAAACCAAGrrrATATTCCACCATCAAGnroTGGCTCTCCCATGACrrCGC^^ 

atgoatcattaagaatatcctcaaatccaatagtctcatcattacccctcaaaacatccagtoaaa 
gatrrgagctrgaaagaaatggaagacgctgaacctccrgcactg<x t^ 

TTAGCGGAGCAAATAGACCCraAATGrrTCTXIAOTOTOGAAAAATTCAm 

GGAAATTTITrTCTGATAATrCAAGGGGATOACrAGG<XAAACTTCAr^^ 

CmCCGAAGAAGATCATGACmCAAAAaGTCCACTTGCTQAAAGTTCAGTAACTGGAATACTG 

CCmAGCTCAAATOCAAGTCCTCTGGCATrCATCTTNCGCAC^^ 

CCGGGCGGGNCGrrCGA 

SEQ ID NO; 2943 A Cl 1 11 ill ri ' i U 11 1 1 1 U U 1 GNATOTAAAACAAQTAAACTTTATTTGGGA 
GATCGGGTGAATCCATCACTGGTTACTGGAACCCTXjAGTCTGCATm 
TGAAATGOAOTGGGCTGTGmGGCAAGGGTTGTAGTOGTTTGGAATCrcCnGGC^ 
GGCCTCAGGCCTGTCTCCCCANAGTAAATGCCCCGGATCATrGAGOAAGCGTrOOCTGCNCTOOC 
ATXTrrAGGCAGGTCTGTACITCAATTCTACCTGGATAATCTCACTATTCTCAAGACT^ 
QTAAATAGTOACCATCCCTAACACTTCTATCTAOAGCTQTaCITrC^ 
GCXTGCTrOTCTTCTAGGCAGTTATAATTCATTOTGTTACACATTAAAATTTCCCT^ 
CTmACATCCCCCAAAATCTGOTrCATCrACrGAATTCTAATCTTCA<XCTGCC^ 
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mrrCCAACCNCATACTGGCTAATTATCCTGG 

SEQ ID NO: 2944 ncgcncngacgcaaaacaaattgcaatataatgtgataaottctttaaaaoa 

GGTAAGANCNACXmKnTItKK}AGCANAGAANANGGAOAAAGCANCATCTTGCCT 
ANGOGACACAGGAANAGAAGCCCACTATCTCATnAATCmACAACTCTCTTOCA^ 
GTTGTGAAAATACATGA0ATAAATCAT0A\NGCCACTATCATCCTCCrrCTOCTTGCACAAGT^ 
CTGGGCTGGACCG^^^TCAACAGAGAGGCTTATTTGACTTTATGCTANAAGATGAN(^^ 

angcccagaanrrcctgatgaccgcnacttcganccctccctagcccactgtocccc™ 
atqccatcttocattaootcaccotthrrgamoggtctggaca^aaagccaaagggatcntrccc 
ctggcacaactcttgtggacctxk;caaancaacaaaaataa<xcgnaatcc^^ 
ttatgaaoctcaaaaaacctttccccmrgaatittttgaacaaatnaa^ 

seq id no: 2945 acgcgoogctrgtccagtgaaacaccctcxjgctoggaagtcagttcgttcrct 

CCTCTCCTCT LIUJI 101 1 i GAACATGGTOCGGACTAAAGCAGACAGTGTrCCAGGCACTTACAGA 
* AAAGTGGTGGCTGCTCGAGCCCCCAOAAAGOTOCTTOGTrcrTCCACCTCTOCCACT 

TCAGTITCATCGAGGAAAGCTGAAAATAAATATGCAGGAGGGAACCmTITreCGTGCC^^ 

TCCCAA0TGGCAAAAAGOAArrGGAGAATTCmAN0TT0TCCCCTAAAGArrCTG>^^ 

ATCATrATrCCTGAAGAOOCAGQAAGCAGTGGCTTATOAAAAGCAAAGAGAAAAGCATGTCCTTT 

GCAACCTOATCACACAAATXIATGAAAAAGAATAGAACTmnX:ATTCATC^ 

hrrGTnACCCTOTATTCTAGAT0TAAATTTA(>TNAATGTGriTGGTCCAATT^ 

OCArrAATAAAAAAATTANGGTTAAATrTAAANTICAAAANNAGGTNG 

SEQ ID NO: 2946 GGTACCrACATCAGATXTrAAOCrrGATXXXAGCAATGTGGATTCCCTX^^ 
CGCTGCCCAGGCCAOCTAGGCCCTCTCAGOATOTGAGATCTCTATTTCAAATGAGACC^ 
GCT TCTG CCAGCrrOTO\GTGAGOACIX:ATCTGTrACCCAGAKTAC^ 
TOGCTTrGGCCTTCCCTTOGCyLTCCCAAGAAGCACTCAOTGCCCT^ 
OGAGACTOTGCTGQCAACAGTa:AGGCTCTGCAGACAOCATC(XACCTGTCCC 
TGAGGAGCATCOTGGAGGAGATTGAGGACCrrGTTGCTCGCCTGGATOAACTO 
CTX;CAQTTrGAAGAAGGACTGGAAACAACAGCGTTATTTGTGGCTGCCAC^ 
TCATGTGGGGACTOACCATCCATTAAGGANGATCAGGTCATNCAGCTGATOAACGCGATCNTTAN 
CAAGAAGAACTTTAATNCTmCOGAGOCTTAAOGOGNGCCTrONA 

SEQ ID NO: 2947 ACCTOCATCAGCATrAGTAATCAACCTGTrAATCaUVGGTCTTTAGAAAAACT 
TOAAATTATTCCTGCAAGiXAATrrrGTCCACGTGTTGAGATCATrcCTACAATGA^^ 
TGAQAAGAGATGTCTGAATCXIAOAATCGAAGOCCATrAAGAA'nTACTGAAAGCAGTTAGCAAGG 
AAAGGTCTAAAAGATCTCCTTAAAACCAGAGGGOAOCAAAATCGATGCAGTGCTTCCAAGGATGG 
ACCACACAGAGGCTGCCTCTCCXATCACITCan-ACATGQAGTATATGTC^ 
TAOTTrOCACrrrACACTAAAAGGTGACCAATCATCGTCACCAAATCAGCTGCTACT^ 
GAAGGTTAATOTTCATCATCCTAAOCTATrCAGTAATAACTCTCCCrGGCCTATAATGTAAGCTCT 
ACTGAGGTGCTATGTTCNTANTOGATGTTCTGACCCTTGCrrrCAAAT^ 
TCnrCCAANGGGTATAAAGGAATCmaXKnTTNGGGGTrATCAA 

SEQ ID NO: 2948 ngtacgaagttctcagtttcactttagtagaaagagctctagaaatoagoct 

GATAAACACATCTAAGAACACTOGTIXKTITrcrAAAArrrCCAAAGCTCCAC^ 

TTAGTOTTTCAAATGATrGCATmAAAGTATATAAATATGGGTTATCCAATATCAATGCTATAOT 

AACATCCTGAAACAAAACAAGCACAAACGTATAAATGCCTAAACTOGAGGAAACTTQAAAOCCTC 

ATGTTAAATCrTAAATGTAGTATrTCrAACTTGTGAAGACAGATTGGTAGGCAGOC Al 11111 I GTG 

TCTrAAAATAACTGGGGGCATAOTrAAAArmATACATCAAGTOATraCTATTATTOAATOlTGC 

AOaTQAOATGTGOTTATrrrrAGTTTATTTGAAATOmGACTGGAAAGOGGGOAGC^ 

AATATrrGAAATTTGGAAAACCCTAAACCTTTTGGTAAGAAATTGTAATTTTCACT^ 

TTAAGGQATTAAAANGTrrATAATTOTrGTAOTTNAATraACAATN 

SEQ ID NO: 2949 GGTACXTCIXKKCCAGCAGACATOTCTaTACAGGCTGAGCCCAGAACAAACA 
AAATGAGTCCTTTITAAGCATnTGAGGTGGGAACTACATGAAGCAGGCrrACAGAAGCGAOAAC 
AAAAQOCrOTTOTGACACTTCTOCTACATGTCTTACATCTCTGTGAGAACTCTO 

AAAATATGCAOAGGATAGATATGGTTAATGmcrrcOGACAOACAOTTAATACTCrCT^ 

TTAACTTCAAGGGGGGCTACTTAAATTCrmCAGCCTrGG TrAAT ACAGCAAT^ 

CTATTGTTATTTCTCTATTATTACTTATGCrATTATTCTGTTATrr^ 

ACCCCTATCAAOTCCATATTrrOGCTAATrriTCAATGACTTGGCTGGACTACCCGCGTACA(^^ 
TTGTTAAAAAGCAAOAAACCACAGTGTCTCTTCACrAGCATTTAG 
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SEQ ID NO: 2950 GGTACATCOTGCATGGamXAaCAAOACCTCCGTGTGGATGGCCGTGGCTG 
TGAGGACTACC0AT0T0TCX}AAGriXH3AAACTGATGTGGTGTCCAACACTAGTGGC^ 
TCAAGCTGGGTCACACAGACATCn'OOTGGQAOTGAAAQCAGAAATGGOOACGCCOAAOC^ 
GAAACCAAATGAAGGCTACTrGGAGTrCTTrCTTGACTGTICAGCCAGrGCTArc^ 
AGOTAGAOGAGGTGATGACCTTGGCACCGAGATCOCrAACAaXn-CTATCGGATAnTAACAATA 
AAAGCAGTGTCGACTrAAAGACCCTXn'GCATrAGTOCTCGGGAGCACrGCTGC^^ 
ATGTGCnXSCTTCTCGAATGTGGTGGAAATTTGTTraATGOCATTTCCAT^^ 

CrrcAATACAAOOATACCAAGGOTTCGAOrmCGAOGATGAAAAGOGOTCGAAAOOACATTGAA 
TTGTCAGATGACCCITATGACTGCATACGACTAAOTGTGOAGAATGTCCC 

SEQ ID NO: 295 1 GCGTGGGTCGCGGCCGAGGTACCTTAAAAGTGTCTCACCTAGAAGGCCTCTA 
CCTOTAATCACATTAATTTITCTAAAGACAAriTOOTOTTTTOA^ 
TAATAGCATCATAGGACAATTAGCCATTTTAGACTTGACCATATTT TCTCr mT 
TCTrGATATTTAGOTOGQAGACTACTCCAATGOAOCAACAGTnrATrrrACATOAT^ 
AAATTTACAAATTTTAAACTCATAAOAATTCTAAATAATTTGAAAATGGAAACATTTGACCCAC^ 
TCTAGCAGCATAAATACAmATAAAATACrrCATTGTTGATCTrAGGTCATTGAT^ 
ATTTGGTGACTATGGOCAGGTOGAGGGGGCCAGTOAOOAAGGTATAAAAGAGAAATCTTTATGAA 
TTGTGTTCAGATTGATTITGTATAAACATAATATAlTCATGGKrGNATCTCT^ 
ACTACATGAAGOTGGCCCAAOGGAAGGACAATATTTrAAATAAATATnGCTTAA 

SEQ ID NO: 2952 ACAOCATCGTAGGGTTOCCCTAAACTTGCCCTOTTTrraTTTTmAGT^ 
ATCaXTTACTGAGroGCCrCTACTAOGTGGCrGTGATTAAATGTCCCAAGC^ 
OGGAATGOTTOAOCCTrCTGGAGATCATTOTAACCAATCCTGCCAQACCTQTTTO^ 
AGCAAACCTAGATAAGGAa7rGTrTGG0GCAGCAGGGAGCAAAATCT<XTrrAACAACC^ 
TTCCTCATTCACATCAACAGAGCGAGGCTOTGATAACTTAGGAGGCAGCAATCCTAATAGTCCTrC 
AGTCCATTTTAGTCTCTCTaZAACrGGACACCAGTA^ 

AGATAAATGTrCATTTTACTOATGCACmAOTnTTGGTCTOTrACCTGT^^ 

GCCrrmANGCGCGGAGTTAGGCOACCAAACCAGTNAGAACCCCCAATC^^ 

CAAGTGTOGGTGGACAGGCCTAATGCGGATCrCAACTTCC 

SEQ ID NO: 2953 GGTACCX^\GTAAAAACCAOAATGACCCATTGCCAGOACGCATCAAAGTTGAC 
TTTGTGATCCCTAAAGAACnrTCCCrrTGGAOACAAAOATACOAj^ 

AGGTGACCATGTTAGGTTTAATATTTCAACAGACCXiACGTGACAAATTAGAGCGAGCAACCAATA 

TAQAA0TrCTGTCAAATACATnCAGTT(>CTAATGAAGCCC0AGAAATGOGTaTGAT^ 

TGAGAGATGGTTmwmXjiTCAAGTGTCn^ATCXjrGATGTTCGTATC 

AATTCTGGATGGGAACCAGCTCCATATTGCAGATGAAGTAGAGTrrACTGTGGTCCTGATATGCTC 
TCTGCrCAAAGAAATCATGCTATTAGGATTAAAAAACrrCCCA AQGGC CGGTTr^ 
ArrCAOATCACCXJriTrCTGGGCACGGTAOAAAAAAGAAOCCACrrrT^^ 
CAATAANGCAAAOAOAAOGAOCTOAGGATGOCITATTGC 

SEQ ID NO: 2954 GTACGCGGGGAGTTCItrnaXKSGACTAACTCCAACGGAGAGACTCAAGATO 
ATTCCCrTTTTACCCATGTTTTCTCTACTATrGCTGCn-ATTGTTAAC^ 
ATTATOACAAGATCTTX3GCTCATAOTCOTATCAGOGOKX3GGACCAANGCCCA^ 
TTCAACAGATITTOGGCAOCAAAAAGAAATACTTNAGCACTrcTAAGAACT^ 
ATCTGTGOACAGAAAACQACTGTGTTATATOAATGTTGCCCTGGTTATATNAGAA^ 
GAAAGGCTGCCXiANCACTmGaXATTOACCATGTITATGGCACTCTGGGCATC^ 
CACAACCCAGCGCrrATnm}ACG<XTNAAAACTNGAGGGANGAAATCAG^ 
CCTITNCOTrANTlTraGNACCTGNmTANTGANNGCTTTG 
CCOT>WGAAGGTrK}GGNNAGCNANCTTGGAAKrGGTTGNAATTh^ 
NCANTAA 

SEQ ID NO: 2955 ACGCGGGGQACCCOACCCrnrrreCAOTCTCAGGACGGGCGCTTTOO^ 
GGCCXCAGGCAGOiTGTCTCGGTCGCCrAGTCTXjGAGAACTAGTCCTCGAC^ 
QATGCTGAAAGOAATAACAAGGCTTATCTCTAGOATCCATAAGTrOOACCCTOGGCCTnTrTACA 
CATGOGGACCCAGGCIXXjCCAAAGCATrGCTGCTCACCrAGATAACCAGGTTCCAG 
CNANAOCTATITrmJCACX:AATCAGAATGACCCGG0CAAGCATGGGNGATCAGCACGNANGGT^ 
ATANACTOCAACATCnXXXXX:AGNGATITTGGAGACrcTATTTaXCAATGGCC^ 
TITCTTAATGCANOOTGAANACATI>)NNmXjAAGCmGC^ 
CTATAACrCCTGCATTACCTrQAAAAAACAOCA>rriTIT0NrrAATCCAOCTITrA^ 

SEQ ID NO: 2956 ACCACCGCAAAGCCCTGTGAGCGTCTACAGACA GCTCA CCA I'l 1 1 1 GTCCTGT 
ATCTCTAAACA Crr r riUl I C l IAGT CI ' ITI I tl I GTAAAArrGATGTTCTTTAAAATOGTTAATGTA 
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TAACAGGGCTTATOTm:A<mTGTTrrcCGrrCTOTTTT^^ 

TCCTTrrCTCATITCAAAGTTGCrAOCAGTGTATOCAGTAATTAGAACAAAGAA 

AQAACATTTTATTGCCTAOTTGA<>ACATTGCTTGAATGCTGGTGGTTCCT 

ACAATITIX:TAATATQTGTrAATXXn'ATGTGACAAAA CGCCCTGATTC CrAA 

CrrAATGTATATACCTGAAAACCCATGCATTTGTGCTCrrr 1 -n i l I'lT ATGOTGCTTTQAAGTAAA 

ACAGCX:CATCCTCTGCAAATCCATCTA TGTTG TCTrANGCAATTCTATCTTrGCTCAA 

NAAGGATOGGGOATrraGNTTCATNGGrnTNGATTTGAA 

SEQ ID NO: 2957 GQTCGQCCGAOOTAL'l-lirn ri I J'l 1 1 1 Tl'l'lTl-nGAAAOGTA ATTGnT ATT 
ACTGGAGAAGAGAAAAATGTATACAAGAAAGTCTAGATTCTTr CrrCCAC ATGCACTACi 1 1 iGGA 
ATAAAOrrCCAAATATCAACnTGAAOATACRACAAGOATAAATITTrCTTAACAAGTGACCAAT 
TACTCTGTTCTGGACATnrriCATAGAATCCATATACTGTAAGCTAAAT^ 

AGGAAGCTTGQAGCAATTATGTATrAACAGAGAAGAKKrrATrATATTITACTGCAAAATATTATA 

AAAGTGATACAATTTrGTGGCTCTTAAATGTrrAAT rACTI XiATTTCAACAO^ 

ACCAAAAATCATCAAGACAGG<XTGGCmAAGGTITITrGTATTCGATrcGT^ 

ATCrcCAArrAGCTaATACACATAGGTGAC(>CTOTAGAAGCCTGOATTCCATCAACACAAATaAT 

GGAATAATXjTmXiTTCXWAGCATTATATOCCCANTOGCA 

SEQ ID NO: 295 8 ACTnrCATNTATTGACACTGAGAGAGGGGCAGTGACCAGGCAGCCTGGAGAT 
OCCTATrAGGAAQOAAGAOTAGOAGAAAGTramCTOTCCCCACCCCTCTACCCCTACAGGaOA 
AAGGAATCTCm'AGTGATGAGGOTGAGAGACTTGAAATACATCAGCACTGCAOTGAGCAGGAC^ 
GGTTTCATTACTGCTTrmACTTCTXjAATAAATTATTTTTO 

AAimXMTTGTGTGGGTTGCrGGGCTTCAANGTTCATGGTAGAAAAAATTAACTrc 

TTNGG<X}GNTCATrGGAATGGTCCrGAGCTGCNCGGTCTraCCAGA^ 

OGCrrCrrOCrhrrCAAAAACNACCAGQGCCCrcrACTm:ATATCACAAT^^ 

CACGGTCANAGGNCmATTmT^mKrrAGANCATrrATNmTTACTT^ 

CTTTNG^^XT^•A>m^■AT^G^r^ITGAAGACCTANACCTATACCGTCTAN^ 

SEQ ID NO: 2959 ACCAAAOGATAGCraTrUIXnTrAAOTAGOGACCTCTCATOOCCTACAOGC^ 
TGACATCTCAGAATCAAACTGGAOAACATIXX^AAGCCGTTCrTATAAGTGTCTCCAT^^ 
GGGCTXiAAATGGAATGTGCAAATOTAGCCCAGCCTGGTCCTTOGGTGTrGCCAGTTGATrG 
TGGGAGCCAAAGTGGCATCTOCmGACCTAAAajGGCGATGATGAAATAA AACTC AACAGC^ 

COTCCAAAGa30CCATGaXCATGTTTCCACTAOATGOa}CTaACACTrCAOGCATCAAC^ 
GGOCTCTCAG(XnTGCAAAGGCAGCCACTTTAAAGTCX;GTGTCCTGTgrGGGGCACCAA^ 
GCTOCA0ACACOCAGTAN0COCCAGOOCAAATGCOTCCCATTrrAAAGAAO>nTrGNAm 
GCTCrntKrrTCCTTCCTTCCCACTAACTTTAAAGAAATG 

SEQ ID NO: 2960 GGTAC'l'lTri ITU 1 1 i 1 ril J 1 1 1 1 IGGAGTTTCTAAGTCATTAC'lT rriATn T 
GAAAGATTnTTOAAACTCTrCACATCATGOTOAGAGTrroTATGATTAATAAGAAGC ^ C 
ATGAAATGCTTGGAGGTGAACGAGTTCTCAGCCTGTGAGATCCOACCATCCCATTAACTTTG^ 
TTXrrCTTCATTAATAGAANAAAAAAGGGGAGGGTGAAGAAAAGGAGGA ACATGC TA AAAAC CT^ 
ATGACAATCATCCAAATGTCAGGAAAGAACAACCGATTCACCAACTCCACTrrritrrA 
CTTTCTACATCrCACIXTTCATTITGGCCTrCCTGGCTXiA^ 

QAGAAOAOCCCTOGTTXjrCCAAAAGACAGAOGAGGAGAAOCCCTXKrAGGATGCGCTQACCACTT 

OCAGANAACraACAGTaXGTG<nXXX;AAAAGTTraACCAAC^ 

NCTOAAANGNAAANGAGGGAATGGNGATGAACCTGGGCTTArGTN 

SEQ ID NO; 296 1 ACACQAACATGATCMQOGTGTrACACTCraQCTTCTOTTACAAGATOAGarrC 
TGirGTATGCTCACriTCCCX:ATCAACGTrGrrATCCAGGAGAAT^ 

TrrOTGOOTGAAAAATACATCCOCAGGGTTCGGATGANACCAGGTGTTGCrroTrCAGTAT CTCA 

AG<XX:AGAAAGATGAATrAATCCITGAAGGAAATCACATroAGCTTOT 

GATTCANCAAGCCACAACAGTTAAAAACAAGGATATC\GOAAATTTTroGATGGTATCTATGTCT 

CTGAAAAGGAACTGrrCACAGGCraATCAATAAGATCTAAOAGTTACCrrGCTACAGAAGAAOAT 

GCCAGATGACAmAAACCTCITGTGATAmAAATGATCATAAAAOACTATrc>nriTGGAAAAT 

SEQ n> NO: 2962 ACnTAATAG li 11 l (J l lAGAAAAAAAATTTCCAOACACTrAACATTTCACAA 
CATTrcAACAGCAAAOTATTAGTraAGAOAGGGGTTTTCAGGAGTTOOAGArrATAGAATATTAO 
GAAGAAATGTTGGTATCCTCCATTATAGATGGATGGCATAGGTCAC AAAT OGGAGACTGGCAGCT 
AAGCCAATATCAAAACCCAOTGQAATOACACTrCTATGGAaTTTACri ri CI I CCTGCT ATCTTCCC 
TATCCCACGGAAATGTCTGTCACCATOTAAAGCCCAGTAGCAGGCAGCT TAGG CTCCAGTCTTCCC 
OCTTGGGTAGGAAAAGGAGTGAAGGGAATGTCACTCCTGAGTTrcCATGCM'riUl IL'l 1 ICCTCTC 
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CTTOAGGNOGTCTTCIXjAATIXrrcCTCTrCrTCCTCrCCTTO^ 

GCCTTTCTGAACKrrGTTCTtrrCACTCCTCCNACKn'AGATAmCANAAOC^ 

CCmCTCGCCAAGG 

SEQ ID NO; 2963 ACAAATTGTATCAGAAG™TrmAGAAATGATAGGTAACCAGGTCCAATC 
ACTAAAAATAAGCTGCTTATAACTGGAAATGGCCATTGAGCTGTTTCCTCACAATTGG^ 
CATGOATQAGTAAACTGTTrCTCAGGCACITGAGGCTrrCAGTGATATCTTTCTCAT^ 
CTAATTTTGCCACAGCGTACCATTTCCAAACTAACTATCCTAGCAOCGTt:^ 
CAAOAGCTCTGAOOOCATCACTOAACANATAOCACmATOAOTTATTATOATrCAAAAATCTOT 
TTG<nT3TNGGATITACCAACACCGTANGCTTTTATTrCTTCCCATTACA 
ONATCGTQOOCATCTCNCTGCNAANATAAGACTTCCTCAAAATCTTATrTGrrAG 

SEQ ID NO: 2964 ACTGAAATOTGTGTTGAAATCAGTTACCGTGCTCAGGCTGA GCAGC AGCACA 
ATCCTGCTGCCAATCCCACCATGATCC«AGCCAAGTGCTATCACAACCTGGA^ 
TCATTGCACTQCTCOTGAAACACTCAGGGOAGGCCACCAACACTGTCACAAAOATrAAT^ 
AACAAGGTCCrTGGTATAOTAGTGGGAGTTCrNCTrCAGGATCATGACGTrCGTCAGANTGAA^ 
CANCAACITCCCTACCATCGAATTmATCATGCncmm^GGAACTX>ATG(^ 
OTTTGGAAACCATTAATTTTCCNGANACT^ACAGC^T^^^^tJCAA^r^AC^^ 
GCCTACCNAAANCTX^ACTGGGCmA^^NAT^^■GGCCIXKK;^m 

SEQ ID NO: 2965 ACAAGACTCTTOACAGTTGTGCITCTCTAGGAGGTrGGGTITmTAAAAAAA 
GAATTATCTGTGAACCATACOTGATTAATAAAGATmXnTTAAGGCAGAGGCTGGTra^ 
GCTGTTATC^K^G<X^t:AGACAGACAGTATAAATGGTC^G^Tr^^■AAGA^TCCT 
ACTTTGGGCCAAGTATCCACATCCCCnTGCGTATGGGAGGTOOGTGAAOAOTaTrNOGATGCATA 
GTGOTTATTATGGOAAGTAGCTNAATGGTAAAAGGACAAACACCTAT<J 1 1 1 CIJ 1 GAGCTTAANCC 
TGGTTGTGCTTNTTNCCAAGGGAGATANTAGG 

SEQ ID NO: 2966 OTOGTmCCTCTATTTrGAATTnTOATCAAAAAACTGATrAaCAQAATATAG 
rmG0AGTITGGCTIX:ATCrmCTGGGGTraXCTCACTCCC7r^^ 

TCTTCCTCTACTCAGGCAaTCAACCCGCCACOATOAGAAGTGGGACCAGCAGAGGGCAC^^ 

TCAGGAGTCCttCTTTCCCACCAGGCTTCATTCACCCAGTCGACCTGAA 

CGGCCCirCCTTCCTCArTGOTGTrrGGTATGCGCACAGTrCCTGTGGGACnXJ 

CATTGQAAAQANOTTCAGTGGCCCATTGTrAACTNAGCCTrAAAATCT^^ 

AGAAAATGGAGAG<XTCTTCTGGNGGiTGGrrGCrCCTCNGA>rtaGCCAA(^ 

CACCnT 

SEQ ID NO: 2967 ACrCAAQTCACITAATOAOOAA OCTOT GAAOAAAOACAACTCTGTCCATrGG 
GAGCGCCCTCAGAAACCCAAGGCACCAGTGGGGCATTmACaAACCCCAGG<nx:C^ 
GGTGGAGATGACATCCTATXnaCTCCTCGCrrATCTCACGGiXCAGC^ 
CCTGACCTCTGCAACXIAACATCGTGAAGTGGATCACGAAGCAGCNGAATGCCCAGGGCGGm 
NCTOCACCCAGGACACAGNGGTGGCnTrCCATGCTtmntXIAAATATGGAGCAGCCACA 
AGNACraOGAAGGCTGACCAOONQACTATTTCAOTCTTAKOGANATrrrTCAGCAAATrcCAAGT 
GGNCAACACANCCCGCrrGTNCTGTAGCAAGGTrrNAmGCAGAACTTGamiGG^ 
^TOAAAOT^QACAAOANAAGOAT^mOT^^'ACCTCCAANATTaTNAATACAATATTC^CCCANAA 
AAGGAAAA>TO<XCnTNCmATGAGNGCTGACriTGTCTrWACTGTGh^ 
CANCrrCCAAATCrCC 

SEQ ID NO: 2968 ACAOCATCOTAGGGTrCOCCTAAACTTGaXTGTTnTG i 1 1 1 i 1 lAQTTTOTT 
ATCCCCTTACTGAGCGGaTCTACTACOTGGCrGTGATTAAATGTCCCAAGCAAGGATA 
GGGAATGGTTGAGCCTCTGGAGATCArroTAACCAATCCTGCCAGACCTGTTTGGGGCAGT^ 
AGCAAACCTANATAAGGACCTGTTTGGGGCAGCAGGGAGCAAAATCTCXnTrAACAACCAAGCAG 
TTCCTCATTCACATCAACAGAGCOAGGCTGTGATAACrrAGGAGGCAGCAATCCTAATAGTCCTrC 
AGTGCATTrTAimrraTXrrrCAACTOGACAOCAGTAOQTAGTOTCAAC^ 
NATAAATGTCATTITACTGATCCACTTrAGTTrrcGTCTGTTACrcT^ 

TITAGGCCGOOAOTTAGGCGACCAACCAOTGAGAGCCrCAATCCrGCAGTTITGTGGCTTAAGm 
GGGTGGACANNCCrAATGGGOATNTCACTCCrrCTGTGGGCIXKJNCAO^ 
AATGGTGGGAGTGGTTGTGGTCTNAACTXnNACANAGCTTCAGTOGGAGAGGATN 
GCNCTTACTGAAGGAirrAATTmGGTTNOCTnTrCAAAC 

SEQ ID NO: 2969 acgcggggagcggatagaggacacgaccaagatogcggcggtgtctggctt 

GGTGCGGAGACCCCrrCGGGAGGTOACAGTnXJTGATGCTATAAATCAGGGTATGGATGAGGAGC 
TGGAAAGAGATX3AGAAGGTATTTCTGCnGGAGAAGAAGTTGCCCAGTATQATGOGGCATACAA0 
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GTTAGTCGAGOOCTGTGGAAGAAATATGGAGACAAGAGGATTATrGACACTCCCATATCAGAOAT 

GGGCTTTGCTGGAATTGCrrGTAGCiTGCAGCTATGGCTGGGTraCGGCCCATTTG 

CnTCAATTTCTCCATGCAAG<X:ATrGACCAGGTTATAAACT 

GGTGGNOTAA<XTGTCCTATAOTCTrCAGGGGGCCCAATGGTCCTCAGCAG^ 

ACTCCAmtKnrrGCTGCTOGTATCGCACTGCCAAGCTTAAAGTGGCAGT^ 

TCTAAAOQACTTATTAATTCNCCATnXKjGNTACAATCCAATG GTGG NG 

GAATGGGGTTCCTITTGANTTTCCTCGOAACTrANTCAAAG 

ATANAANGCAAGAACAATm'AACTGrGGmCCTrcAAAACTGNGGCCAATC 

NTOCTTTT 

SEQ ID NO: 2970 ACAATGACTCATTATTIXnTrATAAAAATCTOTTGTTCTGAAAATCAACAACC 
rrANAAGGATTTTGTCTTANAAAAmCCTrGGCTNTA>WTm 
TCAAAAGCATCCCCAAAATTOCAATGATGTGAATmTAAATTACCrmATT^^ 
CCATTGATrGTTACTTTAAATTTTTtACCTGGAATNATGGNTrTAAAATTTCCC^ 
AAAAANTAANOCAOAA 

SEQ ID NO: 297 1 A CITIN 1 mi H I 111 11 1 1 1 IN TTTATATCACAACATCQnTATTATOTOAAT 
TTITTACAATACAAACAAAAAATACANAAATGCAATATATGAATACAGCTAAATGCANAATGGTG 
ACITTTrrCnmX>AAAGaCCATOATTCa:ATITCT 

AAACAGOTrGGTCArrAACrrCACAArmGCCTANAAATGATCTATAAATGCATITCCC^ 

CTACTTACCATAAAGNGT>j^AAAGGGAGTTAAAGGAAAGTITCCTTGTIXX3Tr^^ 

AOATGmATTXn'ATmAGCAGNGCCAAATATm'GaAAAATATtnmATTAAATGG 

AAANTOAAGCCA>rrA>rrGATATTTrGGCTAAAGAGGGCCCTAATTGAGNAATAATATrr^^ 

AaAA>ri<>AANCANACAAACCAAAAANOACGTTI>rmAAAAAAACTCTAGGNC^ 

NAATATAGGGTrrTrrTnmrcATCNGOATTrrAAANAANNGN>mNGGTNC^^ 

CAACTGGTn'AAANCTAAATrACraTA>mnTmGCCTGOCCCTNNATTOGAO(XTr 

SEQ ID NO: 2972 A Cl ' l ri ri ' lTl ' l ri ' l 111 IG AGCAQTAAOOTATTAnTATTAAQATCTTAAGCC 
TCACCCXTTGAACTCAAATGGGATGGGATAAGGAGTAAOGAAAGAGGTTAGAOGAGACAGAAAAC 
AAAGCCCAGCTCTTCCAfGCTCACCAarrAGCCAAGGTCTTCOAATTCCTGAOGOAACC 
TGGAGGCTGGGCTGAAOGAAOGCANCATTCCTGGGAGGTKXIAAGTT^^ 
ATTATATAGGCATGAG(XACTTGAGCCTGGOCCAGAAGCGTrTrrCTCAAAGGCCCrNAGTGAAN 
ATAAAATTAAAATrnXTCAmrTCCTOTCCTNOGCCAOOOATNTCT^ 

TTGmTGGGNGGCCACAGTTTTrAAAATTAAGGAGGAGGGNGGCNGAAAAAAAAAATGTi^ 
GAGGOAAAhTITTTTCCCNANQCCOGNAChfNTTTtmJGCACTCCATO 
GCCAAACCCTTNAAACCCCTTGATNTTTrcAAAAGTTCCAAGTTGAAGG 
TNANTGGTCCAAAAANANGGGCTGCrrCTCCAAAAAAGNTAAAATNNAATn'C 

SEQ ID NO: 2973 ACAAAGCAOACTOCCNGCAAATCOACCGOTQGTAAAOCACCCAGNAAGCAA 
CTGGCTACAAAAGCCGCnXK:ATGAGNGCGCCCTCTACrGGNGGGG7GAAGAAACCTCATO 
AGGCCTGCTACTGTGGaXnXXTrGAAATrAGACGTTATNATAAGTaiAmGAACTm^ 
CAAACTTNCCTTCCAGCaTm'GGTGCNAGAAATTGCTNANGACmAAANACANATTO 
ANAGCGNAACTATCGGTGCTITGCAGGAGGNTA'nTOAGGCCT 

SEQ ID NO: 2974 A CrrrrrriTn ri in ' i ' l ' l 1 1 I GGNCCTTTAAAATATATTTAATnTnTAACA 
AGNGGAAAANAATGTTTCTTAAAANACATTTAATTTTTTAGTGOAAATT AATAT^ 
T^mnXX:ATAACAA^TroAATAACAA^ITIT^ATCTTCAANAA^ 
ACATGTAGCACTGAATGCCAAAGTGATGGGT^r^CCATGOTCANAATTCAAAAT^AGA 
AAACCTGTCTGGrrTGTOCCTGAGTGAAGAATGATCTCGAGCTGGGGAGGGAGG^ 
ANCAAOTOCTTmAAOONOAANrrCCTIXXnXKJAAAAAOCNTrOOGm"A^ 
TTCATACTGNCGGGCCCNTGGA^frTTCANGG^TmTGNCAGGGNCAGGOATGNGGT^:GACATNA 
TTATCCTCAAACANGCAAAA 

SEQ ID NO: 2975 ACCAGAAAAmACATGOACATATOCAGCAAATGTTQGGOAACATGATCTGT 
TAAAGGATGGATOTGGACAGTITCXyLTATGATGCAAACATCCAOATACACTGGGTATCAm 
ATGGGCGCCAOAaAGTrrTCCTTrrCACCGATGATXrn'OOCrrcCTr^ 
AAGAAATGGAACACGCTGATTATGAAATAACCrrGTXnxnXXIACAGTCn^ 
ACAATCAAAGCAAGCAOGAAOTrrCCTATArrGGGATAACCAGTTCIGGTGTTGTTGGGAGGTO 
AACCAAAGCAQAAATOOAAGCCAriTAKrCAAAAOCAOATAATTCTrA TI^ 
GAAACATCAAATTATCAAGAGACCATGGCTGGGATTAAGCTAGATATAA'rr rriGAG GTCAATTTT 
GATAAAGATCCAATOOAAATOCNCTNCrATTCGTAaCCCrATr/U\ACGAQACTITITN^ 
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TCTANATGGAAmAAGCCGTCrrm"C(XTAAAACTrAAGGGCCNGG 

SEQ ID NO: 2976 ACTTGAri fl 1 1 1 1 J 1 1 1 i'l U 1 Hi 1 1 i 1 1 111 1 1 1 1 lAOAATm CAAC AGGATC 
TGCTOOCAOGaTTTTTTTGrrrTNATTNOTTTOCTrATTT^ 
CTTAAG<XnTrANAO<K}ANAACaXATTTTCAATTATOhrrGGCTTTITATA^ 
AANATTTAAATAAANCnrrGCTACCAANATOATTGOCTTATTOAATAOOTCNCI^^ 
TAAANOm'GATANCTGCCATTKXJOGAAACAAaWAAATTCTCTTAAGGGTAAACAAGGC^ 
TCNNCCNCCAOTAAANTTCTCCTTNGGGATAAACAANATTTmCCNGN^ 
mrmmTTNCCCTTTNACAAGTTONATAAAANAAAAAGGNATTTT^ 
TTTGGCT 

SEQ ED NO: 2977 ACAGCTATTGOAATCAGATGCAAAGATXXnxnTGTCTTTG^XKjTAGAAA^ 
TAOTCCTTTCTAAACTTTATGAAGAAGGTTCCAACAAAAGACTTTTrAATOTTG^ 
GAATG0CAGTAGCAGGrrrrGTrGGCAGAT<K:rOGTTCTITAGCAGACATA0CAAGA^ 
TOIAACTTCAOATCrAACmGGCTACAACATTCCACrAAAACATCTTOCAGACAOAOTOaO^ 
TATGTGCATGCATATACACTCTACAGTCCTGTTAGACCmTGOCTGCAa 
TACAGTGTGAATGACGGTCGCAACnWACATGATTGCCCATCAGGTGTrTCATACNGNTA™ 
GCTGTGCCATCGONAAAOCCAGCrAAOCTGCNAAGNACNGANNTAaAGAANCTrCNOATGAAAG 
AANTGACCNGCCGTGGNTATCGTmAAAAAOTTCCAAAATAmACK^ 
CACTCTTAGOGCOAATNCCACACACTTXlOGOOCGTrACTAGGNGATCCTAOCI^ 

SEQ ID NO: 2978 ACCAACTOCCAGCATTTCrcTGGAGGGTAATCCTGCIXnTAATCGTC 

GCACCCATTCTCOCAQAGAGATTGCrcTGGGTGATTCTCAAGGACAGATTGTTATATACGATG^ 

GOAGAGCAQArraCTOrrCCCa3CAATOA7<JAATaOGCACGOTTTGGCOT 

TAATGCAAACCXJAGCTGATCCAOAGGAGGAAGCAGCTACOCGAATACXnXjCrrAGTTC^ 

GOOGAGTGTAACTAGTGGATTTGOOAAAGGTrCTrAAOTAGATCCTGAGACTATTNOCATGCTTCT 

OTCTAAATGATAATTAA.*AGGAAATnNATGGATTAAACCATGGGTTTAATTGCATCAAGGAAAC 

TTACAATTGTCCCCTrATATATTACANGCATCrTGNTrrGGKriTGTGGNATrmAATATO 

NACrrcACAGAAAOCACTriTTTNAATTCTAATACATAOCTGTATATTNGGAT^^ 

hrrCTTrrCACTTNNAACACTGTTACAGTTTTTNOTAAACNCATATO 

AATAAT 

SEQ ID NO: 2979 ACAOOCrGACAGAGAAGATTCCCGAGAGTAAATCATCTTTCCAATCCAGAGG 
AACAAGCATOTCTp^ 

tccaagcatcacccnwksagtttcctgagggtttictcataaatgagggctgc^ 

tgcttcnaagtattcaataccgctcaqtatmaaatgaaggtcattctaagam^ 

tcaataggaaaacatatocagccaaccaanatgcnaatotitraaantga 

aagtanggaaagtcaccnaacctttgcntmctraagggcixksccconatact^ 

gatctngtactgnganattttaaaaatccagtcctcgocx;gcgncacoct 

actngcggcgtactattggntccgacrcggtccaactgnnaaacatgggtatagctgttct 

aatngtlxx}gtcncaattccnntanttmgcccgancananogtraacctgggg 

nccaanccataaotcgttgoctactgccorntcatncggaaac 

seq id no: 2980 acagctattggaatcagatgcaaaoatggtottgtcntggggtagaaaaat 
tawcctitctaaactitatoaagaangttccaacaaaanacttmaatgto 
gaatgocaotancaggtntgtnggcaaatgctckrtcmaccaaacatancatgakaagangct 

TTCAACTTCAGATCTAACTTTGGCrACAACATTNNACTOAAACATC^ 
NaTAAT^fITCAAACT^^^^TACACTWACAGTaCATGGTC^[AT^^ 

SEQ ID NO: 298 1 ACAGnTCAGGGCAAGAAAACGAAATTTGCTAGTGATGATGAACATGATGAA 
CATGATGAAAATGGTGCAACroGAOCTGTGAAAAGAGCAAGAGAAGAAACAGACAAAGAAGAAC 
CTGCATCCAAACAACAOAAAACAGAAAATOGTQCTGGAOACCAQTAOTrrAOTAAACCAATr^ 
TATTCATTTTAAATAG G 11 ! lA AACGACTriTGTTTGOGGCOCnTTAAAAGGAAAACCGAA'ITAG 
GTXX:ACTrcAATGTCCACCTCTGAOAAAGGAAAAATITnTTXnTC 
GCAAATGAGATnCmGAATGTATTCTTCTGTrGTGTTATTTCAGAT^ 
AAGATTCTTCmAAATTGCCrrTGTAATATOAGAATCTTTTAT 

SEQ ID NO: 2982 ACGCCGGGTOCTrCATCCAGGOCCCrOOAGACAAAOOGGAOTOTTrGACGA 
AGAAGCAGAC0AGT(XiCTCCTGQCGCAGCG0GAATGGC>GAGTAACATOCAAAGACGAGTCAA^ 
GAAGGTTATAGAGATOGAATAGATGCIXKK^AAAGCAGTTACTCTrCAAC^ 
TTATAAG/UVAGOTGCAOAAOTCATTTTAAACTATOOACGACTCCGAGGAACATTGAGT^^ 
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CTCCTGQTGTCACCTTCATAATAATAATTCAACTTTGATCAATAAAATAAACAATC^^ 

CAGTrCXKX:AGTGTGAAGAGTATGTGCTCAAACATCTGAAATX:AATCCTX^ 

GATnATTGGACTCCATTGAGGATATGGACCTITGTCATGTAGTrCCAGCTOAGAAAAAGATTGAT 

GAAGCTAAAGATGAAAGACTCnxn'GAAAATAATGCTGATTTAACAAAAACTGTAGC^ 

GTQOGATAGATTGTCATATGTNQAATGTTGTAGAACACAaOAOCATGCCATrcAGAAAAC^ 

CCCACATGGATmGGAACAGACACCATTTATTAAACANTOGGCTAT 

SEQ ID NO: 2983 ACAOCCAACGGTnCCCITGGGGGCTITOAAATAACACCACCAGTGGTCTr^ 
AOOTTGAAQTOTaOTTCAQOOCCAOTGCATATTAOTaQACAGCACTTAGTAGCTGTGGAGGAAGA 
TGCAGAGTCAGAAGATGAAGAGGAGGAGGCTGTGAAACTCTTAAOTATATCTGGAAAOCGGTCTG 
CCCCTOOAGGTGGTAGCAAGGTIXXACAOAAAAAAOTAAAACTTOCTGCTGATGAAQATOATGAC 
GATOATGATGAAGAGGATGATGATGAAGATGATGATGATGATGATTITGATGATGAGGAAGCTGA 
AGAAAAAGOGOCAGTGAAGAAATCTATACGAGATACTCCAGCCAAAAATGCACAAAAGTCAAAT 
CAGAATGGAAAAGACTCAAAACCATCATCAACACCAAGATCAAAAGGACAAGATCCTTCAAGAA 
ACAGGAAAAAACrCTAAAACACCAAAAGGACCTAOrrCTGTAGAAGACATTAAAGCAAAA^ 
AAOCAAGTATAGAAAAAGGTGQTTCTCTTCCAAAOTGOAAGCCAAATTCATCAATTATGTC^ 
ATTGCTTCCGGTTGACT GACCA AGANGCTArTCAAGATCTCTGGCAOTGGANGAAAGTCT 
NAAAATAGnTNAACAATnTGTTAA 

SEQ ID NO: 2984 CACCAACTGCAAAAATTArrrCCAGAGAAGTATCrGATGGTATAATTGCCCC 
AGGATATGAAGAAGAAGCCTTGACAATACTITCCAAAAAGAAAAATGGAAACTATrcTG^ 
AGATGGACCAATCTTACAAACCAGATGAAAATGAAGTTCOAACTCTCTTTC 
AGAAOAOAAATAATGGTGTCGTCGACAAGTCATTATITAGCAATGTrGTrACCAAAAATAAAOAT 
TTGCCAOAGTCTGCOCTCCGAGACCTCATCGTAGCCACCATTGCTOTC^ 

m^rirnrnNTCNTTACTAm^OACAATociGc^ 

TCACTTTCTGNANGATAAATTCITANAAANAGACrGTTGOGACAAAAAGTTTWGCC^^ 
ANACATTGCAAACTGCOCTCCTAANAAATrGGCrCCANrmXC^ 
NGNCOONACCACOCrTNGGGGAANTNCACNCAm'GGNGGCGNTATNNTGNTOC>^ 
OCTTOGNGTrNTNTGGGTATAACTGTTCCTGTGN 

SEQ ID NO: 2985 GTNCG(XJGGGAGGCATOAOGCANCCAGCGCM*GGGCrrNT0CrOANG0OO 
CCCAOCGQAOCTTOCGQAAACCGCAOATAATGl 1 11 1 i ICTCTTTGAAAGATOGAGATTNATACAA 
CTACITAAAAAATATAGTOJATACAGTTNCTAANATATTGCTrAGCGTTAAGhnTTTO 
TTAATAQCrraWGArmAAQAGAAAATATGAAGACmAAAGAGTNTCTTOAGGAACGAA^ 
TANANGGTITCTAAAACATGACNGAGGTTGACATGAAGCTCCTTCATGGAGTNAAAANNGTA 
AAAATANAATCTGGGAGAAAGGACTACXX}OGCCCNCCATTAATACX:AATAATAAaOOCACT0CTT 
TTAQATrAAAATGACGGTGACITATAC7VGGC>rrATAAGrn'AC>nTrAAAAGTTGNGNGTC 
AAAAA>nrTGAANGCGATC 

SEQ ID NO: 2986 ACGCGGGGATTGCAGCCTCOGarrrTOTAGAAGAOaAOCATCTGCTCCAQAT 
OOAAAATXKHrrAAGGAACAATCAAATrCAATTCGGGAAGAATATAGAGGTOGTAATAAmraTA 
AAAGTCTGCAGGGTTTrrmAAOCAAAGGAAACTGGTnGAGAAAATTGTGAAATCAGCATTGA 
AATirOTTAOCTACTXX^AATCAAGATTrGCAAACTACAAAAAAATCATItrroAAGCr^^ 
AAATATATACCAAGATAATrrCAGATAAGAATGTCCAGCAAGGAAGTGAAGACTGCTCTAAAAAG 
TGCTAGAGATGCAATCAGAAACAAAGAATACAAAGAAGCTnGAAACACTOTAAGACAGTGTTAA 
AGCAAGAGAAAAATAACTATAATGCCTGGGTrmATTGGCGTIGCrcCAGCTOAACTAGAACAA 
CCTGATCAGG<XX:AGAGTGCXn'ATAAAAAACrGCTGAATrAGAGCCAGACCAATTCTACCTTO^ 
AGGGGTTAOCAAACTTGTATOAGAAATATAATCACATAAATGCTAAGGATGACTrGCCrcG^ 
ACCAAAAGCT CCTGG ATCTTTATGAANAGTGTTGACAAGCANAAATNGGTGTGATGTCT 
AACTTGTGGATCTTTNTTTNCC 

SEQ ID NO: 2987 ACAGAAATACOTOOGTGAGGGGGCTCGAATCGTTCGTGAACrCTTTGAAATG 
GCCAGAACAAAAAAGCCTGCCTTATCITCTITaATCAAArroATGCrATTGGAGGGGCT 
ATGATGaTGCTOGAGOTGACAATGAAOTGCAOAOAACAATOTTGOAACTGATCAATCAGaTGAT 
GGTrrrGATOnaiAGGCAATATTAAAGTGCTQATGGCCACTAACAGACCT^ 
GCACrGATGAGGCCAOGGAGATTGGATAGAAAAATTOAATTrAGCTTGCCCGATCrAOAOGGTCG 
GACCCACATATTrAAGATTCACGCTCCrrCAATOAGTGTrGAAAGAGATATCAGAm 
AACAC«ACIGTGTCCAAATAGCACTGGTGCTGAGATTAAAAACCGTCItKX:AGA 
TTGCCATCAGAACACOOaJAAAAATTGCTACCCANAANaATTTCTr^ 
TTAAGTCTTATGCCAAATrCANTGCrACTCCTCGTTCATGACATACAACTXJAArc 
AAGTCAAAACmAAATrGGAATCCTAOCTTATATAACTnjGTAATAACCAATrCNTAACrc 
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AAAAAAAAAAAAAAAAAANO 

SEQ ID NO; 2988 ACTGTrrCXrrcAGCAGAGGAGAAAAACrCAACCTAOTrATGAGACCAACCAC 
ACAACACAATGAAAAGCTGCACTAACTAOTTCAOAATGTTAGrrAAGATGATGCTGGTGTGAATA 
ACTXXrnTTTTCTAGAGCCCrrATAAATAAAATOOCXXAGrrAG^ 

GTTAACATGTGOTAOAATOAOGACrrATOCAAGGTATAAATOCOCATAOCATnTACrACTATOAG 

AACAAOTGCAGTCAGAAGAAAACCAACTGGACTCTAAATTACACACACCTTAATGACAAGACTrc 

a:ACCACTTGTGAATGTAAAACArnAATTrAAAAATOTrcATACTACAATATATAA>^ 

TATAAATGCACATAGTOTATTCTATAGCTGCCAGOTTrA CI 1 i i 1 J 1 i 1 1 1 1 1 AAAGGAAACTGTAA 

GTTACACTGTGGTTAAGACTTGTATCTrCACCCTrGAAAAAGCCCACArrCTATCACAGTGATC 

TOOTCAOACTTAACAOiXCCAAriXnTAAACACTIXKWATCAAOTCATAACC^^ 

AOGACCCTGT 

SEQ ED NO: 2989 ACTACCACAGCCTTrAAGTOACATTGATTTATAACnGGTCACAATrCACTGC 
AmAOOAAAACCAaCATrCTrATCTGGT(>GTGCrCOCTTCITAGCAACCCCTAATTAAATT^ 
TCATCTCTAAATCTTAGCTrCAACTITATTCAATTACATrTOGCTGAOGGCTGTT^^ 
TAAOTGTTGACCATAAATGCAAAACmX:AGTATCTGTrGGQTmATTAGCAOATOCTO^^ 
TTAAAAAAAACCGACAQTATAACTGTCATAATTATGGAAGGCACTtXTrCCGATAATTATATTCTA 
TTAAAAAAACACCATrrATAGTGAACTCTGTCACTGATAAATAAACAATAAATATCTCAOTG(XAA 
AAGGACAGAAAGCTCTOXXn'AAGATTAACACmOOCCAAAATTTGQCAG 
AOTCrrGACAACTOAGTCrGCAACTAAACACCrGAAACTGOTTCTCT^^ 

ACAAAAATAACAAAGACTAAATGaAOGCTTATOGOOOAAOOGACAGAOGAAAAOAAAATATACT 

AAGGCTntiGCTICTt3GTaXXn"CTrrCATAAATGCACT^ 

GCATCACAGTTGAACT 

SEQ ID NO: 2990 ACCAAGAAAAATAAAGAAOAOOCTGCAGAATATGCTAAACTrTTOOCCAAG 
AGAATGAAOGAGGCTAAGGACAAGCGCCAGGAACAAATTGCGAAGAGACOCAGACrrrCCTCTC 
TGCGAGCTIXn-ACTTCTAAGTCrrGAATCCAGTCAGAAATAAGATTTITO 
OATCAOACTCTOAAAAAAAAAAAA 

SEQ ID NO: 2991 ACATCAATAACCGGGOAl J U ICi 1 J H1GCTATACTATTTTCACAACCACGG 
GTGTGCTTGCTAGGCAATATAACCATCACAGAAGCAAACGTTAGGCAGAATACTGOTAAGAAAAA 
ACAAAGCAAGAAAAGAGCTACCA0ATATA(>GACAGGCTIt30TTCCACATTCACT^ 
CAAGCGCCAAGTArrAACTGCACATITrrGTTCAGCrAGAAAGQAGGGAl 1 i Ti l l'i'n<J'i'l-l-l l'l'J 

CTrnTrrrGOTTrGTTTrAAATCAOTOCATAAATTmCTrrr^ 

GATGGACTCTACAGCTAAGTGGAATATCAAAGGTAGAGGGGTGATTCTONGAGACTGATAGGCCT 
GACTATIt7rcAATT^mXXX:ACTGCANTGT^CACGCAAC^^C^^ 
CATCTQACACTGACCTOGATGAGTOCACTTGGOAGACCTGGTGa^GCACAAA;^ 
GGC^AOCCTAATACNGGGAGAAATNATGGAT^^^rAAAAATAC^^^T 

SEQ ID NO: 2992 AcmrmTrnrmTTTrrTrmGAAGGATT^^ 

OTCAOAGrrTGTATGATTAATAANAAGCAGCnrrTTrCATGAAATGCTTGGAGGTCAAOT 

AOCCTOTGAOATCCGACCATCCCATTAACTTTGAAGTTrCTCTTG 

AGGQTaAANAAAAGGAaGAACATOCTAAAAAarrrATGACAATCATCCAAATX}TGA(^ 

CAACCOATTCACCAACTCCACTITTTCTATTTTACAACTr^ 

CCIXMCTGAAACAGCCTGCCACTCCCTANAGCCCCTX^ 

AGGAGGAOAAGCCCTGCAGOATCCGCTGACCACTrCCCAOAGAACrGACAGTCXXSTGCT 

AGTTTGAAOCAACAOCCTAATGTGAAAAGAAACTrGCACTGAAAGGTAAAGGAGGAAATGGNGAT 

GAACTOGGCTTATOTaAOAATOTCTATATmCATTAACACAGCCCCAAAANTrmTCTCT^ 

TCAGCCrNAGGCAAACCCCGTTCraOTTGGTGGOTTCTOCAGGGATCNCTCCCACC^ 

ATCCCAAAACCATNCnr 

SEQ ID NO: 2993 ACATCAGTOAATTTrrAAATacrAAAAATTTATGATAAAAQAATACTGAATC 
AAAACATCAAAGAAAGAAAATAGAGGCTCAGCAGCATGATTTCAATATArnrCCTGGAAAAAGT 
1 11 1 1 iTi ri rriJ ' lA AAGTQTATGACnTITATCCAAOAACAAAAOCTTrCTAAATGCAATOTTrA 
GACTTGA l H 1 m 1 1 lA AACTAAGCCCAOAAAACTGGGQCrCATAAAATAAAGGCACAATGTGGG 
CAGCAAGAGAAAAAAGnAOAAAAAATGOGGAAAAAATTOGTTITAATCTCACGTGTAACATAGO 
TTGACTGGTTTT Gl I I l ltil l 1 1 AGACAATrrAAGGCAGAATrATCTCCCTGCAACACrrGCAGATAA 
ATCCTAGTGrmAGCTATAAGTTTCAAGAGTTCAGATATACAGATAAAGCAATTTAAAAAAAGCT 
TTTCTrATTXn'AAOGCTnTOATGCAmTATAAATCTn'AAAATra^ 

CATACATCTATACTTTAATAAGGCACIXm'AAGATATAAAACAGTnTAAAAACAATGCATTCTAC 
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ArrAGGGCTAACCX;ATTCCTCTOCCTrAGOGGATGCKiAGTQAATT<XCGC^ 

SEQ ED NO: 2994 ACA(jITrAACrATGTAACTATGTrTCAGGAATAAACAAAATCCATGATATTTr 
AQTAACrCATAOTQTATTTATAGAATOAAAAGTTCrCTATCAAAATACACrriTCAC^ 
TAAATAAAATAGACAAATXKiATCTACACAAAGTAAACATTAACTlTCK3TAGATrrCAGTCrrACnTC 
ATAACAAGCATATTTGCCCrrATTraCCAGAGCTGCrCAACTACCAAGAATITnAAAATAT^^ 
AACTOAGATTRATTATCnTGACATITGTTTCTXIATrCCACATCATCTrCAGC^ 
TTACAATTCTCTGACTAATTGTTGGGAGCTrAGTCAGAAGCATCTGGAACACTGGTCGTGGAAGAG 
rrTOCrOTAAGACCTXJCAGTAACrOCTGTOGCTGCCACACACAaCAATTGCAT^ 
CGaXrrrCTCATATKVtCCTTQAGCTAGTAACTCrTCACCAAGCIXn'AT^^ 
CTGAACAG 

SEQ ID NO: 2995 ACTACTrGOTrCCCGATATGOATOATOAAGAAGaAGAAGQAGAAOAAOATQ 
ATGATGATGATGAAGAGGAGGAAGGATTAGAAGATATTGACGAAGAAGGGGATGAGGATGAAGG 
TGAAGAAGATGAAGATOATGATGAAGOOGAGGAAGGAGAGGAGGATGAAGGAOAAGATGACTA 
AATAGAACACrGATXKXATrCCAACXrrriC CI 1 1 1 1 1 1 lA AATTTTCTCCAGTCCCTGGGAGCAACTTG 
CAGTCTrriTTITITTITi mxXCICITG^ 

CCATOOTTCTCAAmATnXKjOOOOAAATACCTraANCAGAATACAATGGG 

CCCCTrTCTGTrCGAAGTTCATITrTATCCCTTCCTGTCTG 

ACCGAOCTCTOTGOOAAAAAAOAAAAACCTGCTCCCriTCOCTCTOCTGOA^ 

GGCCCCrcTGTAGTANTGCATAAAATIXrrAGCrnTrCCTCCTI^^ 

GTAOCTGCCGGGCCGGCCGCTCNAAAGGCCGAATTCCANCACACTGCCOGCCCGTrCT^ 

CCGAGOCrCGGTNCCATCrrGG 

SEQ ID NO: 2996 ACCTrrGGAGATAATGATAAAATrCAACAG<m'GAACOGTCTAACAA^^ 
TTTAATGTAATTGTGGAAGCATTGAGCAAATCCAAGGCAGAACTCATGGAAATCAGTGAAGATAA 
AACTAAAATCAGAAGGTCrcCAAGCAAACCXCTACCTGAAGTGACTGATGAGTATAAAAATG^ 
TAAAAAACAOATCTGTrrATATTAAAGGCrrCCCAACnXlATGCAACTCT^ 
GOTTAQAAGATAAAOOTCAAGTACOCGOOGQAAAACAATGAAAAGGCCCCCAAOGTAGTrATrc 
TTAAAAAAGCCACAGCATACATCCTGTCCGTCCAAGCAGAGGAOCAAAAGCTCATT^ 
AACTTGTTGCGGAAATOACGAGAACAGTrGAAACACAAAmGAACAGCTACCGACTCTTGTGCG 
TAAGGAAAAGTAAGGAAAACOATTCCTTCTAACAGAAATOTCCTCAGCAAT^ 
TrTCAAATGCATGATCAAATGCACCrCACACCrrCCTGAGTCTTGAGACTGAAA GATT AGCCATAT 
OTAACTXjCCTCAAArraACTITGGOCTAAAAaACriTTTTATGCTT^ 
TTNAnTAANAATGCmT 

SEQ ID NO: 2997 ACAGCTAACATOAGATAAGTCAAA AGTTCCTATGGTT rA AATG AACTCCTAA 
OACTATOATCOTTTTTTTITrrAAATCTGOGTATTGOKnTTTTTCTI^^ 
TCAAGACTTGTAGTGTIGTAAACCTGCCTCAa^AAATACATGGTAATAACTr^ 
AAAAAAAAAAAGAO^GCCrn'ACAOCATITCTAGTOGCACACTATnTO 
TTCAATTTCCOCATraTGACCCCTATCACTTCATTrGATATaXT^^ 

TATATGGGC^TGTCCATAOATTGACAAAGAAAGTrrACACTTTTGAATAAAGATGCAAAOTATC^ 
AAAAACATTAATACrOATGCGAAAAAATAAAAAATAAAAOAGAACANGGCAGAGGAAGAAGGT 
GTTTAAGCTCT(XTCGACCTGrKK3AATGGTGGTTAACAGAATGAT^^ 

AGOOOAQAAAAAAAAAAAACAACAAANTTGGNGCrrAAAAAAAAGTAAAATAAAAAAAAQAAG 
GT 

SEQ ID NOl 2998 A CJ 1 1 1 1 1 ' i 1 riTriTl ' l - n I ' iTl II GGAACAGACGATAACTTTATTGGANATTT 
ACTTGGCTACAATTArTACAAAAAAAACAACTAAAGaTAATTGTQATCCATAAGONGCANACrOC 
AAATITCTGCAGCATGATTACTGTATGAATCAATGACATCATGTTCC^ 
TGTGATGGTAAATATTTTATTGTCAATCAAATAATATGCATTAGTITrAC^ 
CCTTCACTATTTACAGCANAAAAGCCAGAAATTTACTTCCTGTrcACCTI^ 
ACCTCCTGCrcCAGTGAATTCTTAGTGCATCTATAAAATnAAATTGTCTGT 
AAAAATOGACTAGAAAAAAAQAAOTAATGATTAAAATATAriTACAAGTTAOTTaTGAAAAATCA 
TTTrCACTGCTAGTGGAAAATCAATCGTCTX^ATGCAAATrrC^ 
AACAACTTTAAAAATCAmTAATAOTCrrGGTCTrAAACATATCTrrAA^ 

SEQ ID NO: 2999 A Cl ITl l 1 1 1 n ri l ' l ' l I I 1 11 1 H OGGAANAATCANAATnATrTCACrATGTO 
AACATTAAGAATTTACXrrACATAGTTGAAAATATTCACAAAGGACTrGATCArrCACACTCATACA 
CAGAGAAAGTCTXSCTOAATAAAAAAATGCTCCTITACCCATTCQACC^^ 
GGGTTTGCCAATTCCACTGTGGGGCnCV^GTCTTCATGAATOTrGAATAGTrrCGCT 
GATCTTCATGCTGCTCACmrCCrrAATGATCTTCCCTGCrATTGG^ 



459 



wo 02/29086 



PCT/USOl/30732 



TAGCACAATCCCCACGACCACTTTCrTa;(nX}AGGTAGAAGATGACA(XTnGCCGTATCCrCCCrc 

tggacg<xiagcctgtggaactgccg(kkn'gcrg<k}aogaatagtaatttctoaao 
troctcactctctgatcggataccagttcctgactgcrctgtggow 
gttgcrmocaaaaacaccactctgggcaaactactgtccacaaagacc^ 
catoggggcccaaatactcx:anacattqactoatgccaqt 

SEQ ID NO: 3000 A Ct L 1 1 L i II 1 ill 1 1 J 1 ill 111 ill l ANATGCTAA>mT^ANTNTTTA-nTNAN 
N^^TNA^WANNNNC^WACAAACNTATACAAAGGGC^mTCAT^^ 

AGOTTrGGACAATOTCAAATTCCCrCACCATAhrrGCrATAOTTATACATTAAATATAATTACTATC 

TATACATATGCAAAAATATAAAGTTCTATTTCATACAATAAAACCCTTTATANAAACCVlTmAAA 

ATTAAGCAGAACTTCTCAACATTAATATGTGAOOTCTAAaTCCrrCTAAAGGTrrCTITAA^ 

TTNAAACAAAATGCTAAAOCTAAAAACATTGTCCTGTCAGTTCCCAAATrAAATCTACTTAGAACA 

AAAACAAAAATTTATACSCTGGOTCACATACTACTrAAATAATATrGTTCAGCCATCr^ 

TCXTiTOrnrCAAOTATOOAAATAOAACTCAAATATTCCACAATACAG 

SEQ ID NO: 3001 actacaocagtcaaagaoatctocactagagatcagaaagaagcaccactat 
trrctrctatgacgtgtatgtgttggtcatgagcatgctagtatgaataaggcaato^ 
ctogcatacaaatocaoctaaaogtqctgaaggaagccagtggggtggtgcaggcacacagcag 
ggagcrctrccccgtgacacgttagtcatcrrctccacagagcagc^ 

CCCAGATGTATCTCCCTTAATCATOGAATAAAGAGAGOTGGCAAAATTCTTCCTA^ 

GATGTTAAACAGATCAATCTCACrCCTGGAAACCATGACTCTOATGAGGGTATGATCATCTGTCCC 

AGCTCXX7rrCATAGCATAATAOAGGGTCTC7XK:AAGCTAOGCAOGTATACTTC^ 

AACAOCAAOOAOTAOTTOCICrAAATTGCCyiOAAGTCTCX3CGOT€AATG(^^ 

TCCTOATATAGTCATGT 

SEQ ID NO: 3002 ACTGGCATTCTCCCAGGACATCCCCCAOOOTGTCAAAAmAOCCAAAGCAA 
ACTTCAAATGAACACAAAGTCCTTATmCTCAAAGAAAACAAAACAAAACAGAACAAAAAACCC 
COCAGATrTATTTAOCTTAAACATTrCCCCTGATCTAAAGAGAAAGATGGCAAAAGATACCTAGCA 
TATATGCACAAGCACCCACTCCAATOAACTCAOATAACCTOAAAGAAAAAAGGACAAAGTTCTGT 
TCAACn-AGAAAATTATCTTCGCTTrroAAAGATGrrTCCACGTCCCATTrc 
TGCGGCCACTrGGGCCTTOAGAATGGCAGCTm<XTCGGCAOCCCCrGAOCCTro Ori 1 1 1 ATTIT 
TCATXKTCTTTAACATGGGTGCGrrCTTCAGCATGTCAGGCAAAATCAGAAAGC^ 
CACGGATGTATACCTGCIXX:AGCTGTGCCACTCGGCCATaXr[Xn'AT^ 
TCTGGCAOTTCATGTTOTCCTaXKnTCAATGAGCTrcCCCGATATACCT^ 
CATGTCACAATGTGGCCCTCGGCCTCATOCAGTACCTGCC 

SEQ ID NO: 3003 A Cl i - l ' lTri 1 1 1 1 1 1 U 1 1 i 1 1 1 1 1 IN GGnTTATAGATTTATTTTCAAAGAGTA 
AAACACATACAAmAAATACAAOT<XATCANAATC>CTCTQTCAGAGTCAGCAAOAATXK^^ 
ATGTGTCCTCACTGCCAGACATTCGTCACrATCATCTGAGTCCTC^^ 
TGATACTTTATTCATATACATCCTTCrrATCTGCAATGTAATCCACAATT^^ 
CTrTTCAGCATCTATATCAGGAAmCAAACa:AAATrCGCITC^ 
GGTCCAAACTGTCTAAGCCCAGGTCTTI^TAAAATGAGANTTACTC 
TTOT 

SEQ ID NO: 3004 ACTGAATGGAAAGATGAGCATrCCTAGTTCTACACr lUrn IT IXXXXCTCAT 
GTGTAAAATGAAAAGAAAACTAAATTraCCCTAATACCAAGGCXKn'ACGTrrATroCCT 
TTCACTGACXTm3TAAT0ATACACACnX}AATTCriTITGACAAAGAGA^ 
AGAGCTGCTGTrrTAATGCCTATOCATrrACTCTrTCCTGATTTAGGC^ 
TTGCATTTCTCTATTTTTrTAATGTACCCTTCCGCTTACCTATXKXX^ 
GCCAAGGTGTTTTTCXXKjCATOGAGCCXXGGAATGGACCXJ^ 
ATCTrrOATGAGCTTCCGGATCTGCTGACGGGAGTrcGCATTGGCQAm 

otctaaccagaccttctrcttgccacagcggaggacactagaggcgagccrc^ 

gcatactcatcgctgcggcogcagcaancgaaaggaaag<xxxxk:gtaotgccccg^^ 

cgaaaggcg 

seq id no: 3005 acaatcaataagtcttaaatctctcttccatggattrcccccatc^^ 

AGCAOTAAAAGGCATTTTTAGGmATATAAACACATIXnTrACAACrcCCTrCT^ 

TACTOTAOTATATAAATAOCACTOAGCATnAOTAAATAAGTTACAGGCCAATAAGTTCAATTGTG 

CAGrnTTGAAATCTAATGTCAAAGAOTTTrrcAAACCTTAAGACTCGGTGATCT^ 

ACATOTCTTCGACAOCCTTAAGACAATAmOGTTTCCTrcrrACTrOOACCTGCAAACTOAT^ 

CTCTGmATTTCACTAGCTGCCCAACTAGCATAmCATGTrG TC 1 1 1 1 CGTmTGCACTrGCT 

TTTGGATTAGAAGCAATATGACTTGCCGACATATAAAGTOCAAACAAAGCTCTCATATTTCTG 
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rrCAGTnCAATGCCTGTGCAAAATACTITCTTOAAAGTTCOAGG^^ 
ACTTAACTTCAGCATACTOCTGACAGTATAAGTGGOTraTOTGGATrAmOkTCATTAQT^^ 
AAACAAAAGGCTGCTnGCATAAGCATCTTCATrGGATGTAAAGTTCTGCAGTTCATGCCAGGCrr 
NTTTGaCrCCA 

SEQ ID NO: 3006 ACAGCTCATCAGCOTGGCCAGGATGGCATCAGTtKK:AAACACATTCCCCrGA 
GTTrrroCCAGCTTGCGGATGACAGGGTCGTCTGTGGTGGTGACAGTGT^ 
CTCCGCAGTGOCrrcrO^CTOCTCOTGGTCATOCGGTCAAAGGCrrT^^ 
CCACAACACrCAATGTCCTGTGGCTCTGATACTTCCAAGTAGCGCATCT^ 
TCCAmCCTCmCACTfOCCAATCACTACGAACTTCAACTGAAGAGTCTCGGGGTnCTGTGAT^ 
TCTCATCCCATTTCTGCCTAACXXCAAATTGrrrCTGGAACTTTTTC^ 
CTCrmnXiTTIXXKIACTCTrAGGCAGGATCrcCAGGT^ 

GCTGTATCCACCANCTGOAAGCTACTTTCATCCTCCTCATOOAAATAAGCAT^ 
CCAAACTGANAQGAOTAOCT 

SEQ ID NO: 3007 ACGCGGGGCGCTACCCTCCCOCCGCCCGCGQTCCTCCQTCOGTTCTCrCC^ 
GTCCACOGTCTGGTCTTCAGCTACCCG<XTrCGTCTCCX3AGTr^ 
CQGCOCGAAOAGGCIXKJACTCGOATTCOTTGCCTOAOTAATOGCTOC^ 
GATTaTrGOTOATOGAGCCTGTOOAAAGACATGCTTGCTCATAQTCTTCAGCAAGGACC^ 
AGAGGTGTATOTGCCCACAGTOmGAGAACTATOTGGCAGATATCGAGGTGOATGaAAAGCAGG 
TAGAGTTGGCnTGTGGGACACAGCrGGGCACGAAGATTATGATCXXXrraAGGCCOT 
CAGATACCGATGTrATACTGATGTGTriTCCATaSACAOCCCTGATAGTITATAA^ 
AAAKTOGACCCCANAAAGTCAAGCATTTTCTOTCCAAACXrrGCCCA^ 

GAAGOATCTTNGGAATOATQAG<>CACAAGG<XiGGAGCrAGCCAANATGAANCAGNAGCCGGT 

SEQ ID NO: 3 008 ACATGGCAATrAGAAGTTGTCATGGCAAAAGAAAACCACAGCTGGCCTGCCA 
CAGCCAACACAAGAACCAGAAAATOGTAGATOAAATOAAOOAATAAAGGTaOOQ-nTATrCCTTA 
TTATAAAAGAAAAAAATAATTCTTCAGCAGTCrrAACAAAGACATCAAGATACAAAATTACAAGT 
GTmGACrCCAGCCCTGTCCCCATCrCCTCCAAGAGCAGAGGTAGGAGACAGTrGAAGCA AACA 
AGCAATTCTGTAAAAATrACCTAGAAACCCTACAAATnX}ATTAAAATCTAAACTTCTATAAT^ 
GCTTTTTAAAAAATTTAATATCAAAAGGCCTGCTrrAGrcACATGCTATr^ 
CCCAATACCOCCTCTGTCAGTAACATOCTCAAOTTQACCAOCCAACTCTTATCTCTA^ 
GGACAAGTGTGTCTTITAAACCAAAGCCACGGAGATCAAGTGACTGCrAGTAAOGTGTTGTCTGTT 
AACTAATCCAGTGCCCCCTTCTCCAQAOGGTGGOCAGaOCAOAAACCCAAAAOGGCTTOAGQGCC 
TAATCCCANATCACXATCTAAGAACOVCITCCCTTGCICrCTACTCAAAATGAGAT 
AATCCGCTATTTGCTr 

SEQ ID NO: 3009 ACGCGGGAGTCCAGTCCCAAGATGGOGGCCACCATGAAGAAGGCGOCrcCA 
GAAGATGTCAATGTTACITIXXiAAGATtVUVCAAAAGATAAACAAATrTG^ 
AATCACAGAGCTGAAGGAAGAAATAGAAGTAAAAAAGAAACAACTCCAAAACCTAGAAGATGCT 
TGTGATGACATCATtKTIXK:AGATGATOATTGCTTAATGATACCTTA7<:^^ 
ATTAGCCATTCrCAAGAAGAAACGCAAGAAATOTTAGAAGAAGCAAAGAAAAATTTGC^ 
/VAATTGACGCCITAOAATa^GAGTQQAATCAATTCAQCOAOTGrrAOCAGATr ^ 
rrGTATGCAAAATTCGGGAGCAACATAAACCTIX}AAGCTOATGAAAGTTAAACATT^ 
TTTTTTATTrcTTTAATAAACTraAATATrGTTTAAAATOATAATTI^ 

AAAGCAAAA Cl i ICH 1 1 1 l AAAAATTTrCATTTATTTAATGGAAA CrroCX XZArrrrCACATG^ 

GCTTATITATTITATATITITAAAAGAAGACAGTATTCACCTATGTATTTrrGNATAAC^ 

TCAAAOTCTAGGGGCTTCAT 

SEQ ID NO: 3010 ACATOGAGTGTrCAGCAAAGACCAAAGATGGAOTGAOAGAGGTrnTGAAAT 
GGCTACGAGAGCrGCTCTGCAAGCTAOACGTGGGAAGAAAAAATCTGGGTGCCTTGTCTTGTGAA 
ACCTTCCTGCAAGCACAGCCrrrATGCGGTTAATrrrGAAGTGC^^ 
TTACTGGCCTTITIO^TrrATCTATAAmACXn'AAGATTACAAATCAGAAGTCATCTO 
TATTrAGAAOCCAACTATGATrATrAACGATOTCCAACCCGTCTOGCCCACCAaOGTCCT^ 
ACTGCTCTAACAGCCCTCCTCrGCACTCCCACCTGACAC ACCAG GCGCT 

AAcrrcTracmrmcTAGAAAGAGAAACAGrrGGTAAcrrrr^ 

ATAACTAACATGTCCTGCCTATrATCTGTCAGCTGCAAOGTACCCCAGATCACCACTrc^ 
TGCTTCCAGGTAGAAATXKiTTTTrrGCrrcGACGAATAAAATTOGGAAACACIGCAGGC^ 
CTGCCCTTCrrAAQAACCACAATOCACATTraCATATCAAAGOCTXrraAOA^ 

ATCTATTGGcrnrcnT 
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SEQ ID NO: 30 U ACGCGCKmJAACAAACTGCTGAGCCCrGTajrCCCCCAGATCAGTGCCC^ 
AATCCAACAAGGAGAGGCTGAAGAACATGGCACTCTCCATrGCCGAACOGTATA GGGCTCA AGGA 
ATAAGCGCAAATAAATITGTGGACTCCACXnTCTATCl'1'Cl 11 ICKjACTTGATCACCTTTnTGACG 
AGTATCATAOTGOTCATATraATAOAOCrrrraATATCATrGAaCOCTimAOCrGGTCCCCCTaA 
ATCAOGAAAGTGTOGAAOAGAGAGTGGCTCKTITCAGAAA'nTCAGTGATGAAATCAGGCACAAC 
CTCTCAGAAOTGCrrCTraCCACCATCAACATriTGTrCACACAG 

AGTCCATCCTCGTCATCCAOGCCCCAGCGAGTCATCGAGGACCGCGACTCrCAACTCCGAAGTCA 
AGOCCGCACTCTGATTACCTTTGCTGGAATGATACCATACCGAACGTCTGGGGACACCAATG^^ 
GGCTGQTGCAGATGOAGGTCCTCATGAArrAAGTGCCATOCnTraTaGGAOTCTGO<^^ 
GTCAGT 

SEQ ID NO: 3012 ACCATTTArnxn-CTGCCOCrmAAAAAATACCCATrGGCTATGC^ 
AAACAAmGAGAAGTTTrmGAAGTITTTCTCACTAAAATATGOGOCAATTQTTAGC^^ 
GTTGTGTAGACrrACTrrAAGTTrcCACCXnTOAAATOTGTXIATAT^^ 

CAAQATTAGCAAAOQATAAATGOCQAAOGTCACTTCATrCTGCACACAGTTGGATCAATA CTGAT 
TAAGTAGAAAATCCAAGCrrrGCTTGAGAACrTTTGTAACGTGGAGAGTAAAAAGTATCGGT ^ 
TTCTTTGCTGATGTCCrnCTGCnGAAATAACAGTCACCATA 

CCTICrAAQTAGGCAGAAATGGTATCATTATGTTGCCGCTCTCCAATCTCCCAGAGCTCOCTC^^ 
AGAGAATCACCTTCTTTCG C ' t 1 1 ' L ' l I ' l l I ' l ' l 1 11 GAGGTANAAGTCT(>CrATGTTGCCCAG ACTAG 
CCTITOAACTCTTGGaCTCAAGTOATrCTCCCTCrCACCnXXXXlAOTAOC^ 
GCCCACTGCACTrGGCAANAATCACCTrmATAAAGCOTNAGNCTOCTTNCACAA^ 
•mANNCATNGGnr 

SEQ ID NO: 3013 ACCACAaTATCnXXriXXrrrCTO<XOTCICrCTOTTTTI^^ 
TTCAACTCTGCGCCTGGATAACriTCATGTrAATXXATTC^ 

AAGACA<XCAAAAAAGGCCAAGCTGTTCACCCAGGGAGCCATACTOGCACATTCCTTCTGCGC^ 

GATACTATCTGTTAATrCCCTTCAGCCAGGGACCAGTCACTrrAGGCr ATTAO CCT^ 

AGAAGAmAAGTAAATATCTGATITGAGGAACCTGGGATAAGAGTCCTTITCCAT^^ 

ATGACrnTIGTOCTrCATCAAAACAAGTOGGOOTrOOTOCmA^ 

OAarcCGGAAGTCAATATTGATTTGTTTAGCAOCATCTGAATGCACAAATGCm 

CTGCmACAGCGCAAAAGATCAAGACTCTG'ri'l iUi l ATAOTC TTCA CAAGCCANCCAGAACTCA 

ATATTCTCCrCACTGAATTCAGACTrrAGGAACnXXlAAAGACATTTT^ 

AGAAAGTTnXX:AGAGATTGANAACCNTrGCATTACTT 

SEQ ID NO: 30 1 4 ACGOGOGCCTTGTCCAGTGAAACACXXrrCGGCrcGGAAGTCAGrrCOTTCrCT 
CCTCTCCTCTCTTCTTGTTTCAACATGCTGCGGACTAAAGCAGACAGTGTTCCAGGC^ 
AAAGTGGTGGCTGCTCGAGCCOCCAGAAAGGTGCrTGGTTCTrcCACCTCTGOCACTAA 
TCAOrrTCATCGAGOAAAGCrOAAAATAAATATXKIAGGAGOOAACCCXXJTITOCOT 
TCCXrAAGTGGCAAAAAGOAATTGGAGAATrcmAGOTTGTCCCCTAAAGATTCTGAA^ 
ATCAOATTCCTGAAGAGGCAGGAAGCAGTGGCTTAGGAAAAGCAAAGAGAAAAGCATGTCXnTT 
GCAAOCTGATCACACAAATGATGAAAAAGAATAGAACTTTCTt^T^ 

CAGGCAmAATTAAAAAATTTAGGTTTAAAmAGATGTrCAAAAGTAGTTGT OAAATTTQA 
TTTGTAAGACTAATrATGGTAACTrAGCTrAATATTCAATATAATGCATTGGTrGO l 11(^1 1 1 1 ACC 
AAATTAAOTGTCrAaTT 

SEQ ID NO : 30 1 5 ACAGGTIXnGTCTOCCAGTTCAOTCCACAGCTCAGAOTATCACCTT^^ 
TTCCATGGTATAAGCKnTGOGGGGGGGCAGOTCTGOGGGTCGTGOATTCACTGGACTGGATGG^ 
ACATGATCCAGAACTCOGCTCOOTTrOOCTTCCCAAGOATCCCACCAACTCATT^ 
CACTGAGGAAATXX:AnGTATrCCTATTCACTATTIX>AAOAT<>GGCCT 
AAGAAAGTTTrCIXytAGTATATTrAGTGTrrATX^rmACTATAGTTCrrC^^ 
ATCTTrnXXTACCTCTAAATTCCl 1 JCi 1 1 1 ICACATTATCnTCTTCATTGCrrrri AATAG AAAA 
ACAAACAAAGACATGGATTTACTGTGCATATTAGCAGATCCATACTGGAAAATGCATGGAGGTTr 
CATATACACCACTTACAGTAAGTAATAACTCAOAGTATAAAGTCGAAAAGAAAOAATCTOAAATA 
TTAAGACrrGriXTGAAATAAGCTTACCTAGGATGATACCACTTTCGCTTAATCAAGA^ 
Tr<XAACTAmAACAAGTOOCAAATATAAAAAATCrcGTAGTTAAAATACACA0CAATCACTTCA 
TATTACTGCCTCCCT 

SEQ ID NO: 3016 ACAGATGGGGTCnTGCTATGrrCCCCAAGCrOGTCITAAACTCCTGGCCTCAA 
OCAATXXTTCraCCTIXKJCCCCCCAAAGTGCrGGGATTGTOGGCATOAGCrGCTO^ 
CATGTmAATATCAACTCTCACTCCrOAArrcAGTTGCrnGCCCAA^ 
AGAAATrATTOOGCTXTmAGOGTAAGAAGrrrOTGTCTTTXjTCTG^ 
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TGTCTACrCTGAAGACCTITAATGGCTTCCCTXrmCATCTCCTGAGTAT^ 

GCTATCCAGTCACITGrrCTGAGTAAGTGTGTTCATTAATXnTrAriTA 

ATATACTCCAGGAaTAOAATAGTGCnTAAAGTGCTXK>GCCAAAGACAGAG<XKiAACT^ 

AGTGGGCTrrGAGATGGCAGGAGAGCTroTCATrGAGCCTOGCAATTTANCAAACTGATGCTGAG 

GATGATTaAGGTGGGTCTACCTCATCTCrrGAAAATCTGOAAGOAATGGAGQAJnX^^ 

TCTGACACAAGATCCGNGGTirGTACCTGCCOGGGCNGOCCGOTAANGGC^ 

SEQ m NO: 3017 ACTCAAAGOTOATATTTGCTITmCAATCCTrCAGGGGAAAAATCCTm 
TTACAAACTTCCATCAOTTTAOGAGTCAOTCTOTATGCCTTTAGTGAGAGAGATarrrC^^ 
mATGGGATCATAAATGAGAAOGACAGATTCTrCAATGGCATGCrGGTAACTAAACTGAGAGTC 
CAGOAOTGCCCGOOTAACOAATaAGCCATAGTATGTOOACTQATACCAGCCCACOTOAAOATGAT 
CAATGTTTACATGGCGAAGGCTCCGCATCATITCCATCTGATATrGGACrrCATCAAACT^ 
CATCCTCTGTGTGCTGAGOOAAAGGAAAGCAGTrGGTAAmCAAGCCGATCTTCrACAACCAGA 
aXAAAAGCACTCCTTCAACAACTTCAOTTCCTrGTCCi'lCriUlltUT^ 
ATACCACAAGCCCATCrATCTGCACTrGCTrCACGGCrGAATCIXX?COAG<XO<^^ 
CriTCXXTOCrGCOCCOOCOaTOOAGCrOaAAaAAGTaGCAOTAGAOCCGGT 

SEQ ID NO: 3018 ACTACTGCTOCAAGAAGOACCTGTGTAACTrrAACGAACACCTTGAAAATGG 
TGGGACATCCn-ATCAGAGAAAACAGmriXnXjCTGGTGACTCCATrr^^ 
0CTrCAItXXn"AAOTCAACACCAOGAaAG<nTCTCCCAAACTC0CC0TTCC^ 
CTCTrGCroCX:ACATKrrAAAGGCITGATATTTTCCAAAT^ 
AGCTTGAOCAAOCrrOGCTAAOATAOAOOOGCTCTOOOAQACITroAAGA 

GGAAGCCCCACTTGAAGGAAGAAGTCTAAGAGTGAAGTAGGTGTGACTTGAACTAGATTGCATGC 

TTCCnXXTTTGCTCTTGOQAAGACCAGCTITGCAGTGAC^ 

NATTATTriTCCntnXXKrrCCTrOOATGTAAGrrCAGTTAGC^ 

GACCATGAGGGTGGCAATGTAAAGTGCCCATACCAGCCAmCGTGGGCAGTCCCTTTC^ 

CTTACCTOTCCTTTOCANCANCCATOAACOOGCrraGCAOGOCCTCTCCAW 

GAAOTGCITGTCTNGCTA 

SEQ ID NO: 3019 acgcggactgtggacaggaaaagaggaacacttttaatatggcagatcatot 

rnTATTTCCAT^^ 

TTTTAAGGGAATTn-CTCAAGATGCAGGTGTrACTrGTCCAOAGAATTTTATCrGAAAAGG^ 

CAQAOAAGOAGGAGAAAGAGGAATAATGGL-J "iUJ-l 7 1 CAGCTAAACATGTCTCATATCAAGAGCT 

GTGTCCACTCTGCCrCCTGGACTGCCATGTGGTCArrTTAGTATGTGAGTCAAAGCAGAATAATAO 

GOAAACATrAAATCTCT<XTrrACAGmAAaAOCTraAAAOCAAAAOGAAAGTCrGAAAAAA^ 

ACAGGGGAGGnTGGGTTGGTAATGl i ! riGGTAGAACIX)GTTATXXTTGTTCGTATTTAGTAGGT 

GCCrrrTAAGTCTTACGAGAGTAGCACCCACAGATGGCCNAOTTCAAATATCAAAGAATCAAOAA 

TGCAAAGT 

SEQ ID NO: 3020 ACAOAOArrTAAATGAAATCTTCGAAAGAATAAATTrOCTmCVVGTCCACTO 
TATTrrCAAAArrQATTATCACCAAGCTItK3ATXJAAA0CTOTOAACC^ 

AATAO/WUU^GAATGTGTAGATTATTAGCAAAGTAATGCCTTAAAATGTATCTTCACACAGTTG 

AAATTTTAGTATAAACTTGTATATCAAGTTGCTTrCCAmTTrATTCTAC^ 

ACTATQATGTrCAAATATOTATTCTOAOCCATTATGTTCAAACATAAATATCTGGGAAATTC 

TGCTGCAACAAGTTAGGAAAGGATTAAGGAAAAATGATGAGCTACAAATTATGTAGTTGGAGGAA 

OAAAAAAATtnTACTTAOCAmATOTCTGOATAGGTATtn'ATTrTCTAATTTACA 

AGTTOAGTATAGACAACCATCAAAATGTAACCAGrrACACAaAGACTAGACTAAGCCAACACrAT 

TTTCTATAACAGGTAACAGTAGTGATTCAAAAATTnAATATCTCAATAGTnCACCAAAAAA^^ 

TTTGNGGTAATATGCTAATATlXTraAAGTTTOAGAOGCGCAGArrAAATGAGTGCNCTATCT 

CAAACTCAGCAAQTC 

SEQ ID NO: 302 1 ACCTTrTGGTGOCAOCXJTnx:AAOQAa<XXTCACCATX:AAACA^ 
CAGCAAGCGTCTAGATCATrrGCAGCGGGCTCGAGAACACrn'ATAAACTACTTAACK^ 
rrGCTATCATGTGGCAGAGTTrGAGCrGaX:AAAACCATGAACAACTCTGCTGAAAATCACA 
CAATTtXTCCATOQCTTATCXTAOTCTCGTrOCTATOOCATCrCAA^ 

ATACAAGCAGAAOAAGGAGTTGGAGCATAGGTKmrrGCAATOAAATCTGCTGTOGAAAGTC^ 

AAGCAGATOATGAGCGTGTTCGTGAATATTATCTTCTTCACOTCAGAGOTGOATraATATCAGCT 

TAGAAGAGATTGAGAACATraACCAGGAAATAAAGATCCTGAGAGAAAGAGACTCnTCAAaAOA 

GGCATCAACTTCTAACTCATCTCGCCAGGAGAGGCCTCCAGTGAAACCCTTCATTCTCACTCGGAA 

CATGGCTCAAGCCAAAGTATTTGGAGCTGGTTATCCAAGTCTGCCACTATGACGGTGA^ 

GTATGAGCACATCGGAAATATGGAGCATTACCGGATaWGGAATAGCCAAGGCAGCACCAGANG 
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AATTCANAAAACAGCTCANC 

SEQ ID NO: 3022 ACTGmATTAACCAAOCAGCTTAGAAAAATAATCATGGTAOACACCTTAGTr 
CATTCTrCTAATAAG<XTQTTOATCTOCrrCCrCCCTOrraC^ 
GTG<nxnTTTTCTTCATTCCACCTCCirGOAGAAGACAATTTOAAGCKjC^ 
TCTTTOAAGCQTTTTCCAACAOTATAOATCTCATOAATCAAATCCTCCATGCAGATOA TGCTO 
TTACCAAGAGATCGAGCAATCAAAGCOTTATCKTrCAAAGCAATTOGCTrCTTAT^ 
TAAOCACGCirCTAGATTAGrrCATTTACrGAaTCAGATrGGGGT 

SEQ ID NO: 3023 ACrmTmGCTrOTTOOTGATGrrrrOCrrATTrAATACATTCCQT CAGGOA CA 
GAAATTACATGCTTTITrrTCrrrCTAGGAAGTGTGTrcAGTrcOCCC 
TmTQGGTGOTGGTGATGCTAGTGGTTATG TCTOTT TAAATGAAGTTGCr^ 
ACTTAATCATCCAriTCCTATATAAAGGTAGCnTrraCATAGACCTCAAGTATATTGTAAOTOT^ 
AGGTGGAATTrAAGGAAAGGTATTAAATrGAGGCTGTGTnTAGCTTACAGGCAAGTAATAAATT 
GTATCArrTATCTTGAATQTATCATAQATAAGCTGCTATATAAOGATTOCCACTrCAGATAOCT 
GAAATrAGGTGATTAACrAGTTGTTATTTAGCCTTCrAATrrCrGTATAAGTCTAArTACATGAATA 
GAAATTOOOOrrmGATITmACTTTGCmTCTQTITGOAGTGT^ 
ATGGTGGAAAATAATTGCATITGrrACTriXjGGGTGTGrrATrrGCATC^ 
ATGGTTGGGGCTCATCACTOCATATTAAAAAAACTrGGATGTATCAGTGTAAAA 

SEQ ID NO; 3024 ACCCAOAACATCnTCOCCTGaAAGTQTTAQAAGCTQG OATrA AATCTGQACG 
CTATATCCAGGGAATTCTGAATGTCAACAAACACAGAGCOCAAATAGAAGCrTTTGTTCOACTTCA 
AGOAGCCAOCAOTAAAGATTCAGATTTAGTCAGTGACATCCTAATCCACGGGATGAAGGCTCGAA 
ACCGCTCAATrcATGGAGATOTGGTAGrrGTGOAOCTGCTTCCTAAAAATGAATGGAAA 
ACCGTAGCCCriGTGTGAGAATGACTGTGACGACAAGGCTTCGGG<X^ 

GCCTACAOGTCOAOTOGTOOGCATACnTCAGAAGAACTOGCGOaATTATOTOOTOACATr^ 

CCAAAGAAGAGGTCCAATCnCAGGGCAAAAATCCTCAGAAAATCCTGGTTACACCTTC^ 

AOAATTCCCAAAATnXiAATTAGCACTCAGCAAGCAGAAACCCrrCCAGOACTrCAG OGTG GTCCG 

TGCGCATCGATTXXTTGGGACnXIAACATCTGTGTATXXAAAT^ 

GAATCGGAGATCrGGAAGGGGAAATTGCACCATNCTGGTGGAAAACAGTATTrC^^ 

NTNAAAANCTAANATOTGTOA 

SEQ ID NO: 3025 ACAACTCrrTGAAQATCCCACTGTrGACAAGGAGGTTGAGATCAGGAAAAAA 
GTGCTAAAGATAT ACAATAAAAGGG AAGAAGATTrrCCTAGTCTAAGAGAATACAATGA l l iu n 

ggaagaagtggaagaaattgttttcaactrgaocaacaatotogat^^ 

aaatgoagatataccaaaagoaaaacaaagatgttattcagaaaaataaattaaagctgactcg 

agaacaoqaagaactqoaaoaagctttaoaaotggaacoacaoqaaaatgaaca aagaaoa tra 

titatacaaaaagaagaacaactgcagcagarrctaaaaaggaagaataagcaggcttrntaga 

tgagctggagagtknxjatctccctgttgctctgcr^^ 

attagaaatgcacttgagaaacccaaacctgtaaaacxylgtgacxstttrccacaggcatc^^ 
gggtcaacatatttcactggcacctattcacaagcrrgaaqaagcrctotatgaa^ 
gcagatagaoacatatgqacccatgtrcctgaocttoagatgctaoaaaacttgggta^ 
catgtcagagctgcctcccacaoga 

seq id no: 3026 accaatggctctggagcrrgoaggaagactaaagoaatg tctag tgattcto 
agtaagatgtagaoctacgcagcagagcratgggggagaagattaacaaagtccmcttccaat 
atcaggatagtcatgagtixk:agtcccatccaaaaggtcattagggctaaaagg ^ 
croaactatoagatrcttgcttxxctcxxsggggagccaaggagctr^^ 
aatoagcaccatgacacaotcarrtctgatccctctatccagctgttgtoaaaagatgaagcaw 
goaggcaagaaaatoctraamagcagacaagagaatcgacagtgtgatcctrgtttgtgctac 

CATTGCGTGATGCACCACTTTTCAGCTCCATGATGCTACTTOTTTCCOT 

ATCCCCTTCAGTrATGOTCTANAATCAAACAATrCTAAACCTtlAOT 

CAAACCTGCAGAAGCAQQTITOCCTrraAOOTATTrAGTCACACCAAAOT 

SEQ ID NO: 3027 ACCGGAGCCTAGCOiACCTGGAGAAGGATGTCATXKnTCTCTGT^ 
TCAGACGTTCAACCTGGAGGGATCCCAGATCTATGAAGACnXCATarrCrrAC^ 
GAGTGCCCGGCAGAAAATrGCCAAAGAGGAAQAQAGTGAGGATQAAAGCAATOAAGAGOAOGA 
AGAGGAAGATOAAGAAGAGTCAGAGTCOGAGGCAAAATCAGTCAAGGTGAAAArrAAGCTCAAT 
AAAAAAGATGACAAAOOCCOGOACAAAGOOAAAGGCAAQAAAAGGCCAAATCOAGGAAAAGCC 
AAACCTGTAGTOAGCGATnTGACAGCGATGAGGAGCAGGATGAACGTOAACAGTCAGAAOGAA 
GTOGOACOGATGATGAGTCATCAGTATGGACCTmTCCTrcGTAGAACTGAAT^^ 
TCTCTCATTTCTACCCAGTGAGTTCATTTGTCATATAGGCACTGGGTTGTTrCT^ 
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CTATAAACTAGCrrTAGGATAOTGCCCAGACAAACATATGATATCATGGTGTAAAAAACCC^ 

TTACNCAAATATTrrOTACATATnmiACCAAATGGGCCTTNAAAGATTCANATraAACA^ 

AAOCTnTQATOaAAAATATGTOOGGT 

SEQ ID NO: 3028 ACXSCGGGGGGAACrG Cl 1 ICl I GGQAAGATGGCGGCTOTGTCGGTGTATGCT 
CCACCAGTTCGAGOCTrCTCTmGATAACTGCOGCAGOAATCCCGTCTrGGAAGCC^ 
AAOAOGQGATACAAOCnTCCAAAQOTCXXJGAAAACTGGCACGAa^TCGCTOGGGTG^ 
GGATGGCATAGTTCTTGGAOCAGATACAAGAOCAACTGAAGGGATGGTTGTTGCTGACAAOAACT 
GTTCAAAAATACACTTCATATXntXrTAATATITATrOTTOTGGTCCTXK^ 

GAGTIXrrcACAGCCAATCGGATG<XX;AAGCAGATGCrmCAGGTATCAAGGTrACAT^ 
GCCCTA0TmA0GOOQAOTAGATaTrACIX3GA(XTCA(XTCTACAGCATCTATCCT^ 
ACraATAAGTTGCrrATGTCACCATGGGTrCTGGCnX:cn"GGCAGCAATGaCTGTAm 
ANTTAAOGCCANACATnKJANGANGAGGAACCAANAATCrGGTOAGOOAAACCATC 

SEQ ID NO: 3029 ACTTOATTGGTCAmGAAAACACrGCAACAGTOAACrTTTGCATCTCAAOAA 
AACATTGAAAAATTCrATGAATrcTIX}TAG<X0GTGAATTGAGTXX5TATrCT^^ 
TGAAGAAAACTTGGCTOTCGAAACATTITICrCTCTGACTGCTGCTTOAATGTrOT 
TCTTATGTATGGG l 1 1 1 H 1 1 AATCTGATCCeTrCATTrOAATATTAATGGCTrnTOCATTAAAGAA 

taaaatattttogacaatgccxjataaatgtatgaagttagtatccacatcataaatrcagagt^ 
0ttta0ca0taaatokatatmgaa0tgatacaca0atotctttcctxxx:cacaa^ 
caaaaaacaagacct c i j 1 1 c i' j i agatgctgccacctatgcccaccacaacagagatntacatg 
gaaaccggcto\gtgAgaactgatttcctxk;ccaatatttxktitooocto 

ATTAAGGAATCTAGCroCrrATACAGTTCAAGGCTrrCTATOTTGTTAATQAAaTNAAAATAGCC 

GTTAANACATCAAATACAGCAGCAGGTACCAATGCGAACAGGrACrKXiCATTTATOT 

ANAAAATQAAAOT 

SEQ ID NO: 3030 ACCATICTGAACOATGTTAAAGCAAGTOTGOmAmATGACATGAACCATC 
TAACTTGAAATATGAACnACAAGGAGGGGCACTCATrAGGTAACAAGTTmACACCA^ 
TimTOAAATAAGCCAAAATAATCCTAAAATTCATTAGAAGAACTroATAAAAGACTCAAA 
TGTTAGAAAGAGCCCATAATTITAGGACTCCTATAAAATTCTTCCnGTrr^ 
CrCAGATTCAAGGGAAATACCAGCTTCCACTrGAGTCACTTTOAAATAGTrAATrCAACA^ 
TGTrAGAAATATATTGGCAGCCAGGACTCrGAACTCTGCAGAAACATTTGTTI^ 
AACTCTAGCCCTCACTATGATGCCCCTOTGTGCATTrACAATAAAOACTOCAACGGAGGGAGCCTO 
TTGGTCTTAAATTGTTTACArrrCTTTATACAAATAATGTGCTRCAGTGCmAG TTAC^^ 
TTTCTGTTTCTTGTCCAAGGATTCTGTACTATGTAGTGTGTTTC^ 

GAAAGGGAAATGGCTCTAACACTGGTCACTOTANCAOOTAAACACTACTCTAACOTGGAOA^ 
AGCTTCATGCTGANG 

SEQ ID NO; 303 1 acagagagcataoaataaaagcaaagatgtgaatotctctacx:agacagag 

GATGAOCTAOTCAGCAOTTTGOAOOOAAATCATaXMGGOTGCrGOCTCAGCAro 

AAGGTCrOTCATCTAAAGGAGAGAGGCAGGCTCAGCrCXrraAAGGTCGCAGAGCCTCAGTAGT^ 

TCCTOAGTGTGTCTAGCrGACrGTTATCGAGGGACACGTAAAAGCAGCATCACAAAACTGQACTG 

CAAGCCATCACXACGGCACCAAOTCTATGCrTGGGCTCTrcCCTGCTC^^ 

CAACACTCTAAAAGCCAACAAAGTCCCTTCAACATGAGTAAAGGAAACAGTrrCAAGCACTGACA 

OTTTrrACGAQTGACTATTCAAAOAATOCACAOOAGCCAOAOAGOCAGACTCCACAGAGAGGCCA 

AAGGCTCreAACACAa;AGTCAATGTTCATX3GAGGTATAGACAAAGGATTCrACCTCAC^ 

TGAAOACAACCTGCCTrTCACCCACAACTACTOCTTrCrATQTCACCCTAGAAAGATCANOTCTO^ 

GCCCIXKXZACQAGCCTACrrCTGAAGCCATCTGGAAAAATOAAACCAC^^ 

TATGOCTGAATTnTAAATCTGGT 

SEQ ID NO: 3032 ACrnrGGATTAGAOCCCrrcATAACATCTTrGOAAAAACTICriT^ 
TITTrAACrrCCCATTCCACTrCTAAATACTCrCTrAAAArr^ 

CTCXKJAGTGAGTCCATAAGTAGGATACAAGGTTGTrrACAGATAGTAGGCTTrAAATGCCACTGTC 

CTATTTCTOAACTOCAOrrGTCATCAGCOAGGAATCCATCOGTCACTGCTAT^ 

GATCTTCTGAGAAGTCGACGAOTTCATCTrCAAGCATTTTACCAGCTrcAGGTGArrCATTO 

AGTTrA<nx:rcATrrrGCn'AAGCCATCTGTATGATGTGTGGAACTrAAA<^^ 

ATIXnXACTTITAGTCCTATCAGCAACrrRrraGGATGTAGATTCT^^ 

ACTCGCAGTATGATrN Ni lU 11 IN ACACTGTATTTACACTGGATATOTTTCTCTrATAACOAGOGT 
CA Cl ' l 1 en N CTNCCCAGOATTOOACTCAATNTC 
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SEQ ID NO: 3033 A(XHXGCCCCACAAGAGAAAGGAGGAAAGAOCTAAAATraACTCCCTAACA 
CCACAATTAAAAGAACTAGAGAAGCAAGAGCAAACAAATTCAAAACCTAGCAGAAGGCAAGAAA 
TAACTAAOATCAGAGCAGAACTGAAGGAGATAGAGACACAAAAAAACCCTrCAAAAAATCAATG 
AATCCAGOAGCTaOTTTTTrOAAAAGATCAACAAAATAaATAGATTGCrAGCAAGACTANTAAAQ 
AAAAAAAAGATOANTXi\AAGAGACACATAAAAAATaATAAAGGGGATATCACCACTGNTCCCAC 
AGAAATACAAACTACCATCAGAGAATCTATAAACACTCCTACACAANTAAACTAQAACTTCTAOA 
AGAAATXJGATAAATTCXTGGACATGTACGCGGGGTATOGTTAAACAGCCAA'nTAGGGGAAGGAG 
OGAmAAACnTAmCX:AGCNAAGCATGGGTGTGTCCGGGOCTGTTANAAGAATTCTAAA^ 
CCGTNGNAGCrrCGGGAGAAAGCATCAGGGAAAACrrAGGCrGNTTANAAArrGTAAGCrACAAA 
TTCATTGNTGTCATC 

SEQ ID NO: 3034 ACAAAACATCAGAAACCAAATATAATCTOGCAATCrrnTACGAAGGGGTOT 
CTTTAAGATGGACAACGACTCAGTAATGCAATCCACTATrrCTTCAGCAGC^^ 
ACAGAAAACCATTGCATCTOCAATATCATTITTCCTrGGAGTT 
TTATCCCrGTGTrCTrCCTTAAOTGCrCCCTTTTTACTAGGTTC^^ 
CIXnTCTGACATIXXATGCX:AAOTACOCCTCACCAAGNCGAACCC^^ 
AQTTAAANAACCCTTrGOAANCCXJTCOOAnCCTrAGGaaACGGCAAOTCATrGCTI^ 
CCCTCAGAANANACATnriXMTTAATATCAGACCNATCTCACTCATCCTATTAAAC^ 
AGGNTNCTCTCANATWANAANATATOCCITGGATTAATCATrcrACCCTATGGCCAT^ 
rrACCAACTATOTAOCACCTCrAGAAAaAGAGAGCCITCGAATTCTAGGAAOACCCACrTACTAA 
ATOCCCAGCNGCTCATOACCGTTITrAACTCGCCCOGNTGANAAAACCX:^ 
GOOGOGO 

SEQ ID NO: 3035 CGCGGCGAGGTACTGTTrGTGTGTAAATTGAACCCAGTGACCACAGATGAGG 
ATCTGGAAATAATATTCTCTAGATTTGGGCCAATAAGAAGTTGTGAAGTTATCCGAGACTGGAAG 
ACAOQAGAOTCCCTCTOTTACGCTmATTGAATnGAAAAGGAAGAA OATr GTGAOAAAGCATT 
CrrCAAAATGGACAATGTGCTTATAGATGACAGAAGAATACATGTGGATTTTAGCCAGTCGGTrGC 
AAAGGTTAAATOGAAAGGAAAAOGTGGGAAATACACCAAGAOTGATTTCAAGGAQTATQANAAA 
AGAACAGGATAAACCACCTAATTraGGTCTGAAAGNTAAACTAAGCOXlAACAGGATAC^^ 
GATCTTITrrAGATGAAnAGCCCGAAACrCAAATCAGTrACTTNCNCCCAGTAAAAACN CAGAAO 
AAAACCATTCCTGGTCTGAAAAAAAOAAANGAGCrrCTrcCATCAAAATTCTAT^ 
AGAATGCGGTTGGNNhrrAGAANAAAAAAAGTrOTNaOAAAACAAANANGNAANAAGAACGACN 

TTAAccTUOTnJcaiCKrANNNNoooGGCJ'rrrri'ri'iurriATAANCccccc 

SEQ ID NO : 3 036 ACATAGTGTCGCGAACTCAAATCGGCArrTAGATAGATCCAGTGGTTTAAAC 
GGCACGTTITIXKrrrATAAAAAAAGTGCAAAAAAGATGTGGTTTACAAGTTA AAGC ^^ 
CCTirTrOCTGTAATTOCACCAGTmAAAGCCTXrrOOACAOAOCAGTAr^^ 
TTTTCTTAAAAGCTTACAGTGTITGGCTAATrCrCCrCCCCITmACAAGACGGG^ 
GTGGACACTGGTGGCAGGTTAAAGGGATACTGTCACTTTAAGAAGCCTGCAGATTGAAOTCT 
CATGGAGAAATTAGGGGCTOArnTTAAACTGGTGAGATATTAACCAGNCCNCCTOTANTAAATA 
AGAAATCCAACAGCGATTACACGNTNACACCCCCriTN I I i ATATmrrANAAAATNACTGAAAA 
ATANTCAACG^mCNATCT^^raGCTTr^rTOGTIT^•AA^ 

ATrANAATTAACTTTNCCCTNCANGAAGNAAATAAAATGGNGGGTACACACTTCAAAAAAAAAAA 
AAAAAAAAAACNTO 

SEQ ID NO: 3037 ACGAGTCAAGC^CAAACTGCTGCGCCAGGAAAAGACAAOGCTAATTGOOCC 
CAACrGCCCnX!GAGTCATCAATCCTGGAGAATGTAAAATTGGCATCATGCCTGG<^ 
AAAAOGAAGOArraOCATTOTOTCCAOATCTGGCACCCTGACTTATOAAOC^ 
CGCAAGTTGGATTGGGGCACTCTnxntKXrrrcGCV^^ 

TTATTOACTGCCTOGAAATCrrmGAACGATnnXDCCACAGAAGGCATCATATrc 

TTGGTGGTAATGCAGAAGAGAATGCTGCAGAATTrrTOAAGCAACATAATTCAGGTCCAAATTCC 

AAGCCTGTAGTOTCCTTCATTGCTGGmAACTGCTCCTCCTGGGAG^ 

CAATTATTGCTGGAGOAAAAOGTOOAOCTAAAOAGAAOATCTCTGCCCTrCAAAOTOCAOOAOTT 
GTGGTCAGTATOTCTCCrGCCAGCrGCGAACCACGATCTACAAGGAATT TGAA AAGAGOAAGATG 
CTTTGAAAOAAAAAAAAAATTCCTAAAACTmGGAATGGATCACOTAACTTO 
TNNTTNTOTTGNCCCTNNTTA 

SEQ ID NO; 3038 CG<XGCOAGGTACAC(>J^TCAAACGrrAGAmGGTCTTTTCACATACnCTCAT 

ATTrrrrGOAGACTTTorrcATTCcrriTCArr^^ 

TTAAATTCATCrrCAGTCTCTGATATCCTrrCTTCAGCTrrATCAATTCAGCTACTGATAC^ 

TOCTTCATGAAGTTCTCATGCTGTOTTmCAGTTCCATCAGGTCAmATGTT^ 

TTATTCTAGTTAGCAATrcCXICCAACCnTITrCAAGGTTCT^ 
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ATGCTCCTrTAGCrCAOAGGAGCITCn"ATTACCCACCTTCTOAAOCCT^ 
CCTCATTATCT0CCA0Tn"OT0CCXnT0CTGGACAGGAC7rcNGACAAnAANG^ 

SEQ ID NO: 3039 GTrCGCGGCOAQQTACAAGGTOAATATCTTAACCAOACTTaCCOCAOAATTG 
AACAAATTTATCCTCGAAAAAGTGACTGAGGACACAAGCACmriTCTGCGTrCCC^ 
AGTGOTOGTOOCCOTCTCKmiAAGCCTOOAOACGCOOTAGCAOAAOOTCAAOAAATTrGTGTC 
TTGAAGCCATGAAAATGCAGAATAGTATGACAGCTGGGAAAACTGGCAaKlTGAAATCTGTGCAC 
TGTCAAGCTGGAGACACAGTIWAOAAGGGGATCTGCTCGTGGAGCIXM 
CCmCAGTCATCACCCAATTTAATTAOCCATITGCATGATGCTTTCACACACAATTGAT^ 
rrATACAGGACACCCTGTGCAGCTACGmACGTCGTATrrATTCCACANAGTCAGACCAT^ 
CCAAAAATACCATGOAAATrrCATTOTOTAATAaGNNCTGC^^ 

SEQ ID NO: 3040 ACCTrGTTGGAGATG<X>CCTCAGAAGTTCACACrGTGCAGGAAAAAGGTTT 
TATTCTClXXrrGGCATACATrAGAATGTCAGATGCITGCATCCA 
AAAArrGGTGGGCAGGGGGTTTGCrrATGAGTITItnXTGOAAACCGAT^ 
TGAATOCCCCTTGAGCmATGAGATACGAGTCCACATGGATAAAATOTTAGAGAGTGGAGTrCTA 
CAQAGGATTCCACGAAQAOOCCATGTCrGTOCAOTCCTAGTTOCAGACAOQTGAGANGCTC 
AACTACTGGCrACCrrGACAAGCTX3GGTAAATAGTTATCATTCTGGGTAACTGGTTC 
TTTGGACAAGTAATTCCTOGGGTrCTGCTTITGGTANCNTCACCAGGGTr^ 
AAACACACAGTGCNTGrrxrrCTCTrGCATCATGTrNGCCCAT 

SEQ ID NO: 3041 CGCGGOGAGGTACl n 11 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 ICGATAGNQAAAATATACT 
TTATTrmAATACAATAGCTGCCAGCAATATACTGGTGCroATOTrCCAAAGATAAAAGAAAAT^ 
CATGCATTCTATAATAAGCTTTCAmGCCTGTTCAANAAATTATAAANAAAATACTC^ 
TTCAACATTACGGCTTGAGGAGTTGAAATnTTCCATGATAAAAATATACTmGTGTGGC^ 
CrraACrATTTATAAAGOATGOAGTITrrAAAAGCCCACATOTATCAATAATGGATOCrC^ 
CTTraAATTAAATGCrrAAAnCAAATTAATGCAAGAAATGGGTOAATCArrAAATGATGAAT^ 
TATCAAAATOTCATOAAAAAATCATTCNT^mTCCX:m^ATTITACT^GAC^ 
AAGGCNCAAAATAAAGNCNT^^NATGTAACT^r^OGGAmTCAAACCCACANAGTTAC^^ 
TCNAAATGNOCCrrTAGGGAAAANAATGGCAAAANCTTAArrGGTCCATTG 

SEQ ID NO: 3042 AOTCCXKXKKX}A0aTACCrGCATGAa UOA0a jCCTTOAACAGGCGGCACA 
TCATTGATAATAGGGTGCATrGTTGCnTTACTRATTrcACCnTI^ 

AGATGTGGCGTTTATGAAGGCAATACACAACAAGGTGAATATrGTGCCTGTCATTGCAAAAGCTG 

ACACTCrCACCCrGAAGGAACCGGAGCGGCTGAAGAAAAGGATrCTGQAT OAAA TTGAAOAACA 

TAACATCAAAATCTATCACTTACXrrGATGCAGAATCAGATGAAGATGAAGATnTAAAGAGCAGA 

CTAOACrrCTCAAGGCTAOCATCCATTCTCTOTGOTTOGATCCAATC^ 

GAAAAAGNCAGANGCCGCCTTTOCXXTGGGGTGTTGNGGAGTGGANAACCANANC^ 

TGAACTGAAAC^^^^CTA^^Im;CCATOCAGACTCAGAGGNGAOCCAGAC^ 

TTTGAAACTAAAAAAGNGGGCGGAA 

SEQ ID NO: 3043 ACAAGOCCATATTTAAC»GATTCTCAGTrACACTrAAAGAOGATGGTGTrCG 
TGGTTIXK)CTAAAGOATGOGCrcCGACTTrCCTraGCTACTX;CATGCAGG^^ 
CTmATGAAQTCTTrAAAGTCTTGTATAGCAATATGCTrGGAGAGGAGAATACTrATCTCTGGM 
CACATCACrATATmjGCrGOCTCTGCCAGTGCTCAATrCTITGCrcACATrC^^ 
GAAGCTGCrAAGGTTCGAATTCAAACCCACCAGGTTATGCCAACACTITGAaGGATOCAOCTCCA 
AAATGTATAAGGAAGAAGGCCTAAAAGCATTCTACAAGGGGTTGCTCCTCTTGGATOANCAGATC 
CATACACATOATOAAGTI<X3CTX3TmAACQTANCTNOQCCONACCACQCTAANG^ 

SEQ ID NO: 3044 CGCGGCGAGGTACGCCrrCrrCGTGCAGACCTGCCOQGAAOAOCACAAOAAO 
AAACACCCGGACTCTrCCGTCAATTim:GGAATrCTCCAAGAAGTCTrC^ 
CATGTCIGCAAAGGAGAAOTCOAAGTrraAAGATATGGCAAAAAOTGACAAAGCTCGCrATOACA 
GGGAGATCAAAAATTACGTT(XTCCCAAAGGTGATAAGAAGGGGAAGAAAAAGGAC^^ 
TCCTAAAAGGCACCATCrOCCTTn'a:NNGTrrGGTCTAGACA>nX3W 
CCNCXrrrGGCimCCATTraGOGOTACTGCAAAGAAATIGGOTGAAATaT^ 
CAAAAGATANACAACCATATGACCAGAAGCACrrAOCTTNGCGAAAAAATANAAAATNAANANA 
AATXnWCrOCCGGCGGCrbnrCAAAGQCAANTTNCACANCTGCGCC^^ 
GGACCAACTTGGN 

SEQ ID NO: 3045 ACAACAAAGCAATGTTACCrTACCATAGGCCTrAATTCAAACITTGATCCATr 
TCACTCKAATGACOGGAGTCAATGCTACCTOGGACACTTOTArrrGTAAATTCTOA^ 
GTAGACTIXnGCCTACTTTGTCATOAGGGTTrcAmCTG<>T^ 
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TTAGGTrrGCTAAAGCTAGAAGATTCAATrGCTCrTTACAGAC^ 

ACGCAGATGTCACnCTCATGCCAGCCCTGOCTCTTAGCTCTmG 

AAATACnTATTTGGGAGOCTCXrTCAGTGTCGTAGGAATrGAGACrAACACAATT^^ 

CACTGAGNATGAGTrA>rrAGACTCACmOATOTmO\NTATACTAGAACATITGCCAGACCn 

GACTT^^TCATTTCTACTGNGAANATAAAACTAGAOOCTTATAATAAQT^^TO 

AAChnXjAGTAAAAAACTGGGOAAAGGAAACnXn"OAAGTACTTATTrGNA 

SEQ tD NO: 3046 ACAATra3TATrrGCTrrCCTCTITCCTTTCTO 

GCAGGTGAAAGAGATOAACCACOACTAOAGOCTOACn'AGAAATTTATOCr OACTC ^ 

AAAATTATGTTGGTTAATGrrAATCTATCTAAAATAGAGCATITTGGGAATGCT^^ 

TCAAGTAACAOTCATACAOCTAOAAAAGTCCCTGAAAAAAAOAATTOTTAAGAAOTATAATANCC 

TmX^AAACCCACAATGCAGCTTAmrn'CCTTTATTTATTrG 

TtmXATAAAATCTCCXntATCTtKnXJCATTATGGNACAAAAj^ 

NOCCAAANTTTCTOATTTTAACCANGNGCTNGCTATGGTTAATQAAACATTGGTATNGTO 
NNAAAGAATCCAAATNTmrCCGOOT 

SEQ ID NO: 3047 ACTtX}AGCTGTOAGGTCATCGOAATCCCGACACCrGTCCrCATCTGGAACAA 

ggtaaaaaggggtcacratggagtrcaaaggacagaactc<ntkx:rtx}gtgaccggga 

ccattcagaccx;ggggtggcccagaaaagcatgaagtaactggctgggtgct^ 

agtaaggaagatacrxxjagaatatqagtcccatgcatccaattcccaaooacaoocncagcatc 

aocaaaaattacagtogttoatgccttacatgaaataccagttcaaaaaaggtgaaggtgcto 

ctataaacctccaqaatattattaqtcnxk^toottaaaao tahrrc ^tggata actac atta 

TTCTTNCTAATAAO"rtUl'iriNATCX:AATCCACTACAC mAr rTrrA 
AATTa>AATTAAGATACNCTrAAGACT>nTACAAAAATmAT^^ 

SEQ ID NO: 3048 AL' l 't r f I L I'l i rrrri - rri ' l - | - ri4 GGTNATArrnTATnTCANAAAAACAGA0T 
TCACTTGAATACAATICTCACTAGTTAAATCACAATTCACCTTATAGT TAAT ^ 
GATAGTCCTrrOCTOCTATATAATTTCATTTrCTrAAAAAGCAA^ 
ACATATGAAAGTCAAAAACAATTATTrCAGO\AAAATGGAAATCAACCACC0TATVl"I"l"^ 
TTCrrTCGACAGAACTATCCriX>ATXXnXiACrrGTCTATGTGATTAAATAATX^ 
TrrCTCTXJrrrCTOACATAAACATCACTGCCATATCAAACCirmAC ^^ 
TTCAAAOATGAGAATGGCTTTCTTTCTCAATGNIAAAAG<XTGC^ 
TCTGO 

SEQ ID NO: 3049 A ciii HI 1 frri 1 1 ' l 1 ill 1 1 1 m i n i acctnaaaaaotgacatitattcaa 

CCCCANANArrOAACCXn'GOCrOGGGCTOOGTQQCAGQACAGCCCCTCA 
TNGAGGGGCAmCCTCAA^XK?AGCKG^^roAANAAGOTCTNAATGCCTCGAAGA^^ 
TTTTTCTGTCACCATGTTAATAGCCACACCTnACGGCCAAACCGTCAC^ 
ATATAGTTTTCCCTCrrQOTGGGAAGGTCAATAGTNGATTGACTAAAGAAACCrG^ 

SEQ ID NO: 3050 A Cl 1 1 1 Uf 1 1 1 1 1 1 1 1 1 i l ill 11 1 1 1 1 I GAGACAGAGTnTGCTCTGTCACCC 
AGGCrGGAGTGCAGTGGCATGATCTOGGCnACTGCAACTTCTACTTCCTGAOT^ 
TCTGCCTCAGCCTCOU^AGTAGCTGGGATAACAOGCATGCACCACCACGCOCGACTAATT^ 
TTlTTAGTANAGATGGGGrnxnt;CATGTrGGCCAAGTTGGTCrCA^ 
CTCCCACCTAGGCCTOCCAAAGT 

SEQ ID NO: 3051 cscggcoaggtacgcggcgaaattcagaaoaggaaaatgtgcccaocctgc 

CTGGAGAAAAGCGTCTGCTXXn"AGCCAAOATCTCCTC:ATCACAAAAGTAATGTGG^ 
. CAGGCCACCKXTitrraGGCTCTOCTGTTCATGCAGTC^^ 

CTCGAGTCTACTACCrGGGCATCCGGOATCTGCAGTGGAACTATGCTCCCAAOGOAAGAAATGTC 

ATCACOAACCAGCCTCTGGACAOTGACATAOTOGCn'CCAGCTrCTrAAACT^ 

GATAGGGGGAACCTACAAGAAGACXATCTATAAAGAATACAAOGATGACTCATACACAOATGAA 

GTGG(XAGCCTOCTGGTrGGOCITCCTGGGGCCAGTGrrG<>GOCT GAA 

TATTCACCTGAAQAATrrrGCCTCWTCCTATCXIATCACXrmAT^^ 

GTCCCITACCCAAANONTCrrGGGC 

SEQ ID NO: 3052 AOCTTGCTCACCTAGAGCAGCTAAAGGAGGAAGAGCTOAACCCTGATrTCAT 
AGAACAAGTTGCAOAATTTTGTTCCTACATCCTCAGCCATTCCAATGTCAAOATrC^ 
CATTCCAGTCAATGGGCCrOCTCTAGAGAGCCTGOTGCTGACCTACOTC^ 
GGATCTA(XCnK:ATOOAOAACGCAOTOCTOGCCTTQGCCCAaATAGAGAACnrAOCCQ<^ 
AAAAGOCTATTGCCCACTATGAACJ^GCAGATGGGCCAGAAGGTCCAGCTGCCC^CGGAAACCCTC 
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caggagctcknxkiaccixkacagggacagtgagagagaggccattgaagtct^ 
ttcaaogatotooaccaaatgttccagaggaaattagggck:cagttggaacaa^ 

TrcTAGCAOAATTOCAAAOCATCATrAGATGTTCATOGTITCTTANGATnm^ 

AATGTNANCNGGGGANAT^T^AACCGGNNGTACOG^^TI>^ATmAAACNGNOGNNT^^ 

ATCCTNGCGGACCCT 

SEQ ID NO: 3053 CGCGGCOAGGTACI 1 1 1 illCl 1 1 1 1 1 1 1 1 U U 1 1 IGGACTTAGCCTCCCTCTG 
TGOCCAGGCrGGAGTGCAGTGGCATGATrrCAG<XCACTGCAACCTCCACCTCC^ 
ATTCTCATGCCTCAGCCTCCTOAATAOOTGGOATrACAOOCACCCGCCACCN^^ 
TTOTOTTGTITGTrrGTTraTITOriTnGANACGGAOCCTTGCrCTGTC^ 
A^tKnXH}OATC^CQOTT^^CTGCAA«n^JCGCCTNCCGGGT^^ 
aX}AAGTAGCTGGGACTACAGGCGCGTGCCACTACTCCCAGCTAATATTn^ A 

SEQ ID NO: 3054 A L ' lTlTfl ' lTl 11 H m U 1 1 1 1 1 Ml I GCCATACAAATGGCTGGrnTrAATnT 
TmANAGGAAATACTATAAmAAAAAAAAAGAGTIXXAAAATATTTAACAGAATCrCCAGCAA 
TGATTATTTXXAAAATGTAAAGATTTGAAACATAATTTATACAAAACTAAAAACCACAAGGATTC 
ATTCTTGCTITITCCrTTmAAAAAATOCAGACAATTTGTCAC^ 
AGCrGTANCCTCAGTCACCCTCGGAATCGCTGTCCCTCTrCATaAOOACAGAOCOCa^ 
GACAGCAACACGTmrC^ATCGGCrrCnAGGOTTTrCXTCCAGOTC^ 

CAGACAGTrCCATTCCAACTCATCTCmKAGOTTCATGCCGCCAAACACCAGAGACCAATAAACT 
GAGTmGrATOTTCTTCANOAAACAAATrCGhKKX:ANATTa:ATNANGNAATGG^ 
hWANNGNrriTrAANTTNCTmaAACTrCTNGGANN<XCXXK3GN 
mTrGTAAGGGNAAAAAACCC 

SEQ ID NO: 3053 ACTCANATTGTGAACAGGCATATTTCACTGATTTAGAmAOTATACTrGATG 
AGAATGCTCAGGTrGAANAGATAGNT^^GGCAGCNATCCyAC^mr^ATAOCAATGTGGAAAAAOT 
AATCAACTCATATnCACGAATTNGATGTATGTTGTOATTTAOAGOGCATGAGATAAAOm 
TTGAACTGNGTCGGGTAGGGGOAAGAANANGTrGCTTANNCAAATAN CGGGG TC 
NAOATOmCTAANATGAOAAOTTATTtn'CrrOCATCATANAAGCACTCTrmACChWGAAGTC 
TTGAGTAACTATNAATCATTTATATCTOTACCTGC 

SEQ ID NO: 3056 A CrriTl ill 1 1 1 1 111 1 11 rCrTGATGGCTACTTATAATTT ATn 'ATAAAACAT 
rnACCAGTGAOTGATGTCTCAAGTGAGACGTCGTAACAGATACACAAAGCAGTrTATAGCATC 
AATATrrCAATGCCTCTGAGAGGTAGAAACAGCrroTGCTCTCANATGGCC^ 
TGTCATCTTTCTGTCCTAOACTGTGTCCTAACTGCTCACAAGGGTTC^ 

TTAATTG O 1 I ' l l CI l CAAAGAGGCATIX3TTGACTGGAAAGACTG0CmrrAmAAACTCTGCTCR 
TTTGCAAATGACAGACrrrCTCAACAGTATITCAGAGGAAATmG T^ 
CTQACCAGACAOCACCACCCATArrACTCAATOCrAAACACANCATCrrrcAT^ 
ACATCTGAGCrrGNGTTACCGATGGACCATrnNAACNCCCTCXn^C^ 

SEQ ID NO: 3057 ACTGAAAGAAAATGTCATGCTGCAGTCACAAAAGGCGGGCrCTCCTCCTrCA 
ATTOTCrrGOOCCCTGCCXIAGOTrcAOCAOOTCCrrACTAGCAATOTrc 
ACCGTO\GTGT0GCTTCCTCTtXATCCTTCAGTOCTACTC^ 
GrrrCACAGCTGGTrGCrcACCCACCTXJGCACTQTAATCACrrcAGTTAT^^ 
AAACTCTrACACAGGAAQTAGAGAAAAAGGAATCTGAAGATCATTTQAAAOAGAACACTGAOAA 
AACXKiAGCAGCAGCCACAGCCnATGTGATGGTAGTGTCCAGTIXXAATGGAmACT^^ 
AGCTATOANACAAAAOJAACKKrrGOACCCAACrCTrTNTATrAATATCC^ 
G>nTOTAArrGACArrrcAATATATGCTGACTGCAGATTmAGATAAATrTAAGOAGGTTCTATTT 
TOATTNGTAAAA 

SEQ ID NO: 3058 AOCAAOTGGaATOTrrCAACAACCAAAGCTCOAOTOOCACCTGAaAAAAAGC 
AAGATXnTGGGAAATTraTTOAGCTTCCAGGTGCGGAGATQGGAAAGGTTACCOT^ 
CAGAOOa:AOTGGTTACTTACACATrcGGCATGCAAAAGCTGCiriTCTOAACCAG^ 
CTrrAACmAAAOGGAAACTGATO^TCAGATTnifcTOACACAAATCCTOAAAAAQAAAAGG^^ 
TTnOAGAAGGTTATCTTGGAAGATGTTGCAATGTrcCATATCAAACC^ 
TTCQGATCATITrQAAACTATAATGAAOTATXKIANAQAAGCTAATTCAACAAGGGAAGQCT^ 
TGGATGATACrCCTCTGACAGATGANAGCANACGTGNGCNGAGGNTANAATOTANCATAGAANA 
ACCCTATOAGAAOAATCTCCNATGGGaAAGAATGAAAAAAGGANCCnT 

SEQ ID NO: 3059 ACTAAATATTGCTOAOAOCATOCA(XCX:AGGAAGGACmACCTTCCAGGAG 
CTCCAAACTGGCACCACaXX:AGTaCTCACATGGCnt}ACTrrATCCTa^ 
GCAA0TaG<>OT0TCTCX:ACX:ACCTATGATGOTGATGCAGCCCCTAGAAaTX}GC^^ 
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ATCCATGAGAGCTTTCGrrCCCCGGGCAAAAGCTTCXXZATTC^ 

CACAATCTOCnAGCCCGAOTGACAOCCrcAGCATACTTCTTGCrGC^ 

GCCCATCCAGCCAGCAG<rrATG<XAGAACK:CACAaTOOCmKXAOTC^ 

CTTOCAGCAGNGACAAAGTCAACAGGCAAGGTAATCm:ACACCATTCTTCTCAGCn^ 

AGGTCrrrOACAATCrTGGCrCCrCTCATCAACAOAAAAGTQCA AT^ 

OAAGGAAAAGtJCATrCCCCCCCANATTTITAATTGNNTrmCCACT^^ 

TNGNCnTANTCCNCCAQAGOOCAOAAOOT 

SEQ ID NO: 3060 ACATGGAGTGTTCAOCAAAGACCAAAOATGOAOTGACAGACOI J J i lGAAAT 
GGCrACGAGAGCTGCTCrOCAAGCrAGACGTOGGAAGAAAAAATCTOGGTGCCTTOTCTTGTGAA 
ACCTTGCTOCAAOCACAGCCCnATOCGGTrAATmGAAGTGCTCTrrATTAAT^ 
TTACTOGCCTTirrCATTrATCTATAArrTACCTAANATTACAAATCANAAQTCATCT^ 
TNTrrANAAGCCAACTATGAm-ATTAACGATamCACCCrGCTGGCCCACCAGGGTCC^^ 
CTGCOrTAACANCanCCTCItK^ICTCXXCCTGACACACC^ 

CrrCATGCTCTGTrAAAAGAGAAACAGNNGNAACITNNGNGAATAATGTTGCACTNCITAT 

SEQ ID NO: 3061 ACGTCrOCATCOATTATCITACXjrGGGGCAAATGATrrCATGTGTGATGAGAT 
CKiAG<XKnCTITACATGATGCACTITGTGTAGTGAAGAGAGTmO0AGTCAAAATC^^ 
CGaTGOGGGTOCTGTAGAAGCAGOCXriTrCCATATACCnGAAAACTATGCAACCAGCATGGGG 
CTCGGGAACAGCTTGCGATTOCAGAGTTTOCAAOATCACnTCTTG rrAT^ 
TTAATOCTGCCCAGGACTCCACAGATCIXKnTCCAAAATrAAGAGCrmCATAATGAGG 
TTAACCCANAACOTAAAAATCTAAAATGOATTGGTCTTCATTTGAGCAATGGTAAACCT^ 
AACAAACAAOCAGGGGTOrrrGAACCAACCATAGNTANAGTAAGATmGAAATTrQCACAOAAO 
CTaCAATCANCATmajT*ATTCNGArnTrrAAATCm^ 

SEO ID NO: 3062 ACGCGGGGGAGOTGAOOTITGTrACCGCNATrCrGAGAGGTGGGCTnTAGT 
CrCTCCANAOCTCGGNmAGNGCrGTCTCCG C - i U ICl 1 1 CACCTrCACAG AGOTrCOOGTCTTCC 
TAAAANAAGOTTTTATTGGOAGaTAAAGOTCAATGCGTAGGGGTAGAGTAATGATGTCTrATOGT 
GAAATrGAAGGTAAATrcTlX}GGACCTAGANAANAA0TNACOANT0A0a>COCT^ 
TNAAGTCATCCACAGAGTCGTANGT^^^r^WACAATX:ATAGAAATGCTGATTOTCAC^^ 
OAGAAAACrGTAAATCANNOGOTCCTOTGACCNTA'nTOOT 

SEO ID NO: 3063 ACACCTGTAGTnriTCTGACCTGTATOTATCTraAGGTOGOTaX}C^ 
TCGGATCACTAAAAGCCTTCCTACAGAAATCACACrrGTGGGGCT^ 

AGTGGACATGAAGTTTGGAAGGAQAOATAAAAOCTTOOGOGCACATTGAGCACrrCCACTTCCTT 
TCmGCTXn-GACrrXKKXX:ATGGCrGCXXXrrATGTCCCTX3^ 
CTGGTCV^GATGGGCmGAACTCTOTGTAAGAATrGCACTCCTTGOCACA^ 
TCrcGGTXnTCAGGACACCAATCTGTrGAGCATANTCTCGGCTAT^ 

AOOAOOOATATCTmGAA0GGGCAGAAAAGATT^TTCCATCATGAGGATAACCACCAAAT^^^^ 

TTCX;GOTCCTGCmGCGCACAACATNATCAATACATCATnCATCAOTrGNAAT0ATC^ 

AGOACCCrrrGGGGTTATCTrcAANATGGGAACTGCIXKiTGGCCATrcOCNC^^ 

GCAATNAAOCCAANCAGTCCNCATNNTGNNNTTTmCTAACANAATGT^^ 

A 

SEO ID NO- 3064 A Cl ^ t ^ ri ' l ' i 1 1 n 1 1 1 1 1 1 1 ll l ll I GGCTGCCCTAAATrGTrrATTAAGTATGAA 
TTrrACAAACTrrACTTATATrAGCGGTAACGGNGQAOCtGOAOAGTAnGCXXXrrr^ 
GaXXSGCGAGAGCCACCAATAOTGTGGTGGAACTTGTGGCCCITrCCAAGG^ 
GCCTGCAAATGNAGCCCACGCATmCCCrOrrOCTTGTGGACTGGATTGGAGACCACTGG^^ 
GOATTTCTrCCGATNGCTrrATGG 

SEO ID NO: 3065 TCGCGGCGAGGTACTTCGCTTTnTOTGCACTICTATGACATGGAAAT^^ 
>U^OAAGAAGC^TTUIT0OCT^GQAAAaAAaATA^AACCCAAGAGTTTCCGGG 
TTCTTXXAGGTGAATCAGTCOCrAACCrOOTrAGAAACTGCTaAAOAAOAAGAATCAGAaQAAGA 
AGCTOACTAAAOAACCAGCCAAAGCCTTAAATrOTGCAAAACATACTXnTGCTATC^ 
CATTTOACCTAACCAaXjCGAAAATTCATTtXGCTGTAATGTr^^ 
ACCm:A0TrAGGATTTCCTrCT0CATAAGG>nTITrrcTAOTC 

TCAAATATTrrAGGAGTATCmAATGTTrAOATAGTATATTANCAGCATGCAATyW™ 
AA>mrrCAAGCAGAOCANNCTTrG>WAGOACTTTTTTNCTGOAGTATrATAGGT^^ 
ACTGCATrcACCTOATCn-GNAATOCCrrrOTNANANNCrGN^ 
GTCAGAAAATTTACTNATTO 
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seq id no: 3066 actocgccrrcctgcactgaaocacccctgcagggcrcagtxjaccaac^ 
aatcagaoaaagack:aaaccoccgtogagacaaagaaocaoctgtocccct 
aga0tgccgatacgg<k3agaactotgtgtatctccacggagattctrotgacat^^ 
agctcctgcatccaatooatgctqcccaoanatcocaocatatcaaatcotgcatto 
gagaaggacatgggcrcix^titoccogtgcagcgcaacaaggacatggtgtgtc^^ 
ggaootootctatoaaaaaccaacccccagtgagctccgttrcogatcictccaacrcnaacaca 
cctatotxtnaagtocmazaaagtggaoakngctawcaatttgaoagcaaa cata aagngaoct 

CCTCOCANNCTTCnTGNG C - 1 rrr r i 1 1 GGGGAANAATrAGAACTTGGG OCAACn TNACCAGAAT 

GGACGGATTAAATOATG>WTTmmTAACTGGANNTTTTATaOQGATT^^ 

TAGGTNOTGGGCA 

SEQ ID NO: 3067 ACCAACCTATGCAQCCAAGCAACCTC^GCAGrrCCXATCAAGCCCAC CTOC A 
CCACAACTOAAAGTATCATCTCAGOOAAACrrAATrCCTGCCCXn'CCTOCTC 
TATAGTTCCCTCACTIXUTTTTnTAAC CI 1(JJ 1 U 1 GCAAATGTCTrCAGGO AACTGAGCT AATAC 
1 i 1 i 1 1 1 1 UCl IG ATGTTTTCTTOAAAAGCCTrrCTGTTGCAACTATOATrGAAAACAAAACACC^ 
CAAAAACAGACTTCACTAACACAGAAAAAACAGAAACTGAGTGTGAAANNTrGGTGAAATACAA 
GGOAAATGCAGTAAAOCCAaOOAATTTACATANCATITCXXSTTTCATrATTGAATAAGTC^ 
AGN>m3JGGGAGGTAAT0GACTAATATGATTTlTGGAChrrGTATTGNA>TOAT^ 
TTOONOTAAGAATTrGGCArrAGGNGTrAAGTCTATTrtKiATTrTTT^ 
TC^^Tra/VNAGGNAAATCNAAACCGGNAAAC^^OATAAACANGGGCAATATATTAil^ii^^ 

SEQ ID NO: 3068 CGCUGCGAGCTA Ci U 1 U 11 lU m m L H 1 Ul 1 HI 1 1 AGTTCAAQTTTA 
ATACAAACTACAAAAGATrAATGGGTTGCTCrACTAATACATCATACAAACCAGTAOCCTGCCCA 
(>ACGCCAACTCAGGCCATrCCTACCAAAGGAANAAAGGCTOOTCTC TC^ 
GGCCTGCCrrGTAANACACCACAATTOTGCTOAATCrGAAONCTrcT^^ 
AAAATCNGAANAGG 1 1 11 U 1 tL 1 1 ATOOCTOCCACCOCAGCCTGGCACTTAAANAGCCCAGCGCTC 
ACTT>ITGGTTGNANAAANATrhrriTGGTTCrrraGAATOAGGCTr 
TTCANCAGCTCGGCCAaTCCCCKrG>rrrrGTCAAGGNACTGAANG<^^ 
NA'rCXXAAANGCGNCCACAGOGNGGTATTTNANGGATTTGOCGAAAAATCCCTT^ 
AAAANGCCNTGNACNAAAGCTmroCI^ 
TNCAAONAAGTCTTGGCCCCCCNCTrri'riAAa 

SEQ ID NO: 3069 ACAGCAATGAAACACCAAAGGGACCrriTTCCTCCAAATTGTGTATAAGC^ 
TTTGATATCACACAGCGCrGTAACCAACTGCrCACAAGCTATTGGCTt^ 
CCTTTGAGCAATGAGCCTTAGTTCATTAGTCAOAACATrAGCATCAGAAGTTATCCCTOCCAC^ 
GCAAGOIATGTCCTCATTGAaTTTATAAArrrTrrCAGAAAAAAANACTrCATC^ 
GATGTTGCGTCTCTCTOCrOCAAGCAAACACCATCmTrGCTAAAATrcCCA^ 
GTCCAATAGCTTCCATGGCATATTCAACTTGGTATANCNGACCrnTGGANAAAATATAGNGOTCC 
TOOAGrrCATATCTTOGAACATlXJTTTCTGACTrTATATAGTnXANATGGAAA 
AGATGTCAaKjAGCCTTACm^COGAAGAOTAAACACCACCGTTACO^ 
COGGACTNGGCOGAC 

SEQ ID NO: 3070 ACCTCTOrrCTGGATCTGGGCAGTCAGCACTCTITrrAGAT^^ 
CTATITITATAGAAGTGGAGGGATGCACrATTTCACAAGGTCCAAOATTTOTrm^ 
ATGACTGTATIXjrAAATACTACAGGGATAGCACTATAOTATTGTAGTCATGAGACT^ 
AATAAGACTATTTITGACAAAAGATGCCATrAAATTTCAOACTGTAOAGCCACArrrACAATAOT 
CAGGCrAATTACTGfTTAATnTGGGGTrG AAC f VV 1 1" V I IGACAGTG AGGGT GGATT ATTGQ ATTGT 
CATTAGAQOAAOGTCTAOArrrCCTOCTOTAATAAAATTACATrCAATrcATTm 
GAAAAOTNCTTTCrGAGAAGTAGTGTrAAGOCTTCGAATGTGAAC ACAT^ 
AriXXnTCCCAOATTTrACTTACTACTGGAAATCCTACCATATAAAGCl-l H 1 ICl i N 1 11 AAAATG 
ATTC 

SEQ ID NO: 307 1 GTCCGCGGCOAGGTACAGTreCAOTTATTTACACTCACAGA^ 

OTAATAOOTAQAACTAOATCACTCACTGOAAATCAGAAAGCArrCAGTCAGTCTGATAATC^ 

CAOTTTACCATTCTTATCACTTTAACTGAAGTGTGAAATAACAAAAQAA TOCATA TAOTCCOTTAT 

TTAAAAATCCCTGTTACACAQAAAAAAAAAAAAANGAAACTTQTrrGAATTm^ 

AATCTGGNGAAATCGOTCGTAAATOrmONACACNCTGOCACArrTGTAAGCCA^ 

G ITI 1 1 i 1 1 AAAANAANNGATOANGANrrATTGGTCCOTATATITTAAAGGGATGAAAGONGGTAT 

G 

SEQ ID NO- 3072 A Cl 1 IL ' l m ' l ' H 1 IIICIJ J 1 1 1 rAATOOCAGCTAAAGATATACAQATTACTO 
TTAAATTCCAGTCCri rr rri ll'lAAAGATATrriCTrcAGTTAriTAGAACATW 
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TrrmAATCAAACAAAATArn-ATGAAATG<KmTrcrCTrAATTCTG^ 

AATAa:AATTCn-AATAmACAATATKyiCCAAMCTTAGAATmGC^ 

AOTXJmCrrrGCTAAGCCTTGCATGCAAAATITOAAATTTrAACATr^^ 

GGAATCTATXTrcroOAOTATrrCAAACmACATTGAAACATAArrCCTrGGAAAAC^ 

SrcAGGAGCmTTATCAACTtWAATtKnTATATTAGTrGOmC^ 

TAAQOCN 

SEO ID NO: 3073 ACAACACCGAGGTGGGAGAC^GTGOATCTGGCrGAAGTGAAC^^ 

TTcrocrccAGCT^ 

GCTrTAA0ATC00GGGAGGGTAAATAATGCAAAAATTGCACA(riXK3AAGAAGGGG^ 

S^GciTTCCATCCrGTAGTATAGGTAATGOAOTrGGaOGAAGCAGC^ 

>^SmAGCnTCnTrrGGAATGGCXX:ACCArrCTCACTGGAAAAC^ 

TTTTmTCrGACTTCCANAAATAAAANTGrrTCCATGGGAAAAAAAANAA^ 
TCCTNGGCGGGANCAC 

SEO ID NO: 3074 ACAACCCTTOTCACCATCTCAGGiXiCAGAGCACTCAOGTGC^^^ 

TOTOAGAACACACCAGAGAAAQAArrGCCTGTAAGTCCTGGTCATCGGAAAACCCCATT^^ 
AGACAAACATTCAAGCCGCmWAOGCTCATCTC^^^ 

NTTTNGGNGm^ 
CGTTATAGGTCC 

SEO ID NO- 3075 ACAGCAGCAGTTGTATrcmATTAGCTrOOTAGATCArrrTCTCra^ 

^^I^^TACTAGCAACrrnCATCCIT^ 

TGAAAGrGCAATArnGAGTATa^CTGCGAGATGATOTraAATnCAAGTATC^ 

OACTOGGAAACCCATrGCTCTTAAACTGGTGAAGATAAAACAAGAAATCCTCCCTOJ^^ 

TCAATCGAGAAGAAGTCrmATCTOAC^^ 

NOAAACTGGAGA>rrAAATAAACTTrGGNArrGATTACAATAAANCTCTOGTGC^^ 

CACCTrmSanxwTOAAAA^ 

(^T^ATrcK^AANAGGGNATCGTGGAAAAANANT^CTITCCT^W 

OTCANCCNONGATA TGGGGA TrCCCATNANOGNCGAANGGGNAATAAGTNCACCAANNTCAAN 
TTTGCCTCANGAAOiCTTnTGAAA 

SEO ID NO: 3076 ACTrGGTGAATATGAGGAGTATArrACTAAACrmCAACTAai^C^ 

r^TC^^GGGCATTCAGAAATACAAAGCAAAOATTOT™^^ 

GACOTIOTCTOCrATCTOCAGTrCCACAGACOCAACCAGTTACGAT^^ 

GGGArrCOACATCATTCCCTATAATGATCnXKXrGCACTGG^^ 

GCraOGTTCATGGTANAArcAATTCAOOGTGAANCANGCGrraTTGTCCCGOATCCAGGTrc^ 
TGGGAGTCOKKiANCTITOGCCAAGCNCCAGfnxnNrr^ 
CiCJKrGGGAGATGGTrGGCTGTGAKrT™AAATGGCANan-G>n^ 
nTGGGOCTATCCTGQKT 

SEO ID NO: 3077 ACCCnAAACTOOCAGGACATrrTTGAAATCACAAATITCCACAT^ 
GT^OQAAC^^ 

TT^AAAACCTCTAATrrAATAG^ 

TTT/^GAGGmATrATCAGTCrGTGCATAACTAAAAGTrCAAAOC^^ 
AACATTGTAAAOTAACAATlXnTCGTAnACATGCCTOGTATQAT^ 
TACACCTn-OTGTCACrGTTTCAAGAGACAAACAGATITOAACAOCTA^ 
a^AGCTTATGAAOTCrcAAACAAACTraAATTTTCTG^ 

SEO ID NO: 3078 ACOCGGGGAGCAO<y.GGAGGAGGCy.GAGCACAGCATOOTCGGGA 
OTCT^GGCCAGTrOCAGOCrrCTCAGC^ 
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TKKlAQTCATTTGCrriTGCCTXXn-AGGCATC 

XcStSSaSaagg^gaatx^ 

TGyCSATGATGATOACCATGTXXJACAOOCAGGACT^ 
ATGACACTGATCATKn^ACCACnr^ 

SS^SSS^^atagS*ggottat^ 
catccagaccctcggccggaac 

«!FO ID NO- 3079 ACCACGCTOGTCTAATGCANAAATXKIAOATrcCTACAAAGGACCCmAAA^ 
CCTA^i^CAAOAlSTC 

Statctcgaactatcgtcccatccct^^^ 
ctgoctSt^gtgacaatgacccaattc^ 

JSISJ^^SS?tg6gStaa^ 
iSSaSJraAcSTTCGG^ 

OOATGCnTACCACCACCCTNT 
SEO ID NO- 3080 ACAAGTOCCAGAGAACACAmATGATTTACGATCCCAGOAAAATTA^ 

AACNTCCAAAAAAATCCCCCCCCCCrrrmOGAAAAAAW^^ 

GNAATTCAAAANTAAAAGGGAAACTGNACCAACAA<XTTTT-m 

CCACACTCAGACACAGG^ 

ASiGSSA^GOWGTXX:AAATC^^ 

ACTTCAAGGGiXTACANNAAAOTC 

SEO ID NO: 3081 AATAAAGAACCnCTATCAGTGAGACTTCTCATTITATA^ 
C^QCTTAAATTrrCTTGAATTOCA 

?A1TTT^rACATT^^ 

OTmACTL^ScCGAciTtCrTGAAAAOCATOTtJ^ 

SSSiS^^^SfrACAGAGCACTATAACAT^^ 

ACTCTIGGATrrrCCAATCATATUITCCTCAAGGCATGT^^ 

ATAAAANTTCCAAACCACAC^mT^ACAAG 

CTJrt m Vn- 5082 ACAAGATGTOTTACTATCOCrrTGGACAGGTTTACACAGAAGCCAAGCGTCC 

^Sa^^^^acxctctcS^^ 

S5ST?§^cSGAASTTOTUnG^ 

ctaacaSSSo^ 

XijuiSGCCTrATTrQTITroCT^ 

jSJi^o^ASrnTnAAAj^cc^ 

GnTGGAAACCCrNTACCT 
SEQ ID NO: 3083 CCCGGGGTATTOATTCCCGCCCNACCCGQAmT^^ 
^SSG?S?TGACcSGTGAalcCAT^^ 

ISSS^gXSToaagtcccc^ 
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COTGAAAACCCTCCTNGCAAAACATNACCCTrGAOGNGGGANCXX:AGTGACCCrrCGAAAACGTr 
AAACCAAAAACCCNGOACANGGANGGCrrrCTCn^ACCACAAAAGGTTATT^^ 
CTTOQAAAANNGGGCCXX XI 11111 l ANNTOAAN(XCAAAAAAANATTrCCCNTNCCCG0GOQCC 
NOCCrCAAAA 

SEQ ID NO: 3084 CACTmxmACAQTTCATATATCAATAGTTAGCAGAGGGAGAAAC rCCTCC G 
NTGGTCCTACNCCAAGGGAAAAAATOTAAGGNCTGGATAGCTCATGAATAATATT AACAT TnTCT 
ATCACAAAATATTGGATAAATTGGAATTGNATCAATTCATGGTATGGAACCTAACATTnAAOCCG 
AATATTAAAATAATATAAATAAGGCrTTrAACTrCATATCAAQACCTATTTGAACACTTCTAOAAA 
AGAACATTAAAAGNTrrTTCCTCATTrATGGGTTGATITANTGNAN TG 

GGGGGCnrTAATATAATQCATAATTATrANAATATAATTATrAOGN'tmTNCANAATnTATGGGGG 
hWCCTTGCAAANAATrrTNAGNCCAAACrrCAAATTAACCCrCCNAAm 

SEQ ID NO: 3085 actcacacaagttgtttcaaaoatgatatttctgtgaacagaoagc^^ 

GAAGATrrOAAAATTATrAAAGAAAAAnCCTACAQArnTCAATGCAGAQACCATAATCAAAAA 
GTAAACTITCmAGTAOTATGTrCAATACATCAmAArrrrrTAAGl^ 

OCmAATTATTATAOTCTAAACAAATITATAAGArrACTGTrTGAAGTAAATAATACGAGTGAAT 
ATTTTCAAATGTOATAAAATAGCACJ^OTGOCTGGTGATAAAATTTGAAAOTATaGTTAACCTCA 
ACTGNGATCTTATGrATrn-AAAGGGAAArmAAATATGATTATINTAGGTraATTACAAA^ 
A 

SEQ ID NO: 3086 ACATGTTItKiAAATOAGTTAGATAmGAAAAGTCTAAACACACTGA'nTAO 
GAGTG<XTATCnTGCTGTCTCAATGAACAATGGGTCAATAGTrCATATGGACTAAATTAATCr^ 
GGTAGCACGTATGTCTGTAAATATAAAATTATAmaACATTAOCACCAOTOCAOAACATOCTCA 
GCATCATAAGATCCAAAACACCAGCACAAGTrTATOCATCATTAAAAACAATTACCTCCTTCCTAA 
AATaTCAQAATAaTAOGTAACTOAGATTAATAAATCTTATCCAAAAAQTAACTCTTATAACATAA 
AAO ri 1 C 1 1 11 AGAAGTATATGTGAGAACCAATCAGCTAAATAGAAATGTTTAAGATTGAAQATGC 
CAATArrrTTAAAACTGGCATTACCATATCVkOCTACAACAriTCTNAAAATGACGTGA 
TCTTAAAAATTTTTTAAAAAATrCCTTCACTGGTTATCAATAAATGNOGC^ 
CCTTWAAAACTTTTrAATTCATCATrNTATGGGGGNmAANAAAGGGGGGGNAGC^ 
NGOAATNC^^T^rrTTANCCAAT^^^TAAT0OaOGGANGAANAAAAAATA 

SEQ ID NO: 3087 ACTrGCCCCTrCCCCAGAAAAGCOOGACTTOCTOCTAAGGGTOAAOQACC^ 
GGCAGTKrrCCCTGCGTGGTCTGACACCCrraAAACGTGGGTGTATAA^ 
AATCATTAAACACCAAGGGAAGG<nXK:CTrCCCAGTCT0TGACCAGCOCCOQAOTITIX3GOT^ 
CGGATAAAACGTGTCTCriTTGTCKrrACCAGAAAATGAAAGGAATTGAAAT^ 
AGATrGAAOTOTAOTOCCAAOATrGAAAGGAGAAAGTGOTTGAGGGATAGTGAGGGAAGTTGGA 
GAAGAGAGTAAAAAGAGGCTGCrTACCAGATTraAAATTGGTGAGATGTTTCTTGG^ 
TCrOAGOACCTGAGGTOOTAGGTGGATCTrTCTCAGOGAGCAAAGAGCAGGAGGACGGAGGATTC 
ATCTXXXIAAGGGAGGTCCCCCOATCCOAGTCATGGCACCAAATTrCATOTOCQrrCCAT^ 
GACCACCAAACAGGCrrrcTCTGAGCAACATGGCrrGTrrATTTCACCTGGOTGCAGGC 
GTCXXAAAAGAOAQTCAOCCCCCGCGT 

SEQ ID NO: 3088 ACAGATGTCAGTGOAAGAQAGTCTrACTGACACrCAOTroOi Vivi 1 1 iCAGG . 
TCAGCACTCrrGTATACATCCANCAAAGGCGAAAGAANAATTCGTGTTCATACTrnKK 
OTAGTTTCQACTCroAATGATOTCTTTCnGOANCTGATGTrCAANCAATT^ 
Am-ATCGNTGTTOACAGATCTATGACTGCX>GTCrGAGTGACNCTCOGGATGC^ 
GCCATTQACTXX^irTTXyiGCTACCGTrCTrCAGTCTrAAGT^^ 
TTnCTITGCGGCTTTCCX:ACTTTTTGTOTNGCTCTOCTrAAACAGAA^ 
AATGCACCrcrANATGAAOGCATnrraCTATGTGTCAAGT^ 

TGCTCACAACTCATCCCAGfmOGATAAAAGTrGACAATCTCTCyLNATGAGOGANCCC^^ 

AGGGGATAGAACX:ATAUn>lAGCCCCCCATTrm'AAGKmCAGNGGGAAAAACTGAA(>AAAA^ 

TQGANCrrCCTNATOGATGCAOGCNTTraACCTTGGNCGG 

SEQ ID NO- 3089 ACTTT NTl rn ' n 1 11 H n 1 It 1 l OOGATrAGTOGGCTATnTCTQCTAOGGGG 
TCGAAGOGGATGAGTAAGAAGATTCCTGCrACAACTATAGTGCrrGAGTGGAGTAGGGCTOAAAC 
TGGGGTGGGOiXTTCTATGGCTXiAOGGGAGTCAGGGGTGGAAACCTANTrGGGCra 
CTGCTGCTAGGAGGAGGCCTAm-AGGGGGGTGAGOTTrGAATTACCOTrAAAAAGGNC-ll li lOTT 
ONOOGNCTCATGAGTTGGAGTOTAGGATAAATCATGCTAAGOCNAGGATNAAACCCAT^^ 
TACGOTTGTATAGGATTGCITOAATGGCItKJrGGGTTGGCATTraCTCOOC^^ 
GAACCANAAGGATTTAATTCCTACCCCCTTrrAACQJATOAACA^ 

AACTAAAATTACmn-OGNAATTAOOAANATOAATrAArnTrTAAAAACTGATTAATGCrrGGGTC 
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TQAAATrmrn-CCAAONGGNAATTrrn^ATCGACCCTOTTACC^ 

^T^^^■GGGAAAATmc^N^^TAAAC^^■AAOOQAAAAC^wooT^^ 

CAATTOCAAAAAANAAA 

<?Fn m NO- 3090 A Cri 1 i 1 U 1 1 1 1 i n I GCGTrrTGrmGANACAGNCTCGCTCTOTCACCAAGO 
CIWAGTCCAhWOGGACGANCTraGCTTACT^ 

GCCTNACnCrriTGAGTAGCTGGGACTACAGGCATGCACCAa:ACACTrcGCT^^ 

ATTS:rAOAGACAG<KnTITACCATACTGG^ 

TCCATXXXKXnXJAGNCrcCCAAAOTGCn^GNATTACAG^^ 

SS^TAGGAATrATAATTACAAATrmAArnGTaiAANAAGG^^ 

AANCCATNAANCAAGAAANCCATANAAGGATrCTTnTrAAATTNATOACCNCAATACAAT^ 

ATACNTTrrrATmOCnTANATAAArrAGNAATAANACAhnTrCTT 

J?S^^ArT??gSS^T^^SS^^^ 

cmtStcccacgggggoiaaccctixkjanatgnot^^ 

TTAmrrrGGAACCGACCATNTTACAACANGGGCTrcCAOGGATTmATAATGCCN 
GGNAACrAAGTNCANAAACCCAATTmx:CATCNAA(X:NNAAAAANGATKWAAA>^ 

nccttnogccogaccnccctt 

ATTO-nnTCTATAAGTCGACnTTCATGA^^ 

GAGAWrrAGOAAATOOTAAmGCCnGCTACACATTAAGAGGGCTATO^CT^^ 
?i^2CTCAGATAA(nGCAGTGTCrrrcCAATG^ 

ATAAAATTACTCATTCAAAACCTCTGGCrAAGGNGATrrrcGAAGTTCTTAACAGT^^ 

ttiSattSJS 
cctgcccngggcggco 

SEQ ID NO: 3093 ACTGGATrcrrcATCTrTCn<K:ATCATCAAGAATGGi^^ 
ATOOAOCTAAGCAGTATCTCAGGATnXJAAGAAGGTTCAGAGCT^^ 
CATGAij^GACATGAOQCrcOAAOCTOAAGCAGTrGTAAATGATGT^^ 

SAAACAGATATraCCTAOAACICACTGAAGCAGGGCTCA^ 
GTAGATGATCATITACAGACTCCXTACCATGAAACyLGTCTAC^^ 

GCCTAOCOAGAAGCATrTGGAAACGCACrccrrCAAAGACTCGAAGCmGAAAAOAOATG^ 
GTCATGACTACACTTTITCCTrrCAGAOGOGCTGGTCCTGGT 

SEC ID NO: 3094 A LTlIiUll lUl HUlIl nilllllLl lllti ii iii wt^^iijrriMiM w 

n/jIgoattttaaaaatn^ 
GGAAS^ISSaSra^c^ 

?IISSS^^Sttotcagt^^ 



ATTrnGGrrrATCnTrrAACTTCAGTOTQAhLVAACraACAA^^ 
TTAAACATCACTAATCNGTGTrATAAAATAATTCAACAAATACGTrANAATAThmT^ 

GCTGATGGT 

SEO ID NO: 3095 ACTCnWANGTCACANTtmATtKrATCAJTTGAC^^ 
TCCANAG>^^ 
AACTTCAAGAATCCT^ 

5i2S^AGSTOrAOAGTATACACAGGTCAAACX:AAAAAATAC^^^ 
^XbcTTCCAOAATAATATATACATC^^ 
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AOOGTAAATCCAAAAAGTAGGT 

SEO ID NO- 3096 ACAOAACrrATCrGGGTGAGGGCCCCCCX:\TG(m:CGGGATXmnT^^ 
GCCAAGGANAATGCACCTGCCATCATCITCATAGACGANATTOATGCCATCGCCACCAAGAaATT 
CaATCCTCAOACAOaOGCCOACAGGGAGGrrcAGAGCATCCTGCTGGAGCTXKTGAATCAGATXM 
ATCOATTrcATCAGAATGTCAATGTCAAGGTAATCATOGCCACAAACAGAGCANACACCCT 
CCGGCCCTGCTACGGCCAGGACGGCTGGACCGTAAAATTGAATrTCCAmCCTGACC^ 
OAAOAGATTGATTTTCTCCACTATCACTAOC^GATGAACCTCTCTGAGOAGG^ 
CTATXntjGCOCXKiCCAGATAAGATrrCAGGAGCraATATTAACTN 
T(yrK}aCraNCCaTGAAAAOCGCTACATraCCTGa;AAGGACT^ 
CATCNAQAAGGACTANCANGANCATraAOTmACAAOTmCCmCCTmCCTrCA^ 
O^AOOGGCTGGGGCTITGThrmGCACCCNCANCACCTCTN^ 

SEO ID NO; 3097 ACCACCAATCAATGCCAGOAAGAOAGTAAATGCACTrAQACTCATTTTrG^^ 
yUUVTGGTrrGAATCmACTGTCTTGACACTGCTTTtXrrTA^ 
AAaAAKX>CCAATATATGCrGTGAACTCrcTCAGGAOCC(XGCGTACrnGGC^ 
CTATOTCCCTCCTCTrGGAAACTrTOATOTGGAAACnrrTAGATATAACACC^ 
ArrrCACCAAAAATCAGAAAGAAAGGAAAAATAGAAAGGAAACAAAAAACAOATOOCAOCAAO 
ACATCXTCCTCTOACACACrrrCAGAAGAGAAAAATrCAGAATGTGACCCT 
GG<XAGCTAAACAAGOAGTrCACAGGGAAGGAAGAAAAGACATCATrOTrACTACATAATTCCCA 
TOCTTTITIXXXiAQAGCTGGACATTGAGGTCTTCTCTATrCTA 
ATCTTAGATACTGAAATGCCACTGAAGCTACAOAAGTrGTGCACTTGGAC^ 
CTrGCTOGAAOATCTCTCCAGAGCTGGAGAGTATGCrGAACCTCCTATTGCC^ 
CrCAAGAACAAAOGAAGC 

SEO ID NO: 3098 ACCTrAACAGCrCTGAAAGCTTCTrCCOATnTGAOAOTCnXXmrrATT^ 
AGCTTTATOTITrrAGAGAACTGCTCATAGAATnCITGTAGTTCTrmAT^^ 
TAAAGAGTTCTAAGCATrrrrrGACX:AAATrmCCTQATAACTTrCAAAATr^ 
CATCTGAOKKiATATOTTrAGAGGGAGATCCTCCGAGTCrACCArc 

TTCAGGGATTAOCTCCTCACAGTTATCCATGATGAAAACTXnXfCGTACTGTOCCAATGTCL^i in.ii 
ri - l - 1 1 1 1 n 1 1 1 1 I CCmSGANACAGAGTriNTCTNTTgrCNCCCAOGCrGOAOTGCAATGGCNCAA 
TCTCOOCTACTOCAACCTNCACCTCCTGGGTTCAAGNGATNCTCCTGC^ 
TCGGATTACNGGCGCGAGCCACCATGCCCGGCCAOTOTCC'l-i l'Cl lATACACGT 

SEO ID NO- 3099 ACGaKSGTAGCATTOAACrrCTAAATACAAOATTraCTnTOAACAG^ 
TCATTACAAOTGAOTCTCATCTGTrCATTAGAAGACCATTCAGAAa^^ 
TTTGCATTTCGAAGTTAATATGGATGCArrrcTTOTTrnOTrr^^ 

■TITOAQACGGAGTCrrCACTCCAGCCTGGOCGACAaAGTGAQACTCTGTCTCTAAAAAGAAAAAAO 

ACAGGTGAArmATGGrrATGTOAATTATAACTCCATAGAAATAAAAAAGTGACAAAACCAAAAT 

AAmOCCAGCCTItrrcGTrAGAAATAACTTCmGGGCCGOGCACQOTGGCAC^ 

CCAGT 

SEO ID NO: 3 100 ACACAGATATn-ACATTTAnATCarmATATATACGTTTATGTATmrrC^ 
CTATOAAATACTTCAnAAAAAAAAGAGNGAAAATGTAAGACrNCAAAAATOAAOACIV^^ 
ATOTATAACAATAAAAmGTOTAAAAAAGTATACAAAGAAAATGTTCAAm^ 
GCTCTATATAATCAOTATTAATCTAAAGACCATAATCCTAQTATOAAAAAAATAACAACCTT^^ 
ACAAAATCTAAOCATCTGCmCCTGGCACrCrrAAGTTAGACAAGAOTAA^^ 
AATIXJIXKXrCITCATUrntATCTATACTrAGAi'i 1 J iCl 1 i ATCTITGGACrrCTrTGGACTrCTG 

CTTCaoTCTCTCTrmACTccrrrcTcr^ 

ACTATAATGGCCATCCCnrrCTCOAOATCGGCrACGACTrC^ 

CT Cn i ' lGl I I CTCCCAACAGCrmnTCTTmCATAGTGACCAAACCCCATTTCTCTATAGATATC 

croATTAGTArnrraATiXKJCATtrrAAGCCTCATCTrcmcrc^ 

TTCTTGGOTT 

SEO ID NO: 3101 OTACCGCTOAGGGAAAGGAOCGOGACTCCGQACCTCCAOOAOTGCAy^ 
GATCCTOAAAOOAATAACAAOGCTTATClCTAGGATCXrATAAGTTGQACCC^^ 
CATWKJGACOCANOCrCGCCAAAGCA-nXKriXKnCACCTAOATA^ 
CGAGAQCrArrrCCCGCACCAATGANAATGACCXXraCCAAGCATCGGGATCAG^ 
CACTACAACATCIXXXXXCAGGATrrGGAOACTGTATTrcCCCATGGCCTT^^ 
T0CAO0TGAAGACATrCANTOAAGCnGCCrcATGGTAAGGAAA(XAGCXXTAOAACT^ 
TACCTOAAAAACACCAOTrrTGCTTATCCAOCTATACa^TATCrTCTGTATGGAGAGAAGM 
GGAAAAACXXJrAAGTCmXKXrATGTrA'aCA-ITrCTGNGCAAAACAOOACTC 
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ATTCCAGATCCTCATCmGCWTGAAAAAnGrrcGGGATCITCTGCAGT^ 

CGCmOATCAACCnTrAGAGOCTTCNACCTOCTQAAOAATrTNAAAACTACAAAT^ 

TQAACCAGATAAAAGTTCAAGA 

SEQ ED NO: 3 1 02 ACGCX}GGQACAGACGAGATCrOGATCGAAGGCGAGATGGCGGACGTGCTAO 
ATmCACGAGGCTCKSOGOCGAAGATTTCOCCATGOATOAGGATOOOOACOAOAOCATTCACAAA 
CTGAAAGAAAAAGCGAAGAAACGGAAGCGTCGCGGCrriGGCrcCGAAGAGOGGTCCCGA^ 
OOATaC0TQAaOATTAT0ACAOCOTOGAGCAGaAT0GCOATCAAC(XGGACCACAACGCT<^^ 
GAAGGCKKjATrCItTnGTAACTGGAGTCCATGAGGAAGCCACCGAA 

ATTCGCAGAATATGGGGAAATTAAAAACATTCATCTCAACCIXXiACAGGCGAACAGGATAT^ 

AGGGGTATACTCTAGTTGAATATGAAACATACAAOGAAOCCCAOOCTOCTATOGAGOOACTCAAT 

G<}a:AGGATrrcATGGOACAGCXX:ATCAOCCGTrcACTGGTGTmCTnXK^^ 

GCAAGAOOAOAGOTGOCCCGAAGACXiCAOCAAAAOTCCANACCGOAGACGTCGCTGCAGCTCCr 

CTGTTGTCXAGGTGTrCTCnCAAGATrCCAmGCCATGCAGCCTTGGAC^ 

OaAACTTOCTONGnTATATTTAATCr 

SEQ ID NO: 3 1 03 ACACAATOGCTTTTOOTTCrrcAAAATTGCTOCCCTTATIX^ 
GCrCmCTACATCCCTGGGGGCrATTTCAGCTCAGTCTGGmOTTGTT^ 
CCTCTTNATCCTCATTCAGCTOGTGCTGCrraGTAGArrTr^^ 

AATCOAATCGAAOAAGOAAACCCAAGGTTGTGGTAT<Krrecrn-ACTGTCTrrCACAA 

TATATCCTGTCAATCATCTGTGTCGGGCTGCTCTATACATATTACACC^ 

OAAAACAAGTrcrrCATCAGTATTAACCTOATCCTTTOCOTTOTOGCrT^ 

CAAAAATTCAGGAACACCAAGCCIXXSCTOCGGCCTCTIGCAGTCC^^ 

ATGT 

SEQ ID NO: 3 1 04 ACAGACATGTTGCAAACTGACTnTAAAACAA m ri' 1 AAAATATATACAAAC 
mill ICi I CTATTCrrCTCAAAGGCAmGAAAGGOATACTTTTATGAATATTCrr^ 
CAATGTAGAAATAACirCTGGGTATAAAACAGTAAAAATAAAAATATrCTACCTGAGTOTGTrAA 
ATCAAOTGATTTGTAAAACAAAACCTIt^CAAOTGTGGGCmCTACATGTAACT^ 
AGGCTTACACCCTCATCrrCTACAACACAGATCACTAATOATGATAACATOAGTTAAATr^ 
(nTGCCCTTCTOTGTGGCTITrcGCnCTAOGTTCTATOACCAAACTATTGACAT^ 
TATGCAGTCATimxmJATAAATCTACnTrACAGCnTGCTTCTACCTC 

TGCTGTGAAGTCCTACATATGCTrCATACAATATGrrOCaX^ATCTOA TAATAAAAAT ACAAAGGT 
GCTCmAAGCTAAACXATAAACCTTATTAGAGAATTCTAOTTAAGTia 1 1 1 1 U 1 11 1 1 CCCATACAT 
OTAAATACCTrAAQATCAATAOGATCAATAAGGOATAATATTAAGTTATCAAAATTTANTCTr^^ 
ACTTGGAAGAATTTG 

SEQ ID NO- 3 1 05 ACANCCAOTGTGGGGATGTGATGANGGCCCTGGGCCAGAACCCTACC AACG C 
CGAGOTaCrCAAGGTCCTaGOGAACCCCAANAGTGATGAGATGAATGTGAAGGTGCrc 
AGCACIKrCTGCCCATGCrGO^GACAGTGGCCAANAACAANGACCAGGGCACCTATGAC^ 
0TCGAAOGACTrQ*GGT0TrTGACAAGGAAGGAAATGGCACCGTCmXK3GTGC^ 
TGTTCTTGTCACACTCKKiTGAGAAOATGACNNAGGAAGAAQTNNAaATOCrOGTaOCAGOOC^^ 
AGGACAGCAATGCrrrGTATCAACTATGAANAGCTNGWCNCATCGTGCrGAATGOCra 
TNCCAAGTCTXXCCAGAGTCCOTOCCTITCCCTGTONOAATTTTONATCT 
GGCTNTC^TOT^^■CANCAACTTTCCCATC^ 

AAATAANCTTGCTCThrrOOOCAACCATNACTNCATGGAAAAAATTGJ^ 
CCGCTTNAAAGOOCAAATTCAA 

SEO ID NO: 3 106 ACGCGGGQAGTCAOACCCAGTCAGGACACAGCATGGACATGAGGGTCCCOG 

crrCAGCKx:raooocrccroCT<}crcroGcr^^ 

CrCCTTCCACCCTGTCTGCATCmjTAGGAGACAGAGTCACCATCACnTGC^^ 

ttattaoctggttggcctggtatcagcacaagccagggaaagcccctaaactcct 

G03TCTAATrrAGAAACTGGGGTC(XATCAAGOTTCATCOOCAaiXKKnXT 

CTCACCATCAGCAGCCTGCAGiXTGATGAriTTGCAACrrATTACTGCCAAC^ 

CGACQTTraG<XAA0Q0ACCAAGOTGOAAATCAAA0GAACTGT0ACTGCACCATCrc 

TTCCCGCCATCTGATGAGCAOrrcAAATCTGGAACnXjCCTCTGrroTGTG^ 

ATCCCAOAGAOGCCAAAGT 

SEQ ID NO: 3 107 ACACXnTra^aTTTCGAOCTIXTrCTCCAOT^^ 

AGATGAAGGAAATTCTOACATGTTAGTGGTGACCACAAAAGCAGGCOTCTTGAGTTGAAAATTG 
AOAAAACXyiTOAAAGAAAAAQAAOAACTGTTAAAGTrAATrGCCGTrCTGOAAAAAGAAACAGC 
ACAACrTOjAGAACAAGTTGOGAGAATCGAAAGAGAACTTAACCATQAQAAAGAAAGATOTGAC 
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CAACTGCAAGCAGAACAAAAGGCrrcTTACTGAAGTAACACAAAOCnAAAAATOOAAA^ 
ACnTTAAGAAGAGOTTCAGTOATOCTACATCCAMGCCCATCAGCrTGAGG 
GTAACACATAAAGCAATrGAAAAAGAAACCGAAnAGACAOTTTAAAOOACAAACTCAAGAAGG 
CACAACATGAAAGAGAACAACTTGAATGTCAQTrGAAGACAGAOAAGGATGAAAAGGAACnTA 

TAAGGT 

SEO ID NO: 3 108 ACGCGGGGGAOTITCATGTTATCCCrGTrOCA OOCAAATO T AAAGT CTAGAA 
AATAATOCAAATOTCACOGCTACTCTATATACrmOCTrGGriTCATnTrr^^ 
CATGACnTAGATGGGAAGCCTGTGTATCGTGGAGAAACAAGAGAOCAAcri 1 J i CATTCCCTGOC 
CCCAAnTCCCAGACTAGATITCAAGCTAArn-lC-l 1 1 1 ICTGAAGCCTCTAACAAATQATCTAGTT 
CAGAAGGAA0CAAAATCCCTTAATCTATXrrGCACCGTnK30Aa:AATGCC^^ 
AAAAAAGTTGTAATAGAGAATATrnTGOCATTCCTCTAATGTrGTGTGrriliri 
GAGGGAGQGOATTTAATmAATmAAAATGTTTAGGAAATTrATACAAAGAAACTnTrAATAA 
AGTATATTGAAAOmTAAAANAAAAAAAA 

SEO ID NO: 3 109 ACAACATGACITAAAAfi 1 1 1 1 1 1 1 U ICTATTAAAACTTAAAOGGGAACAAA 
ACTrGAAAAAGCCCIQTrCTTCAOAAGGTCAGTGGGTrcAGGGAGGCAOTAATATGAAGTGACTO 
CTOTCTATTrrAACTAOCAGATnTTTATATrTOCCACTGTrAAATAQTTOQAA^ 
ATTAAGCGAAAOTGOTATCATCCTAGGTAAGCTrATTrCAGAACAAGTCrAATATrrCAG^ 
TXrrTnCGACrn-ATACTCTOAGTrArrACTTACTGTAAGTOOTGTATA^^ 
TCCAOTATCGATCnXKn-AATATGCACAGTAAATCCATGTCTTTGTTTGTrmCT 
TCAAGAAAGATAATGTCAAAAAOAAAGGAATTTAGAGGTAGOGAAAAGATOAATGTCAGACArr 
TGAAGAACTATAOTAAAATGATAAACACTAAATATACTTGAGAAAACITIXriTAATATOC^^ 
AGOTAGGCCraATCTrroAAATAOTCAATAGGAATACAATOCATnXXnt:AGTGATCACTGATT^ 
AATGAGnGGOTGGGA(XnTGGGAAGOCAAACGGAGCGGAOTTCTGOATCATGTCCCATCCAGTC 
CAOTOAATCCACGACCC 

SEO ID NO: 3 1 1 0 CAAOTGGCAOCGCTGGrGCT00CAGGArr ACCGC CTAATTCATCATAA\TAT^ 
OTTTCCACTGACGGCGGGCTGCTATTGTTTCATATCCTCCCAOTrnTO 
CCAAAOOTTAATCrGmAAAACCTAAATAGGGTATTCGTIXTrATCGGCGTrr^ 
TArrrATAAAGTGCCACCAAGAAGGCrrorrCATCnKXXTOCACTCTTCACCAATGGCAACC^ 
0 <jrt 1 1 i Cl r CATITGAGACTrTmACAGTrATGQTTATTm-CGCCTTGGTCAAGGCTOACCrQC 
CTCACATITCACXTrcGCAACOGCrTTOCCATXXjOAATTGTTGTrG^ 

TGAATCTCTTCnTGTGGGCATGG l J ICl 1 1 1 I GCGTGGTCTGCCrrTAAOAriTGGCGCAAACTCA 
TOjCATATACTCTCGTmCTATGAAGTCGGGTGGCAAAGGTCTaXGACAGT 

SEO ID NO- 3 1 U ACTGCACACCAGGAAGGCCAAGACAAACACAAATCAAGGAATGAAOTTITC 
CCAAAGCTGCAGTCTGAAAAGACTATAAACAGTTGATTCCATACACATGAATGGGri i^-i i iGCTA 
TAGGAAATCCAAOTGGAATAAOGAATGGAGATGTGTAAAAAGGTrrCTTGAAGGAAAGAAGGAT 
GACACCCTGTATGGATITAGTTITCAGCOCCTICTGCCOCATCACATTC^^ 
TGTCCAAAOTGTrAGGTTGTCTCTAAGCAACTGCATGATGAGGGTGCTGTCTT^ 
ATIX^GTGTATCAAGTTCAGCAATGGCCrCATCAAAAOOCGTmAGCCAGCGTGCAGGC^ 
TGGGTrATTAAGAATCrCATAOTAAAATACAGAAAAOTTAAGAGCAAGCCCCAOGCGOATTGGGT 
GTGTCGGTTGCATCTCrTrcrroCTTATATCAAATOC^^ 

SEO ID NO' 3 1 1 2 ACT t - 1 - 1 1 m 1 m 111 H n H I GOAGTmTAGnTATTAATGTTCTTGCGAA 
AAATAG^CAGTCGCCACAGCTAACCmACTCClTCGGCrGTGATa^^ 
rrGCCAOCACCAACATTGGCCTrrGCAGTCCOCCTGACnTrCrrcATT^ 

rracrnxnTCAGGTurrrrTCTTCTCATACAGGCCATOT^^ 

TCTrrGCATAATCCAGGQAATCATAAATCATGCCAAAGCCAGrrGTCTrG<XACCACCAAA^ 
TKrraAATCCAAATACAAAGATGACATCCGGTGTGGTCITOT 

SEO ID NO: 3 1 13 accataaggagacacaagaaoaaaggtgacactaaggctacagtgcacaga 

AAACAGACCAGGltnGGCnXXJACTGTGCGOACCTGOCCACT^ 

GTCmXIACITraACATCACACACGGTTTATAr^^ 

ATAASraSASoCCrcCACACAAAAAGATGATGA 

CAAAAAGGGAOTATTTAAAGOAAACr^ 

CCAAGGCCTTAOGGCGGOAACACrrTTCAAC^ 

/JcOCAATTrCCACAQOGGAGGCAGATCTTCTATACCTACAGTGACAGAAAATACACT 

AGTATAAAATATAAAAAGGTTTGATrCTQAATAGACCAACTGCTAATrmCTT 

AATTWrGAOTAAAAAa:AAArrAGrrcACrGAATCTCATTrr^ 

AATACGAAAACKX}AGCrrATGGCrGTlTrGATmCTCTGTAOCACANGATAACCAGTA^^^ 
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GGAGAACCTACNAAANGNGGC 

SEQ ID NO: 3 H4 ACTTGrroCCTAGTrmCAAGOTATrGCKrrGrrCTATAGATGCAGTGATTG 
CCAGCTAGCTCTGTTACCAGCCITITGOTOTGTCmATOTTCArrTO^^ 
CAGOTGATGTAGCACrrCTOTTITrAATAATTATTOCTTAAAATACCTArrAAT^ 
TrAAAGGGACTTGAGGAAGCTACCC>.OGArrACAGAAOAGTGTCCAaTAACAAGATGGTCTGGC 
AGTTTXXn-AGTTrrGTATCTGGTTCAATAOAAATATGTGAAAGTGGTAATOTCATCAmGATC^ 
GAQTXXXSQOTrrCTCTATAATAAATiCCCITTXXX:AAATXX:ATGAGTrGCAGAC^ 
GAGTOAAGCAAGrrGGGTGAGTAAAACTATTTraACOTOOQAGCCQTnTCAGATAGGAGTTTAGT 
CTrOACGAAAGTGTCCGTGCAGGAATTOGACTCCGAGGAGGGTTACAAGTATCTCCTGAC^ 
CTGCCCTCXiCATCTGGGCAATaTTGACATTTOAaGTGGCAAGCANOATGCCnX^^ 
TTGGGTGAGTAACTGACCCAO^AGGGAAGGTGAATGATTAAATCANAAATGGGATCTT^ 
CTGAAAACnTATTTGGAA 

SEO ID NO- 3 n 5 ACAGAGTCOCATCCATTCTrrTTOAACAACATGrrAGCATGTCTGCAACAGTG^ 
ATGATTCACAAACCATATACAGTCOCCATAATCCTQTATCTCTGTAGGAA^ 
AGCTATGQCAAAGATTOCCATGACAAOTGAGCTGGOCXAGCTrGCTAGATAAATrc^ 
CAAAAGAGCGATCCCAOTIXKXAATCAGCGTGTrrGCAACCATGAOACAOATraTATCTXJGATCT 
OCOCAACCAACAGCrTCAACAGCrATIXK:AAGGTGCGCCAAAGGCATCTTGTCATCCCTCAC^ 
AATCTCACTTCCTGTGAATnXjCAGGOAGGCAOAOCrOGTATrrCT^^ 
OTCACOTAAATGAAACTTTGCTAAOTCAAGCAATTCATX^TGGOAAACAaiTCCAGC 
GCACTGTTCrrGGCCCCrrATAATCnXJTGaTrATATAATCCACTAAGTCCnAC^ 
GATATmCAGTrrGGTCCCAAAATTGCCGTCCAAGTGCAGTATITTQATAAGCT^^ 
ATAATCAAAAACAACrrCrraTAAAriXKJrrrCAACTIOT 
CACQNTTCATTCTCTGC 

SEO ID NO- 3 1 1 6 ACGCGGGGAGGCArrGAGGCNGCCAGCGCAGGGGCTTCrOCTGAGOOGOCA 
CKHXKiAOCTraAOQAAAaXKAOATAAGTTnTrrCTC^ 

-ITAAAAAATATAGTCAATAGGrrACTAAGATATTGCTrAOOTTTAAaTTmAACXiTAATTTTAAT 

AOCTTAAGATnTAAGAGAAAATATGAAGACTTAGAAOAGTAGCATGAGGAAGGAAAAOATAAA 

AGGTTTCTAAAACATGACWGAGGTrGAQATOAAOCTTCrrCATGOAGTAAAAAATC^ 

GAAAATTGAGAGAAAGGACTACAGAGCCCCGAATTAATACCAATAQAAOGGCAATOCrnTAOAT 

TAAAATGAAOGTOACrrAAACAGCTrAAAGmAGTrrAAAAGTTCTAGGTGATrAAAATAATTTG 

AAGOCGATCTmAAAAAGAO^TTAAACaXiAAGGTOATTAAAAGACCTTGAAATCCATGACGCA 

GGaAGAATTOCGTCATTTAAAGCCTAGTTAACGCATITACTAAACGCAGACGAAAATGGAAAGAT 

TAATTGGGAGTGGTAGOATGAAACAATTTOOAGAAOATAOAAOTTTGAAGTGAAAAA 

SEO ID NO: 3117 ACACACAGGACCGCCrGGGGTTNAAGGAAATGQACAATGCAOOACAGCTAG 
TXnriXrrOGCTACAGAAOGGGACCATCrrCAGrKnritJAAGAATGGT^ 
CATTCCTTGGATGAAACX:COTATAGTrcACAATAGAOCTCAOOGAGCCOCTAACTC^^ 
CATCGOAGACAGTnCCTIty^TGCCCAAGCCTGAGCTCAOATCCAGCm;^ 
CATCrAACATXKXXn-ACTKK)AAAGATCTAAGATCTX3AATCTTATCC^^ 
TATGGTCnTGAATGCAAGITrAATrACC:ATGGAGATTGTmACAAACTrrraATOTGGTCAA<^ 
AGTTTVAGAAAAGGOAGTCrcmtXyLGATCAGTGCCAGAACrcTGCCCAGGC^^ 
CTAACTAAAOTAOTCAGATAGATTCTAAGGGCAAACATITrTCCAAGTCTrGCC^TATITC^ 
AAGAOGTGCCCAGGCCTOAGGT 

SEO ID NO; 3 1 1 8 AC0CGGGOG0QACAAGATGGTTTACATCrCGAACGGACAAGTarrGGA(y^ 
OKSAGTCAGTUnXIATGGAGArrATCITraATAACAQATTTCri^^ 

GU 1 lUI n 1 ]C AAAAaxnX3CnXAGCAA0ATGTGAAAAAAAGAAGAAGCTATGGAAACTCATCT 
OATTCCAGATATOATGATGGAAGAGGOCCACCAGGAAACCCTCCCCGAAGAATGGGTAGAATCA 
ATCATCTGCGTXX3CCCTAGrrCCCCCTXX:AATGGCTC 

GAAGCAGACAACOGOACATGCOCATTCATAGCAGAAGGAAACCATCAAGAAGTGGAAGGCTGAC 

CATGATGAGCAGTAGATGAATGTGTAT0TCTAAACAAGGACrGCrcTaTOTCCr<X>GATG 

AGG-TCATOCn^GGAATCCCKTCCANGGAACrGCCTGACTOACATGCAGNTCCTTAATG^^ 

OTmxnXIArrACCCnTrGTATAGTTrATTAAAGTATrAATATAGTmAATAAGTAAATA 

OGTTGCNAAAATGGACTCC^A■ITmAT^mx:N<XNAAAAAGCCATITOAAOAAAC^^ 

NCTGNOnrnTAAAANAAAAAA 

SEO ID NO: 3 1 1 9 A LI 1 i i 1 1 1 1 1 1 1 1 1 n i 1 1 1 1 U 1 n ' N GGTAACTrAATGGATCATCAATnTGT 
CTCACTACCrACAAATGGAATITCATCrrGTTIXX:ATGCTG^ 
TCAT/^TAACCTACATCAAAAGAGAACTAAGCTAACACrOCTCA 
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ATAAATATATQCACTCrANAAT<H:ACAAT(KnTTAGTCACrAAAAAATn>AATG<W 

TGCCCGGCGGOCOTCOAAAAGGGCO 
SEOIDNO' 3120 ACTrrrGATCTCAAATACTrmTGAATTGCGCX;CCOG>£^^ 

nrra^CGACCATOOACATCATCCACGATGCATrrGGATT^ 
^SSS^TTATNClTCTGAACCTrrGN^^ 

AATGCCnTCCGGAAAAATCCCAONOATXXAAAGCTGCNNATrAAAAAACCTrrCTT 

SSS^^fiS^GSSScCNATGNGO™ 

CATCrrGNCn>INCNCCCX}AAANCTCGGACANANACC^ 

TCCANCCANTTONGQCCOTT 

SEO ID NO- 3 ! 21 ACOCGGOOGGOCAGaCGGAGCTTGAGGAAACOGCAOATAAUi « ' • ilL*^^ 
TTGAribATAGAGAr^ 

Stagcatgaog/^^ 

STOQACTAyuJAAAATOTAmAAAAGAAAAATTCAGAG^ 
?^S??r^^i^55^TC^AAAArrAAATCGAAGO^ 
TAAAAGCTTTAAGNGAATAAAAmTTrOAAAGNCAACmTT^^ 
ATTAAAAAANCrrOOAATTCCTNGNCCAA 

5;iJ^S?5oAS^ 

CCCNCACCTGGCGGCOOrrCITA 
SEO ID NO- 3 122 ACAAACCACGGATCTTGTGTCAGAAACACATGTrGAGACKXn'CCATT^ 
ciSAATm^^GAQ^ 
GCTCAlTOACAGCrCT^ 

?^J?SiSI^^CT?^CTSA5L^ 

iS^£^??^iS^AlSA^A^^ 

AS^JrCcXACCnTOGGGGGGaJ^^ 
CTOOGQNCCGOTTCCTAANGGT 
SFO ID NO- 3 1 23 ACATATmOGTTGAAGACACCAGACTGAAGTAAACAGCTGTOCATO^^ 

iJnTA^CA^^ 

NGANGGGOTOOOGOAQANACNCTrGGG 
QTin m NO- 1 1 24 ACCTATTCTrrGCAOTCrCAGGAATTGTrcACATOCTCACCTATCrc 
rCTCCACAATCGGCCTC^GACCAGCy^^ 



SS^a?ctStcattcttoa^^ 

Cm^GCNCTANC^GGCIXKX:ro^AAA^TrNNGCCCGCNACT^ 



CCWmCNNANCnTGGAANNAAATXKKlAGTTrn-CCCCnXXWC^ 



wo 02/29086 PCT/USO 1/307 J2 

TTNAOGGCNANTANNTTT 



*°"*"S^ilJ^ccrCATGAAanTrATrAT«^ 



TTANCCnTAGITNCC 

AGATACCACATTCAAGGCC 
AGATACTCAOCTOCAAGAA 



I5,"^SJSS^^SSSJSS^ONTTOACCCCTOCACCT^ 

CCTTNTCNTTNATTAANAAC 



ocr. m xin. 11 7ft A t - n m n u n L 1 1 II 1 1 1 1 m n 1 i ggatgaaaaanagcctaaacgcitct 

ACTCCACCraAACAAACCAATnTTGAAAAT^^ 

SEQIDNO:3130 ACAOnTCTTGTTGTCrOGCrcGG^ 
TGGTrATGGAGTCGOAC 
CTGGAAGAACAAOGCG 
GTCACCCGGGTAAGTTI 



^^CAACTTCAATGOANATCrnG^ 
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tccaagaccx:anc«^ctnaaaanccacccttaaagcttt 

SEQ ID NO: 3 1 3 1 ACTGOTTCTTAAACAGCCCATAAAAACCCATTiMCCTGAAGCrrATATCrCAG 
GCCTATGCCCATCTTATAGTCTrGGAAOACAAAAGGCrcGTAGAGACAGTXr^ 
OATGCTCTGTAGAGGCCAGQOTGTCrrGAGrrGCTGTAACTCCCAAOCACTGGaCTAGCCTGACTrC 
TGTATCTCCCTACCCCACXXrCrrAAAAAATAAGGGTAACAGCCAATCTATAGTAAAACC^ 
TGCATAGAACGNGGTCAAAATCCrCTGTrrrcATTAAATGTAAAAAGATGCTGTCT^ 
OAATATTrrGGAATrGaGAOAAO<XAmGATTATTATrrraAGTTTCTOTA^^ 
AAGTAAOATOCrrATTCAAATTTAAGAATOAAGQCAACTGAAATATGCArrGrrGTAGTrATTTAT 
AmCAAACTAAAATAAGCAAAAAAAAAAGGCTrGGTTTQANAAAAATCANGGTrAAATrrGATG 
AAACCGATGGTGGGGTTCTCTlTCCATCATCTGGTTmACCCAmCACTCAATAGGTAT^^ 
AACCACTTATmGANGAAAGAAANATCTATGCACAATTrrANATTrTATAAAGGAACAAAnG^ 
GGAAAACTTGAAACT 

SEQ ID NO, 3 1 3 2 A Ci ' l 1 1 1 1 1 IH 1 1 1 1 1 1 1 til 1 1 111 J 1 IN AOGGCTGACCATrTATTGGGACTT 
GACroACCACGGCTTGTAAGGATGGACACAAAACTTCAAGTCnTCGTAATC 
AAAATAAATATTCTAAAAATTATGTCAAACACTAATGAATAAGTGACATrTACAAATAGTTTATAA 
GAGAATCATTTGGOTGAAACAATmCATrrCACAAAAATAAAATAGCTCATATC 
CCCCTNTCCTTAACCANOGGAAAAAAAANCOCACAAAAATAAAGCATGCC^^ 
TGT^AOTr^rAGGGGGGOAAACCCCCACC(XTGGAAAC^yCAAAAG^mTGG G0AGAAT T^ 
TGTTmAATTTGGCCTAAACCCNGGGGGGAGTCANGG GAAA CAAACAGCTTGGTrT^ 
ANGTirAATmAAAACrrTCCCNTTANAACCAAATrOTCCrrm^^ 
CCCCCCAACCT^^C^TITAAGGTITCANCAAAT^mT^^ 
TCCANCNG(XAC>rrrGNCATTAAGGGGCCACCAGOTCCAN>rr^ 
TGCTTTCAAACCX:ATrA 

SEQ ID NO : 3 1 33 A CriT I ' l ITfl 1111 IH H 1 l OGGAATGCAACAACnTATrGAAAGGAAAGTO 
CAATGAAATTTGTltiAAAOCTTAAAAGCOGAAACTrAQACACCCCCCCTCAAGCOCA 
TQCAGAGTGOACTCmCTOGATGTrCTAGTCAOACAGGGTGCGTCC ATCT ^^ 
AAAGATCACCTCTOCTOATCAGGAGGOATO<XnTCCTrATCTrOOATCTIT(^^ 
GGTGTCACn^lGGCTCCACCTCGAGGGTGATGGTCTTACCAGTCAOGGGCCTT 
CA(XTCrGANACGGA0CACANGTGCAGGGNGGAC7riTrTGGATGTTGAAOCANAAAO0<K;C^^ 
ATrmtCACTTmrrCCAACNAANATAAOCnTNGTrOTt^NG^ 
TTGCCTrGACATTCTCAATGGGGG<>CCTCXK}TCCNCTTCGAA^ 

SEQ ID NO; 3 1 34 accaootccxxxttcaccatcctgggaoaaggatogaogacagaggaaagg 

CAOATGCAGCCAGTTCAOATCTrCCCGGGAAACGTGCCnCA(XCCnTGC^ 

aXANGATCrrGGCGCAGAGGGACTGGTACCCATGCACAGCAGCraTGGGGTGTGOGCATCnTC 

GTGGTGTKnGGTAAAATCGTTAAAATAGCTOTATQAAGGAAACCAQTGaQAOQAAATOAAa 

TCAGGATGGTGGGATGTGATITAACAGTGCACATGCTGTTANGQ<nTOTAC^ ^ 

TOOGAOGGTGOTNACAOaXTNA>rrCCCAOCCAAAAT0TTTAAAAAACT<>0TCT^ 

ANGAATNGTr^^TAAGGGATTGCNCAAGTTTAAAGGCCXr^XTrcNAAAANAAATGAAAAAA^ 

GCCCnrrrcAAAAACTCTITIXXXnGCGAGGGTCNCA^ 

TATT^TrcccCAAT^^f^^mAAAATTGl^axAAAACccrGGAm 

NANATTAAAACCCANAAGTrGOCCANCCrrmAKAKrmGGGGAACAANAAAAAAOCT^ 
CCnTGGGANANNNTGOGTT 

SEQ ID NO; 3 1 3 5 acgotogggaccgaccttcagcagggctgtggctaccatgttct^^ 

GGTXnCCCTGGGCrGTCGGCCTGOACCrrGCAG<XXKAATGGATr^ 

TTOAAAQATATCACCAGGAGACTAAAGTCCATCAAAAACATCCAaAAAATTACCAAGTCTATGAA 

AATGGTAGCCGGCAGCAAAATATCCCGAANCTGAGAGAGAGCTXiAACCAGCTCGAATATATrGGA 

TrOGOATCTTAANCTCTOTATGAAAAAACTOATATa^^OGOGCCTG AAAC^ 

TATCOTGTGTXXntiAGATajAGGACKnGGTGGGCTATCAmCTI^^ 

GCGANGGT^NT^ACCTTAACAOCAGCTGGGAAAAGAAGTT^r^CCTraTOGAATrc 

GGAGGCATCTTTATAGGACTCATTCTOAaiAGTTITrGGGOCTTC^ 

ACTTirGGANATNCOCAOTCATrGCCTroAATTACTAAATCTGOrmAAri^ 

TTCTTTATAAATTAAGNCraNCATCnXnTTANACANAAAAAACCCATTT^ 

AOTOCTGACNGCATGAT 

SEQ ID NO; 3 1 36 ACCCATTCAOCTAAGAAGCAGAACrrnTrCACAGCATTGTAATAGGAATOG 
CAGGGCAGAGGaAAAOaAAAGTAGTGGGAGAACAACTroCTTrrAAOATACCCTOGOTOACAOO 
TATCAAAGCCTTGGGCAAAACCTGTCACAGTAAAGCTGTGACTCTCC^ 



482 



wo 02/29086 



PCT/USO 1/30732 



NCA1TANCCCTGAA00AACTTC 



S^?SS^^(^cacSSSaa 



CTAAOGCCGAA 
TCCTTTGGGCrOO 
AAAACCNGGGAAC 
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TTTKKJACCACAAKHMAACAnGCraAAGMCTOAGCATTCrCAATGGAC^ 
CCrrtWCAATOAAATmACNGCmCCTAOAAAANAAmAAGTTTTO 
SEO ID NO- 3 1 42 ACOCOGGGCnCACOTGAAGmATATTmATCCTACCAGGCTTOaA^ 

ANGATTCATCTTTCirm^ACCXiNAAGTGGOCmACTO 
SEO ID NO* 3143 ACACaKKm]OCAnAAOGGGTGAAGATGTCCCCCTTACGGAG<>G^^^ 

S^^^SI^SSJSSgggSCt^^ 
?JJ?SACTOcroJSbccnCTAr^^ 

SEO ID NO- 3 144 AOCTCrrrrrn^TTAAGAATTCTGCCTGGAAGTITAGGTCA^^ 
TrrC/U^GAGT 

cm m Nn- 1 45 a ccCCTGCrenriXjCTrrrGGTAATGTGATGTraATOTTC^^ 
ArcriSoXnTOA^S^CAS^ 

^^SSgtcggtcag^tcg^ 
ctgt^Jcwaaatggtg^ 

SS^^TOOCACC^CTCTGAANCCC^ 

gccocqacaccctt 

qpn ID NO- 3 146 acagaatcgcacagggaatocatatgaaoagoaagccaacaagcagtcatg 

S^S^S55IJS??^S^AAAOCAGCmACACAAATA^^^ 

nttttccnaggaccitgcca 

SEO ID NO- 3 147 ACAAATCTAGGCAATAAACCGTTt^CCTGGGATCAGTTCAGAJ^ 

a5SagataatccajSj^ 

S5S^^SSi?S?ScrATTTC^^ 

GAGONTGAANC^rGCT^mGCT^AC^CTAANNGCTAAANCCCTT^^ 

ttaaccac 

SEQ ID NO: 3148 AOTGAGAOGGTO^CCAAGCGTOGATC^^ 

5???SSSSg2aggat^^ 

^S^S^SSa^^CAAAGOT 
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CTGAAGANCTOCCCCOO 

SEO ID NO: 3 149 ACCCGAOnrCTATOCnGOOGGTCTTCCTCACTCTCCTCTra 
AGACITCAAAAAACCAACGAAGTCCmCAATGTGAGTAAAOGAAACAGC^^ 
TTmACAACTaATTATTCAAOOAATGCACAOTAGCCACAOTTCA^ 
CATTAAAAAGAAGAGTAGOGAGGGTCACTGCrrATTAAAAACAAAAG^^GOACA^ 
TCAj^J^AACAOTtKKJCAOAAANCTCTGCaXXn^ 

SJ^ggggccSJSggatgtxk^ 

ATOCCTCGCTCTTmNCTACAACAAGAAACTGGATTrCTAAAAOCTTr^ 
GGGGTGAGTGGGGG^XTOATCT^OGNAGANC^^^mTGTTCCCAATTAAAGAA 

SEO ID NO- 3 1 50 ACGCGGGGGAGCAGAOAA0AOOONGAANGCANCATCTrGCCrrXK}ATCA^ 
Ac£o<3A(iolTAATWGCCC^ 
GGTT 

SEQIDNO:3151 ACAAAAAACACAAGGAATACAACCCAATAGv^u^^ 
TCA0AAGCAAA0GCCTX3AGTGTCT^^CTCAA0CGTCK1^^ 
GAAAAGGATCCGCrACTCAAAAACCAAGAATTTAAAGGAOTTIXTr^ 

XXocrcicrmcAGTCCCATrGAT^^ 

wSATTCTTAAATAOATAmNGGTITGGGGAAAGTTGAAT^ 

NAGATCGGQAGAGGGATTATACTGCAGGCAGCTTCACCATOTTGGGAAmGATAAAACCACrrA 
CraNOTCCCOAOAOTAAGGAGANAAACTACTATraATTANAA^^^ 

AAGCOOOATACITCACTITCATTCAACmGATGCATAAGCCCATOTAGNCAKriCTAAANCA^ 

S^agctaciSStcc^ 

NNTGATGTGCCCACTT 

SEO ID NO- 3 152 ACGCGGCTIXnTGTTCTAATCrrGTCAACCAGTGCAAOTGACCOAC^^ 
^AJ^AmArrrCCAAAATOTTIW 

TTGCrGrnXACCAAATACAATTCAAATGCrnTrGTmATmrrrA^^ 

CTCTSC^TOOGTOCTATAATAAATAAACrrrCAACACTUmATG^^ 

TTTCAATCXrrANCCCATCraCAGAGCAATrOACTOTGC^ 

TTCTATCAGTCCCAAAAGATOAAAAAAA 

I^^CrcCCGGAAGAATGGTTAAAAAAAAAAGGGTNATGC^ 
CCNCAATTATTTTNAlWnKKiATATaiACTtXTmNAAAAGNGCT^ 

sss^^IAG^cS3ATv^ 

NGGCCOOCCONTOO 

SEQ ID NO: 3153 ACAOTCATCAAATOTTOTOCCATCAAATACTAOA^ 
%TITGAGACAGGGTCrrGCICIGTCACOCAGGCTXKl^^ 

ISircStrcrccrAGGTTCAAGTO 

CATOCri^STCCCAGGCrAATmATATrm 

(KK:AN0TCrCGAAACTCCTOGCC7rrCAGATCATrCTGGNCAA<XX}TANCC^^ 

SEQ no NO: 3154 acaaaocagcaactgcaatactcaaggttaaaacaya^^ 

TOACAGGTATATTACACyrATTATCAAAATATrACATmC^^^ 

CAGAOCTrAAATOTTAAATTATrTCCATAGTCTrAAAAAATATGTAATGTCAGAAATGC^ 
AAAAAAA^^ 

JS^^ctgcStojictgtxxt^ 

oSjoSjTOCrmGGNCOGG 

ttccgacctcogtacna 

SEO ID NO- 3 1 55 ACATAAAGTTTTATTAATATTOXiA-nCTCGTGTCATAGCrmA-rGGCATO 
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TAAAACCnXKXJGCATTTTCKKXnNGGTATrrCTACANOCCNAAGG^ 

CCGGNACNACCCTAAGGCG 

SEO ID NO- 3 1 56 ACCAAAGGGCAAACCCACCACTATOGCTTmX}ATGGCnTACAJ^ 
Gi^STAGCmAAGA^ 

^SNTrfAASOTCTTCT^ 

5£jJrrAc5v^cavCT<^ 

oSoiAA^SiTivGi^ 

rrriNi 1 1 i ocatoaata 

SEQ ID NO: 3157 ACITGTKn^AArcAAACC^TC>^^ 

CKn-ACACTAGGGaATCTGAAACTTOTGGAAAAOCCGTCTCCTTrGACTCTT^^ 

GCAA^^ATT/uCScTAACGTCAAAGTA^^ 

SaSnqSatatSaac^^ 

i^AAr?G^rTA>STCrmOANAA^ 
TTAACXIATAATGTCCCTNa 

SEO ID NO- 3 1 58 ACGCGGGTrAOCCTATAAATCAATAACAGAAAGACAACTGGAj^a^^ 
ii^AAAVG^ATnAAAAACACACTrCTAAATW 

^^^^^^uS^^STSAAA^^ 

iJGS?crSAATOAATATGCT 

CAAAATTAGCAOAAGAAAATNCATTATTAGAATTAOAGCNGAAATCAATGA 
SEO ID NO- 3 1 59 ACGOGGOGCTCCTCTnXTrTCTaXKXATC^ 

^^%^^ATJSrm^ScAAOAcmx:AGGA-^^^ 

COTCCCATTCCCCAGTGGATTCGGATOAAAACTOaAAA-rrAAATCAGGGACAG^ 

oataStatoo/uuktaaaatc 

?^iSJT?^TGAAOTATAAlTrraATmATAC(n^^ 
TT-rrisrrrGGATGOGATGACCCTAAGOTTGAAAAGTTTrr^ 

TTAAAATOCAATCCCTrrcGCCTrCCCCGNG<XnXX:AAANGO-rTCT^ 
S^^AAmSTrrSAGA^ 

AANTOTAACC 

SEO ID NO' 3 160 ACAAAG<>GCAACTGCAATACrcAAGGTTAAAACATTAGAAAA^^ 
TCACAGOTATATTA^ 

So^^iAArTC^mAATTATncca^ 

AJU^mATfflJwAAGOQAACCrA^ 

ATCrCTNANAATTGTTGAAATAAGT 
<;PO m NO- 3161 ACTTXXXXXTTCCCCAGAAAAGiXGGACTTGCTGCTAAOOOTOAAOOA^^ 
cS^^ATTAjicACCAAGOGAAGGCTG^ 
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AGATTCAAA7UTAmX3CX:AAOATrGAAAGGAGAAAAGTGGTrGAGGGATrrOAGGGAAC^ 
NAAAANAGTAAAAAAAAOCTOmACCAAATTTOAAATTGGTGAGATGTrTCrrGOGCTCGTCGG 
TCrNAAGAOCTGAIWTCNTANGTGGATCTrTrcANOGAGCAAATANCAGGAOT^ 
TCTCOCAANOGANOTNCCCCNATCCCAOTNATOOCACCA^ 

ACACNAAACANGCrmmjTCANCACATrNTNTG 1 U N i 1 1 1 ACCTaNTOCAOCGOaCTTATTCCC 

AAAANAAGATTNATXXCOCCCmCTn-GNCNGCNANCACCCTONNGGNGAAAT^ 

NGGOCCGTTTNTANTGNATTCCA 

SEO ID NO- 3 1 62 ACCCAGTAAAAACCANAATOAC(XATrGCCAG<l^CX]CATCAAAOTTGACrm 
GTGATCCCTAAAGAAACrrrccrrrOOAGACAAAOATOCOAAAATXXAAGGNGACCCTCCTOG^ 
GGTGGCCmorrAGGGTrAATATTTCyU^CAOAACCGACGOTOACAAATrAAAGCGAGCCACCCA 
TATTGAAGTTCTOTCNAATACAriTCAATrcACrrrATOAACCCGAKAAATGGGGTaTGANTGC^ 
ATNAAANAA>^^OGTTmKmTCANCAAGTGTGGGC<lATACNGGATA^^TC^ 
TTAANTGAAATrCTGGATXKKjGAACCCCrCCATAnOCACATAAAANTATAAOTTTACTOTOQGTC 
CTOATATGCrcTCTraCrO^AAGAAATOmiCTATITNGGAm-AAAAAAAC^ 
Nil IM 1 m 1 1 IN CATTrNAATANCCGGmTITIXMOCCCNGNTOAAAAAAAAAACCNui 1 1 n iC 
AATCNTATAAAAAC^ATCCCAATAAAGGCAAAAAAAAANOAGGCCNCAGATGGC^mAT^^^ 
TmATAACTGNNQQaGGGAAANANArrATrmTnTNANCANCGA>mmXiNAAGANGATAC^ 
TCCTCAAAAAGOAAGAN 

SEO ID NO: 3 163 ACCCTCA(XCNCTCCGCCCTCCCX:iCrCCTAGATGGTCACC^ 

AGTrrAATCA<nTGAGCACaOAQCrrCAGGOCGCTGGCACX;CAGACTATGATGOAGAGAAGGAAA 

GAGCAACAAACAAACAGGCACGTTGGTAGTTGOCTACAOCCTGGGTATAAGCCCATGOC^ 

TIXXXXXXlAQACTOGOaATCTCCCGGCAACroAGTAAGGGATGGOO^ 

ATAGNGAACnifcATCrCCTTGCCTrUCAANAACATNAAAQAGGTCCTCraTI^^ 

ATTXTrrrrCATrOGCTAAATOTAGGAAACAGGCAAAAGCCAQAGGTArnjGAG^^ 

ATOACAGGGGGCAGGCTCCTtrraCAGOTCCTTCGNOACKXXXJCrAACArhnT^ 

mGGCAAOGGCNCGTITITrrNC>ACTKXX;CTGNGCTn^^ 

AGCGCTTNCnCAAQATCCACATOCNCTrGCTmANTTrrrrCAT^^ 

SEO ID NO- 3164 GGTACAAQATTTACCAGAAAOAGAOTOQTOTOTTGA CATGCCT GGAGCAGAC 
ACmCGAGCCGCTGACAGAAGGTOAAGCAGTCCAAGAAAATGTGGAAACTTrrCCGCTGCT^ 
CACAGTCCACAAACCTGTCCATTTrArrrCGrrcAAGCTrTGTC^ 

TCAAAOTAAGTTATCTCAGCCACATATGGGGAGTGGATGCTGCTGAATrGTGATrAATTGOGOQA 

G<X:ATATAOGTAmCAG0CACGCTGCCrCCT0GTAACAGCTATGCAGGGAGGGAGGACCCACAC 

TCCTACAmCTGATCXXXTn'GGTTrrACTACCCAAATCTAAATAOATACTmGATAAT^ 

CTOCTCTTrTACTAAGACATAGTXnCTACCTATAGAAATGTATTTTC 

CAATmOTATCCATrTAAACTAACCTmATCAATAAAOCanTrGGTTA^^ 

SEO ID NO: 3 165 ACC VCrOTGATAAACAATGAAAAGCTOTrCICCATQTCCTXKCATTGACAO^ 
CAOOTCCAOCAAGGCTGAATACCIXTTTrrGAACCCCTrc 

GCACTAGTAGCGGCAGCAGCCCATXXTTaACOTAOACATATO<KnTCAATAT^^ 

AAGTCTATrATCCACTCTmKTTCAATCCXAAGAACTAAAGTGCAGGCAGTGA^ 

AGTTCATCACTGCTITCACATGCCAACAAAATAGCrTCGTGGGAAAGATCTGCTAT^ 

AAA(mnTrGATAAGTGTGTT0CATGOTCTATCK3AGGTATX>TGGAACTCCACAT^ 

TCnXKnrATCAnATAGCAG<XfAATAAnCCAA^ 

AOCAACXKa^JGTAAGAACCnTSGACrGAAATGOCTrmONCOOGOQAOTroaGCATGGGGTOC^ 
ATAAAATGGGCrrrrGGGAATGGTNCAAAGTNGGGAGAAATGGNGAAATGCTGCC^^ 

SEO ID NO- 3166 OGTACTCTOTAAAAGTrnXX^OACAAAATANAAAGTATCATAATCCQTCAG 
^^GTATGAKrrAAAACTGGAATCCTCTGTATnrn-AAATGr^^ 

TATTAATCArrCAOTArrACTAATGGAATAGAAATTCATACrmGTATOOACAACAATTCA/J^ 

GATATTGCAmATAGCACTGTAAGAAACTTrCATCTrOAQ(^\ACTnGTAGATGATGGGT^ 

ATTrrCAATCGCCATATTTTGATCACrCATTCAAAATrGGCCCC^ 

TCTAAAAACCrOACAGTOANACACAACGTTOCTOAAOTOTGAGGGTGTNCCCANGAAA^ 

AANACAG>K3AATACmAACACATTAAATAQAAAAAAAATGTrrrrrraTTITGNC^^ 

GGONAAATAJUAAAOCNTmrnrriTnTrANGACAAAATCNCAAAAAAAAAAANAANAAAAAA 

AANTNCCTGCCCN0GGCGOGGCNCmAATNaOCTAAATATC^WCANANT^ 

AOON'NC 

SEO ID NO: 3 1 67 ACG(XiGGGGACTTTGTTCAATACAAACTGGCAOAGAAATTTGTGAAACAi^ 
OjAGGAGGAAGTGGCCrcTO^Oa^TOAAGCAGCATroaXA^^ 
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GGGAGCTCCACC(XAGAGTGG<K}GACCTCArrCriTGCTCATCTACATAAGAAGTGTC^ 
TTCCTITCTATCCCACTrrOU^OOAOGGAATGGCTITGGAAGACTATCA 

AAGTAAAOOATTCCAAAGTGGAOCANCAAGACAACTTrTCTAAAACOCATOTCAOOOATGATCC^ 
TCItrrACQCntiCTATCATa:AGCrCCGGTGGCCATATGOAAACCaACAGCAGATTCACWr^^ 
CTTAAATCATCGATCGCGCTtKirrTGaCAAAAATCrrAAAAATaOAOCCCTTOTC^ 
GCCACOC Ci ' lUl ICI 1 I GANTIXXn'GGAGGGGTGTNGGGAATGC CQ^Tr NTNAAACAATAACCAAO 
NTTAATmrTTGaAANANCTTTATThnTn-CNAAAAANGOACrACl i 1 1 CCCAAAAATTT AAAG 

SBO ID NO* 3168 OCTAOiCXHjGGCTGAACGOGAAQCTCACTOOCATOGCCT^ 
TXjCCAACGTXnX:AOTGGTGGACCrGACCTGCCGTCrrAGAAAAACCTGC^^ 
AQAAQGTGGTGAAGCAGOCOTtXWAOOOCCCCCTCAAGGGCATOTGGGCTACACTGAG^ 
GGTCOTCTCCTCrOACITCAACAGCGACACCCACrCCrNCAC^^ 
CCTCAACOACCACITrGrTCAAOCTCATnCCrGOTATGACA^ 
GCnXSGACXnWATGGCCCACATGGCCTOCAAGGAOTNAOAOCCCTG^^ 
OCACAAOAGaAAGAGAGAGACCCTCACTGCrGGGGAAGTCCCTaCCACACICAGTCCC^ 
ACTOAATCrcCrrrCTNCAGTnKXIATGTrAQACCCCTTNAAOAAGGGGAGGGG^ 
CANCTTOTTATnANCTTGNCCXXMNGGNCGCICTAAAGGGGGAArrcCACA^ 

SEO ID NO: 3 1 69 ACGOGGGGATCACAGTTrAGAGGCACAGTGGACCAAGTGOAAGGCGATGCA 
CAACAGATTATACGGCATOAATOAAOAAGGATGOAOGAGAGCAGTGTGGGAGAAGAACATGAAG 
ATGATTGAACrOCACAATCAOGAATACAGGGAAGGGAAACACAGCTTC^ 
CCTTraOAGACATQACCAGTQAAOAATTCAGGCAGGTGATGAATGGCTTT^^ 
AGGAAGGOGAAAOTGITCCAGGAACCTTmnTrrATGAOGCaJCCAGATCraTGOA^^ 
GAAAOOCTACGTOACTOCTOTGAAGAATCAGGGTCAAOTGTGOTICrrOTTGGGti 1 1 1 i^GTGC 
TACTGGTGCrcnGAAGGACAOATGTItrGOAAAACTOOOAOGCITATCTCACrQAGTGAGW 
ATCTNOTAGACTGCTCmGGCCTCAAGGCAATGAAAGGCTGCANTOGTGGNOT 
TCJmrCNNONrrarrCAOGATOATIXXIANGCCXnXKiAC^ 

SEO ID NO- 3170 GOTACATCATGGCTCGGACTrOGOTCAAGCTCITGGCACEAATGTCCT 
GAGTGrrGGATOCCAGCAATCAOGTAAOGGACAAATrrGTGGATTGACXXriTrGTCCT 
ACCANACACTCCCrOOGCX:ACrrrGATTTrcTCAGCTTCACTGAAA^^ 
CrrGCTTXntxrATGGCATCGAGAGAAOCC^TAOOGOGATATrTCT^ 

GAAGTATTCACCAGGGGCCTCAGTGGTGGCAGCCAGGAGAGAGCCCATCATGACTGTOGAGGCrc 
CCCGCGTACCAGAAACATTATCNTrrATTGnACTTGCrrnTAAAL-l-l-lUl 1 1 AGCCACTT AAAAT 
C7G^^^^A7GG^>CAAT^^GCCTCAAAATCCA^rcCAAAGT^GNATATT^ 
AAATTACTATTTACCCAANAT^ITATATTA^OTATAAAAA^TOT^«Xr^TCCCW 
nAATGGNCNA>nTTCAATCXX:ACrrGGCCGNNCCGGOTCTTATTaGNTr^ 

SEQ ID NO: 3171 ACACTTGAAACCAAATTTCTAAAACTrOTITrTCTrAAAAAATAGTT^^ 
ACATTAAACCATAACCTAATCAOTOTOTreACTATGCTTCCACAC TAGCCAGTCTrC TCACA^ 
KTGGTTTCAAGTCrrCAAGGCCroACAGACAaAAGGGCTTGGAGA 11 1 1 i i i iv-i i tACAATTCAG 
TCTTCAOCAACTrOAGAGCTTTCnXIATGTrGTCAAGCAACAGAGCTGTATCTGCAG^ 
CATAGAGACGOTrTCAATATmCCAGTGATATaKKTCTAACTGTCAOAGATOGGTCAACAA^^ 
ATAAlXXnxJOGGACATACTGGCCATCAGGAGAAAGffrGTTrcTCAGTTGTrrCATAA^^ 
AGGAGGACAAACTGCTCTXKXAATrrCTOOAmcmATriTCAOC^ 
GACTGTGTGGGCACrCATa^AGTGGATGAAATAATCATCAAGGOTITGOTGGCTTGTCI^ 
TATATAGAAGCTTCTrCATATOTCTGAOTCCAGATGAAGTNGGT 

SEO ID NO- 3 172 ACGCGGGCTirrrCGTCCTmOCCCOOTTOCTCCTrOCTGTC 
CKm3ATACGTGQGTGAGAAAGGTCCTOnXXXKXKX>0AGCCCA0CGCGC^ 
CGGAAAArroAGGAAATX>AOaACTrCCTCCrCA<>GCCCGAaiAAAGGAT^^ 
OATCAAOAAAAATAAGGACAACXrrGAAGTTTAAAGTrCGATGCAGCAOATACCnTACACC^ 
TCATCACTOACAAAOAGAAGGCAGAGAAACTCAAGCAGTCCCTCnXXXXCGGTITG^ 
GOAACTGAAATGAACCAQACACACTGATrOQAACTOTATrATATrAAAATACTAAAAATCCTAN^ 
NAAAAAATAAAAATTAAATANAGGTTCC 

SEO ID NO: 3 1 73 ACTC AAAACCATAGCrAGCCAACAGAACrrAAAAAAAATA^CrrTCCAA-^ 
ATAATACATGCATTGCCCAAAAGAGTAGGATAOCCACATrAOACTrACAAGACTrACAGCA^ 
CACCAGAAAAAAACTGCTACACAGCAAGTCrGTrAAGTATGCrarnSGTAAAAAAAACrOC^ 
i^TCTTACCCACACTKTOTAAAAAAAAOAATTraCAATTO 

TCTGTATAAAATrACAACAAAACCAATAAATITCAAGCACXAATrCAATAATCTAAA<^A^^^ 
TOTAmCrcrrAmCIGAOGOAACAAATTATGATOAAGOC^ 
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ATCNCTGOTAONNAANAAACGNGn 1 1 1 lAAT 
GAAOTTGCOGCOOTACC 

OAGOOACTCTTOATrGGGAGAACTrAGGACT 

CTTCCCATCCANGGCnTAGTCAATO 

TnTGNAATrTCATTTGAATrTNT 
err. m vrri. 1 1 7R X cnTl H 1 1 ill fll m Ut l m I QOGONOGOCTGTnrCTOAATrGTAGi^ 

^^^^^ 

AOGnTTTTTO 
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^^mAAACTGNGTACTACCT^ACAACCAGNCCr^Cn■GTOTTATAAAGCTGGTAOTATAC^^ 

NAAOOCTOCTOGCTOTGACTIXXATACAOAGACATGCXMOTGTraOGT^ 

TTTTGACCTAG<XXMCGGGA^r^AAC:AG^^TmACCGCACAC»^ 

AAATTTAT 

SEQ ID NO: 3 1 85 ACAGTTGnTGGCCTACCrGATGCTATCTCTAAACTACTnTAAAATGAAGAC 
ATITOGGTGTrrCATGTt>GTGAATrATCTTTTCTCrrGGAC^ 
TAGTACC 

SEQ ID NO: 3 1 86 COAOOTACTmGTOCATAOAGTOCOGCATrATCACAAATCTCGG ATTITAAG 
AGCrCCAATAAGCCOGCTTCCnrTrCATCATCAGTITGTAAAAGCTTATTATCCAATGT^ 
QTATGAAAArTATTCATaV^GTrrCCATATTATCITCAAAAAATTCAGGGAGATC 
AAACTArAOAACAATrrrOAGATCAQQATCAOGCCaXXJT 

SEQ ID NO: 3 1 87 ACAGATAAAAGATGTAAATCrGGAGGTrACGGCCAAGCCAGTTCCATTAAAT 
TCAOOAGTCAGATTCCAGOmATOTANAAGTrTCTAAAATGAAA ATCAA TGTTACTGAAATTC^ 
GACACATTGCGTOAAGATCAAATOANAGACAAACTAGACCTGAGCrrr^^ 
AGGCGGANAGOT0OACCGCGT0GACTATGACAGACAGTCCGGGAGTGCA>nX>TCAC0TIT^ 
AGArrGOAGTGOCTGACAAOATrrroAAAAAGAAAQAATACOCTCTTTATA^ 
CATAQAGTTACTGTTTCTCCATACACAGAAATACACTTGAAAAAGTATCANATATITTCAGGAACA 
TCTAAGAGOACAGTGCTTCTGACAGOAATGOAAGGCATrCAAATGGATGAAQAAATrCT^ 
TTTAATTAACATTX^CTTTCAACGOGCCAAGAATOGAGGTOGAGAANTANATGTGC^^ 
TCTAGGTCAACCCTACATAGCATCTTTGAAGAATAGACTTAACAGAATCATGAAAACr^^ 
TACCCCGATTCTOTAATGTraACAAAAATAATATCirncmAAAAAGAAACTTA^ 
NrmTTANANCAAA 

SEQ ID NO: 3 1 88 GGTACAAAACCAAATGTTTOTTACTATAACnTCroCATCACAATrAAAATCCA 
AACAOTTTrrTAAAAACAGTCAACT(>ATCAAAACCCACTACTTCA0AATC^ 
G<XACAarAACACrTAAATATGGTTAAGACTCOAATGCAOAAATTTG CrTGGT rGGAAAGCrAAT 
TAAACTTCXAACrroCTCAAATAQAATTACAAAAAGOCAAAATTGTOTTrr^ 
CX:ACTGGAATCACCAACACTGGACAGCTGTTAGAGTATrrAGAGTCCnT3AGATAACAAGGAAT^ 
AGGCATCCmAQACAGTCTTCTGrrOTanTIXriTCCAATCAGAGArr^ 
CACCACCACXAGCAATTOTAGCCTTaATGAGAGAATNCAATTCT^ 

SEQ ID NO: 3 1 89 ACrACrrGGTTrcCGATATOOATGATGAAGAAGOAGAAGGAGAAGAAGATG 
ATGATGATQATGAAOAGGAOGAAGGATTAGAAGATATTGACGAAGAAGGOOATGAGGATGAAGG 
TGAAGAAGArGAAGATGATGATGAAOGGOAGOAAGGAOAGGAGOATGAAOGAOAAOATQACTA 
AATAGAACACTGATGGATnXAACCrrCCl Mill iAAATTTTCTCCAGTtX XrrGCG AGCAAGTTOC 
AGT C r n TrTr i TrT r TTCCCTCTTOTOCTCAOTCOCCCTOTTCnGAOOTC^^ 
GGTTXn<:AATTrATTTOGGGGGAAATAOCTTGAGCAGAATACAATXK3GAAAAG 
TrCTGTKX3AAGTrCATTmATCCCIT0CT0TC^ 
NCrCTCTGGGAAAAAAGAAAAAOCTOCTCCCITCCTCTGCT 
TQTOTACTAGTGCATAAAATTCTAOCrrTTTTCCTCCnTCTTGGATATt^ 
CCCCNACCACNCTAAGGCGAATTCCACNCCCTGCOOCCOTACTANTQ<^ 
CTTGGGTNT 

SEQ ID NO: 3 1 90 QGTACGAGOACATTrrGCCCOGCGGCTlxnTCGGGTCTCCTITAOCCATC^ 
ACAGATXXXKXiT0CACCTGOTGCGAGCnTTCCTCAGAGTCCCGCAOAG<^ 
AGACGCirCCCTCACCGGGCnXXKXXKGAACTGGTITATOTCTAACGCAAT^ 
TTTGCAOAGTTCTCCOCGCCACNGNCCCCOCOT 

SEQ ID NO: 3 1 9 1 ACATAAAGTAACTGGTATATGTGCACAAGCATATTOCAl 1 1 1 1 U I'l 11 lAAC 
TAAACAGOCAATGGTATOTmXUTTGACATCAAGTGGAOACGGGATCGGQAAAAAT^ 
TGTOAAAATACCOCCTITCTCCATrAGTGOCATOCrCATTCAACTCTrAT^^ 
TTArmOCTCrCACTOTITrAACAAAAAAAAACAACAACATAAAAATOCnW^ 
ATTOOAOAATTTTAATOGTriTCAmATCATrcNAAAACCCAAOGACAATmATACrr^^ 

c 

SEQ ID NO: 3 192 ACGCOGGGGGG0AT0GCTrGGTAGTaGACTnCTOGaOTTrGCCTGTTACX3C 
CAGACTCGGACTrCTAAOCTTrAAOTGTGOCCCAOOAOGTITCriCT 
(XAAGAAGTCCCAGGGCAGOXJAGGCCAGCCCrGCCTGGOTIXKIAGAAAC^ 
AOTCTACTCAGTGCCTXXnGAAGCCACCCTCAGCCCTTCACAGGCCTGAACCAOTAG^ 
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GGGCCA0OTAA0CCCrAQAGCCrTGAA(XAG0AATATCCAGGAAaAaGAAATTCC^^ 

CCAOATOOTATrOCACKnTCGCTCKXrra(XrriXOGGOAGCOT^^ 

CCACATCATTXnCICraAGCAOAGOAOCAGQAATOCCTCAAGCAGCAACCrGGT^ 

GGCAGATGCAAATAGTTriTOCTGnATrAATOAAAQTAATTACTAAATGCACTrAAACX:AA 

NOAANG^TCX)AAA00ATOQA0CTA0AAA<KrrCAAAAT0OOCCAAACAA00<rrOTaAC^ 

AAAGACANOC3CIWTACmATCCTCCANGGAGCTTCGANACCATCCA^ 

OTTANOQAANAATTrNTTT 

SEQ ID NO: 3 1 93 ACOCGGCKJOAGTCAGAGQAGOOOCCCOATOTOCTCCGOTOOCTGOACCQAC 
AACTCATCCGACTCTGTCAAAAGTmjGACAGTATAACAAAGAAGACr AG GTTAT 
CAOATrCCTrrraCTATATCCTCAGTITATGTTOCATCTOAGAAGATCTCCAr^^ 
AACAACAGTCCTGATGAGTCGTCATATTACAGACATCArrrrGCCCGGCAGCACCTGACCCAGTCC 
CTCATCATGATa:AGCCCATTCrCTACTCTrACTCCTriX:ATG<^^ 

SEQ ID NO: 3 1 94 ACAATOTATGAAGCTAOOTGGGATTAAAACrOTCTCCAOTAnTACCCAATA 

cx:agatacacaaaagacattaatcaatagatcctaccaa<totaaoccctccaaaaaaaaaaaaa 

aaaaaaaaaggt(xx:aagaaggatgtgggaaacacritrgcatcacccaacaagctgj^ 

tgccaaogcccacgaoggctatxjtatgtcaaaaaggatottonttgtggc^ 

aacitctoaaacatgtgagagaaacccataaagaggaaatactatgtgaagtatgccggaa^ 

atttaaacracaaagan'acctraaocaacacatoaaaactcatgccccaaaaaoooatotatotc 

gctgtccaaoanaaggcngtgoaagaacxtatcaactgtgtrtaatcrccaa^ 

cttccataagoaaaoaxxkxttittoototgaacatcctgcc^^ 

aacaaagtctcactaggcatgcrgttgtcx: 

seq id no: 3 1 95 cgagctacagatitctracatgcatatatrgcatcctggtgaactgggggca 

GTTCCTrrroATraCTOAOTAQTATTCCATGOTATGaATOTACAGGTCCAOGGTATAGCTrC^ 
ACTATGGATTrCCTACTTATCOTGGGATTACTTrOCATCCTOGAACrACT 

GAAGCATGGAACCATGGACACTGAATCTAAAAAGGACCCTGAAGCTTGTOACAAAAGTGATGAC 

AAAAACACTGTAAACCTCTITGOGAAAOrrATTGAAACCACAQAOCAAaATCAGQAOCCCAOCOA 

GGaV^CCGTTGGGAATOGTGAGGTCACTCTAACGTATGCAACAOGAACAAAAGAAGAGAGT^ 

GGAOTTCAGGATAACCTCirrCTAGAOAAOOCTATOCAOmGCAAAOAGGCATGCCAATGCCCT 

TnTCACrACGOTCTGACAGGAGATOTGAAAGATGCTOCrGGCCOTCCAGCO^ 

GTCCAGGATGAGAATGGGGACA<nGTCTrACACrrAACAATCATCCCCITCANTCTC 

GGATCTACTAGAAGTCACATrGGTrrOATrCraATGACATATCACATGAOAAATOATCTO^ 

CCGGCXKSCCGTrCGAAAGGCGAATrCT 

SEQ ID NO: 3 1 96 CGTACCTGCACGTCTCATCGTmCTGCCGAAG(>AACACTCTACGAAATCAT 
CATATTCTATCITCCACTCnTrCTCTGCCCGAGTATAACOGATT^ 
TirrrCAAAAGCATGGCATCGACCAGCCATCTOTAGGGCKnTCA^ 
TCaATCTAT0TTAAGQCCGAACCrrmnW3AT0TCCAA0AAAOO<:yLTGGCXX}ATG 
TOACTCTIXnXTGGOCXjCOGCTTCAGAAOGACTCCCGCGT 

SEQ ID NO: 3 1 97 GGTACCaTTAAGCCGTGGAAGGAGAGTTTCTGGATCAATTAGAGTGAGTTr 
TCCTAGACATTCAGC/J^.CAACATTTCTGGTTCCTIXXnxrroCACA^ 

GCCCAGATGTmCAACATATGGTITAAGG<XCACCACTGATGCAGAGCTAATAAmOCrr^ 

GAATOAAQTAAAAGATACTOanTITOOOTTOAmGTTATTrCTTGCAGGA^ 

TCAGGAAGGTTCCCCACACrAATGCTGCCTAATGCATAGGATGCAGCrcATrTGACl lUl ICACTA 

OQAOATGAGAAACOTCTAOTATrACAGATmAGTrCCAACTGTCCAmAAGTCAATATGATGC 

CCAACTICI<X:AAGAGAAAGTAGAGCrAAGAGACGAATOOAATCTGTAQACCTrGA 

ATCTTOAATAAACTGACCrACTACAGCTOGTCCCttmAGGGCATGCTCXIAGTAAGGGGC^ 

CACATnxaaCAATGGAATAATAAGACTGCrrATGAQTAAGAGCTOTOCTCTraAaAGTAA^ 

ACCAGTCAGCATGaK:AACAAATCATaTATCCTAAATrATTCOTCC:AGTGAC^ 

AAAAAOTTACNTaGCACrAAANCOCCCCC 

SEQ ID NO: 3 1 98 ACACGAGAAGCTCCaA0GATOG<nX3AAOT(X:AAOaTCTCT0AT0C^ 
CAGAGCACCCaTATCATTTATOGAGGCTXTGTGACTGGGGCAACCTOCAAGGAGCTGGi^^ 
OCCTtlATOTOOATOOCTTCCTrGTGOOTGOTGCTTOCCTCAAGCrc 
TGCCAAACAATGAOCCCCATCCATCTTCCCTAOXTrCCTCCC^ 
AAGCCCAGTAACrcCCCnTCCCrGCATATGCTTCTCATGGrTGTC^ 
ATCCAAACTCTATCTItrTTrACTGTTrATATCTrCACCX^ 
TltnXXACTTACTATAATGGTrcGAACTAAACGTCAOCAAGGTCOCTTCrC^ 
GAAGGCOTOOTGGOAmaCTCCTGGCnTCCCTAGGCCCTAOTaAGGGCAOAAGAGAAACCATC 
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TCrcCTiriTACACCGTGAGGa:AAGATCCTCANAA0GCAGOAOT0CTOC 

TOCCTCTOTGCTQTaTATGTGAACCACCCATGTGANOGAATAACmGGCNAAAAAAAAAAAAAA 
AAAAAAAAAATNCCTO 

SEQ ID NO: 3 199 CGAGGTACCA l ' IX} ' r rrrGCXmjGTGCCACTCAACAAATTTTAGATGAAGCAG 
AAAOATCATTGCATOATGCTCITnjrGTTCTrcCGCAAACTGTAAAOGACTCTAG 
GAGGAGGCTGTTCrGAGATGTTOATGGCTCATGCrcTGACACAGCTTGCCAATAOAACAa^ 
AAAGAAGCTGTTGCAATGGAGTCTrATGCTAAAGCACTGAGAATGTTGCCAACCATCATAGCTGA 
CAATGCAGGCTATGACAGTGCAGACCTOOTGGCACAGCrCAOOGCTGCTCACAGT 
CCACTGCTGGATrGOATATGAGGG/y^GG<>CCATrcGAGATATGGCTATCCrGGGTAT^^ 
AGTirrcAAGTOAAQCGb^CAOOTTCmrrOAOTOCAGCTGAAGCAGC^ 
GGACAACATCATCAAAGCGGCACCCAOGAAACOTGTCCCraATCACCACCCCTGT^ 
CACOTGCTOTCXJATCrrTOGACCAGmCTAGCAAAGTTGTGTnGAAAGATCTCTA^ 
ACTGTGOAATCT0TrATCGGTGCa:ArrATATNCnAAGTn3OATATrAACTO 
AGGTCTAATTATTGCCGGCCATTrC 

SEQ ID NO: 3200 ACOCGGGGTCTGaXJTrGGGTGAGGCGCQGAGCOAAGTGAAGOGTGGCCCA 
GGTGGGGCCAGGCTGACTGAAGAaAGCCAAAGGAAATCAACATITOAATraACTACCCAATGTCC 
TGCCCCTGTTrCTX}TCCTXn'AGCAmAGGAAAGATGGATACrATGATGCraAA 
GTnGAOCAOCrraTGCOaMGQTGQAOATICTCAaTQAAOOAAATOAAOTCCAATITATCC^ 
GGCGAAOGACriTa\0GATTnXXJTAAAAAGTOGCAGAGOACnX3ACCATCAGCT 
AAOaATCrnTtJATOAAAOCAGAOACTGAGOGAAGTGCTCrGOATOTTAAOCTGAAGCATGC^ 
TAATCAGGTCGATGTAGAGATCAAACXKJAGACAGAGAGCTGAGaCTGACTGCOAAAAGCTOaAA 
CGACAOATTCAGCTCATTCGAQAQATGCTCATGTGTGACACATCTGOCAGCAT^^ 
GGAGCAAAAATCAACTCrrGG<nTITrTAACAOAAOCCAACCA TO 
AAACTATCAACCATTGTGAATCTTGCnCArrTATCANATN^ 
ATGOGACTTTNTTTGG 

SEQ ID NO: 3201 GGTACTTGTCATCAAAOGCCCAOGCAOTCCTCTGGAATAOG CTTTCC AOCrGC 
TCATCCTTXKJTGTATTCTAACACCTXAGCAACATOACCAAGAATGCW 

OTOAATTTOTCTTCACATTroATTGCTIXXnxnXXiAGAAACT^^ i 1 lUACAAA TCAAT ATATC 

Clll 1 I C t 1 l GTCCACCCTAATGACAA(X:ACACACTCATTCCTGCC>ATrCGGATGAGTIT^^ 

AGAACGGATACGCCmntHMTAATTCACTAAGAAGAATCATGCCTI^^ 

CAAGCrcACATAAGCCCX:CATrTCAGCAATGGATCTX3ACATTCACCATCACTACATCTTC 

AGGAAATTIXjrGTTGATAAAATCTACAACnAGACCCGGCATTCTOAGGT 

CACrcGAacCCAGAGCCOACTrCTTt>ATCACAOAQA<XAOACI^^ 

GATCCTXHTAGCGTCCCGOGTACCTGCCNGGCNGGCGCrra 

SEQ ID NO: 3202 GGTACGTCCAGGCACCOCCAGCCACTGTCTrCATGCAGGAACCACAGTGCCA 
GATCCCCACAOCraSTCTCTTCATCTTOGTTrrQCCACAGA^ 
TrCCACAOGAAAAAKXrrTACTTOTCTACAATTACCTGTGAATCI^ 
OGGAOCAOTrACTTCTGrraAmCTATCCCTrcATrrTCTAOAGGTT^ 
TAGACTRKrrATGTTACATCATCTOTATCATCGTCCAATCTTr^ 
AATCTGGATTrrcCTGCTCAGACATCATCTCTKXIATATCXrrGATr^ 
TTCCACGTGGTAACTATGCKXKlTTirm^TOAOATACTrcAOOTTCCAGGarrCTCA^ 
TCTACATCAAGTCGOZATAAGAAACTCATCATCCIXrrraGTGTGGOTCn^ 
TGQTOCAAOTCITCTCCTCAACTGCTCCTOCTONTOG<^^ ICGG CCOTGAA 
GACACCATTOTNCACrrCACAGCrATrrTCTrCCnGGCCAATOCT 
GNCCCG 

SEQ ID NO: 3203 ACOCXJGGOArrcCrGAAOCTOACAGCATTCGGGCCGAGATGTCTCGCTCCGT 
GGCCTTAGCrciXKTIXXKXSCTACTCTCTCTrrCTGG^ 

OTAGAACCroTAGCAAGGTGCrCTTGGCAGGCACCCGGTATAGTIXTOCCCTGTOT 
AQTAGACXn'CCAAArrATCAOGGCAATATrrrTGCTCTAGGTCCCAAaAGOQT^^ 
TCA<XATrAGATGATCAATAAACCTGGAGTCCTCATGAAAAGCAGAQATGAAGTCCGACTOGGCA 
TACTCTGGGTACC 

SEQ ID NO: 3204 ACTTCATAAAATCCTCTrATAGAGTTACTCTIXKXXTAQATrOTAA^ 

TTOGCATTATTGTCAGACrGGATOGAOGGTGAAOTAAAATAGTATGAACAATTAAGAGGCTCrcC 

CCCTCTTGTCTTTAAOCCATArrCTCCTACATaTATrrrATAAOAAAATGrnAAOTCAAA 

GGCTCTTTAATTCCTQACCrCTTCATrcrCCTTTTCAGTATAACCTCrc 

AGACAAAAAAAACAAAACOAAATACACACAGAAAAAAGTCrrrCCAAACTGTITAAGTATTrA^ 
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CATCTOAOCCAAAAAAAAAAAAAAAAOTACC 

SEQ ID NO: 3205 ACTCTATGCATrCTATCTATACAQAAACACCrATTrATrATrACAGTrGATTTA 
TQCAGGATTAACATnTTGGTGrrrAAAATTGTTAAAC^TCATQCTTACra 
ATOAATCTGAACrrCACAATGATTTACCAAACACTTTACTTAAATTACTAATT>^ 
ATGCACOGAGCCAAACAAAACTrAACTOGGCTrATAAAOAAAAAAACCCITCAAAACAGTOCATAT 
ACACATCrATrrCAAAACTACCTm3CrAGCTAGCCCCmCCCAAAGTnTAGC^ 
GTAAAATCTACACAAAATTrrATrGCAATCATACAAGOGTTACATrAGGTCAACAAATACTATGAT 
GC\ATTTrACAmATTAAACTACAOTrCAAAOCACAAATTrACACAriCrAAATACACTAAAOT 
TATCTAATGAAGTCACACTGOTCTrCTAACATrrGATATATCTGGGTGAAAOACATGAACTTTACA 
AGACTITAAACACAAATNCTrAGTATAAAACraOOTCTGaATOTTAACGArrAATGOGGAAC^ 
ATAATXKKKnCATCrCACTAAGCATXXTrmK3TCK:ATATTTAmTrAATO 
ACTGGTAGG 

SEQ ID NO: 3206 OGTAOCAOTGOAOGAAOGCCITCCGaCOOAACATaGCAOTOAACTOCTCCGA 
GATOCOCITGAAGAG<nXXnXK}ATGGCTGTGCTATKKX:AAT^^ 
ACGAGGTGOGATGTCACACACGGCTOTCTTGACATTaTrOGaOATCCATTC^ 
GTrcTTXmxrrOCACQTTAAOCATCTaCTCATCCACCTCC^ 

AGCAGCCACGGTGAGGTATCGGCCGTGGCGGGGGTCACAGGCAGCCATCATGrr^^ 

AGACCTGCTGGGTGAGTTCX;OOCACTGTQA0AOCTC0ATACTGCTGGCTTCC^ 

GGGGCAAAGCCAGGCATAAAGAAATGGAOACGTGGGAAGGGGACCATGTTOACTCCCAAC^ 

GGAGGTCAACATTQAOCTGGOCAGGGAAACGGAOGCAGaTOOTGACACCACTC^TOOTOOC^ 

GACAAGGTOQhrrCAAAT0CC(XTAGOTTGGTGTGGTCAACTrrCAGA>mKXM 

TANAAGGCCTCGOTGTCNATGCAATAGGTCTCATAATATTCrCTACCACTGATC 

GGCATTGTAGOGCTCOACACOGG 

SEQ ID NO: 3207 ACAOACAATGTTrO l HCl J 1 IG TAAAAAGCAGTAAGTTATGCCCAGTAACTA 
AATGAATrCAAAATX}GCCAAGACAAAGAAAACTAAGAAAGATITroC(TrCCCTCT(XTACC^^ 
TATGGAGCA CAG<L^ TOTTGOOAOATOAACAOQOAAAAOACCAAGGTAAGGAGCCTGGGAGOOAA 
GGTATCAACATTITAAACTOAACTAAAA ATAA AAQTATAAATGAGTTGGAmAGGGrrAOATCA 
GTAAGACATCArnTTACTGAACACAAGTITTTAGTATCTOT^^ 
TCTraATOTAACTAAGACACACTrCCACAAGAGCCACTAOGATAAaXX:ACTCAAG^^ 
AGTAAAOTOATGTAAGTOACCAGCAAGCAGTCCACTGCTCCT 
TTrCCTTCCTTCCTTTtrmrCIXHrCTtXCTGAGAAGCTACAT^^ 

ACATCCAACCCCCTAAAGTGlGGGACGGTGGAGGAACAATOGTGGGAATtXKIAAGAAGTCTCAC 

CTAAATCCAACANCCCGGGAATraAGNTGGTTATCTIOTCmGNGATGAGGACrAATTTC 

GAGAAAAAAA 

SEQ ID NO: 3208 ACmAATAGCTCAAACTCAGAGTCATCGTGCn«X>ATrCCA^ 
T AAAA GAGGCAACITCGOCCXnTTOAGAAOCCAOCGCTCACCCACXCOGOOT^ 
CXrmXjGGTXKnXjACTTCGAOAAAAGCACAAACACXJACCAQTCCCATCCTGGCT 
TCTTCTATCTACGCATTGTATXX3ACTGCAnA0TTaCACTAAGATGATOACrcA<^ 
QACAAATOCTOACTGrrCTAAGCAAGAATQGCCCAAGCTOOCAAGAAAAAGCA 
GGATACAGAAGGGCAGAGCTrcrGCCTGCGGATCrGCAACATTAC A TTIXj 
TCAAOAA0OATTCrGTTGG<XAOGGAGTCTCCACTGAACAAACA>^ 
AGTGTCCTATTCCAGCAGCCCAGAGTCCTCATCCGTCATOCACGGGAGAGTC^ 
OAAOTCCAGCTCATGCCrCTGCCTATOOOTCAATTTCTTCGOGAATA 
ATTACCCCTGCGGACCACXATGTIX>GGGTGCTTTCCTTTTAATCACGTC^^ 
GAGACCACGGCTOTCCATTGATGCTT 

SEQ ID NO: 3209 ACT0G<nXXnX>TCrcACT0GGOCrrATGCCAAAGATGTAAAAT^ 
ATO<XCGAGCCTTAATGCTTCAAGGTGTAGACCTTTTAGCCGAT<^^ 

CAAAGGGAAOAACAGTGATTATroAGCAQAGTroGQOAAGTCCCAAAGTAACAAAAGATGGTGT 

OACTOTTGCAAAGTCA.\rrGACTTAAAAGATAAATACAAAAACATrGOAGCTAAACTrOTTC^ 

ATGTTOCXXATAACACAAATOAAGAAGCTGGGGATGOCACTACCACTOCrACTGTACC 

SEQ ID NO: 3210 ATGCGOGCATCACCATCGCTACAOAAATrOOrrCTOnCCTCGrrrr^ 
ATOCX:AAGOTnXAGCA(XAGGCA(XTCGACAGCrcTnTATAAGCGACCTCAT^^ 
CAAGCAATCCAACAGCTTACrrTOJATGGAAAACGAATOAOAAAAGCTXn'OAACCXJA^ 
AOACTACAATCCATCTtrrAATTAAGTArrTGCAGAACAGAATATGGCAAAGAGACC^ 
TGCGGGCAATTCAGCCTGATGCAGGTTATTACAATGATCTOGTCCCACCrATAGGAATtr^ 
ATOn-ATQAATGCAGTAACAACAAAATrnmCOGACATCAACAAATAAAGTAW 
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TTTGTTGTTAGOTOOACTCCAAAAGGAAGACGCTTGGTCACTGGAQCITCTAGT^^ 

CTCTGGAATGGACTCACTTIOWTrTTGAAACAATATrACAGGCTCACGAC^GCC^ 

ATGACGTGGTCACATAATOACATGTGGATGTraACAQCAGACCACNGAGGATATGTGAAATATTO 

OCAGTCCAACATGAACAOOTCAAGATGTrCAAGCCCATAAGGAGGCNATAaAGANGGCAGGTrAT 

ACACAATAACCANTnTGGA 

SEQ ID NO: 321 1 ACAAGGAATCCTTTATTGGTAACATCTTGGTGGCTGGCrAGCTAGTrrCTACA 
OAACATAATTTGCCTCTATAGAAGGCTATICTTAOATCATGTCTCAATGGAAACACTCi-lCl 1 Jti I 
AGCCTTACnTQAATCTTGCCTATAATAAAGTAGAqCAACACAC ATTOA AAQCTrCTGAT^ 
CCTCAAATTITCATCTrGAATOTCrrrcTATrAAACTGAA'rrriUr r I'l 
ATTnXX:ATaATrAGCCGTGTAACTCCTOCAATaAATCrrTTATGTCATTGAAGCAAAT^ 
TATTArmAAAAAOTOGCAGAOTGAmAACTQATCATGCATGATCCCTCATCCCTOAAATTGAO 
TTTATGTAGTCATTTTACTrATTrrATTCATTAGCTAACTTTGTCTA T^ 
TAGTOTAATCXDATTATAAAOGATATnATCAAATCCAOGGATGCATTTTOAAAT^^ 
CTraCTOAAOTATTCAT^O^IAAAACATACAAAATAAACAT^^mAAAAC:A^ITG^^ 
>WNCTONTNTAAhM^AAAAGGNCWCGGOA0GNaAGCGGCACATGaCOTC^ 
G 

SEQ ID NO: 32 1 2 accgacc^taoaocaagaatcaaoattctgctaactcctgcacagcc^ 

CTCmxnTrCTGCrAGOCTOOCTAAATCTOCKrATTATn'CAOAO 

AGTGATAAGGGCCCTACTACACTGGCTITTTTAOGCrrAQAOACAOAAACTrrAGCAT^ 

TAGTGGCTrCTAGCTCTAAATOTnXK:CCOOCCATCCCTITCCACAOTATCCT^^ 

CTGTCTCTGGCTOTCTCGAGCAOTCTAOAAOAGTGCATCTCCAGCCTATO/^ 

GGCCATAAGAAOTAAAGATTrGAAGACAGAAGOAAOAAACTCAaOAOTAAGCTTCTAOCXXX:CT^ 

CAGCTTCTACACCCTTCTOCCCTCTCTTCATTGCCT 

i ITl JCH I GGCCATOGGAAGOTTACCAGTACAATCrrGCTAGGGTGATGTGGGCCATACATTCCT 
TTAATAAACCATTCNGTACCT 

SEQ ID NO: 3213 ACGCOOACACACAOTGAAGCnTAAAGAAAaTGrrTGCTGAAAATAAAGAAA 
TCCAOAAATTGGCAGAGCAGTTTOTCCTCCTCAATCTCGTITATGAAAC^ 
CT<XTGATGGCCAGTATGTCaXAGCATrATCnTraTrGACCCAT^^ 
TCACnGGAAQATATTCAAACCGTCTCTATGCTTACGAACCTGCAGATACAGCTX^^ 
ACATGAAGAAAGCTUrCAAGTranOAAGACTGAAnGTAAAGAAAAAAATCrC^ 
OTCrGTCAOOCCrrGAGACTTGAAACCAGAAOAAGTOTGAGAAOACTOGCTAGrGTC 
OTGAACACACTtiATrAOGriTATGGTn'AATCnTACAACAACTATITmAAOAAAAACAAGTm 
OAAATITGGTTrCAAGTGTACC 

SEQ ID NO: 3214 GGTACTGATACACCATGTItKSCAAGCAahrnGGAGGCAGCTOCrACAGAOAT 
GOTTTCCITATITCGAAGCTCATAGATrCGACATTGTCGAGCCAACAOCCGTrc 
ATXXX;CTCCGCOCCCAG<XATOGTGCCTAGCAGCTATGGGTrCATCT^ 
CTGOGAOOCAATOTAAGCACmXrTGTANCCCTGGAGTCAGCTGCAACTATGACTrc 
ACTTGAAOGCCAGGGTGGTTGTrCCATGAAGCATTTCGATTC^^ 
TGGCGCTKJNCAGCCTCANACCATCWCTGAOACTNCCIXKSACCT 
CCCCAAGTCOACCCGCOGACCTGCCOGGGOCGOCTOTTCNAA^ 

SEQ ID NO: 32 1 5 ACACTTGAAACCAAAmCTAAAACATGTTmCTTAAAAAATAGTIX3TrGTA 
ACATrAAAOCATAACCTAATCAGTGTGTIOCTATOCnxrCACACTAGCCA^^ 
TCTGGTTTCAAGTCIX^AGCCCrGACAGACAOAAGGGCITGGAGAl-rri-I i TlUi i lACAATTCAG 
TCTrCAGCAACTTGAOAaCTTIXrrTCAT0TTCnt:AA0CAAC^ 
CATAGAGACaATTTGAATATXnTCCA0T0ATATtXK}CTCrAACTOTCAC^ 
ATAATCCTGOGGACATACTGGCCATCAGGAGAAAGGTOTrroTCACTTGTT rCATAA^ 
AGGAGGACAAACTGCTCTOCCAATntTOOATTTCmAT^^ 
GACTGGGOTGGGCACTCATCCAAOTGGATOAATANTX^TCAAGGGTrr^ 
TATANAGCrTCTrCATrAT<nTraNAGTCCAGATOAATrOOTNACOCCAANCr^^ 
TGGGGCAGTTTTGOOTCNGGCO 

SEQ ID NO: 3216 GGTACTGCTCax:ACCnAGTTCTrCACAACTAACATAGAAAATrGTa3AAAA 
GTAGGGGCAAOCATTTCCAAAAACAAACAAAAAATCCCCAOCrrATTATAAGCATOAATATOTA^ 
GATGGAATITCTTCCCAGCAATAGACTTCCAAACCATCAAGAAATCACC^ 
AGTTATTTTGCCTATCTCCAACAAGAOATGCACnATATGTCCAAC^ 
rrAGTOTTTAGTrTGOGOOTGGGTrOGGAAATTCAGCCACCATTTTAAAATGAC^^ 
AATGACCACCAAAAATAGGTTCACTAAATTrATmAAAAATCATAAAACGJiiViiACAAAAGAG 
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CATTACATTCTGCACACTCKTCrcAACAGATOCCAGGGACATGTGO/VCTATTGTTAC^^ 

GTCCCACCCCCANATGOTACACTGCCACAAAGCAAGGTOTTCACAAT^ 

TTAACCACCACAATTCCWAAAATAAANTCCNCTCCTCTGCTCKrrGTT 

CACCCC7TC 

SEQ ID NO: 3217 ACaCGaQAACCATCATrOACATTCTAACrAAGCOAAACAATOCACAOCGTCA 
ACAGATCAAAGCAGCATATCT«>GGAAACAGGAAAGC(XCrG<UTGAAACACTa^ 
mACAGOTCACCTTGAGGAG<nTCnTrrAGCTCrGCrAAAAACTOCAGOGCAArrrGATOCTGAT 
GAACTTCGTCKrrGCCATGAAGQGCCTTGQAACTOATOAAGATACTCTAATTOAQATTTrCH^ 
AGAACTAACAAAQAAATCAGAGACATTAACAGGtnCTACAQAGAGOAACTCAAaAGAGATCTOG 
CCAAAaACATAA<XIX:AOACACATXrrO0AOATrTrC0OAA0OCr^ 

A<XGATX7rGAGOACTTTG0TGTGAATCAAGACTTGOCTGATrCAGATGCCACGGCCTTmAT^ 
CAGGAGAAAGGAGAAAGGGGACAGACGTTAAACGTGTTCAAATCCATTCXrrTACCC^ 
TTNCACAACTTCOCAGAOGGTTTCTAAAAACAACAAATACCrrNGNC^^ 
■nTNTCAAANNTGGNNGGCGCT 

SEQ ID NO: 3218 GGTACGCGGGCCATCCATC<XCAAGGAGAACTK>GrrTGCTTGAC^ 
ATCACAACCGAOCTAAAGCTCAAATTGCTCTTAAACTTGGTGTGACrGCT^ 
TCATTATCTGGGGAA.\CCATTOCTCOACn:>GTATGCAGATGT^^ 
AAGGAAAGGAAGT^GOTGTT^ATGAAaCIX^GU^AAGATGACA0CT0aC^CA^ 
A<X3ACTGTGCAGCAGCGTGGCGC7rcCraTCATCAAGCCrcOAAAACTATCCAGTGCW 
OCAAAAOCCATCnm}ACCAC07CAOOOACATCTGG7TrOGAACCC^ 
CCATGGGTGTrATCTCrGATGCCAACrCCTATGGTGTCCTGATGATCTGCTCTACTCAn 
GTAATCAAGAATAAGACCKKiAAATrrOTTOAAAONCTCXXn'ATTAAT TGAT^ 
AGATGOATCmACrGCAAAGOACrTOACAGAAAGAAAANGAAAGTGCTTTrOAATTrC^^ 
GOCNTONTATACAATOATOTTANT 

SEQ ID NO: 32 19 CGTACAACA<XttOGAATATrcCCAOAGTrAACCTGACCACCAACACOATroC 
TGTGACTCAAACTCrCCCTAATGCTGCCTATAATAACCGCTTTTCATATGCTAATGnGCrrC^^ 
GATATTCACTnXXrrGTGGATCAGAATGGATIXn'GGGTrATTTATTCAAC^ 
AACATaOTGATTAOTAAACTCAATOACACCACACTTWGOTOCrAAACACTItK} 
GTATAAACCATCTGCTTCTAACGCCTTCATOOTATOTGGGOTTCTGTATGTCACCCQT 

SEQ ID NO; 3220 ACGC0GGGOOCCAATGT0<XAGCAGATCrGAGCTGAGAATCATCCTGGTC 
CAAAACAGaAACIX3QCAAAA0TGCTaCAGaGAACAGCATCCTCAaaAAOCAAOCATT^ 
AAGCTGGGTTCCCAGACCTTGACTAAGACrrGCAGCAAAAGTCAGOGAAGCT^^ 
AOATrOTCATTATrGACACACCAOATATaTTnCTrGGAAOGACCACTGTQAACCTCTGTACCAG 
lllllUlJlLUlLliilL l GCCrrAGGAG<nTAG(nGAAAGTTCAGCTAACCrrTGACACTACTAGT 
GTCA^^^AGTAACAGTAATCT^IGACACTAC^AAGTGTCAmAGGAGC^^AGC^Xyu^AGT^ 
TAACCrrrOACACTGCTAaTGCArrCAAGAAAGGOTAGGGAATOAAAGTACCTOOGC 

SEQ ID NO: 322 1 GCTA Cll n m 1 i H Ml i H 1 1 1 U 1 1 1 111 ] agoactcaactcatggaanaa 
AAACmAATTCCTGTn'CCAAAAACATCAOTATGAAGTAGAAATrGGTn^ 
AGAATATGTTCTANAAAGNGGAGGTAACrGGATCrAAAArrCTGGCAGCAATTTAATAN AACAG T 
OCTAGAGGGTTAGGCTGAGGCCAACACCTAATGAGGACXJAAAGCCTraTCKn'GGGGAAT^^ 
ATOATACTTGOAOAAATArrACACTONGCATAAAC(>AArrOAGTTGCmCACAAGTOTTOTAAA 
0TTTCATATAGTAAAAOG>rrTTTTCTACATGCACTTCAATTT^ 

CTAAACACAOAAQAOAaCATTCATO<yj^GATATCTAACTCCTrGATATAATAATOCATACAAT^ 
AAAATOATTACACTATCATrACAThrrAGGGCrillCl^GCAACTCCCAGTOGTG GTrNTOO GAAAG 
CACKKXXXX^OCATOGACATTTAAAAAGCATCCCC^KXnTCTAC^ 
TTTCCTTCTTTTTCCTAANAA 

SEQ ID NO: 3222 ACCCrCAACm^AAGGAAAAAAGGTTATATITGGGAATTTAAATATCrrrrT 
arrcAOraCAATTACACACTATNAAAACCTAGAATraCATACAACGCCTrGGTC^ 
CTTGTrrGATOATOATATAAOOGCAATrACATrTAAAGCAAAAriTCNAAAAAGTGCACCCrcC^ 
TGTGAANATATCANACrrAGCANCCCACCTAGAGGATAAGTrGTTCAOGAOTGGTTCTGATrAAA 
GCCCAOATTTCAGAGCTNOCATTTrCTATTACAOCATCrCANAAQATAGCOCTAAATOCTCACAOT 
^X^X^NAAGAGTATTTTTTTTr^r^CTTCXXAAC^ 

WAACTANOAAACAGATCAGAACANNGATCmCTNACTATGrrTTAGCTGCrrCC ATm 

AAOAAAATANTTOTATGCNCATCGTTAATGACTG NCATT GACG GAAGAC ATGAACGTTTNOAT^ 

NTOnTJAANANAANCCTGNAJmJGANNAANATCCTITrAAACATrrrCTOCrATA 
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NATGANA U l 1 1 CI l U I I NNAAAANACC 

SEQ ID NO: 3223 ACAACHJACCACTTCTTCAaAAOCAGAOCTCTCCTCnxykO^ 
GAGCTCCACX7rcA0GCCCKnx:TCTrGCTTTATCX^ 
AGQAATTAOCTCTTOAOTAGCCTITKXTtaTriTCGGATC^^ 
OGTCrriTITAGGATCTAC C TT Cri 'CrrTTriCOAAOAOQTT^^ 
OAAAGACAACAATGAOGCrCGCTCX?r(UCTCrCrCTAAGCrGan<^^ 
CGCTAGTCGGGCOCAGGGCTAOCOAGATACTIXXjCAGCAOOOAOOaXm^ 
CGCTTCCCCCGCOTACC 

SEQ ID NO: 3224 ACGCAGTITrCATGTCmGCOTAAAGAGCTCTCTAGTCTAAOGGTCrT^ 
GTTAGAOATCTAAATGACArmATCATCrnTrCCIWAGCAGGTGCATAaT^^ 
TCACAGCTGTGCCAGTAATAAOOATGCTAACAATTAA TTTTA TCAAACCTA^ 
ATnX3ACACGTTTrAAnGCIX>00TTAAATGAAATA<TITriXXXKK;aT^^ 
ACTOATAAAACAAAAACAAAAGTATGTTrrAAATGCTITGAAGACrGATACACTCAA^ 
ATK>T0AGCrCTC AATm >T0OCAGOCCATAGTTCTA^ 
GACTATA(XACTATTTTTrCTOAOATTAATGTACC 

SEQ ID NO: 3225 ggtacaaacgacogagcaccatcaacccgtccaaggocagcacaaacccag 
atcgaotocagooaocaaoaooccaaaacatcaggoaccgggccaccatccggcgcctgaatat 
gtataggcaaaagoagcgcagoaacagtogtggtaaaataattaaacccctgcaatatcaat^ 
cggggocitctggcacactggcaanagtagagccaaatattaaatggtitggaaacac^ 
attaancaotcatca-n-acaaaaatrtcaaoagqaaatgoatacaottatoaagoatcxatacaa 
agttgtcatgaagcaaancaagttaccaatgtctcttctcx>tgat^ 
oaaootocacatrcttoatactgaaaotrmgaaactacattrggccctangtcacaga 
c0accaaacttatttx3a:agt0atatccantctmatcn>^^ 
ctatgaaccagggcnangattcgtoaatttgqtnacctgaa^ 
tcannaaaaaatcrmtanaggganangtcanagnaatttt 

seq id no: 3226 acaoiagaaoaaattcaogaagtaagaagtaagagnrgaccctattatgcttc 
tcaagoacaggatcgtgaacagcaatmgccagtctggaagaactaaagoaaattg^^ 
agtgaggaaggagattgagoatgctgcccagtttgccacggccaatccroaoccacctttgga^ 

AOCTOGGCTACCACATCrACTCCAGCGACCCACCrmGAAGTimrcGTG^ 
AGTTTAAGTCAGTCAOTTAAOOOOAGQAaAAOOAGAOGTTATACCTTCAGGOGOCrACCAGACAO 
TGTTCTCAACTTOGTTAAOGAGGAAGAAAACXX>GTCAATGAAATTCAATaAAATT^^ 
TTCCATTAAOTGTOTAGATTGAGCAGGTAGTAArrCCATGCAGTrTGTACC^ 

SEQ ID NO: 3227 ACTTGAOTCAAAGACGAC^TTTAOATTCTT^^ 

TCATTGTCTAGTCAGGCAGAGGAAAGAAATTCAAAAGCAUrri CrrrriUl I CTOTCAGTrCCnTG 

CAGTAAOATOCATCTTCTCACOTOAGAAATCATTAATAGGGAGACCTTCAACA^ 

TAmTTGATTACAACAGGGAATGAGTAOAGCAGATCATCAGGAACACCATAGGAGTTGCCATCA 

GAGATAACACCCATGGACACAAACIXntrarTGGGGTTCCAAACCAGATGTCCC^ 

ACAGATOGCnTItK>GCAGACATGGCACTGGATAOTmCGA0anTC 

GCTGCTXK>CAGTCGGTGA(>AATTCTCCXnTGAGC^ 

CCAACTTCCTTIXXTKKiVAmCACXnTOOCATGOTTOACATCTO^ 

TrnXX:AAATAATGACATTCTITACATCATrAGCAGGa;CACCAAOTTTAAGACCA^ 

TANCTOOOGTGNGG 

SEQ ID NO: 3228 ACCTAGAAOAOAGGOOOGTCAAAOAAOTA0TOAAOAAGCATTCTCAC7ITCAT 
AGGCTATCCCATCACCXrmATrrcOAGAAOOAACGAOAGAAGGAAATTAGTGATGATGAGGC^ 
AGGAAOAOAAAOGTGAOAAAGAAGAGGAAOATAAAOATGATGAAGAAAAACCCAAGATCGAAG 
ATGriGGGTlX^GATGAGGAGGATGACAGCGCTAAGGATAAGAAGAAOAAAACTAAQAAOATCAA 
AGAGAAATACATTGATCAGGAAGAACTAAACAAGACCAAGCCTAmGGACXIAGAAACCCTGATG 
ACATCAaxyiAGAOOAOTATGOAOAATTCTACAAQAOCCTCACrAATaACTOOaAAGAO^ 
GCAGTCAAGCACTTrrCTGTAGAAGOTCAGTTOGAATrCAOGOCATTGCTTAT^ 
GGCItXXriTrOACCTrrrroAaAACAAOAAOAAAANOAACAAC ATCA AACT^ 
GTrCTCGTGOCCAGCTTOTOATGACnTQATCCAGAGTAT^^ 
TTrrOAOOATCTTGCCXrrGAAC 

SEQ ID NO: 3229 CGTACGaJGOOCCCOTtXiGAOCCCTroCACGCCTCCrC^^ 
C^GCCTAGCCCAGCATCACTATGGTOOACGCTTTCCTGGGCACCTOa 
AATTTCGATOACTACATGAAOTCACnXGTGTGCGTTTrGCTACCAGOCAGGTGO^ 
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AAGCCTAa^.CAAl-CATCGAAAAOAATGGGGACATTCTCACCCTAAGAACACA^ 

OAACACAOAOATCAGCTTTAACnTGGGOGTGGACnTCGATGAGACAACAGCAOATGACAGGAAG 

OTCAAGTCCATTCTGACACTOGATGGAOGGAAACTTGTrCACCTGCAGAAATGGGACOGGCAAOA 

GACCACACTTOTOCGGGAGCTAATTOATGGAAAAACrCATOCTGACCrCACC^ 

OTTrGCACTCCACTTATGAAGAAAANAGCATOACCTOACrOCA CrrOT T^^ 

GCCATXXXJ>nTCCCCTC»ATTOA>WCCACATITO(^^ 

OGCNOOCNGTrAAGONCAAITNC 

SEQ ID NO: 3230 AOCTACAGAAAAOCCICTIXrrGTCTACQOOGTGrCAQTITQACACCC^^ 
COACAGAGOCTCTCOTAAACATCCCICrcAGOCAGGTAOTTCCACAGGOCrCTCAT^^ 
GOGGTTOCTAGCTCAOCTKnXSTCTOATrraTrAACOTTT^^ 
AAAGTACC 

SEQ ID NO: 323 1 AOGCGGGGGGGTCGACTGACGGTAACGGCOCAGAGAGGCTGTrcOCAGAGC 
TOC0GAAOATGAATOCCAOAG0ACrrGGATat3AOCTAAAGOACA^ 
TCAGCAAGTXKlA(XTTTrcAAAGTCATGATCTTCTTCGGAA^ 
CTrrrOCCTAGTCATCCCCTTGAATTATCAGAAAAAAATTrCCAGCTCAAC^ 
TTTTCCACACTGAGAAACATTCAGGGTCTATTTGCTCCGCTAAAArTACAGATGG AAT^ 
GTGCAGCAGGTTCAGCCQTmCCATTraTrCAAGCrCAAATmTCACTGGATGTT^ 
AATCATCAQACTATIXWATrrraAOOATATICTTAATGATCCATCACAAAOCaAAGTCATOOGGA 
GAGCCACACTTGATGGTGGAATATAAACTTNCmACTIXn'AATAGTNNGCT^^ 
ANraoan^JCATCrrXJOTTATAOTCATCnTrGTACCAACCAAGNroANGNNG^ 
CNTTCANCCANANAANCCCCAAT 

SEQ ID NO: 3 232 GGTACXJCGGGOGGAAGGTGOaKTOOTGAAGOTGCAGOaxrrro^ 

TCAQAOOCAOOTXMCTATOAAAOOCrrATATTTOCAACAOAO TTCCAC AOATQAAGAAATAACAT 

TTGTATTTCAAGAAAAGCAAGATCTTCXnXTrTACAGAGGATAACTITOTGA^ 

CTroTGCTCTGAGCCAOATAAATACAAAGCTrcTGGCAGAAATGAAGATGAAAAAGOATrrAm 

CCTGTTGOGAGAGAAATTGCTGOAAT1OTATTAOATOGTAAOTATA(>OATGTGAOTATATOTATT 

TATTITCTAAGGTAAAAGAGATTAAATGTAGCrAATTGOTrACTGGT 

SEQ ID NO: 3233 OGTACATGQAGGAGAATGACCAGCTCAAGAAGGGAGCTGCrGTTGACOGAG 
OCAAGTTOGATOTtrGOGAATGCTGAGGTGAAGTTGGAGGAAGAGAACAGGAOCCTGAAOOCTGA 
CCTGCAOAAOCTAAAGOACGAGCroCCCAGCACrAAGCT^AAAA CTAOAGAAAGCTGAAAACCAG 
GTTCroGCCATGCGOAAGCAGTXnXUGOGCCTCACCAAGOAGTAC-l 11 Tl I I I l l Tl 1111 11 1 1 1 U 
CCATCAACAAGTGTnATTCATCACCTACTGTCTGOnXKK^CTC 

CAGANAOGTCTAGGATATOOCCCCCACOCACCOAAOGCTTTACAATNTACTrOTGAGATTO 

CACACACACAAATAACGATCAATCAAAAATTGTCAATCCTAAGCATCAAGAACAATTTATACATT 

GAGGGTTGGGGGAOGGAGGGGTAOGAAAAGGGATrGGNTNATCANAANAANGGCTCAATGAAG 

GAGGTGACCCTTATGAAGTTCTrcCAACCCTOAOaAGOA(XmNTO 

TTTGCAAAACCACTTGGAACNTTCGAAG 

SEQ ID NO: 3234 CGGTACAAAGATGACTATAAACAAGATGCAGCCCTCGQTrTCCATGAACAGC 
ACACTATrACAGTAAACCAAOTrTATATIXXACCATCAA<7rOT0O<nCT 
OATXWATCATTAAGAATATCCTCAAATCCAATAGTCTCATCATrACOCCTCAAAA^ 
AGArnGAGCrrGAAAGAAATGOAAGACGCrOAACXTOCTGCACriXKX TIXl^ 
TTTAOCGGAGCAAATAGACCCTQAATGTITCTCAGTGTGGAAAAATrC ATm 
TGOAAATTTTnTtTOATAATTCAAGOGGATGACTAOGCAAAAGTTCAr 
arrrTCCGAAGAAGATCATGACTTTCAAAAGGTCCACTTGCTTO 
TGTCCrnNGCTCAGATCCAAGTCCTCTGGCATTCATCTTC^^ 
CCX^lGTTACCOTCANGCCGANCCCGCCGTTCACrrrCCGTTTO 
GCGGTOGAAAGGGCAA 

SEQ ID NO: 3235 atcaatttacagtcgcatcaataaataaccaaagagtccca i'riu ri ccrcAC 

ATICTrraOCAGTATAOGAAAQTAOAGArrACCCrCCACCCTrrAT^^ 

OAOAAGAGAATCGTGO O 1111111 IL CCCCTrAAAATtrTAATATITrnCCCACATGrn-ACTAGTC 
ATTTGCTTnrrrCTGTATITroAAOTrorrCTAAATCAr^ 

TQTCTrTCTTACAACATACTCITrOATAGGCAAGGGCATGAAAGCTG CATGTCrr rCC^ 
TTCAAOTrAAaCAGGAATrCCCAAAATGACTGGAATrAGAATGTCriUl 1 1 ICTCCTGOGCAGCCA 
TCCTACmAATGTrtnTGGTAATAAAACAGAAAACTGGGCCATATTCTTC 
CTGACCATTTCrGGGGQACCXCTATXTTGAAGGTATTGCHAAGTO 

TTTArrAANCOTTTCTmAAQONCNTTAAAAACCCaaQiXACCQAAACrCGTrNCTOGaCC^^ 
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TCCACCAGTCATNATTTGO 

SEQ ID NO: 3236 GCTACAAAAATTraAAACTXmiACAATaACAATTATOAAATCCTGTOACTCA 
AAOTCCCCTCGAGTOCACTCTarroOTOCACATGCOCCC^^ 
TAAACrAATOCAAACCAOIXKTACCCAGAAGCACCAACACGTQTaTn^ 

AOAOCAOTATCTACTCCAAACATCCAOTAAOTAAAACTATOCKATCTTCCCAGGAACAGCAAGGC 
AGGCrrCTTACTCACGATOAACXirAOCACGAATAAACCCAGCAAAAAGAGAACTGCATACTTA^ 
TTAOGATAGTCATTCATOAGGATCXnXiAC^ 

ACAACATXrnrnTCrCTAGCCAATGTTGTanTffmATAGAGOCCTC^^ 
TATXTOCTIT GGTCAA AAACrTTOATATGCOCArriTGCTri^ 

ATAOGAATCTCl Jil CriCTATCTAAAACAACAATTrCTCCCCTCOTATGOOATmrrAAACTCGAT 
TTQOTTAaAAAAAAAAG 

SEQ ID NO: 3237 ACACTTGAAACCAAATTrCTAAAACN'IXjrrrriVlTAAAAAATAOTTGTTOTA 
ACATTAAACCATAACCTAATCAOTGTOTn>CTATGCnXXAC>CTAO<XA(^ 
TCrOGrrrCAAOTtrrcAAGGCCTCACAGACAGAAGGGC^^ 
TCTTCAGCAACTTGAGAGCTTTCTrcACGTTGTCAA(KAA 

CATAGAGACOATTTGAATATCrrCCACTOATATCGOCTXn'AACTCTCJ^GAGATOGCr^^ 

ATAATCCnXKKJACATACTGGCCATCAGGAGAAAGOTGTTTGTCAOTTGrr^ 

AOCAGGACAAACTOCTCTGCCAATTTCTriGATTTXnrrATT^^ 

GACTXnmXKKXIACTCATCCAAOTGATOAATAATCATCAAGGOTr^ 

ATAGAGCrTCTTCATATGTCTOAOTCXyLAATaAGTrO<yrCACCXICA(^^ 

ACACG 

SEQ ID NO: 3238 GCGlXKKnXXKXiaXJAGOTACGCOGGGGCnACrcGAGAGCTAGCrGAOOGA 
ATGGGCCGCCOACTOTCGAGTrAGOOrCCTCAATOTGGACGCCCTGAGC^^ 
TOGCTGCGQCAQCAOOGGACrAGCGTOAGAGTTrCATGAGGCAGOAO Cl 1 [ I 1 1 1 IC OTCCAnTT 
OrnXCTAATOAATCTOAACTGCXXAGAATOOAACCTAGTATATAAGGCATrCAAGAAATACn 
TCAOTGAATGACTACATTOACrrOAGACATOOaGGAACATCTrGCAATCACCrOGTGC^^ 
GGCAAAOTTGOCTAAAAAAAAGAAAAGAACATGGAGGCAGATATAATCACAAATCrrCOATGCA 
NOCrcAAAGAOGCTOAAGAAGAGOTACTAANAGCTGCACAATATOGTrrACAACTAGTNGAGAG 
TCANAAATGAATTACAGAATCAAT^WGG^f^AAATGTCGTAATGAAATOATGACCATOACTNOAOA 
GTnmSAACAATAAAAhTITCCCCTrcAAGAGAAGTrGAACTX:^ 
NAacrOCCAATQTAAACmNTITAACCrCChfNCAAAAA'rra 

SEQ ID NO: 3239 ACATOATGTCAGTTtXnTTOCGCOrraACroCATCATC^TGCGG^ 
GTOTCAAATGGATAGGAAGTCAAOCCGOCAACAGCAOTGACAGTCTGTGCGATCATC^ 
QACGATOTGAffnTITCTTGGGATXXXGAAGCATItXXTTrGCAGT^^ 
GGCTCGOTAOATGATAATACCCTGCACAGACACGTTAAAaoaTOOTACGCOOGOCTT^^ 
OCAQCAGCCOOGCTGAGAGGAGCGTGGCTGTCTCCTCTCTCCGa^ 
ATATCGGTGTACC 

SEQ ID NO: 3240 GGTACTTCACTOCGGACTTCACTrCTroAOCAAaAAOOCTOT 

AAGAOAATCACAGAGATOAATCTCACAATGCAGGAAAACTAGGTCATAATOTCCAG^^ 

AACATCTGAACTTCAGAACXXKKrrTTCCGAOOACTGaXATTCTC 

(XKnX}ATITCAGCAOCTOTITCTOATXUTQAAATACTn>CAAGGI^ 

GTGGCATAGCAGTTCCTCAACACCAGGTTAAACXXXKL^GGTOTOCCC nCT 

CACATACAGCACGGACTCAAC^OACAGrrrCAACrOCATOCCCTrCOTAAGG 

GTXnT0GAAGAOGG<X:ATCCTQAC:AATGAACKnXXATTCCCGTCCACACTGAOT 

TACAATGGOCTOCAAGCCAGCTTGOANGCroACTrn^TaTCCAGT^^ 

TTGATOTrOAAOaATOOTOTCTCTNATCATaAAATCTTTGGNCC^ 

TNGGCATNGOnTGGATTCTCT 

SEQ ID NO: 324 1 GGTACCCmAACCOCTTCTCCTTCAOCXTrAOCAOCAAOTCCCACTr^^ 
GGOGOCAAOAAACCCCAAACCCCTTCOCTCCGTGTCTTTAOG C i L 11. 1 U 1 C rCnKHJnTGCTrCC 

TrcAcrATXKx;cAACCTrcc>TC(TO:ATrarnxnTCTOc^ 

AACCTCrrCAACrCACACCTOACCTAAAACCrAAATOCCrCATTrn^^ 
CCCAATACAAACT TGACA ATGOCTCTAAATGGCCAOAAAATGGCACTT^ 
AAGACX^•AAATAATITITGTCAAAAAATGGGCAAATOa^CTO^OOTOCCraAT^^ 

TCCCACCTGTCCOCITCA^mXX;AACCCCAGOCOTT^ 
AACOIATCTOACCTnTCCCTrCTrrrCAOGCCAAGCTANGTCCC^ 
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CTCCACOCTGTAA 

SEQ ID NO: 3242 AOOCOQa<10TTCCOAOOCO<XOCCGOGAGCrocCA(^ 

CAOCCACCGCCGCAATCATGGTGTCAOTAATTAACACTGTGOATACCTNCCATGAGGACATOAT^ 
ACGACGCW:AGATOQAC:rACTATGGCACCCGCCnGGCAAOaXKrrC^ 
ATCTITGATGTGCOCAATGGAGGGCAQATCCTTATCO<XX}ACCT^ 
TGGCAAGTGGCCTGGGCTCACCCCATGrrACC 

SEQ ID NO; 3243 ACTGGA<?rGAAATAAATrCAGCCAACATGTGACTAATTGOAAOAAOAGCAA 
AG<X3TGGTGAC»TGTTGATGAGGCAGATGGAGATCAGAG0TTACTAGGGTrTAGGAAACGTaAAA 
GGCTGTXKjCATCAGGOTAOGGGAGCAmrnKXrrAACAOAAATTAGAArrGT^^ 
ACTCTATACTTAATCrCACATTCATTAATATATGOAATTXXnXTACrOCrcAOaX^ 
TTTOGCCCCTGGACTATOGTGCTGTATATAATGCTTIXKiAOTATCTGTnj 
TITIXKiATAAAACCTITriTOAACAOANT TNAATAA TTATAATATNAKrAAll^ 
ATATXANTAATAKirNAATTAhnTOTATAhrnTTrrAAArrTTiy ^^ 
GGCCGGNACCXXnTAGCa>UANTOCNACCNCCTGGGGNa»TTATITr^ 
CNmCTTGGCGTNNAANGOAAAANTTmxrmOGNQAAANTTTTO 
A 

SEQ ID NO: 3244 ACGOGGGGGGAOOGGGCGGCGGCGTTGGCGGCITGTOCAGCAATGGC^ 
ATAAAGGCTCOAGATCTTCmKjGAAGAAGAAGGAGnAGCTGCTGAAACAGCTGOAaSACCTGA 
AGGTGGAGCTOTCCCAGCnXXXXXyrCGCCAAAGTOACAOOCGGT^^ 
ATCCnAOTCOTCOGGAAATCCATTOCCCarGTrCTCACAOTTArrAACCAGACICAG^^ 
CmAGGAAATTCTACAAOGGCAAGAAGTACCT 

SEQ ID NO: 3245 ACAAAQAAAGTrrrAAOTCAAGGarrCACCAAmCTACAOTAmGTATTGT 
GTCTCAATTXnXXAAACTAACirnAAAAAOCTTAAACTTAAO^ 

l AAACTAGAATCAACAAACAlXMOAAATATriXTrTGAATCAGGOAGCTAGCACCTTTO 1 1 IC 
CAAAAAAGCACCTCrCCaytGTGTOTTCACTGTCATCTGCTGTAAAA 

TAAACTACTTAAACTTAGATAACATCACTCTGAAOTATACTAC CAAA ATGTTAATTGAGAAAAGCT 

GAAAATAGTTTrAGTTTACTCATTATCACATGCTAGAAGAAAArrmCATGAGA^ 

AGOTAATTITITAATCCAaATTTTTCACAAACTCATOQTOCAA^ 

TATAACnjCIXnTAATTGCTTGTTGGCTGCCTCTOAAAATOATTGA^ 

CAAAACTAarCATAOCCCCCATGCrOTTCrGCCCACTOTAGCC^^ 

OACTOGCTCTTrAGGCCACTOTAAGTQGCCrGGGCAGCAOACACCCCC ATGC CrrG^ 

CnXKn'ATACG<XCX>TTGGTGGGC0CTGGTOGTG OAATCA AGAAGAGTCT^ 

ATrOGNCCTGTTTnNGACATANCTOGCCCANUITIIlir 

SEQ ID NO: 3246 GOTACATOTTrGOAAATaAOTTAOATACTrGAAAAOTCTAAACACACTGATr 
TAGOAGTGCOTATGTTGCTGTCTCAATGAACAATGGGTCAATAGTTCATATXKj 
CTGGOTAGCTLCOTATOTOrrmAAATATAAAATTATATTAGACATrAQCACCAGTt^ 
CAGCATCATAAGATCCAAAACACXAGCACAAGrn-ATCCATCATTAAAAACAATTACCTCCTT^ 
AAAATGTCAaAATAOTAOOTAACTOAGATTAATAAATaTATCCAAAAAOTAACTCTrATAACAT 
AAAAGTrrCTnTAGAAOTATAVGTGAGAACCAATa^OCTAAATAGAAATOTTTAAGATTGAAGA 
TOTCCAATATTmAAAACraGACATTACCATATACAGOTAaAACATITCTAAAAATGACGTGACA 
AACAATICirAAAAATmTTAAAAATnXnTCACTarTTATCAATAAAT^^ 
CAATCCTAGAAAACTrrmAATTCATCViTrCrATGTGTOmTAATAAGaGGGTOATOC^^ 
ATAGGAATCCATTATrrACCrAATTITrCTAATGCGGGOAAGAGAGAAAGAGATAATTACCACTIT 
CTGGGGGOAATACTCAACrrACAACCCAAArrAATAGa:AATCTAGOGAAATTrAOAAACCAaO^ 
AAANOGTTTTTOGOmATANirrCATOOTATAOGOGCIXKrroOTC^^ 
CrGTCrrOGTTTCCAAAATTCCAAAAATTTNAAAATACITCC^ 
ANGGGGCCCCTTTNGOGTTTrraOTITnAAAAAACAAAOGGAANOGGGT^ 
TGGAATTA7TGGGGAGTTANCCroGNCCCGOOGGNGOGCCaGriTTTNAAAAN0<^ 
CCANGNCCCCCTTGOGNOGGGGCGGCTrAaTATTmCGGGTANCCCGGACC^ 
ACTG 

SEQ ID NO: 3247 ACACOTCGTCCTCCCGGCrCAGOCCCTCAAAGAAOOOGATQAGOTCCAOCAG 
CTCCGTGTCCGTCATGTCATCOAACCAOGACTOCACAaGCACTGCATIX^ 
TGAGGCAGGGGAATTOTCAACAATGATCACTITGCrcACCIXXXXXXOLAGOCO 
CACGTAGTTCOCACGATGAAAAACACATGATICTCTGAAGAGiXGGGCCCGGAACA 
GOTCTAGOAaGTCAGCCACAOGOTCTGCATACITOGCCAAOCroGCAGTAAAGAOCACACA^ 
AAAAGCTG<XCCATCCTCTGOAGOAACTCGTCCACATGTOGCCGC1TCAGCACATAW 
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ATACTTCCAT0GATITCAACm3AACAATAAAATCAGCATrACrAATAG<KTrAAACGAACT^^ 
ACCAATGTTTCATClAAATtyLATOACCACACATTIXTn'CCCOOGM^ 

SEQ EDNO: 3248 GGTAOOCOGGGGGCAACGrrmCTCATGCrCCGTGATGCATGAGGCTC^^ 
CAACCACTACACGCAOAAOAaCOCTCCCTKnCOCCGGOTAAATOACri^ 
CCCGCr(XrCCGGCrCTCCHX0TCOCACGAO0ATOCnX3OCAC^ 
CACACCCACAGTOACCCTITTOACOTOOCGTCAGTGGCAT<X!TTC 
(nX}OACTOAOATrCCUrrO(UOO<TrgTCTO<K30AAOTOAAOACIT^ 
TOAOOGTOTAGCAAAOGCTCAOAGGAACKJGGGACCITCKjGAGGAAGGAAGGCT^ 
AOOOTraCOTOGAaACTOGCTXrrAAAATTACnTATGT 

SEQ ID NO: 3249 ACAAAAAOTATATOOCAOATAAACTTOCAAOATTOOATCCAAOACTATTTTG 
TAAaGCTTTGmAAAGCTTTGGGCTTAAATOAAAATCACTAC>AGTrrGGCT^ 
TmAaAOCTOOCAAOTTTGCAGAATTTGATCAOATCATGAAGCCT^ 
GTTG<nTAAAAOAGTCAATCACTCXKnrACATGCAGTCGCnK}AA^ 

TCTCAGTCATCAAATTGAAAAACAAAATAAAATATOOAOCTGAAiKXnXSCArrAAAATXKAAA 

ACTAriCGAATGTGOCnTOCAAGAOOAGACACAAACCrCOCATTOATOOTa^ 

CCCACTGGAAAAACOACTTCKlTrAAATrAATGAAGGAhfrCAGTGNOar^ 

CCGOATOOATTAACCOaTOCAONATCTNGOAAATTCrAmXJOATCXTn^ 

TNCNCTTIT0ATGACCCCCM3AACCAATTCCX}GAANNAAAAAN^ 

CTTTGGGOOOGAAOCACNmANGGGCOAANTTCCANCCCACroOONOOO C^^ 

ATNCCGAACmX}GOANCCAAACTTGGNCXlNAAACAATQGa<)CATAACTO<^^ 

AAATGGTriTTCCG>m;CCAAATTCOCOCCAAAAATACCAANCCGCGAA^ 

AACCCNGOOOaOOCCCTAATWmONOOOOCCTAACCOCCCATT 

SEQ ID NO: 3250 GOTACroOCTOAACTOTOOAAAACATACAATTCTOTOTTCC rCAOTA AATGA 
GATTACCOTCTAATOAOTAGCACCCCmACTAACTrAOTAGTAOTATAAAATCATTmATITAGT 
TAATTACCAGAOAOATTTAaCATAATriTGTrCTGGATTCAOTAAATCAAg 
CACmAACrrriTCCTTTAOCA<KX>TTTCCACTAGT^^ 
TCCAAAGCAOAATCAATGTCrnTCCATCTCGTOACTTAAAGTrCI^^ 
TGTTCCGACm^TCTCTTCCTCTTAACTACGOTOTmX^ 

OAATGACTGOXAGAATGAGAATrrGTCCAGATTATTCAGATAAACATCATAAAOCAGAA^ 

TATAAATAAOTAQAATATQAATAAATAOAATAATAAAATTCCAAAATACTCAATOGOAAATOACr 

AAOTTATATAGGCnrCAAGAGTTGGT 

SEQ ID NO: 325 1 GGTAOGCGGGGAGAGCCGGCGOCOGAGGAGACGCACOCAOCTGACTTTGTC 
TT(nXXX}CACGACTGTTACAOAOGTCntXAOAOCCTTXrrCTCTC^ 
AGGAAAAACTCATTGCACCAGTTtXXJGAAGAAGAGGCAACAGTrcCAAACAATAAGAT^ 
OTCOGTOTTGOACAA0TT0OTAT<K3C(nt}TGCTATCAOCATI^ 
CTTG<nX7nXiTGGATGTTTTOGAAGATAA0CTTAAAGGA0AAATGATGOATCTG<^ 
mATITCrTCAGACACCTAAAATnjrGG<>GATAAAOATTATrCTGTGA 
TGTAOTOGTAACrroCAGQAGTCCOTCAGCAAGAACGGGAGAGTXXKjCTCAATCTO 
ATGTTAATaTCTTCA.*J^TrCATrATTCCTCAQATCGTCAAGT 

SEQ ID NO: 3252 ACGCOGOGCAOCOGCTCCAGCTAACAGGACAAGATGAGGCCXX}GCCTC^ 
TTTCTCCTAGCCCITCTOTTCnXXTTGG<XX^GCTC^ 
ATTCCCAGaXCOGCTTCAOCTCTTTCaiAGGTOTTGACTCCAGCT^ 
G0TOjGGCTCCAOCTCCAQCCGCAGCTTAO0CA0<X0AOOTrcr^ 
TCACCGGCTXXGTGOATGACCGTGGGAOCTGCCAGTCCTCT ^^ 
COQTOOACAGAOTGOAAaKTraOAATTCACAAOCTCATOTTCmcrc 

ACTTrCTAAAGTGAGGGAATATGTCCAATTAATTAAOTGTGTATOAAAAGAAACrOTrAAACCrA 
ACrGTCCCAATTOAC^TCATaGAOAAGGATAOCATntnTACACTGAACTGGACI^^ 
AAGGTANAA^^raAAGaAAATGOAAAAC^OCCmACACTraAAaOAAAGTITaONGaAAOCTAA 
AAATG 

SEQ ID NO: 3253 GOTACCTTCTCTATGTnXX^TATOTnCOATATACAAATAOCACTOn'ACT^ 
TGCCTACAGTATTCAGTAACATGCIOTATATOTCTGTAOCCTAGGAGCAACAOTCTATACC^ 
CATTAGGTGTOTAGTAGCKTATCCXyLTCrAGGATTTTGTAAATACACTCrATGATO 
AAAAATTOACrAGCAATGCATTTCCCAGAAAATATCCCCATATrAAGTGATCCATAACTATATAAT 
CATGGGGGATATGGCTTCAACATAAAATrrrcAGTrCGGT(XATAATACCACCAATrCC^ 
TATAATQTGAAAAAATrCTCACCTCrOAGTCACACATOQAAACAAATCAATGTGAAGAG 
ATOOCAAAGTCCACTXSGAGAGTTAAAATGGTTACATGGGTAATTTAAOAAAAAATITAGTOG^ 
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AAAGCKiAAAGGOCTGAACTGCCAAAATAATnTCTTATCCAGTmACATTTOCT^ 

CrAG^TTATAOACA0CTQATAOTlXXTAGCCTrAGACATAACTrACAGCATCAGC<> 

CACTTTAOATCACArrAOTCAOGGCTAQAGCa^GOCCCATTTACACNGO^ 

GTAOCTGTOOCAGTTCTANCXATAATGCNCCTTTTXjCTTCC^^ 

CATNOOCCCCAATA<^T^CAACC^T^m^TOGNCCCCCAAAA^ 

OGAACAACmKJTCCCTOGATOTGGGGCCCATTTNAA^^ 

TITnTrAAGG<nTroaOATOGNOTACCCTOCCCCOCX3GONGO<)^^ 

CACCCCCCCmGGNGGOOCCGGTTTCOTATNGOGAATCCCAAOCT 

GAAAANAANGGGGCAAAAACCTGGTTTNCCGNOGGOGAAAAATNT 

SEQ ID NO: 3254 GOTACOACAATAOCAATCTmCCTItXrTAGAGGACACOGGAGGAGTCCCrA 
CTCOOCTGCAAACTCTGGATCAGGGCTCOGQATTGAOAGCritXXAGAf^ 
GTCGCACCACTCTCOAAOGCATCCGAAGCAGTCCCOGOACTGmGGCOTTGGCG^ 
CTTrCAOCCATrCCCOGAAOAOGTCrrCATCrTTCmAG^ 
AGOCCTrGTCAAAACCCCTTTCCTOCAGCrrCTTG^^ 
<XAaXjGCTTCTCCCCX>TOaQCrCTGCCAOT 

CA0GCrrAATCCOCAACTrCAGTrC(XGTAACGGTTCCrCCCGCCACO0GO<^ 
CCTCAAOCCACTAGAA CL llV i 1 C CACTTCCOCCGCGT 

SEQ ID NO: 3255 ACATaOCAOOAATTOATOGCGAOAAGOAaCACOCCAATOCCAAOAAOATCC 
TGCrOGAGATGGGOOAOTTCrrrCAGArrcAGGATGATrACCTrGACC^^ 
TOAOCCOCAAAATTGGCACraACATCCAGGACAACAAATGCAGCTGOCIXSOT^^ 
CAACGOGCCACrCCAGAACAOTACTCCTCWCAOTX>CAATCCATCCTAGTAT^ 
TTriTmAAmACAAAAGAGGmAnGGACnACACTTCTACGTGGCTXjCCA^ 
TCATtKKXKiAAOQTOAAAOCCACATCTCACATOQ<>OCAOATAAQAOAAAAAAAGGTAG^ 
TmAOTAAAGAAACCTGAGACTGGTAGTGGGCTIGGAGCCAOAGGATCGCITAAGTCC^^ 
TCOAGACCAGOlAaOOTAACAAATTGAGACCCCCOCAACTITAAAATTAAAAAACGAAAGAAA^ 
AATAGCTGGGTGTGGTGGCTCACACCTGTAATAGCACTTTGGGAGGCCAANGCGGGTGGATraCC 
TGAGCTCANGAGTTTGAOACCAACCTOOGCAACATGOTTTGAAAACCCTTGTC^^ 
AAAAAAAAAAATCACCCAGOTGOTOGOTAOCCGTNCCNCC^^ 
OGGANGGCnXjANGOOCCCAAAAAAATTOGCrTTOOAAACCTITGGG^ 
AAAAAAAATtATTGCXXXTTGGCACTTICCCAAACCCITGGG 

SEQ ID NO: 3256 ACATAAAAOGCTrCAOAAAArrcAAOTTrAACAAAAAATGOAAGTOTAATCA 
AAGAAAGTGCACTTAAAGLWTTTCKXLVGCAA'lTAAAAAGT AGCr TCTATACAGCTTCTATTAGAT 
GACTTITCAATCACAAAAAGAGCATAGTCATAAAAACAATTCITTCAAAAOTAGCACT^ 
TAAAATAAAACAAAAATAACCCCAAATCATGACCCTTTGCATTATCATCTACC^ 
AGCTTTrGTAAACCCTCACTAOGAGTOAATOTTCrCTTAOATATGATroTCGTTGC^ 
TTAAAAATTCACATTAACTCOCAGTAGAGAaATGTOACATaCATOTOTOTACOCOOC^^ 
OCTTIGGCAAGATGGOOOGOAOCaGCGTCCGCCAAGCTACnTCTACe^ 
CCCATITrcAOT0G<X)ACATaAAaUGOCX:AAOC0OAOGarr<KX3C^ 
TCGGOAGGTGCCOAACACTGTGCACCAATrcCAGCTOGACATCACnxr 

AAAOTCCOGAGAAATOTn-ATGAAGAATCCCCATGTCACAGACCCCAAGGNGGGGTGATCNTTCr 
NGGTCATTNANGGGAAAGAACCGAACCTaGQAAGAAACAAATTAAAGQ 

SEQ ID NO: 3257 OOTACCOGATIUICrCTrrAACCCrCOCCTrCGTOTTTTC 
TGTTTOGATGGTITOTtmTCTGCCTGGAQACAAGGTGCTAACATAOAmAAGTC 
C0GTGCTAAAAATGAAAATTCTAAC(XAAQACATOACATTCTrAG CrGTA AOT 
CTITKX^CACGCATTAATAGTOCCATTTTTCTCTI^^ 
CA<>TOGGTOGACACOOATCTOCIXKK}CKrrOCCTTAAACA 
TTAGTOTrCTOTTTGAAACTAATACTTACCGAGTCAOACrrreTOT^ 
OCTCCCTOTGGGCnmXAGOTGGCCnXKLkQGTCGGCAAAGGGAAG^ 
OTCAAOGATOGTTTrGOGACTAGAOGCrCA0TG0TGGGAGAGATCCaXK:A0AACCCAC^ 
OAACGTGGTTrCCCTGANGCTGTAACTGAOANAAAGATIXnXiGGGCTGTCTTA TGAA^ 
CATTCrrCKCATAAOOCCAGTrCATCAOCATTrC^^ 

ATTANOCTGGTGGGGTCAAACTTnTGGGANCCCCaGACTGTCAGmnrr^^ 

CGCATOCTOCANaoocTTCTCXx^txrr^m^acTT^ 

SEQ ID NO: 3258 ACOCMOGOGOOTCGGAOCTOOOTrcTCKnXSGACGCCACOGGTO 

CGGKXXXJACATGATGGCGAGCATGCGAOTGGTGAAGGAGCTGGAGGATtnTCAGAAOAAGOT 

CCCCATACCTGCaGAA(XTOTCCAOCGATaATOCCAAT0TOC^^ 

CCOACCAACCTCCCTACCACCTGAAAGCCTTCAACCrGOTCATCAGC^ 
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TCAAGCCTCCWTGATCAAATTCACAACCAAGATCTACCACOCC^ 

AmOOCTGCOCAT^TCACCAGTOAOAACnXiAAGCCmKACC^ 

GGCCCTCAATGTGCTOGTGAATAGACCOAATATCANOOAGCCXXTGCGGATGGAC^^ 

TGCTGACACAGAATOCGGAOCTGTTCAAAAAGAATCaXiAAGAGT^ 

QACCOCKXKrnXTAACTOiTGrrTCTGACCXnura 

ACCTCATGaACrGAGGCCAaANCCCCCrGT<K)COCATrcC^ 

GTrAOTCAnAATITnm*Cntn"OTOT0NOaOCCOAAGOGAANOOAACTATOAAT(^ 

GGOGNATCGACTACTTCCANGTTACXrrGGGCCAAAGGGGCC 

SEQ ID NO: 3259 GOTACAAGCGGGAGCACATTAACCKKXKn'GCGACATGGATITCG^ 
TGGG<XTrCCATCCGG0GTGCrCTC)OTGCTAQ0TrACQAOGGCrQOC^^ 
ATTTTOAGACTGCAAAATCOMAGTOACCCAOAOCAACTrr^ 
TTCCAGCirCACACTAATOTGAATGACGOGACAGAGTTTGGCGGCrCCAT^ 
CAA0AA0TrGOAQACC0<nt)TCAATCTTGCCTC0ACA0CAGGAAACAGTAACAC0CGC^ 
TAOCAGCCAAOTATCAGATTtjACaTOACGCCIXXrrTC^^ 

ATAOGTTrAOOATACACrCAGACTCTAAAaXAOOTATTAAACTOACACTGTCAGCTC^^ 

GGCAAGAACGTCAATGCT<K;TOGCCACAAGCTrGGTCTAGGACTGOAATTTCAA0C 

TACT 

SEQ ID NO: 3260 GOTA Cl l 1 ITl 1 1 ITITI f 1 1 1 Till r OCTONCCTAAAIT On T A TrAAGTATOA 
ATTTTACAAACTTTACTTATATTAGCGGTAAOGGTCQAGCTGCANAGTATr^ 
TGCCCGGCGAQAOCCACCAATAOTOTGGNGGAACTTGTGGCCCnr^^ 
GQanXK>NATCm:AG<XX::ACGCATCTOCCTOKK^ 

CACKUmCTTCTOATAGCTTTATGGAATGGATCAATGAGGATAACCTCAAAAAATrn?^ 

AATCrrCACCAACCCAGTAAGAATTCAOOACTCTCAGAOCCCCACAGNOOOGr^^ 

TTTOCAACOOACTOAAGOCrrCOAGCAAACTITAGCTCraTrAACACCATOAT^ 

GTAAGTTGCACCCTrAGOAACTOGGCGTmCGGCCOCAOGGOOAACACaAAT^ 

QTAACCrrGCTTGGCCTTOTAGCCCAGTCOGCGCCCmATTAAG<XaKJTGG^ 

GTXKMNANCAAAAAAACTGCKXK?rACCTOCCCCOGCGGO^^ 

SEQ ID NO: 326 1 GOTACrTGTTACAGTAAAACAOCTATAAAGTCCTGTItXXAAOTCCAAACCA 
CTTTnAACTTAAATCTTOAGTrmCTOAArrACTCAATrrGAAGTAA^^ 
AATGGTTTTATTGAAACOTTTGAGATTAAAAAATATGCATTGCAAOAAGCATATOACAAACATT^ 
OAOAGT 

SEQ ID NO: 3262 acqcoooattcatcactttoatqagtocccacacagtcaagctttaaagaa^ 

GTCnTTGCTGAAAATAAAGAAATCCAGAAATraGCACAOCAGTrrGTCCTC^ 

OAAACAACrOACAAACAaTnriXXT<UT0QCCAQTATOTaXX>0QATrAT^^ 

TCTCTa^CAGTTAGAGCCGATATCACTGGAAGATATTCAAATCOTC^ 

GATACAGCTrrCTTCCrroACAACATGAAGAAAGCTCTX^^AOT^^ 

AAAAAAAATCTCCAAGCCCTTCTGTCTGTCAGGCCTraAGACnTCAAAC^^ 

AaACrcGCrA0TGTGGAAG<>TAGTGAAC:ACACTGATrAG0TrATG<7mAATGrrAC^ 

ATTTmAAOAAAAACATOrmAAAAArnOaOrmCAAGGTOTACCCTCOQCKW^ 

TAA 

SEQ ED NO: 3263 GOTAaJCGGOGATTrGTGAAGAGACGAAGACTGAGCnxnXXK^^ 
GACCnX>0CAQCA0TCOOCITCTCTACGCA0AACO00OOAG TAGO AOAC^^ 
TrCrCCCTCCCCTTCTKnTjAGATrrrTTTC 

CCATCAAACACaAT0G(XA0CAACaTrA(XAACAAGACAaATCCKX3C^^ 

ATTCATKKiGAATXnCAACACTaTOTGGTCAAOAAATXnXiAT^^ 

TGGCAAAATTGTGGCCTGCTCrGTrCATAAGGGCTTTGCCITCGT^ 

TOCCC0OGCTGCraTANCAGaAGAGGATXXK>.QAATaATTGCraGCCAGGTm 

TGOCTOCAOAGCCCAAAAGTOAACCCGAGOAAAAGCAGGTGTGAAACCATCr^ 

GTACCTrOCCCGGCGOGCGCTXXJAAAGOGGAArmCACCNCACnaGCGGOCNT^^ 

ACrCOGACCAA 

SEQ ID NO: 3264 ACIXriTCAOACATACTGAAACCAAAGArrrAACTGGACTATATTCTATAATAT 
ACTGCAAAACTCAAATAmOCATrGTTAAAATAATrATCCATOCCIXXnXjOTGC^ 
AAACTAGCmGTATTTtV^CTTACTGG/WGOTrCTAAOACAAAATTAACACrr^^ 
ACCAGCATTAAACCAATTATTGCCTOAOCAATAAAATGCCACATAATCrACCGCTT^ 
ATAGATCATTTAAATCTTnGOGCrGCCTGATCATTACCAACAGAOOCACCCOAGATG^ 
CAAGT(7nnTCAGTTCCCAGarrA(XAGGCAAGCACrGT^ 
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TAATOAAACCQTAACAATCITITGTffmACTAAOAGAATrCTCCAOAAAAAGT^^ 
ATCCCCGCGTACCTCGGCCGOOACCACXiC 

SEO ID NO: 3265 ACCGTOAATATAATACACCTACroAAAATCrTOCCATACATCATGAA^ 
GAAGTAArrrorrATAOCTTOAAATOTCrrCATmAGAATATTATGCACAAAAGAG^^ 
AACAOAAAAGTQG<>rrrrCTGTCrm:AACrACAATITAOAACC^ 

TAAOTATATTATACTrCAAGTTCCAGAGTATGTC0G<K;ATAAAATCa^ATTCAAGTNTTATATC 
CAAAAAATTCTANTAAATACTCmTTAATAAATaGmATCAATATrCTCAGAOTCAGNCTAATC 
ATCCAAGAAAAAGNAGNTTCCACIUnCl l 1 lUI ICTACATCClTAATrGAATTAAaTOTOTTTATT 
CNTGGCCTATAOTTGNTTXraATCTCTICroAATGTACTTTC^ 

CCCCCAAAANG0TTGAG0ACTGTr^GCAC^C^^0r^mAGAT1GGAACANC^^t>NAAACNAT^AC 
CCOATOCGTNQATAOGNGGCCATGATOAATCATOOTrCACAACAAAG 

SEQ ID NO: 3266 GOTACATOATOACAAAATAOTCTAGCTACACTAGAAAAATATAACTGCATAG 
X^AAATTACAAACTAAAOCAATCCATTCACAGTncrACATATGTrCrcATTATACATra 
ACCTATrCTATOTOOAAAAAATATCTTTCGCrW-AAGGTAGAAOCTATCAW 
TACTCGCAGAAGACCAGTGCICTCATCAGATGA(>CACAGCAGO0T0TTTCCTAAAT^ 
OCaAOaCAAATATTCCCTTTAGTGTarrCTGACCCTAGOAGOAOT 
ACn^OWGGOAATTCACTCTTGGTOTTCOAAATTCTATXXTCACACCCAAGGAGATOG/^ 
AGGCAGCCTCTXnXX^aAGmCCCAAOCTTOCAroAATTCATGCTAAATTOCTM 
TCTOCrOTOOOOCAGGTCCOOATOCOCCACATCCCCXSATQAATATCACC^^ 
NCTG<XATACCTCGTGCnT:AAOGGAGCATCAAAQATrAAOCNCCrn^^ 
NCGTTG 

SEQ ID NO: 3267 GOTACOTCirTATCAGCAGCATrAATACAGACTAATACAGATGCITOAAGCA 
AGaXTTX3TCCAATAAGGTATTrAATAG<>CTTAOGOACTATrA 0AT0TC CX^ 
CArrrCItTTATCTCrrClX^CCCCAGTGTAACGAATa^ 

AGTAOTTGGAAATCAGTTATrrrGCATTACTAAAGTOrrCAATCACATrATCACGOGTCACATT^ 
TATATAACCAGGAGATGGCCAAATAACATCTACTGCAGTOATITrOCAAATTCA^ 
TATCCATCATIOTT<n70TCATCATCTCIt:AAAACACCATCT 
CTCATTAGTOGTGCCTGTI^CICtXTrCXnrCTTATGG 

AAG<XATCAAOCAAATrArrOCCATCATAATCATGCATrrTGOAAGTAATGGAGCrGGCAATT^ 
cmXlCGACATCTrCCGCCTTOGGTTOGTOATGACXXXnCT 

GocccGCGrr 

SEO ID NO: 3268 ACAATTAGCITCAGAGTTOATATrAATAGAAATTATTCCAAAATrATrCTrrOT 
CACAAGTAACTACrATATCCCACATAAAAAG<X!AAAAAATXXCACCCAATCACAGAAAAGGCATC 

CTOxrTATxnrrccxntxscAATGCOTrcTrrATOTATrcTC^ 

GmxnCCAATGOAnCATTCAGTnCnXKlAOAACCATATAOACTAATO^ 

ACGGACGTATCAAGTTCATOGTGOACArrCCTATrATAAAAAATACTCAGGTCnCAGAATT^ 

TGCraAAGOATXXXIAAAATOCTTICTAAAAAGCATrAOTa^TGCCTCAAA^ 

GGTCAGTATTAn-ATCCCCAITlTAACAGAAGGAAAACTCAGCKXXnTATAQTrAAaTQACT^ 

CAOOOOCACACAOCAAGTCAOTGGCCAATGGTroOAGAAGGanXJGCCnAACCAC^ 

CATOGGCACCItXKriTITITAACCTIX>TGATaACCrATrOG<^^ 

SEO ID NO: 3269 aocaccccatitatatatgaagtattcaoaoccccccagoagaoac ooatoo 

ACAGACAQACAOCCAGGTTCTCCAGTGGTATCKKXnXXATTI^^ 

CGAAACCAACATCTXJATATGTAAACTGCrCrmGTrrOCAACCCm 

TQATCCTATGGCGC7WlACCCAGGGCCCT0CCAGG(XATCTCIt^^ 

TAGAGTOGGAOAAAGGGAGfn>GGCGCATTO00AATO3rraGTTCCAGTCTG^ 

ACATrTOOCAAGAAATmXXXrroTTTGGAAAGTrT^^ 

TCCCAAGTOTCrGCCGGTCGACCAATCTGOCTOCCACACATTGACC^ 

ACTCOAGGAirCCAGGTTOAAAANTGGGCCCmiAGOOCCTGOOAAAAOACCA^ 

TNTTTNCrraAOAGNCAAAGOTOCCCCCGGAATCTTGCI^^ 

NTGG 

SEO ID NO: 3270 ACGCGGGGGCACAAGTGAAAOCAATGATCGAGACTAANACOGQTATAATCC 
CnSAOACCCAGATraTOACTTGCAATtXJANAGAGACrOGAAGATOOGAAGATQATGGCAOATTAC 
GOCATCAGAAAGGGCAACrTACTCTrcCTCGCATCrrAmn-ATIXKJAGGGTGAC^ 
IGGGGTOTrGGCAGGGGTCAAAAAGCn-ATITCTmAATCTmACTCAACGAACACA^ 
ATGATTTCCCAAAATTAirrOAOAATGAQANaANTAaAGNAAGATTrrGGGTOGGATGGGTNGGAT 
AANTATATTAGCCCAACT^ITANGT^^^mTOAATCCTGACACAATNAAT^AA^ 
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TACTNATGTNGTTACTrGNGACNAANAAAAAAATriTmmJGG>nT«^^ 
GANATGTCCTr<KKXAACCCNGGCCGCTNCTCrcCTAAGGGNNTAOT 
CATTAA>mANTGGATTCGAGNNN0GTNCNAANTrrGNGOAAATrAfWCNTCTAAATT^^ 
NTGNNAATCNGTATCCCCT 

SEQ DO NO: 3271 GGTA Cl 1 111 n 1 11 1 H 1 m i l til U i 1 1 ] 1 N GGGGCTTCCATAACCAATGT 
TG0OCATCXANATCTG0COCTTGAATT«rrTNTACNAAOCCT^^ 
AGTTACNCirAATTTTGACATATCXJGTCTGACTGGNGCCGGATGAACTra^ 
NATCnO0GCTrCACAA0aQOTCT0AO0OCGG<XATaATaCCNANAAGGANATOGCTGC^ 
CCCGCOT 

SEQ ID NO: 3272 A CITl 11111 rrrri i rri 1 1 m m I NGGGAGATCCATGCnTATOAACACAT 
TATKn^rACACATCCGGGOATAGOmGATACACAAaATCXriTAAAOT^ 
ACCAGGTTGTTCAAAACATGAATICTGAAAACATAGTTGGNGTAGCAAATCAATATATC^^ 
AAGAAAAACrrCATCCAAATATTTACATmAAAAAGACAmAAAAAATAOGCT^ 
GTGCrrGAACAAACCATXnXrrGGAATTCAATTCCATI^ 

TACTGATGCrAGAGCAGACCCCIXKX:ATCTGAGGAmAAGA0TTCAACTOTTCT^ 
AAAAAAAAATOACAAATAATAAATTCTACATTTCCOAGGCACTAATTTOAACTTCA 
GGGGCTOATATOTAANGCAAGOTATrTANaAATOAOGANOGCTNACAGCTGTACTCATNGGGG^ 
OCACTOCCTCCTTAATACCTCGOCOGOACACNCrANGGNGAATraCACAAC^ 

SEQ ID NO: 3 273 OOTACACAATOGmATTAAAGOAATOrATOOCCCACATCAACCrAOCAAOG 
ATTCrACrGGTAAACCTTOCCATGO<X:AAA0OAAAAACAAGCAGGACnTGAa7XKK:^^ 
OTOCAOOCAATOOAOAGAGGGCAOAAOOGTOTAGAAGCTGAAGGGOGCTAOAAGCrTACTCCTG 
AOTTTCTTCCTTCrGTCTrCAAATCmACTICTTATGOCQ^ 

AGATOCACrCTrCTAOACroCTCQAOACAQCCAOAGACAGGGOAGGAGGGAAGAAGGAT ^ 

G0AAAGGGATGQCO3G0CAAACATTrA0AGCTAGAAGCCACTACTG0G0CAATGC TAAA0 TrTCT 

GTCTCTAAOCCTAAAAAAGCCAGTGTAGTAGGGCCmATCACTCTTAGT^ 

CTOAAATAAT0AO<X:AGATTTACNCANaCTACANAAATOAAGAAGACCOGGCCrrQTOCrOT 

TAGCAGAATCCTTXriTCTTCCTNTATNCNGCGCTCCT 

SEQ ID NO: 3274 GGTA CriTM TITM 1 1 H rrriTrn ' l IN TTTTITCACACCTrTCCCTAATACTT 
TATTOOTTACCTCTAQGCCTOTOTOCCCCTOGGTOCaCITGOGGOAGGGC^^ 
TAGGTCGAGGCATGAAAAGGCCTTGGCCTATCC^^rcCAGGGNCCCATA^^raNGaAN^^^ 

NCAGGCAChrrNNGCNCGGGCNCAANCIX>(CTATCCGTTAGCCNGCCrAATT^ 
ATTTCTTGCTGNNATa:AC^^TGGGCTTAATC^rraT^^ 

^^^mx>Tac^:mCT0TaTACTOGGAaAAACCAAOGCTTA^m 

TTrrGCTIWTGCNGCTTm^GAN>nmGANAH^L(kANrCT^^ 
CCCATTKrcOGTCANTCNAACTCXACTACCAGGTTAAOCAATT^^ 

SEQ ID NO: 3275 ACOCOOGQACOCrOAOOAGOATCOOCOGCCGOTOAOOGGOAAGCAAOTCTO 
GlOCrCTGATraAAGAAGlXXKKnCIXKKKTIXXAGTOCGGGAATC^ 
CCOGOTCTAAQTrOTAOATTTTATCAACACAAATrrCCrcAGGT^ 
GTCAGATCXIATTGCTGAAAKKKjOGCrrATOTCAOCTTOaWiAATAG 
GATICrrCTrAGTGAArrATOCAGAAOGCOTATCCGTrCTATCAACA^ 
OAATGAGTGTOTGOTTOTCATrAGGOTGOACAAAOAAAAAGGATATATTOATTTOTCA^ 
GA0TITCTa>GAO0AAGCAATCAAATGTOAAGACAAATrcACAAAATCC^^ 
ATTOTCaTCATOTrGCnXWOQTOTrAaAATACACC^ 
CCAAAaGACTOCCTGGGTCnTTOATOACAA0Tarm}GNCGOGACCAOCC^ 
NAACNCACTGOGOG 

SEQ ID NO: 3276 OQTACAACCAAATCm(nx:TOTATAAATCAGCAGATCAAATG AACAA OAA^ 
CTrCCAACAGTOGCAGAAATOTCACCGTAGCTOTGATCTGTT^^ 
OGATAGCITKTOAGACnTrCTCTOGOrrOCACTGTCATCIT^^ 
AAACTGOCATCTTTCCAGGACXTCAOCACmCAATATTTGAaATAACrACAACCAa^ 
TGAACACTrOTATAACCAATCrn^GTTCTIXX>CCACATTATrTAGCT 
TCAOTAOTrACAAG<>AOaT0AaTCO0TATITCT0CACTCOAaTAAAOGTTl^ 
CXIACGCTOATATAAAATOCTGTrGATGCCGAATOAGAAGAACTCGGCCACGAT^ 
GCOCANOGrrGATTCCCTGCrCCOGaAOAOaXSCAGCGCCATGOCCAGOGAC^ 
ACCGCrrrCACTTCGNOGGACAGGNAACCANCCCCGCGTrCTIXKKX^^ 
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OOGGOAATTTC 

SEQ ID NO: 3277 ACTaOGATAAATOAAGAAOAAOGCATAAOGACAATAAACATOQAACrCCAC 
IXKAAATOGATTTTAlXKIAGCTOAOGAAAOTrTGCGCTTATTAaTATTT^^ 
AGTmCTCCATTGCOGACAACGTAACTACCAGCItXriTGGCTCAGTGGTTCGCCTC 
GTirCCACTAGOTTCTOTCATrArrOTrGGCACATAGOCCCT OAAT ACAGOTGATATAGOGC^^ 
ATGAQOGCTCCTCCATTGTGAAACCAAATATAGTATCATTCATTTTCTGOGC^ 
GAGOAAtUCAOAACCAmAGCACAOTGACATTOGTOAAATATGTrTCATTGATrCTCACAOAOT 
AATrOACOGAOATATAT0ATrOTGAOTCAN0AG0TgrCACA<mATAOOCTCATC^ 
GTTTGAAAGTACCTGAAGCAGAOACGCAAGAAAGAGCrTTGTTAATATCCAAGAANGNCI^ 
ATNAAGGOCANGTNAAAACCTCOCTGCAACOTmHJIATTOCTGAATO^ 
OAGCKCTOOT 

SEQ ID NO: 3278 GGTA Cl n i n 11 L 1 1 1 nm ITl 1 1 1 1 1 1 1 I CCATGGATrrATTCCACAGTCA 
AAATAAATCAAAATTTAAAGCThrrAACATTmAAAAGATAAAOGA>LVATITGNGOC^ 
^^^^AACAAAACANACNCCAOTCTAAAGNGCAACACTAAACAOGTNTTNTN^xmtX^ 
ATAAATACNCNCAATTACNCATAANATTTO^CrAAAGATNGGAGATGAGGCAAATAACCXrrTO 
AAmCCTGCCCAACAAATAGAGGCNGGCTACATTAATTTAACATTTTACTGCAAAATG GAA^ 
ATCCCXXjAGGNOACTAACrCAAACTCCTCAmCATOCACATGACCTTGGCri^ 
ATACCCNCATCCAAATCCANAAAOOCrCCTGCACCCCATOCTCAAAAA 
aAOGGCCTNANCACANAC^tKiCAT^AACAA0CCTOGGTTAACCTT^^ 
ATCNGGOCCCAACNCNGA(>AGOCCCGTITriTNAAAANGGTTTAC^^ 
CTTT 

SEQ ID NO: 3279 actgooataaatqaaoaagaaogcataaogacaataaacatgoaactccac 

TGCAAATCOArmATGCAGCroAGOAAAGTrrGGCCrTATTAGTAmGCTCCAGCGAACCT 

AGTTTTCTCCATTGCGGACAAOnAACTACCAOCTCCTrGGCTCAGTGGTTCGCCTCCAC^ 

OrrCCCAOTAOGTTCTGTCATrATTffrrOOCACATACGOCCTO^ 

ATOAGCGCTCCItXATTGTOAAACCAAATATAGTATCATTCATTrTCTOOGCm 

OAGOAAOACAGAACCArrTAGCACAGTGACATTGGTOAAATATOTTTCArrGArrCTCACAGAGT 

AATTOACGOAGATATATOATnrTGAOTCAGGAGGTGTCACAAOTTATAOOCTCATC^ 

TOTrGAAOTTACCrcAA0CAGAAAaX}CAAGAAGAAGrKnTTG^ 

ATCAGGGCAGOTAAAACCTNGOCTrGCACCGTTTCGATTCNCT^ 

OOCCC 

SEQ ID NO: 3280 GGTACTCGTCAATGGGCrCGOTCATATATACCACCTCGAAQCCXXGTrrC^ 
ACTOXnrCACAAAAGCrOAGTTGGCCACCrOCTCITnXrrCTCACCAGT^ 
TTCTOrGTCTCCrrcATGCOAOAAACATACTCTGACAGAaATOTCA 
GTATGATAGCG<:AG<>GCTt:AGACAGG00GCOGaKnTAGTGQAGTXriTTO 
OAOATTTTTAGAGAATGCCTCATAGAATTTCTTGTAATTXritXn^ ^ 
AGCTt:AAGGCACTIOTAACAAT0TTTTTGCGAATGACTTTt>AGAT^^ 
CItX}QGAOATGTTT>GGOGCAGATCCn>aAGTCAACCACA 

GGTATCAACTCATCACAAGCrGTCCATGATGAACACAajGCGGACATAGAGTTOGAGGT^^ 

TTTCTTCTrCTCTCAAAAAGNCAAANGOGGCCCGCCAGGAAT 

OACC 

SEQ ID NO: 328 1 GOTACGCGGOGGGTGCGGAOGTCAOGGACAAGATGGTOCCACOOGTGCAOG 

OCTACyATTACCTAAAACCTCOOGCAaAAGAGOAGAOOAGGATAQCAGCAQAAaAOAAOAAGAA 

0<:>GaAT0AACn3AAACGaATTOCCAGAGAATTXKX:AOAAQAT0ACAGCATATTAAAGT^ 

ACCCTGOTACCCACrCTTTCGACCAOCAGCOGATGAATAAAGCTrCCTGT^^ 

AAAAAAAAA 

SEQ ID NO: 3282 CAOGTACTTACCCTAT C J J J I t 1 J l AATrAACATrCOATTCCATGAGCrTCTrA 
TGTOAAAAAATAAGA 1 1 T J ■ If ITl A GAGAGCAGAAGCAGAACAOTAAAATrTATTCTATAOCTAG 
CAATATTTTmATGCCATCrOTCTCAAATCAAAQAGTCATCATAGTAGOAAATAAC^^ 
GTCATTTGGCATOAGTCTGCATTXXAOTAATIXTrAATr^ 
TAAAACATGCrAaTTCAAAATAAGACIX}CrCAOTTT€CAAGGGrrrTTC ^ 
AAAGGrnnCTAOTCTCTGATTAGCXATQACTOTArroGACTTO 
CIXTTATTCrAAACTAATCTCAmOGOATOTOTAAOTCITrraTAAA^^ 
CAGOACAATTTATTAAGTrTTCTCAGTATTTTCCCAAATATTAGAATATITAC^^ 
aCTG<X:AATGACCXX;ATATGTCTGaGAGAATAGNAGCCTrATCTITATATAATCC^ 
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SEQ ID NO: 3283 GCTACCGACX^TAGAGCAAGAATCAAaATTCTGCTAACTOCTaCACXOCC^ 
OTCCTCTIXXnTrCrOCTAOCCTOOCTAAATCTCKTCATTATTT^ 

AAGACTOATAACKKXXCTACTACACTGGCTnTrrAGGCTrAGAGACAGAAACTnAGW 

CAGTAGTGGCTTCTAGCTCrAAATGTTTCCaxrGCCATCCCTTTCCACAGTATCC^^ 

CCCCTGTCTCTGGCTCTCTCGAGCAGTCTAGAAGAGTGCATCTCrAGCCTAT^^ 

CTrrGG<XATAAGAAGTAAAGATrTCAAOACAGAAGGAAGAAACTCAGGAGTAAGCrTCTAG^ 

CCITCAGCTTCTACACCCTTCTOCCXnxrrC^^ 

OCTTOTTnTtXnrrGOCCATAGOAAGCTn'ACCAGTAAGAAKXrrnKTAGG 

TACATIXXnTrAATAAACATmK3GTACCTGCCCNGG<XX3NCaTrCGAAAGOCGAANTTCCAGC^^ 

ACNG 

SEQ ID NO: 3284 ACTCGOTCAAGTCTGCACGAOACAGCTTCCTGACATTCrcACTC 

ACAGCCrOCGCTAGAGGTGTCaXTCGAATCCTGTGGCATtK^GCrrACrrAAAGACCACATCTCG 

CATAOATGCATCATTCnXXTGCATCTCCCGCAGGATCACATCACO^ 

TCCAGACTACAOTTCTGCACAATGTCACCCAOGAGCTCCACAGC^^ 

CGCCTTGATGTAGTAAGCnXnCTOCnXCGGOTGCTGTAGGCATrAAGATGGOCC^^ 

CACCTCCITCTC«AOGGCACTGCCAGG<XGATTCmOTK;(XT^ 

OTA0CCTOCm:ATTATrCTTCrrcAaTCTCAAAACO0CT^^ 

CCAACTCCOCTGA0ANGACTGCT€GGAOGGCAC>CGCAGCCGTTCTCCAACANOCTAACC^^ 

CTTNCGOCAOOAATOiAACCCCTGACCAAAOGTGCCGTACCTO^^ 

O 

SEQ ID NO; 3285 ACATCTOACTTTATACATTTTACATrrGTAATAAAAAAATCATiGA AAAGTA TG 
CTTTGAACCTGTTOGTAAATTCTAAAAAGGAACTAATTTOAAAGATTTATAAATC^ 
GTOOOCAAACTATTTTTTATTTGAAAATAACAAAATTCTAACrCGAOCAAAGCTGGTTC 
AGOAATAACT0CCTTTTCTAAG<nTrATCAGCTAGQAAGCAGATAGOAAnAATAATTCCA^ 
OAOAAGAAOTTTTATOAAAAOAAATroTAGTtXJTCATGCTGTAOAOTCTQAOCAOC^ 
AGOTTTAAACOGGTCX^TCTCCCCCACAOCAOCCACAACACTGGCAGTCCCCAC^^ 
AGGAGGCCATTOACCACCTGCATGGCGCAAGAOAACCATCTNGATTtXTCCTATO 
TGGAGAAGANGGGTCAQATTOCANGGAACCACATTGAOAGGCrcTajCXnTa^ 
ATCATTGANATAATCCCCGTCOTGOAANGGGTAGCCCCATGTCCT^^ 
OCAA 

SEQ ID NO; 3286 ACTTTCTCCAGAAGOATCAGCrCAATTrGCTGCTX^GAT^ 
ACCATTTGGTATGGAGCAAACTOCGAOCAAOCATmOAACACATGG ATITCXn ^ 
0ACAAGAAAATCAaAGAAT0TAAmATAAQAAAaAATa<X:ATraAATTTTTrAGGGOAAAA^ 
ACAAATTTCTAATTTAGCTOAAOGAAAATCAAG<>AOATGAAAAC<jrAATTTr^ 
ACAAATAAAATOTATTAOTOAATAAATOCmnCrAOATCC ATAT TAATAAACATGA OCAT CTA^ 
CCCTCCrrrcrrAOOCTAGACACCAAGATATITCAGOCAGCCriTATCAT^^ 
TTTCmAAGTATTOGCTGOTX^CTACTATTGAGTTTCTTCC^ 
CTCCOX;AGCTAAAACTriGCATrACTGACTCCCA0CTATAriTC^ 
GAAACAGGGTCTrA^nTGACriX3JACXnTGGCAGCNAACAAGGACm-rGCT 

SEQ ID NO: 3287 OGTACCTGCAGGCCTOCTACACCTACCTCTCTCrGGGCT^ 
GATOATOTGGCrCTOGAAGGCGTOAGCCACTItTrCCGCGAACTaOCX»AG<^ 
CTACOAGanCrCCTGAAGATQCAAAACCAGCGTQGCOOCCOCGClCrCTl^ 
AOOCAOCraAAGATOAOTOGGGTAAAACCCCAOACOCCATaAAAOCraCCATOO 
AAAGCTGAACCAOGCCCTrrKKJATCTrCATGCCCTGGOTI^^ 

TQACirCCnSGAQACTCACTTCCTAGATGAGGAAGTOAAGCnATCAAGAAGATOGi^ 

TG ACX>ACCTCCACAG0CTC0GTGGCm3aAGOCTGGGCItK.QCOAOTAT^^ A AGGCTC 

ACrCTCAAGCACGACTAAAAGCCTTCTOACCCACC OACTT^ 

QGCTrTTONCTAAGCCTCrCCrTCAOOCATrAGOCAOTTTTrm 

AAAGGO 

SEQ ID NO: 3288 GGTACAAAGAAAGTmAAGTCAAGOCCTCACCAATTCCTACAOTATTAGTA 
TTGTmcrCAATTCnt>AAACTAACTmAAAAAOCTTAAACTTAACCTA^ 
AATATAAACTAGAATCAACAAACATGAGAAATAnTCTTrGAATCAGGGAGCTAGCACCT^ 
TrrrCCAAAAAAGCAOGTCTOCCCAOTOTOTrCACTOTGAT^ 

ATACrTAAACTAmAAACTTAGATAACATGACTCTGAAGTATACTACCAAAATGTTAArrGAGAA 
AAGCTGAAAATAGTTTrAGmACrCAmTCACATOCrAGAAOAAAATTriGCATG 
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TaAAQAGGTAATTrmAATCCAOATTTrrcA(>J*ACTCATGCTGCAAAAT^^ 

TCTATTATAACTOCTCTTAATCKTIXn'GOCTGCCTCnXiAAAAT^ 

AACAAAANCaXKXTTCCIXjNCCNGOaXiOCCQTCQAAAGOOTAAT^ 

SEQ ID NO: 3289 ACACTAATTTAOlCAGQACACACTACCATTrACrATOTAACATTAAAGCnTC 
AGTITAAOAAACnXiGTCACCAAraAAGTGACAGOCACCAOGGAAAAAAGCCAACCTGCT^ 
A0ACAAAAGAGG<X}AATGACA0AAQOAA0CrG0ATCrrmATAAT0TAGTTTAOTCACTA 
ATACTGGAACCATCTACCrCTACATTTCXnXSCrATGAGAAATAA^^ 
Tn'AAAAAAAAAAACAAAAOAAAAOAAACTGATCA AAAAO TTTTOCATAAGATTCAT^ 
TCTTTTTTCCrTAAAACAAAAACGATTAAAAACTTGGGTTriCT 

AGTTACCAAOATACACAAACCroAAGTATCnXJGAACCATTTGaCTOiXTOTAaTCT^ 
ACAAAGTACCTCGGNCGCOACCCCCO 

SEQ ID NO: 3290 ACTGTGCGCCAATATGTGAAQTGTrCCACACAACAGGGCACTTGAGAGOAAC 
AGGriTCGGACATGGTOTCTGAAGTCOTAGCJ\AG<XKiTAACCCl'll C^ 
TTAArrraCTQTTGCCAAATCn'ACrmAAACTrcAAGaA^^ 

TATAACATGOAAAGAGAGGAGGTOTGCTOTCAAQAGAATTTTOGC AAAA GCACTGGAGCTCATAG 

CTrATCATTACCAGTTOCCTOAGTCCATGCTGOTX>0TGAQAACAACTTTAA 

AAAGCTTTAOCATACTrCTTGGAACAGATTTrrAGCATGTGCTGTCAAAGT^ 

OOTICICTrCATTrCTAOCCTCCATTACmiAOTATOAAAAAAAAAT ^ 

GCX::ACCAGTC<K>TOTTn3CATIXn"AAAGaAAAATTrGCrrAT0CCTI^^ 

CTAAACAAACrTAAATGCTTAAACAGCANACCTTrCTn^OCCANAAAT^ 

SEQ ID NO: 329 1 ACOCOOGQTGnGCTCAOOCTCAAGTGCAOTGOTOATTCACAGOCaTA^ 
TAGCACAGTATACCCTCGAACTCCTGGGCnXjAOTCATCCTOCrrc 
OaACCAOAOGCATGTOCCACTGCACCrOOCrrAATCTrCCCTTGTTTTrTGT^ 
AOAOOAGCrTrTGTrATOrrrACCX>OGCrO(mn'CAAACTCCCOGCCT 
CAGGCrCCCAAAGTCCTAGGATTACAGOCATGAGCCACTGCACXX^GCCAAAACrrim^ 
CATCTCCAAACAQAOCTAATXrmCCTTrCTOTOCTCCCAAOGACTCAAAT^ 
A0TCIX7IXKriTTCCAGTriTICrCAAAAGA0<>TriTraAGGANGOAC^^ 
GAGAAGTGATCmAATCCACAAOTGGATCCCXLKTTaXXAGCATAGA(XAAC^^ 
TCTGCAAA^^aA^r^CAAAATCAANCACAACTTCACAATCO<:AGACTTGT^^ 

SEQ ID NO: 3292 GCTACTAATAAAACCATCTTTGCAGGAGTCrmAATAACCATATTTTATTTAA 
ATAAATCCTCCAGTAGGAAATGOAOAATTCATTOCCTrrACTTAOAAGaCTATGTAAATAGCrATC 
TGTCAAATTTATATATGCAGCTCACCAATATTCCCTrACTCGAAACA^ 
TCACAGTIXX:CAAAACGACTTTNAAGTCrCA(^TTCCGTOTCrcC^ 
TTa:ACCCA0OCTrTATTAATGQTTCOCTIX>TATC>TCG^^ 
AACATTCATCAATaX7IX:ACT-AGGATCnrGTITC>GTGTCAT^ 
TOCrrTrcAACCraGOOTCAOOTAATCCCACCTnrmOTT^ 
ACTGTATCAAONCnXKjACTTTTTTTGAACTGNCTrCCNCAGAGAT^ 
ACAATCATGGAGGAACTCTTTCCCATTTAGATTCmA(>AAAGANC^^ 
0 

SEQ ID NO: 3293 GCTACCACATTTrTGTATATGTTAACCAAACGGTCAATTGCArmCCCTAAT 
CTCTAACGATOOCAAATOAGGGAGGAAOTCATnaXACAAAGAAGCACATG AAAA CCC^ 
CAATGCrCCTCrC^ACATCAAATGTGAATGGNAGGCTGGCCATrOTO^ 
CACNAAGAACATTAAGCCCAANGAAGATAAACrCTOCrrCT GCAC AAGO^ 
TCATCATGCrTTOCCTTCTTTTCTCTTGOCAAACCTTCAt^ 
ACAAAGACCACATGGTTTGGGCTTGTTTGGTTTOAATrCTTCTCrAAT^ 
CATOTGTGGCWGGCCAAGCATAATOAGATCAOGCATCTOCTCCACATAA^ 
TrGGNOTNATGOTTAOGCrGGGGCTCTriXjCCTTCTAAT GNNAN TCC^ 
CACCAOGAOCACTAOCATCAOATAAAATACCTGCCAAATTTTTCCCCCAOCOCArrri^ 
TCAOOTTT 

SEQ ID NO: 3294 AcrT riT n T ri' im i i ' l i rn 1 1 1 i cggaatoattaaagatgtctttataaag 

TTmTTCAAGACTIX:AnCTAAATACACAGAATAAAAAATGGTGTCAGCTCAC^^ 
AACCANATTTTCCTrATACTOTCTCAAAATTTAAAGATCAATTTCCCC^^ 

NATAAAATGOCCLTl 1 n'LtjAGQATOGOANAGGAAGGOTTOGOCAOOA TGGA ATATrAAATTGTA 
ACATOATAAACmGCAAGACTGOTATCCAATCTAOATAATTTAThrrACATTTTGATGA OT 
AJWAAAGCAATCATTTGIGACAAGCCTAAAAAGCTrOACATATTTAACATACTTANGAACn^^ 
TGTG<XJGNCOGGAATTCICTAATTGTCTCNGGTGGGCCTr^ 
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0TCTGTTGCAACK3NTNCnXKrraAACTrCACATTCX:ACCCT^^^ 

NTTNAGG<nXK>ATNCCmaTrANCCCAAA\AGGAAAANGTrrCCAT^ 

ONTTT 

SEQ ID NO: 3295 GGTACTGGCAGTATTAAGTTTAtCCATCAAGAATGCTTAGTrCAATGOCra 
ACACAOTCXjAAAAOAATACTGTGAATrATGCAAGCACACArrTCKrrmACAOCAATT^ 
AaATATG<XTTCAOGGCTrCC>ATTCAAGACATATTTGCTGaACTG<m 
AATACGATATTGCrrrTCATrATACACmmSOaTrTGCATGQI^^ 
TOCCGCATCrACAAGTXXrrTOmACroaCTXOTOAQCTCACTACTQACOC^^ 
CTGTCAACOGAAAAnTOrrOGCAG Al lUL 1 l GCAGGGTIX]TmOTGGTGACCTOCACACrGTCT 
GCATrCATCAAantKntrrcaTTGAGAGAOCAOATAOTCCAT^ 
AGCATGCTGNCCCACCCGTrCAATOCTOCGOGCATCACCAAAATQAGGCraCACC^ 



SEQ ID NO: 3296 00TACAAAACAGATAAAATTCTrAGAAGATACATX3CAAAAAGCTCTACTAAG 
CAGATGG<X:ACAGAACTAGAGCATrcATAATmACIXXKX}ATGTCAATAGaACTC^ 
CAAACTCAACrraAACTCTWTCTTAGGCTTTOTATrmMrr^ 
CATOATTCAAATCCCraAAGTATTCATTATAGTCAAGOGCATATXXn"ACAAC^^ 
TirCAAATCCAACAAAGTCTOOCTrATATCCAACAmOGTOOOOTa^ 
CGACCnTOACCATCTrTGQATTATACTGCCrOACCAAGGAAAGCAAAGTCT^ 
TGTCAATTATATCTTCCACAATCAAOACATTCnTCCAOTrAAAGTTCAOACATCATCTC 
ACITrTATtmmrrOTTGACTOGTCATTACAATAGCIXnTt>GT^^ 
GAATaGATCTATCCTATTTCTATCAOTGOTGQATCTAATCCANCAGTC^ 

SEQ ID NO: 3297 GOTACACrrGAAACCAAATTTCTAAAACTTOTTm C TTAAAAAATAOTTG^ 
OTAACATTAAACCATAACCTAATCAGTOTOTrCACTATOCTKXACACTAOC^ 
TTCrrcTGOTTTCAAOTCTCAAOGCCTaACAOACAGAAGGOCrnMACA 
CAQTCTTCAOCAACTTOAOAOCTTrC t TCATOTTGTCAAGCAACAG^ 
AAOCATA<MGAC0OTTTCAATATCTTCCAGT0ATATCGGCTCTAACTGTCAW 
AACATAATOCTOGGOACATACTGGOCATCAGOAOAAAOGTOTrraTCAOriUlllCATAAACC^ 
ATTGAGGAGGACAAACTCKTItTGCCAATTTCTGGATriUrriATTTT^ 
GCTTGACTGTGTGGGCACTCATCCAAGTGATGAATAATCATCAAGGGTT^^ 
ATATAGAmCTTCATATOTCTGAGTCCACATGACTTGGTCNCCCAAOCT^ 

SEQ ID NO: 3298 CGTACATCTGACACTATGOCATCTtnTrrTCTAAAAGAWCTOGCA 
TGACrAATACCATTCCACACrGCTCAmAAGAATATCTCTGATCAGATrACTACCACACC^ 
OCCaATX>TCTACAOGCAATATAaAAAG(7ITrnKM3CCAAGTGCGriCr 
OTTAATGCTTCTGTCAAGTXUAOAOAGAGAACAGATTTrcmTrCrOCX^ 
TTCCTAAAOCITACn^TrCTOAGCAOTCTTAGTAAGCTACTATTCnt:^^ 
CATAGGCT0ACCATAAG<>AAGCCCCAGTITTTCCrGAGCCTIXK3C^^ 
CAGCAGGCAGCATXKWAGTrATrATTACTTATrroGQOGGACTAOQOaAAOACTOCAOATC^^ 
ACCATCTAACCATACATTrAACrrAAACAAAAGCGAGCCAACTGACrAATTAAGTCGTCAA^ 
TCTIX7rCAAAANGGarrAACCrrCAAGAATT0TroAA>KXAAACA^ 

SEQ ID NO: 3299 OGTACAOCAOCAACAOCATOOCCmiiAQAAGCAACXXAOGAAAOajCC^ 
AAGOAAAGGOACACTCGCOOCOGCCGGGAGGCTCCATOCCCGGCGaJGAAGACGC^^ 
CCGGAGGAAA0CANAGAAACWJAGAG<XAGCCGAOCAGGGAAGTCCCTTCTGGaTC^ 
GQACOCAACCCCOGCGT 

SEQ ID NO: 3300 ACACTOCCTOGAGOOCCTTCTTGOTGTrAOTOCCnSACI^^ 
TCATAATNaATrNCATTOANCTGCITNOTGACCCAGACIOCATrACrGCT^ 
TTGACCNAAAThmtKKXnATOTGGCATATNTCACTANAa^TAT^^ 
OCCACCTrCIt:AAT^WAmTGACT^>^ACACTAT^mJGCTCC^^ 
TaNATGATCCATOTATCACCAATTAAATOTIt>T0GAOCCTGAAGGGTCCAGGAC>^ 
ACTCCreATCECCTOGOCCGTGCCCATCCTCAGACATCGACT^^ 
OACAGGAAAGCTTCOGCCAOCTNTrGAOGGOTaATNaTACXTCNONCGCOACC^ 

SEQ ID NO: 3301 OTNKrTGCItXTCACACNAhmxnTCTCrTCATAATCNACT 
CTTGGGATCCC^GCrCTITCGCATAGNAOCTCACAGANrTACTIC^ 
CTTAOCAAOTACX3AN>UaCTTICC»GOCNAGAATaAA0CCCAT^ 
GCCTTGGNOTCATCCAACGTOrArroAAATTTMAGCATCnCATCT^ 
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OTNATCaTNTTCTTrCAGAATGCCTGCnmAANCATCNTGCA 
0ACGCrGAATTAACTOG>nTCTCTCT<XTOGAAGCTCnW^ 

OATNOAOGGATCCKAhrrAATmCOTATTTGCACmTJTAATOOCATGGCTCAGNGC^^ 

ACG<XiGCACTrGTACTG<rmAACNrOAGNGANACOAGTGNNCATTGTTAACTTGOTATC^ 

ONCATATTrOTTCCCn-ACrraACXX:NAANCACTACANATTTC^^ 

SEQ ID NO: 3302 ACGCG<X3GGCQGTTCTGAOGACrOGGTTTGGGTGCAGACX3TTaTTGCTTGGO 
CGCTIxrTtXXXnXKX7rOTAGGTCAAOG<XK}CrrCCraACraA 
AAATACrCA0<>TTACAC0<XAA0CCCAAT00ACraATCCTIOu^ 

AC0AAGGCAGAACATmGATCATGTCATGTm;GCATG00ATrATrAGCTQTCXnt3AGGTCAAAA 

CACACAAAATCCACTATAGGAGTCATOGTAACAGCGTCCCACAATCCTOAGCAAGACAATOaTGT 

AAAATTGOTraAKXnrrOGOTOAAATOTTGGCACCATCXrilOOGAOGAACA^ 

AAATGCTGAGGAACAAGATATOCAGAGAGTGCTTATrGACATCAGCGAGAAAGA AGCim OAATC 

TOCXACAAOATOOCTITOTAaTTATTOCTAAAAATCCACGCC^ 

TCTAATAAATGGGO'rQACrGrrCTAlXjAGGCNAOTChrrTAAlATNGNTrGT 

SEQ ID NO: 3303 ACATAGGAAAATATGATTGAAOCATTCrCAAATACATOOCTCrOAGG GTAG A 
TTACTTtmTTTTATCAmAATtXATATACAATGCAGCAAATAOATACATTAATTATAAC^ 
CTTCAATTAATAGTTTATTCACTGTCATAGAAGCrcCATGAAGGGGCATCACITAAGA^ 
AOOTTAOAGTCAAOCAAATTCrraAACACTrOOTCAATOOCAC^^ 
AATGAGTAAATAACCCANAACCTljATATATCTCAGATCAAGACAATCTTAAGCACW 
TTTGAOGCAAGTOGCACCGTAC 

SEQ ID NO: 3304 cccaagcatctaotctogaactoacaoaoataagtagagaaaatgttccaaa 

OTCTOGCACXSCCCCAGCrrAOCCTGCCATTCPCTGCAAGGTTG AACA CC^ 

AACn3T0GrrcGTrAAAGCAGAAGTGAATGCAAATGCX::AAAA0C^ 

GAAAATt^AAOOATATTGCTCTACACTrOAACCCACOCCTQAATArrAAAGCATnm 

CTTTTCTTCAGOAGTCCTGGOGAGAAGAACAGAGAAATATTACCTCTr^ 

TQTACC 

SEQ ID NO: 3305 ACAmAAAATTAOTCCOC mATO CATTITACTCTACATGTGTTATCCTrG^ 
AAGAAAAAGACTCACATCTTTGAOAGCAAOAGTTrrroTCTTAITCACCTCri^ 
ATCOTOCTTTGCVCATTOaAWTrCAAAAAATaTTATACAAOATTO 

TTGTOTGAGOGTrA0GTGCTGAGGCT0AOAAGTGTATQAGGGAGAC(nt3AGATTAAACCTGCCAC 
ATAAAGTCGAOAGAAOTAGCAAGGTCAGGOCrATGAAATAATCCCAAAAACTITAGAATTTCT^ 
ATAATACACTrcACACTACTATTCTCAATAOAGCTGCirrCAGTCTC^AAOO^^ 



NCATATCATmACTAACTCCCATW:AATTTC:ACaTrCCTnCT 
SEQ ID NO: 3306 ACATCCATGTGCCCAAAATCATCAAGCCTOTCCTGACACAGGAGT^ 

crracattgcaoaagagta7tcao0cct0cocag<x:agoatagcatga0ctcagac^ 

catctx;cagttacagcccgaacactcgaaactctgaticgactgcccacagcc^ 

cocatgagcaaqactotogacctocaggatocagaocaagctgtggagtrgct*^ 

ctttaagaagcntctggagaaggagaagaaaajtaaoaagcgaagtgaggatoaatcagagaca 

qaagatoaacmggaqaaaaoccaagaggaocaggaocagaagaggaagagaagoaagactcnc 

cagccagatxkxaaagatgoggattcatacgacccctatgacttcagtoacacagaggaot 

gcctcaagtacctogooogcgaccc<xntangg<xjaattcaaccacroangoc^^ 

TNCNACrCOGOCCCACrmWGQTAANCATNQONATAACTOOTrcTt^^ 

SEQ ID NO: 3307 gotacaaaaaagaaaagaaaaaaaatcaacccacaaagcttctaaaaaagg 

AACOCOCAOOCACTTCCTCTIXrroOAATGmAAAAAGTTAGCCrACTAAAQAAj^ 
CTTOT0AAOOTTTT00AaAAATATOTATCA<nTCXmTrAriTGGOTATTCA^ 
ATAATGCTGACTCCATGGCnCrGAOCCCAaAATTGACCCTOC^ 
TTGATTTTTGTAGCCACOATTGTTTCCTCGTCCTCT^ 

ArnxACCTcraTTcrrAGTTcxxTatrrmAOTAACTA 

AGGGmroOATAGCCmTTrCACCACTncrrcACCAC^ 
TATTATATOCCCACOAATCCAAGOGCnXTNCTCTOOAAA^ 
CTCTTTGGCCCCXXTnNTTAANTGGGTriTOCCaji 1 1111111 

SEQ ID NO: 3308 ACACAATGGTTrATrAAAOQAATOTATOGCCCACATCAACCTAGCAAGGATT 
CTACnXJOTAAAOCTTCCCATaGCCAAAGQAAAAACAAOCAOOAOTTGAGTOGCTOGG 




CTrAAOTAATCTTGTCAl 



^OAAAATCATTACCTrrCCTTTAAAAATCATTr 
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CAOGCAATCOAGAaAGGGCAGAAOGGTGTAOAAGCTGAAOGGGGCTAGAAGCrrACnXTGAGT 
TTUTTCXTTCTGTCTTCAAATCTTTACTTCnTATGGCCAAAGACrc 

TCCACIXnTCTAaACTXKrrCGAQACAGCCAGAOACAGGQGAGGAGOGAAGAAOOATAC^ 

AAGGCATCK3<XKKX}CAAACAmAGAGCTAGAAGCCACrACTGGGTCAAT^ 

TCTAAGarrAAAAAAGCCAOTOTAOTANOGOCCTTATCACTCTTAOTTTQCTAOGTr^^ 

AAATAATGAGCCANAmAACa^CGCTACCANAAAAGAAAAGOACCGGGCTCnx:AA^ 

AAAACrT0AATCTrGCTrATTGGCX3GGNCrrrGOCCraACCCCT 

SEQ ID NO: 3309 OOTAOOCOOOaOCTOATOTGOCAAAOATCKJTCTCX^CATaXXXn^^ 
GOCCGCACIXrrGCCTCX>TCnmXAOGCAGCCACAGTC^ 

OA<nTCCTCCTGAT00CTGACmAAAAOTCAAAAGACJ\TGOA0ACAACCA00TATAOACACA^ 

CACCTAGAAACAAGCAGTCACACAAAAATTGGTAOGCGGGOGCCGTCGGAGCCCT^ 

CTCTCTTGTAGCTTCrCTCAGCCTAGCCCAGCATCACTATCGTGGACGCm ^ 

AGCTAGTGGACAOCAAOAATTTCOATGACTACATGAAOTCACTCOOTGTOOOTTTTOCT^ 

CAGGTCGCX^GCATOACCAAGCCTACCAaUiiTCATCGAAAAGAATaGGGAC^ 

AACACACAGCACCTrCAAOAACACAGAOATCAACmAAATTNOOaONGOAOrrcOATOAGAC^ 

^mCNGATaACAGGAAGGCAA^^^CCATTNGACACTTGGATGGNaGGAACTTOTTAOT 

SBQ ID NO: 3310 OGTACTATGACTOAAAGATTCTTCATGOCTAAAAAGCrCTGCATCAAACT^ 
ATTCAGGAOOCCTITCCCTCCCCACCACCATCAACCTXTr^ 
TCTCAACACAGTATGTCTGOGGCTAGATTTCAAAACCCACGTAATGAAAAAG 
CTA A ITn U riU ri ' l 1 1 1 1 IT ATATCAATrAAajnAAAAATTGCATCAACTATITAATTCA TGAGG 
ATtnTn:ATATTAAAATTrAACCTTAAGATn>ACCGCCATOTlKr^ 
AGAQACCTCTGAGCTCACnrrACATGCraGTGOCTACnKXXnTAA 
AAACAOCCCTQAATITraAAATtn'AOCCTAATTTaOaAriCraCAAC^^ 

CTCAAATCAAAACTA AAAAA AACTrAACCA GCCCTTA CmAAA AOCnTAACCAGCCATGOCATr 
TTTGAACTACAGGGTOGGGOGnTril'ITlACCCACCTNirtili 

SEQ ID NO: 33 1 1 OCOGNNOCCCOONCGNGGNCCCNGOOCANTCTTANGACGGGaKn^TGGAG 
CaWCCOCATGCAGCOTGNGTCGGNNGNCTArrcTGGANAACTAGNCCTOGAC^ 
ATQANGCraANAOGAATANCAA<K}CrrATCrCTAQGATCCATAAAAOOGACCCT 
CACATGGGGACCCAGOCnX5CCAAAGCATTGCTGCTCACACTAGATAA(^ 
GTCChUGAGCTATrrCCCGCACCAATGAGAATGACCCOGCCAAGCATGQGGATCANWACGA GOG 
NCANCACTACAACATCTNCCCCCATGATTTGGAGACTGNATTTCCCCATOGC^ 
CTGATGCANGNGAAGACATTCAGTGAAGCTTGCCTGATGOTOAGGAAAC^ 
OCATTACCTOAAAAANACCyvATtn'ACGCn'ATXX^CTATCCATATC^^ 
GAACAGOAAAAACCrrrAACACCTCANCATOGNATOATITCTrOCGa^^ 
GAGCNACATATTCACA 

SEQ ID NO: 3312 OQTACACAOOCTOCTACCCAAOTTaTICTaAATOTIOT 

CATGTrrAOAAAGTtK)ACTCAGAGTAGCmXXJAAGACnxrrOGGCrCTCAACATG^ 

CTCIXKKTTOATOCTGaAAGTAOATACaAAAATGAGAAOAACAATOQAACAGC^ 

GCATATGGCrrTCAAGGGCACCAAGAAGAGATCCCAGTTAGAKn'GGAACrrGAGAT^ 

TGGOTOCTCATCTCAATGCCTATACCTCCAGACAGCAGACTGTA 

AAGACntJCCAAGAOCTOTAOAAATTCnXXn-GATATAATACAAAACAGCACAT^^ 

GAGATTGAACGTCAGCCOTXMAGTAATCCTrAGAOAOATGCCAGGAAGTTQAAAOC^ 

AQAAGrTOTTTTTGATTATCITCATGCCACAOCTTATO^AAATACT^^ 

GGACCAACrGAAAATATCAAATCTATTAAGTCGTAAGGGCTTAATOOATrATATAACCNCACAT 

SEQ ID NO: 33 13 GGTACATGGCCACAOATCATCAAAOCAACATTTmGTOAAATCATGA ACAA 
OATaAA0TAAAGCTGQAOOT0AaTrroaAOCA0CTGTCATAACAAGACACKnQO<XT 
ITCACGTOGTCTTCCACTCCAOAAAGACGAATTOAATGCTGCAGTGCAT^ 
TCTGTAGAOOATCCCCAATrCACATCTGOTTTrmiTAGGTAACATAAATATACAGC^ 
ATCACATATGTTAOCAATOCAOCCCACCAtmAATGACGAACATTACTATG^ 
TCCAAOAAGTOATATCCACATGTTGTAGTATTTCAATOCAGGACGCCATC^^ 
TOATGCATOGAATACTGAAAAATTGATCAATGCATATOATGCAAGOAAOAAOTTTGAOATAATTA 
TmGOGAGCACTCACTTAGGGATGCTAATGCTOAAQAAAGAGTGGCroAAAATATACCnXjC^ 
AATTAtnGGTOTAAATCCTQACCCXn-ACTX^ATTACrCXn-AAATCrAAC^^ 

SEQ ID NO: 3314 ACTAAAAATACAAAAATTAGCTGGGTGTCOTGOCGTtKXCCTGTAOACCCAG 
CTACTAOOGAGGCTGAGACAGOAGAATTOCITGAGCCCGGGAGOTGGAOaTIXrrACTG^ 
ATCOCACracrGCACTCTAOCCnX)GCAACAAAGTGAGACT^ 
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AAAAAAAAAGTACC 

SEQ ID NO: 33 1 5 ACCTCCCAAATGOATXXACGAG0CCAAAOCTCTCKXjAG<>CACA^ 
GACAAG<>CTTAOACK30CCTrroCTTArrTAA00CTGAOCACTaGAACCOCT^^ 
ATCCGAOVCTTACCTTCTQATGCCATCATTAATGAGAACrATOACrACCTGAAOCKKJT^^ 
OACCTOGCACCTCCAGAGCGCAGCAGCCTAATTCAGOATTGGGAAACATC^ 
OACrATATrAOAaTCATrGAAATtKnm)CCATATACAGCA0GTtX)ATTGCrCAGCT 
GAGCACnTACACArCAAAO^OACTrCACTOTGCAOTOGGATAQAGCAGATTCAGTOTTACACTOC 
TAAAGATaDCCTCGCTCAOTCAGACATOOCCAAACGTCrrAOCCAACCT^^ 
AGTCT0<>TO^TCTCCTCATAGAACCTItXiACTNAACACC^ 
CirCGmTCCCACATTGGGCCCNTnXXATGCCTGAGOACTATTNC^ 

SEQ ID NO: 33 1 6 A CH 1 1 1 1 L Ul U 1 L 1 U 111 1 1 H L 1 l AGGTrGCAOGGCnTATATITCAQCA 
ACAGTCATACAGAGCCACACAGCAGGGGCACCCACAGCrGCCTCACGAGGNGOCAOCGOCWKX 
TCrrTTGCAGCirnxriTmriTGCAGCATCCT^^ 
TGTAGGCCCCCAG<rrCCTGATAOCG<TrCtnt}aTAOOCCTTGOC^ 

AATorraATAATrrcrroGNCOAcrrTOTACTtxxnci^^ 

ATGA7Xnx;CTCXTCCITOCACrCAOTGATQTCT0GCACGCGGNGGTACC^ 

SEQ ID NO: 33 1 7 OGTACCGACCATAGAGCAAGAATCAAGATTCrGCrAAaXXTCCACAGCOCC 
OTCCTCrrCCTITCTOCTAGarnXKn-AAATCTCCT^ 

AAGAOTGATAAGGGCCCTACrACACrGG C ' l 1 1 1 J 1 AGGCTrAGAGACAGAAACTTTAOCATrOGCC 

CAOTA(mK}CnCrAOCrCTAAATOTTlXKX0Oa<XAT^^ 

CCCCTGTtntnXJGCTGTCTCGAGCAOTCrAGAAGAOTQCATC^^ 

CTTTGGCCATAAGAAGTAAAGATnX3AAGA(>OAANOGAA0AAACTCAOGA0TAAA0Cm^ 

CCaXTTAAGCTTCTACACCCTTrnKKXCirrrTCAT^^ 

TGGTTTGTTTTCCTTNGCCTNGGAAGGTTACCAGANAATCCTG>n'AGG^ 

OTAANAACCATOOOCCCrnTrilTl'irrnTnNOOOrrTATOGTTAAACATGOAACNCCATGAG 
ATTCCCrTNAAACTCCCCCTCATTCrrTCC^ 

SEQ ID NO; 33 1 8 ACTni- m ' rrn Tni Tl T rn - r r r OGANGAAAAAGAGCCTAAACGCTTNTGA 
TrKKKUTAAAGAAAAAGGAGCATTAACCTTCACTATOTCTTTAGCr^ 
TAAATTCCTGGGCAGGTGGGGGAGGGCTAGTCACGGAACGAAACTGTAAGCCGGACCAGGTO 
AOGAGOGGAOOTGATAAAAAOATTACAOOOTOGAGGAGTGQAGCCTOAOCAAGAATTGGOACCT 
ANCTTOOCmGaAGAGOAGGGGAGAGOTCAAATGGOTTrGTAGAAAAGGAAGATTANACC^ 
CACCAACNCCTGG0GTrGGGACroAGGGGACAGGKKK3GAG0AAAAAAAGGAAAAmGGAA(^ 
AGTIWXTTGaGCOCAAAANCNAGGAGGGOCTGGTNNNNTAAAAAAATGCCTGGACT^ 
CC^AAACAT^TGCCCT^T^^TGACAAAAAT^mTAAGGT^mTTGG^ 

ocTrnrraocchrrrrAAANCCNTGOCAAATTTTTrrGOG 

CXTTTAOOTTTAAGGCCGGNOTAATTrAAAAGGTrTAATTTTreACCar 

SEQ ID NO: 33 1 9 OGTACCCrTTGGATrrCAAACAGTAACATCGGATGTAAACAAAClTAGT^ 
TTTACTCACTOAAACTAATCAAGCaGCrCTACOTAaACAAATCTCTGAATCmCT^ 
TCAGCTCrACGAAGAGAOCCTATGCAAAGGAATrGGAAACIGTrOACrrCAAAGATAAAT^^ 
GAAACOAAAOGTCAGATXyLACAACTCyLATrAAGGATCTXytCAOATOOC^ 
AGCTGACAACAOTGTGAACGACCAGAOCAAAATCCTrOTGGQOTAATGCTGCCTACTITGTTO^ 
AAGGTGGGATGAAGAAAATTIXCrGGAATCAAAAACAAAAAAATOGTOCTr^ 
AACAOAACCCCAACCCNTGGCAAATGATCACNTTGGNGGCNCCU ITllN MIGGGAACNATNGCC 
GG^mx:A^raOTAAAANNAAAAGNCTTCC^^TmAAAANAAC^^TTTAACCTO 
CCKANGOTNOTOGNGGOTrAAGlXXCAOaTrGOGNAAAATTaAAAACC^^ 
(XCAGGGGATTATrXXCCNOCNTOGCCAAGOCCAAOGNAAAATTTCCT^^ 
AAAAAAATGGNC 

SEQ ID NO: 3320 ACrCOOTaAAGTCTAGOOATAOOAAaATOGTTGOCGACGTGAOMGGGaX^ 
OGOrrATGCCTCCACCGCCAAGTGCCn^AACATXTKKJOCCCriXWT^^ 
TCTGCTCATCATCATOa^GTOTrcGTOOTCCAGGCCC^ 
CCAOOAGCTCTGCCCOTOACCTOTATCCCACOTACAGAAAGriT^ 
CAGTG0GOCACCACCACC0ax:0CCGCCX:ATCAGCCCTCC0mCT0GAATC^ 
CACATTTCCAACTNCntnT^CIGCCCTANAAGACAGAAAAGAAOACTOOGGTATTC^^ 
GCXXAAAAAATGAAAAAGGCCCTTTICTAAAAACroGTTCAAATCAAT^^ 
GCANGOTTmACACATrACT00a::AAGGOCTaACCAACCX;crmQTN 
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AAAACANQATTGNTTGANGGGACAAAAOCrmKjOTAAAAAOCCNTTGCN^ 
TAOCTTOOTATTCCACCOOAATGNCTTaAiKiATCCATGOOAATTimT^^ 

SEQ ID NO: 3321 ACOCaGOGOAAACGGAAaTOAGCGQCGGGOTCGACTOAOOGTAACGGGOCA 
GAGAGGCTGTTCGCAOAOCIXXXKJAAGATGAATGCCAGAGOACTTGGATC^ 
OTATTCCAGTTACTaAACTrrCX}OCAAOTOOACCTnTGAAAQTC^ 
i nU C rigt trroAAAAATGAACTrnXKXTAGTCATCCtXrr^ 
CAACXAAOATAAAATGAATITnCCACA(nXiAQAAACATrcAOGGTCTATn^ 
ACAOATGGAATTCAAOGCA0TGCACCAGGGT^CA0OOT^^*TCCATlTLT^TC^ 
TTCCrroCA T OT l TGCANGGOOTAATCATGAAAAACTATrGGAmAAGGATm^ 
rnt:CCAAAa:OAATCCTnjGAAAANCNCCCTTGTGOGGGGAANA TAAACTM GGTTA 
ANOGOCTTGTTATTGOAAAOCAAGGOTOGCTWmJGOTATTAGCATm^ 
CCCrrAA0O00AATrCCACCCXXTT<KKH3CCaTCnTrNGaATCCACCTC^^ 

SEQ ID NO: 3322 ACTGCAOOCntKiATrACAGAGACCnXlTCICn'AAAAAAACAAAACCAAACT 
TGACICAOACTCCCACrCTrrG(XATTCCCITCTCT^^ 

AACATOACAOaTOaATCCATCTAAAOCTOCAOCACAGCCTATAOAATATTX^AT^^ 

CaXAAAACTAOTAATACAACTTOOGATAGGAAAAACAAAAAAQAGGCCAGOAGCTQT^^ 

CACACATAATCCCAGCACITraoOAGG<yiAAAGGNOGCTGGAT0GCTTGAGOT 

AAAAAAATTCGTAAAAATAAAAAAAAATAOGCANOTCOGGNGGhnTATOCXrroTAATCT 

CTTTTOOAAGWrrAAOONGGAAAAATGfrrTNANOCAGAAGGGTOCTTNOGGNT^ 

AACWfNCmCKKX>CATAATQAQAhn™ATmTrT0CCX3AAAACAA^ 

GGT^KKJaGG^^KAACCCTNGACTCCCC^X^TAAAAGGTNAAATTC 

TAAACTGAATNTGCCCnKGATCCCNCTraNGAAAANCAAAACCT^^ 

SEQ ID NO: 3323 ACTOArKX»^CrrCA0Cim::AOTCCAAA0CXKn'AA^ 
OTOCroGCATTCCAGAACACAa^GGCACTOTCrACGTGCrGCAOT 
TCCTCTGTGACGATGACATCCOTOTGGOAGCrOCCATACTTOTGOAT^^ 
ACOrrGTCCACTACrrCAATGCATAATTCCAGGTOCCCATAC^ 
OAGGGCCrOAAGOTCACATAOGAGGCAAATTTGC<X:CCTGCATOAATTIT^ 
AGCATATCAAtmTCTOOCAAATAATGOTGGTtXTGANCAAATOC^^ 

CAANN>riTraAO<iOGGTTG>nT>mT^N>nTrAAAGhrrm 

TOOCrnXXMAATCCAOn-ACrrOTOACAAATCXnTOCOTNO^ 

^WCTTTTTGOANGT^^TTACACITGGGANAACCXXT^IGGAANGNCNAAAT^^I^ 

NGGNAAAAATmACl-mUTITGNNKTACAATGGCCGGCCTCNTTATrN 

SEQ ID NO: 3324 OOTACCTACATCAOATCTAACCrraATCCCAOCAATGTGGATrCCC^^ 
CGCTGCCCAGGCCAGCCAGGCCCTCTCAOGATGTOAOATCTCTATT^ 
TGCTICrOGCAGTITQTCAGTGAGGACrCATCIXnTACCCAGAT^ 
0T0GCmGGCCTT<XCTTOGCATCCX>AOAAG<>CTCAGTGCCCTTAC^ 
AGGAGACTGTGCTOGCAACAGTa:AGGCTCTGCAaACAOCATCC^ 

TmXAGTrTOAAAAAGOACTTIGGAACCACWANCGTTrrTTGNGGOTrOCC^ 
OKrcATCTGGGGAC^GACCCITC^^^^TAAOGGCGATCAGOC^mX:ACTTO 
ACCAGAAAAAACTTTOOTCCITntXJ^AANCCn'AANOGGGCXnTrGA AN^ 
TOAAANCnTNCCCCmCNCATrTGGGTNOOCCTGNGGGTmnTrc^ 

SEQ ID NO: 3325 COAOOTACrCAOCOCCAGCATCXKXXX^CnOATrrroaAOGGAT^^ 
TGGAAGATGCTGATGGGATITCCATrGATGACAAGCTr(XCGTTCTCAGCCTro 
AATTTOCCATCGGTCGAATCATATTGGAACATGTAAACCATGTAGTTG^ 
ATTGATGGCAACAATATCCACTTTACCAGAGTTAAAAGCAOaCCTGCTGAOCAGGC^^ 
CNNAACCAATTCOrrcACrCC»ACCTICACCTI«XX>T^^ 

CQACOCAAAA0AAaAT0C0OCraACTaTC0AACAN0AGOAQCAaAGAOCCaX}C^ 

1 lITi ri ri4 [ ri rT L OGCCTATGAAATGOGCtnTrCAAGGNCAAAAAAAANCNaAAAAANCCCAA 

OCCCCTrrOAATTrTTN(>ACAAACTTOAGOACITACCAATANAmT^ 

AAATCCCAAACTTOANAAAACTrnXXntlAAANNCCTrTAATr^^ 

CCAANAN0CNAATTTTTGGGGTriX3GAACCrCTCCCCAG0AOT^ 

AGrmCOCCCTG 

SEQ ID NO: 3326 ACAGTCTTTCATTAAATAAGAATACTTACACATACATTTrCAGATATTTCTAC 
CTTCCTOTATOTGTrTCOAATTOTATGTAGGTAGCCACTOAAAGAATTn^^ 
GOCAOTOGAAOTCCATGAAGTAAAGAOCATIXnTTAAAAAGCAOATTTaATrOCATACCrrriAQT 
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TATTrGAaATrCTGAGAATrCTOATAAACXXX:AAA0CAGAAAGATrCXTTATA0CXnTX3a^ 

GOAAAOOTOAGOOAAATATTraAAGCAGOOTCAGAACATCCACTAAOAACATAOCACCTC^ 

GAGCTTACATTATAOTXXXAGGGTAGAAGTTATTACTOAATAGCrrAGOATmTGAACATTAACCT 

TCCrACACGAGTAGTAA(>ACTaAmGGNGACX:ATGATrGGOCACCCnTAAT0TAACTGCAACT 

AAANAACAATTATTGG>rrTTOCCATATACnXnXrrAGONAGTGATNGGAOAAGf^ ^ 

TGNTTTAANAAAATCrrOATOOCCTCATITrOOATCAACATITT^ 

SEQ ID KO: 3327 ACOCOOGGOCOTCTItnTCTrOCCroaTOTCOOTOOTTAOrn^ 
TQTTGGOACTGCTOATAGOAAOATGTCTTCAOOAAATGCTAAAATT^GCACXX^ 
CAAAGa^CAOCTOrrATGCCAGATOGTCAGTTrAAAGATATCAGCCTXn^ 
AATATOTTOTOTTCTrcrrn-AOCCrCrraACT^ 

AGTGATAGGGCAGAAOAATTTAAGAAACTXIACTGCCAAGTGATrOGTOCTrCTGT^ 

TCTOTCATCTAGCATQGOTCAATACACCTAAGAAACAAGGAGCUCrOGGACXXATGAACATTC^ 

TTGGTTTCAAACCCGAANCNCACCATrGTrCAGGATTATGGOGrrCTTAAAG<^^ 

TCGTrCAOGOOOnTn-ANrCArrOATOATAAGOOT^nTCrrCOOT 

TTGTrGGOCGTTTTTGOGTTANAACmANAATrANTTAQGCCCITCCATTT 

AAAGTGNCCXACTIXKnrGNAACNCTXjCAGGQNACCnTAAACCCTGArrC^^ 

SEQ ID NO: 3328 AaKXJGGOAOTOGACACCATOCATrcTOCAAOCCAaxrraoaGTGCAQ^ 
GCrAGACATGOGACGGCOAOACGCCCAGCTOCrGGCAOCGCTCCTCGT^^ 
TGOCGGOOAGTGAOAAACCCTAGATGOGGmCOACATGTTOCCCAOOATOOTCTTOAA 
AGCTCAAGCAATCroCCCQCCrCGGCTTOCTAAAOTGOXKiGATTATAGCC^^ 
AOGCTGAGOCCCCATAACAGGACmACTGCGGCrnXXTGGAATCACCAOTG 
CAATOOATOCTOTrtXXJACTCCAGTaTCACTOQOOTmxrrQaTOT^ 
OAGTAOGATCAGTTKXnXATGGAGOTXirrcAAAACroAAAAACTKKK^ 
CCC0AGGAATGCNCTTmXIAAOTQaXJ^mTCCAACTTATni^ 
Q^AAGCTraNGAAAAACTGCC^mCTTAAAAAAC^^T^□OTrCX:AN 
GGGrrcCNAAACCAAAAAAAAACTTNCCCTrATNACNfrrAT>mTATrOA^ 

SEQ ID NO: 3329 CAAQACACrACGGGAACAGTTTGCCTCXCTCXX^OCCTDtAC^ 
CATOCnMOGCTGATGTOGGCTAGTAAGACTCCAOTTCrTAOAGGCOCT^ 
TGG<nCATCCriTAGOATACTTCrmAA0TGGGAQTCTCAGGCAA(^ 
TirrOTITOTTTTrraAAACAOOATCTTOCTCrOT^ 
GCCCAGTGCA0CCTCOACCACCTmGCTX>AG(::AATCCrCCCA 
ATGAACANOCGTGAGCCACAAGCTTCCAACCTAOOCOCTTAATCrTi^^ 
AAAAOOOTNGGCANTTTAAOCrAACCCTGGGTrAAAAAAGTTTrAGGOGCCGhrrTCT^ 
AAAGGGGGGTrrrGGGAAGGTTNTONGGCCCAAACAAAAACCIGCTr^^ 
AAAOCCrnTTAACCCCCTOOOCCGAATNTACACTGGAAGCCAACT^^ 
OCCrrnrGGTroGOGGGGAAOGGrTAAAAAANCCCNAATACCCCnTrm 

SEQ ID KG: 3330 ACGOGGGGOGTCGCXKKXXIAGGCnTGGCAGCIXXKKIACTOAGT^^ 
TCAGCATGArrCTTCAaAGGCTCTTCAGOTTCTCCtC^ 

GCGGAGGAACATTGoTOTTACAGCAOTGOCATrrAATAAGGAACTTGATCCTATACAaAA^ 
TimGaACAAOATTAGAQAATACAAATCrAAOCOACAGACATCIXK}AGGACCrOTTOATGCTAGT 
. TCAGAGTATCAGCAAGAOCTOOAGAGGOAGCTTmAAGCTtAAOCAAATOTrrGO^ 
CATGAATACATTTOCCACCTTCAAATTIGAAOATCCXAAAriTGAAGTC^ 
C^GAAAAAATAAAOTAAAATAAATT^GOTA^IT^OGTCACGG^r^AANTTGTNaT 
ACCCTAAGGGCGAATTCCANACACrnXXXXKXOTTATTATNGOATOCNACItXi 
OONAATATGOCATA C 1 10 Ml U 1 ] GGKOAAAATOOTATCCOTTACCAATrCCCCACAAATACAANC 
OGAAGCATAAANQTAAAACCNGGOGGCCTAATOAOGGGCTACCTCXAATrAATGNGTTGGCIN 

SEQ ID NO: 3 33 1 GQTACCCOCACOATTACACAGCnTOAACAAOTGAAACTATGGTrACCAAAC 
AOCTATTmATTATAOCATCTACACrrOTCTGGAAAAGGATGTAACAATAAATAACTGTAGATGCA 
TCACOAGACATCCArrrACTTAATCACAGAAGTOOATCTTGCTACATAOTTTTCTCA^ 
TCTTCAQATrcrCCAGOCnAOQAATOTCATCAATCTrCAAAATCAT^ 
GAOATATCTGTTGCTTTrTGCCAATCAAOGTTTCTATOACATOCTGT^^ 

ATAAOTCTGGATCGGATTCATGCCACTXjOTTTTCAOAAhUGGGCC^ 
CGTCGOCAAACCCTCTrATG00ATACTXKnTCTAAAGGNGGGG<>^ 
ACTGOCCAOOOCCNAAOGATATmAACAAOCCCCrraTCCTTACyLCC^ 
ATGAAGOTTtXXKKrrTGAOCCXX:AAACCTriTGCAANOGGA(^^ 
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AATTCCCCOCTAANAAAAA 

SEQ ID NO: 33 32 CGAGOTACCCCAGGTATCTGCrAOAAAOAGAAOAGAACTAGGGACACCTOC 
CAOXrrCITCCTAGATCAACXnXJATAACATTOCTCACATG^^ 
CATQTCAAGTACACTTGAAACCAAATTICTAAAACATOTrrTTCTTAA^ 
TAAACCATAACCTAATCAGTGTGTTCACTATGCmXACACTAGCCAGTCTTCTCAC^ 
OTnCAAOTCrCAAOaCCTOACAGACAOAACKKKTrOOAOA I H 1 Ul 1 U 1 1 1 ACAATTCAOTCrrC 
AGCAACTTGAGAAGCITTTTTCATOTTGTCAAGCAOCAOAGCTOT^ 
GAAAACAATTGCAATATCTTOIAGTOGATATCGOTTrrACCNCrrCANAAG^ 
TTAATCCTNGGGGACANACTNGGCCnTAAGGGANAAAAO OGGTTT OTCAGTTOT TTATTAAC CCA 
AATGGN0GCGO0C/-VCNNNTTITCCAATrTIT0GArn"rm"I"lTl'A^ 
TrGGATtXKWOOOOCCCCTCCAAOOOTX}AAAAATATCAAOOOTrO<K)CTaO(nN<^ 
CCTTTCTN 

SEQ ID NO: 3333 ACAGGTITCACTATTACAAATATATGATGTTAAACTAACAAACrCATGACCTT 
CAAAaATOTCrrCOTCOCACOCACACACATTrOTAAmGTOTX^ 
TAATCTTCAyyVTTATATACT TATX^ 

ATCACCACAACCACAATAAO^GCAATAACACCAGCTmAQACCCTQCATTGAOAATrCAGGTC^ 

I'l'rilCATCAACATAATAAATTAAAOTTGOACCAOGATCCAGATCCAGTrGGTrcCCC^ 

TCANOOTCCATTTTOT AANAAGOAACAANOGATCACCCTTTACAATCTTTTCA AAAAATAOGCCC 

ATTAGClTTGaitXXAlATTl'rcGOriTTmXMMAOAATmMANC^ 

ANTTTrAlCCAAAAACTCCOOOANAAATrraONNCCGTQAAAACCCOTOC^^ 

GOOGCCGGAAACirrCTATAAAANGGTrrCNNTCCmXKjOTr 

SEQ ID NO: 3334 GCTACAAATAAAATGAAAAAOAGCAGTGTraXJTTGTATTCATTTC^^ 
ATAGCnTATTAATTOCTAATOAAAATTAOAACrrrrCTOaOATCrrCTOACAAOATr^ 
ATCTTAAAATOCCriTrCTTCAOTGAAGCXIATCTTTGGAGT^ 
ATCTTGACrrCAACCraATATTtXritnTCTrnXWT^ 
OrrAAOOAAAGGTCATTTTTCCACAGTTCAAaTtntnXJA/^^ 
CAGCCCAGGAAGTGGAAGTAATCC(>TG<XTAAACCATCAGGGOCCAArr^ 
AACNCCT01«TrCH30<XOACCTATTrA1TAACNCCAONCTIOAAOOONCATG<^^ 
GaCCTT^TTNGGCXX»XTAATIT^^AAAANGGAAAGGGTAT^^r^AAAaOG^^ 
CAAAATTNOCCa^LTNGAAACCAACCATTATNGAmTWOaXTTTAA^ 
TTCAAAAAAAAGOGGGGCNGOTTTNNAATTCNGTTTTGCNCCCCCTr^^ 
AAA 

SEQ ID NO: 3335 GGTA Ci lU li 111 1 1 i 1 1 H L 1 1 1 L I GGCTGAAAO AGTrGACAATTTTATrrrc 
ACATTTCCCAATACAAAGGAAAACrGCATCTrrrrrGTOCXACITCr^^ 
rrCATAGGACAGGGGACCAAGTCTItXTTATGCTGTnANAAAACTCAGTATCACAGCA^^ 
CTOCTXjaTOAAOCAOAACAOGTAATATAAAACCOATACAATAAOOCCTCXXr^ 
TCTtXJWCGAGTCATTCCGGGCCGAGTGCGCACCATCATGGGACOGC^ 
GGCOGCCCAACCATTAATrGCK^LTATOOCCTCCCATOOOCOGCCTCATT^ 
ACT0OCATTATTCCAN0ANGAGGAGGGCCCATX>TTGGCArCATGO0ANCGGCC«^ 
GGNOCrNGNANTAATACCCAGOO<X;AAG0AOOACC00GAAACXnTGGOGG<^ 
CNCCCTrCNCWAOOAGOACCCAAAATQGNGTNGGANGCTTNl'riVI'l'i'rJGNAAAAGCAaXGTN 

TT^TTTTCCOCXXTNNCT^^KXX)ONG^^NNNJW^^^^^ 

NNNNATNTC 

SEQ ID NO: 3336 GOTAOGCQOOACTCAaAAGCrTGOACOGCATCCTAOCOGCCOACtt^ 
G<K>GOTGGGT0AGaAAATCX:AGAGTTGCCATCGA0AAAATTCCAGTaTCAOCATTXn^ 
TOTOAaXTCTCCTACACrCTtMKXAOAGATACCACACITCAAACCTGOAOa 
AGGACTCTCGACCCAAACTGCCCCAGACCCTCIXXAGACCTT^^ 

CAGACATATGAAGA-*GaCTATATAAATCCAA0ACAA0CAACAAACCCTrGATaATTA1T^ 
mOOATGAGTGOXACACAOTCAAGCmAAAAG^^AACTGriTItKrrGAAAATAA^ 
GAAAriXKX>.GAACAAGTroNCXX:rCTTAAACTGGT TATOA AA(^ 
ATGQCCATATOTCCCAAOAATATTOTraaT0<XCATTmTTAC^ 

ATTI>^^^AATCOTT^m^t^TACCAAACT^^WGAAC^IAG^^^ 

AOTOTTGAAANCTOAATGGAAAAAAAAAATTTCAANOCTCTTITGGA© 

SEQ ID NO; 3337 OGACGCGOOOGACTCCOOCAOCTITATCOayLOAOTCCCrQAACTCTCO^ 
TCTrriTAATCCCCTGCATCX3GATCACCCGCGTGCaX>C^ 



515 



wo 01/290S6 



PCT/llSOl/30732 



GCTCCGAAATCACCACCAAOOACTTAAAOGAGAAGAAGGAACrrTOTGGAAOACKrcA 

AAGAGACGCCCCrOCTAACGOGAATGCTGAGAATGAGGAAAATGGOGAOCAOQAOacrOACAAT 

OAOOTAGACGAAOAAGAGOAAaAAOOTOGGGAOGAAOAGOAGGAGGAAGAAGAAGGTGATOGT 

GAGGAAOAGGATGGAOATXiAAGATCAGGAACTGAGTCACnXXX}GGa:AACCGGCCAC^^ 

TOATAAGCATaACAATGTCOATNCCAAOAACX:AAAAAACCGACOAGaATOACTAOACAGCAAAA 

AAAGGAAAAGTmAACTTAAAAAAAAAAAGGCCGaXrroACCTATITACCCTTCACT^^ 

AAAATCTAAACGTOGTCCCCTTCAATAAAAANCGCCONCCCCXXNCCrGOOC^ 

ATTANACCGCTTTTCACCACOACCAAACCATaANAATTTCACCGOGGW^OG 

CNTTCAACGCCTOT 

SEQ ID NO: 3338 ACACCGGGrrOGCATTAACOOGTOAAOATOTCCCCCTTACGOAGCAGACOOTO 
TCTCAGGTGCTGCAQTCAG<XAAAGAACAOATCAAGTGGTCACTCCTItXiGTO 
T0CTXXK7rCTrcATCCTCTTCAAAAAATTT<K:ATGT^ 
OATaCTCrCAOGOTCATCrOOOOGATCACAOOGATCCrTAAATCTCCAT^^ 
CCTCAACCTXXXXTACACCCTrCCTATrCTriTrcATTOT 

AGCATAmAOATAATANOOOCAOGOGGAAOCACCCTCTTTCTTrCTAOAACTOOATTATGCT^ 

ATOCICCrrTGCCTnjACATnTXySNAAATCTTOG O C Cr^ 

OOAOAAOCCNANNJMANNNNNNNNNNNNNNT^^ 

CAACNACTOGCNOCCGTACTTATOGATtX>ACTIWANCAAACTTGCCNAATCAT^^ 
TTOCTGOGGAAATGOTITCCOTrCAAATrCCOCAAAAACAAACCGG 

SEQ ID NO: 3339 GGTACGCOQGOGTCATAGAATGGAATOQAATaGATTCATTOAATOGAATCAO 
ATGOAATCATCGAATGGACTOGAAT<HUATCATraAATtKJACTCQAAAGGGATt^ 
GAATTCAATCGAATCATOGAATOGTCTCaATrGOAATCATTATCAAATGOAATCGAATOGAA 
CCOAATAOAATXXJAATOGAACAATCATOOAATGGAaXi^AATOOAAT^^ 
AATOGAATTATtXJAATCCAATCGAATXHSAATrATCGAATOCAATCGAATAOAAT^ 
CTajAATGGAATCATCOAATGGAATaGAATCGAACAGTCAATaAACTaMATOQ^ 
AATGGAATOOAATGGAATCATCOACTGGAAAACGAATCOAATTATOATCOU^TCKIA^^ 
AATCATCATCAATWAATCAAAATAACCCTCATCAATGGTTTrGAATGGAATTGCATC^ 
CAAGOA^€TC^r^CAAKKlANCaAT0OATC^C^TTaA^^K3AATO 
GAATCTCTTCAATGGAACGATOGATCTATCAATOGATTATGOAT 

SEQ ID NO: 3340 ACAGATACTTAOAOGOIAGCTGOTCTrrAATrATGTXKnTCCGAAGCAAAT^ 
CTrOTATOOGCATCAATTOGAGaOOTItX:ATXTTTOAATACAGAATTCAOGC^ 
CAACCOCTCACirCCAGAGAOATOATrOCCtMOGCCXMCTrGTCTGAAAAOarCAATC 
ACACATCAGACTCrCCAGCCAATTCAAATAGaTTCTIGGCTOAOrC^^ 
TOTAGT0QATCACCCCGTrGGT0OCTAGQATOTCTTrATrGGAGATGATCaCCTT(XC^^ 
TGAOCATOTOCCXX)CrGCAGCCCACCra:AGTGTCGTOO^^ 
CACCGATOOCTrCAOCACACATAOTraACTTCAANATCTGGTTGTTCAN 
mWOnrajCCCAGAATNCOOTrCAAAONNTAACTACGGhmnT^^ 
GCCAAAACCGOQCCTTOOCCGQAACNCCCrrrAAQOCNAATTCNACNCC^^ 
ACCAAmXKWCCCAACCmGGGAAAANGOGAATATNGTCCTOGGNAATGG 

SEQ ID NO: 334 1 GGTACACTrGAAACCAAATTrCTAAAA Ci ltJri mCl ' [ A AAAAATAGTTOTT 
CrrAACATTAAACCATAACCTAATCAOTOIXmXIACTATQCnCCACACTAO 
TTCnCTCGTrTCAAOTCTCAAGOCCTGACAGACAGAAGQGCTTGGAOATTIT^^ 
AGTCTTCAGCAACTTCAGAOCTrrCTTCATGmrrCAAGC^ 

AGCATAGAGACGGTTTCAATATCTrCCAOTGATATCGGCTCTAACTGTCAGAGATGGC^^ 

ACATAATCCTGOOOACATACTOOCCATCANOAOAAAGGTGTrTGCAGTIX^ ^ 

QAOOAOOACAAACKKrrCTOCCCATTTCTGGATTCTITATm 

TCGCraNGGGOGCCCK^TCCAACTtjATGAATAATCATCAAGCOOTro^^ 

TANACTTmCATAT0CTQAAT(XANAAGAOrrGGGCCCCCAAarrT0OONAAOGNCTNGG<^ 

TTN0CONCAAAANCCTnM3GCCCTTTQOOT0CGGNnK}CT^^ 

SEQ ID NO: 3342 GOTACXTTATACTCTCAAGGTIXnTTAAACATGATAAGOTTAATCOCCATCrA 
CTIX:AA0TTrrAOAAAA0flAAACAA0AA0CTAAAAACA0CT(KnXTOACmAATATCT 
ATCTTTGATCTGTTTGCAGOTCATCCAAOTC'miCrAOOAATATATTT 
CTACTATTTTITAGACTCCrOAAAGITOTrCACATCAATGTGAAGACAAATT^ 
AG A ATGA AATTATGrrCTrQA ATCATATATTAAOAAOTAAAAATAATAOTGATCAOGCAQAAAAGA 
AAAATGGAGCATCTAAAAATGTATOTGCTAACTATATCATCCAGTGTOCAGTGGTGNGTAT^^ 
TAAACATOACAACATTOATOTGCCTrntlAGTaNAACAACAAATACTOOTAOT^ 
TTTATGTCATrrnKrmAAAAANATNACrGGAGTGTCCAATC^ 
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AAT(XXXX::AmXT^m^ATCOTAACCAAAA^TCC^TANTTCG^^^ 

SEQ ID NO: 3343 QQTA C 1 1 111 1 1 1 111 ] 11 111 i ] 1 t OATOAAAAAOAOCCTAAACGCITCTOAT 
TTGGOATAAAOAAAAAGGAGCATTAACCrraACTATCnCrrTAOat^ 
AAATT0CrGQOCAGOTOGOGOAG0GCTACFrCACGGAACGAAACrGTAAO<XOQACCAO<7T^ 
OOAOCKWACKnmTAAAAAOATrACAOraTOGAGGAGTQOAOCCraAOGAAGAATTGGGAOT 
ACrrG00CTGGAAAAGAAGG<XKlA0A0GTCAAATGGGTTTOTAGAAAAAQ0AAGATTAQAC^ 
CTCNCCACCCCCTOOGQCnTaaACTTOAaOOONCAOGOGOAOGGGAAAAAACWAAAAT^^ 
CAAOTnxrCTTGOCCCNAAAACTNAOAAAGGGNCGQTTtmTAAAAAAATGCCT 
OCNAAAACATTroOCCIT^T^TGCCAAAAATATmAGG^mXJ 
GOCmTTNGOCCTTTTAAACNTrQOOJAATTrrT^^ 

OG^^^mG^rIT^N0NNCGGGN0NG^m^AAAA0<^TNA^^T^raNCCCT 

SEQ ID NO: 3344 ACmXXXX>GCGCACOTTCTCTCTCCXy.c;riUrr 1 1 CCAATTTCAOCGOCTC 
CGT0<UTOACCOTG0GACCTGCCAGTOCTCTGTTT0CCTGCCAGACACCACC^^ 
AGTGGAACGCTrGGAATrCACAGCTCATGTrCTnxnx:AQAAOTTrc 
GA0<K3AATATGT(XAATTAATTA0TCrrQTATaAAAAaAAACTGTrAAACCTAACnn^ 
ACATCATCGAGAAaCATACKA^ITC^rACACTQAA^fNNGGACTTCGA0CTGATCAAGOTAGAAGT 
OAAGGAGATGGAAAAACrGGTCATACAGCTOAAOOAOAOTmOOTGOAAACTHAAAAATTOOT 
GACCAACTTGAOCNGGAAATAAAAATATTACThmrTTNGTAAAa^ACTrOOGACXTO 
CX:ATGTCCTTOChTrCNCCAAAAATCGGGGTriTNAAAACAACTO 
AAN^WAAAAACCCITQTGNCNOCT^t^^XX:^JT TCAAGGA C^ 
TAANNUUCNGTT(XXn"AACTTAATGGGAACOOTTTITrrATTTt}Ghrr 

SEQ ID NO: 3345 OGTACTOCCATTCTCATCAGAATOTGGTrCTGG<XTATTCTrATrT^ 
ATTAACTXJATtmxnTTtUCTTTtnXXriXrrATTm^^ 
GCTGOAA n TTC r rTTTCAGCTGOTreATCTTCTreAATTAAGOAAOCT^ 
TTAAOATTTTOAATAAATOCTTCCAATTCACCTTOCrOAAAOOGTCATCaATra 
CTCCATCrATCACrrrCCTCATTCTCATCXi(^GTAGCX>^ 
GAGCCCGTAACACrrTCTaZAAGGAOAACOCATrCICOGGTTC^ 
TONNCOGAAChnTTANCNGCNTCCrCGOGCCNCCAAOOOCX^ 
TOG^m^GCXXKX^TG<KXXXXAANCATACCCCCGGNAAACTAACCCGCOTCC^^ 
CNCCTOrrAA0CAAANNCNT0ATnmCTCCTTTCAAO0CAAQ<MXTN0G 
AGTCCAAANCXNOCCOAATICIXXAGTCKTOOATTCCACGGAN 

SEQ ID NO: 3346 ACG<WXK}GGaAKn-GGCA{XTCX:AATIXXX:AGCCCCGGCrr^ 
CCCAOOTOTrQACTCCAGCrCCAOCTrCAOCTCrAOCIXXAOGTC^^ 
CTTAGGCAGCGOAOGTTCTmXTrtXXIAGTroTTTnX^ATTI^ 
OACCrGOCAOTGCTCTOnTCCCTWX:AGACACCACC^^ 
ATTCACAGCrcATGTTCTrrcTCAGAAOlTrOAaAAACAACTn^ 
ATTAATTAGTCTGTATGAAAAOAAACTGTrAAACCTAACTGTCOGAATraACATCATXjQA 
ATCCATTTCTTACACTCAACTGGACmXiAGCTXiATC^AGaTAGAA^ 

CIXXnCATACAGCTTAAAGGNAAGTITOGTSGOAACCrAAAAAATTGCTGACCNACTTGANGr^ 

AQATAANAAATATOACTITCTQOTAQANAACTTTOAACCTTAACNAAAACATOTCCTTGC^^ 

CAANAAT(XWGC^NTGAAAACCAACTNAAAA^m^t}AGGCCmAAAAT^^^ 

SEQ ID NO: 3347 COAGGTACCAGCGGTTTCrCTGTTCTGrrGATCAATXnXjATTCACAOGAACTCC 
TTAAOrAACAAACGAAAT0A0a:A0(K}0C0TOQAAAATATOACncrATATrQ<nxrr^ 
CTATGAGCTCCAGCATITrcATrGGAGGAAGTTrCATTTTGAAAAAAAA 
CCAGGAAAGGCTCTATGAGAGCAGGTCAACGTOGCCATGCATATCrTAACGAATXHjTTOT^ 
GCTOGACTOCrOTCAATOGQAGCTOOTGAOOTOGCCAACTTCGCT^^ 
ACTCTAGTOACrCCACTAGGAGCTCTCAGOGTOCrAGTAAGTGCCATrCT^^ 
ATOAAAOACTrAATCTTCATGOOAAAAATOOCntjOTTGCTAAGTATTCTANGAT^ 
ONCATIX>T0CTCCCAAGGAAGAAGAGATT0A0ACmAAAT0AAATaiUrCAC^ 
ANCNAGOTT^0NO0GCTTT0TAACCCaG^tX)GC^^0^KK^^ 
CrCNOCTNGACAACAAACATnmTOTACTrOCCGCXGCKXXm^ 

SEQ ID NO: 3348 GGTACOCX}GGGOCTA(XCOGGCATCAGCC0CGAGGAAT0CCCCTCIt^ 
TOCCOCTrCIXXAACTTCATCTrT0AA(m;CCCTGGTa 

ATTACTAAOAGAGGCTXJGTTCCAGAGOATGCATCTCOCrCACCGOaTGTTCCGAAACCA^ 
AAACTIXXlCCTTATCAGCm^TATTTCATOAAATCCraGGriT^ 
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CAATCKrmAACATATAATrTCmAAATAAAAOCCrrAAAATCTGCAAA>mANANAN^^ 

hWhW»mAAAAGTACCTGCCCGGCCOOCCXTrrOOAAAOG<KXJAATTOC^ 

ACCTAOKKK]ATCCNNCCTCaGANCCAACCTTOO<XFrAAT^ 

AATGOTATTCCCTTCCAANTCCNCCCACATCCAACCCGGAAACCTTAAAGTGOT 

COCCTAATGACNGACCTACCTCNCATTAANTCQGGTGCKXnTACTItKXXINTTTC^^ 

CTTGCTNCCACTTGCN 

SEQ ID NO: 3349 ACCKXMSGGGGCGCroAGAACGOiOGTCCACGCOTOTOATCGT^^ 
AGCCrnXKCCACGCAOCTTTCAOTCATGaCCTCCGOTAAa^ 
ACTTCAACKXXACAGCOanKnrroATGGCGCCTTCAAAGAGGTC 

AAGTACOCGGOOGGACAAACGGAAGTGTAGGTTACGQTCTGAGACATCACOGCCAAGCTGGGCA 

TCOOOOAOATOOCCOAOACTOACCCCAAGACCOTGCAGGACXTCACCItXjGTCGT^ 

^TGCACAAAATNCAAAATAAT^TNNAANCATT^TrNACCCAA^m^^^^^ 

NANTA^^NC^xr^^^GNTQAT^^^ooAAAANAAT^^IxxK^ 

AATT0GAAAOTGAAACAAAAATACCTX3CCG0CXAAAAAAATT0AAOGTGNTrAATAATrATr(^ 
GGAATTNGCNTTnTCC:AANa>AAAAAAANTCAATOG>nTr^^ 

ACoaTIT^mmTTTAAAO>mJNcnTC^rT^^^ACCTAA^^ 

c 

SEQ ID NO: 3350 ACTtKnCCAGGAGTTATOCAOOATAGArrrTCACCCACCATGGGACOTCATC 
OTTCAAATCAACTCrrcyo^TCGCCATOGGGGACACATCATGCCT 

AGATGGGAGGCAAOmATGAAAAGCCAGGOACTAAOCCAOCraACCATAACCAGAGTCAGGO 

ACrCTTATm:AOCTGCAAGOACAaTC0AA0OATAT0<X><XnXM<mTrcrAAOAAA^ 

TTAATOCAOATOAaATTAGCCroAGGCCroCTCAGTtXmXXTAATaWTAA^ 

AAOCrrCAGCCOCAOATAACTATGATKXTOn'AAOGCCAACACCACOiXCTTNA^ 

TOGGACACAACACCrAACNnGOCTrCAAAmAATCCACCACTrATCCAOSAAAACC^^ 

gaaccagcaaaaaoccnccacccxmiaaaggaagaactccttaaot 

ngat^t^•aatatn0oaat0caatgaaant^itccatog0ttanaaaaaataag<xr^cttaac^ 

tnttcrmoatcotaaccaagaatatctgtccn'atrraaccgttaaataaanaaaaan 

seq id no: 335 1 ggtactatagaccagtttgaatataatgctrotgacaatrcrgatgcatat^ 
acaaatqaaoqotaacooaoagatggtatatgacnjcactaoctcttccrma^ 
oatoatgagtccagagoacagctgoctctccaagtoocagcgaotcaotaactttaagc^ 
naatatgcxjgtotcagtcactggtooccroocccaaggaatcxrtx)^ 
aotgocctacaaatccaoagacacaoctataaaqacctaocaaoatgcaaggctgccagcatctt 
TGcrcrccACcnxxrrocx:TCTGcrrATTix;ritrriCTGGAACTAAAT^ 

CCTACCCTCCAATrCAOACTCAGCTOACTOTTQAQAOAOCAOCACATCATTTTAT^ 

TTTtKiACTACAOGTXKKjGTCGGAGGGATrrcGGTrGGTGOATAACAG^ 

TAGGATCCTGATTTTCrACCCCOOGOCCAOOTTOOONCrmCCX^ 

TCAATAATrraAACCTCAAAAAOTTTGAAOGCCCOAAAAAAAAAAAAAAAAAAAAGTCC 

SEQ ID NO: 3352 ACGOOGOGGGGGAAGAGGOGAACATGGACATGAAGAGGAGGATOCAOCTOG 
AGCTOACXJAACCOGAaXXXSGCAGCrGTTCXlAQAACTTGT^^ 

OGAAAAATTGAGGOCTTAACAGCTGAATTTOTGAACnAGAGTrcCTCACTrrAATAAAT^ 
TTGATCTCAGTrrCAAATCTCCCCAAOCTGCCTAAATTO 

ATCTrTGGAGOTCrOAACATGTTAGCTOAAAAACnXXAAATCTCACACATCr^ 
AATAAACTOAAAQATATCAGCACCTrGGAACCTTTGAAAAAGTTAGAATGTCTC 
CCIUmAACTGTQAGGGTACCykACCrGAATGACrACCCOAaAOAOTOT 
CTTACCTACrnGCATOGCrATGAOCCGAGAGOCCANaAACCCntiAC^ 

GTGTXWATrAAAAAGAGCAOGNCCAAAAANQAGAAOATAOGAANACCAGGNCATNANQNTOM 
OAAAAAAAGAOTTTGnOAAAAAATOTTAAAATNAAArnTAAAGOOGNTAGGNC 

SEQ ID NO: 3353 GGTACrOATACAATTGJWGG<XCrrOCACTATAAATAOaATGGAOGATGGGT 
CACTQTQTCCOTATTACCAATOACAOTCAOCCCAAGAAACACAAGCAaCTOC^ 
AGGGGGTAGAGCCACTATACrrCTCATOTAGATCAOCCACATTOTCACrG^ 
CCATCCTCCCGCAayrGOTAGAOGTrGACTGCACCIXXTOAGTAGGCATC^ 
TAaATOOCrCOACOOOCCAGATCATAGOCCTOCTCCACrrCCAGGrrC^ 

CATGACCCCATATGCATACACAaAGCCAOAACCTACAGAGAAGGNOGCCCCTGAAATCGGTrCCT 

TX>CTOTOCAOCn'AOTAAAAGCCA0GaCCTnTrATCCCACCACA0ATCATO 

CCCCATGCCTTTOT 
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SEQ ID NO: 3354 ACTOAACrrOCCGCAGCAAAGAGACrGTCTCraACGTGATCnXX^ 
OIACAAAAOCCAOCAATOAAATCaCTGCrtUOOCTCACACCrOaAA 
AACTAACTCCACATAAGCTTCICrrGAATATCX:CXnXXXK:ATGOCCI^^ 
ACTCTdOOCTXKKrAOGTOOATCroTTTACAOATtyrTATCTCTCrCATOAAT^^ 
ATCAGGAAAATCCTTGGOGTXXKjGAOAGCTAAAACGGATCCTCATT^^ 
CCrOATCCAGAAOATGA(XAAAACGAACTCCrcrrrG<nTOGhm^ 
ACaACTOAGArrGGTAGGCACTGCCTGTTGAACTCGACCTTOGAATroTC^^ 
TCTOACOCAGAAOTONCACTrKITIX^CCCCTGGTAAAAAGCn^^ 
GCAATANOOOOACItnTmrrrONOOCOOOaJAAAOOACAATOCAATAC^^ 
NCCCNCTlGAlTaAACAAAAGCNACT(XKXrriXX}CCTIOyrOGATGGOA 

SEQ ID NO: 3355 TCNCNGOOGOnCCCGCGTGGCCATTrnXJTTGGTGGTOrrCAGTrGTGGO^ 
TnXTOQTCAOTAACAGOCAAOATOCTQCOOAATCTOCTGGC^^ 

COATAAGCACTQCTIXXXX3CAGGCATTTrAAAAATAAAGTTCCGGAGAAGCAAAAACT0Tr^ 

OAGGATGATQAAATrCCACTOTATCTAAAOOGTOOGOTAacraATQCCCTCCTOT 

CATaATTCrrACAGTTGaTOOAACAOCATATOOCATATATGAGCKHSC^^ 

OAAOCAGOAGTOACTTCAOTCATCOCAGCAATCGCrroGTItAAGm 

CCAGTAATCTTGATAAATAACCGAGCra-rWmtKJGGATCAATATTATrGACT^ 

CCANCAATAAAGCAGTCTITACCACGATTAATTANANNCNATTATTNAAAGTNCrrNGGCCGCG^ 

CACCCTAAOOOCAATTCCAOCCACTGCOaNCOTTCTAATGGAANCCAOCTCO^ 

AAACATG0CANACTrGTTOnt3NGN0AAfrrNriTTCCNT^^ 

SEQ ID NO: 33 56 ACrCTOOAGAAACTAGAAATATATCmrCriTGA<UGT00ACTCAW 
TAAATCTGAATaCOATCACACTGTAOTGATATAOTAOAAOCAQTOATQATATrCAGATTOTm 
TTTGATTCAAATGATGAATCTGrrcACACCTTATAACTCCC^ 

AOAGATCAAGTAAGATACATGTKnTCATATTCATAACAACTTATrAaAGCTATGTAGOAA^ 
AGOTATCTGTOATCAAATOACrrQTCATroGCACTTTCAGTCnTr^ 

GOTTaTATTTTTTATTATrAATGACATAmAAGTCrTCAGATATATTATmAAAGGCAC^ 

TOTAAAACAONTATTACCAATATmATCNTAAATCKnTOCnNOATT^ 

ATTCCAOAAAAGGAATCmGAATTTCGTGCAITOTAOOAOCAACCAGGCITCTANmmTAA^ 

aACTOGGAAATTrANTATAAATTCOTA>mTATCANGGCAACTOGCCTT^ 

OACTACOOGTAAATANTGGAACCCTTA 

SEQ ID NO: 3357 ACTOTGGAGAAAGrTAGAAATATATCCGTCTTT<UGAGTGGACn:AAACmT 
TAAATCTGAATOCGATCACACTGTAGTGATATAGTAQAAOCAOTOATOATATrCAaATTGTnTAT 
TTTGATTCAAATGATGAATCTGTTGACACCITATAACTCCX>TCrc<XTATCT^ 
AGAGATCAAGTAAOATACATOTTOTTCATATTCATAACAACn'ATrAGAGCTATGTAGCAACATAO 
AGOTATCTOTGATCAAATaACTraTCATrOGCACTTTCAOTC^^ 

GNTrOTATl-rTTrAlTATTAATOACATAriTAAGTCTrcAOATATATTATmAAAGGCACCCC^ 

TGTAAAACAONTATTACCAATATTITATCNTAAATCTCTTOaTOOATTTC^ 

ATTCt>GAAAAGOAATCTN0AATTTXXmKATT0TAOGA0CAACX>OGCTTCT 

GACT(XiCAAATTTANTATAAATIOrrAimTATCANGGCAACTGGCGTTTT^ 

OACTACGGGTAAATANTOOAAOCCTTA 

SEQ ID NO: 3358 GGTAOCGCrrOTTTATCCAAATTTTCCTCTOCAAGTGOAOCAT^^ 
CAATAGCAGCAGTGTrAAGCAGGAAGCTACATICTGTTmyLAAGGGATGOajATG<^^ 
ATAAAGCCCTATCCrcAAGTGCTGATOATGaiTCTTTGCrrAATGCCTCAATr^ 
AAGCrACrrCTCCAOTGAAATCTACTACATCTATCACTOATGCTAAAAGTTGTOAGOGACA^ 
CTGAGCTACTTtXAAAAACTCCTATTAOTOCrcrOAAAAaKKKK^ 
CAACTTTATCOCAOACAOTTXXATCCAAOGGGAGAATTAAGTAGAGAAA'inG-lCI 
TCTAAAGACAAATCTACGACACCAGOAGGAACAOGAATTAAGaTnTCTGOAACGCTm 
OCmTGTCAAGAACATAGCAAAAAAAOTa>Cran'AOCCACCXXAC>A^ 
TCCAATACAANGCCATICCAGAAAGAATAnCAAGCCAAQACCAT^ 
CAGCTCAGCNOGACCGTCAAAAAAACTrC^^^OGNTIt^NOGC^ 

SEQ ID NO: 3359 GGTAOCOACCATAQAGCAAGAATCAAGATTCTGCTAACrCCrGCACAOCCC^ 
GTCCICrrOCrnCItKTAGCCTGGCTAAATCTGCrCATT^ 

AAGAOTQATAAGOOOCCTACTACACTGOClTrmAGOCTrAGAQACAOAAACmAOCATrGGCC 

CAGTAOTGOCTItrrAGCTCrAAATXmTGCOCCOCCATCCCTnCC^ 

CCCCTOICTCTtKKnXjrCTCaAOCAOTCTAGAAO^ 

tctttogcc^taaoaagtaaaoatttqaagacagaaaogaagaaacicaggagtaagc^ 
ccccTTNAAcr^^^TACACCCIT^rrGCCT^mTCA 
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OTOOTnXXnrrOGNhrrGOCUAOCnTACAAGAAAATCCTGCTANOaTGAaNGGOC^ 

TTAAAAAACATGG<jroCTNGCCOG<XnTO<XXX}TraAAAO<XKjAATTCCACW 

AATO<UACCACTCGOONa::AACrT0OOOAAAATNO0ATAAT 

SEQ ID NO: 3360 A ci i- in -i'i ri T T r rn - n m - rii I i x^GOAOCCAAGGAAcnTrArrrACTCTAc 
tgggtgacaggagg<k:anagtgctxx^vga<xjagacccagatacatcaacca^ 

NATTraOCTTT0CrCTIXXAGG<XntX:ACATOCTGCCm3AT^ 
TGACH3GCCTC0A0CK}CroCTCCCTC0ACTnxrn^ 

CAGOGCACTGhrrCTGTGCTOACnCCACTXKAGCCAAGOaTCAAAATGAAO^ 

AOOACTCCTTGOCATCGOACACAOTCAGOOGAAAAGCCCCCTOAC^^ 

GOOTCATITGOCAOOAAAACACTGQTGTGCXXAGGOAAACGAGCATGArrTTroGAOT^ 

ATCK:ATaGGCTGQAG^^^CAATAAAC^aaAAAAO^^^^ 

GTOAACnTTOTCAATCTmATTGOTNAAAAGAOCTrrAAACTO 

ACGAACN^mc^r^^m^AA^T^mT^^GaANOoaaATNG<Kr^^NAA^ 

SEQ ID NO: 3361 C0AO0TACACAATXKmTATTAAAGGAATGTATa<KXXACATCAACCTAGCA 
AGGATTCTACroOTAAACCrnXCATOGCCAAAGGAAAAACAAOCAGGAOTTaACTO 
C<XXmiCACCCAATOOAGA0A0O0CAOAAOOOTOTAGAAGCTGAAaGOOGCTAGAAGCTTACTC 
CTQAGTTTCTTCCn CllJTCri tlAAATCTTTAmCTTATq 
TGOAGAnKACTCTTCTACACrrGCTCOAOACAOCCAGAOACAOGGQ^ 

TOTGGAAAOOOATOOCOOOOCAAACATrrANAGCTAGAAGCCACTACTOOGCCAATOCrA^ 
TCTarCICTAAACCTAAAAAACCCAOTOTAGTANOOCCCTTATCACTCnAOTT^^ 
OCTCroAAATAATGAAOCAOAATTACCN0OCrACAAAAANOAAAAOACCO<X}CTT^^ 
AAOCANAATCmTGATTCTrGCICTATGGCNGQACCTTGCCCC^^ 

SE Q ID NO : 3362 ACTCTGTAGCATGACAATTmAOAOCnKIAGGTTAOCTATTATAGCTC^ 

L^rri^tjrrttji-ti-rn-icriuAcrnxnTCACATCTOAGa ii T crr it nv r^ 

GTOTGCTAAArTTOGGAAACAQATCrATTCTAACTCATTAAC(XAGTrGAAAT^ 

CTTAATATATCATGCAGTAAAAGOAACAATOTrrTAAAaraACOCAOCTm^ 

ATTOTCCTGCrnjIXKnaATGOrroraArrAOaTOTAGAGAATTTOCTTAAAT^ 

ATCACACAGCrTAATGAGTCACATTOAAATACTCACjrAATTGAACTOCANATCAC^^ 

AAAAGAGCTAAAGCAT0TCA^^^aOCATOAT0C^TCKX>IAGCCAGAATCTAAGCATICT 

ATTAANOGAnTTAOTATGCAATTrACTOOa^TAATATCAAANGGCAGTAACAGNGaTC^^ 

Cm^GCCTATGTATCCTITAAGATACCrOCATGAATACTITaATACm 

*aKXKXJGGA0AGAGGCCGAGAKXKAOATGAGATT0CCAA<^^ 

CGACACGATCmOGGAAGATCATCX;GCAAaOAAATACCAGCCAAAATCATTTTTX3AOGATaAC^ 
(WTOCCITCKriTIXXATaACATTrCCOCTtlAAGCACC^ 

ATATATCCCAGAmCTOTCGCAGAAGATCATOATGAAAGTCTTCTTOOACAmAATGATTC 

CCAAGAAATOTOCTGCTOATCraGOCCTGAATAAOCGrrATCGAATOCTGOTaAATO^ 

AATGGTOGGACAAQTCrOTCTATCACGTTCATCTCCATGTTCmSGAGGGi^^ 

CrNCTGGTTAAACACCGTITrOGCGATAATTrCTCTTCmANOCAATOArrAAAOTANGC^ 

CAATATTmAAGTAACACACnATrTrroCCTOTGTNrro^ 

ACCCCATACATAATAAAAAACATTGTGCOCQAAAAAAAAAAAAAAAAAAAAAAAGACCTOrc 

SEQ ID NO: 33 64 ACTXiCGG0COTOTGOOT0AOTrQGCTQCCGOOO0GT0AATA C -[ | 11 ICIU OOG 
AAOQAAOATrATTTau.0TQOAATATGCCATTOAOGCTATCAA0CmXjTTCT 
TCCAGACATCAGAGGGTGTGTQCCTAGCrGTOGACAAGAGAATrACTIXXCCACrOATC 
AGCACCATT0AOAAAAnx3TAGAGATTGATtXrrCACATAGGTITrrGOCATO^ 
TGATCCTAAGACrn-AATrOATAAAGOCAGAOTGOAOACACAOAACCACTOOTTCACCrAC^ 
ACACAATOACAOTtKJAAOAOrOTOACCCAAaCTOTOTa:AATCrGGCTT^^ 
NAAAATOCANATCANGTOCCATOTCTTOTCCTTTQGAGTAACCATTATrAT^^ 
AAOAAAAGOCCOCACTOmATATX}aACOCATOTOGOACCTTGTANCTTO^ 
TAAOaCGAATITCAACACACTrGCOCGCCGTACrANTtKlATCCNNCTCGOm 
ATCATOOCATACTTONTOTIXKKWfUAATOTnCOGTTACAATT^^ 

SEQ ID NO: 3365 ACnTOACTNACTAOCGTGArrCAAAOTrrCAOGAAAAAOAAAATrO^ 
>m>TITICTrAATCTrATrAAACCrAAACATAAGAATGCCAAAAAATACAGAGCTCAC^ 
TGG<>TACATTrcCAAATTITrAATOCCT(XCTGACAGaTGAATmAAO^ 
ACITTX>AAACATTCCTroTGATOAAGTAGAAAAAGCCCraGATAAGTOQCX^ 
TITTGTCTCAATTCTTCCATTAACTrTACAGTCTrCA^^ 



520 



wo «2/2WW6 



PCT/USOI/J0732 



AfnCACATCATTATAAAOTOCTCntXnATTAAAATCrGATAAACATAOATOAAAA 

ATGTTT<lAAATACrACATAAATACACATAATAACTAAATTa:ACAO<riTACGATtnGAGAOACCr 

ATCTOACCAGT>rrrATAAAANGGAGCTrATrACANAAAGCNCACCTTATrCATrACAGG^^ 

TTCATATCirn'GNGCAAGOATTrAATAAATGAGTTm 

CANACCIXKXiAAACrcAAAGCnXXTAATQTnTANO 

SEQ ID NO: 3366 ACC0C0<XKK}CTX3ACTCTCTmCGGACTX;AGCCC(KCT 
ATAAACAGCCATCTTOCTCACACAAAOCCTOTTroCT^^ 
TGGrrOCCAT0ACT00OAT0O0O0OACCTCCCTT0OOAGAT<>ATC^^ 
TCCCTOAGAAAGATCCACCTACOACCTCAGOTCrTXAGACXXACGAO^ 
AATTTCAAATCTXKJTAAGCAGCCTCnTn'ACTCrCITCTa 
CmCTanrrCXATCTTOGCACTACACTrCAATCTCTC^^ 

GGTAGAAGACAAAAAGAAGACACGTITTATOCCGNGGACCCAAAACTTCNGCOC^^ 
ACTGGOAAAGOCAC CC^T^CCTIXJ(J^<Km^AAACA^mJNANGOATG<X^T^^^ 
aNTTCAAGGrOGTAAGACACOCANGOACNACTGGCNTOGTCCrcACa^^ 
NTOGOAAOGGCAAOTCXnTOOCOGOAACACCrAAaoaOAATTCAA 

SEQ ID NO: 3367 ACGCGOGaGAAAATAAAGAATTACATATTOATQAOOATCAAAAGTCAGOCT 
TrrATTGAGATOGAGACAAGAGAAGATGCAATOGCAATCGTTOACCATTGTrroAAAA^ 
TTOOTTTCAOOOOAGATCrrOTaAAQOTTGACCTaTCTOAOAAATATAAAA/^ 
TTOCAAACAGAGOCArrOATTrACIXlAAAAAAOATAAATCCWiAA^ 

ggcaaagaatctccaagtgataagaaatccaaaactgatogti«x:agaagac^ 

CCGAAGGTAAAGAACAAQAAGACAAOTCCGOTOAAGATiGOTGAOAAAaACACAAAGOATOACC 
AGACAGAGCAGaAACCrAATATOCntnTQAATCTOAAaATOAGCTACrrGTAOATGAAAGAAAA 
AACAACCAOCACraOTAQAAAaTOGCAOrrCAATGQaAaAajAAAACCMATC^^ 
■ TOATGT0CT^CTaA^T^OGGAAAAN0OA^CATCAATAAGCTlXXKiAAAAAOATQQAAff^^ 
CCTCACCAAAAAAAACTTAAAANGGOOACAGATCaAOOACrrONCAAOAAACCAACNGCGTO 

SEQ ID NO: 3368 OGTACTATTCXrrOCWAAGACAGCATCCraACTCOCTXjrCTAC^ 
AGTAAAAAT0ATGCATCmX}CTOGAAATATTATTGO0GOTTCGAAGAC:AGA 
AAGTCAGAAACAOACACTAAGAOTCACTGOCAGOCrcACrOCAOOTATCAACTCT^^ 
TAOAATAAAAOAOCrAQOGOCTGAAGOGOAnAACAC AAAA GCIXJrrAACAATrAAQTGATATCT 
CTGGAAGATrC0AGAGAC>OOAGGCAACTCrTCTATAGTTrroAOTG<XCAGCA 
ACA(>AATGTGTATATGCACCTG<MTOCATGCACATACACATACACACCAATQAT^^ 
GAAGCCCGCCTATAAATAAAGAGAAATAGTIXXXAAAAGOAAAGCXCACATGAGCAGCAACATC 
TqOACXjrCACACTTXK^AAOAAACmCACCXXAACTOAAAACrOOC rOOGN AAANG 
TCAAOCTCTTGCAATAATGCCCAAAANCGCTGATCAGAAAGOACAGCTrT^^ 
ATTrmCTrrCACACTAOCTGGTCCroCCCXSGCGG^^ 

SEQ ID NO; 3369 A0QC0OOOTOATG0<XQC0O0OCTCTCCAGAACATCATCCCraCCTCTAC^ 
OCXjCTGCCT^GGCrOTGOOCAAGOTCATCCCTGAGCraAACXKeAAGCT^ 
CCTCTCCCCACrOCCAACOTOTCAGTGOTGGACCTCAOCrOC^^ 
GATGACATCAAGAAGGTCOTGAACCAGGCGTCGGAGGGCCCCCTCAAGGGTAT^ 
TGAGCACCAGGTGQTCTCCTCTGACrTCAACAGCOACACCC^ 
TGGCATnMXXrrCAACGACCACrrTOrrCAAOCTCAmC^ 

AACAOCGGTGGTGGACCrCATOGCCACATGOCCTCX:AAAGAGTAAAaACCCrGGACC^ 
CAOCAAOAOTCAAOAGQAAGAAAOAGACCmAATTtjrroGGGAAOTCCTr^^ 
CCAACACACTGAATCrnXXnrcnTWACAGTTCCCATOT 
AOGAGCCOGACCirrGrrATOTACTTaJGCCOGAACACCTrANQGCNATTCAC^ 

SEQ ID NO: 3370 A<XCAOACAAAACCCGOCCACQTOTAAGCCAGAT0CrGATTrr^ 
TCAAGGTCAAOGCCATOGTGCTCAACTICITOAAAJCAGTrCATAGATACrAC^ 
ACTrCTTGATATrA<>OCOGAAGATCCATCCAAAAGCrATGTGAAA^^ 
GAAGCmOTCAAGArnXXXXn'GTrTTrCCCGGGAAAAATrAATGCAAOOArr C^ 
GCCCATAGAOOCACAACAGAAGTTCAAAATAAATAAGCAACAOOCTAGAAGGOTTTATGAAATr 
CTTCGACTACTOOTAACTOACATOAOTOATOCXXXlAACAATACAOAAOCTACAOACTOaAT^ 
AAAGAAGACTAATTAGCCCATATAAOAAAAAOCAGAGAGAnrrraCTAAGATGAGAAAATXJTCrC 
ANACCAOAAAOAACTXSACCAACCAOATGAAOCCAATAGAAATAAGQn-GCCACATC 
GAAGAGACrmCAGGACTATGGGAANATACCGOarraOTATIXMACXjACrGCT^ 
TATCTGOThrrrGACCTNGGCCG<KJAaXXXTANOOCQAATTCAC^^ 
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SEQ ID NO: 3371 OGTACACTTOAAAOCAAATnCTAAAAtl lUl 1 1 1 l AAAAAATAOTTGTT 
(KAACATTAAACCATAACCTAATCAGTCjTGTrCACTATGCTrcCACACTAGC^ 
TTCrrCT<KjTTrcAA<nXnX:!AAGGCCTGACAGACACJ^ I lACAATTC 

AQTCTTCAOCAACTrOAGAGCTntriTCATOTroTCAAOC^ 
AOCATAGACJACXHjnTGAATATCTnXAmXjATA'rcGGCT^ 
ACATAATCCTGOGGACATACTGGCCATCAGGAGAAAGGTGTTTgrCAaT^^ 
TOAOGAGOACAAACT0CTC^KXj^ATT^CTC0ATrTCT^TA^^^ 
TTGACTONGTOGGCACrCATCCAAGTGATGAATAATCATCAAGGOTrGOTGCTI^^ 
ATAQACCrCTCATATOTCTaA0T(Xi\^TOAOTTOQTCACCCAA0t^^ 
N'mXjOGlCANAAarOCTltiNGTOCmTOGCTNOCXXXXrrACCTa 

SEQ ID NO: 3372 ACNCCGGGGAGAAGCrrGOACm::ATCCTA<XXX3<XGACTC^ 
GCm}OOTaAQQAAATCX:AOAaTTCKXV^T0QAQAAAATTCCAGT0TCA0CATr^^ 
CCCICrCCTACACTCTtKXX:AGAOATACCA(>GTCAAACCTGGA0C^^ 
TCTCGACCCAAACTOCCCCA<MCXXrixntXAQAOGrrrG^ 

ATATGAAOAAOCTCTATATAAATtXAAGACAAOCAACAAACCCTTOOTOATTATTCAT^ 

ATGACnWCCACACAGTCAAGCTITAAAGAAAOTOTrroCTaAAAATAA^ 

GCACAGCAGTITGTCCrCTCAATCrGGTn-ATGAAACAACTaACAAACACC^^ 

AOTATGTCOCCAOGATrATCmTGTrcACCCATCnCraACAGTTAQACO^ 

ATTCAAACC0TCTmA^Xr^^ACGAACC^OCAOATACAOCI^T OT^0OT 

Tm-AAGrrGTTGAAGCTGAATTrAAANAAAAAAATITCAANCCi ri 1 1 1 10 

SEQ ID NO: 3373 A Ul lU ' l^ ' ri^T IT l T I T r n m T l TATIXKSNGTCCCAOA'nTATTOAAAATAAT 
ACAOCACTACAOAAAAAATTCAAACAOGTrcmXSAOOCOTTrraAAATTCATOCC^ 
TGAGTOACCnSAA00TTtXlACAGACT0CCGAA0TCCAAAAOCrn>OCATr^^ 
ATCTACTrCAATAATXnrCTGATOCAAOOCraAQACC^ 
TCCTCCItXTOCAOCITGATGGAGATACCTCrrACrGGGCCTC^^ 
TGACGTAACCTOCrATCnxnTGCOOAOCnTTTOCTOGGGATAAT^ 



SEQ ID NO: 3374 GOTACACTTGAAAOCAAATTTCTAAAACI IIjI i t I ICl lAAAAAATAGTTGTT 
GTAACATTAAACCATAACCTAATCAGTOTGTTCACTATGCITCCACACTAOCCAGTCI^ 
TTmCrOOTrrCAAOTCTCAAGOCCTOACAOACAOAAG<KK:rriXK}AOAn^ 
CAGTCTTCAGCAACTTGAGAGCTTIXnTCATGTrCrrCAAGCAACAGAOC^ 
AAGCATAOAOACCairraAATATCrTCCAaTOATATCOGCTCrAACTOTC^ 
AAACATAATCCTCKXKACATACTCKSCCATCAGGAOAAAOGNGOTntjOCA^ 
CAANAATrOOAGGGAGCGA(>AACTt3CTCTOGCCAATrrmOQATIt^^ 
TT^CTTTNAAGCCmKlATGOGGGQ<WACCTGGACAG^C^^•ACAOOT 

TANTGNTTGGGGAAAAAAACCAGACTATGCACICmAAAAGmCGTNAAAGNGNCCAGGCC^ 
ATTAATATTNTOOACTOCCCOGCNCKKXXnWAAAOQCAAATCAACAANTO 

SEQ ID NO: 3375 GOTA Cl JTlTl t rn 11 1 1 1 I TT T rTMOCTGAOQAOTGCnTATTTCCAACTAT 
QTOGTCAACTTrGOAAGAGGTGTGOTOTOOTOamNAANAATXrrATAT^ 
OAOTrCTOTAOATOTCTATTAOOTCCXXrrTOOTOCANAOCTGAOTTCAATTCC^ 
AACTrrCTXJirTrGrTCAIXntiTCrAATOTTGACAOTOr OGTC l^ 
TGQOAOTCTAAGTCACTITOTAGGTCACTAAGGACTTQCTTrATCAAT^^ 
GGTGCATATATATTTAGGATAAOrrAATTCTTCTTGTTOAATTGATrc 
CCTCmOOCTCTrrraANCmOOTGOOTrAAAAOCTGTmATOC^^ 
OC ClMT I ^ -I XJ I IlN CATTGCrrOGOAAAACITCCCCACCCTTTATm 
aXZATGANAANOGGTINCTCAATChWCATCTOATOOOGTCroGCT^ 
OCmAATOOOCCTTTACCCATrCCTT 

SEQ ID NO: 3376 GCTACCTCTTtXtXrrCAATAAAGAAATAACACAAATTAAAGCTATAAAATTT 
AAGCATGGOCATATOTATTTCXlfcATTCTCCTrAATAAOTAATTOCAAAC^ 
TOTGACTAAOTAOTAACAAAnAAAAQAAAATAGACTmCIXTOOCAOTQATITt^ 
AATGTTACTTCCCATATOCTGAATrATATTTAAATGATmATQTATmAATTCAOT^ 
TrTOAOTTTATAAAATOCrrAATmATTCTTTOTGCAGAAAAGmCAATTTGAGATACr^ 
AGCOVAAACAAAACAAAACAAAACAAAAACCTGTATrrTGCTACTAGTAOTATOAATAC^ 
AAGATACAAGOATTCAAAGTrrGOTGGAAACTItXnXlAAGAAAAATCTCraAG^ 
TCrTQATACrTCAATTATCAATTAAaTmOQAACATrCTAQTITAAOAATAT(nX:AOAAAA 
CTCTQTAATATAACCTCTGACTAAACAGCrcCNCCCAAGCAaAAAC^ 
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AGATCAACTTGNGOCX-TnCl 1 nTCTATTATAATNGGGAGGGGTOAGTTTTAAOTTA 

seq id no: 3377 cacaat0qttrattaaaagaatgtatg<kxxyicatcaacctaocaag<jattc 
tact00taaaccttcctatgoccaaao(maaaacaaocao0aottoa0tckk^^ 
aggcaatggagaoao<xk:aoaaogctotaqaagctoaaggggtctaoaaocti'act 



GCACTCTTCTAOACTCCrCQAGACAOCCAOAaACA(XK»AGGAGOGAAOAA^ 
AGG0ATC<XXK]GGCAAACAmAGA<XTAGAAGCCACTACroGGa>ATOCTA^ 
CTAACCTAAAAAAACCAarrOTAOTANGOCCCITATCACTCTTAOTrrOCrA<K^^ 
TAATGAGCAaATnAGCCAAGCriAOCAAAAAOGAAGANOACGOGGCriXn'CCAGaANlTAACAG 
AATCrrrOAATCTTOCTCTATGCNCCGOACCrCGGCCGCGACACa 

SEQ ID NO; 3378 actoaocaogattaccatgocaacaacacatcatcaotaogotaaaactaac 

CTGTCrCACQACGGTCTAATCCCAGCrcACOTrCCCTATrAGTGOaro 

aAATTCraCTTCACAATOATAOOAAOAGaXJACATCGAAOOATCAAAAAGCCUaTrC^ 

CGCTItKyXG<X>CAAGCCACTTATCCCTGTCOTAACrrrTC^ 

AAOOTCAGAAGOGTCOTCAGGOOCCGCrnrACGOTXnxn-ATIWrACl 1111111111111111111 

TTTNTmCQTCACTACCTCCCCXWCTCOGaAOTOGGTAAT^^ 

GNGGNAG(XXnTrNTAAGCTCCCTKriXXXIAATC>UACCCTG^ 

OGNAOOCACOONOACTACCArrcAAAarraATAOCHKAAACCTTCNAATGOOTT^ 

CCCOGGTCCTTGGGCGGGAACACXXnTAGGG>roAAATIt:ACCAC^^ 

AACTCOGANCAACTTOOGOAAAANGOCATACTGTTNCTGO 

SEQ ID NO: 3379 CGCACTOAAOOAGACAAOAAAOCAOCAAAQGTrCAAAAOCTOTCTAAOAAT 
0AAGTGCTt>TCGTOAACATA0aATCCCTOTCAACAG0A0GOAaAGrrA0T^^ 
TTTOGGTAAAATTOTrnGACCAATCCAGTOTCCACAGAGGTAGQAGAAAAAATO 
OAAGAGTTGAAAAACACTGOCOTTTAATroOTrGOGGTCAGATAAGAAGAG<WGTOACAATC 
GCCAACAGTAQATGATQACTQAAGAATACCAGTTAAATAATACATKXKjATOQATmGAA 
QAAnCCIXnTAACAACCAAGGOOTnATmCAAAOCAATATTGOGO^ 

GTrACCTTAGTAGOTAACGCTAAOOGTATTCTL'rrn'ri'rrrri 1"! GGCTATGAAAACrrAGOGCTA 
AAATrAATATAAAAATTXjGCATAATCTIXXKTrOAA TCrCA TTTrOOCAQAAA 
ACATAATGTCNAAAhrrATACATCAlXH:NAaTCrGGOTTTTTGOTTtK3CTTAAT^^ 
TGAAGCTGGCTTreTCACOCAOCTGGANTOCAGTOOCGTGATTAA^ 

SEQ ID NO: 3380 OOTACACTTOAAACCAAATrrCTAAAACATGTITmTTAAAAATAOTrGf^ 
TAACATTAAACCATAACCTAATCACTCTGTTCACTATGCmX^CACTA^ 
CTTCTGCTTrCAAGTCTCAAGGCCrGACAGACAOAAG0<KnT0GAQAlTlT 
OTtTTCAGCAACTTOAGAOCTTTCrnZATtTmiTlIAAOCAACAGAO^ 
GCATAGAGACGATTTGAATATCnTCCAGTGATATCGGCTCTAACTGTCAGAGATGG<^ 
CATAATCCTG<XX}ACATACT00<XATCAOOAGAAAGOTOTTnrrC^ 
OAGOAGGACAAACTGCrCTtjCCAATrTCTOGATmrm 
TGA(TONCTGCGCACrcATCCAAGTGATCAATAATCATCAAOCCTrGGTGNm 
TAGAACTiriTCATATOCTGAOTCCA>WTOAOTTGGCACCCCACCr^ 
rm'GGCGCCANAGNCCTTGGQCXjnTrOGCTCAGOTn^^ 

SEQ ID NO: 338 1 OGTACCCTITGOATrrCAAACAGTAACATCGGATGTAAACAAACn-AGTTCCT 
TrrACTCACTGAAACTAATCAAGCGGCTrTACGTAGACAAATCTCTGAATCTTTCrACAt^ 
TCAOCTCTACOAAGAGACCaATGCAAAOGAArrGGAAACrCTraACrr^^ 
QAAACGAAAGGTCAOATCAACAACTCAATTAAGGATCTCACAQATOOCCACTTTO 
AGCTGACAACAGTGTOAACGACCAGACCAAAATCCTrGTGGTrAATGCrGCCTAC^^ 
OKWATGAAOAAATTTOCroAATCAQAAACAAAAGAATOTCCTTrCAGAOTCAA^ 
CCAAACCAGTGCAOATOATGAACATCOAOOCCCOTTCTGTATGGGAAACArrGACAGTAT^^ 
QNAAGATCATAGAOCTTNChnrrcAAAATAAGCATCTCAACATGTTCATOm 
OGAGGATGAGTCCCANOCTTOGAOAAGATCAAAAACAACTCACCrCAAAGTCAC^ 
GCTAATNCCAOCCCATGGCCATOCCANGNCAACnmXTTCCNAAATTAGOOT 

SEQ ID NO: 3382 TNOCGGGAACTTACCGCAAGOCTGltXATCATOOTOTTAACCAOCTAAAOTr 
TOCTOGAAGCCrrCAGTtXGTCGCAGAGGAGCGAGCTGGACOaiAC^^ 
TGAATTCTTACTQGCTTOGTGAAGATTCCACATACAAATTTITraAOCTTATC^^ 
(XATAAAOCTATCAGAAOAAATCCTGACACCCAOTGGATCACCAAACCAOTCCAC^ 
AOATCCGTOOGCTXUCATCnXAOOCCGAAAGAOCCXXJTOGCXrm 




JCTGnrCATACCCTCGAGAT 




lCTCTCCANCTOCACCGTTC 
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CONTTj^TATAACrrAAAOTTGCA/^TTCATACriTAATAAACAATNTAGaACAGCCCT^ 
TONNTNT>nWAAAAAGTNCTmGaCC(KJACOCCCTrANGGNQAATr^^ 
TTNTAATCXJATKX:AACTTXKKINCCAACTTTOaCaNAACATO 
TmmrCCGCTACAAATTChWCAAANITXAANCCGGAfWTTAATC 

SEQ ID NO: 3383 GCTACTNCAGOAOCKXXXnOTAmAATGAAACTCCCATCAACCCTCGGAAA 
TOTOCCCACATCCTCACCAAQATTCmATCnX>TAAACCAGCKj^ 
AGCGACCOAOOCCTTCITIXJCCATGACCAAOCTXnTrCAGTCCA^ 
OTOCTACTTOACCATCAAOOAGATCncrrOCATrOCAaAaQATOTC^ 

aacaaaagacatgactoooaaagaaoacaactacxoogocccooccctgcgaoccctctc 

ATCACTCATAGCACCATOCTGCAGGCTATTCAGCXjCTACATGAAACAAGC^ 
OCCCAAOIXmnrXrAOCTCTGCCTCGTOTCTrCCTroC^ 

AAACGCTT<X}TGAATGAAOCTCANGAOOCAGCATNCAATQATACATCATNOGCCANTAOCT<^ 

OOCOO<K30TTrOUWAGONOAAATCX>CACACTOG<KKKra^^ 

ACITGGGGNATATGOCATAACTOGTTCTGGGGCAATGGTrrCCGTNCAAT^^ 

SEQ ID NO: 3384 A CTl 1 L I m 1 11 1 1 m 1 111 HI ] I GGGCAGATnTAAGGGinTATTTAAAO 
AAATTATATaTTAAACCATTOAAAATOAGOAAAAGATOGTrAANAAAACCCAQaA-nTCATGAAA 
TATGAAGCTGATAAOGCGAA Ol 1 ILi IN r i lO OrrTOGqAACAOCCGOTGAOCCANATGCATCCTC 
T0aAACCAGCCTCT^m•AGTAATOCCA0TCr^CCACANACTTNGG^ 

CAAAQATGAAGTTGOAGAAOCAOCACrrCXOAGAGGCGCATrcCTCGGOOCroATOCCCGOGTAQ 
CCACAGTTTCTTXXSOTCTQAGACCmCATOAaXyLCTGATCCOACI^^ 

AAACAOCANOOOACCCCAOTQACACTGOAAOTNOAAACAGCATCATTOGNAAAACACIW^ 
TOGGOATTOCNAOOAAACCCCAOTmCTNCTaNTATIXXKKJC^ 
TrGNGGAAAAA«7riCAAC(KIOCCCXXXXrraoaANCrraGQCCOOACC^ 
AOCACreGGGOCOTTCTNANOGANOCACTTOGAOXAACTOONOAATTATOCKn^ 

SEQ ID NO: 3385 ACGOGGGGAAACGACAOGOGAAAOGAGGTCTCACnXIAGCACCGTCCCAOCA 
TCmjA(>OCACAG<XW<XCTTCGCTCCACGCAOAAAACC^ 
TTtXrrrcOCCAAAGCCAGAAGATGCACAAGGAOGAACATGAGCTGGCIXnXJC^^ 
CAOCA(XATCCTTCCAAOGTCCACCGTGATCAACATCCACAGCGAGAanXXXr^^ 
CGTCTOOTCCCTOrrCAACACCCrCTTCTrOAAC^^ 
TCOrrGAACnCTAGGOACAGGAAGATGGTrGGajAOGTGACCGGGa 
CGCCAAATGCCTOAACATCTOGGOXTCATTCTrOGOCATCCTCATaAOCATT^ 
ACTGGGANTCNGCrCTTOGACAGNCTACATATTATGGTACAAGATAATACAAGGAAAAAa^^ 
ACTTATNACCCGCOiTAGNCTTGOAACCTTGGACTIXlACTGGGCA^ 
NTGOTGOCCCTOaaXXJrraGNCTONCCTAAAAAKAOAAGTTrrACCCCACAC^^ 

SEQ ID NO: 3386 GGTACACTrGAAACCAAATTTCTAAAACATG rnTIClTAAAAATAGrnjl-rG 
TAACATTAAACCATAACCTAAT(>OTGTGTrCACTATOCTI«>CACrAG^ 
CTTCIXKnritVU^GTCTCAAGGCCTGACAGACAGAAOGGCT^ 
GTCTIX^GCAACrraAGAGCITIXrrrcATGrrGTCAAGCAACAGAGC^ 
OCATAOAGACGATTrGAATATCrrCCAGraATATCGOCTCrAACraHCAOAaATOOOTAAC^ 
TTCAGGAGOACAAACTGCTCIGCCAATTTCrGGATITCriTArrrTC 
CrTGACrGNaTGGOCACTCATCCAAAGCGATGAATAATCATCAANGGGT^^ 
TTATATANAACITCrTCATATTGTCTQAAGOCNAATOANGTOGTCXXXX^ 
nOGGCAATTGGOQCCrWANGaxnTGOGOCCITTrOGGTCCAGOTrGAC^^ 
CWKKACCTOCCCOaNQOCCOTTAAAAGGNCAArrCC^ 

SEQ ID NO; 3387 OOTAOriTCTGOGGCATACAACATGOCAGCACGGCCTCaGGAAGAQGGOTA 
GGAGGACCGAOCAOCATTCTCTCTAGAGGAACACAGOAAAOOAOACCCTCT^^ 
OGAGGGTTOTCCCTQAAGAGAAQGOCAGtmKKUQAOaTrCCCTGTTACTTAAGAG^ 
ClXXXXAAQAGCACAATGAAaAGCATQATQATAAAAACAATCACGCAGATAAGOACAATCATCr 
TCACOTTCTICCACCAOAATrTTCGAOCCACCTTCrOCG^ 
GCTr(XAGATCCTCTOTCrrGrmXXjOAAQAAGTrrCCAAGTmtX^^ 
CrCCACATnCTCQGNCATAAATATTCTTAAmCCTTCCCXX:^ 
AAC:ATTTTCTTaxrrrTA>rrrGOTTTCrTCATTGT^ 
ATrcCTTATCGGTTKnrCOCANGGGGONCCmCCGCOTACTrOC^^ 
ATTTX^CNCKTrOGOOCCOTATTANOATCCAACTCGOACCAAC^^ 

SEQ ID NO: 3388 ACATTTACATTCAAGTTGATAACACTGGTaQTrTCATTTCAATACAAAT^^ 
CTAGAGAACTOACATTTCAGACATGCTCATATATATGCTATTTGAATIXXT^ 
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CTTGATTCTOAATXn"CTTaATOATAGATGTXH:ACKTAATnOTtXXX;AA^ 

TATTaCTTGATOOTCTGTAnOCOCOGGAT€CTCnAGGrCTCOCAG<K^ 

aT0ATATTXmm:AaACAOQTATAGTAOGAaACAAGCAOCTACAAGACAAOATCTCa:AAG^ 

CCATAGCAOTOrATTAACGaTrntXKmAATrrrrAAGOCANGmtiNAAA 

ACAACA0CTX}0CTATOTCANaAGrnXrrTCATCTOCOATTO<^ 

TtKntX^iAAAAAATCCAAAAOCTrOGOACTCTm-ATTrrQAC^ 

AAAAATGNC^WATACCCCACCCmAK^C^m:NGGATCTIT^GG^ 

TTCCAACCOOAAAANCNCTTAACCGGGAAACGGAACATTNCrrCGGANAAA 

SEQ ID NO: 3389 ogtactgngataaatgaagaagaaggcataaggacaataaacatggaactc 

CACKXrAAATOGATmATGCAGCrcAGGAAAGTTroGOCTrATTAGTAr^ 
CCAAGTirrCTCrATrGajOACAAOOTAACTACCAGCTCC^^ 
GAA0TIXXX::AQTA0OTTCTGTCAmTT0TT0GCACATAGQCOCrGAAT ACAO OTC^ 
CCCATGAGOTCTCCnXATraTGAAACTACCCAAATATAO^^ 

TXXACACTtlAOGAAGACAGAACCATrrANGCACAOTOACATTOGTGAAATATXmTCAT^ 

ACAAGAATAATTGACGGAGATATATCATTCTOAGTCAAGAOONOGCACAAGTrATAN OCTCn ^ 

AGCOGAGATGmTCGAAGrrACCnJCTGAGAACIXKK}CITNCCAAATAAATAO0TT^ 

GAGTOAAAKGOOTXKJACAACTrCCGCNGOGTOAATAATTTCCTTTGAAAAAA^ 

ACOTCNAOCCTOAC^^ 

OOnTTTTTT 

SEQ ED NO: 3390 ACAOTCTATAATACrCCAACAGrCTm:ATCTGTArrcAAKKKXK^ 
TACAOTCCTTrOTTrOQATOCTOOOGAGAGTAATCCCTACm;AAOCACCATATAOAT^ 
CCCTCTCCAGTraAGCTGAACCACAGACCCTrrOCTGATOTrCACCACACCACCATQAC^ 
axn-GGAGTOOGAGGAGOOTOGACaACAGOGOTOTTrraATCTrTAGAGOCCTCACAC^^ 
0C^tK^XrITCAOAAOCACGA^TTC^X]GCGAATOOCAAGGACATTa^^ 
AAGCTTCrCTACCAAAGAAGAAGTCATATTTCrrATTCTtmX^ 

OAGCirrcACCCAAAcrrrrcmAAcrooATOAACAAQTTm 

ACAAGCTCGNAAGCCX:A^^[TNAGNOGNAGAAANNGNATTCC^T^ITCCTOATGNC^ 

TANGGTTAAAAAOTrCTrrTCTTNCaXCTAAnAATNOGCCTATTCCCNAC^^ 

TTNAACrrrTGANAAAAANATGANCGCGAATTCAACGGTNCNTTTG 

^^*CAAG(KA(TrrGTCoS^ 

CTGCAATGATTAAACAOCAAGGGAAGOCrcCCrrCCCAOT^^ 

TCCACOGATAAAAOOTGTCTCTITTGTCTCTACCAGAAAATGAAAGGAATrGA^ 

OGAQAGATTOAAGTtn'AGrrGCCAA<UTTOAAAaGAGAAAGTGGTTCAGGGATATGTOAGG^ 

GTTGGAGAAAANANTAAAAAGACGCTCCITACCAGAmaAAATTCGTGAGATGTTTCTrOO 

COTCXKritTOANOOACCn'AAOTan'AAaTOOATCnTCraAAOQAGCAAA^ 

GGAOGAT^rAAT^mX;AANGGAGGTCCCCCOA^rcCAATTATGOCACCAAATI^ 

OTGAAAAAACCAa:AACAAGGC^^X}Wm)ANOCACATGNT^OTr^^^ 

<K3CTTA^r^TC^NAAAAAAAAAAAGCNCCCNTTCTTGCC^^ 

ACTTGGGGGCO 

SEQ ID NO: 3392 TCNCNTTACA0TOA0TrACTATroGTCATCTX3CTACAGTTm 

OAAOAACAATTTTTCCATGCTAOTTCACTITOAAAAGAAGCAAACATGTAOTAAAGACTATAGGC 

AATTATCACATKn-AATAaATnTn^CAAAAATGCAGATCAGOACCATroTAATTC 

AAACAATGOAAOAATOCrCCAAACTGAAACTOGACCrAAOCTAOCAAATTQTCC^ 

CCAGAAAGAACAAAOQTAAAOCAGCCAATCCTAACATAATrGCATAAOGNATO^AaAAGOCCCr 

TCACCJ^TrGCTONAOGTCAGATATGGAAAT(ntX;ACACATTTTCr 

ATAG^^^^0AAOAATNATCCOATTT^ITGaACCAaATACCAC0OGGCTGAT^^ 

ACATNOOAAAATTirrOATQAAGCCGACACKrrrTTOCmTC^ 

1TCAACTrGNCCATOGGOTAATTCCCTNKKKrmGGATCCO<^ 

AACTTGATItX3ACAAGGGGCN^^mCIXCTGTCCTCNTGGC^ 

oc 

SEQ ID NO: 3393 OOTACATOTTTTCTCAGAAOACTtAAOATTTCGCCCACATXXXrr^ 
OOCTAOATCTGCOOCOCGOCTOCATrTGTCCCACICn-CAGGACAOAOT^^ 
ACTTCATAGTCTTrOTAAOOqCTCOGCCAAOCOTGqOOCCC^^ 
OaCTGGTTCTXX::AOCTCAAAAT0T0TGOAATAGOGGGCATAGAGC^^ 
ACCTrGAGTOA7TrCCTCrroAGAGTAGGCTCT0TrCTa:ACACCATA GC^ 
OOmACAGAAAGCATOCTCAACAOGATAATOCATGGCOTTTCGNTGCTTTOTO 
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CTC0GGACTAACATAAGCCAACACK3AACACCCACCATTGQCACKXAA0CKXXJ 
TOTCATT0CCCTGCO0<XAAAGNANGAAACATGAGCAG0<KKiATCCA00aXXA^ 
AO0hCATC00NOAGGQAA>rrOTOGCCCATGTTTTrrCANCaXKi^^ 
aAG0CAT0AGGNAO0OCNCGCCTrAAAAAAAQA(XTNCHjGAACCGANCUAAAGTITrr 

SEQ ID NO: 3394 OOTACANCAGCrrCnAACTCTACACACOCACTTAAATTTTTTTAAAGC^ 
AOOTTATGTCTTATTACACCATGATCCTGOCTAATAGCrmCAAAACTTTGAOAAAAATC^ 
AAAOGTrrCACATCm><XTGAAACrrACAAAmAACATTATCAAAOAAGGAATCCr^ 
CTTACAAA0ACCACTAaAAAOAAACAACAATTAAAAA0CTAAOAAACTtmnt:AAAOaC^ 
TTTACAATCCTTCCTCCCAQTAGGGrAATGTTATTAAATAATCCAArc 
TCTQCATCtCKrrCTtXrmJCTrClKjCCATAT^ 
TCT^AACATra^^■ATTTCOATAACCTOAAACTOT^CTGT^NCC^^^ 

TATCCAGNOAAGCCCATNGCAAATAGGAAAGAAGCTCAGNATTCGCKfTCTTCCAACATAACCC^ 
AACTrrCTTTCACTGOCTnXXXXjCaNANCTnTTriTNC^^ 

^mGOTIU^I>r^TT^T^TrmAAAAAAGOGNN^w^^^K^n^ 

TTCAAAA 

SEQ ID NO: 3395 ACACTraAAACCAAATTTCTAAAACATGTnTrCTTAAAAAATAOTTGTTGTA 
ACATrAAACX>TAACCrAATCAOTGTGTrCACTATGCTrCCACACTAOCCAGTCn^ 
TCTOaTTTCAAOTCTCAAOOCXTOACAGACAOAAOOOCTroOAOAr 
TCTCAOCAACTTGAGAGCrrrCTTCATOTTGTCAAGCAACAOAGaXn-ATCT^ 
CATAGAGACGATTTGAATATCTrCCAOTGATATCGOCTCTAAG^ 
ATAATCCTGGGGACATACTXKKX:ATCAGOAGAAA<KJNGTTTGNCAGTny^^ 
GACOAGGACAAACTChrrCTGCCAATTTCTGGATmnTrATT^ 
TOOCTONONOaOCACTtrATTCCAONOATOAATAATCTCAAGOaQT^ 
AAAOCTTTTCATATOTCTGAGNCCAANGANOTOGOCOCCCCAAOC^^ 
TTNOGGCAAAAACCCrraQOCCTTTTOGNTCCNOOOTQACmOO 

SEQ ID NO: 3396 GCTACATCTTAGG ri rr [LU ' l C CTrTAGTCrrGAAOAOGCOTrnXACCAAC^ 
ACAOCTCraCClXXJAijrTmACTACMTTOCTOCAAATrrCAT^ 
TCCATTTATTOQAOCCAAAAATICTAGGCOCrAQAATGOaAACAA(KrrAQTX>GC^ 
AACATAACAAAACAOGAAACOCCCGACAOAACACATGOATCTAGATAOTAGATAATCAGAAACA 
CCAAAOAAACCACACCCATGATGGCAOOTGOAAACKAOOCTCTrrcC^ 
0<XATCAG<>TXrCCGCGTACTrcrrG<XrrGTAOAATrTCC^ 
AATAACraNGAAGAACACOGGCAATOOATnXXJIGACCACrCOGATCTrAAANAO 
COACOCXJNCTGTCACTTTOOCOAACCCCIiACmGOACAAOTTCCC^ 
AACrCTCNTTTITriTCCOCGAAAATrCANCCTrc 
CGGCCGTrCGANGAAAAAACCCCOGTACTNCCCOOGOOCCrrCAAA^ 

SEQ ID NO: 3397 AC r r m T T T I Tn -I T II TlT l llt mil 11 II ACrGATAAACAGACATOTTTAA 
TOATAGCrrOCTmtiACAGANATOTCTACAGAGACrnTAATCTATAATCCAGGAGTNTATAArc 
ATGCAGCACAGACCAATTAOCCAAATGCAAAATAAACTANATTCTrACCACAACTOT 
CACTOCAACTAnCTITCCAAAAGGACCCTAATCTTATGNGAAAACACCTACTG^^ 
NAAACCTANCTNmraOCCAAAAOGAAAAAATAGCAATTNAOGGCTTAAT^^ 
AATOTATOCCCrn'ACItmGCTOCTGTTGCTOCTOTTGGTGC^ 
GCOGNAOOOGCTOCTaCrocrOCTGCTOfmxnXKTnr^^ 
NTmrTCACCCTIXXATOTrNACI>riTAAGNOGOTroAATAA AO^^ 
TOCANCIXXXWUGGGGTOGATTTTrrmCCCTTaHjGGNGTT^ 
GGC^^^^^'AANACCIm^AANANC^^OOGC0QAACNCCrANGO0QAATT^ 

SEQ ID NO: 3398 ootacocoooqqaqacaoactgacgcgaocagc caago tottggagcagct 

CACAGOGCAGACCCCKmnrrrOCAAAOCTAGATACACrGTCA^ 

TOAAAAGATroCrOTCCACrOCACAGTITOAQOOOCCAAOOCAOAAOAAATC TrgO A OAAOO OTC 

TAAAGGTOOOGGAGTATGAGTTAAGAAAAAACAACTTCrcAGATACTGOAAACriT^^ 

ATCCAGOAACACATCCATCTGOOTATCAAATATGACOCAAGCATTGaTATCrAOCAOT 

TATOTGGTGCrOGGTAGGCCAOOTlTCAACATCGCAaACAANAAGCXSCAN^ 

OOOOCCAAACACAGAATCMGCAAAOAGaAGGCCATGCCGCTGOTrNCNCAAAAG TATC^ 

ATCATTCTICrOOCAAATAAATTCCCrnXTrTTOCAAAOACCArm^^ 

TAOrrTGCCGGCJ^GCCGTTCAAAAGGNGAATTCCACCCACTGGCGGOCOniCni 

TCGNNNCCAACTT0NCOATCAT0GCANAATT(m'CTraNO0AAATGNTNTCCCrcC^ 
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SEQ IDNO: 3399 CGA<Kn-AOOaKXK::AAGACCOGCGAGGCGGAGACCATCACCAGCCACTACT 
TGTTTCCOCTAOGCOrrrACCGCAaJCTCTATCTC^^ 

cncnasAOCTCATooocATTaraocAGGocTGOt^ 

CTCrATATCACOJ^OTCCTAAAOOGGAACAAGTraACnTrOCOGQ<>TAOC^ 

TCTCTCT(XnXX}OCA0CAGCO0GA0OCAGAOaAAOCHXOCAGAAGATGAAGA0CTr^^ 

G<XiCTOACTrrmAAAAACCCACCTCrrGTGCTCC^ 

CAAOTqGAOOATCCAGGTmGOCMAACTCAAOACTTroOOCTOGTTGNAA Ul 1 L I I GCCmTAG 

AOCNGAA^WAN^WN^lNN^lN^WNN>^N^IN^I^ 

^m^t}ATGACAAAATACCAAAGGCrCCCCATCCCNAAGATNAAAAAATAm 
AAAAANGGTraCCGTGGNOTTTrAANANAAGTTAAACCmAACCCATGC^rr 
A 

SEQ ID NO: 3400 GOTACACGTCTCrGTCTGGGCCTOOOCCAOGaTOOCOAOOOCCAOC^ 
Aa:AGGAOCAGGGCCCJ^OATCACC^mTC^m^TGGTGOCCA^TCCCTC 

aiAOcccoAcrrcAGOGATCCOXJCOTACTTrrrrriT^ 

TrATTCCCrCACATOOOTG0TrCACATACACAOCACANAG0CACX}GGCACC^ 
CACTCCTOCXTTCTOAOCOOATCTTGCCCTCACGOT^ 

CCXTCACTACGCCCTAOQOAACXXAOQAOCAAATCECACCACGCCTTCAT^^ 

ACCACCrn30N0ACG>riTAATTCCAA<Xi^TTATANNAACWOGANAANGGA 

ATTNCAGOGNGAAGATTNAACN*GNNAAOGAAaATACANTrcOTOAAOOCa:NNGA^ 

GAC0CCTT^WAANCT^^^^^KXK)GAAAGOG^nm^cTGGG^^^^ 

NAAGGNNGGAAAA^^It3NATOG0^mmT^NGN^mNN^^ANNNC^^NAAT^( 

SEQ ID NO: 340 1 aCTACCACTCTOOCCTOTXnTCIXX^CCCGAAGTATCaj^^ 
CCAan^TOGGAAGAGCOOrrAaAOOGCTTin-CAAa^TrAATGAAACCn^^ 
OCAATCrCTCTCrni^GAGaATrrOAACCACATOCAAOAOOTGTATAOT^^ 
AOCTTATCAAAATTCTTTAGCAAAAGaGGCCACTaAGACaTXrrCCAACTTA^ 
GOTTTCATAAGAAATTCTrcAGCOTGTTCn-AT^^ 
CCTTTrrCTTCTTATXjrTTCmOOCAAAATAAmCTr^ 
GTGOGTOCCG<XOGOTCCAAACCnAGGGAACGACCGCCCCarACCrO(X»^^ 
GGCAAATTNCA 

SEQ ID NO; 3402 ACOCOOOGAGOGCGGCAO<»OCOACCaOAOCOGTAGOAGCAGCAATTTATC 
CGTOTGCAGCCCCAAACTOGAAAGAAGATOCTAATTAAAGTOAAGACGCrGACCGGAA^ 
TTGAGAnGACATTOAACCTACAGACAAGCTGGAGOOAATCAAOQAOaSTGTOaA 
GGOAATCCCCCCACAACAOCAGAGOCrCATCrACAGTGGCAAGCAGATGLfU^TGAT^ 
GCAGCTOATrACAAOATTrTAGGTOGTrCAGTCCTTCACCTGGTOTTOGCT^^ 
OOTCTTAGOCAOTGATGQACCCTCCATmACCTCTITAOCCTmt^^ 
ATOCItnX^CTCTCTOGGACACCATAACCACTGNCCCmaXn^ 
ACTaGT(XK3AAGACTGNGAGOATCCCAGAATrCAATArrrCTTOO<XCAAAA0<^^ 
CTOGOTOGTAAGTTOCAAGCCTGTGNGCmXTTTmmTQAC^^ 
ANrrCCTGGOTrCTTGGAAAAAANNNWTOWNNWft^^ 

SEQ ID NO: 3403 ACGCGGOGGAOCCGAGCOGTAGCrGGTCTOQCOAGOrnTATACACCraAAA 
GAAGAGAATGTCAAGACGAAGTAGCCGTTrACAAOCTAAGCAGCAG(XCCA000CA0CCAGAra 
GAAT(XCCCCAAGAAGCCCAOATAATai\GO(X:AAQAAGA00AAAACTACX^ 
AAAOAAGAGAGGAGGTCACCAAGAAACATCAGTATGAAATrAGQAArTGTTGQCCACCTOTATT 
TCrGGGOGGATCAGTOCTTGCATTATCATTQ AAACAO CreACAAAGAAAT^ 
CTCCAQATTTACAAATTACAOATTTAAAAATCTTTrrATTAATCCTTCAC^^ 
TtKXKUTXmx:AAAAOAAaTCTaGCTAAACAT0TrAAAAAAGGAGAGCAGATATGTT^ 
ACATTTTOAAOTTCTOCATICItUCTrOOAACCACAOATOAAaNC^ 

AAAAONATTGCKJAACTATACACACTITATANOaAAACAmTrAlCTrOCACAANAATITrrra 
ATTATGTrGCCCCAAAAGCGmAATNAAATTTCmAAatWrT^ 

SEQ ID NO: 3404 caogocactqu^ataacacccaaaaqaocaack:agcccctocc^ 

GAAAATAATCCnXJAGCAOOAACTGGCATCAAAAAAAAAAAAAAAAAAAAAAAAAAGGTACT^ 
TTTTTTrrTnTnTTTTrrrGOCCCCTAA^ 

OCCCCAOOTCAOQTCTOCCAAAGGGTrTOCCAOC^OTCACTTCAAAGT^^ 

CAAaCANACCTTTTNCTOOGGAGOCCGTOTCGCCACCCCAAO^^ 

CCTGTCACCTOOOC^CCAOT^^^NTOCCXJATO0CCTCAACAGGAC^^ 

AAGGGGCXXnNAAGCrrOGCAtXUACCACGTGGACAAAAGTTITOTAN^ 

GCAGCCrOG^'GATCAAATCCa:ANCCCTCCGGGT^NraOG^ 
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NAAGGGGACrniCCCrnTCAAOGCATAANANA 11 1 N 1 1 1 1 IICCAAACCCCATMOGCNOCTNGO 
000<XNACXK3NAATGG<KTOGTTOCAATAATGTCCAACATGA^ 

SEQ ID NO: 3405 ACnTTCCTOOAAOTAAAOTOAOOTCAAOCCACAOOCTOCTCCAaOAC^^ 
ATGAAAAATranTTACAATCTCCrrcrGAACATOGACCnXTCC^ 
AAAaraATGATAOCCATTCT(nX}CCCTrGOCTATT0<m^ 

CCTTCATGAAATACAAGGCAGGGCTCCAAKKiAGACTGTOGAAOGAATACrrCATACT^ 

OOAATAOCGGGTTrrGCAGCAATIOTTGCATATGG^TTATATAAACroAAGAOCACOGOAW 

TAAAATGTCCATTCATCTQATtXACATOOQTOTaOCAOCXX:^ 

■[XKmKJTATO30CrATTCCATCn-ATCG<KiAATrCT00GCA^ 

CTGCTIOCTCrrCKJTOOAAOAACTrCKTrrAATTAOATGCTrATr^ 

ONAATAAAC^AAATTGGATNCKKmANATC<i^L^ACATGOC^^Tra^ 

CTOArrOCnXKWACCGAATACTAOCGCTAOTTACrACTAGOCATC 

SEQ ID NO; 3406 AcxxKKA(KnrrtnxKnxnxxACAOOOCixxcc^^ 

GCGCOOCCACItKKKn-ACAACKXXAACCAAOGTTACGTrATATATAOGATrO^^ 
TGO<XXfAAAACGCCCAOTIXXTAAGGCTOCAACnTACXKKAAQCCT^ 
AOCTAAAGTrrGCrCXiAACCCTTCAGTCCOTrOCAGAGGAGCOAOCTGOACOCCAC^^ 
CTOAGAOTCXntlAATTXrrrACTXXXnTCGTCAAGATO 

ATTOATCCATTCCATAAAAOCTATCAOAAAAAATCTGACACCXAATOOATCAC C^ 

CAAGCACAAOOAAATTCCTMKXn'OACATNTGCAGOCCOAAAOAACCGGOGG 

CCACAA(TITNCACCACACTTrTNGOCCNTITrCXKXX}OCAACT^ 

GTT(>AC10CTACCGGTTATATAATAAAGTTro>WAAAATTATCCTTAATAAA<^ 

0AAAAAAAAAAAAAAAAAAAATNCTrG0CCGGACNCCTAfK3<K?NATrCCAC(r^ 

SEQ ID NO: 3407 ACCAOTGOAOOAAGOCCTraXMKXWAACATGOCAOTOAACTGCTCOT 
OCOCrrGAAGAQCrCCTGOATGCCnnXCTATTOCCAATQAAOGTGA^ 
AGCrrOOGATGTCACAGACOOCIXnCITGACATTOTTGOGGATC^ 
CTTCnTCTXKIACCJrTAAGCATCTGCTCATtXiACCTXXTrCAT^ 
AACCAC0CroAGGTATC0GCCGrrCO<XKXK}0TCACAAGGCAGCCAT^ 
ACXTOCTOOOTOAGTTXXOGCACTOTGAOANCIxmTACroCTGGCrrCAC^^ 
GCAAANCCCGCATTAANAAAKKSAACATOCOAAhKXKUCCATGrrC^ 
GANACATrNAACTGGCCCNOGAAACC0AGOCANGNO0GGACACCNCTCATKK3OG*^ 
AGGGOGGTAAAANCXXrCANOGTaOGONGGGCAACnNAAAANGCGOAACNNATTnATAAAAO 
GCCTT^^trNNAANGCAAAAGGGTrATTAANANT^^T^XAOCGATOGGGGGy^ 

SEQ ID NO; 3408 CCOGOCAOOTA Cl i i H 1 J H J 1 1 1 1 111 1 1 1 1 1 1 111 IN OaATITOTCTOCCAA 
OTtlANAANAACTrCCANATAANAATTTCTGCCTOCAGCTCCAGTAT^ 
CAG(XACAACACANAGCroONOCIXXrrCACACGTCCTXrn^ 

OCAOCTt^OOAOaTGGGGCCTrrATCAGAAAACAAATCAAOGOCCCGGGACACXlATGT^ 

GTCCCGOCCACCaTCATAATCCACAGGAAACTCCCrnCTXJAAATATCT^ 

TTAATCCCOQATTOOOAaOCCAAAACCTNATTGACTNGAACN™ 

CTTKXXHrrGOTCTTTTACTTTrNAANCTTNGOGNAANCCACT^^ 

NCCCCCCCAG<XX}CATTAAACTTAACCCTrWAACNTmATTN^ 

TOGGOGNTNGGAAATTAAATACAATCC^mACTTGAA^r^NTAN^r^TTGCC^ 

(mrrnXKmCGNCCCCAAOGGANCTrrANAATOGGCGCAAAATTAANG 

SEQ ID NO: 3409 ACAOCCAACOOTTTCCCTTOGGGOCITrOAAATAACACCACCAGTaG^ 
AGGTTOAAGTCTOGTrcAGOGCXAGTGCATATrAGTGGACAGCACn'AOTAGCTGnrGC^ 
TGCAGAOTCAGAAGATGAACAOaACGAGaATGTaAAACrcrrAAOTATATXntKiAAAGCGGT^ 
CCCCTOOAOGTCGTAGCAAGGTrcrACAOAAAAAAOTAAAACTT ti^ 

GATGATGATCAAGAGCATGATGATGAAaATCATOATGATGATGATTTTGATGATGAGGAAQCT^ 
AOAAAAAOCOCCAOTGAAOAAATCTATACCAAOATACnXAQCCAAAAATOCACAAAAOTCAAA 
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ACTCAAAAAGOOACTATTTAAAOGAAACTCAAATCAGaAOAAOOCGOTAOGCATCAGAOG^ 

OGCKIACCAAOOCCTTAOGOCOGOAACACrmCAACXXAAGCCAGOCTrC^^ 

AACAGACC(X;AATTrCCACAOOOOAN0<>OATCriCrATCCTtL^OOTGGCW 

actoc^ooattaaatataaaaaogorroattctgaatagaccaacto 
attttaatttcotgaataaaaaccx^attaottcctgaatctnatttog 
CAATCCAAAACTOOACTrrrcATOorrGATrrmnxx}oc(^ 

SEQ ID NO: 34 11 GC n'ACN NCAATATAAGCAAATCTCAAATACAACATACrraTAA'ITAGAACAC 
AATOCAATGACTTGATTTTAOCAAflAACTAGACACn'AATTTOOTAAAAQAAAt^^ 
TTATATTOAATACTAAOCTAAOrrACCAT^ArrAGTCTl'AaW^ 
TOAACATCTAAATTrAAACCTAAATTnTTAATTAAAKHXnXnTCAACAAAO 
ACACAmATOTAAATTTACATTCrAGAATAOCAGOOTAAACAAGOAGACGTTATrCAAAGATO 
ATGACAAAGTTCTATTCTTnTCATCAr^ 
CTirrCTAACCACrOCTnOGCCrCTTCANQAATCTOATT^^ 
CCTAAAGAATnmO^ 

crocATTmA-nmAAfnrnxrnxMAT^^ 

NAACCAANCACrmrTOOGGOOCrcOAGCAGCCCCACTrTTrr^ 
G 

SEQ ID NO: 34 12 ACGCNOGGaOGaATaTGOOACCt€CAATrCXX>OCCOOaOCT^ 
CCCAOGTOTTQACTCCAOCTCCAGCTTCAOCTtXIAGCTCC^ 

CTTAGGCAGOGCAOGTrCTGTGTCCCA tjl lUl U IC CAAnTCACCOOCrCCOTOOATOACCGTGO 

GACCTOCCAaTOCTCTOmOCCraOCAGACACCACCTTrC^^ 

ATrCACAGCTCATXTrTClTTCTCAGAAGTmSAGAAAGAACTITC^^ 

ATTAATTAAGTaTOrATOAAAAGAAACTOOTAAACCTAACrOTOCGAATTGACAT^ 

GOATACCATmrrACACTOAACTOOACrrCOAACaXihrrCAAGGGTAAAAOTGA^ 

GAAAAACTTOGGCATTACACTOAAAOOAAAAOTITrGONOQOAAOCrCANAAATTNOT^ 

CTOGAGGNGOAOATAANAAATTrOACTKrrTTOGNOANAAACnTGGAACCNTAA^^ 

OCCrrOamXXKXXWNAAATOOGOnrmiAAAAOCAACTNNAAAj^ 

AAACCOC 

SEQ ID NO; 3413 GOT ACm CTGC^GAGCTACATItACTGCATAGCGCATAAACTCATTOCIXK^ 
CCTGAGAACAGAAAGOCrnxnTGOTGGCrnXKUTACTATCOCTCAAAAO^ 
TCOACACAAATTTCATOGACAAAAQACTATCACTrerrCTTGTTGGCX^ 
COCTTITrCATrAAaAATGTCAGAaAGrrTrTnOTAACACATAAATAAAOCTCAGAAT^^ 
ACCTATTCrrACTOAAACTACTrATGCAaAAATTONATTCTATTAAAAOCTCACAAACr^ 
CAAOAAAAOCACAGATATTATnTTTATXKXlVU^TACTCmXKJrcrGAAAAAAAT^ 
OCAGNTCAAA0<nCrrCCAACTC^ 

AAOOTniltXKJAOAATOCCTCTTCTTCCTCl 1 1 1 IL'l ICl ri'ltXriXIWlUOAANGGNATGAATNGA 

NTTCITATACCAOOCCAACCATGCTGGATCXAOCXXAGCAANTATrCCA 

ACTTAn ICl l lW<KK:AaANACAAGClTrrAATTTAAAaAAGCA(XAANCAGGTrrNGNN 

*TTIXKnX?TCCAAACAGGA'^'i^^ 

CTTACIXXTrrATAACCACCAGCrCAAOAAGGAACCTACAGOCITrrOGAAA 

GGCnX3ACrGaTrOGGTCTGTAOaXKTTTCACT<lT 

ACTACAOrmAGCrCXrreATTOCCTOTOTOOCAAACAOTQATATCT^ 

AOGO<XAAATTrGAGTOCTCTTTAAGACGTATGACAAGGACATCACCrrrc 

TTCAAACCANTCAAAATAAACrrCAAGAACCCCTITnTCGCANCA^ 

AAAAATrAATrrCTNOGNAAAGGAAATGAANOTATATrrraNTTAAAACCTT^ 

CTCy^CANCrOO^r^■a;aKX;AANTCANANAAACAACTTITGATITTN^ 

GG<X}ATOaNAACAANOOAAAANOGANCCCAOCTNAACChnXWNCCTINATGNCXTr^ 

SEQ ID NO; 34 15 O GT ACACXT TGAAGGCOAOOTrAATrAAATCCnmOTCCAGTTTOAQOGCC 
GGAATTTAATTTTTOGAOTTTTATrrAATATCGGQAGCAOATTOGCTAATAAAATOTATATT^ 
ATAAGA(XKKXrrTTT0ACCTmAGGOTCTAGO0CTOTAAAOCOTtr^ 
GOCATOAACTGOCCrGGtnTrrrATATTTGATGAAAAAGAGCCTAAACGCTTCrrGAT^ 
AGAAAAAGOAGCATrAACCTrGACTATOTCTnAGCIXX:A0CCACCTrmAA0AOT>^ 
GGCAGGTOGOGOAGGGCTAQTCAOXiAACXiAAACTGTAAOCOCOGACCAXOTOTOAGGAGGOT 
OGTOATAAAAAGATTCAGOGTCGAGGAATXKrAOCCTGAAGAAGAATTOOaACCTAAC^^ 
0GAAGAO0AA0aOAAAAOOCAAAATQG<jTTrCTANAAAAAGOAAAATTNACAx:ACTCAC^ 
CroGGGOTNOGACrrAAGOGACANOOTGGAAGGAAAHWGOAAAATTNGGACCAATrOOT 
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NCANAACTNGOAAGGACGOTOGGTAAAAAATGCTGOAATTAOTNCTAAACATrrCC^^ 

SEQIDNO: 3416 OOAGCTA C] H J U mi 111 H H I il i 1 11 1 i OGOAOmTAnTAATATCGG 
GAGCAGATTG<MTAATAAAATarATATItUGAATAAGACO{XXTTrT(MCC^^ 
CCnTrAAAGTOTCItVK0G<mtXTCKXXiAAC0AOCCAT0AACTGC^^ 
AAAAAGAGCCTAAACGCTrcrrcATTroGOATAAA0AAAAAGGACK>TrAACCTr^ 
TAOCTCCAG(X>CCTrTrrAA0A0TAAATTGCTGGOCAGGT0GGCaA0<KjCTAGTCAC^ 
AACTOTNAOCCGGACCNOTGTGAGGAGGGGAGOTOATAAAAAGATTACAOOOTOGAGGAOTGGA 
ACCTOAOOAAAAAATroOOACCTrrWCmiGCNTNOAAAaGAAGGOANAGOaNAAATCOOT^^ 
rraAAAANOAAGATTTNACCCNCrcACCACXXrcroGGGlTTOGACmAAGGa^ 
GAAA>WAAGAAAATr^GGOACCNA^nTCNCTGOCCCCAAAACTrcCC^ 
AAANTGOCTNGAATTAGGN'CCTAAAO^mT^CXXrrTTTTNACAAATT^^ 

SEQ ID NO: 3417 GGTACACITQAAACCAAATrrCTAAAACATGTrm'tTrrAAAAAATAGTrC^ 
GT/^CATTAAACCATAAOCTAATCAGTCmjTrCACrATGCTTCXACACTAGCCAaiO^ 
TTCTTCTGGmCAAGTCTCAAGGCCTOACAGACAGAAGGOCTTGOAGAl l-rri ri^ 1 lACAATT 
CAGTCnt:AOCAACrrGAOTtKnTrcrrCATGTrcTCAAGCAACA^ 

AAOCATAGAOACOAmOAATATXrmrAOTOATATtXracrCTAACTQTCAGAGATGGGT^^ 
AACATAATOCTTOOGGACATACroCCCATCAGGAGAAAGGTGGrnJMCAACTr^^ 
AGATTOAOOOAOQACAAACrONTCnKXIATrnnXKIATTCrrrATrr^ 
AGCTTXUCTOOOTOGCXrajCGTCCTGOCCGGCGONCGGT^^ 

SEQ ID NO: 34 i 8 GOTACGCGOGACTCAAAAOCTrOOACCOCATNCTANOCXJNCOACTCACA 
GGCAGA0TTQ<XATQOANAAAATNCCANTOTCA0CATrcrT0Crh4CT^ 
CTGGCCAOAaATNa:ACAGTCAAACCTGCAGCCAAAAAGCACACAAAGOACTCTOCAC^^ 
TGCCXXAGACCXrrCTCCANAGGTNGOGOrraACCAACrCATCTC^ 
CTATATAAATCCAAGACAAOCAACAAACCXrmGATGATrATTtlATCACTNGO^ 
CAGTCAAGCTTTAAAGAAAOTOrrrOCTGAAAATAAANAAATXX^GAAATTG^ 
OCTCCrCAATCTGGOrrATOAAACAACrcACAAACACCirnmxr^ 
GAN7NNOTTrOOTNCCC>TNTCTCCAOGTACAOa3«UTKXnX30AANATATTAAAC^ 
T^ACCAACCTGGAGATCAGCT^m^TONTrOOCAACATQAAaAAAOCT^K^ANa^ 
TGANTGTAAAAAAAAAAACTCAANrcCTTrTTCTOGAAGGCTTOAAACTraAACCC^ 
AAAA 

SEQ ID NO: 3419 ACCTAATCACCOCCOIACCrGOCATATACTOOCTXKJCCTC^^ 
GGGTCrrAGTCOrniACCACATGOCTGTrGlt^O O TCOrrGCTOC^^ 
GGOGACIXrrCCCTGTGCTOGrrrrOCAOTGTOT^^ 
TICTtrrACTACGAAGACCTGCTTOGAGACCAAGCCTrCTTATCn^A^^ 
AGGGAaACAOCC0ATACATCCT00GTGACAACTrcACj^GTGTGCATOGAAACCATCACAGC^ 
CTOTOO0GAa:ACTCAACCTXntXIOTQ0T0ATCGC L -|- lUUri tX3C^ 
CAGCTrmXXSTCTCTGTOOGCCAOATCTATOGGGATOTOCTCTACTrCCTOAC^ 
N0ATTNCAACACNGAGACCrGO0CCACCT>mn'ACTTCTGOaTITA(X^^ 
TGGCrGGTOCTONCrGCAGTCCrnXiCrrGATCITGGAAACANC^ 
0CAAGG(XXX>AAOCCAAAA(XAAAAAAACTGOGNaNKmGGACAGCnNA 

SEQ ID NO: 3420 ACOOOGaOOCraACICrCTTrrOOOACTCAG<XCXXXnXK:AO^ 
TAAACAGCCATCTrOCTCACACA/JkOCCTOTTroGTGGTCTCI^^ 
GOTGCCATOACTOOGATTOGGGGACCTCCCrrOGGAOATCAATC^ 
CCCTCAOAAAOATCCACCTACXlACXnXUGGTCCrCAGACCOACGAGCCX^ 
ATITCAAATCTCXn'AAGCAGCCTCril'riACrmnTCTCX^ 
TTTCTCCTTrCAATCITGGCACrACACTrCAATCTC^^ 

TOOTAGAOACAAAAGANACACGITITATCCCTGGAaxrAAAACTTCNGCGCroGT^ 

OAAAGCAOCCmXX:rTOGNGGTrAATCATrcNANOGATOCCTNT>rr^^ 

AANGGGOGTAAAAC *JCGCNNGG ACAACTGOC^mH^CCTrCAa:^^■ACAACAAGTCCGCTT^^^ 

CCGAAAOOGCAAGACCTIGG0<XGGACACCTTANCGNGAATTtACCAT0aO0GCG0T^ 

A 

SEQ ID NO: 342 1 OCGTGGNCNNOGCCOACGNWCACTCTATAATACTCCAACAOTCnXCATCTG 
TAT^K>ATGGCO(X:ACCCAATACAaACC^T^G^^raOATGCT^^ 
CACCATATACATAANAAAACCCrCTtXAGhrraAOCTOAACCAC^ 
ACACCACCATOACCACAOhnXXXnXXlAaTOGGAGQAGOQTGGACCACAOGGOTaT^ 
ATAGOCCTrUNACTCTTrCAOCTTOOTCnCANAOCCACGATm^^ 
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QTTTTTGGNTANNGTXnCAAOCTTNTCrACXAAKACAGTNATA-m 

NCAACAATmrrGAOCTGCNN(XNAAACTCTCCTTGACK^ 

ACTmACCTraAACAahrrANAAG<m:AATrCAAGGGAAANAANGGTATTCC^^ 

CAAraXKINAAACNNOOTim(MNOG<mTrrCAraOCCCCICAATO 

NTTAOAAAQTTTnCTTOAACNrrraANAAANAAATOAACTTTGAATTOC^ 

NAGGNAAGGNGGTTTOOCAOGNAACAAA 

SEQ ID NO: 3422 OOTACGCGOOGACTCCGTCCCGACATQATtKiOGAGCATGCGAGTOG^ 
OAOCTO<UOOATCTTCA<UAaAAGCCTCCCCCATACCTOCOaAAC^^ 
TCrnXTGGTOTOGCACCKnOCCTOCTACaXlACCA^ 
GCGCATCAOCrrCCaKXXKlAaTATOCOTTCAAOCCTCCCA 
CCACCCCAACGTOOACOAOAACGGACAGATTTCKXTGCCCATCATCAO^^ 
CTrGCACCAAQACTTGCCAAGIXXrrGGAOOCCCnXAAT^^ 
OAOCCCCItXXKlATOaACCrCGCTQACCTOCTGACACAGAATCGO^ 
OAAAAAOTTCATaTrCGATTCCKJAOTOOArcOGOCOCTr^ 

TO<UTCCTCONATACNOACNOOCACACOTUTOOCTQAGOCCAAANOCXTOOGOCC^ 
AT^Tp^•CCTT^^^■ACK}TOOTAOCAT^AATITNGT<JI^^ 

Tcrrr 

SEQ ID NO: 3423 0CrrA<XHM00O0CCAOA0ATA(XACA0TCAAA0CT0OAGCCAAAAAGGACA 
CAAAGOACIXrrCGAaXAAACT0OCCX::AaAC0CTCTCX:AW 

ACrCAGACATATGAAQAAGCTCTATATAAATCCAAOACAAGCAACAAACCCrraAT^ 

TCACTTtKjATGAGTGCCCACACAGTCAAGCrrTAAAGAAAGTGTTT^^ 

AOAAATTOOCAOAGCAOTTTCTCCTCCTCAATCTOGmATGAAACAAC^ 

CroATGOCCAOTATGTOCrCAGOATTAT Xr rT TC T I OACCCATCTCnU 

CnjGAAQATATTCAAACXXmnCrATGCTTACCAACCTOCAGATACAGCT^^ 

TQAAOAAAOCTCTCAAOTTCKrrOAAOACTOAATrGrrAAOAAAAAAAATCTrCCAGCCC:^^ 

GC>NCXXint}AGACTIlCNAACCAOAAAAATGTOAGAAGACTGGCTNGTCTGC^ 

OCTGGhTrAGGTATOOGTNATGOTtXACAACTmrrrrAAaAAAACAAOmANAAAT^^ 

GNGACCTCKXXXi 

SEQ ID NO: 3424 <XAGGTA Cn ' [ I Tl r i T I Tn ' ri t ' L rri ri ' L 'I W GGOGTTACCAAACAGCrATTrT 
TATTATAGCATCrACAGTGTCTGGAAAAGCATGTAACAATAAATAACTGTAOACOCATXi^COAO 
CATXX^ATrrACTTAATtACAGAAGTGCUTXrrniCrACATAGTrnrrCAA 
rrCTCCAGGCTrACGAATGTCATCAATCTTCAAAATCATTCrAACX>T^ 
TXnTOCniniOa:AA7t:AAOGTnCTATOACATGCrOTTOCm> 
AACAOTCCATOCCAAGAGCAGGGTrCATClXXTrCAOCTOTCnXKjCTCGGAC^^ 
CKjATCXXjATTCATaCCACTGrriTCAAAAAOOOCCATOOOOATOACC^^ 
GCTCT^WTGONATACTCX}TCTAANGGGOGGCCITATCCCNC^^m■0<XT 
ATATNTAANAGCCCCTCTCATACCXACACNATATTGCNGOTGAGGTTa^ 
NQOOAANQGACGNTCNCCCCCCAAAGAAAACTATTTCCCCCTAANAAAAAGGT^ 

SEQ ID NO: 3425 AC0COGG0<XiAAA<nGTGGanX2ATGOCCGCGG0GCnjnX^ 
OCTGCCTCTACnXKXjCTOCCAAGGCTCTGOGCAAGan^ 

ACCTOCrAAATATGATGACATt:AAOAAGGTGOTOAAGCAGG<Xr^^ 
ATCCTOGOCTACACTOAGCACCAGGTGOTCTXXTCTOACTn^ACAGCOACAC^ 

TrraACGcrGCTrGCToaCAriXKXXTCAACXjACCACTrn^^ 

AAmOCCTACAOCAACA0a<m30TQOACCTCAT0OCCCACAT0GCCTC^ 

GGACCA0CANCCXXACCAGAACCCAAGANOAAQANACAGACCCTT*ACTTOTO<^^ 

ACACTCAATCXXCCANCACACrOAATCTCCCIXnNACAGTGNCATGTAAA 

GOGCCTAAGGAGCCCACCrrCKATrrACrrTGGCGaAACACCTAAGGOOAATTCACAA^ 

OOGTT 

SEQ ID NO: 3426 GGTA Cl U U 1 L 111 1 L 1 H U H L 1 1 11 M OOOAOmTAnTAATATCOOOAO 
CAGATraOGTAATAAAATOTATATTGAOAATAAOACGQCCTTTTOACC^^ 
TAAAGCGTCTCAGOGTTGCrGCCOAACGAOCCATgAACTGGGCTOGGTrmATAT^^ 
AGAOCCTAAACOCTTCTGATTrOOOATAAAOAAAAAGGAOCATTAACrrrOACTATOTCm 
a:AQCCACX7ITrTrAA0AGTAAArrGCrGGGCAGGTG0GGGA0GGCTAGTCA0GGAACXJAAA^ 
TAAOCCOGACCAGOrGTaAOQAOaOOAOOTQATAAAAAOATTACAOOOTOOACOAaTOGAOOCT 
GAOOAAAAATTGCGACCTAACTTOGCWTOGAAAOGAOGOGAAAOGTCAAAT^^ 
AGGAAGATTANACNCNCTCNOCAAOCCCTGOaG^mX)QACroAAGQGACANGTNGGAAG^ 
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AAGOAAAATrrOOACCAATTGCCTOGGCNCAANACTTNGAAGGACOGATIXm*AA^ 
AATrAO(>(<XnTAAACATrrOCX7rrrTTOANAAAAT>m'AC<nT^^ 

SEQ ID NO: 3427 ACA<KrrACNO(TIX:ATCAA0CTrAOAAC0QAmt3ACTrTGQAQAC«^ 
TATCAAGAAOTTCTGAATTATCAATCTtXAGCAACATGCCAO TOATn TACCW 
OCATQOCITAAATAAOAOOAAACAOCCOTTCACCCAACATrrOCrrrTOCrCT^ 
NATOTCAACATGaAAGCAGTCAAAOGTTCCTQACCTrGTAaKJGAO^ 
AGCTOCCACVVOOCAOCCCTOOGACATAAGAAOCTOOOAGCAAOGAAAOOGTCTTAOT^ 
CCCOAAOTIXXrrTOAAAGCACTCGGANAATrGTOCANCTGTATrrATtrrATC 
GCAA0CAAaTNCTATGAaTGAAA<KKJAOCCANANAACTCATTGGAGG<KXXrrATCr^^ 
GOCATCTOTTOOACTTTCACCTOOTCATATACTCrrCAOC^ 
CA>K:ATOANCTT0CTGrrGTA(XTC>«XXXK:GACCACCCTTAN0<KXJAAN^ 
CGTCTAATCGATCCGANCTXXKIANCAACCTGOCOGATX^TTOOCN^ 
TATCCCTCN 

SEQ ID NO; 3428 GOTACMaKMAOTCAOBTCAAOOTOTATrnmCCTACGAATTGGAGATC 
AOATOTAO<KXX«3GTGATCTrroGTCICITaX3AAAGA 

GOCCTTAOCTACAOOAGAGAAAGOATTTGGCTACAAAAACAOCAAATKrATCCnOTAATC^^ 

ACTrCATGATCCAOGGCGCAGACTTCACCAGG<WAOATGGCACAGGAGGAAGGAGCATCTAC^ 

TGAOCOCiraXCOATQAGAACTTCAAACTOAAaCACTAOOGGOCTGGCT 

ACXH>GGCAAAOACACCAACOCCTC(XAGTTCTrCATCXCOACAGTCAAaAW 

G<KXAOCATCmK7TOTITGOCAAA0TTUrAGAAGGCATrGAA0TXKrrGC00AA^ 

CCAAQACAOACAGCCCOOATAAAOCCCTQAAGCUTGTGATCATCGCAAACTXK^^ 

NGTG0AGAAACOCTTrTNCATCOC0CANOAOTAG0G<>CANGaACATXTmmT 

TTCAGGCCrrTAATCCCCCCANGOTTmACrmACTXKJN0CCX3NGCT^ 

CCCTNCCTTCATT 

SEQ ID NO: 3429 ogtacagnctotcctcccaoccagoaacacxxtctgcctaatqacaogacct 

TtXTOOOAAGCATCrrGACAGCACnt30CAaATGAAOAGCCAQAATCAACTCC^^ 

GGAAGTGACAAGAGTtXTrrCAOCXXWOTAGCATCAATtKimCCC^^ 

AOOCATOGAGOAGCKICOtKlCAGAAATQAGTAmTOACTACroAGCITC^ 

GCTACAAGAOTCTAAAOAAGAAGCCATCAGGACTCKKAGCGAAAAATTTGTGAGCrC^^ 

GGCTOCAOGCCCAOOAAOAACAGCATCAOGAAOTCCAOAAOOCAAAAOAAGCAGACATAGAOA 

AGCTCAACCAGOCCrrtmxnTGClQCTACAAGAATOAAAAGQAGCrCCATC 

AGAATX)AGAAGATCCTAGAACAOATAGACCAGAGrGGCGA0Cn>TAACCNTAaAGANGANONC 

ACCACvrACCOCrCACTTTOONGrTGOOGAOACAGANACCAAATGTTCANGAAOGCCT^^ 

CrGOCTCAACTOCAGCTATCGCCCX:AATGOATCNGGANAAGNGNGm>rnTAOT 

TTONANTN 

SEQ ID NO: 3430 TGAAGGTrCTGGQAQCTCTGOAO nn Tr f C C TTCT l lTCTOGAGCACGGGAAO 
GACTCCOGCTCTTOGTirrGTQACGGGGAGATCGAGAATGACrGCQCmXTIXn^^ 
OOaAAQATCOTCTTCTA^^ 

1 1 1 1 1 iCTCTGCTOCi I 1 1 1 iCi lOCTTATCTLl i I lATui'ilo'iuJ'JCATCTTGCl i'l'i JCATAGA 

TGOCAOTTTTTCTTXmCAATCTOTCTTTGNTTTATT^^ 

GGGATTCCCXKXJATOTrTTCrrcTCCACITANCAACAGOGAC^ 

GCATnTrrCCATTCAAAAATCACrCAOOCTGATTTGCATCATTTT^ 

CNACTNOGTGAATATAAACTCAATaXACATCATCnxa^AACOCA^ 

GGTATNCAAOGCTrrATAACCrCCAAATTACTTroCnXATGnX^ 

AATmAACrOmTrCCCCGTACCIOTCCCOOCOOOCGTr 

SEQ ID NO: 3431 ACCCmAACCXCTrCTCCrrCACCCTTAGCAOCAAOTC^ 
GGCAAGAAACCCCAAACCXXnTCCCTaxnmcmAOGCTXnCIT^^ 
ACTATOOOCAAOCTrCCATCCTtXATTCCTCCTrC^^ 
CTXnrCAACTCACACCrGACCrAAAAOCTAAATGCCrcATTTTC^^ 
ATACAAACTTQACAATOOCTCrAAATOGCCAOAAAATOOCACTrTCOATm 
CCTAAATAATTmGTt>AAAAATGGCCAAATGGTCTOAOQTGCCTaATGTCW 
CACATCCGCCCnXXTAGTCIUTmGCCCAGTOCAACTCGTtX:^^ 
GTCCCTCAGTCCCAAOCCAGGCGOTGCTGAGTOOOCTAATCTTCCT^^ 
NCCTCrrrcACCCAAGCTAGGTCCAATNTTCTKAGOCT^ 
TrA>«CC0COCO0GTAAAaTTONTCCOOOATAN0C 
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SEQ ID NO : 3432 CXiAOGTACAGGACACATntMAGATCTTTTATCOTATCCCCrGAACTAOCroC 
AGTrrrOTXriXX>GCAAOTTCAaTrTCrOCCmTCAACA^ 

GTAOOTATAGAGATTGGCTTGOCXiAOGGCTOCTrCKJGGAATXXXjCACAAGTrCT^^ 

AACATCACCTCCGTCCACroAnArmGTO'ntMTrCIOTAaCAGO 

CACCAOOOC7t>GGAOGATOACOACCAC7TCACAGOCCAAAmmTCTGGAC^ 

OTOAO0ATTGGAAA(TnKTGaTGTAGCT0CACmXlA0OTAAOrit>T^ 

TGOTOATACTCACTQATGOAAGATaAThKjAGAAGGAGAAAOAAGOOACTt^NTAAGW 

CACCnCATOTGTtKi^TTCrnjrrOATAAGGAAAAGGAGOTCC^^ 

CT0CTrOCrrmOOA0AAOAAAN0AAAAAAACCG0a:ANCATTCTKrr^^ 

TGAACTGGAOfrrrnTITTTTriTTrrTrTCNACW 

OGAACCCCCNCTC 

SEQ ID NO: 3433 ACACmOAAOCimtAOOATCKXriXiAAOTCCAAariCTt^ 
CAQAGCA(XCCrrATCAmATGGAGGCIUraTOACraGG<3CAA0Cra^ 
OOn'GATDTGOATGOCTIXXTraTOOCTaOTGCTrcCCrCAAOC^ 
TOCCAAACAATGAOCCCCATCCATCmXXrTACCCTrCCn^^ 
AAOCCCAGTAACTOCCCTITCCCTGCATATCCTTCrGATGOTGrrC^ 
TCAAACTXJNATC^TCrmACTO^rrTATATC^^CACCCTt^•AATOG^ 
TCrrCACTTACTATAATGGNTOGAACTAAACGTCACCAAAGTOOCrrC^^ 
AAAGCTGOTGOGAATrGCTTCTOOONTOn-AGOCCTAATGANGOCAANAANAAAAAACCATr^ 
TTCCrorrTACACCGNGAOGCCAAAKrCCCn-ANAAAGO 
NCCrmOCnxnTTTOAACCCCCAnGAAGOAAhUACCGGNCTAGOCTm 

SEQ ID NO: 3434 OOTAOOCOOOOAGCAOCCCTOACAAGAOAQTKXrnMAGCCCAAGCr^ 
CCACAaAGOACAAOCAGOCAOCAOAOACCA-roOGGTaXXnTCAOOCTOTC^ 
ArrCCCnSGCAGOGOCrCCTOCTCACAGOCTCGCrmAACm 
CAGACCAATATTaATOTCOTOCCGTTCAATOTCGCAGAACGGAAGGAOQTC^ 
AATaAOTCCCAGAATCTTTATCGCrACAACrOGTACCrrcCQAAATACTTCC^ 
CCAAOAATATTTCrOOAAOCATaTaATQAOTTtmnXjATGAAGATAGAGCCC^ 
CCAGGACACOTTCTXmKjCOTrOAAGAGCANAAAGCAATOAAGTCCTTCTTC^ 
AAACAOCATCTTCCTTCANCTTCTNAGATOACTGTOAAAAOOCCCTTrCA^ 
TCrrrTGAOCCAATTCXXIACOOGTrCNANCTTmGANGGCTGQ 
mXANTGANOGCANTrOCOOTGrrGATiyiXKjAAAAGOOOCATAAAACAC^ 
ATCNAAN 

SEQ ID NO: 3435 GGTACACTTOAAACCAAATTTCTAAAAC IIG r T TlTCrf AAAAAATAOTTGTT 
GTAACATTAAACCATAACCTAATCAOTCTOTTCACTATOCTrtXACACTAGCCAO^ 
TTCTTCTQOTrreAA^nCrCAAGOCCTOACACACAGA^ 
CACTCTTOjGCAACnXiAGAOCTrTmx^TOTTOTCAAGCAACAGAGCTC^^ 
AAGCATAOAOACOOTTTOAATATCTTCAOTOATATCOOCTCntXXXKXlT 

SEQ ID NO: 3436 A0GC0G0ACTCAGAAGCrrQGACCGCATCCrAG(XX3CC0ACTCACAa^ 
AGOTGOGTOAOGAAATCCAGAGTTGCCATGGAGAAAATnXAGTGTCAOCATT^^ 
OCCXrrCTan-ACACIXTOGCCAOAGATACCACAGTCAAACCTGOAOCC^^ 
CTC1XX}ACCCAAACnoaXXAGA0CCICrCX>OAGGrrrG0OGTO^ 
ATATOAAGAAGCTCTATATAAATCCAAOACAAOCAACAAACCCTIXWTOATrATrCAT^ 
ATGAGTGCCCA(>CAGTCAAGCTrrAAACAAAGrrGlTTT)CIGAAAATAAAGA^ 
OCAGAGCAGTrrCTOCTCCTCAATCTGGTTTATOAAACAACTGACAAACAOT 
CAOTATOTCCCCAGOATTATOI 1 i UJ l OCCCATrrCTGACAONTAOAGCCCOATATACTCCNAOAT 
ATrNAATCOTCTTCrATOCTrACXUACXTttX:AATCAGCT^^ 
CTAACTXKrrOAAGATOATraThfNWAAAAhWINNNNN^ 

SEQ ID NO: 3437 ACGCGOGGOCTAAATCrGCrcAmTTTCAGACGOGAAACCTAGCAAACrAA 
OAOTGATAAGOGCCCTACTACACnKKTrTTTTAGGCTTAGAGACAGAAAC^ 
GTAQTOOCnCTAOCTCTAAATOTrrOCCCCOCEATCCCTITC^ 
CCIXTTCTCTCGCrGTCnra^OCAGTCTAGAAGAGTGCATCT^ 

TOOCCATAAaAAOTAAAGATTTOAAOACAGAAOGAAaAAACTCAOOAOTAAGCTTCTAOOC^^ 
TCAGCTTCrACACCCTTCTOCCTCTCTCCATTGCCTO^ 
GTTTTTCXrrrroGOCATGOGAAOGOrrACCACrAGAATCCTrOCTAGOTTO 
TCCTTTAATAAACCATrOTOTACC 
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SEQ ID NO: 343 8 ACCTGGCXTArrmAAACTAGTOTAATtACCCTAOTCATACCATTCAOTA 
TTTGClTnTAAAA'rAAOTAACXi^CAATrAAOTTOTTOTAOCCCTrOCAm 
TTACTrTCA(nTGlCTGTrAGGTCCATlCrtnTrACTAGACOOATaTTAATAAAAACTATO 
TGAATOAATIXnXIAGCCAAATrrAAGTCTrOTCTCrCATCrrOATTaQArrAATO 
ATOATTCAAOTCX^CAATAOCTCTAGOGaATGAAAAATrrQOCTTACTTTCCC^^ 
CTOOrrCAACmSOCAAATOCTAAACTOTAAAGCrC^^ 
GaraOCATaTAATTrTTCTAAAOOTCKmt)OCAACACThrrOT 
GCOAATTTCAACACACTTCKXWjCCCn'ACTAAOTNGAATCCAAC^^ 
TCATTOG<>TAACTNGTrrcnOCNaNAAATOOTATrCCCrrTACNATTC^ 
GaAACATrAAATGTNAAACCTNG(KX}CKXTAAG>«K}NGOCCTACCTACAATTAATOGOTG0C^ 
TNOaXNTT 

SEQ ID N O: 3439 ACWACCATAOAGCAAGAATCAAGATTCTOCTAACTCCTGCACAGCCCCOTC 
CTCTTCCrrrCIGCTAGCCTGGCTAAATCTOCTCATrATTTCAG^ 

AOTDATAAGGGCCCTACTACACTCGCrrrmAGGCTrAGAGACAQAAACTrrAGCAT^^ 

TAOTOGCTrCrAOCTCTAAATOmGCCCCOCCATCCCrrrCCA^ 

CnxnCTCTOGCTGTCTCOAOCAGTCTAAAAGAOTGCATCTCCAOCCTATQAAAC^ 

COa^TAAGAAAOrrAAAOATTraAAGACAGAAOGAAAAAACTCAGGAGTAAACCTrCTAACCXX; 

CTT CAACnCTACA CCCTTCTGCCTmrrreATr^^ 

TTOQTrrrrCTTrNOOCATNaQAANOOTTACCAaTAAAATCCIXKTAGOTGATGNG^ 

TTCrnTAATAAACCATTGNGACCrnjGOCCGGACCmniAAOC^^ 

GTCTAATOQATCCACCTCXINACCAACrnKJOGAANAATOGCANAClNON^ 

SEQ ID NO: 3440 COTACOCOGOOOCaaKnGTTGGG^CTTGCTTOGAGOrrGGCGOOOCOGGGC 
TGAAOGCTAGCAAACCGAGCGATCATO'rCGCACAAACAAATrrACTATTCOGACAAATA^^ 
CQAGQAOTnXjAGTATCCACATOTCATtXrroOCOWQOACATAGCCAAC^^ 
ATCTOATOTCTQAATCTGAACCOAOOAATCTTOOCCTTCAGCAOAOT^ 
ATGATCCATGAACXAOAAOCTCACATCTrCCTCrnmXX^ 
ATGAAOCTXJGCAAOCTACTnTCAACCrCAAGCTTTACACAGCrOTOCTTAC^ 
TOATAACArTATTATOGTOCCnClTOOriTCTCACnTGATAmAAAAAAAGGTCAATA 
TTOAATOGTOCTNGTAACrGhnrroaTCTTGAOTAGAACCAC^ 
TOCTTTQTOAACCACAGCCTAAmAATGGGACCCAAAANCCCCAATI^^ 
CTOGCAAATCCAGGAACArrTAATTrGAOOCCTOOTTxmiAAOOGATATr^ 

SEQ ro NO: 3441 ACTXrraCANACn:ATOTANAAAACCACTClXKn'AATT0mGAAAAGTT^ 
ATOAGCAAATAflAAGAAAGrGAGAAOCATACrQCAAArrATGATACAfiAGOAAAGAOTAGOATC 
TTCATCTTCTGAGTCTTOTGCrcAACATrmxri^^ 

ACTOJAOAATACAOOTATAQACKKTAATGTTTTOTGTTrGGAAAGTGAGATTTC^^ 

TGAAAAAOGAGGTGATOCATTOOAAAAGCAAGACCANATATCTGOACrrrcACAATC^ 

AOACAOATOTATGTACCmOOC 

SEQ ID NO: 3442 OOTAOOOGOOGAaAAGCnxXjACCGCATCCTAGCOGCCGACTCACACAAG 
AGGTGOGTGAaGAAATCCAGAGnGCCATGGAGAAAATlXX>.GTOTCAGCATTCTr^^ 
GCCCmCCTACACTCTGGCCAGAQATAOCACAGTCAAAOCTGGAGC^^ 
CTCTCGACCCAAACrocCCCAGACCCTCTXXAGAGCTTGOOOT^ 

ATATOAAOAAOCrCTATATAAATXX>A0ACAA0CAACAAA<XCTtGATGATrATTCAT^ 

ATOAOTGCCCACACAGTCAAGCrTTAAAOAAAGTOTTIXKTOAAAATAAAQAAAK^ 

GCAGANCAGTrTGTCCTCaX:AATCroGmAT0AAACAACT0ACAAACAC rr i 4^ ^ 

CAGTATOTCXX^VOOAmTOTrrOGTOCCCATCTNTOACAOOTAGAOC^ 

TTNAATCQTI>mXKnTACNAAOCTTaANATCAACTrnGTGNTTC 

ACTTOTTGAAAACTQAATIXrrAAAAAAAAAAmXXAANCXnTTm^ 

SEQ ID NO: 3443 ggtacgcoggoactcagaaccttggaccocatcctaocooccoactcacaca 

AGGCAOAGTnjaL\TGGAOAAAATTXXAGTGTCAGCATIt:rroCrCCr^^ 

TCraOCCAOAOATACCACAGTCAAACCTOOAOCCAAAAAOGACACAAAGOACIX:^ 

CTGCC(XAGACCCTCrcCA0A0OTTGQG0TGACCAACrcATCTGGACrCAOACATATO 

TCTATATAAATCCAAOACAAGCAACAAAC(XTTOATGATTATrcATCACn^^ 

CAGTt^AGCrrrAAAGAAAOTOTrroCTGAAAATAAAOAAATCCAGAAATTGGCAQAGCAC^^ 

TCCTCCrcAATCTOOrrrATGAAACAACTGACAAACACXTr^^ 

AOATTATGTnXjGTGCXX^TCTNTGACAGhn'AGAGCCOATATCACTGGAAQATAT^^ 

TATGCTrACCAACCTOCAGATCAGCr^r^0TQCTTGACACATOAAAAAGCT^^ 
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AATGGAAAAAAAAATTTCANCCTTTrOTTOTAQOCTrOOAATroAACNAAAAAAGG<M 

SEQ ID NO: 34+4 ACACATCCAOATOACTOGrrrCCTOCCrrAACraATOACATTCCU^ 
GAAATOAAAATCCCCTCriTCCTGCCTTAACTGATOACATTATCTTCT 
ATCCTQACTCAAAAOCrCCCCTACTQAOCATCTTCrrOACC^^ 
COCCCTTTCUCTGTAATnTCCrrTACCTAOCC^ 
TCACTOACTXncrrrrCtKjACTCAGCCCCCXnX^ 
ACAAAaCCTC>rrTG<TrGGTCntriTCACATCGACGCACATGA^ 

o<KJACCTcc^rGGGANA^rcAATCT^cQCCIX^x}^^^cr^x^^ 

TANOTCCraAAAHCQACAACCCCAGAAACAThrTTAOCCATTTXSUA 

CI 1 U U 1 ICANTrCCTTACrnXXrTTAACACNTTnt^nTNAATmJOOCr 

TNmAAATTNAATCCrrCAmTCTNOOANAAAAAAAAAA^ 

SEQ ID NO: 3445 CCOGOCAQQTACAGCCR. 1 1 L m U i CATGAATGAGCATGCrGAAGGGCTATr 
TACICTCCTATGAAAAAATCmXnTACAGTAAATCACAAOTCTTATCAACACAATOAAC^^ 
OTTAGATGTrAACTGTGCnK3CACC(XATOTOAACCrOCQAGTC 
TTTGCTOCATOCAAACCTOCTAATACAAAGCGGGCnxnOACrrAAGOAC^^ 
AGACAATGACOCAAOCAGACCTAGTATAAAAAOOTAOTCIXKKXX^GrrrAAATTCC^^ 
GOAGACTANCAGCAGGAGCTGAAGCTCATCATGTANAAAAGAACCrCAAAOGGTGCAAGTrAAA 
GTIArrACAAAGaAACAAAAA(nX)TAAOTATOCAAAAAOCrGTOTAAANAAATKKm^ 
AOAAKKIAOTTACAATOCAAAAAQAAACAAOTCACATCCTOCAGACCC^ 
TlCCXTONTKL^GQACACCNCCAATTTCTACTrnXTrAaNrr^ 
GGOTa<nX3AGGNGQTNCroATrCCTXK3<CTTrcrr^^ 

SHQ ID NO: 344tf GOTACOCOOOOOCTOACTCTCTTrnXKACTCAOCCOGCCT^^ 
AAATAAACAGCCATOTIXSCTCACACAAAOCCTOTTTGGTOOTCr^ 
TnXXyraCCATGACTCGGATCOOOGOACXnXXCnOGGAaATCAATCCr 
CTCCCTOAOAAAOATCCACCTACOACCTCAGOTCCTCAGACCGA^^ 
CAAT TTC AAAT CTGCTAAGCAGCCTCTrrrrAUlX-N L'lUtnOCAAC^ 
CTTTCTCCTITCAATCTrO<X:ACTACACTTCAATCTCT^^ 

CTGGTAOAAACAAAAAAAACACOTmATCCGTGGACCCAAAACTTCGOCOCraGTCAC^ 

OOAAAGCAGCCrnxnTOClTOOnAATCANTaNAAGGATGanTmrnjA^ 

AAOaTO^XXAAACACGCAAOGACCACTNGC^^OGTCCTTCA 

GQAAGOGCAAGTCN 1 1 1 I'ri-ri-rrrrriTrrrnTrnoiATTCAAATTTnTAAAAAAAAA 

SEQ ID NO: 3447 OGTACXJCOOOGOCXXrAGOCTACAGAGCCATCXIACnXCrCOGGGCCGGAGA 
OTOGCCCTACA(XAG<XAGC0CTGAGCAGAATGAGTA0CrAGGTAGGGGCAG<JIX^^ 
AAGCTGCAAAAACTOrrGCTGTCCTTGTaACKjrCACTGC^^ 
OTGCa^OAAAGGAAGGGOCTArrOCCnXXTCCCAOCCACGTTCCCTTrOT^ 
CGATIXnXXX:ATCAG<X:ATC7XK7rrcTCCrrCTrAAGGC^ 
AOTTAOGTTACTOATGTCAAATCCTCC^^ 

AGGCAGOTGTTGOTyiTCTTCCCAATrCTrrTNCAAGTAAOamOm 

TGAAOCAAAAaTOOOOTOCrrATACTNCCAAACCTTTGAOTGNTCAACCTTC^^ 

CIXrrrOGGCTGNGCCTAATGGCACCTGGOCTGOGWKJACACrGOOCCGCT 

ACTAA<7rCNAACTCCACCTGGAATAAT0<KIKTTrnT0ACKKrrAA^ 

GAOONTTATOGGAAAACTT 

SEQ ID NO: 3448 CGAGOTACrilVrn-l-lllTnU-llTrrrmTnKUCTrTGCTAAATXKXXTGT 
TTATTCTGCAAACAAOGmANAAATOTATAOACACAATmrrAOOCTOAGAAATC^^ 
TGTG ACTTCCATAaTOCGCAAAAOTAGTAAGGACAOCGACTTCTOAAC^ 
•niUO-rrTAGAGCGAACACA<nX)AGAAT0GATrCCTAAC00CCTaTTTCTCTAAOTGA 
OTICOCTOCTCTAOTCTrTAANAACAAGTGCAOCXTrcACITa^OC^ 
GOOCTTCAACAATTOCAAOCXJTAAACTCGAAGCrOGT 
CCCGCTTTTTOIACACNATTCTCAAAAQAaGNONAATQACCTCAT^ 
A0GNGNaTGGNACTTTri2TCAAACCTrm 
GT^^TANGCCXTCC»GT^INCNGGTCCTCAAATIKCT^ACAC^ 

ATArarrOOGOAACCCCCACAATOGNCCCTTTrrGATnAAGGGNATANGAAAAGATC^ 
CCTGNC 

SEQ ID NO: 3449 OQAGOTACTTQCCCCrrCXXXAaAAAAOaKJQACrrOCTO 
OACCAAAOCAOTraTtXCTtKXJTOQTCTOACAOCCITGAAACGTOGGT^ 
TOCCTGCAATOATTAAACACCAAGGOAAOGCTOCCTrCCCAGTCTOTO 



535 



wo 0M9O86 



PCT/US01/M732 



GOrOCACOQATAAAACOTOTCTCTTrroTCTCTACCAGA^ 

AC0GA<UQATraAACTGTAhmjCCAA0ATrGAAAGGAOAAA0TOQTTOAO0OATANTOAGOOAA 
AGTTCGAAAAAAAAATAAAAAAAAGCTGCTrACCACATTTGAAAATtKrraAGATCT^^ 
TC>nXXXrrCrGA0GACCTTAAGNCGTANGTCH3ATCTTT>miANGGACKXA^ 
NGGATTQATCTTCCCAAGCAGGNCOCCKANMCNAOQAT^^ 

NAAACCXXX:ACAAC<nTTTGOOACACAAGOI\Xi7r("riAACCNGGGCAOGGOCn^ATCC^ 

ATAACWXXCTACnNCCCK3G0G0GTTTAA>JGG0ATrCANC(XT000C0T^^ 

ACTOOOAANATOOrrNT 

SEQ ID NO; 3450 ACCCGOOGOCT0ACTCICrrrTCAGACTC>G<XCACT^^ 
AACAGOCTTGTTGCrCACACAAAGOTOTTTAGGTGCnCmn'ATACOGACATG^ 
TOOCAAAATCroGO(X:AOOGGOACTCCTrOaTGAOAOCGOCCCOCTGT^^ 
'AAOAGATCCACXTGCGACCTOKKmXnX>CACCAGOrcAAGOAAC^ 
GGATCrCrrCWCTTAQT00CTQAA0ACTGATOCTG<XCaATC0CC^ 
CACAOATOCCCAGCirCOGOTAACItTTACGGTCOAOGATrCCCAGCCATATO^ 
CTOOACGATCAAGTCaTGTCAAAAGTCTOAOCCCTCAAACTCTACAGOC^^ 
TACCNOTCAmATAGACAOCAACTGCCGNCCATCTGCAGGACCrrrW 
GAATAAAOCATaCCATCAGACAGCACmUCTNTCTmTCTXnXKlAOCC^ 
Gn'AAACAACACTAAACCrrANTrTQGAOGCCANCAA0GAT>rnxri^ 
ACNGTrCAAAOG 

SEQ ID NO: 3451 ACaXTrAACCCCTratrrrCACCCTTA0CA0<>AOKXX>C^^ 
GGCAAQAAAOrCAAACCACTIXXXTCCXnGTCrmACGCT^ 
CTAGOGGCAACCTrCCAlCCTCCATTCCrCCTTCrCC^ 
TCntAACTCACACCraACCrAAAACCTAAATO<XTCATmCr^^ 
ATACAAACrrGACAATGGCICTAAATGGCCAOAAAATGOCACirrC^ 
CCTAAATAATrrritrrCAAAAAATCXlCCAAATGGTCraAOGTGCCTOATOTCCAffi 
CACATtXXnXXXXrrCCTAGTCTCroTGCOCAOTGCAACTC^ 
CTOTtrCTCACTCCCAACCCAGGCGTOCTGACrcONCTAATC^^ 
TirCCCTCTmtAaKXAAaCTAOOTOCAATTXn^ 
CTTaxr^CTrACACltKJNCOCNTAAAOTrarrT0G^ffiATA(^^ 
TAAAGGOONTGGGTOAANA 

SEQ ID NO: 3452 <Kn"ACACAATGOriTATTAAA(KlAAKrrATGOCCXACATCAACCTAGC^ 
ATrCTACTGGTAAACCrrCCCATGGCCAAAGOAAAAACAAGCAGGAGTTQAOTGOCT^ 
GTGCAGGCAATGGAGAGAQGaCAGAAGOOTOTAOAAOCTXjAAGGGGGCTAGAAGCTTACTtX^ 
AOTTTCrnXTTCTOTCTTCAAATCTrTACrTCTTATGOCC^ 

AOATOCACTCrrCrAOACTOCrajAGACAGOCAOAQACAGGOGAGQAGOaAAGAACa^TACTOT 

G0AAAaGQAT00CaaOGCAAACAmAOA0CrAGAA0(XACTACT0OGCCAATGCTAAAGTrrCT 

OTCTCTAAGCCTAAAAAAGCCAOTGTAGTAOOOCCCTrATCACrCTrAOTTTOCTAOGT^^ 

CTOAAATAATGAaCAOAmAACCAGOCTAQCANAAANOAAOAGGACCGCKKrntrrGCAOT 

TANCAAAATCrroATCTrGCTCTATGGNCGGACTTrTnTr rnTll 1 ' l - lTriTl GGAGATCTAAATC 

TTCTOG<XKK[AAAAATAACTOm■CAAAAAAAOAAAOO^^ 

GNTTTTNTAOCTNCIXIGCOGCCCrCAAAOGNGAATCCACCm 

SEQ ID NO: 3453 GOTACTOAGACCTATrOGAGCTrOTOGCCACCATCCCATCTOCACCGrrOOTC 
AGOTCACTGTCACCTATOGCnX^CACKXTrGOGQACCAOACrrCTG^ 
CAQACCAGCrOXCGACTCCAOCCCXrcAAOOCATOCCCATCCGCr^ 
GQATTTCAAATCTGTrGTCAAAATTAAACAGATACrCAAATTr^^ 
TOAaxmrrCTIXKTOOTOTTCACGATOATCAAOOOCAGGTQGATQACTO 
GOCCGQCTGGTCTCKnWrCCGCATGCCGGTrrcrCT^ 
AQAATAAOrrCTTGAAGTTOAOACTtmTCTtnTnATTCnTr^^ 
TAAOTTXTOACATTCCTGAGCOOAOTTOCnXJOGCACACCAATCCACTT^ 
OAOATGATGITCATGONCATrAACACOmAAGOCATCOTANACOCCaXKnTATGmT^^ 
AAAC^GACTC^^^TGGOTAAGATGTGGTT0CGGCACACTCAACT(XaAACCAC^ 
TIGAhraAAGNGGNCCTrTCTTTOACCrrrraAAACTTATGG 

SEQ ID NO; 3454 ACATGGCCTTTCTOaAATACATGOCAaATaKXJAATACCTCCC^ 
ACAAG0ACAOGQTraCG0CnK:AATtKKKXnTCC(XrrrCT^^ 
TTnTCACCTTOCCACXyLGCATCAAC^ 
AACITTlTCACCXXXXATCTroCAAGATGGGAAAGACCCGC^ 
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SEQ ID NO: 3455 (KrrACACG<U0CrGOmCAAA0ACCTItnTKyrCCT0TAOTAACa^^ 
AQAATTCmUG C TTlTCCAATACKTrcmACCTCroAOAGTT^^ 
TCTOACTGACCAAOOAATrciXXTITATCATXmXXi^GAOACTTAAC^^ 

TATCATCAOTrATAOCAOCmOOGAACTAATATTCrrACAOCmAACrTCC ATTO C 1 II H C AOT 

OAOCATrrrCCCTAGCTGAAGAACTCCTOGOCTCTCTOATXKJAACro 

AAOACTATGATOCTirCTAGATTGOACTOOGOACKn'AACTTCAACAOArrrAGOTO^ 

TTCCATTCCATCTCCrGCTCXrrCGGATTOOGOATGAAQTAACOGT^ 

TAAGCATrrTTTCACCrrAAOCACraAAAATTCACTCCAA(W 

CTTATTCTCCOTATTTCANaaOOCTCACrrQATAOCKrrCT™ 

ACANAO.- ] N mX H 1 l CK}AAOCC0ACC30OCCCCNaiTCIXKXyqGCGGCG<ri>^ 

TCACNCACTGGCGG<XCm:rrAATOCANCCNCNT 

SEQ H) NO: 3456 00A0crrA C TTrrn-i ii -rrT iTnu i- n Ti-rT r TT0OATATcrTgit^ 

ACIXnTTCTAAAGAAAAACTCCATTATCCCAACKiVAAAAOCACAaA^ 

GAOATCTrrAACTCAAAATCTrAOCKXTAOCAGAaAATCACCAAATTrATtKK^^ 

TTTAACAGOAAOOAAXjnXXTrrACrrAA(nTCTCAAGCCA0ACKKT0OAGGCAOC^ 

QAOGACAGCATCCTCAGTOAAAGTGAGCCAnCGCKXJTGGCATOTt>CTC^ 

CTTAAAAACAAATQATTTCOTAOCUTAQCACAaTGACATXKmKIACrGTGAACC^ 

TGTCAAACTGTCX^CTCKnTGTOAATAOGGAGAGCCNAAAATTATGrrCCTACT 

CAAT(X]mXX3ATCCTmx:ACACTGAANnNTQTA0AACACTT>^ 

CTTNTTOOGOGACniTCTCNGCAaXCAOGAAOOOCTGGGAANCCCrrAGOC^ 

AGCOCAAATOCOOaATGGOCrrACnXmXJOCNCrAACrccCCTAAAOCa^ 

ACAACCNAAATAATGCNTnTN 

SEQ ID NO: 3457 CCAGGTACOCOGGOOCCAAAmOAaXjOOCXjTTCrOCTGTAACGAGCQGGC 
TOXJAGGTCCTCCXXXnXKTGTCATXSGTIXXmxXKTA^ 
TOGOCATCOGCAAGAAOGGGGAOCItKXXTOGCCACOGCTCAGOAATGAAT^ 
AGAATGACCACAACCTCTTCAOTAaAAOOTAAACAGAATCTGGTCATrATGGGTAAOA^ 
OmTOCATTCCTQAOAAGAATCOACCmAAAGGGTAOAATTAAmAOTT^^ 
CAAGGAAOCTCCACAAOGAGCTCAlTTTCTTray^GAAtrrCTAOA^ 
ACAACCAGAATTACCAAATAAAGTAGACATOOTCTOGATAOTTOOTOOCAOTTC^^ 
AGCCATGAATCACm^GCCATCTTAAACrATTTGTCACAAGGATCATCCAAGAC^^ 
ACGTTITrrrcAGAAATTOATrTGGAGAAATATAAACTrCTOCCX^^ 
TOTCCAOANGAOAAAGNATTAATACCTGNCCCOGCGOCCGTCCAAAGGNOAATTt^ 
OGCCGTTCrAATGCATOCACTCGGACCAACITOCTATATGNCTAN 

SEQ ID NO; 3458 ACCACAAAGCAGACAACAAAAATarCICTrTAAAAAATQAACAAATTC^ 
OTATAAOOAAATTTCAGCTCA I - riUllCIUU ATTTXXn-AGGGOAACGCAArroTm^ 
ATGGCGCITCCTTOAACTAGGGGACAC0TCAGOCATOGACCA(XACCriW 
GCATCTCKXX:ACATCXXAGCCOCCTOT0ACAQAATCTrCCATACACATOGCTC^ 
AAAOOGAGACCACAOCAOATACGATTCTGTGGACTTCAGAACQUVATCTATACAAATATm 
CTCrAAGTCCCCAAGAGAACATTAACATrTTTOTCTATQAAATAAmmX^OC^ 
TGATATTCACCAGCTOCQAOGOAACAAAGCATACCTraAAGACATATGTArrTGGCATC^ 
CACATAAACACACCACAOCTOOGAACACACATGCTNTCTATGTCCCNTaTGAC^ 
CAT(rraAAGNOCTCCAATACTC^t>AA(XiCKGGAATCO^O^AAOACTT^ 
GTCCTTGGCGGGACACCTANOONOATI^CCCTGGOGCOTCTATOOfKXyLC^^ 
AANGOAAATOTCCTOO 

SEQ ID NO: 3459 OOTACAAOAACATWGCTTCACTGTOTOOGACGTGGOTOOCCAOGACAAGAT 
COGG<XXXnmGG<XXXACrACnXX>GAACACACAAGGOCTGATCnXXjrOGT^^ 
ACAOAOAQCaTaTGAACGAOOOCCGTQAGGAOCTCATGATOATGCTGOCCOAOT 
0GATGCTCTCCTCCrCOTGTTCGCCAACAAGCA00ACCTCm>AC^^ 
TCACAOACAAOCTGGGGCTGCACICACTACGCCACAOOAACrOGTACnTrrrmrrnil 
TTTTCC<XAAAATrcCTrrATOTAAAGTAACCCTNTCAACA0TGCTAA^ 
TTTCATirroCANATOAAAANCATOGATrrrGGGACGTCAGGTCTATGCG^ 
irnTTTGANCTCCCCTGCAACNGQGNCrrCAAGGACCATGACC^^ 
NTGATTAACTCCTT^XiACAAGTOCACACAATOGNGGmCAAThrrGOGr^ 
GAAATCCAGOhfCCXXATAAQOOhrrTOCCCCOOGGNAAAGOOOTrNAAANAXa^OC^^ 
TrrrOGGGGOGGOCTrCACTGTGaACCCAA 

SEQ ID NO: 3460 acacgaaogtcctgctcaggaggacooccagogcoaaoatccaggagtca 

TCTGOTTOTCACXXTIOTTACATaCCCAOACCCTCaQT^^ 
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OCn'CrroOOQTGOGOCACACAC^ 

OAOAOAACKXXrrrTTTCrrTCCAGCCAOCTTCnTCATCAOCT 

OCACAATTQOCACCACCACCATANOCTCnmAATGCCAOAAAOGATCTrCAA>^ 

CC:ATCAOTCGCCX>.TrA0TCriTCT CAATGA AGCACATKXlG0CTOT^ 

T<n>JCAAGcroGTcccTGacToarrcrriTTcmo^ 

CCCCKXrrCCCCAAACrraAAAAAATGGATAATTCmOCTGGACACrrGC^^ 

CITNAAAAACCrGTCCNAAAGCAAAAATOCTNOOCrrAWmAT^^ 

TAjyCCCATAACACNTTTNAANDCTNNAT^ 

CTTCCnNNTOCTTN 

SEQ ID NO: 346 1 ACAGTOGCCCCCOOTCAAAOOCAQAATTCnXWTTITCCr 
CCAOTGTOCAAATAAOQOCTCKrrGTn'CCACOACACECrmxrrOOGC^ 
TAATACCATCOACGTCCCTOCAOAAOAOGAOTGTOAATmAGACACTTCTOCAOOOAT^^ 
CATCCTXMCOCOOTOCCOTCCCCAOCACGOTOATrACTrC^ 
ACACCTCAOACACOCTTCTCCACCTCTCKXnXJOOC^ 

CTACTCAAAATrGGOCTAAAAATTAAAACUOATCOATNCCNANAAAAAAAANNAAAANNNAAAA 
AQTACC 

SEQ ID NO: 1462 CMrrACCKaXjGOGO<XrrOOAOGCGCATGCOCCGCATt K;i 1 1 1 l l l l CCOOTAO 
GTCGCCAGCTGAGGCOGTITOTAAaTrrrOOCriXXiCAaTATCKrrAGAAT^ 
ATOAAAATTOAGCrOTOCATGCAGOCATGOAACCCOGOTrACAOCAGrrGAOGG^ 
AOAAACTrACACATGTCCAAAAAT0ATTGAGATOQAOCAOOC0aAG<XXX:A0CmK^ 
ACCTOCrAaCCAaTATaTrCCCTGGTGAGAATOACKnCATAaTGAATGAC^ 
GAACroAAAGATTOTATraAAAAQAACACAATOOAGOGOCOATCTTCAAAACmTACm 
CAATATOAACCTOOATCn^ATCTCUCOAAAAAATaGCXJATGTrnCrCTG^^ 
AATACOCGCAOTCTOCCTOAAAmCTOCAOATCACT^ 
AOCANATTGACraCrrTCTOCAAAACATIOTATO<^^ 

AAAAAAACCCTTTG^mmTANANAAANCrrrnirACCCCOCAGOAAOCAATCC^ 
TNTTTACAAAATTGaTrrA 

SEQ ID NO: 3463 GOTA Cl J I i 1 HU 1 1 i ] 1 i 1 1 L LI ] ill l m l ATACGNTrrmTATTAOAAAA 
OAAATTAACmmxnrACTAOAOAlTOAAOGAAATTAACAATCTTAC^ 
CrrCATOATAAGGGATTCTTAAGm^ 

AGG<K30CrCJrAAOTmATCAGAT0TTTTACATrrCATCCTTTAATC^^ 

AATAAATACACAGTOTTCAGOATrATCTGGATAGCCATCTTAAATAACAGAOTGTTCCaCATGArc 

ACAG<X:rCnKCTT0TGCCrCTOATCCATATQACACCAAGACCAACT^ 

CCAATrGOOanGGaACCCTTOCAAOAAATOOTGlAACArrmC^^ 

AACIXX^ATCTGNmAACCCGCGTCAOCACTCXXACTrQTaCAAAAAAATAAAC^^ 

^^mCAT^mACraATAAAAC^CAGGCNCCGTC^^GTOCCNGACTT^^ 

AGOGOGGTGOOGNTGOOAATCNACAATCCNAAACCT^mX}ATOGCAATOACmOOAC^^^ 

CCTCANNC 

SEQ ID NO: 3464 ACXH^OGOCAGAGAOAGOOrATCACCCTGCAATGGCAGOAAAATCAG^ 
TGCGGCAGAAGTCGGACCATGGTtXXTACAG(XLAAAOCCCAOCCATAAAAAAACAGC^ 



CTOCAAGAATCCAGTTGCCACCTGCCCACCnTrCTCCCTC^ 

AAACAGAGOCCCCAAOCOCCAATAAACTTrACGTCAAGOCTAAACCGCAGGOCATCAn^ 

OTATOGAGGGOTGGATGOCGOATOCTrroATAGGCACCnATCmAGCACATIXX^^ 

AATOTTCCNGQAAGCCAATGAAOATGAAAAGTOGCrrCACCTOCTGTOCATTCr^ 

CNOOTNCTQATGCTTOGCACCTOCCANOOCAACTGAAGCTTrrAATO^ 

GACOCCACrNTAACIOTACAACn^AACmACACATTTTQAACCntlJ^ 

ACATrONTAnXKM(KX:ANCTTraCTGNCTTTtKKiAAaAATC>^ 

AANOCrrrrGAGTAAAANCCnXXSOACXKn-ATrGGCN^ 

SEQ ID NO: 3463 ACAGCTTCTCCAGGTTACTOCnXMACAGOCTarrCCAOAOCCT 
ATCTOTCCCrOAOCCATCCTTCTCGOOOCrCTOCATCA^ 
GGCCAOAAACTTGTOCrrQAACCCAAAATGAATOCrQAAAGTCTACACGC^ 
AAAmATirOCAGACTCrrcAATCATTTCCCOCAAATTCTCm 
AACntnXXnTCACCraCTTCAOGGOAAGAOCCATGTCTGC^ 
CCATOCACACACCACAAOCTrOAACCTXJCTCCGOTATAOCTOGT^^ 
OOACCAOTOCTOGTAGAOCACCANGCGCAGGTCATACTCAAAGGAGATCCGTGTOCAarcC^ 
AGAGTO^KKnxnXCTCATCCTCONTCCNGTGQTroTGGa}OGAAA(Xr^ 
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AACATCAATCACOTATTTCATTTOAATQATtTITaTNTGOACC^^ 

OCACCACACATOTANTNANOTCCTTGOACAGATTCAAGCCACTroAACATTAN^ 

GTCCATOANTTATCTOTCNAANNAAAAQATnTT 

SEQ ID NO: 3466 GOTACACTCXXAGTOTGTCACTOOOCTGOTCmCTCCOCACACAOT^^ 
TCCTO(XXTCTCTCACrrOAAGACTGCTCACTTGCTOTOCTCM 
AACKXIAAOCCCACAGAOACTITOTOAQAOATQOTACrroGTCmX^ 
CACAGTGGACTtKWACCATCAOGTCCrrcCACXAanTG^ 

OACCTACAAOTQTTACTGAGTAGATTOGATn'AAOACAAAAAOCAAOTCCCCCATOAOTaT^ 

TTCrmKXntXXTCTAACTTGNGAGACAACACACWAGCCTT^ 

CTGTOCCOCTAATAGG<>TCOn:iXTCAAOCT0A^ 

NGGANaAAAAACTrrrNTAAAAATQGACATOTATTTO(XTGTCJUmJTOTG^ 
TGGHAGNOGCNOOATAAAAAAATCATCTAAAAAAAAAAAAAAAAAAAGTCCrNGCGGGACNCCT 
ANGGCGAATCAACCATCNOOCOTITAACKWCCACrCONCCACTrQNQAANANCOTAATm 
GAAAOTTCCTCA 

SEQ ID NO: 3467 ACCOGGGAGAAGCTTGGACXXjCATCCTaGCOOCCGACICACACAAOO^ 
GGOT0AOGAAATCCANAQTTGCCATGQA0AAAArrtX:AhnrGTCyvGCArr^^ 
TCTCCTACACrCTOGCCAGAOATACCACACTCAAACCnXiAaCCAAAAAGGA 
CTACCCAAACTOCCCCAGACCCIXTO^OAOOTTGOOOroACCAACTtt 
OAAOAAOCTCrATATAAATCCAAGACAAOCAACAAACCCTm^T^ 
OTCCCCACACAGTCAAGCmAAAGAAAOTGrnGCTOAAAATAAANAAAltXj^ 
AGCAOTTTOTCCTNCTCAATCnXKnTTATQAAACAACTOACAAAC^ 
CTATTOTCCCCAAGOATTATTaTnGTTGACCCATCTC^^ 
QATATItikACCOTCTrrrTTtKTrACGAACCTOCNOATAC^^ 
OAAAOCTNCC 

SEQ ID NO: 3468 ACGCGOGOAOCACTCnTrOACAATOGATtKrrOTTrCOACT 
GOCTCCCCTOCTOrmXACCCCCTCCCAAAQCAAaAOTCOOAT^ 
AC(X)AAOAAACnnX}GCrACCCGOGCATCA0CCIXXiACGAATGC0CCT^^ 
TCCAACTTCATtnTrGAAGTGCCCTGCTGCTirrraXX^ 
OA0AOOCT0OTT0CAOA0aAT0CATCTGGCTCAC«3<XnrGTTC^ 

CCTTATCAGCTTC ATAT TrCATaAAATCCrGGUl ] I'JU'l'J'AAOCATCTTTTCCTCATTrTCAATOGTT 
TAACATATAATirmTAAATAAAACCCTTAAAATCTCCTAAAAAAAAAAAA 

SEQ ID NO: 3469 • OCTACTTTTmTTTGTrmTGmTrTAATTCAAGAAOTAGGCK^ 
TGOCTCATOCCTGTAATCCTGGCACrmJOGAGOCTGOGGCAGOCGGATCAC^ 
TtX:AAGACCAOCCra<KXi^ACATOOTX]WAACCCCAKnCT 
ACTAGCTOTGCTnKjTGACrrOGOCCTGTAGTCCCAGTTGCTTOGOA 
CTTCOGCCTGAQAOCTOGAGOTTAnKriGAQAAACrOAaATTOCA^ 

TOACA0AG00ACACCCT0TCrCUAGAAAGAAaTATTCK3GAAG0TrAmAAAGACTCTAC^^ 

AATCAGGCGTGGCAGCTCA00(XTCTAATCXXANCATTm30GAG0CT0A00^ 

TGAOGTCANGAGTTCAAAAACAACCTCOCtXACATGTGAAACCCATTTrACTAAAAATCC^^ 

TACCCGGCATGOTOGCNGOCCCCTGTAATCCAACTC^^roGAG^^ 

GG AGOT OCATOACrmAATACNCCTtiACTTANCCTOaTOACAACNAACTCOC^^ 

TrcnTNAAATCACTTATAATn^ 

SEQ ID NO: 3470 A C! Ti L rrn - i 1 1 1 ] 1 1 H I ' l l i L ici rn i- i rri rmTiTi J i i n i l i riTJ NOC 

TO0G0CAaA0CK>ICAATChrrmA>WTAAAAAAAAArrOAACAAAGAlWC(X^ 
GANATaAOGCOCT0CCAT0CAAA0QA0TCCX>GCAGU^ 

GTTT^m^X3CAAAGACAC0ACCT0OGOACANANAACX^GTCC^XACCC^ ■ 

OCTrATCANCTTTGOGCTOANOCrnXKKnXjATNAMAO 

CITNTGGCXAAACTTGCATrTGCATmGCACTCATaACCATOATOAT^^ 

CCACOCAAATQANCCCCCAACCTOGAGGrmfTOCCANrrCATAATAAAAAG^ 

hTTAGCKATrGGCGTCCAAGACA^KJAAANCCrNOCACKJAACCAAACAGGNCXyLNOC^ 

CATTTNAAACCCTTGi:CrrrGTNOTTAACAAO>rraANCCCa^^ 

OGNGAATTCCA0CACTOGGGGCCOTI>riTnM0TCCANCT^ 

AACINTnXTOGGNAAATOrmXimXAA 

SEQ ID NO: 347 1 AC'i4"i"iu'i"rrrrj'rrJMi rrrrriiG0AATACAAcriTATTCT0ATTcrAAAC0A 

AAAOGAATGGGAATGACAOTAACAAACAAOATTTCACCACTGAATATrOTOATGTGACTOCAOCA 
GTCITATATATGAAACTCAAGGAATCAACTOCOTTCCAAAACAOCTAAATATOCAGOT^^ 
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ATGAAOrrATrrmAAACTDCCACATTCACTCCGAAGCCCA 

ATGAAGCACATOrrCCOCTTAOCrACATAATAATOAGONGGCACACACOCTaCACCGCrGACATC 

ACA0CACAGCTGCCTATAAAACTAi3ACrTNTaACGCTTK3OCTCCAGC^^ 

AT(XrcATTCCGOANANCAN>m3TC™AACA0CC^ 

NGGTCATOACAACTrrrTGTCKK)aCCAAA(K:AGCAT(WCACCA^ 

0ACTTCTACCAGKCCAAGOAAGON^^^mAAAATGTAGTrA(nT^XK}ANAAAAff 

rrGGCGOGACACCCTAAOGNGAATIXi^CNACTGGOOCCH3TCrATOQA7ratfCrrOOGCC^ 

OGQAAAATGGCAAAATOTTCrr 

SEQ ID NO: 3472 OGTACAAGACCAGCAAA0CCX:AGCrrCIXX30CraT(UGCTCTGAa^ 
GOACGATCCCKXXTCCT0TT0COATGGCAAT<^^0<TO^ATTTCAG<T 
CAGOCAAOTIOTTCraAAOAAGTAAGTOATrrGCrrCATCATCAAAOC^^ 
G OTTA GCACCAGTtnCmAATTrcnTGAATt>lXrrCTTOVAArn^^ 

<KrrrrATAATCITCOACAGAGQTCACATCCAQCn'ATGCr 1 10 1 ' l ' L fl OOTTTGOGTOGrTCAAATC 

QACATOTGAOAATTGCAATCntXKATCrrCCAC I lUU'rriXjGCATCTGNGOaTQACrQAAATCCTT 

OTCCACAATCACGCCCTTAATCAAGmAATOTOCTCCAOCCTOCCOCC^ 

ATAAAGCrCAAAOTCAACXrrCTCTCCOCrCCATATtnWn-A^^ 

CrCANNCAlCrGGCNaNOACACrGGTOACCACTrraGAGCCACaTOOTrrrOraNC^ 

OGTOjGNOTCCirATOTNACAAOaACCTTTCXTONCrnMCNOOGGT^^ 

TANAACCroormTOATOGNGAAO 

SEQ ID NCh 3473 ACOCGGQATGCATOGTOGTITrCAOmAOCrACOOCAATCCTGAACrnX^ 
AAOATGTaTroATQTOCAQCTGGCAnCCTnXiACTTCTC^ 

CATATCACTXH::AAAAATAGCATTGCATACATGGATCAGOCCAOTCGAAATaTAAAaAAOGCC^ 

AAOCTOATtKXKm^AATGAAGCTQAATTCAAOOCnXiAAOGAAATAOC^^ 

TCTXTOAOOAKXJTTGCACOAAACACACTGGGaAATGCAOCAAAACACTCTr^^ 

0CAAGGCIXm3A0ACTA(XTATT0 TAGATA TIUCACCCTATQACATT00T00TCX:TaA 

TTOOTOTOaACOTTGGCCCTQTTIOCl 1 1 1 lATAAACCAAACTCTATlCTGAAATCCCAACAAAAAA 

AAmAACTNCATATOTOTTCCTmOGTCTAATCrrGNCNACCAGTOCAAOTOAC^ 

NCAGTIKrrATTTrCX:AAAT0TTT0OAAAQWT^^ 

TGTTGrrCCNOCAATCAATTNAAAOCCnTGnri I ri'l lACCAATCCANTTAAAAOGTOAAOOTWTT 
AATAANAACrrACCTTTOTAAACAA 

SEQ ID NO: 3474 aOTACQCaOOOOaaTCCCCTOOKKrrTCTATOTAATACCATC^ 
CAGAAGAGGAGTGTGAATmACACACTICrOCAaCKUTCTGOCTGCATCC^ 
arAGCACGGTOATTACnCCCAGAGCTCOQCKKX^CCTXXACCGOAC^ 
CAOCrCTOCCTCOGCTCACAACACAGATTGACTGCTCTCACTrrGACTACT^^ 
AATTAAAAOACATCXIATACANAANAAAAAAAAAAAAAAAAAAAOTACCC 

SEQ ID NO: 3475 COAGGTA LUl U 1 nil I M 1 1 Ul 11 1 11 1 1 l ATTXiGAAGCTTCaACAAAAAT 
TCCACAGCTQTAATXXTCAGGATt^CnTGCAGTCTTCA^G^ 
TCAACCTTTCAGAGAAGACATrCCAOCraX^TXWTCTCATCAACX^ 

CACAOTOCAGOAOTGAAGAAGCTGJ J ItU J IACACAATAGTTATOXIATACGiXTACTrCTGCTGC 
CAOCATTGGCTCAOCnn'CrTCAOCT^^ 

AATTrTAACTAATOTrnXTOAAGONCAAAACCAQAGTrCTGAGCAAaAAOC^^ 

OCAATCCATCAACAAATOCTTGOACTCCAAACTGNOCCCriX^^ 

CAAGCK^T^rrGOCAT^GC0CTTKACOOOCCA0CCCTOGAACAC^ 

OAAaCCTNAANCGNCCTfUCTGOATTTTCACTGAGGGAOGGGGOCCTATm^^ 

ACNQACAAAOIIAXGGOTOOTCAmTrTNAANAAOOaAAA rj I N | - n -ia>AGOTACTAATACAAA 

NCTICrrrCCAACCACAGGANTrGNTN 

SEQ ID NO: 3476 (Xn'ACTGTTCaTOCCGCAGCA<X}ACOCCTGCGTtKrrCC^ 
TTCCACCGQATCCTOOAGCCTGQTITOAACATCCTCATOOCTXJTar^ 
CAQAGTCrcAAOOAAATTCTCATCAACOItKCItLVOCAGT^^ 
CTOCAAATaJATCGACTOCTTTACCTGOTCATCATOOACrcn 

GGACOCTGAGTATOCOQTCACXXAOCTAGCTCAAACAAOCATOAGATCAQAOCTCOC^^ 

CTtTOGACAAAGTCTItXXJOOAAOOGGAOTtXXTOAATGCCAQCAT^^ 

GCTOCTGACTCXntKWGTATCOCOCTOCCnXa 

Oa<nXlAAAAGAaTCTATGCANATOCAOGTGOAGGCAGANCGOaXlAAACCGOCCAC^ 

A>rrCITANO0OACCNAAA0TOGGCATTAAT0TWK:ANAAN(KK3AAAAACAGGCC^ 

TTCXlAACANAAAAOOTraACAAATAATAGCNOCAOGANAGGCOnXKj^lTrro^^ 
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AACTOAACrTTTNAATCrGOTNaAOTrNAACAAATAAOOAANCAN 

SEQ ID NO: 3477 ACAGCTnxXrACTCAAATATOCCTrATTXrrGGCATTCAGGaAG 
A0A0CKK:AO<KJCT0CGGTCXXTO0CTCACTra:ACATAOT 
CATCAAAACCACKXCOGTOCAATCCATCCCCAOOCACTCKKn^ 
ACnCTOAOOTAOCrCTATCAOCTATrTCAGOCCrro^^ 
TQTTOCCTTOCrroOACnXlAOOCAOaTrCCTAaTTTCATra 
AGOCnXJATGCrATCXXX:ACAGACCTCCITCTrO<KWGaA<XATO^ 
OTCCCCCCGTAOCrn-CAATAAAOCCCTCn'CCOTnTrCCCrACOT 
CCTCTCKXXiCAACCrcATCAAGTtnCAATGATAAGOTCATATXXn^ 
(XCAG0GGGCAa>GCCATAA0(XXGAAAA(7mGTCACANATCKrTGGT0OT 
OOOTTOCTGQOOGGCAAGCANCAACGaNAATCAATNTGOGCCCTTATTOC^ 
CAAA 

SEQ ID NO: 3478 GNCNCNGCCGACOTAOCTCNACrraANAAACTGATITATQATCACTTOOAAK^ 
Tmcm^TAOTimATAAAACmACNTAAAATarTOOTTTCAATGAOCTCTATr^^ 
ATCACACGATGOGTATTAAACrTCnrCAaAATrCCTTrcT^ 
TrfraOCNCTTATTATGANOCTATTAAAAOAATCCAAATCCAANCTAAT 

SEQ ID NO: 3479 ACGCGGGAO Cl 1 1 1 lU TCIXnTCAGCCnX3CX}GCOCCCACAATrTOCGCOCTC 
TCITICrXMrrOCTCCCCAGCTCTCOOATACAGCCGA 
TOCCOiKXTCCAOOTacrCAAOGAmCCTOOCOOACAAaAGCTACAT^ 
CACAAGCAGA'IXrrOGCAGTATrrGAAOCOOTGTCCAGCCCAOaK^^^ 
TACGTTCKrrATAATT::ACATCAAOTCnACaAAAA(KlAAAAGOCCA(XX^ 
AGCTTTGOOCAAATATGGTCCTGCXX^A^t^aGAAAaACACTNCANQAA^m^ 
GTAAANATGATGATCACATrrOACCNCTrrTGGATrn-GATGArrOANCGACOAAACr^^ 
CNAAANAOGGCTNAAGNAAAAAACGTTroCACAATATGAATTNAAAANAhUOGCAAAAAACCTT 
CACriTaTrNCCAATnTCAATCTTACTAAATOTTAAANCr 

SEQ ID NO: 3480 ACi rm ri'ITri'l I I'ri i rri ri'lOAACITGAAOCGGAACCANAATGOGATOA 
Ta:ANATGATOACCrOTOOajTCTOOOTAAAOTAGTOaTGTTOCGTAOGGGTTGOC^^ 
GGGTTXimAGT G - r r r iCACCAATCACCTTTOGGCTXMCCGTGAT^ ^ 
ACCANATaAAOAANATCTAOAACQAmCTCTTTCTOmxrrCT^ 
ATAAAATCnTrmCCn'G<KK3AAAAAAAGCCXCrGTCGNACCT0NAGCroGATC^^ 

SEQ ID NO: 348 1 ACAACACAGGCTACATGTAGOCAAOCTAAAGCTATCATQAAAGOAOGATAC 
AOTAOOCAAAOATCCOTTCTOTAOOTATCATTCACrATCCTCCATOCAAGOt^^ 
KTrCKKXXATXmxntK^CATACrcGAGCAAAOaTCTA^^ 
ATCCATTAGTITCTAACAGATAOAATTCACATKn'AATATATOATTCATOCT 
GGAAAGGCATATGAAAATCTAOTTmAATACANAAOTAGCANCAGCAATCAATCTrGTATr^ 
AOCTACTCCAAATTCCTCIACTrTGGATGCCAAAAACACACATCTANGACCC^ 
TATACTTrrCANAGAATACXrrNGCATAOAATCTTCTrOAAATATACXOTAN 
TOTTTCCNTAATTAAAAATGTrCACCCrAATGCrrCXKjATAACN^^ 
CNTTNCAATNTNCNTOCCrcr 

SEQ ID NO: 34?2 ACOCOOGOATCGGCATCCCOGO<^'CNCTCTGQAAGTCOGT^^^ANAAGTC^ 
CACCCTCANACAAAAOAAOTAAACOTGOAGATGACAGAOOGrrCTAGAAGTANAGATAOANATAG 
OAGOACAGAOAGOTCrcOTAOCAOOQATAAAANAAQATatXHJTCAAOGGACANOAAGCGT^ 
ANACGirOCANAAOrrAGAGAOAOAGACAGAANCOIAOAGCGAAATAAOATCITOAANTNC^ 
TAGGAGACOCTCANNGANTANAATOCTOOTOCOG 

SEQ ID NO: 3483 ACATCCCAACTOAGCTOGATCAAOTTAGAAAAnKJATTTXnXjACTtXJAATCT 
CACCAOXIAAAAAAACKACACOCTnTAAGACTACTrrATGAGOCACTTmX^ 
GTOATXKnXKrrrCAAAAOTCATGOTOOAATrOCTCOGAAOTrACACAGAOT 
OCTOOAOTTOATOCCCACAOOTOTATTOT 

SEQ ID NO: 34S4 ACGCXKKiGAGGGACNOQAOGCGAGCAAGATCOCX)CANACGCANOOCACCC 
OOAOOAAAOrmrOTTACTACTACXUOOGGOATOTroaAAArrACTAmTOOACAAGGC^ 
ATGAAGCCTCACCGAATCCNCATGACrCATAATrrOCTOCTC^ 

GAAATCTATCGCOCTCACAAAGCCAATOCraANGAQATOACCAAOTACaAAarAOOTC^ 

GTTGACCOTATTTACAOTrrCTACAAACTTACAOCTCATAAACATAAAATGAATAC^^ 

CmACAAGCANAAGAAaAATOCnCTATAAGCATTCCrmATnXCAOAAACACCrGT 
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CAAAATANTTTCAAGACTTAM^^CCANATTGGGTTTNGAAAAGAGAT^ 

ATCCCCCTGCAGCTATTNAAATGGTGATGGATNCCCTGNCATTCCCTNTTNAAT/^^ 

ANTTTTCA 

SEQ ID NO: 3485 ACGCGGGGGCTGTTGCCrrCAGGTGACCACGGATTCGCCATCGTGAGTTCCA 
GACAGGGTCAGATGTGTTACTGGGCTGGAGTGTGGTGGCACAATCACGCTCACTGCATCCTTGACC 
TCCTGGGCTCAAGTGATCTTCCCACCTCAGCCTCCCAAGTAGCTGGGACTACAGGATTTTCAAA^ 
GGCTGCAGCTCCTCAAGCACCGGGGCGGGGATCTCTCCGTAAGACGAGACCTCTGGTTGTGAAGA 
CGTCGTTGAACAACCCATACATCATCCGCTGGAGCGCTCTGGAAAGCGAGGATATGCACTTCATC 
CTACAGACGCTTGAGGACAGGCTTAAAGCTATTGGACTTCANANGATTGAAGATAANAAGAAAAA 
GAACAAANNCCTTTTCTGAAAAAAGANAGCANNGAGAAATGCAGCATO 

GAATCTNAAGGACAANAAAACAGATGCTAATCCACCAAGTG'mANGGTGGACNCTGCCACACGTC 
ATGGAAGCANCmmCCTTmGCGTTTO 

GTAATTCTGGTATCNAAAANAATCAACCCCCCATATNCCTCNNANTNGAm 
AATTTCCNTCC 

SEQ ID NO: 3486 GTACGCGGGGGGGAGGTGGGAACGCTGTGGCCATTCGGATTTGGCGCGAGCG 
CGGCTGGAGTTTGCTGCTGCCGCTGTGCANrmGTTCAGGGGCTTGTGGTGGTGAGTCCAAGAGGC 
TGCTTGTGAGAGACGTGAGAAGGATCCTGCACTGAGGAGGTGGAAAGAANAGGATTGCTCGAGG 
AGGCCTGNGGTCTGTAANGCAGCGGANCTGGNTQAAGGCTGCCGGGTTCCGGCGAGGCCTGANCT 
GTGCTGTa>fT(:j^TGCCTCAAACCCGATCCCAGGCACAGGCTACAATCANTTTTCC/^^ 
CTGTCT^^^GGCATTNAACAAANCTAANAACTCCTGTNATGCCAAACTANAA^ 
AACCGTAACCTGGTTCTCCTCGTGTAAAANCCCTNCCrm'CANCCCAATGAAAACGT 
TNACAACCTAATNCAACACTCCCCATTTACCTCCTTNTTCTCCNCA/^ 
GGTrCCCCTCACTCACATACACTTANGGGACTAAAAATTGNTATTTGANANATNAN^^ 
AANTCTNCTAACCAAAANAAAACTAACCAAANTTTCNNCCAAAACAAAAT 
AAATCANAANNTNC 

SEQ ID NO: 3487 ACirmTTTTTTTTTT^^ 

TTTTGGAAATATAACAAAATAATTGGCAAAAACCAAACCAAAACAGAACCAAAAAAATGACATG 

TTATATNAGTGATCATCTCCAAGCACAACAGCNTTTATTAATGAATATAAAAAATAAACT 

TTTTTTTTGAAACGGAGTCTTGCTlNrrGTCTCCCAGGCTGGAGTGCCGTGGT^ 

GCAAGCTCTGCCTCCrGGGTTTTACACCATTCTCCTGCCTNANCCTCCCCAOTA 

GGTGTGCACCACCTAOsfCCCAGCTAATATTTTTTTGTATITrTA 

SEQ ID NO: 3488 ACTTmrmrriT^ 

TTACTGTGCTTAATTTGGACCAAATTTTATTTAGCTTAATATGGACACT^ 

TACATTANACATATCANAGCAGTGTATTTCTGGATCATTrrTTAAATGACCrCTTCT 

CTGTCACTTACCTGAAATGCTGCATCCTAAAATTCCAAAATTATATTGAGCAATC 

AAGCCAACTGACTTAAAGGTAATCATTTCAAGCTAAGATTAAATTTA^ 

AGCTAGTTTTTAAAATAATGATCTCAGATTTTrAAAAAGGATATAGGAACCT^ 

TGAATTAAAAACTGATGGTTTCTATCAITATTTANCCCCACCT^^ 

ACATTTATNAACCAATGCGNANTGGACTTAGCCANNCACAATGGAAATTTANACCTTTGACT^ 
GGTGTTTTCCAGNTCACAAAAGGTGG 

SEQ ID NO: 3489 ACGCGGGGGAGGCAGCCATGTCTTATCCCGCTGATGATTATGAGTCTGAGGC 
GGCTTATGACCCCTACGCTTATCCCAGCGACTATGATATGCACACAGGAGATCCAAAGCAGGACC 
TTGCTTATGAACGTCANTATGAACAGCAAACCTATCAGGTAGATCCCTGAGGTGATCAAAAACTT 
ATCCAGTATTTCCACAAAACTGTCTCANATTTGATTGACCANGAAAGTGTATGAG 
TNAT 

SEQ ID NO: 3490 ACCGACCATAGAGCAAGAATCAAGATTCTGCTAACTCCTGCACAGNCCCGTC 
CTCTTCCTTTCTGCTAGCCTGGCTAAATCTGCTCATTATTTCAGAGGGGAAACCTA 
AGTGATAAGGGCCCTACTACACTGGCTTTTTTAGGCTTAAAGACAGAAACTTTAN^ 
TAGTGGCTTCTAGCTCTAAATGTTTGCCCCGCCATCCCTTTCCACANTATCCTT^ 
GTCTCTGGCTGTCTCGAGCAGTCTAGAAGAGTGCNTCTCCAGCCTATGAAACAGCTTGGGGTCT^ 
GGCCATNAGAAGTTANAGATTTGAANACNGAANGAACAAACTCACQGAGTAAGCTTCTANCCCC^ 
TTTAACTTTTANACCCTTCTGCCCTNTm'CNATTGCATGCA 

SEQ ID NO: 3491 ACACTTGAAACCAAATTTCTAAAACTTGTTTTTCT^ 

ACATTAAACCATAACCTAATCAGTGTGTTCACTATGCrrCCACACTAGCCAGTCnTCT^ 
TCTGGTTTCAAGTCTCAAGGCCTGACAGACAGAAGGGCTTGGAGATTTTm 
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CTTCAGCAACITGAGAGCTrTCTTCATGTTGTCAAGCAACAG^^ 

ATAGAGACGGTTTGAATATCTTCCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACAAAC^^ 

AATCCTGGGGACATACTGGCCATCANGAGAAAGGTGTrrGTCAGTTGTTTCATAAACCAGAT^^ 

GGAGGACAAACTGCTCTGCCAATITCTGGATTTCrrTlATTTCAGCAAA^^ 

CTGTGTGGCACTCATCCAAGTGATGAATAATCATCAAGGGTTTGTTGCTTGCTTGGAm 

AGCTTCITCATATGTCTGAGTCCAAATCCCCGCGTTACCTCGGCCGCGCCACCCCTAAGGGC^ 

SEQ ID NO: 3492 ACTTTTTCmATTATTACTTTTh^ 

TTTGCNCTGACAAAAATNCCAANATAAAAOTGNATTCTTAAGTTCCCATTN 
GCTGGCCNGATGCATNTTNGAGTNCGTNTTAGNTGATAAATNAACAGTAATAN<^ 
AACC^^^^ATGGATGCTTGCANTTTTOAATATTGCGGATANTCNGTO^ 
NTGGNTNrrTACAGATTCNACTGTNTTAGAAATCAAACCTTGCCO^AAAC;^ 

SEQ ED NO: 3493 ACTITGTAGGCTATACGTrrrAAACTCCTGTAAAGAACATTACAA 
ATGTAATTTAATTITGTGCTGAATATTCTTCAAATGACTTTATAAAT^ 
TCTTCAAATTCACTTGTTGTTATACCATATTGTTGCTTTCTCCAACTTT 

CCAGGTAAATTTAGGAGGGATCCAAGGACTGGCACTCTTCTAATAAAGCCAACAACGACAGGA^ 

GAAGCCCCTGAACAAGAGAAAAAATCCATAAATTTCGAAGATCATGCCTATCAAAGGCCAACCAA 

TAAGGACTACAAATACACCACCCAGAAAAAAACCTGTAGCmCATTTTATGTT^ 

ATCTGAATGTTCTTTCTAAACCAATTACAAAAGCCAAGCCGGCTCAAATAAAAC^ 

CAGTANNTGCTTTGCAAAAAAAGAGAATCATTCCAA^ 

TTAATCCCCATTCCAATTTTCTGGCGTGTCCCGTAAAGGAGATCATGGGGCTGCAGGTGGNTGG 

ACAGCCGCCAAAGCCACCTCTGAAAAGCCCACGGTGGGAAACTTAATTCGAGANAAATGAAACC 

CGGGAAA 

SEQ ID NO: 3494 ACTTTTTTTTTTTTTTT^^ 

ANAGGTCAAATAATOTACCAAACnrAAAGCACTATTAAGTCNTGAAACAA^ 

ANAAGTCANNCANTATAATCCACAGTTTCAGGTTTTCCNAACAACNATCAIW 

ATCTATTTTTTTTAAAGTCTCAAOT 

AACTmTvrACCAACAAAAAA(>[CCCAAACrmTO 

CCTG 

SEQ ID NO: 3495 ACCATTTGGAAGAATGGAAGCTGATGCATCTGrrGACATGTTTTCCAAAGTCC 
TGGAGCATCAGCTGCrrCAGACTACCAAACTGGTGGAAGAACATTTGGATTCTGAAAT^ 
CTGGATCAGATGGATGAGGATGAATTGGAACGCCTTAAAGAAAAGAGACTCCAGGCACTAAGGA 
AAGCTCAACAGCAGAAACAAGAATGGCTITCTAAAGGACATGGGGAATACAGAGAAATCCCTAG 
TGAAAGAGA(mTmCAAGAAGTCAAGGAGAGTGAAAATGTGGTTTGCCATTTCTAC^ 
CCACATTCAGGTGTAAAATACTAGACAGACATCTGGCAATATTGTCCAAGAAACACCTCGAGACC 
AAATTmGAAGCTGAATGTGGAAAAAGCACCmCCm 

CCCACACTAGCACTGCTAAAAGATGGGAAAACACAAGATTATGTTGTTGGGTTTCTGACCTAGGA 
AATACAGATGACITCCCCCAGAAACTTTANAATGGAGGCTCGGTTCTTCT^ 
GTGGGAAATTAAATGGACCNCCnriTCAGAACCAAAAGAATTTTGGAC^^ 
AAAAGAAAACTT 

SEQ ID NO; 3496 ACCTCTCAAATTGCTCATCAATATAGGAGATAATraTCTTAAAACAATCTCT^ 
CAGTTGATAGCGTCACCATAGCCAGGGGTATCTACCACTGTCAGGCGTAGCTTGACCCCTCGCTCT 
TCAATTTCAACAGTTGAAGCCTCAATCTGGACAGTTCTTTCAATTTm 

TTAATAGGGAATATAAGCAGCATGTGAAGAGGAAAGCTCACGTTCTTACGACAGGTGACTTCANA 

GTCTACTGCCTCTCTACAGAGGTAGGTTCTCCAAAGATACCATGGGGATTAAGAAAAAAAAGATC 

CCACAGAAGTCNNANNGGGGAAGGTTAAAGCTAAAATTATTGTGCTO 

CACAAATGNTNAAAATATAATNTCNGCCNTGGQAAAAAGAAANCANm 

CTANATNCATAAATNTTOSnSfAAANGGCasFCTNANTCT^ 

NNGNTCNCTATTTCATCCAANGTCTTTT 

SEQ ID NO: 3497 accttaacatctgttgagaaaatacaaataaatatgatgctaataaatggcc 

ACTGATAACTCAGTAGCCATCTGAATAGTCATGCGGTTTAAGAATACATCCTTGTATAATCTGACA 
TACAAATTTGTCATTTCCTGCACATGCACACCArrGTTAAAAAAAAAAAv^^ 

SEQ ID NO: 3498 ACACnTGAAACCAAATTTCTAAAACATGTTTTTCTTA^^ 

ACATTAAACCATAACCTAATCAGTGTGTTCACTATGCTTCCACACTAGCCAGTCTTCTCA 

TCTGGTTTCAAGTCrCAAGGCCTGACAGACAGAAGGGCTTGGAGATTTTT^ 

TCTTCAGCAACTTGAGAGCTTrCITCATGTTGTCAAGCAACAGAGCTGTATCTGCAGGTTGG 
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CATAGAGACNATTTGAATATCTTCCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACAAAC 

ATAATCCTGGGGACATACTGGCCATCAGGAGAAAGGTGTTTGTCAGTTGTTTCATAAAC^^ 

AGGAGGACAAACTGCTCTGCCAATTTCTGOATTTCTTTATTTTCAGCAM 

gactgtgtgggcactcatccaagtgatgaataatcatcaagggtttggtgcttgcttgga^^ 
tagaacttccttcatatgtctgagtncanatgagttggncaccccaacctcttggagagggtot 

GGCANTTTGGGT^n^^AGAGTCCTTTGTGTCCTTTTTGGCTCNAGGTTGA^ 
A 

SEQ ID NO: 3499 ACTTTITIOTTTTTTT^^ 

CCATTTTCAATTTCATTTATGAGCTGCTACATTATAAATGANATGCrCT 

GTrGTTGTTGTTATNAAACAATGAAAATTCCTGTTOGGAACACAAGTTGCTGm 

TCTCTTAAATAGNNTGANAANAANTAAGGGGGAGCTGTTGGGAAAGCCCATOT^ 

AATTATCTTCTTGGTTCAGCCATOTCCACCACAGATTTTTAA^ 

AGTm'CCNATTTAATTNCTGAATATANGGAATACCC^ITAATGC^ 

CNAAAAGGTCAATTCNTGATCTGGACNCCATTTTTCGGTTCAAANTAAACT 

CCCTTNAAGGCTCNNTGCAANTCTTCGGNTNAAAAGGGGCGGCGGGGTNTCCTO^ 

TNTGOCCAANATTAATTTCTTCAAAATGCCCAAANAATTCT 

SEQ ID NO: 3500 acccccatgcaatatatggctctacaatcctcancatgttaatcgaanccttg 
ttgagcttcacaaaggttccattgaagamgamaangcgaagaagctgcancaccm 
ctttgggctcactccattnatacctwgattctgatgacag 
narit^cggtanatgctcantaacttatt^^^tgtt 

seq id no: 350 1 acaaatatccattgcttcataggttcaagttacataaattaaagtcaaataat 
tggaaactgattcaatagggaaaactatacatgaaatgaaggtcaaaaggagctatacagcaat 
atttcarrgtttatagattatgagttactttcaggaccttaacaaagat^ 
ctttgttgtattttatacttaaatatctccatacctatactgagtcaaactacttgaccaa^ 
tgatttaggaaagcatctagctttatagcacangtttttccatntacatgtactatcttca^ 
atatacatcacaatgttgacaaaaanacctcctggttccmtgaacaatg^ 
atgttaactccatggtaagtcaaataggtcctnggc 

seq id no: 3502 actccagcctaagcagtaattctctaagtttcgcaaaaaactccitcctttgg 
aaatgcagaaaagttaattcttgagttatcttcaacaactctaatggtttatagcca 

TTATGCAGAACTGAAGTAACTCrrTTAATCAGAC^TGAGTrrCTGCm 

CCAACACTGCCAGGGCTTGCTCGCCAGTTAGGTCCTCTGACATCAATAACATAGTTGATT^ 

GTTTCTTTTCTTCAGGTCCTGCAATGGGTCCCAATGCTACAAACAAT^ 

GATTACACAGAGGAATCArrTCAGTTAGCTTCTTTTTAGCCATTATAATAAA^^ 

ATTGTAGAAATTNGTATACACTAAGTATTTTACTGATGGAATCCAAATCAAGGTG 

CTTAAAAAATCCGTTATTACATCriTCTAATAGTGGTTGATAACGATATCTAACAm 

CTTTTGCTATGCTTTrTGCAACGTTGACCTCANAANAAATCTATGG^ 

T^TImTCACCAGNTGTTGTNAAAATGTCGGGGAT^^ITAAAAN^ 

SEQ ED NO: 3503 GTACACTTGAAACCAAATITCrAAAACATGTTTTTCTT 

AACATTAAACCATAACCTAATCAGTGTGTTCACTATGCTTCCACACTATCCANTCTT 

TTCTGGTTTCAAGTCTNAAGGCCTGACAGACAGAAGGGCITGGAGATriT^ 

GTNTTCAGCAACTTGAGAGCTITCTTCATGTTGTCNAGCNACAJSnSIAGCTGT^^ 

AGCATANAGACGATTTGAATNTCTTCCANTNGATNTCCGGCTCTACTGCAGAGATGGGTCAACAA 

ACATATTNCTGGGGACATAOTGGCCATCAGGAGAANNGTGTTTTGNCATTTGm 

SEQ ID NO: 3504 GTACAACGCTTCAGCCTACTGCAAATCCAAACACAGGTTTGGTGGAAGATTT 
GGACAGGACAGGACCTCTTTCAATGACAACGCAGCATAGTAATTCTCANAGCTTCTCTACATCACA 
TGAAGGCTTGGAAGAANATAAAGACCATCCAACAACTTCTACTCTGACATCAAGNNATACGAATG 
ATGTlWCAGGTGGAAGAAGAGACCCAAATCATTCTGAAGGCTCAACTACTrTACTGG^ 
ACCTCTCATTACCCACACACGAAGGAAAGCGGGACCTTOATCCCAGTGACCTCAGCTAANACTGG 
GTCCTTTGNAGrrACCTGCANATACTGTTGGAGATTCCACTCTAATOTCA^ 

GACCANCACACATTCCACCCNCAGGGGGGGGTCCCATACCNCTCATGNATCTGAATCATATGGAC 
ACTCTNNTG 

SEQ ID NO: 3505 ACTGGAAGCATGCTCCAAAGACCTGTAAGAACTTTGCTGAGTTGGCTCGTCG 
AGGTTACTACAATGGCACAAAATTCCACAGAArrATCAAAGACTTCATGATCCAAGGAGGTGACC 
CAACAGGGACAGGTCGAGGTGGTGCATCTATCTATGGCAAACAGTTTGAAGATGAACTTCATCCA 
GACTTGAAATTCACGGGGGCTGGAATTCTCGCAATGGCCAATGCGGGGCCAGATACCAATGGCAG 
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ccagttctrrgtgaccctcgcccccacccagtggcttgacggcaaacacaccattm 

gtgtcagggcataagaatggtgaatx:gcgtgggaatggtanaaacaaactcccangaccgcctgt 

ggacgacgtgaagatcattaaggcatacccttctgggtanacttgctaccctcttgagca^ 

CTTGAGATGGCCCCAATGGAACCANCTTCTANATGACATANAATGACATTGTAATGCTTA^ 

rmggttttgcaagtcatgaaacttangaggcctggctctttggtg^ 
cttcngnccggaccacnctaaaggccga 

SEQ ID NO: 3506 accataataagtttgtagtagtataggctaggcttaaaactgcactcctcct 
cctccccgcccccttccaaaaaaaagaaaaagtgaaaggaactctaaagagtgtcatatgcaggc 
acatctatataaaagcrgggaaagaaattcatatccctgccattttcctttc^ 

TTAATTCAACTACCTCAAAAAAAAAAAAAAGAAAAAATAAAAGGAACCCTAAA^^ 

gtaggcacatctatataaaagctgggaaagaaattcatagccctgccattttcctt^ 

ctcaatttaattcaactacttcnacagaaatgtatgttaganaagagtatgctcanaaa 

ttgtgggtaagctaacttcttgctttnaacaatggctgtcttaaagt^^ 

GGTGGTCTACTTTTCNTTCATTAATNATCNGGTATCNCATTAAANCCCATTCAC^^ 
NATNGGGGTNGGGGAGGNTGANATATTTATNCCAANNANTCTTCAAATTATCNT^ 

SEQ ID NO: 3507 ACGCGGGGAAAAACAGAGTAGCAGCTCAGACTGCCAGAGATCGAAAGAAGG 
CTCGAATGAGTGAGCTGGAACAGCAAGTGGTAGATTTAGAAGAAGAGAACCAAAAACrm 
GAAAATCAGCTTTTACGAGAGAAAACTCATGGCCTTGTAGTTGAGAACC^ 

CTTGGGGATGGATGCCCTGGTTGCTGAAGAGGAGGCGGAAGCCAAGGGGAATGAAGTGAGGCCA 

GTGGCCGGGTCTGCTGAGTCCGCAGCACTCAGACTACGTGCACCTCTGCANCAGGTGCAGGCCCA 

GTTGTCACCCCrCCANAACATCTCCCCATGGATTCTGGCGGTATTGACTCTTCANATTCAAAG 

GATATCCTGGTGGGGCATTCTGGACAACTTGGACCCAGCATGTTCmANATGCCCTTCCCCANA 

CCTGCCNGCCTGTAGGGAGCTTCCAGANGTCTCCCAGAAGGACCCCANTTTCTTACCATC CTC^ 

TTCmGTNAGTTGGGGACNTNATCATCCNAGCCTGGAAGCCmr^ 

ACCACATTATNTACCAAGCCCCTT 

SEQ ID NO: 3508 GTACTCGGGACTGTGTCAGGAATGATGGAAACCAAATCAGGGAGAGTTTCTT 
TGGCAATTTCATTCTGCANAGAAAGATTCCGGGACAGATTCCTCANCANCGAGATGGCTGTC^^ 
TCACACTTGGGTCACCAACATGCANCATCmCGGGTGTGCTGCAGGCCACmCCTTCT^ 
CTGTCTGAGCCACTGATGTCGGCNTTGGTCCACTrCCGGCCGACCCACNT 

SEQ ID NO: 3509 ACTCATGTATTTTTTTTa:AGATCTCTrrCCCCAAGTTGCTAT^^ 

TCTGCTQCGTGTQGATGCAGTTATACACATTAAAGCAGATCTGGAGTCTGAAGTA GCTA TAAAGC 

AGCTATAAAACAGAAATACATGCATAGCTGCAGAAACCATGATAG GTAGA GGACTTTrCTTT^ 

TT^n:GTTTTGTTTTGTTr^GNTTTGT^mGG^^^ 

AATTCCAGTGAATTGTGCANAAATGCTGGTTmACACCATCCTAAAGAAAAACT^ 
TTTTGGAGTACAAAAAAGGTNATAAAGTTGGAATCTTAANT 

SEQ ID NO: 35 1 0 ACGCGGGGACATGTGTATGTGCCAGCTCACACCTAGGGGCGGGCTGCCTCTC 
TGTGTGCATCCATGCCAGCGCGTGCGGCTGCAAATCTGGTTTCCATCTTCTCCCGGTGCTCTGAGG 
AACATTTCGGGAGCCTCCAGAACTGAAGAGGGGGCCAACTCTGTGGAGAAAAATGAATGGATGTA 
GGCCCAAAGGAGCCGAATGTTTTCCTTGGCTGAATGGCAAGCAAGCACCAAGTGTCCA^^ 
AAAATCAGCCAGCTTTCCAGGCAGATACCACCCTGACTCCCCACAAAATATCTGAGTTTCCGGAGT 
GTGGATATGAACAGCCGCTTGGGTCCTGAGGTGGAANCCATGTGTGQAAAGATGGANGGCATCGG 
TTAAAAGGAGTCTAGTCCTGATGGTCACTGAGCTGCAAACCAACCrGGGCTGCTTCCTC^ 
TCACnACTAAAGAGCGAATTAAATGTGCTTCANCTACTGTACmGGGT^ 
NANATAATCCTAATCAATATGAAATATATAAANTAAACAAAA 

SEQ ID NO: 35 11 GGACGCGGGTGAATACATTTCTACTTTATTTTGAAACATTTGCCAAACTAA^ 
ACTGTAACACTGTATAACATTTAAAAATGTTAAAGAACTGCTTAGTATTAGAAGCAGATCA 
CAAAATTCTAAGAGCAGCAGCATATGTTGTTGCTTGTATAAAGCCTAGCGATAATTTTTAGACT^ 
CTTCCATGGTGCCCTGTTGGCATTAGCACTACCATTGT 

SEQ ID NO: 35 1 2 ACGCGGGTGAATACAmCTACTTTATTTTGAAACATTTGCCAAACTi^ 
TGTAACACTGTATAACATTTAAAAATGTTAAAGAACTGCTTAGTATTAGAAGCAGATCATTT^ 
AAATTCTAAGAGCAGCAGCATATGrrGTTGCTTGTATAAAGCCTAGCGATAAmTTAGACTA^ 
TCCATGGTGCCCTGTTGGCATTAGCACTACCATTGT 

SEQ ID NO: 3513 ACATTTACATTCAAGTTGATAACACCGGTGGTTTCATTTCAATACAAATTATG 
CTAGANAACTGACATOTCAKACATGGTCATATATATGCTATTTGNATTNC^ 
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TCTTGATTGAGAATCTCTrGATGATNGANGTGCANCrAATTCGTCCCGAAACTCATGA^ 
GNTTTGNTTGATGGTOTGTATTGCCCCNGATCCTCTTTGGTCTOACACGCT 
GGTGATATITGCGTCAAACACNNATAGNACGANACNAGCNAACTACAATNACNAAGATO^ 
NGTANCNCTTATTAACTGTATCAANGTTTTTCCGCAT 

SEQ ID NO: 35 1 4 ACCACAAAGGAGAAGTTGATAGGGAATCTAATTTTAGAATGTGCCAAATGGT 
CTGTGCTCAACAATATAATTGAACTCrcrCAACTCTACCTCACCATTTCrrTATC^ 
TGGCTTTGTCAAACGTTGGATTITATTTCTGCAGCCTAGTATCTCCCCATO 
ATGTTAAAGACAGCAGATGrrTTGAGGAGAATGGCATTGTGGAGATGCAGAGATGCGTTGCTAAG 
TTGAGGTGGATCCAGTATAGAGATACCTCTATTTCTTCTTTATGGCTCAAGAGCTAGGACTTGGAT 
TTTGTTTTAAGAGATGGGCAGCTGGCCGGGTGCAGTGGTTCACGTNTGTAATCCCAA 
AGGCCGAAGCNGTGGATCACGAGGTCAGGAGTTCAAAGACCAGGCCTGCCAANANGGTGAAACC 
CCNTCTTTACCTNANAATACAAAAAAAAAATTAACCTGGGTGTTGGTGGNCGGGC^ 
CAGCTACTCNNGAGGGCTATNCAGANAAATCATTTNAACCCCAGGAGANCGGACm 

SEQ ID NO: 35 15 ACGGATCACGCTTTCCCCAGGATGACCTCTGCCAGTATATCACATCAGATGA 
CCTCACTCAGATGCTGGACAACCTAGGGCrrAAGTATGAGTGCTATGACCTTTTGTCCACCATGG 
TATATCTGACTGCTTTATTGATGGTAATGAAAATGGAGACCTGCTTTGGGATTTT^ 
CTGCAACTTTAATGCCACAGCACCACCTGATCTCAGAGCAGAGCTTGGGAAAGATCTACAAGAGC 
CTGAATTTAGTGCTAAGAAAGAGGGGAAGGTTCTTTTTAATAATACTCT 

AGGCATAACTATCAATCACAAAAGTATATTCAAAAATTATATTTTGAACAACTCGAATCACTCA 

TATTTCCATATTAAAATCACAAACTCATCCATTAATGTAGATAAAGCACTGTTTGGATATGAGATG 

TAGCNAAATTCCAATACATTATTGGACTTCCATTTGGAATCCATATGGGGATACTGCT 

TCCTGTCCCTCCTCCAGGTAAGAGAGACCACAAGCAGGCTCAACATAACATAAGGCTAGAAAAAA 

TTAGATGACTGAATTTCTATGGGCATATTGATAATAAAAATNATTCCAm 

TTNTT 

SEQ ID NO: 3516 GNGTACGCGGGGGACGGCCGGGGGCATTCGTATTGCGCCGCTAGAGGTGAA 
ATTCTTGGACCGGCGCAAGACGGACCAGAAGCGAAAGCATTrGCCAAGAATGTTTTCATTAOT 
AGAACGAAAGTCGGAGGTTCGAANACGATCAGATACCGCGTAGTTCCNACCATAAACGATGCCNA 
CCGGCGATGO^GCGGCNTTATrCCCATGACCCGCCGGGCAGCTTCCNGGAAACCAAANC^ 
TTCCGGGGGGAGTATGGTTGCCAAAAANAAAA 

SEQ ID NO: 3517 GTACTGCAGCATGCACTGGCATACTATAGCTTGGTCCAGCTCTTCCANAGCCC 
CAACCACnTmCAAATTGAATCCTAAAGTTTTTGGTGCA^ 
ACTGATCTGAAACCGCTTTTCTATTTTCTTCTTTTCCCCAAAAAGT^^ 
AGCATTGTTTGCACCAAAAGGAACTACCCCTTCAAriTGTGCCAATTTGTTGTCC^^ 
CAAAGCACCCTGCAAGCTCTTTCTTTAAATGGNCAACTTGATTNCCATTCAACATTGOT 
ATTGCATTCTTATCATAAGGGTTATTANGATCTC^aTTGNATGCAACCATTTCATTAT^^ 
TCCCGTOTAATAGCGTAATTOCAACCACATGACCTNTCAAACTTCCAAATAAAA^^ 
CTTCATACCTArraNAAAAGTCAThrrGGANGGATNACNTCrrGGAim 
AAGTTGGATATAAANTGGCNGNNNAAAAATITNCATGAACTNNAAACTGG 
CTGCCTGGCAG 

SEQ ID NO: 3518 ACGCGGGATAACCATGCACACTACTATAACCACCCTAACCCTAACrrCCCTA 
ATTCCCCCCATCCTTACCACCCTCGTTAACCCTAACAAAAAAAACTCATACCCCCATTATGTAAAA 
TCCATTGTCGCATCCACClTrATTATCAGTCTCTTCCa:ACAACAATATTCATGT^ 
AAGTTATTATCTCGAACTGACACTGAGCCACAACCCAAACAACCCAGCTCTCCCTAAGCITCAAAC 
TAGACTACTTCTCCATAATATTCATCCCTGTAGCATTGTTCGTTACATGGCCATCATAGAATTCTO 
TGTGATATATAAACTCAGACCCAACATTAATCAGTCTTCAAATATCTACTCATCTTCCTAATTACCA 
TACTATCTTAGTACCGTAACACCTATCCAATGTCATCGGTGAANGGCGTAGGAATATTCCTCTGCT 
ATCAGTTGAGATCNCCGAGCA 

SEQ ID NO: 3 5 1 9 actggcgtggattctgcataatggtgatcacacgttccacctcatcctcagtg 
agttctcccgccctcttggtgaggtcaatgtctgcttrcctcaacaccacat 
ccacaccotaatggcagtgatggcaaaggctattttccgccgcccatcgatgttggtgttgagta 

CATTCCGTTCTTTTITrTTrrTGAGACAGTCTCG 

TCTCGGCTACTGCAACCTCCACCTCCGGGTTCACCCATTCTCTGCTCANCTCCCAGTACT^ 

ggcccgccaccacgcccgctaatrmgattttagtagaacangggttcaccgtgt)^ 

gctcatctcctgacttggatcaccacctcgnctccaaatgctggatacaggctgagcactgcccng 

cnnattcattctatnaanaaataccagcttatcttgatgatcgatatccatat^ 
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AAT 

SEQ ID NO: 3520 ACGAGATCCrrATGGAAATCGTCCCTTATGCATTGGTCGTCTTATTCCAGTGT 
CTGATATAAATGACAAAGAGAAAAAAACATCAGAAACGGAAGGATGGGTGGTGTCTTCAGAATC 
TTGTAGCTTCTTATCTATTGGTGCAAGATATTACCGTGAAGTCTTGCCTGGAGAAATTGTGGA^ 
ATCCAGACACAATGTCCAAACTCTTGATATTATATCAAGGTCTGAAGGAAACCCAGTGGCTTm 
TATCTTTGAATATGTTTATTTTGCAAGACCAGACAGTATGTrCGAAGACCAAATGGGT^^ 
AAGATACCGTTGTGGCCANCANCTANCGATTGAACCCCTGTGGATGCANATTTGGTAGACTGTCC 
AGATCTGCTNCCCTGTGTCTTGCTACCCAGGAAGTGTGACTTCATATGTGAGNGCTGGCAAACCGT 
TTGTAGGAAACCTCATNACCAACTGAGGTAAACACCTGTGTGAAAAAATTGGAGNTTG^ 
TAAGCAAANATTGTCT 

SEQ ID NO: 3521 ACCTGAAGCTCAGGAGGAGATGAAAGAAGTAGCCAAACACCCAAAGAATCC 
TGAGGTTGGCTTGAAGCCTGTGTGGTATAGTCCCAAAGTTTTCATTGAAGGTGCTGATGC^^ 
TTTTTCGGAGGGTGAGATGGrTACATTTATAAATTGGGGCAACCTCAACATTACAA 
AAATGCAGATGGAAAAATCATATCTCTTGATGCAAAGTTGAATTTGGAAAACAAAGACTACAAGA 
AAACCCTAAGGTCACTTGGCTTGCAGAGACTACACATGCTCTTCCTATTCCAGTAATCTGTGTC^ 
TTATGAGCACTTGATCACAAACCATGCTAGGAAAAGACGAGACTTTAAGCAGTATGTCAACAGAA 
CAGTAAGCATGAAGAGCTAATGCTAGGGGATCCCTGCTTAAGGATTGAAAAAAGGAGATTTATAC 
ACTCCAGAAANAGATCTTCTATGTGATCAACCTTATGACCTGTANCCANTANTGCANGAGCCCGGT 
GTTTGTATCATTCTGTGGCCACAAGGAATCCAT 

SEQ ID NO: 3522 ACGCGGGGGAGATGGCAGATGAGATTGCCAAGGCTCAGGTCGCTCGGCCTGG 
TGGCGACACGATCTTTGGGAAGATCATCCGCAAGGAAATACCAGCCAAAATCATTTTTGAGGATG 
ACCGGTGCCTTGCTTTCCATGACATTTCCCCTCAAGCACCAACACATTTTCTGGTGA^^^ 
AACATATATCCCAGATTTCTGTGGCANAAGATGATGATGAAAGTCTTCTTGGACACTTAATGA 
TTGCAAGAAATGTGCTGCTGATCTGGGCCTAGATTAAGGGTTATCGAATGGTGGTGAATGAAGGT 
CANATGGNGGACAGTCTGTNTATCCCGTTCATCTCNTGTCTNGAGGCCGCAAATGCATTGGCCTCN 
TGTTAACACGTTTTGGGGATAATTTCTCTTCTITANGCATGATTAA 
GAACACCTTATTITrGCCTGTNNTGGAGAATTCANAAATATTTTAAA 
TNTNACTCAAAAAAAAAAAAANAAAA 

SEQ ID NO: 3523 ACTTTriTTTTTTTTTITI^^ 

ATTTNGGTCTTACAAATGATCACTTTTAAATGGACTTTTCTGTAAGAATGTi^^ 

GCCAAGTOTGTATCTGATCCACACAAATCCCTAGAAAGGTTTCTGTGTAGTOT 

TCTTTGAGAATGTTTCACTCrrACTGTAGGATCTTGAATNTGTTTTAC^ 

TTTTATGCAAGTGCATTCATTGTAAACTATAAATAACATTTGTATTTAAAANAAAGCTGGG 

AAAANTAGAGAGACrCTNGGGANCAGGCAATCTGTTGAGGCTNAGTTATCTTATTTGOT 

CTGNTTGCATTCCTTCAAAGACACACAGCCATGAGGGNNGTATlSri^ 

SEQ ID NO: 3524 ACTTTTTTTTTTTTTTT^^ 

GGGATAANATGGTITCTTGGGGGATAGATTCAAGAGGAGTTGAGAATGTTTT^ 

TCCCmrCCTGGAAGGGTGGACAGCAAGATTTAGGACAAGCTAAAATCATCCCCTATTTAy^^ 

AAAAAAAAAAGTCACCANCAANTISOTCCCGGNTGGGAGGNGGGANCANANTAAA^^ 

AATGATTCCrANTTGTTTTCAATACANAACCITGGGAAGGGNT^ 

AACTNCAGTTATNTTGGGGAAGGTTTAAGGTCrrTC>R^IGCTTGC(^^ 

TTNTCTGGCA>rmCTCAACTTTTGGCTGGNCCTCTTGOT 

GATT 

SEQ ID NO: 3525 ACAAAATGTAGATCTATTTATTTAGCACTTTGTTCACTCAGATAAATTTA^^^ 
TGCATATCTAATGAGATATGCAATCATCTCCAAAGATTATAACTCCTATAATTCAAAACAA^^ 
CCCTTTAAAGGTTATTCACTATATGAAAACAGAAAGGGAACAATTACAACAATGA^ 
ACTTTTTCAAATCTGATTCTAATGTGTATTGACATATATTTAATTATATAAA^ 

gggcattaacagtacgcggggactgganacactgaagaaggcaggggcccttaaagtcttggttg 

CCAAACANATTTGCAGATCAAGGANACCCAGGAGTTCAAAAAACNCTAGTAAGNNT^ 

tgcnctcctacatcctcagggtaggaagaaaaggnttccaaacatgcggtgntcnattgttgact 

CCTGCCAAAACAGGATNCTGG 

SEQ ID NO: 3526 ACGCGGGGGCTTTCCACTATGGCTTCCAGCACTGTCCCGGTGAGCGCTGCTG 
GCTCGGCTAATGAAACTCCCGAAATACCGGACAACGTGGGAGATTGGCTTCGGGGCGTCTACCGC 
TTTGCCACTGATAGGAATGACTTCCGGAGGAACTTGATACTAAATTTGGGACTCTTTGCTGC^^ 
GTTTGGCTGGCCAGGAACITGAGTGACATTGACCTCATGGCACCTCAGCCAGGGGTGTAGCCi^^ 
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TAGACAAATGGAATCCTGTGCTGAACCCGAATCITCCAAAAAACAGCCTACAATCTGTGACCAC^ 

ACAAGATGTGCCCTGATGGCANCTGAAGTTTGATTCAAATGGCCTTTTOT 

TCCTTTTGTTCTTGATCCACGCANAATTCATTCTCTGGTCACAACAGGCTAACTAA^ 

TTCTGAAAGWGACCATGACTGCCCCAAGCTACANGANATCTTTATTTGG^^ 

TGAAAAATTTANT 

SEQ ID NO: 3527 ACAATTACCCACCACTGGATTTGACTCAGAGAGGACCCCCAGAGGGTGTCTC 
CATCTTCCCTATTTATTTTCAGCCCTTGAGGGCTTCArrGTAGATCAA^ 
GGTGACATACTCCTGGAAGTTCACCTCCTGGTCCTTGTTCCGGTCCAAGTCTTCCATCAGCOT 
ACCCGCGT 

SEQ ID NO: 3528 ACGATAATCCCACACCATATCTTGGATTTCTTGGAAATTGACTCAACTCTCCA 
TTCTAATAACATCTCCATTCTCCAGGAACTGTACTTTIT^^ 
AGGTTGACATirrCCATAACAGGTGTAAGAGTGTTGAAAAAAAAATTCAAATT^ 
AGGGAAGGAGTTAATGAAACTGTATTGCACAATGCTCTGATCAATCCTTCTTT^ 
CAATTTAAGCAAGJNGATGTGCAAAAAAAATGGAAGATTCANCTTTCANTTAAAAAAAAAN 
AANAAATGGCCAAGAAAAAGTTTTTCAAArrCTTTCTT^ 

CAAACTGGGCCATGGCANAAAAATTCTGNTNACACCNCCGANTCCAAGGTCCATTCAAGGANGCN 

AGCGGANGCTATTGTITGGNTCAATGATTGCTrCCCATCTTTGCTTTTA^ 

TAGGACCTGTGTGCCT 

SEQ ID NO: 3529 ACTTTTTTTTTTTTTTIT^^ 

AGTrCATAAAAAAATACrGCCCTGATATACACAAAATTTTCTACT 

CACCAATATTCAGTCTANATTGGTTTAATCTTGAAGTGTAATCCAATAAGACTGAAGACCAAACAC 

rrCAGGTCCTGGACAAGATAATAAAATACTa^TAAGCCrrCTGGATCCTrGGATTGAT^^ 

ATAAGGGAACCAATTTTTGATGTTGAAAAANAAATGNGTCATCTCCAATO^ 

CCGGCCCACTO^GNCAGGANGAGGGCACAATGCATAATCCrCITrGNAATTNAC^ 

TTCTCTTCAGTNCrrCATCCGCTTTATGNCCTGCCGGCGGCCGTCNAAGGGC 

SEQ ID NO: 3530 ACTCTCTCAGCrCAGGTCTCTTAGCTTTTAGTGTTGGTGTCAGCAAGTCATm 
GAACTGAGAACATGTCAGAATGGATGTGAATGGCTTTAACCTGCTCAAAAGAATGGAGTCCAC^ 
TCTTTTCCTAACCTCACCATATCTTCCAAAATGGCrrrcr^ 

CATATGTTCTTTCAArrCCTCTCTTCTGGGCCCAGGAGGGCATAACTTCAGGGTCAG 

TGCCTACCAAAAAGGarmAAGCTGCCCCATGGACATAGATTTGCGCCACAGGT^^ 

TAGATGTCrCAATCTCTCGGTGCAACATATCTCCTGAGCAAGTTTAAATATATGCTT^ 

TAATTTAAGAGTTCCTGCCGGCACCATTTCCGATGCrCCAGTGTAANCAGCCATCGTG^^^ 

CTTCGTCTGCTGATCTTTAAGTANCTTTGACACATTGGNCTTAACATATC^ 

TAGTTAGTCCTACT 

SEQ ID NO: 353 1 ACTGCTTGTTCTCTGTGGAGGAGATGATTAAAATAAAAATCCCAGAAGGACA 
GGAGGATTCTCAACACTTTGAATTTTTTTAACTTCATT^ 

AATGCTTATTTCCAATTAAAACGCCTACAGCTGCCTCCTAGAATATAGACTGTCTGTA^^ 

ACCTATAATTAGTCATTATGATGCTTTAAAGCTGTACGCGGGGCCTTTCGGCCGGAACCGCCATCT 

TCCAGTAATTCGCCAAAATGACGAACACAAAGGGAAAGAGGAGAGGCCCCGATATATQTTCTCTA 

GCCTTTTAGAAAACATGGAGTTGTTCCTTTGCCCATATATGCAATCTAT/^ 

ACATCAAGGAATGGGTCCTGCCCGGCGGCCGTCAAANGGCN 

SEQ ID NO: 3532 ACTTGAACTGGAGGGCAAGAAGTGGAGAGTGGAAAATCAGGAAAATGrnN 
CAACCTGGTGArrGAGGACACAGAGCTGAAACAGGTGGCTTACATATACAAGTGTGTCAACACGA 
CATTGCAAATCAAGGGCAAAATTAACTCCATTACAGTAGATAACTGTAAGAAACTTGGCCT^ 
rrCGATGACGTGGTGGGCATTGTGGAGATAATCAACAGTAAGGATGTCAAAGTTCAGGTAATGGG 
TAAAGTGCCAACCATATCCATCAACAAAACAGATGGTTGCCATGCTTCCTGAGCAAGAATTCCC^^ 
GATTGNGAAATAOTCAGTGCCAAATCTTCCGAATGAATGTCTATTCCTACAAAANGCGTGACm 
TGAATTCCAGTTCCTGA^mTTCAAGACCCTNTGAACNGGCANAATTGGCACACAGTGC^ 
TGTTGATANCCAAGTGCCCTGGGTCTTGCCTCCCTTCCACNATGGGTAAATCTGTTAAACGGTCm 
CTAAATTCTTCCTTTTGCITTAACTGTTN 

SEQ ID NO: 3533 ACTTAGCATTGATCAAAGAAATTTCAAATrACGATCAATTGGGTGGGGAGAA 
GAATTTTCATTGTCCAAGCACCCTCAGGGAACAGAAGTCAAAGCAATAACATATO^ 
GGTCTATAATGAAGAGAACCCGGAAGTTTTTGTGATCATTGACATTTAAGACACCAAAAAATAAA 
AGACTCCTACGAAGAAATrrCATAAAAAAAAAA 
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SEQ ID NO: 3534 ACATGTTTACAATACCAAAAAAGAAAAAATCCACAAAAGCCACTTTATm 
AAATATCATGTGACAGATACTITCCAGAGCTACAACATGCCATCTATAGTTGCCAGCCCTGGTCAG 
TTTTGATTCTTAACCCCATGGACTCCTTTCCCTTT(nTCT^ 
TCTTTTTTTTAACTGAGTTGAATTGAGATTGATGTGTTTTCAC 
TGCACTTAACAATATGAAATAGAAACTTTTGCTTTACTGAGATGAGGATATGm 
TTGGATAATGTGGGAAAATGACATCTAAGCTTTACCTGGCACCATGTGATGTGATCAGATGCTTGA 
ATTTAACCrTTTCACTTGGTTCITATCTGATGCCNCrCT^ 
TACTGNTTGAACATATGGAAGTTTAATTmGTATAAAGATO 

SEQ ID NO: 3535 ACGCGGGGTGTATNATQCCTGrrACTANTATTCACATGGAACAAATTGCTGCC 
GTGGGAGGATGACANANAAGCNTGAGTCACCCTGCTGGATAAACrrAGACTTNAGGCTTT 
rmCAATCTGTNAATCATAATCAGGTCACTGGGATGTTCAACCTTAAACTAAN 
GGTTATTTAAAAGATTTTCAGTANTATNCTAAATGCAAACATTTTCATTTAAATO^ 
TTNGNTATTTCANTAANANAATATATATTCATGGTCATTCTTAATNAGCAGG^ 
TTNTAATGNTCATAATCACCTTTGATNCACTGGTTNAATTGCGGCTAAATACAAm 

SEQ ID NO: 3536 ACAAAAAAATTAGCTGGGCATGATGGCGTGTGCCTATAATCCCACCTACTCG 
GGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGGCGGAAGTTGCAGTGAGCCGAGATGGCG 
CTACTGTAGTCCAGTCTGGGCGACAGAGTGAGATTCTGCCTCAAAATAAATAAATAAATAAAAAT 
TTGTATTCCATTGATTTGGGTAGACACCAGGAATGTGCATTTCTAACAGGCTTTC^^ 
ATAGTAAGTCATCTGTGGACTACTTTAAGAAACTCTTCTATAGAGAATGGAGTT^ 
AGGTGATTITITACACTGGACTGATTCACAAGAACCTAAACAGTAGTCCATG^ 
TGGNACTATTTGCCCCGCTCACTCTGAAAGCANAGGAGATGTTGTTTACTTTGT^ 
TGAGATAATTTTGAATGAAAGTTTTCTCTTATGCnTCCTGGTCTTTC^ 
GCACNTGCATCATACCTTTAAAAGATGCATTTN 

SEQ ID NO: 3537 ACACTTGAAACCAAATTTCTAAAACCATGTTTTCTTAAAAAATAG 

ACATTAAACCATAACCTAATCAGTGTGTTCACTATGCTTCCACACTAGCCAGTCTTCTCACACTTCT 

TCTGGTTTCAAGTCTCAAGGCCCGACAGACAGAAGGGCTTGGAGATTTTTT^^ 

TCTTCAGCAACTTGAGAGCriTCTTCATGTTGTCAAGCAACAGAGCTGTATCT 

CATAGAGACGATTTGAATATCrrCCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACAAAC 

ATAATCCTGGGGACATACTGGCCATCAGGAGAAAGGTGTTTGTCAGTTGTTTCATAACCAGATTGA 

GGAGGACAAACTGCTCTGCCAATTTCTGGATTTCnTTATTTTCAGCAAACAC^ 

CTGGTGGGCACTCTTCAAGTGATGATATCATCAAGGGTTGTTGCTTGTCTGGATTATATAGAGCT^ 

TCATATGTCTGAGTCAGAGATTGGCACCCCACCCTGAGAGGCTGGGCANTTGGGC 

SEQ ID NO: 3538 acacttctcgaaatctaattgggggcgctgacatcattgtgatcaaatacaa 

CGTTAATGACAAGTTTTCATTCCATGAAGTAAAGGATAATTATATTCCAGTGATAAAAAGAC^ 
AAATTCAGTNCCAGTAATTATTGCTGCTGTTGGT 

SEQ ID NO: 3539 ACCATGGAGAAGGAGTCGAAAACCACCCGATTCTGTCTTATCTGTAACTATG 
TCAGTCGAATAATTGAACCCCTGACCICTAGATGTTCAAAATTCX^^ 

AAATTCAACAGCAGCGATTACTAGACATTGCCAAGAAGGA^VATGTCAAAATTAGTGATGAGGGA 

atagotatcttgttaaagtgtcagaaggagacnrraagaaaagccattacatttot 

actcgattaacaggtggaaaggagatcacagagaaagtgattacagacattgctggggtaatacc 

agctgagaaaatttgatggagtatttgctgcctgtcagagtggctcttttgaca^ 

tggcaaggattaatagatgagggtcatgcaacactcactcgtcantcactncntgatgtgggtnt 

agaaatnact^m:tgttaacagantntatttcacanaaaact 

ctgagaacattgnctatctcctttggcantggac 

seq id no: 3540 actttttttttitrt^^ 

acttcaagircgngtcattxttacatggcaattacrrtnaaatacc^ 

tccaaattcacgttttgcitgacactttgtatttctaaat^^ 

actgatttcratmaattttatgaatcctagctititraa;^ 

gactttctgaaattaiwgcaaagngaccgaggcacttggcaacactcot 

antagtgccttctgtcattcaatccntacaactcanggggaggaatcccccctt^ 

gtatatta 

SEQ ID NO: 3541 acagacaa aacaaaatctgccttaggctgtgtgataaacctatcatacctcc 
atcatcaaaaaaccttcagaataatttgggggatgcitacaaatgcattcatatacac^ 
tggaaatcttaaatatgttgcccagtcttccacaatccatgccatacatgggot 

CrrCCTTCCATTGCAGGAAGCTCCTTCTAGAAGCCITGGGTGGGTAATTTGA^ 
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AGATTGATGGTCTGACAAGTCTTCAATGACTCACTGGACAGTCTCTGCTGAAAGTCTGCTCTCAT^ 

CAGGCACAGAGCATGGTGCACANATTCATACTGCTCACTGGrrACACCATrCCACCnTrATCCT^ 

GAAGCTGGCANACAATGCITATGCATNCACACTCCTTCTTTTCAGCTGT^^ 

AATAAACACCCTGTNTACTATTCTGCCTGAGTGACACCCAGCCTCGNCCTGGGAACAAGCTGCT^ 
TTNCATCCACT 

SEQ ID NO: 3 542 ACATTTTCATGACTGGGGAATGGATTTTCTGAAGTCATCTTCAATAGGGCAAA 
AACTTAGAAACAAAAAAAAAAAACAACCTGAATGTTGAGTTCAGTTCTTTATATA^ 
AAAAATGAAAGAATAAAAACAAACCGAAAAAGAGGGGCAGGGTAAAATITTTTAA^ 
AGGAAAGAGAGGAAAAGAAAATAAAATAAGACGATTTATTGCTTCTCCTCAGTATCCTCOT 
CTCCTCCriT CACC GAGAGAGCTTCTAGCTTTTCCGCCACTm 
TCCTGCTTTCTTTTCTCTCTCTNCGAACTCTT^ 

GCOTTCTNAGCAITCAGGAAGCGGGATGGCCAANCANCTTTTGGCITGG 

TACAAGCGNNGGGTGTTCCANACCCAGGCANCGGGNNlsrrACTGlWTTGGGCT^ 

NGGCGTGATGTAATGGNTOGNANAAATTTTAAGGTCTTNCCCTCC 

SEQ ID NO: 3543 ACGCGGGGAGAAGCTTGGACCGCATCCTANCCGCCGACTCACACAAGGCAG 
GTGGGTGAGGAAATCCAGAGTTGCCATGGAGAAAATTCCAGTGTCAGCArrCTTGCTCC^ 
CCTCTCCTACACTCTGGCCAGAGATACCACAGTCAAACCTGGAGCCAAAAAGGACACAAAGGACT 
CTCGACCCAAACTGCCCCAGACCCTCTCCAGAGGTTGGGGTGACCAACTCATCTGGACTCAGACA 
TATNAAGAAGCTCTATATAAATCCAAGACAANCAACTAACCCTTGATGATTATTCATCACTNGGAT 
GAGTGCCCACACAGTCAAGCTTTAAAAGAAAGTGTITGCTGAAAATAAGAAATCCANAAAT^^ 
GAGCOTTTTGCCTTCTCAATCTGTTTATNAAACAACT^ 
GCCCCAGGATATTGTTTNTGACCCACTNTACANTTGANCCANTTTACTGG 
AGCT 

SEQ ID NO: 3544 acgcgggggtctggagctgcctgaggatgaggaggagaagaagaagatgga 
agagagcaaggcaaagmgagaacctctgcaagctcatgaaagaaatcttagataagaaggto 
agaaggtgacaatctccaatagacttgtgtcttcaccttgctgcattgtgaccagcacctacggct 
ggacagccaatatggagcggatcatgaaagcccangcacttctggacaactccaccatgggctat 
atgatggccaaaaagcacctggagatcaaccctgaccaccccattgtggagacgctgcggcanaa 
ggctgangcccgacaagaatgataangcagttnatgacctggtggtgctgcttgtttgaaac 

CTGCTTCTTCTTGCTTTTCCTTGNGGATCCCNAGACCACTCCACCGCAT^ 

GTNTAGNTTAOTGAANATNAGTGCACAAAGGACCCATTNTTNATTCCTGNGAATC^^ 

ATAAGATCGTTCCATGGAA 

seq id no: 3545 acataatcgtntgtggagtcggcacagttcaggttatggaggcacgtaattc 
accaaagtgcaaaaaaggcaaaggaaaacacgctgcattgtagaataaggcattcaaatgtgct 

GT TAAC GTTTAAGGCAGCTAATGGCCAAAACAGGCAAGTCAAGAAAAGTGGTCTGGTTTGGAGGT 

gattttgcatctagaaggcattctcttctcgtgacctcaaagactgagcactgtagagca 

cttcctcaaggccaatgatacitcagataccagatggtttcatttttcaat^^ 

ggttgagttgggccagaattgcaatcagccaaaagagatagcagcaaactgaacaggtcaccaa 

caiggtaatgatactcccggitaggacccttagggatgaaccaaggcccaagatccgacnanccc 

anaccgctctccatgagagccagtgaggccgtgatcccntgnncccgctgacgcccccgntctgc 

cggcggcg 

seq id no: 3546 acagattgcctatttgaggaccttggccgcrctgtaagcatctgactcatct^ 
agaaatgtcaattcttaaacactgtggcaacangacctagaatggctgacgcattaaggtm 
cttgtgtcctgttctattattgttttaagacctcagtaaccatttc^ 
tctccatagtatttcagtcatggaaggatcatttatgcaggtnggcattccaggagtii^ 
ttctgtctcaaggcataggtgtgtattgntccgggactggtttgggtgggacaaagttaaaat^^ 

CTGAAGATCACACATTCATACTGTTG>n^CTKrGGNNGmTrANGATTC 

TCTTGCACnTSINCATACTATa^ANTTNCATTGGNN 

NAGGGTCTGGOTGCTNCTGCnTrGCCTATGGTTGGCCTTTGTGCAC^ 

SEQ ID NO: 3547 ACGCGGGGrrGAGGCTTrGCAATTrCACrrGTGrrAAAGGCTCTGGCATTl^ 
CCATTTCTATGCAAATTTCTTTGAAGCAGAATTGCTTGCATAm 

AGAGTTTCTTTCAAACTTCACTGAGGCATCAGTTGCTCTTTGGCAATGTCCCTTAACC^^ 

AACTAAGTTTGTGGCITGAGTTrACAAATTCTACTTGTTGCATTGATGTTC 

TTTTAGTTTGATTGTGAAAAAACCCTGGGGCTGAAGTTGGNATTrCAGT^ 

ACTAGTCCCAGATTTGAAAACITGTAATAAAATTGAAACTCACTGGriTrCT^^ 

NGNAATCGAGTTTTGATCATAmCTATTAAAGTGGCTACACCCAAAAAAANAAAAAAA^ 
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TNTANTCGGNCGNACCNNCCTAGGCGAATCCACTCATGGNGNCGTACTANGGNCCACNTCGGTC^ 
AGCTTGCTNTATATGGCA 

SEQ ID NO: 3548 ACAGCCTTGTGGCCAGCCTTGACAACGTTAGGAATCTCTCCACTATCTTGAAA 
GCTATTCATTTCCGAGAACATGCCACGTGTTTCGCAACTAAAAATGGTATCAAAGTAACAGTG 
AAATGCAAAGTGTGTGCAAGCAAATGCTTTTATTCAGGCTGGAATATTTCAGGAG TT^ 
GGAAGAGTCTGTTACTTrrCGAATTAATTTAACTGTCCTTTTAGA 

AGTCCTATGCCAGGGACTTTAACTGCACTTCGAATGTGTTACCAAGGTTATGGrrACCC^ 

CTGTTCCTGGAAGAAGGAGGAGTGGTGACAGTCTGCAAAKTCAATACACAGGAACCTGAGGAGA 

CCCTGGACTTGATTTCTGCAGACCAATGTATTAATAAAATTATTCTGCAGTCAGANGGGCTCC^ 

AGCATTTCTGAATTGGATATACNAGTGAANCCTACAATACCATGCTCTGNAGCCTAm 

ACTTTGGAATGCAGAGTTCCACTTN 

SEQ ID NO: 3549 ACGCGGGGACTGCGATAGAAATCATGTCTGGTCGCGGCAAAGGCGGAAAAG 
GCITGGGGAAGGGTGGTGCTAAGCGCCATCGTAAGGTGCTCCGGGATAACATCCAGGGCATTACA 
AAACCGGCTATTCGCCGTTTGGCTCGGCGCGGTGGCGTCAAGCGCATlTCCGGTCTrATCrATG 
GAGACTCGAGGTGTGCTTAAGGTTTTCTTAGAGAACGTTATTCNAGACGCCGTCACCTATACGGAG 
CACGCCAAGCGCAAAACTGTCACAGCCATGGATGTAGTATATGCCCTAAAACGTCAGGGGCGCAC 
TCTGTATGGCTTCGGCGGCTGAATCTAAGAATCGCGGTCTCCTGAGAACTCCAAAAAAAV^^ 
AAAAAAAAAAAGTCCACTTAAACCTGCCAGGTCCCAGNTCTATTTATGGTGC^ 
TGCATCCAAACAGGITAGTGGAGTGGTNTGGACANATTTANTCTATCATCACANATTGGACCTAA 
GATTTGCAGGNTTTCT 

SEQ ID NO: 3550 ACGCGGGGACTCAGAAGCTTGGACCGCATCCTAGCCGCCGACTCACACAAGG 
CAGGTGGGTGAGGAAATCCAGAGTTGCCATGGAGAAAATTCCAGTGTCAGCATTCTTGCTCCTTGT 
GACCCTCTCCTACACTCTGGCCAGAGATACCACAGTCAAACCTGGAGCCAAAAAGGACACAAAGG 
ACTCTCGACCCAAACTGCCCCAGACCCTCTCCAGAGGTTGGGGTGACCAACTCATCTGGACTCAG 
ACATATGAAGAAGCTCTATATAAATCCAAGACAAGCAACAAACCCTTGATGATTATTCATC^ 
GATGAGTGCCCACACAGTCAAGCTTTAAAGAAAGTGTTGCTGAAAATAAAGAAATC^ 
GCAGAGCAGTTTGCCTCCTAATCTGGTTATGAACAACTGACAACACCTTCTCCTGATGCC AGTTG T 
CCAGGATTATGTTTGTTGACCCATCTCTGCAGTTAGACCGTATACTGGAAATTTCAATCGCT^ 
TACCAACCTGCAA'mAGCTTGTG 

SEQ ID NO: 3551 ACCACAGTTCACAAGTGCAGGAGAGAATTTTGATAAATTGTTAGCTGGAAAG 
CTGAGAGAGACTTTGAACATATCTCGACCACCTCTGAAGGCAGGGAAGACTCGAACCTTT^ 
TCTGCATCAGGACITCCCCAGCGTGGTGCTAGTTGGCCTCGGCAAAAAGGCAGCTC 
AACAGGAAAACTGGCATGAAGGCAAAGAAAACATCANAGCTGCTGTTGCAGCGGGGTGCAGGCA 
GATTCAANACCTGGAGCTCTCGTCTGTGGAGGTGGATCCCTGTGGAGACGCrCAGGCTGCTGCGG 
AGGGAGTGGTGCTTGGTCTCTATGAATACGATGCCTAAAGCAAAAAAAGAAGATGGCTGTGTCGC 
AAACTCTATGGAAGTGGGGATCAGGAGCCTGCNAAAAGGAGTCCTGTTGCTTCTGGCANACTTGG 
CACCCAATTISrrTGAACCCACCATGANATACCCACCAAArrGCTAArrTTGA^ 
ATAGTAAACCGAGNCATTAAACCCA 

SEQ ID NO: 3552 ACCCTTTGGATrrCAAACAGTAACATCGGATGTAAACAAACTTAGTTCCT^ 
ACTCACTGAAACTAATCAAGCGGCTCTACGTAGACAAATCTCTGAATCTTTCTACAGAGTTC^^ 
GCTCTACGAAGAGACCCTATGCAAAGGAATTGGAAACTGTTGACnTCAAAGATAAAT^^ 
ACGAAAGGTCAGATCAACAACTCAATTAAGGATCTCACAGATGGCCACTTTGAGAACATTTTAGC 
TGACAACAGTGTGAACGACCAGACCAAAATCCTTGTGGTTAATGCTGCCTACTTTGTTGGCA^ 
GATGAAGAAATTTCCTGAATCAGAAACAAAAAGAATGCCTTTCAGAGTCAACAAGACAGACCC^ 
ACCAGTGCAGATGATGACATGGAGCCACGTTCTGNTNGGAAACTTGNCAGATAATTGNAGATCAT 
AGAGCTTCTTTTCAAATAGCTCTCACATGTCTNCTCTCCCA 
A 

SEQ ID NO: 3553 ACTTCAGACAGGATCCCAACCCCCACCCAAATTCAATGTCGACCGTCTGAGC 
AGCCAGCTTCATTGGCTGCAAACGCCTCTCTCAGGTGAGTCAAAGGAGACACGACGGGGAACCAG 
GGGGCCCTAGGTGAGGATGTCATGGGCCTGGTGCTCCACCAGCATCTCCATGCTCTTCACATCCGT 
GCACCAGAACTCCAGGCGGTCCTTCATTCCCITGATCTGTTGCAAATCCAACACT^ 
CCAGGTCATGTGGACTCGTTTGTCCACCTCGTCTATACTGCCTTTCACCAGCCCCACCGAAAGGGC 
CITCATCACCAGAAGCTCCACCTCATTCACTGTGATTTTAGCACTTTTGG 

AGTTGTCTGTGATTGGCAGGTCGTGTGAAAGTCATCTCCATGAGGCACAACAACTGAATTTTCCTC 
AGAAGCTGGGCTTCATTAGCTGCTAAATCAGGCTGCTGGCCCCAGGCAAGTCTTCAGAGTCTGGA 
ACCGCTCTACGTTGCCACTGTTGAAGGCATAGANGGTGTCAATCAGCCACTGCCGGTCAGTATTNC 
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TCAGGGACTCCAGCACANGGTGCATGAAGAGTCrCCAAAGTTAAAAACTCCCTCCCGANAAGTCC 
TGCTANCCCAACG 

SEQ ID NO: 3554 ACTTTTTCCCCTAATTCCAACACTAGTTTATATATATAGCGAATAAATCTAGT 
TGTATAAATTTITAAATGCCGTCNNTAAAAAGCACACAANGTTATGATT^^ 
TGATTTCTTTCACTTCTGATCCTITrCCT^^ 
ACGATGGGTANGAATTTTGAGATTAATGTTAATTTTCCCTTTITG^ 
ATGCTTTTGTCCANAAGGATCA>mGAATTCTACCATCCCTTGGGTCTTNGT^^ 
ATAAAGGTNGACTCAGNCmAANATATTAGACANTTTTTTTA^ 
ATNACTTNCTATANAATATTTrGGCTTCGAACTATANCTCAAATAGTTT^ 

SEQ ID NO: 3555 ACGCGGGGAGGTTGGGGTGACCAACTCATCTGGACTCAGACATATGAAGAAG 
CTCTATATAAATCCAAGACAAGCAACAAACCCTTGATGATTATTCATCACTTGGATGAGTGCCCAC 
ACAGTCAAGCTTTAAAGAAAGTGTTTGCTGAAAATAAAGAAATCCAGAAATTGG 
GTCCTCCTCAATCTGGTTTATGAAACAACTGACAAACACCTTTCTCCTGATGGCCAGTATGTC^ 
AGGATTATGTTTGTTGACCCATCTCTGACAGTTAGAGCCGATATCACTGGAAGATATTCAAATCGT 
CTCTATGCTTACGAACCTGCAGATCAGCTCTGrrGCTTGACAACATGAAGAAAGCTCTCi^ 
TGAAGACTGATrGTAAAGAAAAAAATCTCAAGCCCTTCTGCTGTCAGGCCrrGA^ 
AAGAAGTGTGANAAACTGCTANGTGGAACATAGGAACACACTGATTAGGTATGGTTATGTACAAC 
ACTTTTTrAAAAAACAGTTTAAAAT 

SEQ ID NO: 3 556 ACirmTiTiTrn^^ 

CATAANATTTACAGAAGTTTCCANACAANCCATACAAAATGGTCACAANCTTT^ 

GAATCTACACTTGACAGCAATGTTNTTAGTGAGGGCTGTGATGTTTGTTTAATGT^ 

TCCAACAATCAAGCTTGTCCATCTACAGCGTTAAATAAAGTTAGACTTGGCTAGAGCATATTCT 

AGACCTGGTrAGCTGCTTTTAACCAATGCAATTAQATCCCAAAAAAGGGGGAAAGGACCCATAAA 

ATTAAACTACCTCCCCCCTCAAAATAAAATTAAATNANATAAANAAAAACCCCCAO^CC^ 

TACCTGACACTCCTTATCACAGTGCTTATCTTAACCAGGNTGGGGGAAATGAAT/^ 

GCCCTGTTTTAAACGTTCACACATCCAANGAACTT>n:ACC^ 

AACTA 

SEQ ID NO: 3557 ACTTTTTTrTTTTTITI^^ 

ANCANAGTGGCTCCAGGCCCTTCACGCCTNTNANACACCACCCATGAGGGTTTAGGAAGGTGCCA 

TCATTCTGTGAAGGCCCANAGOTACCCAAGTCTTGGAGCCCAAGTTGAATCACCAACCANAGGG 

TTGGGAGAGGAAAAGGAAACAGGCANAGGGGAAAGGCAAGGCTCTGCAGTGAAGGGGACTGAT 

ATCAAGGGAATGCTGAGGTCCACCAGTGTCTCCTGAAGGCATGCTGCATCCTAAGGCTCCTCAGG 

ACTGGATGGAGTAGGANATCTOTGTGTTGAGCAGTTCACATCTATATGGCAACTI^ 

CTTGATTCAGGCTCAATGTTATGGTTGGGAAGGTCCGCTGACCIWGAAGCTCTCCT^ 

ANGAACroTAAGNTCAGCCCATNTTGACGGCCANAGGCTCCAATATAACT^ 

ATGGGTCN 

SEQ ID NO: 3558 ACAGCCTGICCTCCCACCCAGGAACACCCTCTGCCTAATGACAGGACCTTCCT 
GGGAAGCATCTTGACAGCAGTGGCAGATGAAGAGCCAGAATCAACTCCTGTGCCCITGCT^ 
GTGACAAGAGTGCTTTCACCCGAGTAGCATCAATGGTTTCCCTTCAGCCCGCAGAGACCCCAGGC 
ATGGAGGAGAGCCTGCAGAAATGAGTATTATGACTACTGAGCTTCAGAGTCirrGTTCCCTGC^^ 
AAGAGTCTAAAGAAGAAGCCATCAGGACTCTGCAGCGAAAAArrrGTGAGCTGCAAAGCTAGGCT 
GCAGGCCCAGGAAGAACAGCATCAGGAAGTNCAGAAGGCAAAAGAAGCAGACATAGAGAAGCT 
GACCAGCCTTGTGCTTGCCTNCAAGATGAAAAGGACTCAGAAAGTGATCAGCANCT 
TCTAGACNATAGACAAGATGGCNACTATANCTTANAAGGAGGNACCACTTCCGTT 

SEQ ED NO: 3 559 ACGCGGGCTCATTTTAAAATTGGTGCTTTCCACAACATGCATCGAGACCATCT 
TGGAGCATTTACrrrTGAAGCAmTGTTTAAGACCCCGGATAAGAAAA 

GAAGTGACTTGTCCAAGATCAACAGTGAATTATTAGTTGGAACGCCAGCCTGATACTCCTAGCTAT 

ATCTCACTGGAAAAGCATTGGAGAAAATGAAACCATTTTAATATTCTAAGCT^ 

AATAGGATTATAGGCGTGAGCCACCATGCCCGACCAGTTTCTGCTTTTArrAAAATTGT^^ 

TTTATACATTCATGTTCATTAAAAATGCTATTTAGAAAAGAGTTTGATAAAATAAA 

AATTCGAAGAAAAAAGAAAAGAGTTTCTGTITCAGTCACAAATTAGGGTrATO 

ATGATGACCGTTGAACAAATGTGAAGAATACTGTGAATTCTATGACTTTATCAAAAATCAGTCACA 

TCCAGGAGCTTOCAGTTGTTGACCAAATGAATGATGACATAGAGTAGTTCAGATCTATCATGTGCT 

CTTCTATCTAATCAAGCAATATITCCITrGGNCCTCAGCCAACATTCA^^ 

TCAT 
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SEQ ID NO: 3560 ACAGTCATTAGATAGCCGGAGTGTAAGTGAAATCAATTCAGATGATGAATTG 
TCAGGCAAGGGATATGCTTTAGTGCCTATTATAGTTAATTCITCAATTCCAAAGTCTA^^ 
GAATCTGCTGAAGGAAAATCTGAAGAAGTAAATGAAACATTAKTTATACCCACTGAGGAA^ 
AATGGAAGAAAGTGGACGAAGTGCAACTCCTGTTAACTGTGAACAGCCTGATATCTTGGTTTCTTC 
TACACCAATAAATGAAGGACAGACTGTGTTAGACAAGGTGGCTGAGCAGTTGTGAACCTGCTGAA 
AGTCAGCCAGAAGCACirrCTGAGAAGGAAGATGriTGCAAGGTAACTCTAACAGI^^ 
GAATGAAAAGCrGGAAAAAAGGGAGGCTCAGTTATTATCTCTTATTAAGGAAAAACACTTCTAGA 
AGAAGCTTTTGATAACCTGAAAGATGAAAATGTTCAGAGTGAAAGAAGAAAGCATTAN 
TCCTTGAAAGATGAimTACTCAAAGAATTGCAGAAGCAGGAAAAGAAAGT^ 
AGAGAGAGATGCTGCTTNAAAANGAAATTNAAAAACATAAAAGAAGAACCT^^ 
AATNGTAATGAAACTGC 

SEQ ID NO: 3561 ACAATGTTGAACAAAAGACCACAGGGGGACCTTTTGTTCAAAGTAGCACCAA 
TCCACACCTGATTGTGTTTCCAACATTAACCITCCTGTTGACTCTATCATTGGCAC 
CTTCTCCTGCTTTAGTGAGGATTCCTACGCTGACTAAGCACACTGTGTTGCTAAACTCTCTACA^ 
GTGTGGCAGCATCAACCCGGGAAATGGCACATTTGAACCAGGATCGCCCTCCCGCGT 

SEQ ID NO: 3562 ACACTGAAACATAAATCCGCAAGTCACCACACATACAACACCCGGCAGGAA 
AAAACAAAAACAGCAAGTTTACATGATCCCTGTAACAGCCATGGTCTCAAACTCAGATGCTTCC^^ 
CATCTGCCAAGTGTGTTCTGGATACANAGCACATCGTGGCTACTGGGGCCACACTCAGCTNANGCT 
GTGGGTCCACAAGAGCACTCATCTGGCTGGGCTATGGTGGTGGTGGCTCTACTCAAGAAGCAAAG 
CANGTTACCAACACATTCAAACTGTGTTITGAACATCTTTTAATATCAAAGTGAAGA^ 
GGCNACATNNTAATGTTNTCTAAAAGATGNTAGGAAGTANNGACAGCTNTGTA^ 
NAAAAG>nrNrrTGNasfCTTCAATTTTTGG™ 

SEQ ID NO: 3563 ACATGACAAGGTGCGGCTCCCTAGGCCCCTCCCCTCTTCAAGGGGTCTACAT 
GGCAACTGTGAGGAGGGGAGATTCAGTGTGGTGGGGGACTGAGTGTGGCAGGGACTCCCCANCA 
GTGAGGGTCTCTCTCrrCCTCTTGTGCTCTTGCTGGGGCTGGTGGTCCANGGGTCTTACTCCT^ 
GGCCATGTGGGCCATGAGGTCCACCACCCTGTTGCTGTAGCCAAATTCGTTGTCATACCANGAAAT 
GAGCTTGACAAAGTGGTCGTTGAGGGCAATGCCAGCCCCACGTCAAAGGTGGAGGAGTGGGTGTC 
NCTGTGANGTCAGAGGAACCACNTGGTGCmAhrrGTACCCANGATGCCCTTGAAGGGGCCCT^ 
GACCCTAGTTACNACTTCTTGATGCATCATATTTGCNAGTTTNTATA^ 
CACGTT 

SEQ ID NO; 3564 ACll'Il l i ai TmTTTTTTTriTITITCTAG 

CATGGAArrTGGGGTGGCGAGGGTAGCCACACCCTX:CAGGAGGATGAGGTAGGTTCAGTGCTTCC 

AGCTCACACCTTTCCTGGGTCTTCCTTCAGTGTGACAGTTCGGTTAAANACAANC^ 

GCTGTCTGGATCCACGGANAAATATCCAANACGCTCAAACTGGAACTTGTCGAAGGGTm 

GGGCCACAGAGCAGTCCACTAATGCTGCATCCACCACGTGTAGTGATGCCAGGTTCAGGTCACTT 

AAAAATCCCCAGNCACCTCAGTAGGATCrrCAGGGCTCTTGNGCTGGAATAGTCCTCATAGAGCN 

ACCTCACACATNAAGGCTGNGACCCCAGTGAATAAAGCCTTGGCTTTNTCACATOT 

GTACCTCAACTmTAOTAACCCTGGGGCCCTTGAAACATGCTGACTN 

SEQ ID NO: 3565 ACCATAAGGAGACACAAGAAGAAAGGTGACACTAAGGCTACAGTGCACAGA 
AAACAGACCAGGTGTGGCnTCGACTGTGCGGACCTGCCCACTAGCCTATGCTACAGATTrGAAAT 
GTCmCACTTTGACATGACACACGGTTTATATTACACAAAATGAATGAAACGACAATGGCTAAAA 
ATAAATGAGACAG(XTGCACACAAAAAGATGATGACTGCTACTTTCCTCCCATCAGAAAA 
CAAAAAGGGAGTATTTAAAGGAAACTCAAATCAGGAGAACCCGGTAGGCATCAGAGGTTCAGGG 
CACCAAGGCCTTAGGGCGGGAACACTTTTCAACCCAAGCCAGGCTTCAGGGGCAAGCCCACC^ 
AGACCCCAATITCCACAGGGGAGGCAGATCTTCTATACCTACAGTGACAGAAAATACACTGAAGT 
GCAGTATAAAATATAAAAAGGTTTGATTCTGAATAGACCAACTGCTAATTTTCOT 
TTAATTTGGGTTTGAGTAAAAACCAAATTAGTTCACTTGATCTCATm 
TGCAATAeCAAAAACTGGAGCTTATGACTGCTTTGATTITCTCTGTAGCAC^^ 
GTGGGAGAACACT 

SEQ ID NO: 3566 ACAGTCTATAATACTCCAACAGTCTCCCATCTGTATTCAATGGCGCCACCCAA 
TACAGTCCTTTGTTTGGATGCTGGGGANAGTAATCCCTACCCCAAGCACCATATAGATAAGAAAA 
CCCTCTCCAGTTGAGCTGAACCACAGACGGTTTGCTGATGTTCACCACACCACCATGACCACAGCT 
CCCTGGAGTGGGAGGAGGGTGGACNACAGGGGTGTTTTGATCTTTANAGGCCTCACACrC^^ 
GCTTGGTCTTCANAGCCACGATTTCTCGGCGAATGGCANGGACATTGirmG 
GCTTCTCTACCAAGAGAGTCATATTTCTTATCTCCACCTNCAGCTGGTCAACAATT^ 
ACCAAAACTCTCCTTCAGCTGTATGACCAGTTTTTCCATCTCCTTCAOT 
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AAGTCCAGTTCAGTGTAAGAAATGGTATCCTTCTCCATGATGTCAATTCGGACAGNTAGGTTTAAC 

AGTTT^mT^CATACACAC^AATTA^m'GGACATATTCCCTCNCTNTGGAAAGTT^^ 

CTGANAAAGAACATGAANCTGNGAATNCCNAGCGTTCCACTCTGTTCACGGGAAAAGGNGGTGTC 
TGGCA 

SEQ ID NO: 3567 GTACGCGGGGTTGAAAAATGGCGACTGTGGCAOAGTTGAAGGCTGTnTAAA 

GGACACCTTGGAAAAAAAGGGGGTATTAGGGCATTTAAAAGCAAGGATCCGAGCTGAAGTTTTCA 

ATGCCCTAGATGATGACCGTGAACCCCGACCATCArrGTCTCATGAAAACCTTCTAATTAATGAAT 

TAATTCGAGAGTATTTAGAATTCAACAAATATAAGTATACAGCATCTGTCCTCATAGCAGAATCTG 

GTCAACCTGTAhnnrCCGTTGGACAGACAGTrrCTCATCCATGAACrrAAATGCATTTGAAGAATCAA 

AGGATAATACAATACCTCTTTrATATGGGATTTTAGCCCATTTCTTGOiTGGAACTAAGGATG^^ 

TCCAGAATGCATTTCTGAAAGGGCCrrCACTTCViNCCTrCANACa:AAGTCTrGGCAGACAACCrA 

OTANAANAAAGCCNATGGATGACCACCTATGAAAGGAGGAACATAAAANTACCTGCCCNGNCGG 

NCCCCCGNGCAGGTAa^JCGGGCTGTrCAACTCCAGTGCACTTGQGTCCCCTGGTGTTCCACCCCCT 

CCAANCAAGANTCGGATCAAGTGCGTCATGQAAGNNTCACACNGAAAAAACTGTGNTTCCCNOGC 
CTTNACCCCCGANG ' 

SBQ ID NO: 3568 ACCCACCACCATGCCTGGCTAATTTTTTGTATTmAGTAGAGACAAGGOT 
ACCATGTTGTCCAGGCTGGTCTCATCTCCTGACCTCATTATCCGCCCGCCTCAGCCTTCCAAAGTGC 
TGGGTGGGATTATAAGCATGAACCACCACACCCAGCCCCTTTTTAAACTCTTAAACAGACACT 
TTCrATGTATCAAAAGGAAAGTTGATGAAAATATrACAACCAACCTGGTTTTTGTTO 
AAGGAACTAGAAGATCTTCTGCCATCTAGACCGGAAGTTCCTAACATAATTGGAGAAACAGAG^^ 
GGTAGAGCTTCAGGAATTTGATAGCACTCGAGGCTCANGAGGTGGTCAAGAGGCGTGAAGCCTAT 
AATGATAGCTCTGATGAAGAAAGCAGCAGCCATCATGGACCTGGAGTGCAGTGTGCCCATCAGTA 
AACTCTGCAAACAAATTGCACAGGTGGATTI7CTTrCCACAm 

AGCTGGAGTGTCTTATCAATCCANATGAACTGAGGGACATCTGTTGGCTATGTTTAACTm 

TTGGTATAGTATCTACAGAAGTGTATAAmAAACTAACCACAAAGCTTTANATCTTCA 
TGTCCAT 

SEQ ID NO: 3569 ACGCGGGGACAACATGAAGAAAGCTCTCAAGTTGCTGAAGACTGAATTGTAA 
AGAAAAAAAATCTCCAAGCCCrrCTGTCTGTCAGGCCITGAGACrrTG^^ 

GAAGACTGGCTAGTGTGGAAGCATAGTGAACACACTGATTAGGTTATGGTTTAATGTTACAACAA 
CTATTTTTTAAGAAAAACAAGrmAGAAATTTGGTTTCAAGTG^ 

SEQ ID NO: 3570 ACGCGGGGATCCCGGAGTTGGAAAACAATGAAAAGGCCCCCAAGGTAGTTA 
TCCTTAAAAAAGCCACAGCATACATCCTGTCCGTCCAAGCAGAGGAGCAAAAGCTCAm 
GAGGACrrGTTGCGGAAACGACGAGAACAGTTGAAACACAAACTTGAACAGCTACGGAACTCTO 
TGCGTAAGGAAAAGTAAGGAAAACGATTCCTTCTAACAGAAATGTCCTGAGCAATCACCTATGAA 
CTTGTTTCAAATGCATGATCAAATGCAACCTCACAACCTTGGCTGAGTCTTGAGACTGA^ 
AGCCATAATGTAAACTGCCTCAAATTGGACmGGGCATAAAAGAACTT^^ 
TTTTTTTTCnTrAACAGATTTGTATTTAAGAATO 

CTCTGTAAATATTGCCATTAAATGTAAATAACTITAATAAAACGmATAGCAGTTCAA;^^ 
AAAAAAAAAAAAAAAAGT 

SEQ ID NO: 3571 ACAGGCTGAACAGAATTGAGAATGCCTTGAAGACAATAGAAAGTGCCAACC 
AGCAGACAGACAAACTGAAGGAGCTTTATGGACAAGTGTTATACCGTTTGGAACGCTATGATGAA 
TGCTTAGCAGTGTATAGAGATCTCGTCCGAAACTCCCAAGATGATTATGATGAGGAGAGGAAAAC 
AAACCTITCAGCAGTTGTTGCAGCTCAAAGCAATTGGGAAAAAGTGGTTCCAGAGAACCTGGGCC 
TCCAAGAAGGCACACATGAGCTGTGCTACAACACTGCATGTOCACTGATAGGCCAAGGCCAGCTG 
AACCAGGCCATGAAAATCCTACAAAAAGCTGAAGATCTTTGCCGCCGTrCATTATCAAGAAGACA 
CTGATGGGACTGAGGAAGACCCACAGGCAGAACTGGCCATCATTCATGGTCAGATGGCTTATATT 
CTGCAGCTTCAGGGGTCGAACAGAGGAGGCriTGCAACTTTACAATCAAATAAT/^^ 
CAACAGATGTGGGATTACTAGCTGTAATTGCAAATAACATCATTACCATTAACAAGGACCAAAAT 
GTCTTTGACTNCAAGAANAAAGTGAAATTAACCAATGCGGAAGGAGTAGAGTTTAAGC^^ 
GAAACAACTACAAGCT 

SEQ ID NO: 3572 ACTTCTCAGAGGTATTTGCAGCTTGATGCAAAGTAGTCTCTAATGAGTAGGCA 
TTCAGGTGGTTCTTCCCAGCAGOTGGAGAAGAAAGGGAGGAGATGAAGAACACTGAGAGGGGAG 
TGGCACCTTCCCAGGCTGCCCAGCTCAGTCTCTTGCCCTGTTCCTGTGACTCAGCTGCCCACTCCCC 
CAACTTTGTTrCCCTCCCTCCCAGTCTCTGAAAGTGTCAGGTGTTTCTCTCCT^ 
AGCAACAGTAAGACAAAATTCAAGGCAGCCTTTTAAAGTTACCGAACAGTTATTA^^^ 
ACAGACCTAAGCAGAATGAGAGTTTATACATTGTTTTTAGTTGCCTGTATTTAT^^^ 
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ATTACCTTAAAGTTGAGATCTTTCrCTTCTTTTCCTAAATTTT^ 

ACATCTGGAAAACTCCAAGTATAAGAGACCCTGGACTGATGATGGCCCANCCAAGTATATGGAGG 
GACAAGAGTTCTCTCTGTCATrAATGANGACATCGGNTTTCACAATTGAACCTCATTGCACT^ 
ACAGCATCTCACCTAACTCCTGNATCTCCTGATCTGNTTTAAAAATAGTTAGTTAGGCT^ 
CACCA 

SEQ ID NO; 3573 acaaactatgcatatcataactaccagcttcatgaggtcaagactaaatcaa 
aaattctcaaggacttttctaaaaatttttttagcataaact 

aagtgaacctgtctttccactgggatgactgtctacatgctcaggaatcctgaattcatgactatg 

ttcaactcccacgaccarmtaaatcatcaaaatggcagcaaatgtcattacctggaccaaaa^ 

ctgcaggttttgtggcaaatgggaaatgtgttatagaatatoactgggaaagagggam 

tgctgcccagaattaattcagcaaacactgaacattactacgtgccacatataggatcatactgca 

tttcaggtcaccatcaaaggtaaattacaaggctgggcgcagtggctcatgcctgtaatcccagc 

actttgggaggccagggcaggcggatcacctgaggtcaggagttcaagaccagcctgccaacatg 

gcgaaacccccgctctactaaaaatacaaaaattagccagcatgctggcatgcgcctgtagtccc 

agctacitgggaggctaaggcaggaaaatcgcttgaacctgggaagtggaagttgcagttagctg 

aaatggccc 

seq id no: 3574 acgcggggcggccaaggtgccggccgacaccgaggtggtttgtgctccccct 
actgcctatatcgacttcgcccggcagaagctagatcccaagattgctgtggctgcgcagaactg 
ctacaaagtgactaatggggcttttactggggagatcagccctggcatgatcaaagact^ 
ccacgtgggtggtcctggggcacrcagagagaaggcatgtctttggggagtcagatgagcr^^ 
gggcagaaagtggcccatgctctggcagagggactcggagtaatcgcctgcattggggagaagct 
agatgaaagggaagctggcatcactgagaaggttgttttcgagcagacaaaggtcatcgcanat^ 
acgtgaaggactggagcaaggtcgtcctgcctatgancctgtgtgggccattggt 

seq id no: 3575 actgaacactgtaggaaattgtaacaaaatgttaagtatttatgagtctaaa 
cacagaaaatacacagagtaaacatgcagtataaaagataaaaaaggatgcacctgcataggac 
actccatgaacatagotgcaagactagaagttgctctgggtgagtcagttgagtggtgagtgaa 
tgcgaaggcccaggatgttactgt 

seq ed no: 3576 acttgccatgaaaatgccctggggaccctctgacgacactgttgggatggcr 
gaggcacctgttcccttttggtcccaccggtggcctaaagtgtggtctcagagtgcttgtcttct^ 
tgctcactgggtggggcttgtgagcatctgttctctcttcctaggatcgcagaagaactgcagcrr 
cctggggagcagtttcacctccacaggcatcgcttcatactcctcactgtcaatgcta^ 

CCCTGCTCCCTCCGGGATAAGCAAAGTGCACTGGCTGGCTTGGAGACACTCCGTGCCCTCCACGTG 

CAGCTTGGGGTTTCTCACCTTTCGACrrCCTATAGTTATAAAGTCTCCT^ 

TCAATGCANATATTCAGAAAATCnTCTTTGCTTGTCGGGTCAAGCTGATTATTCCGTGTTGTC 

ACAG'ITCAATGGTGGACAGCTGCACATCmCCAGACCTNCGGGCTCACCTCTTGGGAAAGGGC^^ 

CCTGTGGTTGTGCCCAGTAGGACGCAAGCCTTNGTAANATTCTCCTGCNCAAGAANAT^ 

CCANTCCTTGCTGAAAGACAAGTCTGAATGCNCCACTTTTTCAATTCTCTC^ 

TCAA 

SEQ ID NO: 3577 acgcgggacacttcctggtgggatccgagtgaggcgacggggtaggggttgg 

CGCTCAGGCGGCGACCATGGCGTATCACGGCCTCACTGTGCCTCTCATTGTGATGAGCGTGTTCTG 

GGGCirCGTCGGCrrCTTGGTGCCITGGTTCATCCCTAAGGGTCCTAACCGGGGAGTTATCAT^^ 

CATGTTGGTGACCTGTTCAGTTTGCTGCTATCTCTTTTGGCTGAriGCAArrCTGG^ 

CCTCTCTTTGGACCGCAATTGAAAAATGAAACCATCTGGTATCTGAAGTATCATT^ 

AGAAGACATGCTCTACAGTGCTCAKTCTTTGAGGTCACGAGAAGAGAATGCOT 

ATCACCTCCAAACCAGACCACTTTTCITGACTTGCCTGTTTTGGCCATTAGCTGCOT 

CAGCACATTTGAATGCCTTATTCTACAATGCAGCGTGTTTCCTTTGCCTT^ 

TTACGTGCCrrCATAACCTGAACTGTGCCGACTCACAAAACGATTATGTACCTNGCCGGAACACNC 

TAAGGGCG 

SEQ ID NO: 3578 acgcggggtcccatggctggccagaggaggaacgctttgtgttctcatcgga 
gctgcatgggaagtctgcatacagcaaagtgacctgcatgcctcaccttatggaaaggatggtgg 
gctctggcctcctgtggctggccttggtctcctgcattctgacccaggcatctgcagtgcagcgag 
acccatccactgtggaggacaagtgtgagaaggcctgccgccccgaggaggagtgccitgccctg 
aacagcacctggggctgtttctgcagacaggacctcaatagttctgatgtccacagm 
cagctagactgtgggcccanggagatcaaggtgaaggtggacaaatgtttgcttgggaagcctgg 
gtttgggggaggaggtcattgcctacctgcgagacccaaactgcagcacatctttgcanacanaa 
ggaaaggaactgggtatcttgtgaccacccccgtccagctagtgcctgcaggaacattctggana 
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AAAATCAAACCCATGCCATCTTACAAAAACCCCTCTCTTGGTCAATGATTTCATCATN^ 

ATCCTAACATNAACTTCCAATGTCCTACCACTGGACATNAAATh^CCTCC/^ 

TTATAArmCC 

SEQ ID NO: 3579 ACTGGGTCCAATTCCTGTGATCTQTTITITGATCAGCTGTAACTCCATATGT^^ 
TATTTTTATTCITACTAATAAGAAGTAAATTrTr^ 

TTATTATATTCAAATTCTATATGTAGACAATCTTCAATGCCCAClTCCATCrrA^^ 

CATCAGGATAGGTGGCAAGCTGGTGAACAATAAGATCATACTCTTTTACCAAATCTGTCAG 

TCACTATTGTCACmAAGAAAATACCTCAAGCGGACATTGGCACCGATGTAAGATTCAT^^ 

TTTCAACrTGCATAAATTCAAAATCATAACTTCTGCTCTGAGTCAGTTCTCCA^ 

TTTCACTAGGTTTACAAATTCATGAGTAmCTCTTGTCATTGAAAAGlTCAATTT^^^ 

TCAATTCTAATTCCTTGGTGTTCTACCTCmCCAGGTTGCTTAAAG 

AACGGATTCTCCGCATAGAAAGAGATAGTGTTTTTCTACTTTGCCATCTTCAGTm 

ATTTTCCTGGTTTCCCCATCATTAAGAACAATATCGATCTCACAAATTGGACCAA^ 

SEQ ID NO: 3580 acttatttcaaactgggacaattggaatcactggtcatctaagaggaatatt 

AATATCTACCATATTTAACAATAAAACTAATAGTTTCCTCCATTTAGTGAAA^^ 

TTAAAAAACATAGTAACGTCAATATTTTATAAATTATITCAATITCCATTrGTAGAC 

GAAACTCTGGGCAAACAATCTCCTTGGTGGTCAGGTTTATTTGTTAGTT^ 

TATTCACTAGAGACTTTGGTTAAATTAAATAGTATCACTATGCTCTATTAGATCTGATGT^^ 

TATTTATGAGCCTCAACATmAAAAATTAAAAAGGAAATGATTGCACAATTCTT^ 

TACAAATATTTAAGAGTGTTGATTGGGAGTAAGGGAATGTCAACTGCCAATAAAGTGGAAGATGA 

AAGAATAGGACrn-CACAGAGCATATTTAGTTATGGGTCTCTGCTCCTCCCCCACAGAAAA^ 

CAGACATTCATGACTTCATCCCCCTGCTGCANATAGGCCATATTTCCTGCCCCCTGGCTCTTCACC^ 

CAAAANGGTTATCTTTX:CCCCACTANCAAANAACCCTCAGGACACACAACTCATACTGGCCANCT 

AANC 

SEQ ED NO: 3581 ACGCGGGGAAGATGGCGGCGCACAAGTCAGGTCCGGCACATGTTTCCGCGG 
AGCGGACCCAGCAATGACGGATGATATCACCrCTTCTTCrCTGGTGAGAGTCTGAGGATAG^^ 
TTTTTTCTCACCATGAATGTCACCCCAGAGGTCAAGAGTCGTGGGATGAAGTTTG^^^ 
CTGCTAAAGCATGGATGGACTCAAGGCAAAGGCCTCGGCCGGAAGGAGAATGGTATCACTCAGGC 
TCTCAGGGTGACACTGAAGCAAGACACTCATGGGGTAGGACATGACCCTGCCAAGGAGTTCACAA 
ACCACTGGTGGAATGAGCTCTTCAACAAG 

SEQ ED NO: 3582 ACACATGGAAAAGACATGATCACCAAGTGAAAACAATCTAACCAGAAAGCT 
TTAACGTCTGTCAGTTAAGCTGAAGCTGAAATTCTGGGAGCATGACATGCTGCAGGGCCAAAAGG 
AATGGATAATTAGTATTCCTCTCCTTCTTCCTCACCCTCTCCTTCAACAG/^ 

tCATAATCCTTCTCAAGGGCAGCCATATCTTCACGGGCCTCTGAAAACTCGCCTTCCTCCATCCCCT 
CACCCACGTACTTT^^ITITTTTTTTTT^^ 

GGCITGTTAGGATAGTTAAAAAAGCTGCCTATTGGCTGGAGGGAGAGGCrrAGGCA^ 

TTACmGCAAGGGGCCCTTCAAAAGTCCGCTGGGCTCAAAAGGCTCTTAATCGTGGTTGANAGTG 

AGCCTTTCNAANANATACTCGCCCANCCCACCTCGGGCCGCCAACCTGGGGAGGTTGGCAGGTGG 

CACCCATCTTTTGATAAGCTTACTTCCTOATCTANGAAGTGAGTCTCCAGGAAGT^^ 

GGGTCCNNGCNGGCAAAACCCAGGCNTNAAAATCCAAAAGGGCCTGGTTNAAC^^ 

GCCATG 

SEQ ID NO: 3583 acagaaagtaaaaatgctgttacaatctcagtgtaactggtagccacagacg 
gttgactcaccaatcacagctatccacgctacagcaagaatcttacacaaaaacctaacaacgct 
tacaattttgttggggtgaagtccacaagcttggtggtagattaccaagagggactacatggaag 
gaaggcggaatgtcacaggaactattctttggcrctttgggtgggtgggtgtcctggggtttgta 

CACAGCTACCirrTTTTTTTTAAACAGCT^ 

TCTGTTTCTGCAGGTCATCTGGGGTCTTAATCATCTTCTCCTTCATCATCTGTTTCTCTC^ 

TTNACCTTTCCCACCTTCTTCCTCCTCTTCATCCTCATCCTCATCTTCATCTTCTTC^ 

ATTCTTCrrCCTCCTCACTGACTTATCGTCGTCCTCATCCCCTTCTACATCrrCATCTTCATC 

CTTCATCAAACTCCTCTTTTCACCATCCTCATCGNCCTCGTCTTCCTNATCITCTC^ 

TNCTTTTTAATCNNAACCATCCACCTNGGCTTTTGATCANGNG^ 

SEQ ID NO: 3584 ACCGACCATAGAGCAAGAATCAAGAITCTGCTAACTCCTGCACAGCCCCGTC 
CTCTTCCTTTCTGCTAGCCTGGCTAAATCTGCTCATTArrTCAGAGGGGAAACCTAGCAA^ 
AGTGATAAGGGCCCTAGTACACTGGCTTTmAGGCTTAGAGACAGAAACTTTAGCATTGGC 
TAGTGGCirCTAGCTCTAAATGTTTGCCCCGCCATCCCTTTCCACAGTATCCTTCTTCCCTCCTCCC 
CTGTCTCTGGCTGTCTCGAGCAGTCTAGAAGAGTGCATCrCCAGCCTATGAAACAGCTGGGTCTTT 
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GGCCATAAGAAGTAAAGATTTGAAGACAGAAGGAAGAAACTCAGGAGTAAGOTCTAGCCCCOT 
CAGCTTCTACACCCTTCTGCCCTCTCTCCATTGCCTGCACCCCACCCCAGCCACTCAACTCCTGOT 
GTTmCCTTTGGCCATGGGAAGGTTTACCAAGTAGAATCCTTGCTAN 
rrCCTTTAATAAACCATTTGGT 

SEQ ID NO: 3585 ACGCGGGAACGTAACAATCAACTTGCCAATACTGCCTTCTCTTCCGATAGCTA 
CGTTCTCCGTCCTATTTTAAGAACTCAGTTCTTCATAAGACTTGTGTGGT^ 
GTCTGGrrGATCCTTGTGTTGrrATTTTTTTAAATGTGTATCGTCTGTTCAGC 
GCArrCTTAAAAAAATCTTAACCCTATCAAAAATTGTGTTTGTTTAAAGGAGGA^^ 
GCCAGCTTrrACTAGGAAGAGTGTAAATGCTGACGTATTTAGGTAGCTCTAAATACTGAG 
TATTCTAACCACAAAATAGATAGCCTTTCTTTTGTCTTCACTTTC^^ 
ATACCGNTTCTTCATCTATAACACAATTATAATGATATAGGAAGCCACTCAAATAATO 
GTTGCGTTGCGCTTAAAAAAAAAAAAGAAGTCTCTCTGTGGCACGGAATGAGGTGTGGCTCG^^ 
CTAGAATCTCCAGTGAAAACCAATGAAAGANGGTGAAACCCCGTGTCTATCAAAAAAAAAAAAA 
AAAAAAAAAKNCCTT 

SEQ ED NO: 3586 GTACANCCTGGCAAAAGTTAAAAGGGGTGTGGCAGCTCCCATCAGGTCTGGA 
GGTGGTCTATAAGCACAGTTGACAGTTGTGCATTGGGATGGGTGGAGAAAGACGACAAGAGAGC 
AGAGAATCTGCTGATGTGGCTGCGCTTACirrTAGTGACITrATGTACTTTTT^^ 
TTTTTTCGANACTACCTCCCCGGNTCGGGAGTGGGTAATTTGCGm^ 
GGTANCCAGTTTTOAGGCTNCNTCTNCNGAATCAAACCCTGTT^ 
GTNGGCACGNAGATTNCOTnrNAAAAGTTTATAGGGCAAACCT^ 

SEQ ID NO: 3587 ACTGCCAAGGACAAGTTGATTrCTGGCCAGGCAAAGTTAACTCAGTTTm 
ACTATAAATTTGTGTCTTATATGCTTTAGGTTTATGTATCTATAAACCATTCACC^^ 
AATTirrAAGAGATCAAGGTGTAAATTATGATGATTTATTATITTGGTCT^ 
AGTATGTTAAGCATTGTTTAAAAATACTAGTAAGTCATAATTATGCAGAATTTTCACA^ 
TGCACAGAGAAAGCATATCATTTCAGrrACTGATACATCITAACACrACTTTC^^ 
ATTTAACATACACAAGTTATAGTAGCAGTATGGACTTCTCCTCCCATTGGCAATTAAATGCT^ 
TTTCTTCTGAAAAGATGATGTGGACCAACAGGTATCAGACTTGCCAACAAG^^^ 
CCAGCATACATCTGAGCACTGAAGGAAGAAGAAAGTTTAAATTGTTTAAAGGACTATA^ 
CACAAAATTTATTAAGAAAAAAAGAATGGATCTAGTATAACTAATTCTGAGTAAACCAAAATG^^ 
AATAATTAATTNGTGCTATTTAATCCCCATTTTITGGCAGGNGTAATTG^ 
T 

SEQ ID NO: 3588 gtactttttaaatcatgttccccctaaacatggctgttaacccactgcatgca 
gaaacttggatgtcactgcctgacattcaotccagagaggacctatcccaaatgtggaattgact 
gcctatgccaagtccctggaaaaggagcttcagtattgtggggctcataaaacatgaatcaagca 
atccagcctcatgggaagtcctggcacagtttrrgtaaagcccttgcacagctggagaaa^^ 
cattataagctatgagttgaaatgttctgtcaaatgtgtctcacatctacacg 
tttatggggccctgtccaggtagaaaagaaatggtatgtagagcttanatgtccctattgtgacag 

AGCCATGGTGTGTTTGTAATAATAAAACCNAAGAANCCTT 

SEQ ID NO: 3589 ACGGGGCAGGATTATGTTTGTTGACCCATCTCTGACAGTTAGAGCCGATATCA 
CTGGAAGATATTCAAACCGTCTCTATGCTTACGAACCTGCAGATACAGCTCTGTTGCTTGACAAC^ 
TGAAGAAAGCTCTCAAGTTGCTGAAGACTGAATTGTAAAGAAAAAAATCTCCAAGCCCTTCTGTC 
TGTCAGGCCTTGAGAOTGAAACCAGAAGAAGTGTGAGAAGACrGGCTAGTATGGAAGCATAGTG 
AACACACTGATTAGGTTATGGTTTAATGTTACAACAACTATTITITAAGAAA^ 
ATTTGGTTTCAAGTGT 

SEQ ID NO: 3590 acgcgggggccggagaccgtcgtcgctcttgggaatccttggccgcccagac 

AGAAGGGAAGTAGGCGCCGGAGAGCCCGTTCTGCATTTTGATTCATCTCGGGCCCTATTAGAATCC 
CAAGAAAATCAAATGGCATCCGGGGATTTCTGCTCACCTGGAGAAGGGATGGAAATACT^ 

agtgtgcagcaaacaacitcctccttgtaacctgagtaaagaggacctgttacagaacccatactt 

cagcaagcttctcctgaatctctcacagcatgtggatgagagtggcttaagcctcaccctagc 

ggagcaggctcaggcatggaaggaagttcgactgcataagacaacatggttgaggtctganattt 

tacacagagtcattcaagagttgcttgtggactactatgtgaagatacaagacacaaatgtaact 

tctgaggacaaaaaggattttgtgtggatganggctcggctacagcaagaagtanaggagcagct 

CAAAAAGAAATGTTTACTCTGCTCTGCTACTATGATCCCAATTCANATGCTGACAGTGA^ 

GAAGGCAGCAAAGGTGTGGAAACTCCANAGGTCCTGGTGGGTGAACANCANCATGCCAGGATCC 

AAGACCANCNN 
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SEQ ID NO: 359 1 ACTTTITITITTTTm 

AAGGAANATTATCTGGGCAAGTCCAGTGAAGGCAGACAAACCACAAGACCTAGTGCCAGGTTTAT 

TCCCTCACATGGGTGGTTCACATACACAGCACAGAGGCACGGGCACCATGGGAGAGGGCAGCACT 

CCTGCCTTCTGAGGGGATCrrGGCCTCACGGTGTAANAAGGGAGAGGATGGTTTCTCT^ 

CACTAGGGCCTAGGGAACCCANGAGCAAATCCCACCACGCCTTCCATCTCTCAGCCAAGGAGAAG 

CCACCTTGGTGACGTTTAGTTCCAACCATTATAGTAAGTGGAGAAGGGATTGGCCTGGTCCCA^^ 

ATTACAGGGTGAAGATATAAACAGTAAAGGAAGATACAGTTTGGATGAAGGCCACAGGAAGGAG 

canatgacaccatcanaagcatatgcagggaaagggcagttactgggcttctgggctgcttaatc 
cctggcttggcangaagggtanggaanatggatggggctcattgtttggcatttgato 
cx:aattncngcttgagggaagcaccacccacaaggaanccatccccatcaggctggct^ 
ctccttgcagg 

SEQ ID NO: 3592 acctcacactgcaacacatgtagatagcgcttttgaaaaatacatatat^^ 
actatagtatgagaggtcctggggctcaaactttaaacatttcctaacrcagcttac^ 
aaatgtccaggtctctgctgaattctcttgttgaattcatagctctccaacctcct 

CCTATGCCCCTAACCCTGCGGCCTCTITGTTITATTIT^ 

GCAAGGAAATTTTCTTTTGAGGACrTATAGGTTGATTGCCC^ 

TCCTCTCATTTCTTGCCATTGGATTGTTTCCTAAACGTTGGATAGGAGAATO 

TTTTTATGAAGCTGTGCCCTTGATCAAATCTGTGCTGTGGGTCTGAGTCACr^ 

TCAGTTTCAACACTCTGATCAGCATGCAGGCGAAGGGGCATGTTTAATTNCTTCACTAACTC 

ATTCTATAGCTTTANACTTCATATTTCTTATCCACCTTTGTCTAGGCTCACGATCm 

AGCirmTCACCGTGGrTGGGCTANAACAAATATTGGAAANNTTACCAACAC^ 

GT 

SEQ ID NO: 3593 ACGCGGGGGGGGCCTTrTTTCCTCTCITCAGCGTGGG^^ 
CGCTCTCTTTCTGCTGCTCCCCAGCTCTCGGATACAGCCGACACCATGGGm 
AGCCCTGCCGGCCTCCAGGTGCTCAACGATTACCTGGCGGACAAGAGCTACATCGAGGGGTATGT 
GCCATCACAAGCAGATGTGGCAGTATTTGAAGCCGTGTCCAGCCCACCGCCTGCCGACTTGTGTCA 
TGCCCTACGTTGGTATAATCACATCAAGTCTTACGAAAAGGAAAAGGCCAGCCTGCCAGGAGTGA 
AGAAAGCTTTGGGCAAATATGGTCCTGCCGATGTGGAAGACACTACAGGAAGTGGAGCTACAGAT 
AGTAAAGATGATGATGACATTGACCTCTTTGGATCTGATGATGANGAGGAAAGTGAAANAACA^ 
GAGGCTAANGGAAGAACGTCITGCACAATATGAATCAAAGAAAGCCAAAANAACCTTGACT^ 
GCCAAGTCTTNATCTTACTANAAGTGAAACCTTNGGATGAATGA 

SEQ ID NO: 3594 acaaagataatcaatgctgattcggaggacccaaaatacattatcaacgtaa 
agcagtttgccaagtttgtggtggaccttagtgatcaggtggcacctactgacattgaagaaggg 
atgagagtgggcgtggatagaaataaatatcaaattcacattccattgcctcctaagattqaccc 
aacagttaccatgatgcaggtggaagagaaacctgatgtcacatacagtgatgttggtggctgta 
aggaacagattgagaaactgcgagaagtagttgaaaccccattacttcatccagagaggtttgtg 
aaccttggcattgagcctcccaagggccgtgctgctctttggtccacccggt 

seq id no: 3595 acacggcca.tgtaatatcagtatatcccaagttaatgaaagtgttcatttaca 

TAGGTAATGGAGACCTTTGCATTTTGATCCATAGAACATAGGAGGATGTTOT 

GCTCTATATGTTTACATATTATTTCTGTAGATTGTTTTCAGGAGAAAGrrTTG^ 

GTGAGCACTTTGGCTTATGTATAAGTTAGAAATAATTGTTAGTTTTTAAT^ 

AATrrCTTAGACGTATGCAAGCAAGTGAAAACAATTAGGGCCAGTGGTATTAACTACTT^ 

ATTTTATTTTTGTITGTAAGAAGTCATCTACTTAAGGCCCAG 

TTAAGGAATACCCAGAGATTGCTGCTGTTCTATTTATTTTACAGAAAGGGTAGCTAGAT^^ 

TCITCAGTGGACCTTGAGCTAATAGATCnTITACCACTAAAAGAGCAT^^ 

AGAATAATAATTACATACTTGGCATAATAAATGCCTAAAAGACATTTTAm 

TCTTGCTATAATGGGGATATTGGAAATTATGCATTTGTATAAATGGNATTCTTAAANCAATCTATG 

AA 

SEQ ID NO: 3 596 ACAGCTACTATCCAGCCCAGTCAACAAGCCCAG ATTGTCACTCGGTCAGTGT 
TGCAGGCAGCAGCAGCTGCTGCTGCTGCTQCTTCTATGCAACTGCCTCCACCCCGACTACAGCCCC 
CTCCATTACAACAGATGCCACAGCCCCCGACTCAGCAGCAAGTTACCATTCTGCAGCAGCCTCCTC 
CACTCCAGGCCATGCAACANCCTCCACCTNAGAAAGTTCGAATCAATTTACAGCAACAGCCTCCT 
. CCTCTGCANATCAAGAGTGTGCCTCTACCCACITTNAAAATGCAGACTACOT^ 
GTGGAAAGTAATCCTGAGCGGCCTATGAACAACAGCCCTGAGGCCCATACAGTGGAGGCACCTTC 
TCCTGAAACTATCTGTNANATGATCACAAGATOTANTTCCTGAAGTTGAATCTCOT 
ATGTTNAATTGGTGAGTGGGTCNCCTGTGGCACTCTCANCNCAGCCTTANTNTGTGAAGT 
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SEQ ID NO: 3597 ACGCGGGGAGCAGCAGGAGGAGGCAGAGCACAGCATCGTCGGGACCAGACT 
CGTCTCAGGCCAGTTGCAGCCITCTCAGCCAAACGCCGACCAAGGAAAACTCACTACCATGAG^ 
TTGCAGTGATTTGCTTTTGTCTCCTAGGCATCACCTGTGCCATACCAGTTAAACA^^ 
AAGTTCTGAGGAAAAGCAGCTTTACAACAAATACCCAGATGCTGTGGCCACATGGCTAAACCCTG 
ACCCATCTCAGAAGCAGAATCTCCTAGCCCCACAGACCCTTCCAAGTAAGTCCAACGAAAGCCAT 
GACCACATGGATGATATGGGATGATGAAGATGATGATGACCATGTGGACAGCCAGGACTCCATTG 
NCTCGAACGACTCTGATGATGGTAGATGACACTGATGATTCTCACCAATCTGATGAATOTACCATT 
CTTGATGAATCTGATGAACTGGTNCITGATTNTCCCACGGGACCTTGCAACAACCGAAGT^^ 
TCCAGTTGTCCCCACANTATNACANATATGNATGGNCCGANGTGATANTGTGGATTTA 

SEQ ID NO: 3598 gtaccgaccatagagcaagaatcaagattctgctaactcctgcacagccccg 
tcctotccritctgctagcctggctaaatctgctcattatrtcagaggggaa^ 
agagtgataagggccctactacactggcttttttaggcttagagacagaaactttagca 
agtagtggcttctagctctaaatgtctgccccgccatccctttccacagtatcct^ 
ccctgtctctggctgtctcgagcagtctagaagagtgcatctccagcctatgaaacagctgggtct 
ttggccataagaagtaaagatttgaagacagaaggaagaaactcaggagtaagcttctagaccc 
ttcagcttttacacccttctgccctctctccattgcctgcccccaccccanccactcaa 

TGTTmCCTTTGGCCATAGGAAGGTTACCAGTANAATCCTTGCTAGGTTGATGTGGGCCATACAT 
TTCCnTAATAAACCATTNTGTAC 

SEQ ED NO: 3599 ACCGTTTTTTCAGGCACAAGGAAGGTTTCACCCCGTTGCCGAAAGACT^ 
GTCCCGTTGGCCAGTGCAACGAAGAACGGAGGAAGCGACAAGATTTCAGTTTGTAAGTAGTGAGA 
GCAATGATCTTGGCTTTGCATGGTGCTCAGGAAGCCrrCCCCGCTCCCAAACCAGAAAGG^^ 
ATCCAGATTITCCTCCACTGTCTGGATGGCCGCTCTGTGGATGCTTCCATCTGTATCCATTGCCOT 
ACCTGGTGAGATTCCGTCrCTAGAGCCTTTGAGGTCACTCCTGACGCTGACATGGCTGTGAAGAGC 
TGGGTGCCAGGCATTACTGCCTCCAAGGTTGCTTTGCGAGGAATAGGTGGGGCATCAGGAAGAAG 
CCAGTTGCAGGCAAGGCCTCTGCTGATGCTGCTTTTTrCTCCTGT 

SEQ ID NO: 3600 ACGCGGGCGGGAAGGGCCTGTCCCAGTCGGCTTTACCCTATCGACGCAGCGT 
CCCCACTTGGTTGAAGTTGACATCTGACGACGTGAAGGAGCAGATTTACAAACTGGCCAAGAAGG 
GCCITACrrCCTTCACAGATCGGTGTAATCCTGAGAGATTCACATGGTGTTGCACAAGTACCATTGG 
TGGTGGTATCTTTCAAGCAATCAAAGGTTTTCGCAATTCTCCAGTGGGAGTAAACCACAGACT^ 
AGGGAGTTTGACAGCTATTAAAACCAGGGCTCCACAGTTAGGAGGTAGCTTTGCAGTTTGGGGAG 
GGCTGTTTTCCATGATTGGCrGTAGTATGGTTCAAGTCAGAGGAAAGGAAGATCCCTGGAACTCCA 
TCACAAGTGGTGCCTTAACGGGAGCCATACTGGCAGCAAGAAATGGACCANTGGCCATGGTTGGG 
TCAACCGCAATGGGTGGCATTCTCCTAGCTTTAATTGAAGGAGCTGGTATCTTGTTGACAANAm 
GCCTCTGCACAGTTTCCCAATGGTCCTCANTTTGCAAAAAACCCCTCCA>nSfTGCC^ 
TACCrrCCTCACCTTTTGGANACTrCGACAATATTAAGTAGGACTTCTTTTCTTA 
NAAAACCAAT 

SEQ ID NO: 3601 ACTGAACTCCCATCACAACATCATCTTCCTCTAATAACTGTAACACAACACCT 
TCAATAAACTTTGCATTGGGCTCTGCCATAGCTGCTTTCOGGAGACTCATGATGAATCrrCCGTGA 
TGGAAAGCTCTTCCACTCTGCACTTGATTGTTTTCTGACAGAGGGTAAGGAATC^^ 
TTGCTrrCCTGATCATGAATCATGTAACCATTTACAACCTGGGCATCAAGACCTTCCACTGTATCTC 
CAAGACCAAGGTCTTTGAGAACATGATAACCACCCGGCTGCAGGAATTCTCCAACTATTCTGTCAG 
GCTCrmAAGTCTCTCTCAATGACTGTCACCTTTCrrCCATCTCTGGAAAGCACAGCT 
AGAGCCAAGCACGCCAGCTCCCACGATGATAACTTCTGGGTCATTCTGAGAAGATGTTGATGT 

SEQ ID NO: 3602 ACTGTGTCTCAGCTCCAGCAGTCTCAACTGGGAAGACCCAGGACTCCTGCTCT 
TTTCTCTAATCCCTGGGAGACGAGGTCCAGCTAAGGTAGAGTAAGCAGTCAGTGACCAGGCAGGC 
TGGTTTGGGAGGTCACTGCCTGGAGGACGGGATCTTGTATTCTTCGGAAGATGGCTGGGAAATTCT 
TCCCTCCATTACGTAGAACriTCTTCCCCTCCTCAGTTGAGGTGCCrAOATGTCCCACAA^ 
CTTCACTCAGGTCCTCCAGAGGCACACGCTCAAACAGTGGGTGCTCTTCGAAATGAGTGCACATCC 
AGTCGTGTANCTCCAGCACATCGGTTATGGTATACACCAGCCCCCCAACTCTTAGCACGTAGGCAT 
ATTCTGCTAGCAGGGTGGGACTGATGATTCGCCACTTGTGCITrGTCCCTTGAA^ 
GAAGANGAANAACATTTTTGT(>IAGCTGGCCCITGTAAAAAAAATT^ 

TGCTACGGANACANGCGATGTTNrmGAANCCACCTGCATGGACCTGGCGCNTNANGNCCCA^ 
CCGGTTTTGTACCTTGGGCCN 

SEQ ID NO: 3603 ACGCGGGACTCnGAAGAGGAGGCCGCCTCCTCGGGGCTCCAGGCTGGCTTG 
CCCGCGCTCTTTCTTCCCTCGTGACAGTGGTGTGTGGTGTCGTCTGTGAATGCTAACTCCATCACCC 
TTTCCAGCACACTGCCAATAAACAGCTATTCAAGGTTGGAGGAGGGGGGGCGGGGAGAGAACTAC 
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AGCCAGTATCCCCAATGAACATAGATACAAAATTTCTCAACAAAATACCAGCAAACTAAATGCAA 

CAGCACATCAAAAAGACAATATACCGTGATCAAGTGGGATTTATAGCAGGGACGCAAGGATGCTT 

CGACACACACAAATCCGTAAACATGACACATCACACCAACAAAATGAAGGACAAAAACCATATG 

ATTATCTCAATAGGCACAGAAAAAGCATTTGATAAAATTCAACATCGCTTTATGATAAAACA^^ 

CAACAAACTAGACATAGAAGGGAACATAACTCAAAATAACAAAGACCATATACAGCCAACCCAC 

AGCTNAACATCATACTGAATGGAGTAAANTTGAACGCCTTTGCTCCAAGAACTG^^ 

ANGGATrrCTCACTTTCACCAATTCCTTATTCAACATAGT 

SEQ ID NO: 3 604 ACTTTTTTTTTTITITITTTT^ 

ATTGCATGCTACATAATACACTGTAACCTTCTANCAGGGGTAGATGGCCATAACTGAGTTATT^ 

TTCATCAATCAGGAAAATGCTTCCTTAATCAATGTTCTCCAAACCCTrrGACACATAGT^ 

AACTCCAGAGATGTGCCTATCTCTTTCCATCANACTCCAGTGATCCCAAAATGAGCAACCC^^ 

TCCTTGCCCCCGTTAGTGAACCTGNGGATGTACNCGGGGACITTTTGAAAGCCAGGAGG^ 

ATTGCACCGGCAGCTGCCNGGGCGTATGTGTNGGTGCTAAAGGCAGCTGCAAGGTCTCGNTTGGG 

GGCCGCTNCGGACCAATTTTNGNAAAGGTACCTGCCCCNGGCGGCCGCNC^ 

SEQ ID NO: 3605 ACGCGGGGGGGCTTGCACGTCTGCAGAGCAGGGAGCACAAACCTGCTCTCTC 
AGCAGCCACTTGGCAAGGGCTGGTTGTGGATCCCAGCCCTCACCCTCTCCTGGCCTTTCCTCTGCT 
CTCCTCTGCTCAAGTCCACTTCTAACCTGGTCTTCGGAGCTGGGTTGGCCCOT^ 
AAGCAGCCTTAGCACACGGGCCTCTCCTCCCTCACTACTGGGTGCTGCCCTGCGTGGCT 
TGGCCCAGGATTTCACAGTCGAAAAGGAAGCCACCACTGATGCCTCCCACTGTGACAGGCCCTGT 

caccaccaatatcttatttcaacctcacagttgacctgagaaatcgagattatcactccact^^ 

agacaaggaaactgaggctcanggaagccaagtgacaagtccaaggtcacgaagactttct^ 

agcccgaaacacx:acctctgctcctcttctcctgtcctgcccaagcatcctaggggctgaaatcct 

ggaaaccgtgggctggtgtganaaggttgcatgctcaaancaaaaaaagggctctc^^ 

cgtgattccagggccanaaccatgccanncccaaaaacccccaacctaacctgggggcaggtcca 

nantccaagc 

seq id no: 3606 acttttttttttttttttttm 

tacgttggaaaaggaaatatcttcccataacaactagacagaagcattctcagaaactagm 
gatgtgtgtcctcaactaacacagttgaacatttctttanacanaacagtt^ 

GTGGAATCTGCAAGTGGrrATTTGGCTAGATTTGAGGATTTCGTTGGAAACGGGA^ 

AAGCAGTCAGCAGCATTCTCANAAAGTTCTTTGTGATGATTGCATTCAAGTCACAGAATT^ 

TCCCTTTCACAGAGCAGGTTTGAAACACTCTTTTTGTAGTGTGT^^ 

TACGGCCTAAGGTGAAAAAGGAAATATCTTCCCATAAAAACTATACAGAACATTCTCAGAi^ 

ACTCGTGATGTGTGTCCTCACTAAAGGAGTANAACCTTTNTTTTCATA/^ 

(nTTTTTGGGQAATCTGCAAGTGGATATTTGGCTAGNTTTTGAAGG 

TTCATACCAAATTGCAAACTGNANCGNTIWrGAAAAACATCTm 

ACAA 

SEQ ID NO: 3607 ACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGC 
TGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCCCA 
ACTGGGmCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGGCAGTTCC 
GGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCATTACACCACTCAG 
TATATGTGAGGGAGGGATGTGCCTCTGGCCACX3TGGTTACCTTGCAGTGCACAGCCTGTGGTCATA 
GAANGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCCTTGCTCTCGCAGTGGCCCTGGCAG 
GCCANCCITCANTTCCAGGGCTACCACNTGTGCCGGGGGCTCTGTCATCACGCCCCTTGTGGATCG 
CACTGCTGCACACTGTTTTATGAACTTTGTACCTTCC 

SEQ ID NO: 3608 ACCCAAGGAAACAGCGGTATTATATTTACTGTCACAGCTAGACCCTATCAGG 
TGAACAGGGCTCTCCACCTGCGGCtOTTGCTTCGCAAAATATCOT 

GCTCTAGAGTGCTTTTAGGCAGAGTGACGGTGATGTCATGGGCACCACGCCAATCrrATTCCTGGA 
CAATGACTTAAGCCCCATCGTGAGAATGGTTTTTTCCTCTTCAACA^ 

TAAGATGATGCTACATGTTCTAAAAGAGTGCCACAGAAACCCACCCAGGATCACAAGAATGACAA 

CAAAAGGAAAGTGGCTATTTCTTGTGTAAGGGATGACGACTCCCCCTCCCATGGGTCCCAm 

CCCCAGTTGCAGGTGTTTTAAGGGACAGAAAGGCTGTGGAAAGGACCACTGGCTGGGAGAGTTAT 

TGCACAACCCGGAAGAATGAGAAAAACGTGGGCGGCAAGGGGTAGGGGAGAGGGTTGCTGCGAC 

TAAAACAAGGTAACACTGAATTAGGAGCTCGAACAGTTCTGCAACAGTCANACATGTATGTGm 

ATGTGTTAACACTTGGAGACTGGGGAAGTAAAAGTAAACCCrrTATCATATAAACrrrGGGCT^ 

ATGGGAAGCNAAA 
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SEQ ID NO: 3609 ACAAGACTCTTGACAGTTGTGCTrCTCTAGGAGGTTGGGTTTTm 

GAATTATCTGTGAACCATACGTGATTAATAAAGATTTCCTTTAAGGCAGAGGCTGGTCGAGATGCT 

GCTGTTATCTTCTGCCTCAGACAGACAGTATAAGTGGTCTTGTTTCTAAGATTCCTACCACC^ 

CTTTGGGCCAAGTATCCACATCCCCTTGCGTATGGGAGGTGOGTGAAGAGTGTTGGATGCAAAGT 

GGTTATTATGGGAAGTAGCTCGATGGTAAAAGGACAAACACCTATCTATCTTAGAGCTTAAGCCT 

GTATGTGCTTATTCCCAAGGGAGATAGAGGTGTTTAATCACAAGGACAGCATGAGTTAGAG GACA 

CTGGCATCAACAGCTGCCACAGCCGTGCACACCAGGGCCAGAGCAGCCCACTGACATCTGTCTTT 

GGTCTTGANATCAAATGCATCCCATTCITCATACATTAGAAGGTCGACCTCCrTGAA^ 

ANGTTTAGCAAGCCTCTAAAAGGACTACTGANAAACAGAATCANAAACTCTANAACTCTAN^ 

GGCCCTTCANCAGGGCTGCANAGCCTCCCTGGATCCCAGGCCTGGGGAAAGCCTGCTGGTCTTGT 

CCCCCAGGT 

SEQ ID NO: 3610 acgcgggtgcttttgtcacctttgaaagcccaocagacgctaaggatgcagc 
cagagacatgaatggaaagtcattagatggaaaagccatcaaggtggaacaagccaccaaacca 
tcatttgaaagtggtagacgtggaccgcctccacctccaagaagtagaggccctccaagaggtct 
tagaggtggaagaggaggaagtggaggaaccaggggacctccctcacggggaggacacatggat 
gacggtggatattccatgaattttaacatgagttcttccaggggaccactcccagtaaaaagagg 
accaccaccaagaagtgggggtcctcctcctaaganatctgcaccttcaggaccagttcgcagta 
ncagtggaatgggaggaagagctcctgtatcacgtggaagagatagttatggaggtccacctcga 
agggaaccgctgcctctcgtagaaatgtttatttgtccccaagaagatgatgggtattctacti^ 

GACAGCTATTCAAGCANAGATTACCCAAGTTCTNGTGATCTAGANATTTTGCCCCCCCACGAGAT^ 
TTCTTACCGGATNTGGCATTCCAGTTCACGTGATGACTATCCTTCANAAGATTTA 

SEQ ID NO: 361 1 ACAGAAATTTCACAAGAAGTCAAACACAGTGATGCCATTTGCTATGTm 
TTGCTAGTAGCTTATTAAACATAACATGCAAATAATCAAAGAGAAACATACATGACTTAGAGTGA 
AAAATAATTCTAGAAAAGTTTCACTAGGTAAGTATGCAAATTCTTATTCT 
GTGCATGAAACrrCATGTNTTTTGCAATATTCTTGGNCTCAATATCTACCACCTAT^ 
ACAAATTTTGAATTGTATCAAATAATTTGGGGCAAGAAAAGNACCTTC^^ 
CTTTGNGGAACATTTAAAATTTNAGGNTTTTTAAACC^ 
GTTTGNCCTCATTTTGACNGTNAAATTAAAAAACACTNATTTNC^ 
CNTTCGCGTTATGTGNGNGCCTGGGATGAAATAGGCCTGCNTTTCirm 

SEQ ID NO: 3612 ACGCGGGCTCGCTCAGCTCACCCACGCTGCTGGCCCTGTGAGGGGGCAGGGA 
AGGGGAGGCAGCCGGCACCCACAAGTGCCACTGCCCGAGCTGGTGCATTACANAGAGGAGAAAC 
ACATCrrCCCTAGAGGGTTCCTGTAGACCTAGGGAGGACCTTATCTGTGCGTGAAACACACCAGCT 
GTGGGCCTCAAGGACTTGAAAGCTTCCATGTGGTGGACTCAAAGTCCTTACTTTTTC 
NTACNAAAACGCATGGNAGTGTGTATTGGTTCCAAGTGACACTTAAAAAAOTGGTA 
CCATTNTGNACCAAGCCTGGGNCTGNGGCTCCTTTmTTTNTTC^ 
AATCNATTTGGGTNATNNTTGAATAAACCTGGGGNTGGNTATTC^ 
TTTAANAANAACTATTGTTNNTGGCATAANGGGTTCTGATTAAAA^ 
ANACACTTITITTCTGGTAANANAATAANNANCTTATCATTO 

SEQ ID NO: 3613 ACAGCCTGTCCTCCCACCCAGGAACACCCTCTGCCTAATGACAGGACCTTCCT 
GGGAAGCATCTTGACAGCAGTGGCAGATGAAGAGCCAGAATCAACTCCTGTGCCCTTGCTTGGAA 
GTGACAAGAGTGCTTTCACCCGAGTAGCL\TCAATGGriTCCCTTCAGCX:CGCAGAGACCC^^ 
ATGGAGGAGAGCCTGGCAGAAATGAGTArrATGACTACTGAGCTTCAGAGTCTTTGTTCCCTGCTA 
CAAGAGTCTAAAGAAGANGCCATCAGGACTCTGCAGCGAAAAATTTGTGAGCTGCAAGCTAGGCT 
GCAGGCCCAGGAAGAACAGCATCAGGAAGTCCAGATGCNAAANAAGC^mACNTAGAGAAGC^ 
ACCAGGCCTTGTGCTTGCGCTACATGAATGAANGGAGCrrCAGGAAGTGATACAGCTCTNATG^ 
AAATTCTATAACAGATAGACAIWANTGGCGAGCTCATAAGCCTTATAGAGGNGGTGACCTCOT^ 
CNGTCCTT 

SEQ ID NO: 3614 ACTATTTACAGCTAAACCAGCTATACAGGATCATAAAAAGTGAGAACATTTT 
TGTAGCCCTGAATCAAAGCirrCATAATCTTTTTT^ 

GATACATGTGCAGGTTNGTTACATTGGTATATTTGAGTGATGCTTGAGATTQGGGGATATGAATNA 
TCCTATCACAGCAAATGCTGGCATANTACCTCGGNCGNGACCACGCT 

SEQ ID NO: 3615 ACGCGGGGCAGCAGGACTCGGTCTAGCAAGGCCATCTTGTTGCCGGACCTTT 
CTGAACCAAACAATGAGCCTTTATTTTCTCCAGCGTCAGANGTTCCAAGGAAAGCAA^ 
AAAATAGAGGTTCCTGCACAGCTGAAAGAATTAGTTTCGGATTTATCTTCTCAGTN^ 
ACCTCCTGCnTrAAGGAGCANACTAANAAACACTTTCTATAANAACNANCm 
AAAGATGATGCNCAANCAGTAGAAACTCTGGGAAAGCCANAAGCGAAACGAATCAGGACGTCAA 
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AAACAAAACAAGCAANNCAATAACNCAGGAAAANAAAGTGCTTGGTACCTNCrc^^ 
CNGGTGArrTCCCCTTGGCTTGCCCACCTGACTAATCAANANCANACCNAGAAT^ 
TGNCAGGACAGTCTTNGAANGAACAI^AANNAACTNGNTTCTATTCA 
TGNTGTTTTCTTGGGAGATTTATNG 

seq id no: 3616 cgcggcgaggtacgcggggtctcgcggccgactcgcaagatggcgccgcag 
aaagacaggaagcccaagaggtcaacctggaggtttaatttggaccttactcatccagtaaaa 
tggaatititgattctggaaattrrgagcaatttctacgggagaaggrraaag 
tggaaatctcgggaatgttgttcacattgaacgcttcaagaataaaatcacagt™ 
ancagttctctaanaggtatrrgaaatccttactaagaaatccttaagaaga^ 

seq id no: 3617 actttttntititittt^^ 

naaattatatgttaaaccattgaaaatgaggaaaagatggttaaaaaaacccaggam 

atatgaagctgataaggc^faagtitcttcttrggttrcggaacacccggtgagc 

ctggaaccagcctlsrmttanttaatggcagtttncac^ 

tnaaaaatgaanttgggagaaacnacccttccgaaaggcgcattnctcggggctgnt^^ 

ntgccacanttatttcggtcntgaganctccntgancnactggtccgacrm 

tgnaaacccanggggacccc 

seq id no: 3618 acacaagatggtggccrrggcggttactcttccaaccacttccacaattccag 

AGATTTCTTCATCAAGGGGTTCCATCAACTCGATGGTTCCATTITITCCTTC^^ 

AAACATTTTTCCGGTGGGATGAATCTTTTCCAGCCrCCCTACGAAGCANACAGGCT^ 

TTGAGCTAGCATGCCGGCGTTGATGCGCGACCTGGGCAAGTCCATCATGTCACCATGATTATGGTC 

CAAAGACTCCCCGCGTCAACATCTACAATGTTGGCTCAAGTGCTGCATTANACGTGGA^ 

ATGATTCCTTCCCTOCAAAAAATTTGGCATTTGTGTCTGGAATGCTNAATGTCA^ 

TTGCCCTGCTAGCAATGTNTCCCANANTGNGGTCTNCCCCTCTCAAGCTNGI^^ 

GGNACAATNNTTACCTNGCCAGAAGGATGGCOSrTTOTATGGNN^ 

GAATCTrGCTTACGGNATTTTAANGGTAATTTCCTTTTTN 

SEQ ID NO: 3619 CGCGGCGAGGTACTGGAGATGTATTTGATAACCAAGGTTTTAGGTAAATTTTC 
ACCAGTATTAGTTCTATTTGCAAACrGAAAAATGTrGTAGGCTTAATGTAAAATAACC^ 
GAACATTATATCTCTTAGAAGAAAGGCCATATTTTGCTCCTGCTTCTC^ 

AAGGGGAAATAATGGTAhnSITGTGACCTTTCACTTAATTCCTACTCCCTTAATGTGAGAGAGACAA 

ATTGAGCTGAAGAAGGAAAATTCTGGANTTACACTCCACAACCTTGAACATACTGACGGACATCT 

CTGTITGACAACGATTTCTCCATGCCCCCATGCTCTAATGCCTTGTGGATCACGGACACCh^ 

CACAANCTACAGCATNNGCNANGTTATTTTGCATCAANCACTGCTNGATAATGAC 

NCTCCTGGGTrrGGCATCNTTACACCANTNCCGCTTTGATTTGAAATNTCCACT^ 

SEQ ID NO: 3620 CGCGGCGAGGACGCGGGGAGAGAAGTTTGTCATGCAGGAGGAGTTCTCGCGT 
GATGGGAAGGCTCTGGAGAGGTTCCTGCAGGATTACTTTGATGGCAATCTGAANAGATACCTGAN 
GTCTGAACCTATCCCANAGAGCAATGATGGGCCTGTGAAGGTAGTGGTAGCAGANAATTTTGOT^ 
ANATATTGAATTATGANAATAAANATGTGCTGATTGAATTTTATGCCCCTAGGTGTG^ 
ANAACCTGGANCCCATTATAAAGAACTTGNCGAGAA>OTCAGCNAAAGACCCAA^ 
AGCCAAGATGGATGCCNCANTCAATGATNTGCCTTCTCCAT 

SEQ ID NO: 362 1 ACCAAGGCTTTAACGTGTCTGTGCAGGGTATTATCATCTACCGAGCCGCCTAC 
TTCGGTATCTATGACACTGCAAAGGGAATGCTTCCGGATCCCAAGAACACTCACATCGTCATCAGC 
TGGATGATCGCACAGACTGTCACTGCTGTTGCCGGGTTGACTTCCTATCCATTTGACACCGTTCGC 
CGCCGCATGATGATGCAGTCAGGGCCGCAAAGGAACTGACATCATGTACAAGGTGGGAGGAGAA 
TATGCCTCATTCATTACTCAATCAATTTCTTGGCCTTGAAGAAGCGTCGCAGGTANAAGACCT^ 
AGGTAGCTAGTCCAATGAGACAGAACATTGAANAGATGCTGAAGTATAGGACCCGAGTGTTTGTT 
GACTCGTGGTATCACGCATCTACTCTTCTNTCTTCTTNATGTAGCNAAATCATTTCA 
ANGGCTTNNAGCGT^^^AANTCTACC^TAATGGNNTANCTT^ 
ACNCCCCCTCCTGCTTAGTTTAGGAACNAAAT 

SEQ ID NO: 3622 ACAATCTATCGACAAAACAAACTCCAAAAGAAGGTGTGAAAGTTAACAAGG 

tgatggttgctgaagccttggatatttccagagaaacctacctggcaattctgatggaccggtcct 
gcaatggccccgtgctggtgggcagcccx:caggggggcgtcgacattgaagaggtggctgcttca 
aacccggagctgatttttaaggagcaaatttacatttito 

gctggatggccgaaaatctaggc™agttgggccttrgaaaagcnaggctgnaaatca>^ 

taagctgtntaatctcttctgaaaattgattgctactcangnggaagtgantcccm 

tccanaangaa>jagttmctgtttgatgccaaaataaatm 
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AATAC 

SEQ ID NO: 3623 ACGCGGGGAGTGAGGAGGAACGCGAAAAGGTAACGCCACTCATGGTCAAAG 
GCTTTCGGGATGCGGNTGAAGAAGGAGGGACGGGATTGACCGGTGGGCAANCGGTGGTCAACCCT 
TGGATTATAATCGGTGGAGTTGCCANTGTNTTATTNCCAACCAAATGAGTTO 
CGCCNTCATTNGGNGATGTGNTGGTGTTAACNATACCTNTANGAACCCAATTI^^ 
TCATTNGCTGGNTTAATCCTTGAAAGANGTAATAAIWTTTTTO 
NGGTAAC 

SEQ ID NO: 3624 ACAAAAACCCAAATTGATAAATCTGCAAAATCTTAAACTTCTTAATCCATO 
TAAATATTAAGGAGTAGCTGGTCTTCAAACACCGTAAAAAGTTAAAGGGTTGAAAACATGTTCAC 
CCTTTGCTATACGGTGGCATCTGAAATTATCAGGCATGACAACTTGAAriTGTTTTGCTCT^^ 
CAAGAAGGTATATATATGCATTCTNAANANAGCTTANAANGTTACAGTTNACTTC^^ 
GGTAAAGTTGACTANGGATGGCT 

SEQ ID NO: 3625 CGCGGCGGGTACAGATCTCACAGGGACACTCCTTATCCCTTGCAGAGTTCCA 
GACACTACTGATGGTCACCAAAGCAACATTTCATCAGAAAACACAGTGCTGGGCTTGTGAAGAAG 
GTGTCCAGCAGAGCTTCCACTGCCCCTGTAGGCTGCAGGCAGCTGCTTCAGTTGAGAGATACACTG 
AGCTCCTCAAAGAATTCCTATTTAAGTTTAGGCTATGAGCATAGATGGAAGAGGTGGCTGCTGTAT 
TGGATCAAGGATATGAGCAGAGATGGATGAGGCAGCGGCTGTAATGGATTAAGGCTATGAGCATC 
TTGTGAAGTAGGATGTAAAATCTGTAAATGCCAGAATGAATCCAAACTTCCAATTCAAAGCTGGA 
TraTAAAACAGCTCAAAAAAAGACTGAAACACAGCTCTCACTTTATGAGGCTNAGGCAGGCAGAT 
TTGCTTAGTTCANANTTGGANANAAGTTAAGAAANTTGGAAANAC>mT^ 
TATTAGAGNTTTGTGGATNNAACNTGGGGTCCAATTTNAOGGGTT 

SEQ ID NO: 3626 ACGCGGGGCAGGACAGCATTTCATATGTAACCATTTGAATGTTTTTGCTGTTT 
rrAGAATTCAGAGCCCTTGCTGGGGGGTGCCTGGGAGATGGGGTAAGAAGAGCTTTCATTTGTCTG 
GTAGATAGATAGCATGTAAGGGGGTGGTTGTCCCAGGAGGCAGCTGCTGACAGGTrTGCTACACA 
CAGCCCCGGACTGTGTTGCCTGGGTGCTCATTCAGAGAGGGGCTATCATCTGGGAGCCTGTGCCCC 
TGGGTCCTCGAGGGTCATGGCTTGTCCCTGGTCAGTCCrGTCTGACTGCCTNANGGCCTACCTCTCT 
GCCTTCCCTGCCGGTTCCTACTCACCTGCTAGGGCCNAGTGCCATTTTCAGCNCTACCCATTT^ 
TTTCAAGAAANCTTNGTTACrrGTNGNACCAAGCANAAATGCTCCACTANTCACT 
GATTA 

SEQ ID NO: 3 627 acaaataaaatcaaaaagagcagtgttctgttgtattcatttctgcatgtata 
gctttattaattgctaatgaaaattagaacttttctgggatcttctgacaagatttt^ 
taaaatgccttttcttcagtgaagccatctttggagttagtcattactctcaccttatctgtc^ 
tgacrrcaacctgatattcctcttcttttggtccagaccctcaaattrraaaagtag^ 

AGGAAAGGTCATTTTTCCACAGTTCAGTTCTCTGAAAAACTTCCATCTCCCACT^ 
CCANGGAGTGAAGTAATCACATGCTNGAACATCANGGCCAATTGNAAAGCATTATGAACACTNGC 
NTNGGTCGACrrATTTATCACCACANCCGTGAATATGCAATGTTCTGAAAAAAGGGACCl^^ 
CCCACGTTATTTTTANAA 

SEQ ED NO: 3 628 ACGAGACATGTCATGCTGCCCAAGGACATAGCCAAGCTGGTCCCTAAAACCC 
ATCTGATGTCTGAATCTGAATGGAGGAATCTTGGCGTTCAGCANAGTCAGGGATGGGNCCATTAT 
ATGATCCATGANCCATAACCTCACATCTNGCTGTTCCGGCNCCCACTACNCAAGATNCCAANGAA 
ATGNAGCTGGCAAGCTACTTTTCAACCTNAANCTTTACACANGCTGTCCTTACTTCCTAAa 
CTGATAACATTATTATTATCCCTTCTTGTNTNTTACTTTNGATAm 

SEQ ID NO: 3629 accacagttcacaagtgcaggagagaattttgataaattgttagctggaaag 
ctgagagagacttrgaacatatctggaccacctctgaaggcagggaagactcgaaccttttatgg 
tctgcatcaggaotccccagcgtggtgctagttggcctcggcaaaaaggcagctggaatcgacg 

AACAGGAAAACTGGCATGAAGGCAAAGAAAACATCAGAGCTGCTGTTGCAGCGGGGTGCAGGCA 

gattcaagacctggagctctcgtctgtggaggtggatccctgtggagatgctcaggctgctgcgg 
agggagtggtgcttggtctctatgaatacgatgacctaaancaaanaaagaagatggctgtgtcg 
gcaaagctctatggaagtgggggatcangagcctggctgaaaggatcctgtttgctttgggcana 
nttggaccccanttotggngnccctcccttgntatgaccccatcnnt^^tm 

TTCCAAAGNCTG 

SEQ ID NO: 3630 ACAGATACTTATGAGGCCAGCTGGTCTTTAATTATGTGGGTCCGAAGCAAATT 

ccttgtatgggcatcaattggaggggttca^tctttgaatacagaarrcaggggagccagga 
tcaaccgctcacttccaaatagatgattgccgaggccggcttgtctgaaaaggtcaatggctgtgg 
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ACACATCAGACTCTGOSrGCCANTTCAAATAGCGTCTTGGCTGAGTCTGGNATGANTAGCrC^^ 
TGTAGKTGGATCACCCCGNTGGTGGCTANGATGTCTTTTTGGAGATGATOSICCT^ 

SEQ ED NO: 3631 ACGCGGGGAACTATGGAATAAAACTACTGATGCAGTGAANACAGTTGAAAA 
GATCrraACAAATGCCTATCTATATTTATAATGAACAAATTCAAGAAAAAGGACTACGG/^ 
AGGACATCAAAGAAGTCANGCAAAACTCATCTTGACCCCTGTTGCATGCTAAGGAACNCAGOT 
GAAGAAAAGATGATATANCAGTTAACAGCGNTGCATACNTGGNAGANGNTTOCNTAATNATC^ 
TTATCAATAACCATTnTATTTTANATTG 

SEQ ID NO: 3 632 ACCTTTCCTTTTCCAAATCTAGCTGAAATTTCTCACTTGTm 

AGCTAACCACCGGCATTTCTCCTGATGTTGATTTAGGATCGGCAGGGGAAGCAGGATGCCACACT 

GAGAACATCTCCTCAGAATGTCATAGGCTGCTTTATCTCCTCTAGGGGAGCTGGTCCT^ 

GCTCTGACATCNAAANTGGATCCA 

SEQ DD NO: 3633 GGGTACCAAACTGACCAATGGGCTGCAAGAGGTITAGATTATTGCTACCCA 
AAAATTCTGAGCCAAATTGATAATGGTCATCATTAGTGACATCTCACCATGATGATAAGAAGACA 
TTTCAGCCACTGATCCAGCTAATTGGGCAACCrrTACTTCTCGCTTGTCATT 
AAACAAAACCTTTCTCTGACCTGGTTTCAAACCATCCACCATNGANGGGATAGATC™ 
AGAATTTGAGAACANAGATAAGTTCCnTGTTGATGAANTCATTATATGCAGATATC^ 
TNCATCAAGTAATNCTCAGGAACCCAAGTANCTThrrCMrrGTCTTTATCOT 
ACCATTCCTTCNACNTANAThrrGNTTTTTGCTNAAAG^^^ 

SEQ ED NO: 3634 GGGTACAGGCGGCAACTTCCAGAGCTTCCCCTCAGTGCTTGGTGACTGGCAC 
AGACACGATGCCATTTGCTCATGTTCCCCTAATACAGACnXjGTAGAGCTGTGGTGGATCCATTGTC 
AGGATGCAAGGGGCACAGCCAGGTCCCTGAGTGACCCTTCTCCTTCCCACAGCCCCTCCACTCCCA 
TCCCTATCTTCCCACTCACACTCACTGGTTCTTCTGCTTTGGTTTAGCCTCCAATGAT^ 
CATCAGTGAGGCTTITCCATCTGAGGTAGCrTTTCTAATATAAAGAAAGA;^ 
TTTAAGGCITAGCAGCCTTCTGGCITAGTTGTTTGTAAATGAAAATAT^ 
GCCAAAAANATAACCCANAAAATGGGAGAAAATATNGACAAATATATATCTTGTAA 
ATCTANAATTAAAANAACTTGANACTNGCATAAAANAACACCThmTTA^ 
TGCCTTTTTGG 

SEQ ID NO: 3635 ACAGGGAAAGGAAAAATITTCACCAACAGAGGCAGTCTCTCATAGAGTATAA 
AGCAGCTGTCACACTTCAAAGAGCAGCGCTTAAATTCCTAGCGAAGTGCCGTAAGAAAAAGAAAC 
TATTTGCTCCTTGGCGAGGACTCCAAGAACTCACTGATGCACGCCGAGTTGAACTGAAGAANCGA 
GTGGATGACTATGTCAGAAGAC^^mGGGCTCTCCAATGTCAGATGTGGTCANTTGGGAGCTCCAT 
GCX:CAAGCrrCATGAACGACTGCAACACTACTITATGGGCAGGGCCCTAGAAGAG(^ 
GCACANAGAAAGCTCTGNTNCACANATCANCACCAACCGTTGAACAGCTATTGAAGGCACCTAAG 
TCTGGANNGAGGAATAANGGAAANACCTTGAGCTCTTCCTANATTATTCCAGGCT^ 
AGNCA>n>ICAGGCCCATTTACACCCrTTGAANCCATACAACCCCCTGGTGAANAAN 
TOGAAATNAAATGANTTCCAAAGNTAACCTATTATGAANTATAAATTTm 

SEQ ID NO: 3636 AGCGGCGAGGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATT 
TCGACCGCGATGATGTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAG 
CGCGAGGGCTACGANCGTCTCNTGAAGATGCANAACCNGCTTGNCGGCCGTNCTCTCTTCCAGGA 
CATCAANAAGCCAGm^AAGATNATTGGGGAAGAACCCCATGACNCNATGAANNCTGNC^^ 
CTGGTGAAAAAGCT 

SEQ ID NO: 3637 ACGCGGGTATTCAGAGTGATAGrTTGTGGCTTGTANAATTCTATGCTCCATGG 
TGTGGTCACTGTCAAAGATTAACACCAGAATGGAAGAAAGCAGCANCTGCNTTAAAAGATGTTGT 

caaagtttggtgcagttgatgcagatnagcatcattccctaggaggtcattatggtgttcagggat 
. ttcctaccattaaaattctttggatcctctaanncanacctgaatatca^ 
gncgaaacctntgtngatgctgctctgaattgcttn 

seq id no: 3638 actttttctttgaagttttagcggtcaam 

ttatactgtggctatgcaacagctctcacctacgcgagtcttactttgagn'agtgcc^ 

ccactgtatagtttacttctcacx:atttgagttgcccatcttgnttcac^ 

taagtgcctttagttntaacaggncactrmacagtgctatm 

tgcntaaaattgcgtotaantatangarranggaataaatngtncntng^ 

nggcnaaattccaagacattg 



S64 



wo 02/29086 



PCT/USOl/30732 



SEQ ID NO: 3 639 GCGGCGAGGTACGCGGGGGCTGTCGGCGGTGGACTCGTCX5GAGCCGCGGGC 
GGTCAGGAATTTGACCCTCTAGGGCATGAATACTGTGCTGTTCAGTTCTGAGCTGTGCTAGCAATA 
CCCnCAAAGGAAGAGCAATGGCTGCAGCAGCAGCTTCTCACCTGAACCTGGATGCCCTCCGGGA 
AGTGCTAGAATGCCCCATCTTGCATGGAGTCCTTCCAGAAGANCATCTGGGTCCCAAGCTTCT^ 
CTGNGGCCATACCATCTGGCGCCAGNTGCCTGGAGAAGCTAITGNCCANTNNCATTAATGGGTGT 
NC^^CTNTCCCTTTNCANCAAGATTACCCCCATAACCACTTGGACCCAG^ 
ATTGCNAAAATCATTGATCCAGCTTGGGNTCANCANNGCTNTNGGGGTTNTAATN 
TGGCGG 

SEQ ID NO: 3640 CGGTACCCATGCACAGCAGCTGTGGGGTGTGGGCATCTTCAGGTGGTGTTCT 
GGTAAAATCGTTAGAATAGCTGTATGAAGGAAACCAGTGGGAGGAAATGAAGTTTTCAGGATGGT 
GGGATGTGATTTAGACAGTGCACATGCTGTTATGGCTGTCACTAGGGAGTGGCCTTNATGGAGGG 
TGGTAACANCACCTNAGTCCANCCAAAATGTTTAGAAACACTTCATTTC^ 
GCTTrrCAGGNCATGCNNCAAGTNAAGGGGCACCTTCNTNAAAGATGATTG 

SEQ ID NO: 364 1 TTACACATTGCCTCACTTTATATTTTAAATGAGAATCTTGm 

GCAGGAGTTGGTAGATTGGCCCTCTATNGNGGTCAGCGGCGAGGTACGCGGGTCTCTCCAAGTAG 

TAAATGATGACATTTCTTAATAAATACTGGGCTTGTGATTAAAGTTCAACCTGGCACATGl^ 

CTTTGAAGAATGGGAAGAAATTACGTGAACGGTCTTNATATCACCTCTGGGAANCAGGT^^ 

GTNACGGACNCTTTCAATCTAGAATAAAGCAGGANTAAGTTGCTAACAGTm'AT^^ 

NATTGArmTGTAGCATAT 

SEQ ID NO: 3642 ACACTTGAAACCAAAmCTAAAACTTGTrmCTTAAAAAATAGT^^ 

ACATTAAACCATAACCTAATCAGTGTGTTCACTATGCTTCCACACTAGCCAGTCTTCrCACACTTCT 

TCTGGTTTCAAGTCTCAAGGCCTGACAGACAGAAGGGCTTGGAGATTrTTTTTCT^^ 

TCTTCAGCAACTTGAGAGCTTTCTTCATGTTGTCAAGCAACAGAGCTGTATCTGCAGGTTC^^ 

CATAGAGACGATTTGAATTTNTTCCAGNNGATATCGGGCTCTAACTGTCAGAG 

AACATAATCCTGGGGACATACTGGCCNTCATGAGAAATGTGTTTGTCAGATGTTTCATAAAC^^ 

TTGANGAGGACAANCTGNTCTGCNATT 

SEQ ED NO: 3643 acgcgggggctgactctcttttcggactcagcccgcctgcacccaggtgaaa 

TAAACAGCCATGTTGCTCACACAAAGCCTGTTNGGTGGTCTCTTNACATGGACNCACATGi^ 

GGTGCCATGACrCGGATCGGGGGACCTCCCTTGGGAGANCAATCCTCCGTCCTNCTGCTCTTTGC^ 

CCCTGAGAAAAGATCCACCTACGACCTCNTGTOCTNATACCGACNANCCCNTNAIWCAT^^ 

CATTATCAAATCATGGNANCAACCTNTTTTACTCAT^^NCTN^ 

CACTTTCTCCTTTAAATCTTGGCACTACNCATTTAATCT 

SEQ ID NO: 3644 ACAAATAAAATCAAAAAGGGCAGTGTTCTGTTGTATTCATTTCTGCATGTATA 
GCTTTATTAATTGCTAATGAAAATTAGAACTTTTCTGGGATCTTCTGACA^ 
TAAAATGCCTTTTCTTCAGTGAAGCCATCrn'GGAGTTAGTCATTACTCTCACCT^^ 
TGACTTCAACCTGATATTCCTCTTCTTTTGGTCCAGACCCTCAAATTTTA^ 
AGGAAAGGTCATTTTTCCACAGTTCAGTTCTCTGAAAAACTTCCATCrC^^ 
CCAGGAGTGAAGTAATCACATGCTAGAACATCAGGGCCAATTGGAAAGTCATTATGAACACITGC 
ATTGGTCGATCTTATTTATCACCACAAGCCTGAAAATGCAATGTCCTGAAAAAGGTGACCTCTCTG 
TGCACACGTAATTTTTAAAAAGGAGAGGGTAATATGAAGGGGACTGAGGCTTGATCACCAAA/^ 
CAGCACAATGAAAACAAACAATAATGAATAATGAGCACTAGAATTCAAATTACCCAGA TGTTT CA 
AAGAGATGGGGGTGCCAANTTTCAATTNCCGTTTTGAACACCACATTACAAAAGA^ 
AAAATAA 

SEQ ID NO: 3645 ACACATCTTATAGCTGAACGCCTATGTAAGAGTGAAAAATTTITAGCAGOT 
TGCGGCAGGAAAGTGGATACTAACCAAGGACTATATAATTCATAGTGCCAAAAGTGGCAGATGGC 
TTGATGAAACAACTTATGAATGGGGATATAAAATTGAAAAAGATTCCCGTTATTCACCT^ 
AATCTGCACCTAAAAGATGGCGTGAAGAACTGAAACGCACTGGTGCTCCAGGAGCCTTCCACAGA 
TGGAAAGrrGTCCTCCTTGTTAGAACTGATAAGCGAAGTGATTCTCTTATAAGAGTm 
GGAAAGGCAAATGTTATTTTACCAAAAAGTTCACCAAGTGGAATAACTCATGTGATTGCCAGTAA 
TGCAAGAATTAAAGCTGAGAA.\GAAAAAGATAACTTTAAGGCTCCATTTTATCCAA^^ 
AGGGGATTITCTTTTANAGAAAGAAATTCAGAATGATGAAGATTCCCAAACCAAT^ 
TGAACCATAGCAATGAAGAAACAAACAAAGATTTCAGGAAAGATCAGGATrrCTTGGA^ 
GGNGCCTTAAGAAAAACCTGTNTAGAACCCANAAAGAAATGCCAAATCATGAAGATGT^ 
GGTCTATTTTGATT 
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SEQ ID NO: 3646 actgtaaaagttctgacacaagacagtggcagtggttacttttcatcgac^ 

AGCATOTGATCTCAGGGACTCAGACATACGTCTAAGTTCTATTCTGAGTTTTGGCAAC^ 
GACAGATATTTCTGAATGAACAATTTTTAGGTGTTTTCAGCCAm 

TATTTGGTGTTAGCTCAACAGTCACGTTGTGCCAAGAATTAAAGAACTCTAAAGTCTACAAACAT^ 
TTACTTCACCAAGACTAACTATAArrGAAGGGTTTACTATTTGm 
CTTTTATCCAAACAGCAACTACTACAAAAGGAATGACAAGAAAAAAAATGAOT 
ACTAAAAAATCTGACAATGrrATGCAAGAACCGCAAAGTTTTAGTGm 

ACTGACCAAGGTCCAAGATGTGAAGATACCATGTTCAAGAAACTGGGGGGAAATCACTCTACAAA 

CTAACTATATACTAGGTGATAAAGAATGCATACTTTATAATATAAAACCTGTCT^ 

TAGCTGCAAAAGCCATTCGATCTTCTCTTGGGCriTGGAAA^ 

TACCACCCTTCA 

SEQ ID NO: 3647 AC'l Ul l" l 'i'1114"ilUll"ll l l"lU'lTrri' l >CCCTCAGTCTTTTANACTCTCAGCT^ 
CACAACTTGATTGCCTTACAAATGACCTGCTTCAAGGATGTGGAAATTCCTAAm 
CCTTCTGTGACACCTTCACCAGGAACATCAACATGTATTTCCCTGCTGCCGTATTTGGTTTTCT^ 
CATCTCGGGGACCCTTTTCTCTTACTGTAAAATTGTTTCCTCCATTCTGAGG 
GGGAAGTATAAACCTTCACCACCTGTGGGTCTCACCTGTCAGTTGTTTGCTGATTTTAT^^ 
GCGTTGGAGGGT 

SEQ ID NO: 3648 ACGCGGGGGGTGGAGGTGGTAACCGTGATAGTAGCAGCTCCGGCGGCAGCA 
ACAGCGACTACGAGGGATGGCGGCGGCTGCAGCAGGAACTGCAACATCCCAGAGGTTTTTCCAGA 
GCTTCTCGGATGCCCTAATCGACGAGGACCCCCAGGCGGCGTTAGAGGAGCTGACTAAGGCTTTG 
GAACAGAAACCAGATGATGCACAGTATTATTGTCAAAGAGCTTATTGTCACATTCTTCTTG^ 
TACTGTGTTGCTGTTGCTGATGCAAAGAAGTCTCTAGAACTCAATCCAAATAATTCCACTGCT^ 
CTGAGAAAAGGAATATGTGAATACCATGAAAAAAACTATGCTGCTGCCCTAGAAACT^ 
AGGACAAAAATTAGATAAGACGGGGTTTCATCGTGTTGGCCANGCTGGTCTCCAACTCTTGACCTC 
AAGTGATCCACCTGCTTGGACTCCCAAAGTGCTGGGATTACAGGTGCAAATGCTAATT^ 
TGGGATTAAAAGGTGTCAANAAGCTCANAATGGCTTCAGAATCTGAGGTGTGGACTTC^^ 
AAAAATCAAAGTATGACmGGTATAAAACAGAATCrCAAGNAGTTCATTAC^ 
NAATNGTTCAAANG 

SEQ ID NO: 3649 ACACTGTTGGAGAGATGAGACAGTCACACCAGCTGCCCCTAGTGGGGCTCTT 
ACTGTTTTCTTTTATrCCAAGCCAACTATGCGAGATTTGTGAGGTAACTTCAACATCTCCG^ 
AGCCTATAACTGTGACACCTCCTGACTCACAATCATATATCTCCGTCAATTACTCTGTGAGAATC^ 
ATGAAACATATTTCACCAATGTCACTGTGCTAAATGGTTCTGTCTTCCTCAGTGTGATGGAGAAAG 
CCCAGAAAATGAATGATACTATATTTGGTTTCACAATGGAGGAGCGCTCATGGGGGCCCTATATC 
ACCTGTATTCAGGGCCTATGTGCCAACAATAATGACAGAACCTACTGGGAACTTCTGAGTGGAGG 
CGAACCACTGAGCCAAGGAGCTGGTAGTTACGTTGTCCGCAATGGAGAAAACTTGGAGGTTCGCT 
GGAGCAAATACTAATAAGCCCAAACriTCCTCAGCTGCATAAAATCCATTTG CAGTC 
GTTTATTGGCCTTATGCCTTCTTCTTCATTTATCCCAGTACTTTi- 

GGCAAAAAAGGAAGGTTTAATGAACCCTGTCCAGGGCCCCTTAGGNGGGGNACCTCCTTO 
GCCCTTNTC 

SEQ ID NO: 3650 ACTTTTTTTTTITrTTTT^^ 

GTAGCTCCCTGGGCCGGGCCTGGCTGCriTAGGCCAGTCTCTTGCTCACGC^ 

TCCGATGGTGGANACCTCCACCAGCTTGTCAa:CACAATCTCTGAGGTCTGGTGATAGTTGGGGAA 

ATTCACCACCAGCTTCCCGCCCTCCATCTGCACAGTGGCCTTGAACATCTTGCCCCC 

ATGTTACTTTCCTTGCCAACAGTGAACrTGTTGGGNCATGGNGTGGCCCCCGGAGTNAGTGCT^ 

ACCAAGTGAAANTCCTGCCCATCCTGCTGNANCTNa^GTGACAATCITA^ 

CATTACATTNCTGGAGATNCCAAGGGACITCTGAACCCATCATAATTNAT^ 

SEQ ID NO: 365 1 acagcataaAgtctactttcctcttgaagaaaccataagctaccagcto 
• ggaggggctggagcagagcatatgaagcagctgcaaaaaaaaaaaaccaaatcaaactcatgac 
gcagctggccaagcaagccaagtgctgtcccgccatcccagcacagaactggaaggcaagtgact 
ccctgtccctgttggtggaacccaggaaggaacctcagcagagccctccgtgtcctcctctc^^ 
cagcccctgcmcccttctgctgacaggacaaccaggcctgtgtcacctactccatc^ 

TGGCTCCTTCAGGACCTCATAAGCCTTTAGAACCTTGGCAAATTGGATTCTATGC^ 

TAGTAAAAGTAAAATTTGCAATAATGAAAAGGCTGACACAGTAGCACAACATGGTTT 

CAGCAGCTTAAAAAGGAACAAAAAGGAAACCTCTCATGCANACACATCAGGTGGCATAAAAC^ 

TAGGCAATTCCACGCGGAGCATCATTAGCCATTCTCTCTGTCCGCACACAGGACTCTGGCTGCACC 

TTNAGGGGCAGCAACTGCTTTCAGGTCAAGTCTCrrCACCCTCTACCCrAAGCA 
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CCGGCCTNNGCTGA 

SEQ ED NO: 3 652 ACGCGGGGGTTGTGAGTTTTGTGGACCTGGAACAATTTAACCAGCAACm 
ACCACCATTCAAGAGGAGTrCTATAGAGTTTACCCrTACCTGTGTCGGGCCTTGAAAACATTCGTC 
AAAACCGTAAAGAGATCCTCTTGCAAGGATTTTTATGTTGCATTCCAAGACCTGNCT^^ 
GATTCGAAAGCTCACCTNATNCAAAATTGGTTTGCTCACTCGCATNAATGGGCAAGTGGTGC^ 
CTCCCCANTTCACCCANACTTGTGANCGGAACTTTTCTGTGCTTGGACTGTCANACAG 
GGATGTAAAACAACAGTTCAAATTCACACAhfNCCAACATITGGCNAAAITCAAT^ 
GAGGAGATTCITACTGGATACAAATAAATCAAGANTTGTTNATTTTTAAV^^ 
ANAACCAANCTGAGCNTTCTCGAAGGAATNTTCCCCNCANTTTANAAATAANT^ 
CTTTGNGAATTANTCAA 

SEQ ID NO: 3653 ACGCGGGGAGCGGTAGCTGGTCTGGCGAGGTTTTATACACCTGAAAGAAGAG 
AATGTCAAGACGAAGTAGCCGTTTACAAGCTAAGCAGCAGCCCCAGCCCAGCCAGACGGAATCCC 
CCCAAGAAGCCCAGATAATCCAGGCCAAGAAGAGGAAAACTCCCAGGATGTCAAAAAAAGAAGA 
GAGGAGGTCACCAAGAAACATCAGTATGAAATTAGGAATTGTTGGCCCCTGTATTATCTGGGGGG 
ATCAGTCCTTGCATTATCATTGAAACACCTCACAAAGAAATAGGAACAAGTGATTTCTCCAGAm 
ACAAATTACAGATTTAAAAATCTTTTTATTAATCCTTCACCTTTGCCT^ 
CAAAAGAAGTCTOGCTNAACATGTTAAAAAAGGAGAGCAGATNTGTTCATGACA 
GTTCTGCATTCTGACTTTGGAACCACAGATGAAGNNCATACITCTAACTGGCTTT 
GAAAGTNTCCCNCriTrATAGGGGAAACATTTTT>^ 
GACCCNAAAAGGGTTTTAATTAAAATTTTGCTTNAACTTATTGG 
CAACTT 

SEQ ID NO: 3654 acgcgggaccgaatagaatcgaatggaacaatcatcgaatggactcaaatg 

GAATTATCCTCAAATOGAATCGAATGGAATTATCGAATGCAATCGAATGGAATCATC^ 
TCGAATTTAATCATCGAATGGAATGGAATGGAACAGTCAATGAACTCGAATGGAATCATCAT^^ 

atggaatcgaatggaatcatcgagtggaatcgaatggaattatgatcaaatgqaatcgaatgtaa 
tcatcatcaaatggaatcaaaaataaccatcatcaattggtattcaatggaattgtcatcaaatgg 

AATTCAAAGGAATCATCATCAAATGGAACCGAATGGAATCCTCATTGAATGGAATTGA^ 

atcatctaatggaatcgatggaatcatcatcaaatagaatccaatggaatcatctcaaatggaat 

CTAATGGAATCTTGACAANAATTTGATGGAATCCGCATCCAATGAATTGAATGCCATCATCGAATG 
GGCrCNAATGGAATCATCTTCrrATGGAATGGAATGGAAAACCGGAATGGAATOT 
GAAAAATGTTAGGGANACCGNAATGGGAATGTNAAAATNTATTTGGAATTrGG/^ 
AATGAAAANGAATT 

SEQ ID NO: 3655 ACGCGGGGAATCTGCCATTTTCTGTCCCTGAGTGAGTCTCTGGCGTCCCAAAT 
TGCCTGTTTTTCTCGCAGGCTCTATTCCGTTCGCTGGTTCGCCACCTC^ 

GAGTCCACAGCCACTGCCGCCGTCGCCCGCGGACTGGTTTCTGCCCGACAAAATTGAAAATGTTCC 

TGCTCCITCTACATCTGCAGATAAAGTGGANAGTCTGGATGTGGATAGTGAANCTAAAAAACTAT^ 

GGGTrrAGGACAAAAACATCTGGTGATGGGGGATATTCCAGCANCTGTCAATGCATTN CAAGA ^ 

CANCTNGTNTTTTANGTAAAAAACATGGANAGACACTAATGAGTGTGGANAA^ 

TGGGAAAATCACTTCTGKJAGlTGGCAAAAATGGAAAATGGGGTOTTGGGAAACCCCm 

GGTGO^TTTGGAAAAGGAAAAAGGGAAAAAAACCCNANAATGATTT^ 

SEQ ID NO: 3656 acgcgggggacacaaaggactctcgacccaaactgccccagaccctctccag 

AGGTTGGGGTGACCAACrcATCTGGACTCAGACATATGAAGAAGCTCTATATAAATCCAAGACAA 

GCAACAAACCCTTGATGATTATrCATCACTTGGATGAGTGCCCACACAGTCAAGCTTTAAAGA>^ 

GTGTTTGCTGAAAATAAAGAAATCCANAAATTGGCAGANCAGTTTGCCTCCTCAAT^^ 

AAACAACTGACAAACACCmCTCCTGATGGCCAGTATGTCCCCAGGATTATGm 

CTNTNACAGTT>rNANCCNATNTNACTGGAAGANTTTCAAATCGTCTC^^ 

ATACAGCTCTGTTGCTTGCAACATGAAAAAAGCTCTCAAGTTGCTGAAGACTGAATTGTO 

AAAAAATCTCCAAGCCCTTCTGTCTGGCANGCCTTGNACTTGAAACCCNAANAAGTO^ 

TGGCTAGTGTGGAAGCATTGTGNACACCCTGATTAGGNTATGGGTTAATGTTACAACACTNi^i^ 

AANAAAAACATGTTTTAAAAATTTGGNTCAAGTTG 

SEQ ID NO: 3657 ACTCAAACAAAACAAAACAAAACCGGAGTAAAAATTACCGrrAAATTIT 
TTATAACACAAATGACAAATGAACAGAAAACTGTGAGGCTGAACTTGCCAATATCTACAACT^ 
GAACTTCTGACTTCTGCTCAGCTTCATACATGAGAAAATTTGCTTAGCCGTCCTCTGT^^ 
CATAAACAGGCAAAGAAAAACAAAACACGAATGAGGGTAGCTTGCAAATTTGGAATCAAATTGT 
GATTCTTATGAACAAGGCCATTTCAGAACACGAGGCCACATACCTGGAAACATGAAGACGAGGA 
CTGCAGGTCAAAATAGTATGAGGACCAGGTTGGTGGGCTGATGCTCANACACAGAAGCAATGATG 
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GGGATGTTGGTCTTAACATAAGAAGGATCCACAGTGAGTAGAAACCCCCCGTCCAAGGGAATTT^ 

TTTCTNTCACATGAAGGANAACCTCCACGGNGAANAAGACAAACCTNCAAANANATGAC^ 

GAAAANAACTTCNAATTGGGTGGGGAATCACTTTGCTGCCCAGGNAATNCCAAANG^^ 

GCANAAAACAAANAACCCAAANGCNGGCCTOGCTmTATAAAANGNCCGGCTNGAANCrc 

NGGAANNTTNGCTTTA 

SEQ ID NO: 3658 ACAAACATGTGGCAGGTTAAACATCAGGCATAAAAAGGACAAATACTGAAA 
CGCAAAGGGCCAGAGATCCCAAGTCGCTTAACACTATTTAAAATGTTATTCAACTGCAAAC^^^ 
CCTCTGGAACCATCCCAGATAGCTACTGGTTAAGTGATATGAAGTAGGAAAAAAGAAGGTTCTGG 
AAGGTCTCAAGGAGGCTGTGCAGAAAACAGCACCAGATGGAAGCAAGAGCAGAAAGAAGTCACG 
TCCCATCTTTATGACAAGACACAGAACAAGATGGTCACAGTTCTTCATGGGTAAACAAi^ 
GGAAGACACAGCCAACCTTAAGTAACTTCTACTGACGGATGCTCTGGCACTTCTCTATGTAGAGAA 
GGAGAGAGAAAGAGAGCACCAAGCAGGAATGGGGGACTCTATATTTATTGTCTGCCANTTCXm 
ATATGCATTGNCTTATTTAAATCCTTATGACACCTTATGAAGAAGCCAATGATTTATTN 
GAAACCCTAATNCCGATTAAAGAAACTTGAGACTCANAATANGGGAGTTTT^ 
ACTGCrrANTGGGGATGGCANAAATTCAAACTCAACTTTGGGAAAACCCCTG 
TTAATCAAA 

SEQ ID NO: 3659 ACGCGGGGGTGTCATGGCCGGCTCCTACCCTGAAGGTGCACCTGCAGTCCTC 
GCCGATAAGAGGCAGCAGTrCGGAAGCCGGTTCCTGAGAGATCCGGCGCGCGTCTTCCACCACAA 
TGCCTGGGACAATGTGGAGTGGTCGGAAGAGCAAGCCGCGGCGGCGGAGAGAAAAGTCCAGGAG 
AACAGTATCCAGCGGGTGTGCCAGGAAAAACAAGTTGATTATGAGATCAATG CCCAC AAATACTG 
GAATGACTTCTACAAAATCCACGAAAATGGGTTTTTCAAGGATAGACATO 
CCCrrGAGCTGGCACCCTACCCAAAATCAAAATCATTTTGAAGGACrGGTTOT 
ATTGAAGTNCCTGCCGGGCCGNCCGCTCAANGGCGAATTTCCACCACACITGCGGCCC^ 
ATTGGATCCCAAACTTNGGTCCCA 

SEQ ID NO: 3660 ACTCGGTGAAGTCTmGAAGAGCTTCGGCTGAAACCACAGCrrCTCCAAGG 
AGTCTATGCCATGGGCTTCAATCGTCCATCCAAGATACAAGAGAACGCATTGCCACTGATGCTTGC 
TGAGCCCCCACAGAACTTAATTGCCCAATCTCAGTCTGGCACTGGTAAAACAGCTGCCTT^ 
AGCCATGCTCGGCCGAGTGGAGCCATCAGACAGATACCCCCAGTGTCTGTGCCTCTCCCCAACAT 
ATGAGCTGGCGOTCAAACAGGAAAAGTGATTGAGCAGATGGGCAAATTTTACCCAG^ 
CTTGCCTATGCCGTTCGAGGCAATAAATTGGAAAGAGGCCAGAAGATCAAGTGAGCAGATTGTCA 
TTGGCACCCCTGGGACCGTGCTGGACTGGTGCTCCAAGCTCAAAGTTCATTGATCCCAAAAAAATC 
AAGGNGTTTGGTCTGGATGAAGGCTGATGTCATGATAGCCACTCANGGCCCX;CAAGAATCAAAAC 
ATTCNGATTCCAAAGGATGCTGCCAAGAACTGGCCAAATGCTGCTTTT^ 
NTCTTGGTGGNAATTTGNCCAAAAAAGTGGGTCCCAAANCCAAAATGTTNTTA^ 
GAAGGAAAA 

SEQ ID NO: 3661 ACAGTCmCATTAAATAAGAATACTTACACATACATmCAGATATTTCTAC 
CTTCCTGTATGTGTTTGGAATTGTATGTAGGTAGCCACTGAAAGAATTTGGGCCCOT 
GGCAGTGGAAGTCCATGAAGTAAAGAGCATTCTTTAAAAAGCAGATTTGATTGCAT^^ 
TATTTGAGATTCTGAGAATTCTGATAAACCCCAAAGCAGAAAGATTCCTTATACCCTTG 
GGAAAGGTGAGGGAAATATTTGAAGCAGGGTCAGAACATCCACTAAGAACATAGCACCTCAGTA 
GAGCTTACATTATAGTGCCAGGGTAGAGTTATTACTGAATAGCTTAGGATGATGAACATTAACOT 
CCTACAGGAGTAGTAGCAGCTGATTTTGGTGACCATCATTGGTCACCTmAGTGGAACTGCAAAC 
TAAAAACAATTATGGCTTGACATATACTCCATGTAGGGGAAGTGATGGGAGAAGCANCCTCTGTG 
TGGGCCATTCCTTGGGAAGCACTGGCTCGATTTTGCTNCCCT^m^ 
ACTTTTCCTTGNTAACCNGNTTCANTAAAATCTTGGATGGCCCTCC^ 
NTTTT 

SEQ ID NO: 3662 accaccctgagttcctgtccaggcctatcaagccctccccaccatactttggc 

CTCCTCCTGGCCTCTGTGGGGCGGCTCTCACATTACCTCCAGAAAGGCTGCAGGCTCTCA 

GACACCTATAGTGACAGGAGTGGAAGCAGCTCCCCTGACTCTGAAATCACCGAACTGAAGTTTC^ 

ATCAATAAATCATGACTGATCnTGTAGCGGATGATTCTTCAAGAGACCC TTCAA ACTTGGGT^^ 

TTrACAGCTCTGACTTTACACTCGGCTTTGGAGACriTCTTTAAAT^^ 

TATTATGCGGAAAGGTATITGGGAAACTTGTCACTTGCATGTCCCATCACGTGT 

SEQ ID NO: 3663 ACT ri - il - lUnUU 'll l ' r i lN TGGGAATGAGAAAATAACTTTATTTCATTGNGGGG 
AGCAGGGCCNGATGTCCANCCTCAANAANTTATGGAACTGCTTCTTGGTGCCGACAGCCTTG^^^ 
ACCTTGANCACGTTGAAGCGCACTGTCTTGCTCANAGGCCGGCACnT^GCCCACTGTAAra^ 
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AOTGATCTGGAC^rrCCCTGAAACANGGGGACAGGTGTACTACNTT 

SEQ ID NO: 3664 ACAAAATATCCCCACTTCCCTTGAGAAAGAGTATATCTAAAATACACTTTGAT 
GAACACAGAATATTAAAACATTATATGCTATAGAAGTGAACACAAATACATTTTCT^ 
AATAGTCTAAGGAATATATAAGCATTGATAATATGAAGGAAAATGTTTGGATTTATT^ 
AAAATTCTCCTGGTTGTAGAGGTGAAAAAGTAATCAGGTTACCACCAAAAGAAATGTGTCTG^ 
TGTTCTrGATGTCrCCTCTCCAGCTGAGTAAATGGATTACTGCAGTTTANACCTGGTG 
AGGTGGCATTCAGTAGTGAGACITITATTATCAAATAGTTCTGACTGAGAAAT^ 
CTTCTAAGATGCTATTTTGTGAAACTGGATTITITr 
TAATTGAAGAAATCAGGT 

SEQ ID NO: 3665 ACCGACCATAGAGCAAGAATCAAGATTCTGCTAACTCCTGCACAGCCCCGTC 
CTCTTCCTTTCTGCTAGCCTGGCTAAATCTGCTCATTATTTCAGAGGGGAAACCT^ 
AGTGATAAGGGCCCTACTACACrGGCTTTTTTAGGCTTAGAGACAGAAAC^ 
TAGTGGCTTCTAGCTCTAAATGTTTGCCCCGCCATCCCTTTCCACAGTATCCTTOT 
CTGTCTCTGGCTGTCTCGAGCAGTCTAGAAGAGTGCATCTGCAGCCTATGAAACAGCTGGGTCm 
GGCCATAAGAAGTAAAGATTTGAAGACAGAAGGAAGAAACTCAGGAGTAAGCTTCTAGACCCCT^ 
NAGCTTCTACACCTTCTGCCCTTTNTCATTGCCTGGACCCCACCCCAACCACTCAACT 
TTCCTTTGGCCATAGGAAGGTTTACCAGTANAATCCTGCTAGGTTGATGGGGGGCCATACATTCCT 
TTAATAAACCATTGNGTACCCTGCCCGGGCCGGCCGNTCCNA 

SEQ ID NO: 3666 TTCGGCCGAGGTACGCGGGGGAGGGTTCCAACTnTCTGCTTATCTGGGAGGT 
GTTGGGCGCGGACAGTCGAGATGTCAGAGAAAAAGCAGCCGGTAGACTTAGGTCTGTTAGAGGA 
AGACGACGAGTTTGAAGAGTTCCCTGCCGAAGACTGGGCTGGCTTANATGAAGATGAAGATGCAC 
ATGTCTGGGAGGATAATTGGGATGATGACAATGTAGAGGATGACTTNTCTAATCAGTTACGAGCT 
GAACTAGAAGAAACATGGTTATAAGATGGAGACTTCATAGCATCCANAAGAAGTGTTNAAAGTAC 
CTAACTTGACCCTGCTTAATACATTCTAGGGCAGAGAACCCCAGGATGGGGACACTAAAAAAATG 
TGGTTATTTCATTATCTGCTTGGGATTTATTTGGGTTm 

NTTCAAAANAAAAAANAAAAAAAAAAAAGTACCTGCCCCGGGCCGGCCCGNTCNAAAGGGCC^ 

SEQ ID NO: 3667 CGTNCGCGGGGGACGCTGAGGGGTCCGAGGAGACCGTGAGGCTNTGGCCTG 
CATCTCGCGCCGCCATGGACGCTGCCGAGGTCGAATTCCTCGCCGAGAAGGAGCTGGTTACCATr 
ATCCCCAACTTCAGTCTGGACAAGATCTACCTCATCGGGGGGGACCTGGGGCCTTTTAACCCTGGT 
TTACCCGTGGAAGTGCCCCTGTGGCTGGCGATTAACCTGAAACAAAGACAGAAATGTCGCCTGCT 
CCCTCCANAGTGGATGGATGTAGAAAAGTTGGAGAAGATGAGGGATCATGAACGAAAGGAAGAA 
ACTTTTACCCCAATGCCCAGCCCTTACTACATGGAACTTACNAAGCTCCTGTTAAATCATGCT^^ 
ACAACATCCNGAAGGCANACNAAATCCGGACCCTGGTCAAGGATATGTGGGACACTTCGTNTTAG 
CCAAACTCCGAGTGTCTGCTTGACAGTTTThrrGANACAAGCAGGAGGNACATGC^ 
TACTTGGACCTTTGATGGGANATCAACACCAGCGGGGACTTTTCnTAANACAAGCCCTTC^^ 
CTTGTACCTTGCCCCGGGOSrGGCCGTTCNAAAAGGGGCNAATTITCCAACCCNACT 
TTAATTAGTGG 

SEQ ID NO: 3668 ACAAATAAAATCAAAAAGGGCAGTGTTCTGTTGTATTCATTTCTGCATGTATA 
GCTTTATTAATTGCTAATGAAAATTAGAACTTTTCTGGGATCTTCTGACAAGATTm 
TAAAATGCCTrrrCirCAGTGAAGCCATCTTTGGAGTTAGTCATTACTCTCACCTT^^ 
TGACTTCAACCTGATATTCCTCTTCTTTTGGTCCAGACCCTCAAATT^ 
ANGGAAAGGTCATrmCCACAGTTCAGTTCTCTGAAAAACTTCCATCTCCCACTGAA^ 
TCCAGGAGTGAAGTAATCACATGCTAGAACATCAGGGCCAATTGGAAAGTCATTATGAACACTTG 
CATTGGTCGATCTTATTTATCACCCAAGCCTGAAAATGCAATGTCCTGAAAAAGGNG^^ 
TGCACACGTAATTTTTAAAAAGGAGAGGGTAATATGAAGGGGACTGAGGCTTGGTCNCCAAA^ 
CAGCCCAATGAAAACCAACCATTOTGAATAATGAGCCCTAAAArrCAATTACCA 
ANAAANGGGGGCCAGTTTTTAAATTCCGriTGGACCCCCCCbriTACAAA/^^ 
A 

SEQ ID NO: 3 669 ACTCTTGATGAAAGACCGTGAAACCAACAAATCAAGAGGATTTGCTTTTGTC 
ACCTTTGAAAGCCCAGCAGACGCTAAGGATGCAGCCAGAGACATGAATGGAAAGTCATTAGATGG 

aaaagccatcaaggtggaacaagccaccaaaccatcaittgaaagtggtagacgtggaccgcctc 

CACCTCCAAGAAGTAGAGGCCCTCCAAGAGGTCTTAGAGGTGGAAGAGGAGGA AGTGG AGGAAC 

caggggacctccctcacggggaggacacatggatgacggtggatattccatgaattttaacatga 
gttcrrccaggggaccactcccagtaaaaagaggaccaccaccaagaagtggggggtcctcctcc 
taagagatctgcaccttcaggaccagttcgcagtagcagtggaatgggaggaagagctcctgttc 
acgtggaaagagatagttatggaggtccacctcgaagggaaccgcttgccctcrcgtagagatgt 
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TTATTTGTCCCCAAAAGATGATGGGmrrCTACTAAAGACAGCTrm 

AAGTTCTCCGTGATACTAGAGATTATTGCNCCACCACCCChn^AGATIOTAC^ 

TTGGTCATTCC 

SEQ ID NO: 3670 ACTTCITGCCCTTGTAGAATTTCX:TGAGGriTTCTTTCTGAGT^ 

CTGTGAGAACACGGGCAATGGATTTCCGGACGACTCGGATCTTAGAGAGCTTGGAGGCCGCACCG 
CCTGTCACnrrGGCGACGCGCAGCTGGGACAGCTCCACCrrCAGGTCGTCCAGCTGm 
TCCTCCTTCTTCTTCCCGCGAAGATCTCGAGCCTTGATCTTGGCCATTGCTGCACAAGC^^ 
CCGCCGCCCGCTCCGAGGGAAAGAGCCCCGCGT 

SEQ ID NO: 367 1 ACCCTTGGAAGATGGGAAAGGTGAGGGAAATATTTGAAGCAGGGTCAGAAC 
ATCCACTAAGAACATAGCACCTCAGTAGAGCTTACATTATAGTGCCAGGGTAGAGTTATrACTGA 
ATAGCTTAGGATGATGAACATTAACCTTCCTACAGGAGTAGTAGCAGCTGATTTGGTGACCATC^^ 

tggtcaccttttagtgtaactgcaaactaagaacaattatggcitgacat^^ 

agtgatgggagaggcagcctctgtgtggtccatccttggaagcactgcatcgattttgctcccctc 

tggttttaaggagatcttitagaccltrccttgctaactgcm 

attctggattcagacatctcttcrcacccrrctttttcattgtac^ 

aaattggcttgcaggaataattcaagtttttctaaagaccttggattaacaggt^^ 

ctgatgcaggt 

SEQ ID NO: 3672 acaatacagaaatgctgatgtctgagctgagtcctgaagaccagagagtatt 
caactttgacgtgcgccagttgaactggttggaatacaitgaaaattatgtttt^ 
atacttattgaaagaggatatggctgggatcccaaaagcaaagcaacgcttaaaaaggctccgaa 
atattcactacctcmaatactgccctcttccttatcgcctggcgcot 

GATGGCTCGGAATGTCTGGTTCITCATTGTAAGCITCTGTTATAAATTCCTCT^ 

TCCAGCACGCTCAAAGTTTAAGAGCATTTAGCCATCGCCrmATCTGGAACC^ 

TAAAACAGCAAACTGTGATTCTCAAGATTAGAAAGTACAAGGAATATGCCCAAACTGTCAAAT^^ 

CACCTGTTATGTATTCGTCCCTATTCCTTAACTATGTATTTTTATTTTCAGTGAG/^^ 

TGTAAACTAGCCCATAGTCACCTATATTTTAQGGGAAAAAAAATCCCAAATTGGm 

TTAmTATGCCCTTGGCGTATTAAAACGTGAAAAGTACCAATGGGCCCACAACAGGCTNATAGGC 

CAGGAC 

SEQ ID NO: 3673 ACGCGGGCTGGAGGAATGGCATCAGACCCATGCCTCTGTGATTGCTCCCAGC 
CCATCCAACCACAGCATCTATGTTCTGCCTGGGACCAGGGCCAGGGAGCATGGTACACTGAGCTG 
AGTATAAGGAGAGTGGAGCAGGCCACTGCCAGCCCAGAAAAirrTGGTCAAAGTTGCCTGAAATC 

ttctcagccttcgattcacagctgctctctgctgctctggggccatgcagaccagttcagaaaaga 
gttaatttgttggggcagttggaggcaggtggactgccagctttgacaccttcccagcccacag^ 
tgctgcactggggctgaaggcgtggctaacccctgcacacctagagagtgacagagatgccaga 
tgggcagcaggaaggcaagaggattaagagagagcttctggctgaaagccacactcggttaacc 

AGGAAAAAGCCCTTGGCACGAGAAGACTCAGTGGGCCrGAGGGACTGAGCCTTGGITTGTTGGGG 

catctgctgcataanccatncatgtgtgacaatagaagtgtattccaacccacttgtgggaacatt 

GGGTGCCTGAAAGACCACATTGGGAGAAGGAACAAGTGAGNTGCTTGACAAAGGGCTTACCCITG 
ATCACTTTTGG 

SEQ ID NO: 3674 ACACTTGAAACCAAATTTCrAAAACTTGTTTITOT 

ACATTAAACCATAACCTAATCAGTGTGTTCACTATGCTTCCACACTAGCCAGTCTTCT^ 

TCTGGTTTCAAGTCTCAAGGCCTGACAGACAGAAGGGCTTGGAGATTTTTTTO 

TCTTCAGCAACrrGAGAGCTTTCTTCATGTTGTCAAGCAACAGAGCTGTATCT^ 

CATAGAGACGATTTOAATATCTTCCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACAAAC 

ATAATCCTGGGGACATACTGGCCATCAGGAGAAAGGTGTTTGTCAAGTTGTTTCAT AAAC CAGATT 

GAGGAGGACAAACTGCTCTGCCAATTrCTGGATIT(nTrATTTTCAGCi^ 

TGACTGTGTGGGCACTCATCCAAAGTGATGAATATCATCAAGGGTTTGTGCTTGTCTTGGATTATA 

TAAAACTCTTCATATGTCTGAATCCANATGAGTTNGGTCACCCCACCTCTC^ 

AATTTNGGGTCGANAAGTCCTITGGGNCCCTTTTTGGCTCCAGGTTrGACTGNGGGGAT^ 

SEQ ED NO: 3675 actccctgtcgtcaaagtgcttccctctggtaaatacacgggtgccaacttaa 
aatcagtcattcgagtcctgcggggtitgctagatcaaggaattccttctaaggagctggaga^^ 
ttcaagaattaaaacctttggatcagtgtctaattgggcaaactaaggaaaacagaaggaag 
agatataaaaatatacttccctatgatgctacaagagtgcctcttggagatgaaggtggctatatc 

AATGCCAGCTTCATTAAGATACCAGTTGGGAAAGAAGAGTTCGTTTACATTGCCTGCC/^ 

actgcctacaactgttggagacttctggcagatgatttgggagcaaaaatccacagtgatagcca 
tgatgactcaagaagtagaaggagaaaaaatcaaatgccagcgctattggcccaacatcctaggc 
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AAAACAACAATGGTCAGCAACAGACTTCGACTGGCTCTTGTGAGAATGCAGCAGCTGAAGGGCT^ 

TGTGGTGAGGGCAATGACCCTTGAAGATATTCAACCAGANANGTGCGCCATATTTCTTC^^ 

TTCACTGCCTGGCCAGACCATGATCACCriTCTCAACCCAAATGATCTGNTTACT^ 

TGANAC 

SEQ ID NO: 3676 aCTTCATGAAATTATAAACATGTITTTAAAGOT 

TTGATCAAACAAGGCCACCTGAGTGACATCTTCAATGATGGTAAGCCCACCATTTTAAAGGAATA 

ACATCrrTATTTAAAAGCCTAATTATTAATATAAAAAGGAAAAGAGTTTAT^ 

TAGTAAGATTGCAATGGGACAGCCCTTTGATGAAAAATCTAAGGAGGGTA AAGC CAATGTAACTG 

AATTAGAACAAGAGTTCCAATTTTGAGCTACCATCCACCAAATAATTTCCCTm 

ACACAGTGAAAATAAACAGTTATATTAAGAGCACITCAGTCCCACAAGGTAGGAriTAAG 

TGAATAGGTGTAAATGGCCCTGTAACAATATGATGCCTGCAAAAATACATTCAACTGAA^ 

TGTCTGTTCTTTAACTAGTAAGGAAACGGGAAGCTAAAGTGGTCCCACT^ 

ACATAACAAAAACCACTGGTTTATCCTTTCTGGAAAGACTACCAAAGCCAAAGA^ 

TATTATAAATTTAAAACTGCATACTTTTACTCATCTrTAC^ 

TACTCATG 

SEQ ID NO: 3677 ACGCGGGGCCTTTCTAACTCCGCTGCCGCCATGGCTCCTGTGAAAAAGOT 
GGTGAAGGGGGGCAAAAAAAGAAGCAAGTTCTGAAOTTCACTCnTGATTG 
AGATGGAATCATGGATGCTGCCAATTTTGAGCAGTTTTTGCAAGAAAGGAT^ 
AAGCTGGGAACCTTGGTGGAGGGGTGGTGACCATCGAAAGGAGCAAGAGCAAGATCACCGTGAC 
ATCCGAGGTGCCITTCTCCAAAAGGTATTTGAAATATCTCACCAAAAA^ 
ATCTACGTGACTGGTTGCGCGTAGTTGCTAACAGCAAAGAGAGTTACGAATTACGTTACTTC 
TTAACCAGGACGAAGAAGAGGAGGAAGACGAGGATTAAATITCATTTATCTGGAAAATrTO 
GAGTTCTTGAATAAAACTTGGGAACCAAAATGGTGGTTTATCCTrGTATCTCTG^ 
AACAGAAAATTGGAAATCATAGTCAAAGGGCirrCCCITN 
AACTTGACCTTCTTTTirmTCTGCTTTAAAAAm 
AAGGAGANGG 

SEQ ID NO: 3678 ACCGACCATAGAGCAAGAATCAAGATTCTGCTAACTCCTACACAGTCCCGTC 

ctotcctttctgctagcctggctaaatctgctcattatttcagaggggaaacct^ 

agtgataagggccctactacactggctttittagacttagagacagaaactttagca™ 

tagtggcttctagctctaaatgtttgccccgccatccctttccacagtatccrrcttccctot 

CTGTCTCTGGCTGTCTCGAGCAGTCTAGAAGAGTGCATCTCCAGCCTATGAAACAGCTGGGTCm 

GGCCATAAGAAGTAAAGATTTGAAGACAGAAGGAAGAAACTCAGGAGTAAGCTTCTAGCCCCCTT 

CAGCTTCTACACCCTTCTGCCCTCTCTCCATTGCCTGCACCCCACCCCAGCCACTC^ 

GTTTTTCCTTTGGCCATGGGAAGGTTTACCAGTAGAATCCTTGCTAGGT^^ 

TCCTTTAATAAACCATTGTGT 

SEQ ID NO: 3679 ACAGCATCAAAGTGAAGTrrATACTTGACAGAAAGCAGAAAACAAAAGTTCC 
AGACATGATTCTGTCATACAGAGTTAATAATAAGATTGCTTTTTAGGACTAGATTGACTC 
CCCTGGAGAGCCCATGGGTCTCCTTTTGAGCAAACATAAACAAATCCAAACTCTGTCCTAGAG 
AATACTTAACAAATGGTGGTTGTCTAAATGTATATGTAGGACATATTTAACATATACT^ 
TGAAAGATAATGAAAAGCTATGGGAAAGATAACTTAGAAACAAAGAAGGCATGGATCCTCA^ 
CTGGGAACTTCAAATTCATTCCATCTGCTATATAAGAAACACTAAAATAAATAAAAA 
TACX}CGGGGGACATTTTCTCGGCCCTGCCAGCCCCCAGGAGGAAGGTGGGTCTGAATCTA^^ 
ATGACXlGAACTAGAGACAGCCATGGGCATGATCATAGACGTCmTCCCGATATTCGGGCAGCGA 

ggacagcacgccagaccctgaccaagggggagctcaagggtgctgatggagaaaggagctacca 

ggcitcckx;agagtggaaaagacaaggatgccgtggataaattgcto 

aatgggana 

SEQ ID NO- 3680 acctgtgaaccaagtgtttgggcaggatgagatgatcgacgtcatcggggtg 
accaagggcaaaggctacaaaggggtcaccagtcgttggcacaccaagaagctgccccgcaaga 
cccaccgaggcctgcgcaaggtggcctgtattggggcatggcatcctgctcgtgtagccttctctg 
tggcacgcgctgggcagaaaggctaccatcaccgcactgagatcaacaagaagatttataagatt 
ggccagggctaccttatcaaggacggcaagctgatcaagaacaatgcctccactgactatgacct 

ATCTGACAAGAGCATCAACCCTCTGGGTGGCmGTCCACTATGGTGAAGTGACCAATGACi 1 1 rr 

catgctgaaaggctgtgtggtgggaaccaagaagcgggtgctcaccctccgcaagtccttgctgg 

TGCAAAraAANCGGCGGGCTCTTGGAGAAGATTGACCTTAATTTCATTNAC^^ 

nggccatggcccgtttcaaccattggaggaanaagaaaagcattcantgggacccacttgaato^ 

AAGACCGAATTGGCAAATGGNAAGAAAGGANCTTNATTGCCANACCTGCCCT 
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ataaaagggnc 

seq e) no: 368 1 acccaaaccctccaatccccaacgcggtctcaagttcagactgggctccagc 
ttctgtccacagccacccccacatmctttttgtatrrgtctgc^ 
taagaatttctgcctgcagctccagtatcaaggatatggggcagcacagccacaacacagagct^ 
gtgctcctcacacgtcctcttggcaatgtcctcgttgataatctcaagcagctcaggaggtggggc 
gttatcagaaaacaaatcaagggcccgggacacgatgtcggatcttgtccgcccaccgtcataat 
ccacaggagactcgcctttctgaaatatcttgattgtaggaaatcctctaatcccgta^^ 
ccagaacctgattgactgtagcatccacagctgccagtttcactcttcctttcgtc^^ 
. ttctgaagctgcggcagccx:actctggctctaggtttttgcagtgtccacac^ 
ctcaaccatccaaacatcntcactgtccagaacattctttatcaaagctgtcgtct^ 

ATCACATCCirCTTACTTGAACTATCACrrCTGCCTTGTTITCCAGA^ 

SEQ ID NO: 3682 ACGCGGGGGTGATTGAGAGAGGGGTTAGAGGCGGGTCCCAGCGCTGCCGCA 
CCATGGCGGACCAGCTTTATCTGGAAAATATAGACGAGTTCGTCACGGACCAAAACAAGATCGTG 
ACATACAAATGGCTGAGCTATACACTAGGGGTTCATGTTAACCAGGCCAAACAGATGCTGTATGA 
TTATGTTGAAAGGAAACGAAAAGAAAATTCAGGAGCCCAACTGCATGTTACCTAOT 
GCAGTCTCATTCAGAATGGACATTCCTGCCACAAGGTTGCAGTAGTGAGAGAAGATAAATTGGAA 
GCAGTGAAGTCCAAGCTAGCTGTGACTGCCAGCATCCATGTGT 

SEQ ID NO: 3683 ACTATCTAGTGTCTAAAACATTATTCTCCAGAAAAATCAATCATTTTCTAG 
CTCTCCCTCAGTCCTTTATTGTCCATTCCAATACATTGAACACATrrCCTTTACCCT^ 
CTTCCAAAAGGAAGCACCCGITGAGTCCTTTTGAGGGTGATITGTOT^ 
AGGAATTTAATTAGGTCATATTTGGTGATGAGACTTATGGAGTGTGCCTCTCTCTCCCAA 
CTTAAAATGCAAGGACAAGCAATTAGAAGCCATCCTAAGGTGCTTACCTCACACGCCACCCATGA 
GGCTTGTGGCCACAGTGGCACTTGGGTGTGGCTCCTCTGTTATTTGTCCTCATGTG AGAA ^ 
TCATCTCCAAATCTTGCCATTTGTATACTTTTGGTGGAGACTTGGATGTCATATCT^ 
GTTTTCTTCCCTAGCTTATTTTGTGGCTTTTAi^ 

SEQ ID NO: 3684 ACNCGGGGTTNTroAAGCACTTTTrACCAACGGNCAG 

TANCGTGO^AAAGGCTTCNAGATGGNAGACCCATCTCTCTTGTGCTCCANACTTC^ 
NTTTTNATCAAAGAGGGGAAAACTCATGCCTTTCCTTGGTAAAAAATO 
TACGTNACTATACATCTGAGCrmATAAGCGCC(>IGNAGGAACANTAGANCITGGTO^ 
NCATCGNA 

SEQ ID NO: 3685 ACGCGGGGGGAAAACTCTGAGGACATGAATAGTCGCCAGGCTTGGCGGCTCT 
TTCTCTCCCAAGGCAGAGGAGATCGTTGGGTTTCAAGGCCCCGCGGGCATTTCTCGCCGGCCCTGC 
GGAGAGAGTTCTTCACTACCACAACCAAGGAGGGATATGATAGGCGGCCAGTGGATATAACTCCT 
TTAGAACAAAGGAAATTAACTTTTGATACCCATGCATTGGTTCAGGACT^ 
GACAAAACACAAGCAGAAACAATTGTATCAGCGTTAACTGCTTTATCAAATGTCAGCCTC^ 
TATCTATAAAGAGATGGTCACTCAAGCTCAACAGGAAATAACAGT 

SEQ ID NO: 3686 ACCNTNNGC^O^^AACANAAAAGGCG WGATNATAAACTACATCA^ 
AAGTGAACTGCACTATGCCCTTGTGGCCAGAAAATTGCATTTm^ 
TATTCTTTTTGGACTTCCAAAGCTTGATTTATAGTAACTACAAACATCACGAC^^ 
AGAAAAACTCTTAGAAAAAATAATTrCAAACATTAGTCACACTGTTGA^ 
ATAATTNGTTCTTCAATAAGAAATANAATTCTTCATAACAAATGCCCCAGAGT^^ 
TACCACAGGCCATTAAAAACCAAACTGTTAGCGAAATAGGCAAAATGTTTTTGTAGTOT 
CACCTGTAGCTCTATCCGCTATTGTOCCAATAGGCACCATGTTTATCTGAAT^ 
ANCATCTGACAGATACTAACACTAAGCTTACATACTGTATGAAAAAGCCrrAACACTG 
GCrrGCTATAGGGTGGCTNTNTTTTrTTANGANANNCT^ 
TTITAAANGGGGGGAGTAAANCCATTTANAACTGCCAAATCCTNTO 
CNAAG 

SEQ ID NO: 3687 ACGCGGGGGACGCTGAGGGGCCCGAGGAGACCGTGAGGCTCrGGCCTGCAG 
CTCGCGCCGCCATGGACGCTGCCGAGGTCGAATTCXrrCGCCGAGAAGGAGCTGGTTACCATTATC 
CCCAACTTCAGTCTGGACAAGATCTACCrCATCGGGGGGGACCTGGGGCCim 
CCCGTGGAAGTGCCCCTGTGGCTGGCGATTAACCTGAAACAAAGACAGAAATGTCGCCTGCTCCC 
TCCAGAGTGGATGGATGTAGAAAAGTTGGAGAAGATGAGGGATCATGAACGAAAGGAAGAAACT 
TTTACCCCAATGCCCAGCCCITACTACATGGAACrTACGAAGCTCCrGl^ 

AACATCCCGAAGGCAGACNAAATCCGGACCCTGGTCAAGGATATGTGGGACACTCGTATAGCCAA 
ACTCCGACGTCTGCTGACAGCTTTGTGAGACAGCAGGAGGCACATGCCAAGCTGGATAAACm 
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ACCTTGATGGGAGATCAACACCAAGOSrGGGACTTTrCC^ 
CCTTCGGNCCGNGAACCACNCTTAAGGGCGA 

SEQ ID NO: 3688 ACAACAGCTCCAAGACAACACITAATTCCAGCTATACGTGGCAAAAGATGTr 
ATGGCAGGGAATAGAGAGGTTTAAATACAGATGAAATAAAGGGTCACCATCTCCTCAGGCACAAG 
GAACAGCTTACTTTTTGCCAGATTTCTTAATTCCACCTGTGGCCA^ 
CGCTTTTAGCTCCTCGAGTTTCTTCTGCTCCTCT^^ 
CCATCrCCTTGGCCrGCTTCTTGGGCTGTITCAGTGGCTTCTTCTTGC^ 
GGCAGCCTGCCGCCCCCCGCGT 

SEQ ID NO: 3689 ACCTTCTTTCTGGCAGTGAATGGTCTGTATTCCTCTAGTGATGATGTGATCGA 
ATTAACTCCATCAAATTTCAACCGAGAAGTTATTCAGAGTGATAGTTTGTGGOT 
TGCTCCATGGTGTGGTCACTGTCAAAGATTAACACCAGAATGGAAGAAAGCAGCAACTGCATT^ 
AAGATGTTGTCAAAGTTGGTGCAGTTGATGCAGATAAGCATCATTCCCTAGGAGGTCAGTATGGT 
GTTCAGGGATITCCTACCATTAAGATTTTTGGATCCAACAAAAACAGACCAGAAGA^^ 
TGGCAAGAACTGGTGAAGCCATTGTAGATGCTGCGCTGAGTGCTCTGCGCCAGCTCGTGAAGGAT 
CGCCTCGGGGGACGAAGCGGAGGATACAGTTCTGGAAAACAAGGCAGAAGTGATAGTTCAAGTA 
AGAAGGATGTGATTGAGCTGACAGACNACAGCTTTGATAAAAATGTTCTGGACAGTGAANATGTT 
TGGATGGTTGAATTCTATGCTCCITGGGGGTGGACACTGCAAAAAACCCTAAAACCANANTO 
CTGCCNCANCITCAANAAGTAAAAAAACCOT^ACCAAAAGGGAAA^ 
GGATGCCTACAN 

SEQ ID NO: 3 690 ACAATrrGGTGTCATCCTTGAAGCATACTGCCGGGGAAGTGTGGGGCACATG 
AAAGTGCTTTCTAAGCAGGTTGAAGCACTCAATAAGTTAAAAACTT^ 

AATGCCGTGAAGTTAAACAGAGCCAAAGGGAAGGAGGCCATGCATACCTGTTTAAAACAGAGTG 

CTTACCGGGAAGCCCTCTCTGACCTGCAGTCACCCCTGAACCCATGTGTTATCCTCTCAGAACTCT 

ATGTTGAAAAGTGCAAATACATGGATTCCAAAATGAAGCCTTTGTGGCTGGTATACAATAACAAG 

GTATTTGGTGAGGATTCAGTTGGAGTGATTmAAAAATGGTGATGATTTA^ 

ACACTCCAJ^ATGTTGCGCTTGATGGATTTACTCTGGAAAGAAGCTGGTTT 

CCTTATGGCTGTTTAGCAACAGGAGATCGCTCTGGCCTCATTGAAAGTTTGTGAGCACCTCTGAAA 
CAATTGCTTGCATTCAGCTGAACAGTANCAATGGTGGGCITGCTTGCACAACOTCAAA 
GCCCTTCTGAACTGGCTTAAAAAATACAACTCTGGGGGATGACCTGGACCCGACCCATTGAGGAA 
TTTACA 

SEQ ID NO: 3691 ACTCATCTTCTGACTTAAAGCAGATTCCCATTAAATGGACAGCACCAGAAGC 
TCirAATTATGGGAGATATCATTCTGAGAGTGACGCACGGAGCmGGCATCCTCCTCT^ 
TTTCGGCTTAGGGGGTCTGTCCATAGCCTTGATGGAAAGTCTTCATrATCT^ 
CTTATGATGTTTCTTCTCTTCCTTTCCTCCTGAACTGTCCCGTTT 
ATCATGCCTGGACTCCTTTTTGCCACCGGAGGAGATTTrATCCATOT 
ACTGCCTTTTTCTTTGCCTGAAGArmGACTT^ 

GGTAGGCGGGCGACTCAAATCCCAGCCAGTGGACTTAGCCCCTGTTTGCTCCTCCGATAACT^ 
GTGACCTTGGGTTAATATTCACCAGCAGCCTCCCCCGTTGCCCCTCTGGATCCACTGCTTAAAT^^ 
GGACGAAGGACAGGGCCCTGTCTCCTCAACTTTCAGGCACCACNATGACCTGGGACAGTGAATCA 
CAATGCCGNTTTTGGCTCGNGGGGCATCCTCCTGCTTGGCAGGCCTTGTGCCT^ 

SEQ ID NO: 3 692 ACACTTGAAACCAAATTTCTAAAACATGTTTTTCTTAAAAATAGTTG 
CATTAAACCATAACCTAATCAGTGTGTTCACTATGCTTCCACACTAGCCAGTCTTCT^ 
CTGGTTTCAAGTCTCAAGGCCTGACAGACAGAAGGGCTTGGAGATTTTTT 
CITCAGCAACTTGAGAGCTTTCTTCATGTTGTCAAGCAACAGAGCTGT^^ 

ATAGAGACGATTTGAATATCTTCCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACAAACAT 

AATCCTGGGGACATACTGGCCATCAGGAGAAAGGTGTTTGTCAGTTGTTTCATAAACCAGAT^^ 

GGAGGACAAACrGCTCTGCCAATTTCTGGATTTCTrrATrTTCAGCAAACAC^ 

ACTGTGTGGGCACTCATCCAAAGTGATGAATAATCATCAAGGGGTTTGTTGCmTOT 

TATAGAGCTTCTTCATATGTATGAGTCCAGATGAGTTGGGTCACCCCAACCTNTGGANCCGCGTAC 

CTGGCCCGGGGCGGCCCGTTCGNAAAGGGCG 

SEQ ID NO: 3693 ACGCGGGGAATGAAGTGAGGCCAGTGGCCGGGTCTGCTGAGTCCGCAGCACT 
CAGACTACGTGCACCTCTGCAGCAGGTGCAGGCCCAGTTGTCACCCCTCCAGAACATCTCCCCATG 
GATTCTGGCGGTATTGACTCTTCAGATTCAGAGTCTGATATCCTGTTGGGCATrCTGGACAAOT 
GACCCAGTCATGTTCTTCAAATGCCCTTCCCCAGAGCCTGCCAGCCTGGAGGAGCTCCCAGAGGTC 
TACCCAGAAGGACCCAGTTCCTTACCAGCCTCCCmCTCTGTCAGTGGGGACGrc^ 
CTGGAAGCCATTAATGAACTAATTCGTTTTGACCACATATATACCAAGCCCCTAGT(nTA 
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CCCTCTGAGACAGAGAGCCAAGCTAATGTGGTAGTGAAAATCGAGGAAGCACCTCTCAGCCCCTC 
AAAGAATGATCACCCTGAATTCATTGTCTCAGTGAAGGAAGAACCTGTANAAGATGACCTCGTTC 
CGGAGCTTGGTATCTCAAAATCTGCTTTCATCCAGCCACTGCCCAAAAGCCATCTrCCTGCCTACT 
GGATGCTTACAAGTGACTGTGGATACCGGGGGGTTNCCTTTCCCCATTCAGNGACATTGTCCTT^ 
TGTTG 

SEQ ID NO: 3694 ACCATGATCCTGACCAATTTGACGGCGCTGCACAGGGACCCCACAGAGTGGG 
CCACCCCTGACACATTCAATCCGGACCATTTTCTGGAGAATGGACAGTTTAAGAAAAGGGAAGCC 
TTTATGCCTTTCTCAATAGGAAAGCGGGCATGCCTCGGAGAACAGTTGGCCAGGACTGAGCTGm 
ATTrrCTTCACTTCCCTTATGCAAAAATTTACCTTCAGGCCCCCA^ 

AAGTTTAGAATGGGTATCACCATTTCCCCAGTCAGTCACCGCCTCTGCGCTGTTCCTCAGGTGTAA 

TATTGTTAAGAAAGAAAGGGGCAAGGAAAGTAAGAAGACATGGCACCGTCTTCTGAAACCACTO 

GTGTCTGCTCAGATGTGTTGGGACAAAATGAAAGTGACTTTCAAGAAAGATCAGAGGAATTTGAC 

TCAGAGAAAACTAGATCCAAATCCCAGCTCTACTGTCTCGTCCGAATTAGTCTTGGGAAAATCATT 

TATATGCTAAATAATTTACCTTTTTATCTAGGAGATGAAAAGAGGAT/^^ 

AAAGTTCTTGTAAGAATCAAAAGAAATGGGTGAGCTITAAGTGGTTTGTAAACCATAA>^ 

NATTAAA 

SEQ ID NO: 3695 ACGCGGAGAAOAAGGCCAAGAAGCCTGCACTGGTGGCCAAGTCCTCCATCCT 
GCTGGATGTCAAGCCTTGGGATGATGAGACGGACATGGCTCAGCTGGAGGCCTGTGTGCGCTCTA 
TCCAGCTGGACGGGCTGGTCTGGGGGGCrrCCAAGCTGGTGCCCGTGGGCTACXjGTATCCGGAAG 
CTACAGATTCAGTGTGTGGTGGAGGACGACAAGGTGGGGACAGACTTGCTGGAGGAGGAGATCA 
CCAAGTTTGAGGAGCACCGTGCAGAGTGTCGATATCGCAGCrrrCAACAAGATCTGAANCCTGAG 
TGTGTGT 

SEQ ID NO: 3696 ACGCGGGGGTCAGACCCAGTCAGGACACAGCATGGACATGAGGGTCCCCGC 
TCAGCTCCTGGGGCTCCTGCTGCTCTGGCTCCCAGGTGCCAAATGTGACATTCAGATGACCCAGTC 
TCCTTCCACCCTGTCTGCATCTGTTGGAGACAGAGTCACCATCACTTGCCGGGCCAGTCAGAATAT 
TACCAGCTGGTTGGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAACCTCCTGATCTATAAGG 
CGTCTAGTCTAGAAAGTGGGGTCCCATCTAGATTCAGCGGCAGTGGATCTGGGACAGAATTCACT 
CTCACCATCAGCTGCCTGCAGCCTGATGATTTCGCAACTrATTACTGCCAACAATATCAC^ 
ACGTTCGGCCAGGGGACCAAGGTGGAAATCAAACGAACTGTCGCTGCACCATCTGTCTTCATCTTC 
CCGCCATCTGATGAGCAAGTTGAAATCTGGAACTGCCTCTGTTTGTGTGCCTGCTGAATAAC^ 
ATCCANANAGGCCAAAGTACCTTGGCCGGGACCACGCTTAAGGGCN 

SEQ ID NO: 3697 ACACITGAAACCAAATTTCTAAAACTTGTTTTTCTTAAAA^ 

ACATTAAACCATAACCTAATCAGTGTGTTCACTATGCTTCCACACTAGCCAGTCTTCTCA^ 

TCTGGTTTCAAGTCTCAAGGCCTGACAGACAGAAGGGCTTGGAGATTTTT^ 

TCTTCAGCAACTTGAGAGCTTTCTTCATGrrGTCAAQCAACAGAGCTGTATCTGCAGGT^^ 

CATAGAGACGGTrTGAATATCTTCCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACAAAC 

ATAATCCTGGGGACATACTGGCCATCAGGAGAAAGGTGTTTGTCAGTTGTTTCATAAACCANATTG 

AGGAGGACAAACTGCTCTGCCAATTTCTGGATITCTTTATTTrCAGCAAACAC 

TGACTGTGTGGGCACTCATCCAAGTGATGAATCCCGCNGT 

SEQ ID NO: 3698 ACTTTTTTTrTTTTTTT^^ 

ATCTTTTTTATTTGAGCCTATGTGTGTTGTTGCTGTGANATGGGTCTCCTG^ 
GGGTTTTGACTCTTTATCCAATTTGCCAGTCTGTGCCTTTTAATO 

ATAAATATATGTAACCCATCACATAAACAAAACCATTGACAAAAACCACATAATTATCTCAATAG 

ATGCAGAAAAGGCCTTTGATGAAATTCAACACCCCTTCATGCTAAAACCTCTCAATA^ 

ATTGATGGAACGTATCTCAAAATAGTAAGAGCTATTTATGACAGACCCACAG(XAATATCATACT 

GAATGGGCAAAAGCTGGAAGCArrCCCTTTGAAAACTGACACAAGACAAGGACTCCCTNTNTC^ 

CACTCCTATTCAACATAATATTGGAAGTTCTGGCCGGGGCAATCAGGCAANAAAAAANAA^ 

GGGGTAGTCAAAATAGGAAGAGAGGAAGTCAAATTGTGTCTGTTTGCANATGACATGATTGGTTT 

TTTANAAAACrCCATNGNCTTAACCCACAATCTCCTTAAGCTGNTAAAGCAAOT 

CAA 

SEQ ID NO: 3699 ACTAATTGAACCGATGTTGACAGTTGTGGTTCTGAATTCACCj^AGTTCTC^ 

catcaggacggcagttctcittcagaaatctcctgtaatactccagaggttccacggt^ 

cagccgccatcttcccgcccgcgcgtctccccgcgtacagaaagttatagagattatattgtgatg 

ctggaacttggagtgagacacacatcatitggcatttgagttgaatggtaattcacagtaatgct^ 

ccgttgttcgggacttaaagacaotgacctgtttgggctgttgccacttaaaagto 

aaatgtccacagtgtcttcctctgaggaaactcgaatcctgaaatggaaattcntrgtggcagata 
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ACTGGCTTATGACACCTTGAAAAGTCAAGTGCTCATATAACACACCACACTGAACCCCTTTTC 

CAGCAATATGTTCACTATGrrACCAATTTGCAACTTGTGCTTCAATAGTGGAATCTAC^ 

TrAACACTGAGCTAAAGAAAAAAAAGCCGTGTGTTTTATGAATGACCTTATCTGT^ 

ATACCTTraAAAAAATAATGTCCTNANTTCAGGCGTTGGTGGGTGCGT^ 

TTT 

SEQ ID NO: 3700 ACGCGGGGGGCTTCCTCCTCCTGAGCAGTCAGCCCGCGCGCCGGCCGGCTCC 
GTTATGGCGACCCGCAGCCCTGGCGTCGTGATTAGTGATGATGAACCAGGTTATGACCTTGATTTA 
TTTTGCATACCTAATCATTATGCTGAGGATrrGGAAAGGGTGTTTATTCCTCATGGACTAA^^ 
ACAGGACTGAACGTCTTGCTCGAGATGTGATGAAGGAGATGGGAGGCCATCACATTGTAGCCCTC 
TGTGTGCTCAAGGGGGGCTATAAATTCTTTGCTGACCTGCTGGATTACATCAAAGCACTGAATAGA 
AATAGTGATAGATCCATTCCTATGACTGTAGATTTTATCAGACTGAAGAGCTATTGTAA^ 
GTCAACAGGGGACATAAAAGTAATTGGTGGAGATGATCTCTCAACTTTAACTGGAAAGAAATGTC 
TTGATTGTGGAANATATAATTGACACTGGCAAAACCAATGCANACTTTGCTTTCCTO 
GTATAATCCAAAGATGGTCAAGGTCNCAAAGCTTGCTGGTGAAAAAGGACCCCACGAANTGGTTG 
GATTTAAAGCCAAAATTTTGTTNGGATITGAAAATTCNAGAANAAGm 
CCCTTG 

SEQ ID NO: 370 1 ACGCGGGTCAGCATTCTTGCTCCnTGTGGCCCTCTCCTACACTCTGGCCAGAG 
ATACCACAGTCAAACCTGGAGCCAAAAAGGACACAAAGGACTCTCGACCCAAACTGCCCCAGAC 
CCTCTCCAGAGGTTGGGGTGACCAACTCATCTGGACTCAGACATATGAAGAAGCTCTATATAAATC 
CAAGACAAGCAACAAACCCTTGATGATTATTCATCACTTGGATGAGTGCCCACACAGTCAAGC^^ 
AAAGAAAGTGTITGCTGAAAATAAAGAAATCCAGAAATTGGCAGAGCAGTTTGTCCTCCTC/^^ 
TGGTTTATGAAACAACTGACAAACACCTTTCTCCTGATGGCCAGTATGTCCCCAGGATTATGm 
ITGACCCATCTCTGACAGTTAGAGCCGATATCACrGGAAGATATTCAAATCGTCTGTATGCTTA 
AACCTGCAGATACAGCTCTGTTGCTTGACAACATGAAGAAAGCTCTCAAGTTGCTGAAGACTGGA 
ATTGTAAAGAAAAAAAAATCTCCAANCCCTTCTGTCTGTCAGGCCTTGAGACTTGAAACCAG^ 
AAGTGTGANAAAGACTGGCTNAGTGTGGAANCATTAGTGAACACCACTGATTTAGGGTTATTGG^ 
TTAAATGTT 

SEQ ID NO: 3702 ACGCGGGGGCCAAACTTGACCGCGCGTTCTGCTGTAACGAGCGGGCTCGGAG 

gtcctcccgctgctgtcatggttgg1tcgctaaactgcatcgtcgctgtgtcccagaacatgggca 

tcggcaagaacggggacctgccctggccaccgctcaggaatgaattcagatatttccagagaatg 

accacaacctcttcagtagaaggtaaacagaatctggtgattatgggtaagaagacctggttctc 

cattcctgagaagaatcgaccmaaagggtagaattaatttagttctcagcagagaactcaagg 

aacctccacaaggagctcattttctttccagaagtctagatgacgccttaaaacttactgaac^ 

cagaattagcaaataaagtanacatggtctggatagttggtggcagrrctgtttataaggaacca 

tgaatcaccccaggccatcttaaactattttgtgnncaagggatcatgccanactitga^ 

acgtttttttccaaaaatttgatttgganaaaatataaaacm^ 

totcttgatgtccaggaagganaaaanggcattaagtoccttgcccgggccggccgtt^ 

ggcnaattcc 

seq ed no: 3703 acaaagcatttctgcttccaagagaaatatcattgctacaaaaaactggcac 
attattctgtgaaaaaagacatgagtttttagttgtgttctacagctagttcccgac 
acatatccacacgaaagtaaaaggcaggtaagacaaaaagggctgtagttttrrctgaaataact 
caagtctrcaaaatatagcttttatattctttgtaaagtgggattagcatatrgc^ 
gttctaacaaacaaaaattccagtctggataataattctatggtagacaaaagaattctcagcctc 
ttgggtttcctggaattctttgcttcatgctctcctcttccactatgcagctaaggtag 
ttaagatgtcaccccttggccatccccctttagaacgtatcttaatgtgaacataaattgttc^ 
tgatgcttaaaagcttacatataattttcattcttagaaaaacgccacam 
tttctgaatatcatgattggaaaaaacaaaacaaaaaatgaaccccaaatcaaagtgtgg^ 
cttatatganaaagaatttttcaaccagatgggtcattcaaaaaagtttc 

seq id no: 3704 acgcggggacatatccactcctgctctccctcctgcaggtgaccccagccatg 
aggaccatcgccatccttgctgccattctcctggtgggcctgcaggcccaggctgagtcactccag 
gaaagagctgatgaggctacaacccagaagcagtctggggaaggcaaccaggaccttgctatctc 
ctttgcaggaaatggactctctgctcttagaacctcaggttctcaggcaagagccacctgctat^^ 

CCGAACCGGCCGTTGTGCTACCCGTGAGTCCCTCTCCGGGGTGTGTGAAATCAGTGGCCGCCTCTA 

CAGACTCTGCTGTCGCTGAGCTTCCTAGATAGAAACCAAAGCAGTGCAAGATTCAGTTCAAGGTC 

CTGAAAAAAGAAAAACATTTTACTCTGTGT 
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SEQ ID NO: 3705 ACGCACCTGGGNTNAAAATGCAGGCGATTCCTGAGGACGCCATCCCTGAGGA 
GAGTGGCGATGAGGACGAAGACGACCCTGACAAGCGCATCTCGATCTGCTCCTCTGACAAACGAA 
TTGCCTGTGAGGAAGAGTTCTCCGATTCTGAAGAGGAGGGAGAGGGGGGCCGCAAGAACTCTTCC 
AACTTCAAAAAAGCCAAGAGAGTCAAAACAGAGGATGAAAAAQAGAAAGACCCAGAGGAGAAG 
AAAGAAGTCACCGAAGAGGAGAAAACCAAGGAGGAGAAGCCAGAAGCCAAAGGGGTCAAGGAG 
GAGGTCAAGTTGGCCTGAATGGACCTCTCCAGCTCTGGCnTCTGCTGAGTCCCTCACGm 
CAACCCCTCAGATTTTATATTTTCTATTTTTCTGTGTATTTATATAAAA^ 

CCCCAGGGACAGAAACCAAGGCCCCGAGCTCAGGGCAGCTGTGCTGGGTGAGCTCTTCCAGGAGC 
CACCTTGCCACCCATTCTTNCCCGTTCTTAACTTITGAACCATAAAGGGTGCCAGGTC 
AAAGGGATACTTTTATGCCAACCATAAACAAACTCCTGAAATTGCCNAAGTGCCCTGCTTAGTA^ 
TTTTGGAAA 

SEQ ID NO : 3706 acctgtattggggaaacatagcatacaagcaagaagcttacagcctcagtgg 

CGAAAATTTTTTCATGTCAGAGACCGAGAACTCTTGCAGTCGTTTATGTCATCCCTTCr^ 

CAGAAGATACCAAAAAGTTGCAATCAAAGATCTCTTCATCTTATTGATAAAGCCACTAAT^ 

AAAATGTCTGTCAATGTCAACCGCAGCGTGTCAGACCAG1TCTATCGCTCAAGATGCCCCGTCTGA 

ttgccaaggttgagggcaaaggcaatggaatcaagacagttatagtcaacatggttgacgttgca 

aaggcgcitaatcggcctccaacgtatcccaccaaatattttggttgtgagctgggagcacagacc 

cagtttgatgttaagaatgaccgttacattgtcaatggatctcatgaggcgaataagctgcaaga 

catgttggatggattcattaaaaaatttgttctctgtcctgaatgtgagaatcctgaaacagam 

gcatgtcaatccaaagaagcaaaacaataggtaattcttgtaaaagcctgtggctatcgaggcat 

tgcttgacacacatcattaaactctgcacattctrtctcaaaaacccacctgaanaatagtgacag 

tggtacc 

seq id no: 3707 acgcacctggggtccaaatgcaggcgaltccrgaggacgccatccctgagga 
gagtggcgatgaggacgaagacgaccctgacaagcgcatctcgatctgctcctctgacaaacgaa 
ttgcctgtgaggaagagttctccgattctgaagaggagggagaggggggccgcaagaactcttcc 
aacttcgaannacnnacaacattccantcatagatnggacctcattagaacagt^^ 
cttgcaataaccatttttgcgagctgtggatctagaggaaactctgccatcatggatcccaat^^ 
gtcagatctccatcatcatttaaagcagccaggtaattcaaaagttccagggctctcatcagagtt 
tcaggagctggtggatccataaaatcaaaatgtacatcatttccagagcaggcactggcagcgag 
atagggttggaggagaagtagcgccgggacttccggatggcaaacttctctgtgggtagagattt 
ccagcaatcttgagcttcaggcctggcacagctcgaaataattccacttcgtccgtccccgaacng 
gcttgtggtccttcttcccaaacatgctgaggtagccggcx;tttcattgtaaatgtanggtggcct 

TTTTAAAG 

SEQ ID NO: 3708 acgcggggggagccagggccggaagtagagcggaggtggtggcggcggagg 
cmggcagctcgggactgagtgcaagaatcagcatgattcrrcagaggctcttcaggttctcctc 
tgtcattcggtcagccgtctcagtccatttgcggaggaacattggtgttacagcagtggcatttaa 
taaggaacttgatcctatacagaaactctttgtggacaagattagagaatacaaatcta^ 
agacatctggaggacctgttgatgctagttcagagtatcagcaagagctggagagggagcttttt 
aagctcaagcaaatgtttggtaatgcagacatgaatacatttcccaccttcaaatttgaagatccc 
aaatttgaagtcatcgaaaaacrccaggcctgaagaaataaagtaaaattaatctggtaatm 
tcacggattaattgt 

seq id no: 3709 actttttcmgaagttttagcggtcaatttgccr^^ 

ttatactgtggctatgcaacagctctcacctacgcgagtcttactttgagttagtgccataacana 
ccactgtatgtrtacttctnaccatttgagttgcccatcttgttrcacactagtc^ 
aagtgcctttagttttaacagttcactttttacagtgctatt^ 
ctaaaatacgtaaaaaaaaaaataaacaannaaaanaggt 

seq id no: 37 1 0 accgaccatagagcaagaatcaagattctgctaactcctgcacagccccgtc 
ctcitccttrctgctagcctggctaaatctgctcattarrtcagaggggaa^ 
agtgataagggccctactacactggcttttttaggcttagagacagaaactttagcattggc^^ 
tagtggcttctagctctaaatgtttgccccnccatccntttccacagnatccttcttc^^^ 
ctgtctctggctgtctcgagcantctagaaagagtgcatctncancctatgaaacagctgggtct^ 
tggccataagaagtaaagat^^^gaagacagaaggaagaaactcaggagtaagcntctagccccc 
ttcagcttkracacccttctgccctttctccattgnctgcacccca^ 
cttgtttttcctrrgnccatgggganggtttanccantagatnccttgn 
tanattccttaaaaaaa^ina^r^gngnaccmccccgggncgggc^^^ 

CCCNCT 
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SEQ ED NO: 37 1 1 ACCCAAGGGGGTGAGGATGGCATTGTGGGCGATCAGCTCCCATCTTCCCTCT 
GGCTTGCCCCTGTCTITTTTCCCCTACAAATCCCCTGGTTCCrGGTGCC^^ 

CCTTCCTCAGGGCAGGTAGCTCCTCTTCTCAGTCCTGGCCTCCCATCCAAGCCCATGAGGGCCCCT 

TCCTTGCTGGAAAAACTCTGTCTGTGTTAGTCCTGGAGGAGGGAACANATAGrrCCCC^ 

GGGCCAAGCCTCCCTTGGGGAGCAACTCCAGCCCTTCrrGGTCTGTCCTATCTGCTCAGrc 

mGGGGCCTTCCACCTGACCAGCTGATGGCANGGGTa:TGTCACAANGGCCAGGGCTATGGGTT 

GTGTCCCCCACTTGTGTTGTTGTNTmGCANGGCATGTAGGGGGANCAANTTTAACCAATGGC^^ 

AAAGGGGCTGGTGTCTTACTAAN 

SEQ ID NO: 37 1 2 ACTTGCTGGTCTCAAATTTCCACAAGGAGATATCAATGGTGATACCACGTTCA 
CGCTCAGCTTTCAGTTrATCCAANACCCAGGCATACTTGAAGGAGCCCTTTCCCATCT^ 
TCCTTCTCAAATTTTTCAATGGTTCTTTTGTCNATGCCACCGC 

GTGGTGGACTTGCCCGAATCTACGTGTCCAATGACGACAATGTTGATATGAGTCTT^ 

ATTTTGGCTTTTAGGGGGTAGTNTTCACGACACCTGTGTTCTGGCGGCA 

AACAANACAAAAAAAANNGTACCCTNGGGCCGCTACCCACNCTTAGGGCG 

SEQ ID NO: 37 1 3 ACACAATGTGCTTCCTTGTTTGTATTATAACACATTTCAAATAGGGACCm 
ACAGGGCAGAAGCATGAAAGGACACCAACCACGAGTGACACACTATACTATGGCTGTCCTGTGTC 
ACATGCTATTTGGCCTGGGGATATAGCAAA.GCCTAATGTAACTAAACAACAAAACCCC^ 
TTCATGTCGTCAGGAAGCITACTTTAAAAGAATAGCTTGGCCTGCTGGTGGCTCAC 
CCAGCACTTTCNGAGGCCAAANGTGGATCACCTGAGGTCAGGANTTCGAGACCAGCCTGACCAAC 
ATGGGGAAAACCCCGTCTCTACTAAAAATACAAAAAAATNANCCAGGCCGTTGGTTGGC^^ 
CCTGCAATCCCAGCTCTTCCGGAGGCTGAGGCACCGAGAATCCGCTTNGAACCCAAGAGGCGGAG 
GTTGCANTGAGCCAAGATCACACCAACTACACTCCAACCTAKGCAACAGAACAAGACT 
AAAAAAAACAAAAACAAAAAGGAGGCACCAAACCCCAGGCTTNAAGTO^ 
AATTTTGGGCACAANANGGACTCANACAGGCACTGTGTGNGCACCNAGGTm 
TCAGACCTNAGGCTTAAA 

SEQ ID NO: 3714 ACATTTCACCCTGATCATAAAAGAGGGACAAGGGAGCACTGGGCTCTACTGG 
ATAGCCTTTCTTTTAGATAAGATGCTTITAAAAGTTAAACA 
CAGCAAGCAGCACACAATTCCAAGTCAGCTTGTAAAGCTTTTGTTATCT^ 
TGGATTTTGAACGAAATTGATGGAGTACCrTAAATTTGCATCTTCCCT 
ACAATTGAAATATTTTGTCTTAAATCACTTGGTTCAATACATGCTTATT^ 
ATCAAACTCTCTCTCTAAATTTAAAATGCrGTTGAATATGATACTTTTG 
NAACTTAGACGGGATTTGGTAGGCCAAGTATGCTAAGTGT 

SEQ ID NO: 37 1 5 ACGCGGGGACTGCGGGCTGGTCCGGGCTCCTCAGGTTCAGACCCGACCGTTA 
TCCAGTCGGTTCGTGGAGAGGAGAGGTGCACTTTACAGGTCCCCGATGAACCAAGAGAACCCTCC 
ACCATATCCAGGCCCTGGTCCAACGGCCCCATACCCACCTTATCCACCACAACCAATGGGTCCAG 
GACCTATGGGGGGACCCTACCCACCTCCTCAAGGGTACGTTAGTGTTCGCGATTTTAAAGGCAAA 
GTGCTAATTGATATTANAGAATATTGGATCGATCCTTGAAGGTGAAATGAANACCAGGAAGAAAA 
AGGTNTTTCTTTAAATCCAGAACAATGGAGCCAGCCTGAAGGAACAGATTCTGACA 
CANTAAGAAAACTGTAAAATTCGAGCCATTTAAATAAAACCCTGTACCmC^ 

SEQ ID NO: 3716 ACGCGGGGGATGCTGCGCCTCTCCGAACGCAACATGAAGGTGCTCCTTGCCG 
CCGCCCTCATCGCGGGGTCCGTCirCTTCCTGCrGCTGCCGGGACCTTCTGCGG^^ 
AGAAGGGGCTCAAAGTCACCGTCAAGGTGTATmGACCTACGAATTGGAGATGAAGATGTACGC 
CGGGTGATCTTTGGTCTCTTCGGAAAGACTGTTCCAAAAACAGTGGATAATTTTGTGG^ 
ACAGGAGAGAAAGGATTTGGCTCAAAAACAGCAAATTCCATCGTGTAATCAAAGGACTTCATGAT 
CCAAGCGGGAACTTCACCANGGGGAGATGGCACANGGAGGAAAGANCATCTACGGTGAGCCCCT 
CCCGATTAAAAACTTCAACTTGAAGCACTTACGGGCCTGGCTTGGGTAACCATGGCCCACCGCA^ 
GCAAANACACCAACCGGTTCCCAGTTCTTNATNACAACNrrCAAAGACANCCTGGCTNj^ 
AACNTTGTGGTGTITNGNAAAATTTCTAAAGGCCATTGAAGTGOTCCC^ 
AAANAAACACCCNGGAATAAAACCCM^GAANGATNTTNATCNTNCAAAN^ 
GNNGANAAACCCCTTTTCCT 

SEQ ID NO: 3717 ACCCTTGGAAGATGGGAAAGGTGAGGGAAATATTTGAAGCAGGGTCAGAAC 
ATCCACTAAGAACATAGCACCTCAGTAGAGCTTACATTATAGTGCCAGGGTAGAGTTATTACTGA 
ATAGCTTAGGATGATGAACATTAACOTCCTACAGGAGTAGTAGCAGCTGATTTGGTGACCATCAT 
TGGTCACCTTTTAGTGTAACTGCAAACTAAGAACAATTATGGCrrGACATATACTCCAT^^ 
AGTGATGGGAGAGGCAGCCTCTGTGTGGTCCATCOTGGAAGCACTGCATCGATTTTGCTCCCCTC 
TGGTTTTAAGGAGATCITITAGACCTTTCCTTGCTAACTGCTTTCAGT/^ 
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ATTCTGGATTCANACATCTCTTCTCACCCITCTTTITCATTGTAGCA^ 

CAAAAATTGGCTTGCAGGAATAATTTTCAAGTTTTNCTAAAGACC^ 

CTTAATGCTGATGCAGGTACCTCGGNCGCNACCACCCTAANGGNCG 

SEQ ID NO: 37 1 8 ACAGAAAGTAAAAATGCTGTTACAATCTCAGTGTAACTGGTAGCCACAGACG 
GTTGACTCACCAATCACAGCTATCCACGCTACAGCAAGAATCTTACACAAAAACCTAACAACGCT 
TACAATTTTGTTGGGGTGAAGTCCACAAGCTTGGTGGTAGATTACCAAGAGGGACTACAT(^ 
GAAGGCGGAATGTCACAGGAACTATTCTTrGGCTCTTTGGGTGGGTGGGTGTCCTGGGG 
CACAGCTACCTTTTTTriTmAAACAGCTACAAAATCCATG^^ 
GTTCrGTTTCTGCAGGTCATCTGGGGTCTTAATCATTCTTCTNC^ 

SEQ ID NO: 3719 ACTCCAAGCTGCAGGATTTCATAACTGGGGCAATTGTGGTTGGATTrCATCCC 
TTACTGGTCTCAACCAAAATATTCCTGTTGGAATCATGATGATAATCATAGCAGCACTm 
CATCAGCAGTCATCTCACTAGTTATGTTCAAAAAAGTACT^lTTTrmTT^^^ 
CNCGGAAACTGGGTTTCACCACCGTNGGCCAGGATGGTCTCGATCTCCTGACCTCNGANANCCCG 
CCCNCCTTGGCCTCCCAAAGNGCTGGNATTCAGGTGTGAGCCACTTGTGCCTGCCCANAT^ 
CCTTTNTTTCAANAAAOTGAATCCmTGAAAATAATTA 

SEQ ID NO: 3720 ACGACTTANATAAATATGATGAGGAAGGTGACCCAGATGCTGANACTCTTGG 
TGAATCTCTCTTGGGTCrrACGGTCTACGGGAGTAATGATCAAGATCCTTACGTTACTCTGAAAGA 
TACAGAACAATATGAACGTGAAGATTTCTTGATTAAGCCCAGTGATAATCTTATAGTTTGTGGCCG 
AGCTGAACAGGACCANTGCAATTNATAGGCGCATGTrTATAATCAAGAANAAGACTCT^^ 
TACATGGCATTTATAATGGGAGCCCITITATTGGGAGAGAGGGAAGAGAACACATC^ 
AGGTGAAATATGAAGAAGCTCACAGCACACCTTITGTTCCTCTACCGTGTAATCCATGCCGTC 
TGCTCCAGCCCTCACAAGAGGGGGACCCCATCCCCNCCACCAGGCANGGGAGCATTTAAAGCATT 
CTCCACNAACAAATTTmAATGCATANTTAGAACCTCTTCCCNTO 
CCTGCTNNGANTAGCTAGANCTGTAGCTITAANCTCACCrmrCAAAA^ 
ACACNTAGGTTAACCTTA 

SEQ ID NO: 3721 ACCACGAAGTCAATACTTCTCAGTGAGTAGGGAAGGCAAAATACTTCCTCAA 
TAGCAGGGAAAAAAAAACAAGAGAAAGACTGAATTTANAAAAAAAAGACAACTGTTAAAGA^ 
ATGirrAAGTCAACTTTTTTTTTTTTTTA^^ 

CATTCTTTAGTAGQTCCITAAGGCAATATArrNAAATTAATTTACy^ 

TTTTTAAACATCACCTCAAAATAAATGCTCAAAATAATGCCCTCACAAT^ 

ATTCNAGATGTTTAGATAAAAAATAACTGGTGGGCTTTATATTN 

ACTATANTTAAGGNTNTACCANGITNCCAGNAAAACCACCTGTTTCTTANGGAGm 

AGATNCNTrTGGCTTNACCNATTGGGGAGCNCACAAAAAAACAATTTNTATCAN^ 

TirmGTACCTCGCCNGGGCCGGCCTTT 

seq ed no: 3722 actgggataaatgaagaagaaggcataaggacaataaacatggaactccac 
tgcaaatggatttratgcagctgaggaaagtttgggcttattagtatttgc^^ 
agttttctccattgcggacaacgtaactaccagctccttggctcagtggttcgcctccact^ 
gttcccagtaggttctgtcairattgttggcacataggccctgaatacagg tgat atagggccccc 
atgagcgctcctccattgtgaaaccaaatataotatcattcattttctgggc^ 
aggaagacagaaccatttagcaccagtgacattggtgaaatatgtttcattgattctcacagagt 
aattgacggagatatatgattgtgaagncangagggtgtcacaagttataggctcatcaacggga 
gatgttgaaagttaccttgaac<;aaaaacgcaagaaagagctttgtta^ 
tccawcaggggcngtgtaacacctggggctgnaaccgtttgnattgntgatgctc^ 
tttccngtgagcnttgtatncangagcttgnnggcaantccancnatnta 

SEQ ID NO: 3723 accactgtcttggaaatacagctttgagacaccaaataagatgtcagaagcc 
atcaatgtcttggcagaaggtgacaggaagagagaagaatgaaacaatatttattaacagcg 
agcctcctcxtgtctrcaagaacacctcctcccntgccatctgi^^ 

TGGTGGGCCTGGAATTCCC<XjAGTAGATTCGTCCACACAAATGGCCCCCTAAACCCATAAAC^^ 

AATGGCAAAACTTCAAATGAAAACGAAAGGAAAAATACAGTTTCTATGTCATGT AAAAT^ 

GGGTTGGCTGGAGGAAGAATGGAGCTCANGGCCAAAGTTCAGAAGTTACTTCCTCTTT^ 

GGGTGCAGAGCTGTGTCTGGAACCACGGhrrAGAGCCACGGCCGAACTGTGCACANATCCTTCCTC 

TTOCGGCTNATACTTCACTTrGTrCTTNTTTCTTCCTC 

SEQ ID NO: 3724 ACTGAGACCTATTGGAGCTTGTGGCCAGCATCCCATCTGCACCGTTGGTCAGG 
TCACTGTCACGTATGGCTCCAGAGCCTTGGGGA(XAGACTrCTGGCCATTTTAAGGTClTCGGCAG 
AGCAACTCCCCAACTCCAGCCCGCAAGCCATGCCCATCCGCTTCAGCACTTCTATGTCATC^ 
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TTTCAAATGTGTTGTCAAAATTAAACANGATACTCAAATTTGCATTGGAGATGCTGCAAGTCCGAT 
GACCGTCTTCTTGCTGGTGTTGACGATGATGAAGGGGCANGTGGATGACTGAGTTGGGTGGCGGG 
TGGCCGGCTTGGTCTGCTGCTCCACATGCC 

SEQ ID NO: 3725 ACGCGGGGTTGGCGATGGATATGTGGTTCATCTGGCCCCTCCAAGTGAGGTC 
GCAGGAGCTGGTGCAGCCAGTGTCATGTCCGCCCTGACTGACAAGGCCATCGTGAAGAAGGAATT 
GCTGTATGATGTGGCCGGGAGTGACAAGTACATCAAATGCCCGGAGCACTTCANGGCTCTTG^^ 
CCAGTCTGAAATCTCAAGTCAACATCTCTCTCTCTCAAAATCTCTCTTCTACCCTC^^ 
CATAAGGGCCTTGCTCAGTGGGGCCCATTTTGGCCCCTATAATGGAAATCGGGCATCACCTGCTCC 
ACCATCAGGTCTATAATCAATTTCCTCCTGCCGTCTTCTATCCCGTGGCCGAAGTTGCCTGGGGA^ 
GTTCGNTCTCAGGACCCGCCTATATGTCTCAGCACATGAACTGATTTTCCAGCCNCC^ 
ACTCAGGATTGGCTAAATGCCATAATTATCAGCACTGGGAATATGAACAACCGCTGGAAm 
CATAGAGTGCmAATAAATCCCCAGrrCOTTCCNATATGCNTCAATGGCn^ 
GTGNATGGAAACTGCANGTTGCCTTTT^TT^mAGGGAANNAACNAAAAACG^ 
AANCCCATAATA 

SEQ ID NO; 3726 ACTTTTTAATGGAAACAACTTGACCAAAAATTTGTCACAGAAT^ 
ATTAAAAAAGTTAAATGATNAAAAAAAAAAAAAAAAAAAAAAAAANGTACTGCA/^^ 
GACTTCAAACTGCACAATTCAGATAGCAACACCTGGGAAAGGCAAGAAGTCAACACCAAAACCC 
ATCCCTATTCTAGCTGCTGGTTTTTGCTCANACAAAATGTCGTTGTTGC^^ 
TTCANCCTACTATTGAGCCGAGTGGCTTTAAACTCCAAAGAACCTCATATGTGTTTAGTAAN 
ATTTCAAACTGCTGGGCCCCCAAAGTANAAACAGOTATAACAAAGGTGAGGACACCAGTGATi^ 
TTCTGAAGCAAAATTCTGGTGCCTGGGATrCCTGGTCANCATGCAGCTATNAAGGCCGCTCC^ 
CAAACCGAGTNAATTAGAAAGC 

SEQ ID NO: 3727 ACGCGGGGATATTGGAGCAGCAAGAGGCTGGGAAGCCATCACTTACCITGCA 
CTGAGAAAGAAGGCAAAGGCCAGTATGCACAGCTTTCCTCCACrGCTGCTGCrGCTGTTCT^ 
GTGGTGTCTCACAGCTTCCCAGCGACTCTAGAAACACAAGAGCAAGATGTGGACTTAGTCCAGAA 
ATACCTGGAAAAATACTACAACCTGAATGATGGGAGGCAAGTTGAAAAGCGGAGAAATAGTGGC 
CCAGTGGTTGAAAAATTGAAGCAAATGCAGGAATTCTTTGGGCTGAAAGTGACTGGGAAACCAGA 
TGCTGAAACCCTGAAGGTGATGAAGCANCCCAGATGTGGAGTGCCTGATGTGGCTCAATTTGTCCT 
CACTGAGGGGAACCCTCCTGGGAGCAAACACATCTGACCTACANGATTGAAAATTACACGCCATA 
TTTGCCAAGACANATGTGGACCATGCCCATTGATAAAGCCTTCCAACTCTGGAGTAATGTCACACC 
TCTGACATTCACCAANGTCTNTGAGGGNCAAGCAAACATNATGATATCITTTGNCAGGGAGATCA 
TCGGGACAACTCTCCTTTTGATGGACCTGGAGGAAATCTrGCCrCATGCTTTNA^ 
GGTATTNGGAGGGGA 

SEQ ID NO: 3728 ACTrCCTGTTCTGGCTTCCCAGAAGTGACCTCAGCTTCATTCGGTCCrrCTGGT 
GCTGCTCCTCCTTTTGCCTXjn'CAGAGGmCAGTTAACCAATCTAGAGTC^ 
GTGGCTCCTCTGCANATTTACTTATATCCATATCITCTCTrCCACCC^ 
TCTTTATCAGTTTCAGGCTTTGCCAAAGACTTGTCTTCTGNTTTm 
GCGTCATAAACCTGTTCTCTCAACTOTCCCTTGCTTCCTAATCTATGTTATCATTA^ 
AGATTCATCTTCTGATNTTTCTCCTTCTTNCTCrnSICACATGCACAC 
CACCATTCTCCATTCTTGGCAACTCCAGAAGTGATTTCCCCATNAGAAAAAAGAAN^ 
NCmATTAGCCTGACTITCCATACTTTCriTACCTAAAAAG^ 

CAAGCTGCGTGGAAATATNCCCCATCACCAGATNGTNNCTGNCCTAAACCCAATAATTTTCT^ 
NTTA 

SEQ ID NO: 3729 ACGCGGGGGTGGTGGAGAAGGACGTGCCGTGCCGCTGGGTTCTGAGCCGGA 
GTGGTCGGTGGGTGGGATGGAGGCGACCTTGGAGCANCACTTGGAAGACACAATGAAGAATCCCT 
CCATTGTTGGAGTCCTGTGCACAGATTCACAAGGACTTAATCTGGGTTGCCGCGGGACCCTGTCA 
ATGAGCATGCTGGAGTGATATCTGTTCTAGCCCAGCAAGCAGCTAAGCTAACCTCTGACCCCACTG 
ATATTCCTGTGGTGTGTCTAGAATCAGATAATGGGAACATTATGATCCAGAAACACGATGGCATC 
ACNGTGGCAGTGCACAAAATGGCCTCTTGATGCTCATATCTGTTCTTCANCAGCCTGTCATAGGAA 
CTGGATCCTACCTATGTTAATTACCTTATAGAACTACTAAAAGTTNCAGTAGTTAGGCCATTCAm 
AATGTGCATTANGCACTTTrrCTGTrrATTTAAGAAATCAATTGC^ 
NATCAAAGATATTTATTAAAGAANANGATCArrGTTTTNAAACACCAGGTNCAAG^ 
ATATNNAATTTGCTGNATTCAATAAATNTGTTTTGAGTAAAAAAA/^^ 
CTTANGNNTCNAC 

SEQ ID NO: 3730 ACCACCTCAGGTGTTTGCTGTCTrTCTTCAGGTTCCrCTACTTOT 

TCCTCCTGAGGCTCAGTGACAAACCCACCAAAGACCTCATCITGGTATCTGAAGATATCATTGTGA 
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ACATAGAATTTATTTGCAACAGACCCCTCAGGAGCAAGGACAAACGTTTGCATGAATCTCCTCA^ 

AGCCTGGTTGTTGTTAGAGAGAAGCCCCATCACCTGGACTACCACACCATCATTTACCNGTGGC^^ 

GAGCATCAACATGGCGAATCrTGGTGTGGCAGTTGGTGAAAGrrTTGTGACATCAC^ 

ATTTCTTTCTGTCCGTAGACTGCATCTGCTGGCTTCCATTTGAATCCAATCCCCCATGGACATi^ 

AGAGTT<>JTTTCCATAAAATCTATGCANCATGTCTGGGGCCTGGTTCAACAGTGNGTAATAOT 

T 

SEQ ID NO: 373 1 GTCGGCCGAGGTACCAAGAAAAGGGTGTCCGTTGCTAGAGAAACTTGGTGTG 
ACTGTTCGATAAACTCCAAAGCTTGTTCCCATCATACCCGTAATGCTTTCm^ 
AATCTTTGAACTCATTCAACTCCAGGACATGGAAGAGGCCTCTCTCTGCCCTTTGACTGGA^ 
CAAGTTITATTCTGGTGTCTATGGGGAAGCITrGTTAAAT^^ 

TAATTCAACTGCTTCCCTACATANACTAGAGGGCTAAGGATTCTGTCTGCTGCTTrG^^ 
OTAGGCATTTAGATCArrCOTGTAGGCTTCCTATTTTAACm 
TAGTCrmATCACACTANGTGGTTGCCTTTNTTAGCANA/^^ 
NCCNGGCGGCCGTTCTA 

SEQ ID NO: 3732 ACACAATGGTTTATTAAAGGAATGTATGGCCCACATCAACCTAGCAAGGATT 
CTACTGGTAAACCTTCCTATGGCCAAAGGAAAAACAAGCAGGAGTTGAGTGGCTGGGGTGGGGTG 
CANGCAATGGAGAGAGGGCAGAAGGGTGTAGAAGCTGAANGGGTCTANAAGCTTACTCCTGAGT 
TTCTTCCTTCTGTOTCAAAATCTTTACTTCTTATGGCCA/^ 

ATNCACTCTTCTAACTGCrCGAGACAGCCAGAGACAGGGGAGGAGGGAAAGAANGATACTGNNG 

GAAAGGGATTGGCGGGGCAAACATTATAGCTAGNAANCCNCTACTGGGCCAATGCTAAAGTTTCT 

GCTCTTAAGCCTAAAAAAAGCCA^fNGTANTAAGGCCCTTATCACTCTTAGTm 

CCTTTGAAATATGAANCANATTTACCCNTGCTAGCACAAAATGAAGAAGGACGGGGCT^ 

AGTTAACNANAATCTTTATTTCTAGCTNTATGGNCXjGTCCT^^ 

CAANT 

seq id no: 3733 acatcatctggttctcccagacccmtaggactgaactgtgcit^ 
tacatgtctgcctgttacacagtggaaactagtitccntcagagggacacgaaaccacagatc^ 
gccacaaagaaagcattctcotccacaagcaactggctgagaaaacagaaccaggaaaa™ 
ccttgaggaaacaatgtcagcttcctgaatcatccanggagaaagggggagagaatagatm 

AATGCTTAAAAAATGGCCTTTCAGAGTGCTCTGTTCCAACTGTTTGTGTGT^ 

ataatacttgcct(^aacagaaacacctggcccccagccatcgtcaacacacarrgggcccc^ 
tccaaaacacaagangctgattcctctcctgccctgntgagaagcgcctgccaccagagngnatg 
gggagacagggaanggggaagttctttcctgagctnaanctctcnancttggaattgcctgttgc 
aacatggccagggaattgacmrtgacccctngaaggctttmcct^ 

SEQ ID NO: 3734 acatagacaagtttotgtaagacagaaaacagagaaatccacagtaactct 

AACACATCCCTTAAGGAATAAGCATGTATTTGTAGGAAGCAAACAAAGCITTCCATAGAGAAACC 

ACrrrCCCCGCGTACTAAACGAGCAGGTGAAGGAGGCTGAAGGATCGTCTGCTGAATACAAGAAA 

GAAATTGAGGAACTAAAGGAACTGCTACCCGAAATTAGAGAGAAGATANAAGATGCAAAGGAGT 

CTCAGCGTAGTGGGAATGTAGCTGAACTGGCTCTGAAAGCTCTCTGGTGGAGAGTTCTAC™ 

TTTCACTCCTGGTGGAGGAGGCTCTTCAGTCTCCATGATTGCCAGTTGAAAGCCAACAGAC 

TTCCTCATCAAATTGTGTGACTGATATNrrCCCACCTTGTCANAAAGAAGAGGAAACCAAGAGG 

GAGAGTCCCCGGAAAGATGATNCATANAAAGCCATACATGGACCNNGAGGTGAAACGGGANGCA 

NNGGGGGATGCCTGTCCCCA 

SEQ ID NO: 3735 ACTTITTTTTTTTTTTI^^ 

AATTTTGAGTAGTCAAAGTCANAGCAGTCAATCTGTGTTGTGAGCCGAGGCACANCTGCANAANC 

GTGTCATGAGGTGTCCGGTGGAGGTGGCAGCCNANCTCTGGGACTAATCACCGTGCTGGGGACGG 

CACCGCTTCAGGATGCAGGCNGATCCCTGCANAAGTGTCTAAAATTCACACTCC^^ 

GACNGTCGATGGTATTAGGNTANAAGCACCAGGGGACCCCACNNAAasrGTGTbTO 

ANCCCrrATTTACACACTGGGAGGGCNTNn^CNCCKTGAAAACA^ 

GCCNACTGGTACCTCTGGNCNTAACCNCNCTAATGGCGA 

SEQ ID NO: 3736 ACGCGGGATCCCAAGTCCAGCGTGAAGGGCCACAGCCCCTCITGGCTGCCAA 
GCACGCAGATCCCATGGACATTTGGGGAAAGGGCTCCTTGGGCTGCTGGTGAACTTC^^ 
CCACCTCCTGCTCCTGACCTCCCTGGGAGGGTGCTATCAGTTCTGTCCTGGCCCTTTCAGTm 
AGTTGGTTTCCAGCCCCCAGTGTCCTGACTTCTGTCTGCACATGAGGAGGGAGGCCCTGCCTGTGT 
GGGANGGGTGGTTACTGTGGGTGGAATATTGGAGGCCTTCAAOTGATTAAACAAGGCCCNCCAC 
ATCTTGGANGGCATNTGCCTTACTGATTAAAANTGTCAATGCANTAANAAAAAAATAA^ 
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SEQ ID NO: 3737 ACAGCGGGGGCrmGmGTGTCCCTGGCCATGGCGCTGCAGCTCTCCCGGG 
AGCAGGGAATCACCCTGCGCGGGAGCGCCGAAATCGTGGCCGAGTTCTTCTCATTCGGCATCAAC 
AGGATTTTATATCAGCGTGGCATATATCCATCTGAAACCTITACTCGAGTGCAGAAAT^^ 
ACCTTGCTTGTAACTACTGATCTTGAGCTCATAAAATACCTAAATAATGTGGTGGAACAACTGA 
AAITGGTTATACAAGTGTTNAGNTCANAAACCTGGTTGTANTTATCTCAAATAITGAAAGTAGGTG 
AGGTCXTTGAAAAGATTGCAA 

SEQ ID NO; 3738 AGCGACCATAGAGCAAGAATCAAGATTCTGCTAACTCCTGCACAGCCCCGTC 
CTCTTCCTTTCTGCTAGCCTGGCTAAATCTGCTCATTATTTCAGAGGGG/^ 
AGTGATAAGGGCCCTACTACACTGGCITTTTTAGGCTTAGAGACAGAA^ 
TAGTGGCrrCTAGCTCTAAATGTTTGCCCCAGrcATCCCTTTCCACAGTATCCrrCTACCC^^ 

ctgtctntgcctgtctcgaccagnctataaagagtgcatctccagcctatgaaacagctggntctt 
tggccataagaagtaaagatttgaatgacatgatggaagaaactcaggagtnt4gcttctagaccc 
nttcaagcttntacacccttntgccctcirrcattggctgcnccnncancc^^^ 
ggtttgttttttnctttggccnntangaaaggttta(^^ 

SEQ ID NO: 3739 ACGTCAAGCAGGAGTGCAATCGCACCCACAACCGCGTGTGCGAATGCAAGG 

aagggcgctaccrrgagatagagttctgcttgaaacataggagttgccctcctggatttggagtgg 

tgcaagctggaaccccagagcgaaatacagtttgcaaaagatgtccagatgggttcttctcaaat 

gagacgtcatctaaagcaccctgtanaaaacacacaaatttgcagtgtctttggtctcctgctaac 

tcagaaaggaaatgcnacacacnacaacatatgttccggaaacangtgaatcaactnaaaaaat 

gtggaatagnatgttaccctgtgttgaggaggcattcttcangtttgctgttccrracaaagm 

gcctaacctggcttaattgtctatggtaganaaattntgcctggcaccaaagttaacgcagaana 

GTGTTANAGANGGATTTAAACGGGAACTCAGCrrCACAAGNAACAGTACTTTCC^ 

nttatngaaaacatcaaactaangacc 
seq id no: 3740 accatrggtggtggtatctttcaagcaatcaaaggttttcgcaattctccagt 

GGGAGTAAACCACAGACrACGAGGGAGTTTGACAGCTATTAAAACCAGGGCTCCACAGTTAGGAG 

GTAGCTTTGCAGTTTGGGGAGGGCTGTTrrCCATGATTGACTGTAGTATGGTTCAAGTCAGAGGAA 

AGGAAGATCCCTGGAACTCCATCACAAGTGGTGCCTTAACGGGAGCCATACTGGCAGCAAGAAAT 

GGACCAGTGGCCATGGTTGGGTCAGCCGCAATGGGTGGCATTCCTCCTAGCTTTAATTGAAGGAG 

CTGGTATCTTGTTGACAAGATTGCCCTCTGCACAA^^mCCCCAATGGTCCTCANTTTGCAA^ 

NCCCTCCCAGTTGCCTTCAACTCANTrACCITCCTCACCCTTTTGGAAACT^ 

AGGGACTTCTTTNCTAANGATTTCTrrTAACAAAACAAGTTGTGGGTTC^ 

SEQ ID NO: 3741 ACAATrCCTTGTTTTCAAGGGTAAGTTCX:AAGACTTCCAGACTCAGG 

GTCAGGTCCTGGTGAAGGTCGTCATGCTGTTGTGATAGCGirrAGGTCCTNCATTAGTCCTrCCAG 

GTGGGCCATCTCTTTGGTCAGCTCATCTGGTTCATANCTACTCTCNGAGTCTTNC^ 

CTCCTGCNCTTNAANGGGCACTGGNAACAACCACTGGCATAGNAGGCCCGGCTCCTIWCTA^ 

CCCNTNTGGATGCTTGTCTTANNCTAGANCTGAANTGTGNATGGANCNCAACCTTGCTGGG 

CTTATCGNGNNNTTCTTGNCTANNGCTAGATCATCAAGTAGGNGGT 

SEQ ID NO: 3742 ACTTTITrtTTTTriTm 

TAANACATTTANAACACCAATTTGTGAGGATAAATTCCATTCGTCAAAGCAAACACANGATCGCA 

GGTNNCCCTGGAGCTGAGGAATACCTTTGATTTTTGGTAAAAm 

CAATCTTGCNCTGCTCCAGTAATCNCATATITCTCTTTTTCTOGGTCAAAAATCT^ 

GGTCTGGGCTTCCGCAGCITCITCTTTITGAAGT/^ 

TGNTGANTCNATTITOGTTGAACNGGCAANTGNAAATTNNNG 

NATTGNGGCCCANAGGTCCCGrrCCNAAGG>nSfTAAGCCACTTNCCNGCTGCTT^ 

NCCCCTnTCCCCCNGTGGCG 

SEQ ID NO: 3743 ACTGTAmATTTCTTATTTTATACAACTCTTACTCTTTACAAAAATC^ 

GTCATGCCCATAAAAGTGTAACCCATGGCATTTGAAAGCAAACACTCTTTGTAATTATTTAAAAGG 
GCCTTTTTAAAAATTATAATACAAAGGTCTGTGCTCTATAAGCAGTCCTCCA^^ 
AATAAGGCTTACTTTTAATACAAAGACTAAGCGGTTTGGGAGTTATGAACCACTGGATTT/^^ 
ACATCATTTTATAATTGCTTTTTGCATCTTTmGT^^ 

ANCAGGCTTGGAGTTCTACAGCAGATTCTTCTTTCTCTGGACTCTGAATCGCm 
GACTTTCANGATCAGGTGAATCATCATCNGCCATCATCA 

SEQ ID NO: 3744 ACTGGAGATGTATTTGATAACCAAGGTTTTAGGTAAATTTTCArc^ 
TTCTArrTGCAAACTGAAAAATGTTGTAGGCTTAATGTAAAATA^ 
CTCTTAGAAGAAAGGCCATATTTTGCTCCTGCITCTGTAAAAATArrAm 
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AATGGTAGTGTGACCTTTCACTTAATTCCTACTCCCTTAATGTGAGAGAGACA^ 

AAGGAAAAATTCTGGAAGTTACACTCCACAACCmGAACATACCTGAC^ 

GACAACNATTTTCTCCATGCCACCCATGCTCTAAATGCCTTGTGGGATCACNGGACAACCCT 

GCCAAGCTACAAGCATCACNGATGTTNATCTTGCANCAAANCACTGCAAGANTAAATGACANW 

TNAACTGGTNCCTGGGGTTTTTGCCATCmWCAC^^ 

ATAATNANTNCTTCTGCCTCCAGNATTGTA 

SEQ ID NO: 3745 ACACGAGAAGCTCCGAGGATGGCTGAAGTCCAACGTCTCTGATGCGGTGGCT 
CANAGCACCCGTATCATTTATGGAGGCTCTGTGACTGGGGCAACCTGCAAGGAGCTGGCCAGCCA 
GCCTGATGTGGATGGCrrCCTTGTGGGTGGTGCTTCCCTCAAGCCCGAATTCGTGGACATCATCAA 
TGCCAAACAATGANCCCCATCCATCTTCCCTACCCTTCCTGCCAAGCCAGGGACTAANCAGCCCAN 
AANCCCAGTAACTGCCCTTTCCCTGCATATGCTTCTGATGGTGTCATCTGCTCCTTCCTGTGGCCTC 
ATCCAAACTGTATCTTCCCTITACTGTTATATCTTCACCCTGTAATGG 
CTTCTCCACrrACTATAATGGTTGGGAACTAAACGGTCACCAAAGGTGGCIT 
AAAATGGAAAGGCGTGGGTGGGATTTGCTCCCTGGGTTCCCTAGGCCCTATTGANGGGCANAAGA 
AAAAACCAKrCCTCTTCCTTTCTNAANCNGNNGAGGCCAA^ 
TGCTNCCCCTTTCCNATGGNTNCCCCmCCTTTTNrrC^^ 
GGAAANAAA 

SEQ ID NO: 3746 ACTTTGTTTGCTGATACAAGGTGAGCCAAAGGGGTGGTGAAAAGAACACACN 
AAAAAGGATCTTTGTCTAACCCACAAAGGTTGAGAAATTTGCACTGATACTTAANTTCT^ 
ANAANGGCTGCTAATCAAAGCTGGNGATCATTITTGCANGACANCCTTACTT^ 
TTATATTTAGTTAGTTGATAACCNAAAAAANTGTCCANNGAGTNAATACCCTACTG 
GGTTANAANCAANTCCCTGGGTCCTTGNGACCTAC 

SEQ ID NO: 3747 ACTlUlC M 'lU"riUlll'14'l' 1 14TTTGCTGTTGTCCCAGATITATTG 
CAGCACTACAGAAAAAATTCAAAAAGGTCCCCCGAGGCGTTTTGAAATTCATCCC^ 
CTGAGTGACCTGAANGTTGGACAAGAACTGGCCGAAAATCCNAAAAANCTTCAGCATT^ 
NNGTCAGGGATCTTACTTTNAATAATCTCCTGAATCCCAAGGOT 
TTGGCNTCTCCmCTTNTTNCTCCTCNTGCANCNTTGATGGGN^^ 
TTNTGAAITCCGTTNANTCAAAANTCCNTNACATNAACCT^ 
TNCTGGGGGAATATNGCGAATNTCCTTCNNNAACNCCCCTTGTTC^^WNT^^^ 
AGCCCCCTGTANTANCCTTGGGCCCGGAACCCCCCNTATGGGNGNAATTTCA 

SEQ ID NO: 3748 ACTGGCCAGGAAGGTGGAGTAGGTTTCAGGCCCTGGGGATTTCAAGTGCAGA 
CTGATGGCCTGGGAGGGGCCAAAGAGACCAGATCCTGGCAGCAGCTGAGGAGGTGCCCAAGGGC 
ACTTTCAG 

SEQ ID NO: 3749 ACGCGGGGGTGCCAGGCGATCTTCCTGGAGGGTCATTAGCAGCATTGAGCAG 
AAAACCATGGCTGATGGAAACGAAAAGAAATTGGAGAAAGTTAAAGCTTACCGGGAGAAGA 
AGAAGGAGCTGGAGACAGTTTGCAATGATGTCCTGTCTCTGCTTGACAAGTTCCT^ 
GCAOTGATTTCCAGTATGAGAGCAAGGTGTTTTACCTGAAAATGAAGGGTGATTACTACCGCT^ 
TAGCAGAGGTCGCTTCTGGGGAGAAGAAAAACAGTGTGGTCCGAAGCTTCTGAAGCTGCCTACAA 
GGAAGCCCTTTTGAAATCANCAANAGAGCAGATGCAACCCACNCATCCCATCCGGCTGGG^ 
CCCTCAACTTCTCCGTGTTCTACTATGANATCCAAGAATGCACCTGATCAAGCCTNCCTCTT^^ 
AAACAAGCCTTNATGATGCCATACTGANCTNGACACACTAACGAGGATTCCTATANGGACT^ 
GCTGATCTCCNNTTTGCTGNAANACAACCTCACCCTTTGGACGANCGACCAGNAGGA^ 
CAGGATAANGCAACTGANGATCCTTCANANTCCCTTGNCCTTTCrrmCC™ 
CCNTATCTTTCCTT 

SEQ ID NO: 3750 ACGCGGGGGCATCCTANCCGCCGACTCACACAAGGCANGTGGGTGAGGAAA 
TCCANAGTTGCCATGGAG^^AATTCCAGTGTCAGCATTCTTGCTCCTTGTGGCX;CT^ 
TGGCCAGAGATACCACAGTCAAACCTGGAGCCAAAAAGGACACAAAGGACTCTCGACCCAAACT 
GCCCCAGACCCTCTCCAGAGGTTGGGGTGACCAACTCATCTGGACTCANACATATGAAGAAGCTC 
TATATAAATCCCANGACAAGCAACAAACCCTTGATGATTATTCATCACTTGGATGAGTGCCCACAC 
CAGTNAANCITTAAANAAAGTGTTITGCTNGAAAAATA 
GTTTGTCCTCCmAATNCTGNmATGAAANAACTGANCANACA^^ 
ATGTCCCCAGGNATTATGTTTmrGACCCTTCTCTGACNAOTTTAGANCCNAT^^ 
ATTTTCAAACCGTCmiQTATTGCTTACCNAACCrGNAGATOCAAGC^ 
AAAAAAGCTNTCAACTTTGCTGAAGAATTAATTNGTAA 
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SEQ ID NO: 375 1 ACACTTGAAACCAAATTTCTAAAACTTGTTTTTCTTAAAA^ 

ACATTAAACCATAACCTAATCAGTGTGTTCACTATGCTTCCACACTAGCCAGTCTTCTCAC^ 

TCTGGTTTCAAGTCTCAAGGCCTGACAGACAGAAGGGCnTGGAGATTTTT^ 

CrrCAGCAACTTGAGAGCmcnTCATGTTGTCAAGCAACAGAGCTGTO^ 

ATAGAGACGGTTTGAATATCTTCCAGTGATATCGGCTCTNAACTGTCAGAGATGGGTCAACNAAA 

CATAATCCTGNGGACATACTGGCCATCAGGAGAAANGTGTTTGTCAGTTGTTTCATAAACCAANAT 

TGAGGANGACAANCCTGCTCTGCCAATTTTCTGGATTTCnTrATTT^ 

AAGCTTGACTGTNTGGNCACTCATCCAATNNGATGANTAANCATCAAGGGm 

GGATNTATATAAAGCTTCTCATATGTCNNGATTCCANATGAANmAGTCAGCCCNAACCm 

AGGG 

SEQ ED NO: 3752 ACTATAAGANTAAAGAAAATTGTGTTGTGGATAACATCAAAGTGTGCAGTAA 
TGACACTGGGAGTGQAAAArrCAAGTGTGmGCATCACTATGAGAGTGCCTCGGAACCCAACTA 
TCGGANATAAATTTGCCAGTCCNCATGGGCAGAAGGGCATTTTAAGCACATTGTGGCCGGCTGAG 
GAa^TGCa^^^T^TACTTGANAGTGGGATGGTTCCCGACmCTGT^ 
ACTNGAGTNACCATTGCNATGTAA 

SEQ ID NO: 3753 ACTCATTCCTCTTCAAGAGCACCACTGGAAACATCTTTGGAATTGATGGGACA 
AAACTTATTTGATTATAGTGAAGGATATTTCTCTACGAAAATGGCTTCAAAATAGQAAAAAA 
GCTACCACATCTTCCAGGAAATCCAATAACAGGTCAACTCTATTGGTAAACAAGCAAGTAAACAA 
GCAAGCAAACAAACCAGCATGTGACAGAAAACTGGCTAATATTCTGGGACGCTTTCCAGAGAAAC 
ATTAAGCCITTTAAGTGTGTATGAGCCACTTTAAGAAACTTCATAAGTGTTGCTAAATTC^^ 
GTCTGCTTCACTGAACTGTGACATTATAATATTCTTCTGGTGTCACGATTm 
GTCCAGGACCCAGATATTGGTTATATTCAACTTAGCAAAGAACCAATTATGGCTGGAAGGCCAAG 
GTTTCATCTTCANGGTGTCGAATGAATAACGAATGCTTTGCAATCCCGCGT 

SEQ ID NO: 3754 ACGCGGGGGAATCATCGAATGGAATGGAATGGAACAGTCAATGAACTCGAA 
TGGAATCATCATTGAATGGAATCGAATGGAATCATCGAGTGGAATCGAATGGAATTATGATCAAA 
TGGAATCGAATGTAATCATCATCAAATGGAATCAAAAATAATCATCATCAATTGGTATTGAATGG 
AATTGTCATCAAATGGAATTCAAAGGAATCATCATCAAATGGAACCGAATGGAATCCTCATTGAA 
TGGAAATGAAAGGGGTCATCATCTAATGGAATCGCAGGGAATCATCATCAAATGGAATCGAATGG 
AATCATCATCAAATGGAATCTAATGGAATCATTGAACAGAATTGAATGGAATCGTCATCGAATGA 
ATTGAATGCAATCATCGAATGGTCTCGAATGGAATCATCTTCTAATGGAAAGGAATGGAATCATC 
GCATAGAATCGAATGGAATTATCATCGAATGGAATCCGAATGGTATCAACACCAAAAAAAAAAA 
AAAAAAAAAAAGTACCTGCCCCGGCCGGCCGCTCNA 

SEQ ID NO: 3755 ACAGATGCAGCCGTGCAGGAGCCGAGCCAATTAATTTCATTAGAGGAAGAAA 

accagcgcaaggaatcctctagttttaagactgaagatggaaaaagtatttta^^ 

aaggctctacacatactgcatgctcaggacccatagatgaactattagacatgaaatctgaggaa 

ggtgcrrgcctgggaccagtggcagggaccx:cggaacctgaaggtgctgacaaagatgacctgct 

gctgttgagtgagatcttcaatgcttcctccttggaagagggcgagttcagcaaagagtgggccg 

ctgtgntggagacggccaantgaaggagccagtgcccactatggccctgggagagccagacccc 

ANGGCCCAGACAGGCTCANGTTTCCTTCCTTCGCANCTTTTAGACCA^ 

gcctcgctacaagaacctgctaaggctgcctcaaacctgacttgcctggttc^^ 
accttcgacccactctcaaattcctgatgctggngggaaaacccggantcta 

SEQ ID NO: 3756 acgctggcagggccagtggcaggaagggagggacaagtggacagtggtgtg 
tctgaattgtanagtgttagattccaggtcattcccatctgctcaattccatcagccgcagagttc 
aagctgtcatctaagcccagggaggtgtaaggaacaggccacatctggttcttggaagtatatgc 
ccaggcatttaaagcggatcmcctcritgggccatgggtggatttcctgctcaagcaacot 
tctcttaaagctgtgttgctggaatctggactttccattcangctggggccca 
ngctcctctacatgacatcgaaacgcttcctgcagctcatctgggacggantggccattcactgtc 
ccggcacatccaganccctggcatggactggctacrranagcanngaactctcccttctctgctcc 
aggcctgaagacrrgcagtctgctttaccatcatcctctttcttc^ 
gtcttgo^agttcantcctnaatccancmnmccctgggaacaatactgt^^ 
aacaatcantaaaaattttantgggcccnatncccgcnctttaacttct^ 

GNG 

SEQ ID NO: 3757 ACTTCTTGCCCTTGTAGAATTTCCTGAGGTTTTCTTTCTGAGTCT^ 

CTGTGAGAACACGGGCAATGGArTTCCGGACGACTCNGATCTTACAGAGCTTGNANGCCGCACCA 
CCTGTCAArrrGGCGACGCGCAGCTAGGGACAGCTCCACCTTCAGGTCCTCCAGCTGTTTCAGC^ 
CTCCTGNTTCTT>riTNCCTGCGAAGGATCTTNANCCTNGATOT 
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natacgccaasrtgccgg>n'a>innctanntaccgtgccangggcgct 

seq id no: 3758 actaaaaattttaaacgtcaccttggtgttttm 
cacgtaaggcatttccatgtcataaattaattctaccatatttitatgtaa^ 
gttgtgtctgcagctgtxk:tgtgattatgagacagatacaatgattcacactggctggaagct 

TCTCACGTCCCAGGGATCAGGGCAGTCCAGCCCGGTGCTGCCACAAAGGGTGGTGGTATCAGGTC 

acacctaggacctgccactttanactgaccttgtttgatccacggttgtgaccatgtatt acca c^ 

aatitaacaataaaaaatngititaagagtatctgagaaaggactaagcagtcattcca 

tccttcccttgttitaaaataagcaacgtcctatacttccatcam 

gttcaagtcttgggcccatgtcatccttccagaacaagccatcctangattttaaatgggtct^^ 

aaataaagtgagaagattgaccccaagtcccgcgtacctcggccgnnaccacnctaagggc 

SEQ ID NO: 3759 acgctttgccagtgtggctgggtccacctgcatctagagaaaaagaaaacac 
tatgagggtcagaccatgctgtctcctgtgtgcacaggtcttcacttgttgacgttaaaaagccag 
atggtaaacagtggttgagacaatggccccggtatctagaagacattcctggaaaggcacaggat 
itctgtgcaacattttcaattgtttttctttgacaato 

tcttaaggaaaatgcctgttgacacacagmgtaagtgtggatgccatctgaaaccactgttta 

ttgataatgaactctggcatttgcccaccaagaaattctgtcagtcacctatagtgtatatqgtga 

ggtotatacatcttaattcaattgaaacagagaaaccgagagaaataaaatttcctt^ 

aaagaagacgactttaccatgtgaacccccgtggcttcctcagagactgtctgcctggcacatgct 

cctccaggtcrrccangtcacctaggaaacagaagtcgttataagcacatgagctcnctgagntg 

aatccctcaggtggncttggagctgtngacccnggctggcnatgtgtngaaaaggtgtggtn 

ncacanaactcacttgctgnccggtccatcctccntngtngaaagg 

seq id no: 3760 catcctnctattgatattanctantgtcattaagccntancanacagtaangc 
tlsotatnctcttcmaanaaacanaattngtc^ 

aaaatacacattttccccctaccagatccatgatggctaccttagaaagaggactgatataatrgt 

atataaacttggctttgaactattncacaggtaaaccaatggaaaactgccngt^^ 

canagaaaacatgtangaattctcccacagaaatttaccgncctacgtnaanca^ 

accactgatgnatccaacaacctataacatgttggcactgactttatngcataacatttot^ 

gttgcttaacttntnctttcccaaaggaccctnatnantcgtnctgatac^ 

tgttttttcctttggngaaaccnnctcrmaatagaatgccm 

seq id no: 376 1 acttttttttttttttttt^^ 

gaaacttanaccanacctggcaatcaaggggtgaggtactggccaggaaggtggagtaggtttca 
ggccctggggatttcaagtgcaaactgatggcctgggaggggccaaananaccaaatcctggcan 
cagctgaggaggtgcccaagggcactttcaggcactggggccatcanctggttctcattgacctct 
ccatatncggtgactcattgtagtcattcatntngtccatgtnctgnatatcctcntcatnct^r^ 

GTCCTCTTNAATNTCCTCATGNGNTTCATCATCCmiTTT^ 

ggnttgccx:antaggaccatgttgt 

seq id no: 3 762 accctttggamcaaacagtaacatcggatgtaaacaaacttagttcct^ 
actcactgaaactaatcaagcggctctacgtagacaaatctctgaatcrritcta 
gctctaa^aagagaccctatgcaaaggaattggaaactgttgacttcaaagataaattggaagaa 
acgaaaggtcanatcaacaactcaattaaggatctcacagatggccactttgagaacatm 
tgacaacatgtgtgaacgaccagaccaaaatccritgtggttaatgcttgcctactttgt^ 
tggatgaagaaarrtnctgaatcanaaacaaaanaatgtcctttcagagtcaacaagacaaacac 
caaacctgtgcagatgatgaaacatggnggccacgttctgtattgggaaacattgacagttcaaa 
ttgtaagatcattagagcttcctttcaaam'aagcattctcagcatgttcattccta^ 
gatgtggnangatgantccacnggctgggagaatattggaaaaacacnncaacttc^ 
gncacantggactaatgcccagcaccatggcctatgcccanngtcaaactotcatttccaaat^ 

AGGTGGA 

SEQ ID NO: 3763 ACCrrCTGGGGCATACAACATGGCAGCAGGGCCTTGGGAAGAGGGGTNGGA 
NGACCGAGCAGCATTCTCTGTAGAGGAAGACAGGAAAGGANACCCTCTTGGCACACATTTATGGA 
GGGTTGTCCNTGAACAGAAGGGCAGGTGGGAGAGGTTCCCNTGNTACTTAANANAAGGCACCAGT 
TNGCATAGAGCNCAATGNACAGGATGATGATNNATAACAATCCACNGATAANGGACAAT^WTC 
CNCGTTNTTCCACCAGAATTTTCTNGCCTCCTTTT 

SEQ ID NO: 3764 ACCANATACTTGCATGTAGAAGGAGGTAATTTTCATGCCAGTTCACAGCAGT 
GGGGAGCCrriTrTArrCATCTCTTGGATGATGATGAATCAG^ 

ATGGCTACATCCATTATGGACAAACAGTCAAACTrGTGTGCTCAGTTACTGGCATGGCACTCCCAA 
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NATTGATAATTAGGAAAGTTGATAAGCAGACCCGCATTATTGGATGCAGATGATCCTGTGTCACA 

ACTCCATAAATGTGCATTTTACCTTAAGGATACANAAAGAATGTATTTGTGCCT^ 

AATAATTCAAATTTCANGCCACTCCATGTCCAAAAGAACCAAATAAAGAGATGATANATGATGGC 

CGCTTCCrGGACNATCATTTANCACAGATAANGCATNAGTATACATCOTATGAG 

CTGTCCTTGCCCCANTCNCTCCTGNGCCTGTGGTAAGAGAGCCTTTAGTTTNAATGGCGGGT^^ 

ACCGTANCATTGCTTGAACTTACAGGACANAATTrCACTNNTAATTT^ 

SEQ ID NO: 3765 ACTITAAAATCTGAGAAAGAATTCAAGAAAAAAAGATTACATGTCATGTATT 
TTTTCAAATAAAAAAGGAAAACTAAGTGAATAAATTTTAGGCATACTAAGTGTCCCT^ 
GAATGCTAAAAGGACTTTTGATGAAATCACAGCCTAAAAACCTCTGCATCAGAAACT/^ 
ATAAATTCTCTCTAAAGGTAACTGTGGAGAATATTAAGTGTAACATATTAACAAGAGCCTTC^ 
ATAATATACAGTAGACAAGAATGTTAATCCTACAATGCTGCCTTCTTGTACTGGTTGGACTGTGGG 
GCATGTGGCCGCTGCAGTTCCAGTGGTTATTTCTAAGTCTATGACAGGACAGGCTTGTTCTTGCT^ 
ANAACCTTCTCTGACAGACACGGTAACTAAATGTGAAAAACCAATAAGCTGGTGACTCATGAATA 
CACACGANGAAAAGCAGGAGGTNTATTTTATCTGCCTTTTCAACATTTCTT^ 
ATNGGGTCAQATGTCTTTGANNAAGTGTTAAANCTAAATTCACATGGGTAAGTGTAGGGC 
TTACAACCTACCATCNTAATGTGTOTAGTANACTTTGGGNAAAAGCGATTT^^ 
TTC 

SEQ ID NO: 3766 acgcgggggcggncggcgtgtttgaaagcgaggccaaagtgggtgggagcg 
cgtgctgttgggagttgcttggaggttggcggcgcggggctgaaggctancaaaccgaccgatca 
tgtcgcacaaacaaatttactattcggacaaatacgacgacgaggagtttgagtatngacatgtc 
atgctgcccaaggacatanccaanctggtccctaaaacccatctgatgtctgaatctgaatggag 
gaatcttggcngttcagcaaagtcaaaggatgggtccattatotgatccatgaaccaagaac 
acatnttgcttgttccggncacccactacccatagaancnatatganaatgaagcctggcangct 
c(ntitnagcctnaatnctttacacaatctgtccttaot 
ttantgttngccrrcttgtctctnactatnatatttaaaaaaatg^ 
gctngtnaactgct 

seq id no: 3767 acgcgggggctgactctcttttcggactcagcccgcctgcacccangtgaaa 
taaacagccatgttgctcacacaaagcctgtttggtggtctcrrcacatggacgc^ 

GGTGCCArGACTCGGATCGGGGGACCTCCCTTGGGAGATCAATCCTCCGTCCTCCTGCTCTTTGCT 
CCCTGAGAAAGATCCACCTACGACCTCAGGTCCTCAGACCGACGAGCCCAAGAAACATCTCACCA 
ATTTCAAATCTGGTAAGCAGCCTCTTTTTACTNTCrrCTCCAACTTCCC^ 
TTTCTCCTTTCAATCTTGGCACTACACTTCAATCTCTCCCTTCTCT^ 

TGGTAGAGACAAAAAGANACACTTTTTATCCCGTGGACCCAAAAACTCCGGCGCTGGTCACAGAC 
TGGNAAGGCANCCNTTCCTNGGTGTTTAATCATTGCAGGGGATGCCTCTCTGATTATACACCCACG 
TTTCAAGGGGTTGTNAGACCACGCAGGQACAACTGCCTTGGTCCnTNANCCTTN^ 
CCGCTITNrrrGGGGAAAGGNNCAAGTACCTTTGGCCGNGAANCACGC^ 

SEQ ID NO: 3768 ACGCGGGGGAAAATGGAGGTATGAATTTGGGGTAAGAGGAAGTGAGATCTC 
CGCTTGCAGGTCAGCCCCTGCCITGCAGGGCGGGCTGQCTrGACTCANGCCCTGT^^ 
GCCCAGCCCANCCCCACCCACAGATCCCCTGCTCCTGTTGTGrrCTGTTGTAAATCATTTGGCGAG 
ACTGTATTTTATTAACTGCTGCCTAACTTCCCTGTGTTCTATTTGAGAGGCGCCTGTCTG^^^ 
TTGTCTTGAAATTTCAA 

SEQ ID NO: 3769 ACACTCrcGTAATTTTAATAATATTTTAGGCAAGTCCTATGAC^ 

ACAAGTTTCTTCAACCCCACCACCACCCCACCATCTCTATG(7mGCTTGCCCAGACTCCOT^ 

CTTTCCTCTGCTATTACCGCTTATGGTTGAAAGTAAAAAGCATCTCCAGATTACAGCAAATTCAGT 

GGCCCTGCTCCATANCCTGGCCTCTGCTGAAGGAAGAAAAGGGGTGAAACATGAACAGCCATCAT 

GGGTTGTGATGAGGCACTACCAGAAGCCCAGCTTGTGTTTTANAAACATGACTCTNTGGACAC^^ 

CCANCAGATAACTCTTCACTGGGCTTGGAGAACAAGGAAGGACTATTTCACCAAACACCACTGTG 

TTTGAATTTCTAGAGGCTCCCCAGGCCAATNATTACATTTCTGCAANTGC^ 

TTNNNGCAGTATTCATTGTCCCAGTCCTTACTCACAATTGGGNAGGGGTTCTCAN^ 

TACACATCGGAGGCTGGGGTGAGAAGTGCCNCANGATAACCCCNTCANCAATTGAACATTCATTT 

AAAAAGGAGACNCAACCN^a^GGAACTTC^^^CTGNGATCNTC^TO 

GGCCT 

SEQ ID NO: 3770 ACGCGGGGAGGCCCCAGCCAGCTCAGGCTACACTATCCCAGGATCAGCATGG 
CCGTCCGCCAGTGGGTAATCGCCCTGGCCTTGACTGCCCTCCTTGTTGTGGATAGGGAAGTGCCAG 
TGGCAGCANGAAAGCTCCCrrTCTCAAGAATGCCCATCTGTGAACACATGGTATAGTCTCCAAOT 
GTTCCCAGATGTCCAACCTGGTCTGCGGCACTGATGGGCTCACATATACAGAATGAATGCCANCTC 
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TGCTTGGCCCGGATAAAAACCAACNAGGACATCCAAGATCATGAAAGATGGCAAATGCTGATCCC 
CAAGAGCCACCTCAAGCCATGAANTGTCAGCTGGAGAACANTGGTGGNGCATNGGANAGGATTN 
TGACATGAAATAAAAGATCCANCCCAACTTGA]SrrGAAI^rajGTCCAGTGTCT^ 
GTTTGNAGANGGGATAGNGANGTG 

SEQ ID NO: 377 1 ACCGGCTTCTTCTCTGGGGTAGGGGTAGCCTCGCCCGGAAGCAAGGCCTCTG 
GAAAACCGCGGCCCCTGAGTTGCAAACAAATGTCAGATCCCAGATATTAAGGCTAAGACATACTG 
CATrrGTAATACCAAAGAAAAACGTTCCTACCTCAAAACGTGAAACTTACACAGAGG 
AAAAGCAGATTGAAGAGTTCAACATAGGAAAGAGACATTTAGCCAACATGATGGGAGAAGATCC 
AGAAACirrCACTCAAGAAGATATTGACAGAGCTATTGCITACCm 

GAAACGAGCCAGGCCAGTTAATGAAGCATCCTGAACAGATTTTTCCAAGACAAAGAGCAATCCAG 

TGGGGAGAAGATGGCCGTCCATTTTCACTATCTCnCTATACTGCAAACAGCATACTAT^^ 

TGCATGGATGTATATGGAATGTTACTCAATTNAGAAAAACATCAAAGTCCCTTGC/^ 

TCTGCTCCCAGAAAAAACTGTAACCAGAGACATGATTGGCACCATATGGCTTGATTAAGGAGGAC 

TAGANAAAATGTTAGNGGAAAAACTGCAGATCTANANTATNTGCNTTNTTCGGGNTGCT 

GNTATTT 

SEQ ID NO: 3772 ACCTCATAGTCAATNTNATTNTTNGCCGCGTTTANATGATOTAANGCAN 
ATAATACATAATGCAGrrGATATTAAATATCTTGAGGAATGTCAATAGAACTACTTTCACTC^ 
GCATTAACTGATCACTTATAAATGTTCTGTTTATCCACTTI^ 
GACTTTANTGTTGTGAACTAOTCATCATTATCTGCTAmnvrr^ 
NCAAAAATAACTT>n^TAAAGGTCTTTCCTTANATATTAGNTT 

SEQ ID NO: 3773 ACGCGGGGCrnTGTCTGCGGGCACGCGCCGCTGCGGTGCTCAGGAACAGCC 
CATGGAAGAATCATATGAAGAGGTGGTGACTGAGGTCGTAAGCAGGAGTGGACATGTTTGGATTT 
CCAACAGCTACCCTGKn'GGACTGTCATGGAAGATATGCCCAGAATGTAGCGTTOTCAGTGAGTGA 
GTCANTTGATGGCTGGATGCGCTCANCTCACANATATTCANTGTCCCTGTTCTCAGCACTGGTCGG 
CAGTCTGANGAAGCAGANNGAGACCTTGCTTraATTGTGATNACTGAATNCCC^^ 
ACCNArrCTGANGCTACANGATCCTCAAGCTAGGATbrrACCAAAATTCTTTN 
NGGAAACNAAAATCCCCATAATTCNNAATACACTNCOTGACGNATTCNNCTG>^ 
NGGNCCNTTCACTGCCCGAANTTNTGGAAAANCCTTCTCNNAAATNAGGTO 
NNGNGAGTATTNCCACCTGNGCAAA 

SEQ ID NO: 3774 ACCAGGTGTTAGCTGTGACCTTCAATGACACAAGTGATCANArrATTTCTGGT 
GGAATAGACAATGATATCAAGGTCTGGGACXrrGCGCCAGAACAAGCTAACCTACACCATGAGAGG 
CCATGCNGATTCAGTGACTGGCCTGAGTTTAAGTTNTGAAGGCTCTTATCTTTTGTO 
GGACAATACAGTTCGGTGTCTGGGATGTCCGGCCAirrGCCCNCAAAGAGAGATGTGTAAAGATT 
TTTCAAGGAAATGTGCACNACTTTAAAAAGAACCTNCaJGATATGTTCTAG 
CATAANANCACCTTGGNTCAGCCCXjNCAAGTTNNGTTATGTGTGGGATNCCACAAGCAGGA 

SEQ ID NO: 3775 ACCAC(XrrGAGTTCCTGTCCAGGCCTATCAAGCCCTCCCCACCATACTTTGGC 
CTCCTCCTGGCCTCTGTGGGGCGGCTCTCACATTACCTCCAGAAAGGCTGCAGGCT^ 

gacacctatagtgacaggagtggaagcagctcccctgactctgaaatcaccgaactgaagtttcc 

atcaataaatcatgactgatcttgtagcggatgattcttcaagagacccttcaaacttgggtagag 

tttacagctctgactttacactcgggatttggagactttctttaa^ 

tnntantatgcngaaaggtatrrgngaaacmgtnacttgcatgtccc^^ 

ccgctgaccactctaagggncg 

seq id no: 3776 acgcggggcagcggctccagctaanaggacaggatgaggcccggcctctca 
tttctcctagcccttctgttcttccttggccaagctgcangggatttgggggatgtgggacct^ 

ATTCCCANCCCCGGCTTCAGCTCTTTCCCANGTGTTGACTCCAGCTCCAGCTTCAGCTCCAGCTC 

NGTCGGGCTCCAGCTCCAGCCGCAGCTTAGGCAGCGAGAGGTTCTNGTGTCCCANTTGTm 

TNTCACNCGGCTCCGTGGATGACCCGANGGACCTGCCATTGCTCTTGTTTTCCTGCCAGACACCAN 

CTTTCCNNTTGGACAAGAGTGGGATCGCTTTGGAAATNCACANCTO 

NANAAAGAACTITTCCAANAGTGANGGAATATGNCasfCATTAArrAGCGGTGTATGA^ 

ACTGTTAACCNTNOSrraca^AAATTGACATTACTGGTANAAAGA^^ 

GGANTTTCGAGCOTGATCNANGGTANAAACTNAAGGAAATNGANAAAACT^ 

AGGAGANTTTGGGGGGAAGCCCNNNAAATTNNTGGACO^ 

TGANNCTN 

SEQ ID NO: 3777 GCGCGGGGCGCATGCGCGGGGGCCATATTATCAGCGGTTATTCGGTGAGCGG 
TGGTGGTTTATTCTTCCGTGQAGTTAAGGGCTCCGTGGACATCT^AGGTCT^ 
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GGAACTATATAAAGTTCAGAAAACATGTCTCGAANATATGACTCCAGGACCACTATATTTTCTCCA 
AAAGGTCGCTTATACCAAGTTGANTATGCCATGGAAGCTATTGGACATGCAGGCNCCTGTTTGGG 
AATNTTAGCAAATGATGGTGTTTTCTTGCAGCAAGAAACACCCNAClsrrCCANAANCCT^^ 
AANN>fNTTTTTTTC^^GAAAAAATTO 

SEQ ID NO: 3778 ACACCAACAGCAGGTCACACATTAACAGTTGTAACTAAGCACTGTGACAAAT 
TAGCCAGTTCTTCCCACATTAGTCCCTATTAAAACAAAAATGGGGGGAAGGGAGCAAAATAAGTT 
GCTACAAAATGGGCAATATAATTTTGCCACAArrGCCAAGGTTTAAAAAGCAAAGCAATTGTAGC 
TAAAGACAACTGAATAAAATGTTTGCAAGTGTTTTAAGGA^ 
TGCCTTTTAGTATAAGGTCAATACTGCCAATITCATCTGTGACAATAGCAC^ 
TGCCTCCATTATTGATCTGATCCTCCTCAATCCTCCTTTTGACAATCCT^^^ 
TCCACAATCCTTAATCCAAAGCATTCrrAATCCATCTCrTCAGATATGTCTACAGAAA 
ANAAGCNANGATAAACATAAGAACTTCCCCCAGGTAAGATAACCACAAAACACCCCCAACATCC 
CANTTCACANAAACCAGGGCAGGATAGACNTTTACACCATThrrCACh^ 
TTAACTAAATTTAAACATTCCTGTCTOTGACATTCNCTTTAAAATACC^ 
AA 

SEQ ID NO: 3779 ACACTTGAAACCAAATTTCTAAAACATGTTTTTCTTAA^ 
ACATTAAACCATAACCTAATCAGTGTGTTCACTATGCrmCACACTANCCAGTCT^ 
TCTGGTTTCAAGTCTCAAGGCCTGACAGACAGAAGGGCTTGGAGATTIT^^ 
TCTTCANCAACTNGAGAGCTTTCTTCATGTNGTCAAGCAACAGAGCTGTATNTGCAGGTO^ 
CATAGAGACGATTTGAATATCrrNCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACAAAC 
ATANTCCTGGGGACATACTGGCCATCAGGAGAAAGGTGTTTGTNAGOTGTTTCATAAArc^ 
GAGGAGGACAAACTGCTCTGCCAATTTCTGGATTTmTTATTT^ 

CTTGACTGNGTGGGCACTCATCCAAGTGATGAATTATCNTCNAGNGGTTGCTGCM^GTCTTGNATT 
TATATANANCTTOTTATATGTCTGATTCCANATGAANTTGGTCCCCCAACCTCTGNAGAGGGTCCT 
GGGC 

SEQ ID NO: 3780 GTACACGCITrTGGCCCCNACCAATGAGGCCrrCGANAAGATCCCTANTGAG 
ACTTTGAACCGTATCCTGGGCGACCCAGCAANCCCTGANAGACCTGCTGAACAACCACATCTTGA 
AGTCANCTATGTGTGCTGAANCCATCGTTGCGGNGCTGTCTGNANAGACCCTGGAGGGCACGACA 
CTGGAGGTGGGCTGCAGCGGGGACATGCTCATTNTCAACNGGAAGGCGANCATCTNCAATAAANA 
CATCTCTAGCCAIWAACGGGGGTGATCCACTACATTGATGGANCTACTCATNCGANACTCANCCA 
NAACACCTATTTAGAATTCGCTGANCANTOTGOTIWGCTNCACAA^ 

SEQ ID NO: 3781 ANTGANTCTCNGCAAANAAAATCTTGCANAGTCCTCCAAACCAACAGCTGGT 
GGCAANCAGATCACAAAAGGTCNAAGTTGCTCAGCGGAGCCCAGTNNATTCNGGCACCATCCTCC 
GAGAACCCACCACNAAATCCGTCCCAGTCAATAATCTTCCTGAGAGAAGTCCGACTGACAGCCCC 
AGANAGGGCCTGAGGGTCAANCAAGGCCGACTTGTCCCAGCCCCAAAGCTGGACTGGAGTCCAA 
GGGTAGTGANAACTGTAAGGTCATTGAAAGCACTTTNTGTGTCAATACCTCTGGCCGCM^ACCA 
GCTATAAGGCGAATTTCATCACCCNTGGCGNGNCTTACTAGTGGGATCCGANCTT 

SEQ ID NO: 3782 ACTCCAATATCTTTGTTACCTCCCCAAGCCCTGATAACCTCCATACTCTGT^ 

aatttgtatttaaaggatttgatgctctgcaatatcaggaacatctggattatgagatt^^ 

ctctaaatcctgaatttaacaaagcagtgatcagagtgaatgtatttcgagaacacaggcanact 

attcagtatatacatcctgcagatgctgtgaagctgggccaggctgaactagrrgtgattgatgaa 

•gctgccgccatccccctcccttggtgaagagcctacttggcccctaccttgttrrcatggcatc^^^ 

catcaatggctatgatggcactggccggtcactgtccctcaagctaattnatcagctcctgtcn^ 

agancgcccatagccanggtcaggcacccactgctgataataagancactacgacanncnnattg 

gcattcancncggacactgcattgaggtttccctccangattcantccganacacccctgtggatg 

cagtggaaaaatggntgaatga 

seq id no: 3783 acrgamaatcagtataaaatcgaaagagctttagatctgtaataaaaatc 

CAAATTTGGGGAAGGGCAAACTTTAAAACAGCAGCCAAGTAGAGAAGGGTTGGGGAAGGGAGAG 

GTGAGTGAACAGGCACATTGGAACTGTGGGGACATGTAACTGACCACTGTCAAACCAGTGTAAGT 

TTCTTGGTTTCTCATGGAAAACATTTTATACACCTATTTTTOT 

GTCCAAATACGTCTGTTCCATTTTTTTGTCTGATTITTGTTGGGATA^^ 

TTACTCCAGTTCCTGCTCCATGTGAAAAAAATTAGATGATTTCAAATACTTTOT^ 

AAGCTGCCAGTCACTCITCTGCATTTCATCTCTGAACCAGTCTTTGAGNGCATCAATGGGACTGNC 

GGCTGATCITACTAC 
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SEQ ID NO: 3784 ACTTGTGACAGGCAGACGTGATTGCAGCCACGAACACGATGAACTCACrGAA 
GTCCACCTGGGCATCTCCATTGGCGTCCAGGTCCITGAGCAATTTATCCACGGCATCOT 
CCACTCTGCAGGAAGCCTGGTANCTCCnTCTCCATCANCACCrTGAGCTCCCCCTTG 
TGOjTGCTGCCCTNGNTGCCCTAATATCGGGAAAANACGTNTATGNTCATGCCCATGGCTGTCT^ 

aktatcgovrtggtgctagattcagacccaccrtcctcctgggggctggncagggccn 
gtnccccgcgttacctcnggccg 

SEQ ID NO: 3785 actccctgtcgtcaaagtgcitccctctggtaaatacacgggtgccaactta^ 
aatcagtcattcgagtcctgcggggtttgctagatcaaggaattccttctaaggagctggag 
ttcaagaattaaaacctttggatcagtgtctaattgggcaaactaaggaaaacagaag 
agatataaaaatatacttccctatgatgctacaagagtgcctcttggagatgaaggtggctatatc 
aatgccagcttcattaagataccagttgggaaagaagagttcgtttacattgcctgccaaggacc 
actgcctacaactgttgganacttctggcagatgatttgggagcaaaaatccacagtgatagccc 
tgatgactcaagatgtngaagggagaaaaaatosraatgccancgctattggccccaacatnctan 
gcataaacaacnatggtcagnancagactncnnactggctcttgtganaatgcagtcaactgaag 
ggttttgn 

SEQ ID NO: 3786 accgaccatagagcaagaatcaagattctgctaactcctgcacagccccgtc 
ctcttcctttctgctagcctggctaaatctgctcattatttcagagg 
agtgataagggccctactacactggcttttttaggcttagagacagaaacm 

TAGTGGCITCTANCTCTAAATGTTTGCCCCGCCATCCCTTTCCACAGTATCCTTCTTC^ 

ctgtctctggctgtgtcaagcaagtctagaanagtgcatgtccagcctatgaaacagct^ 

tggccataagaagtaaagatttgaagacagaaggaagaaactcaggagtaagcttctaaacccct 

tcangcttctacacccttctgccctctctccattgcctgcacccnaccccnanccactc/^^ 

gcttgtttttccctttaggccataggaangtttacccattnaaatcc 

catacanttnctttnantaaaccantngngtacotggnccgggcng 

TCCAACCACACNGGGCGGNCXnTTGCmrrnGGATTCW 

SEQ ID NO: 3787 ACTTTTTAAATCATGTTCCCCCTAAACATGGCTGTTAACCCACTGCATGCAGA 
AACTTGGATGTCACTGCCTGACATTCACTTCCANANAGGACCTATCCCAAATGTGGAATTGACCTG 
CCTATGCCAAGTCCCTGGAAAAGGAGCTTTANTI^GTGGGGCTCATNAA^ 
TCCAGCCTTAIGGGAAGTCCNGGCANANTTTTTGNAAGCCCITGCACANGNT^ 
CATTATANCGCTATGAGTTGAAATGTTTGTNAAATGGTGTOTACOTCTA^^ 
TmTGGGGCNCTGT 

SEQ ID NO: 3788 GNACTTGGCCANGCGCTCANATCGGCAAGGGGCACCAGNC1TGATCTGCCNA 
GTGCACAGCCNCACAACCAGGTCANCNATGAAGGTATCTTCANTNTCNCCCNAACGATO^ 
CATGACNCCCCAACCATTGGCCTGGGCCAGCTTGCACGCCTGAANAGACTNGGTCACGGAGCCAA 
TCTGGTTGACTTTGAGCAGGANGCAGTTGCAGGACTTCTCGTTCACGGCCrrTGGCNATCCTC^ 
GGTTGGTCACTGTGAGATCATCCCNACTACNTGGATTCCTGCACTGGCTTNTGA^ 
TCTCCNCAGTCNATCCTGGTCAAAGGGATCTTCCATAAGACACCACTGGGGTAGTCCTTTGATGAA 
GGACITGGNACCTGCGNCCGCCACCCACTCTANTGGGC 

SEQ ID NO: 3789 ACGCGGGGAGCGGCTCCAGCTAAGAGGACAGGATGAGGCCCGGCCTCTCATT 
TCTCCTAGCCCTrCTGTTCTTCCTTGGCCAAGCTGCAGGGGATTTGGGGGATGTGGGACCTCC/^ 
TCCCAGCCCCGGCCTCAGCTCTTTCCCAGGTGTTGACTCCANCTCCAGCTTCAGC TCCAG CTCCAG 
GTCGGGCTCCAGCTCCAGCCGCAGCTTAGGCAGCGGAGGTTCTGTGTCCCAGTTGTTTTCCAATTT 
CACCGGCTCCGTGGATGGCCGTGGGACCTGCCAGTGCTCTGTTTCCCTGCCANACACCACCTTTCC 
CNTGGACAGAGTGGAACCCTTGGAATTCACANCTCATGTTXnTTCTCACAAl^ 
NNTCCTAANTGAAGGGAATATGTCCAATTAATTTANTGTGTATTGAAAANA^ 
CTGGCCCGAATTGACNATCATGGGAGAAAGATCC^^mCTTNCNCr^GNATNTO 
TCAAGC 

SEQ ID NO: 3790 ACAGGGTTTTATCAGTCCACCTTCCCTCCAAGGAGATAAAATAAATGAAATC 
AGCATTTCTTCCAACTCCCTTQGCATAAATTATTACTTACCCTAAGCAAGATGACTGA 
AAGCTCTTTAATTATATGCTAAATGGATTAGTGGCTGAAGGGAAGTCCTTTTAGAGCAGTG 
GGATTATTCCACAAATACCTGTATGTTGAGGTGGGTACTGTACTCTCTGACTCCTTACCT 
GAATTTGTTACATAATCTTCTACATGTATGATTTGTGCCACTGATCTTAAACCTATG 
TTCTTACCATATAAAAACGATAATTGCTTTATTTGGAAAAGAATTTAGGAATACT 
TTTTTATAGACAAAGTAAAAAGACAGATATTTAAGAGGCATAACCAAAAAAGCAAAAC^ 
CAGAGTAAAAATCTTTAATATrrCTAAAGACATACTGTTTATCTGCTT 
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SEQ ID NO: 3791 GGTACTACCACTTTTATGCTAGTTGGCATCTCCCmCTATACA/^ 
TTAATCGACATTGACCTATTTCCAGAAATACAATTTTAGATATCATGCAAATTTCATGAC^ 
AGGCTGCTGCTACAATGTCCTAACTGAAAGATGATCAmGTAGTTGOTrAAAAT^ 
TITCCAAAATGGT CTCTA ACATTTCCTTACAGAACTACTTCTTACTTC^ 
AAAAAC]^^ 

TirmriTmGAGACGGAGTCTNACT^ 

CACTGCAACCTCCGCATCCGGGGGTTAAGCCATTCTCCTGCCTCAGCCTCCCAAGTAACTGGGATT 
ACAGGCATGTGTCACCATGCCCAGCrAATTTTrrTGNATTTTTAGT^ 

SEQ ID NO: 3792 GGTACGTrrCATGACCAAAATTTATCATCCTAATGTAGACAAGTTGGGAAGA 
ATATGTTTAGATATTTTGAAAGATAAGTGGTCCCCAGCACTGCAGATCCGCACAGTTCTGCTATC^ 
ATCCAGGCCTTGTTAAGTGCTCCCAATCCAGATGATCCATTAGCAAATGATGTAGCGGAGCAGTG 
GAAGACCAACGAAQCCCAAGCCATAGAAACAGCTAGAGCATGGACTAGGCTATATGCCATGAAT 
AATATTTAAATTGATACGATCATCAAGTGTGCATCACTTCTCCTGTTCTGCCAAGACTT 
TTGTTTGCATTTAATGGACACAGTCTTAGAAACATTACAGAATAAAAAAGCCCAG 
CCTTTGGTOATTAAATGCACATTAGCAAATCTATGTCTTGTCCTGATTCACT^ 
GCAGAGGCTAGAAGTATCATCTGGATTGTTGTGAAACGTTTAAAAGCAGTGGCCCCTC 

SEQ ID NO: 3793 GGTACTTAGAGCTAATTCGCATATATACAGGAAGGGCTCTTAGAATCAGTTT 
GTGGGCACAGAGCCTCAGGAGTAAATGAAGTTACTAGGGCTGTTCTTACCATCTCCTTCTGGCCAA 
ATAGCACAACATTTCCTCGTTCTGCTCTGACCTCTTAGCTTAGAAGGA^ 

CTAAGAAOGTTGTCCTTGCCTAATGCTCTGATCTGTAAGTGAATAGGGCAGAACAGTTCAGCCTTG 

AGGTTAGAATTTAGCAGGAGCTATCCTGACTTAATATCCAGTTGTGGGGTTTGCAAA^ 

GCTGTATGTAATCATTGCCACTAGTTCCATCTAGAACTCCTTTCTAGm 

ATACATAAAACCACCAAAATACATAGCTTCGACAAGATGGAAGTTTATTTCrCTCTCCCATAACAG 
TGCAGTGATAGTCAGCTGGTCCAGGCCAGGCAAGGGGCTGGTCCATGA 

SEQ ID NO: 3 794 GGTACAGCTGACTATCCAACATGATTCCTATGGAAACAGAAGGGGCAGAGTC 

ctggtttgctggcttattgagggcitggcagagaagctaaagctccaaagtgactacagattctct 

gcaaccggctttgacccatggaaacaggagccagattctcactctagagatagtgagggggccaa 

acctactcataccacatgcattagtcctggtcatcctccaggaccatgcgtatgatgggcaactca 

taccaggcaggggaagggagctgattagggaagaagggaccatttttcatctm 

tttatattttaacctcaaacatattatcagtgcctcagatataatttaatcttaagtcm 

cccctgaaacaaaagtatacrrtttatrraggcttcctcactttctggtagtc^ 

tccagttttaccctgacttagagtccacaaacttcatcagaccctctgtg 

seq id no: 3795 ggtacgcggggagtgtgaaatcitcagagaagaatttctctttagttc^ 
aagaaggtagagataaagacactttttcaaaaatggcaatggtatcagaa 
tggtitattgaaaatgaagagcaggaatatgttcaaactgtgaagtcatccaaaggtggtcccgg 
atcagcggtgagcccctatcctaccitcaatccatcctctgatgtcgctgccttgcataaggccat 
aatggrraaaggtgtggatgaagcaaccatcattgacattctaactaagcgaaacaatgcacagc 
gtcaacagatcaaagcagcatatctccaggaaacaggaaagcccctggatgaaacactgaagaa 
agcccttacaggtcaccttgaggaggttgttttagcrctgctaaaa^ 
tgatgaacttcgtgctgccatgaagggccttggaactgatgaagatctctaatt 

SEQ ID NO: 3796 acatccccagtcgtggccctctggacaagtggcgggccctgCactcatgagg 
gcttccaatgtgctgccccc ctcttaa tactcaccaataaattctacttcctgtccacc^^ 
aaaaaaaaaaaaaaaagtmcttttttttt^^ 

tggtccaaggcttgttaggatagttaaaaaagctgcctattggctggaggganaggcrraggc^ 

AAGCCCTATTACTTTGCAAGGGGCCCTTCAAAANTCGNTGGGCTCAAAAGGC^ 
GANAGTGAGCCTTTCGAANAGATACTTGCCCANCCCANCNTTCGGGCCACCCATNCTGTGGAGGT 
TGGTCAGGTGGTCACCCATmTCTTGATAAGCTTCACTTCCTTAAm 
GANNTCACAGANATGGGGGTTCGTGCNGGCAAAACCCAAGGCATGAAA 

SEQ ID NO: 3797 ACGCGGGGGGTCTTCGCTGGACACCATGAATCACACTGTCCAAACCTTCTTCT 
CTCCTGTCAACAGTCjGCCAGCCCCCCAACTATGAGATGCTCAAGGAGGAGCACGAGGTGGCTGTG 

ctgggggcgccccacaaccctgctcccccgacgtccaccgtgatccacatccgcagcgagacctc 

cgtgcccgaccatgtcgtctggtccctgttcaacaccctcttcatgaacccctgctgcctgggcttc 

atagcattcgcctactccgtgaagtctagggacaggaagatggttggcgacgtgaccggggccca 

ggcctatgcctccaccgccaagtgcctgaacatctgggccctgattctgggcatcctcatgaccat 

tctgctcatcgtcatcccagtgctgatcttccaggcctatggatagatcaggagggcatcactgag 
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GCCAGGAGCTCTGCCCATGACCTGTATCCCACGTACC 

SEQ ED NO: 3798 GGTACGCGGGGAGTGTTTGGGTTTCTTCGCGGCTGCTCAAGATGAACCGACT 
CTTCGGGAAAGCGAAACCCAAGGCTCCGCCGCCCAGCCTGACTGACTGCATTGGCACGGTGGACA 
GTAGAGCAGAATCCATTGACAAGAAAATTTCTCGATTGGATGCTGAGCTAGTGAAGTATAAGGAT 
CAGATCAAGAAGATGAGAGAGGGTCCTGCAAAGAATATGGTCAAGCAGAA^GCCTTGCGAGTTTT 
AAAGCAAAAGAGGATGTATGAGCAGCAGCGGGACAATCTTGCCCAACAGTCATTCAACATGGAA 
CAAGCCAATTATACCATCCAGTCTTTGAAGGACACCAAGACCACGGTTGATGCTATGAAACTGGG 
AGTAAAGGAAATGAAGAAGCATACAAGCAAGTGAAGATCGACCAGATTGAGGATTTACAAGACC 
AGCTANAGGATATGATGGAAGATGCAAATGAAATCCAAGAAGCACTTGAGTCGCAGTTATGGCAC 

SEQ ID NO: 3799 GGTACCAGAACACATCACTGAAGAAGAGCTCAAAACCCTTATGGAATGTGTT 
TCTAACACAGCAAAAAAAAAATATTTAAAATATTTATATACGAAGGAAAAAGT^ 
GGCAAATAAAAAAGGAAATGAAAGCAGCAGCAAGGGAAGAAGCAAAAAATATCAAGCTGCTAG 
AAACCACTGAGGAAGATAAACAGAAAAACTTTCTATTmACGACTTT^ 
ATAGCAATGGGCTGGAAGGGTGCCCAGGCCATGCAGTTGACAACCTTTGGTTTTTGACATG 
CGAAAATTATATGAAACGAAAAGAATTGCAGAATACTGTTTCCCAGCrm 
GGAACAGAAGAAATGTTGATCCTTTCCATATTTATTTCTGCAATCTAAAAATAGATGG^^ 
ACAGAGAGTTAGTTAAACGGTATCAAGAAAAATGGGACAAATTGCTTTTAACATCAACAGAA^ 

SEQ ID NO: 3800 GGTACACCACACTTACAGCCCTTGTCAGATATTATCTCCAGGTGTGTCAGAGC 
TCCGGAGGAATTCCAAAAAATATGGAAAAGCTGGTGAAGCTGTCTGGTTCTCATCTGACCCCCCTG 
TGTTATTCTTTCATTTCTTACGTACGCGGGGACAGACGAGATCTCGATCGAAGGCGAGATGGCGGA 
CGTGCTAGATCTTCACGAGGCTGGGGGCGAAGATTTCGCCATGGATGAGGATGGGGACGAGAGCA 
TTCACAAACTGAAAGAAAAAGCXjAAGAAACGGAAGGGTCGCGGCTTTGGCTCCGAA^^ 

ccgagcgcggatgcgtgaggattatgacagcgtggagcaggatggcgatgaacccggaccacaa 
cgctctgttgaaggctggattctctttgtaactggagtccatgaggaagccacccgaagaagaca 
tacacgacaaattcgcagaatatggggaaattaaaaacattcatctcaacctcgacaggc 

seq id no: 3801 accacgagcaaagttgacaaaggagatctcatcgaagcggatgtgcacagg 
tggcttgtggacgtagatgaagccccgcrccagcgggtagagcagtcctgagcttgccttgtagg 
aacaggtaatgcactgggcccctgagtgcccttggaagttgcctggcactgtgatcttgcggttta 
ccagtgctitcatgacccggctgaccatctcatagagggatcctgacatgttcttggtgagccgac 
cctcaaagcgcttctccacttcttcctcgttcatgttcagagtcaacgaaatgtcctcgtc^^ 
gaagaggaggatcaggaagtggtagcgagtttggccatgcttgattgggggatccaggctgatca 
caaagaacatctggcgctggtccttgtggggtaacaaaaacagacgcagtacctcggccgngac^ 

ACGC 

SEQ ID NO: 3802 ACGCGGGGCTCTTCCTCGGCGCTGCCTACGGAGGTGGCAGCCATCTCCTTCTC 
GGCATCATGGCCGCCCTCAGACCCCTTGTGAAGCCCAAGATCGTCAAAAAGAGAACCAAGAAGTT 
CATCCGGCACCAGTCAGACCGATATGTCAAAATTAAGCGTAACTGGCGGAAACCCAGAGGCATTG 
ACAACAGGGTTNCGTAGAAGATTCAAGGGCCAGATCTTGATGCCCAACATTGGTTATGGAAGCAA 
CAAAAAAACAAAGCACATGCTGCCCAGTGGCTTCCGGAAGTTCCTGGTCCACAACGTCAAGGAGC 
TGGAAGTGCTGCTGATGTGCAACAAATCTTACTGTGCCGAGATCGCTCACAATGTTTCCT^ 
ACCGCAAAGCCATNGTGGAAAGAGCTGCCCAACTGGCCATCAGAGTCACCAACCCCAATGCCAGG 
CTGCGCAGTGAANAAAATGAGTAGGCAGCTCATGTGCACGriTICTGm 

SEQ ID NO: 3803 GGTACGCGGGGGCTCTCTGCTCCTCCTGTTCGACAGTCAGCCGCATCTTCm 

tgcgtcgccagccgagccacatcgctcagacaccatggggaaggtgaaggtcggagtcaacggat 

ttggtcgtattgggcgcctggtcaccagggctgcntttaactctggtaaagtggatattgt^^ 

tcaatga(xccttcatrgacctcaactacatggtttacatgttccaatatgattccacccatggca 

aattccatggcaccgtcaaggctgagaacgggaagcitgtcatcaatggaaatcccatcaccatc 

ttccaggagcgagatccctccaaaatcaagtggggcgatgctggcgctgagtacatctctctccct 

cctcccccacatgcacaaggctcacatctcattatggtgcggcccatgtacccacaaggacattca 

ttagtgtttatggacatgaacatgtggaagcctcctgaataattactttttt 

seq id no: 3 804 acgcgggatgacagggaacaggttaagaaaactatgtaaatgtaggaaaga 
tgggaggaaataaagccctgttctggatcccccatcccx:tccagaataagagcatgttctgcatgt 
attaatcrmatgctgtttatgaaacaggcaagataagtctgtttttcot 

GTAACCAGATTTTCATCTACAGACAAGTGGTAGTCATTTGTGTTTATCATGCAACT^ 

caccaagatattaattgctgcaacttgatgtcaaatcacattactgggtaattm 
actagacaacgtgtcttgcaaaggatccgtatcaaccaaatctgatacaaccaagtcaggatcta 
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ggatctcatgggtcccattagtggacagcctgnaggaacgtctcctcgataagtcmgtcagtgt 
tttccgagtagttttttgactmcctgggatgaatt^ 

seq id no: 3805 accaagctgrrggtgaactgccccaggccaacagggacacattagctttcct 
catgattcacttgcagagagtggctcagagtccacatactaaaatggatgttgccaatctggctaa 
agtctttggccctacaatagtggcccatgctgtgcccaatccagacccagtgacaatgttacagga 
catcaagcgtcaacccaaggtggttgagcgcctgctttccttgcctctggagtattggagtcagt^ 
catgatggtggagcaagagaacattgaccccctacatgtcattgaaaactcaaatgcctt^ 
accacagacaccagatattaaagtgagtttactgggacctgtgaccactcctgaacatcagcttc 
caagactccttcatctagttccctgtcactgagagtccgrrccaccctcaccaagaacactcctag 
atttgggagcaaaagcaagtctgccactaacctaggacgacaaggcaact 

seq id no: 3806 accctatgaacctgactctgtggtcatggcagaagctcctcctggggtagag 
acagatcttattgatgttggattcacagatgatgtgaagaaaggaggccctggaagaggagggag 
tggtggcttcacagcaccagttggtggacctgatggaacggtgccaatgcccatgcccatgccca 
tgcctatgccatctgcaaatacgcctttctcatatccactgccaaagggaccatcagatttcaatg 
gactgccaatggggactratcaggcctttcccaatattcatccacctcagataccagca^ 
catcgtatgaatctattgttggtcctggacccaagccagaagcctctgcaaagcttccttc^^ 
ctgcagataactatgacaactttgtcctacx:agagttgccatctgtgccagacacactaccaactg 
catcttgctggtgccagcaccrcacatctgaagacattgactttgatgatctt 

seq id no: 3807 acaacacaaaactaattgaaaattctctccacctrrcccaaatcaaccactg 
atatgagagtgttatgcatttacaacgtgcmctacgtgaagggttgatttttta>^ 
agatgactttgaaaacaagtgtacttttttttttttt1"1u"11 
tattagccctitgtcagatgagtaggttgtgaaaatttrcrrcccatt^ 
ctgatggtagtttcatttgcagtgcagcagctntttagttgaattanatcccam 
cttttgttgccattgcttttggtgrmanacgtgaagtc 
atcccgcgtacc 

seq id no: 3 808 ggtacttgcttagggtggccaacaggctctttgggg aaaagtcttgtgat^ 
ctctcatcttttagagattcctgccaaaaattctaccaagcagagatggaggagcttg 
agcgccgtagagaagtccagaaaacacataaacacctgggtagctgaaaagacagaaggtaaaa 
ttgcggagttgctctctccgggctcagtgggatccattgacaaggctggttctggtgaatgctgtc 
tatttcagaggaaactgggatgaacagtttgacaaggagaacaccgaggagagactgtttaaagt 
cagcaagaatgaggagaaacctgtgcaaatgatgtttaagcaatctactitraagaagacc^^ 
taggagaaatatitacccaaatcttggtgcttcatatgttggcaaggaactga^ 
gcttccggacgagaccactgacttgagaacggtggagaaagaactccttacgaga 

seq id no: 3809 acaagtatttatatcaatgaaaatttccattggtgatttm 

ggtcttgactctgtggaataaatgacgacgtaaacgtagctgcacaggggtgttcccgtataatg 

cttgaatcaattgtgtgtgaaagcatcatgcaaatggctaattaaattgggtgatgact^ 

ttataaatccttcattccagctccacgagcagatccccttctccaactgtgtctccagot 

gcacagatttcaccgtgccagttttcccagctgtcatactattctgcattttcatggot 

acaaatttcnnrgaccttctgctaccgcgtctccaggcrrgacagagacggccacccca 

atcggggaacgcagaacactgcitgtgtcctcagtcactttttccagcataa^ 

gcggcaagtctgggttaagatattcaccttgtacctcgcccgcga 

seq id no: 3810 acgcggggttttgaccaatccagtgtgcacagaggtaggagaaaaaattgcc 
cttagccgaagagttgaaaaacactggcgtttaattggttggggtcagataagaagaggagtgac 
aatcaago^aacagtagatgatgactgaagaataccagttaaataacacattcggatggatttgg 
aagttggaattcctcttaacaaccaaggggtrrattttcaaagcaatattc 
agttcgttaccrtagtaggtaacggtaaggttattctttt^^ 
ttagggactaaaattaatataaaaattggcataatgttggattgaatctacattttgg 
aaacattcccacataatgtcaaaattatacatcatgcagttctgttttm 
ttgtttttgagtctggctctgtcgcccaggctggagtgcag 

seq id no: 3811 ggtacacaagctttgaggaagtgcaaaggactgacctctaggccagaacaa 
gatggaaaactaccaggcccatcaggcctataacccagacaccagcatggacaaaactcagttat 
actgaattcagagacaaaattcagtgacactcttctaccacttatttagggttctacagcam 
ctgagcagacttagttttttgtttttgttttacaaacc^^ 

aaactaggacracgatgttaagacaaccactagcagacagctgcggacagttactgggtctgaag 
gtgaggcttcccaactcaaacagagaagtcattgggataaactctgcctctttcatctttc^^ 
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CCATATCCTTAAAACAGGACAAGATGGGATGACCACACCAGGTAACATCACCGAGGTCTCAAAGT 
AGCAGCTAAAATGTGCCAGAACTCTTCACCCTGTCTAGGACCTATGTTCCA 

SEQ ID NO: 3812 ACGCGGGGCTTTCAACCTTGTCAACCCGTCGGCGCGGCCTCTGGTGCAGCGG 
CGGCGGCTCCTGTTCCTGCCGCAGCTCTCTCCCTTTCTTACCTCCCCACCAGATCCCGGAGA^^ 
CGCCATGGCrrTACTTACTGCGGCCGCCCGGCTCTTGGGAACCAAGAATGCATCTTGTCTO 
GCAGCCCGGCATGTCAGTGCTTCCTCCACGAATTTGAAAGACATATTGGCTGACCTGATACCTAAG 
GAGCAGGCCAGAATTAAGACTTTCAGGCAGCAACATGGCAAGACGGTGGTGGGCCAAATCACTGT 
GGACATGATGTATGGTGGCATGAOAGGCATGAAGGGATTGGTCTATGAAACATCAGTTCTTGATC 
CTGATGAGGGCATCCGTTTCCGAGGCTTTAGTATCCCTGAATGCCAGAAACTGCTACCCAAGGCT^ 
AGGGTGGGGAAGAACCCCTGCCTGAGGGCnTATTTrGGCTGCT 

SEQ ID NO: 3813 ACGCGGGCCAGAATCTCTGGGACACATTTAAAGCAGTX3TGGAGAGGGAAATT 
TATAGCACTAGATGCCCACAAGAGAAAGCAGGAAAGATCTAAAATTGACACCCTAACATCACAAT 
TAAAAGAACTAGAGAAGCAAGAGCAAACACATTCAAAAGCTAGCAGAAGGCANGAAATAAGTAA 
GATCAGAGCAGAACTGAAGGAAATAGAGACACAAAAAACCCTTCAAAAAAATCAATGAATCCAG 
GAGCTGGTTTTTTGAAAAGGTCAACAATATTGATAGACCGCTAGCATGACTAAAAAAA^^ 
AAAAAAAAAAAGGTCATNTTATGAAGACCCGTAAATTAAAGAATTACTGGTTTAAAGT^^ 
TATGAAACAAAAACATATTCATAGTTTCAGCTTCTTTIT^ 
ATTTTACGTTTCTGTGTTGAAGGCnTrGGTCCTCGTTTCTI^^ 

SEQ ID NO: 3814 ACGCGGGGGAGCTCCCAGAGGTCTACCCAGAAGGACCCAGTTCCTTACCAGC 
CTCCCTTTCTCTGTCAGTGGGGACGTCATCAGCCAAGCTGGAAGCCATTAATGAACTAATTCGm 
TGACCACATATATACCAAGCCCCTAGTCTTAGAGATACCCTCTGAGACAGAGAGCCAAGCTAATG 
TGGTAGTGAAAATCGAGGAAGCACCTCTCAGCCCCTCAGAGAATGATCACCCTGAATTCATTGTCT 
CAGTGAAGGAAGAACCTGTAGAAGATGACCTCGITCCGGAGCTGGGTATCTCAAATCTGCTTTCAT 
CCAGCCACTGCCCAAAGCCATCTTCCTGCCTACTGGATGCTTACAGTGACTGTGGATACGGGGGTT 
CCCirrCCCCATTCAGTGACATGTCCTCTCTGCTTGGTGTAAACCATTCTTGGGA 
CAATGAACTCTTTCCCCAGCTGATTAGTGTCTAAGGAATGATCCAATACTG 

SEQ ID NO: 3815 GGTACTGGAGTTGAAAGAGAACTGAGATCACCTAGTTCAGAGACTAACGGGA 
AAOJTTAGGTCATTTATGGCrCTCAGACACGTTAAGCCTGCATAACATTT^ 
TAGTTCCCAACACTTAATTGAGAGCTITCACAGAAAAATCTAGATATGTGGCTTAT^ 
CAGAAGATTTGCTGACTTAAACACTGCAAAATTAAACATTTATATCTGAATC^^ 
GCTTTGTAAATGAGGTCAAGGATGAAATCCAGATGTTCrrCCTTTGTATTTGT^ 
GAATCTAGGTATATCCAAGGAAAAATAAACTTAAGATAAGCTTATTTTCC^ 
AACAAAAACCCAGAAGATCTAACAACCCTGGACCTGTAGAGTGGCAGCAACAGACCAGAACTCA 
GCAGTATAGTCCGAGCTGGGGCACCTGCTCGCCAGTGTTACTGTCTCCATCA 

SEQ ID NO: 3816 ACCTGTGGAGCGAAGGAGAGGTTGTCATCTGGGCCCCTGTTTCACCAGTCAA 

aacctatggagcagtcagaagcccgaagcatccaagcccagtggccaggaagcttctgaaataa 

acggcactgggttgggaaggatgagaaagccaagacaggtgggccaggttctccgaacatgggg 

caacagtgacctgggcaaaagcctggaactgagcggctgataaggccacaattactgccaccacc 

tctcagtatctacacgggggcagacgttaggcagtgggaactgaggactggagggcgtgtggcaa 

acacgggatggattttatcagaaaagagaggacacccaaacacgtgcctgccctatttatcttgc 

atgtcctgtgccaggtgaaggacgctggagcatgggtaaggttcctgagaacaactggtgggcag 

aaacgggctgggatgggcactgctccacaggcaacattaagcctctgagggggac 

seq id no: 3817 acccatgatttggacactttgtcgggcccatttacttatactgcaaagcgtcc 
ttcagatatcaaattcaagcctctaaataagaccaaggagtatacagcctgtgaactgatgaaca 
tatacaagactgacaatcacctgaaacattatttacatatcattgaaaacaaaccccrgt^^ 
ttatctatgatagcaatggtgtcgtccrrrcaatgcctcccatcatcaatggggatcattccagaa 
taacagtaaatactagaaatatttttattgaatgcacgggaactgacm 
gttcttgatattattgtcaccatgttcagtgaatattgtgagaatcaatitacggtcgaagct^ 
gaagtggtttttcctaatggaaaatcacatacctrrccagaattagcttacc 
agagctgacctaattaacaaaaaagttggaatcagagaaactcca 

seq id no: 3818 ggtacatgggtgtttcaatgcctccattcctaaacctgagcagttgtcagctg 
agcagtggcaaaccatggagataaacatgggtgatgaactagaairrgaagtatttcgtttac^ 

TCAGATXjCTGCTGGAGTATTCTGCATTCGGGGAAAACTAAATATCACAAGTTTACAATTCAA^^ 

tctgaagmctgaagaagttacagaaaatggcactgaggaagctgctaaaaaacctaaaaagaa 
gaaaaagaagaaagacccagagacatatgaagtggacagtggtacctaactcccagtgctggtc 
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GTCTCAGGTAATAAGATGGCATTTCCCTGCTGTTAATTTCTAACCTAAAAAAAGGAA 

CTCTGAGTTCAGAGTCCTTTCCACTAGGTATACAAATGAGGATCCTTGAGCAGGAAGGATCCAGTT 

TACCCTAAGTATCCTCAAGAGGAAAGAGAAGGCTGGGAGCCAGCCACAAAGA 

SEQ ID NO; 3819 GCTTTTTTTITITITITI^^ 

GTGGGTAAACAAGCTCCTTAAACTCATCCACCAAGGAGCCCAGTCTTTTAITCATTGOT 

TGGGCAATGTCAGGTCCACTGCTTGTTCCGGCTCCATCAAATCCAAGGCCAAGGCCTCCAGGTTrc 

TGAAGTGCTGCTGCAGCACGGGGTTCTCAAAGCTGTCACTTCTGTATGTGAAGCGAAGOT 

CGATAGCCITCATCTTGCCCACCTGCTCTGGAGTTGCCATGATTTTTTC^^ 

TTTATCATCAGCAAAGGGTAAAAAGACCAGCTGGAAGCCTGGAGGAGTCACCTGAATm 

CATCCAACTCTTCTTCCTGTGGCACCAAAGCCACAAAATAAGGAGGGATGTTCCTGCGGGGTGTC^ 

ATCTGCACAATGCTGCAACCTCCTTCTCCAGACACTTGATGAG 

SEQ E) NO: 3820 ggtactacgatggaagatgcaaccccaatattagaaaggcagcttgatgagc 

AAGATGGCACACATGCAGAAGGAACAACTGGGCACCCAGTCCAGGAAAACCAACCCAAAA GGAA 

GCCAAGTATTAAGAAATCACAAGGCAGGAAGGGACAAGAGGGAGAAAGrrGTCTGAGAACTT^ 

GACTGTGGCCCTGGACTTTGCTGTGCTCGTCATTTTTGGACGAAAATTTGTi^ 

AGGGACAGGTCTGCTCCAGAAGAGGGCATAAAGACACraCTCAAGCTCCAGAAATCTTCCAGCGT 

TGCGACTGTGGCCCTGGACTACTGTGTCGAAGCCAATTGACCAGCAATCGGCAGCATGCTCGATTA 

AGAGTATGCCAAAAAATAGAAAAGCTATAAATATTTCAAAATAAAGAAGAATCCACATTGCAm 

GAGCTCAATCTTGGACATTAACTTTAAAATGCTTCCATACAGATCTCTTC^^ 

SEQ ID NO: 382 1 GGTACTCAGCAATTCACAGACATQACATAAACATG ACATT TTTAAGACATAA 
ACAAAGACTGCATGTTGGCAGCATAGGGGACAGTCTTGCTCTTCCTCATTTTTTGAAGCAGAGA^^ 
TTTCATATTTCATAGTTTCTGAACTTCCTCAGGAGTTCTGAGGGAGCATCCTTGGACCAACGGA^ 
CGAAGGTGCCAACCATTCACATCTGCAAAACCTGAAGAATCTGAAriTrGCGGGGAGACTGAOT 
CTCTAGGGGCTTGGGTTCrrrCTGCTCCGTTrrCTCTGCGGTTGAAGCCAGCT^ 
CTCTTTGCTCATTACATTCCTCATTTTCCATCTTGGTGGTATGAGTCTTGTCAGAT^ 
AGGTCTTTTTTACCTCTGACAACCCAGTGCATTGACGTCTTCACTGAGTCATTCA^ 
GCTGQAATGGCATCGACATCTGGCTCCTTCGCCAGTGGTCATA 

SEQ ID NO: 3822 ACACTAACAAATTGGATAACTCGGAAAGATGGATTAATTCCTACAAACATAT 
AATCTACCAAGAATGAATCATGAAGAGATAGAAAGTCTGAACAGACTGATAATTGGTAAGGAGTT 
TGAATGAATGAAAAAAACCTTCCAAAAAAGAAAAGCTTAGGAATATATGGCC^ 
CAACCAAACATAAATAAGAATTAACACCAATCCTTCTCAAGTTCTTCCAAAAAA 
GAACACTATTGAACTGATTTTATGAAGCCGACATGACCCTGATACCAAAGAAAGGCAAG^^ 
ACTAGGAAAGAAAATTATGAGCCAATATCCCTGAAGAACATAAATGCAAAAACCCACAACAAAA 
AGCTAGCAAACTAAATTCAACAAGATAACAAAAGGGTCATACATCATGACCAAGTGGAATTAC^^ 
CTAGAATGCAAAAAAAAAAAAAAATAATATAGGAAGCTATCAGTCAATGTGATT 

SEQ ID NO: 3823 GGTACTITTTITrTTTTT^ 

TTGAGAGAGTAAAGGCCCAATTACAAAATTTTTATACTTAANATCTGT^ 

TTTATAAGGACTGTAGrrrCCATAACTGNTAGGCTGAACCCCATGAAACTCCTTCTTTGTAGAT^ 

AAGCCAGCCTAATATTGGCAATTTCACGTGGTTCAACCTCACATACAATGAGCCACTGCCTAGGAT 

TNITITGGCAGCAAQATGTTCCCAGTCACATGTCTGTTTATATTATAAACAGA TGTCC ^ 

AATACATCCCIOTTGANAAAAACATCCTGTGCITATGTATACTGGGAATGG 

GAGGCAGCGCANCTCTGGCTTTGCTCTTCCCAGCTTGCCCGGCCACCCGATTCCTCAGTTTCA^ 

CAGGAGGATTCTTTACAGATGTCAGCCCCCGTCCAAGGCCACGCA 

SEQ ID NO: 3824 ggtacagtatgggggttgtaaattggcatggaaatttaaagcaggttcttgt 

TGGTGCACAGCACAAATTAGTTATATATGGGGATGGTAGTTITITCATCTTCAGT^^ 

AGCTTATACGAAATAATTGTTGTTCTGTTAACTGAATACCACTCTGTAATTGCAAAAAAA^ 

TTGCAGCTGTTTTCTTGACATTCTGAArGCITCTAAGTAAATACAATTT^ 

CCTTrrcATAGGTCTGAAATTTTTCTTCTTOAGGGGAAGCTAGTCT^ 

CACATGAATTATTACAGTGTTTATCCrrrCATATAGTTAGCTAATAAAAAGCrmGTCT^ 

CTGCATATCATAATGGGGGTAAAGTTAAGrTGAGATAGTTTTCATCCATAACTGAACATCCAAAAT 

CTTGATCAGTTAAGAAATTTCACATAGCCCACTTACATTTACAA 

SEQ ID NO: 3825 GGTACACATANTGCTTCTGCCACATGATAACGAGCGCGGTGAAACCGATGAA 
NAACATGGCACCGCCCACAACCGTCTTNCACTCGTNCGAGCCCCTGTTCATCTCANCANAGCTCT^ 
CTTGAACrTAATGCGATACAACTCNACTTTNTCATACATNGAGAGGCTGT™ 
TCCTTCTCCTTTAKTGCCTTCTGGOTGGCATGACANGTGCTTTGAC^^ 
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GGTGGTCACGCCCGATCCATATAAGCTGGGAGCGAAAAGTTCTTCTCTTCTTCACAACACTT^ 
TGAGCTCGTTCTGTTGGGGACTATGAGCTCmCTTNGGANGCTGNGGATC^^ 
GAGTTNCTGAAACTGGNTTGAIvrrCAANCCTTGTCCAGAAGTTCCCCTGCTGGAC^^ 
ACTCCCCCCCATCCTTTACCTTGNCCATTGANGGACCTTTGTNACCAAACTCCCTO 

SEQ ID NO: 3826 ACGCGGGGACGTAACGGAGTGGCCAACGGCCTGCAGAGCAACATGCCCAAG 
TTTTATTGTGACTACTGCGATACV^TACCTCACCCATGACrCTCCATCTGTGAGAAAGACACACT 
AGTGGAAGGAAACACAAAGAGAATGTGAAAGACTATTATCAGAAATGGGATGGANTAGCAGGCT 
CAGAGCCTGATTGACAAAACAACGGCTGCATTTCAACAAGGAAAGATACCTCCTACTCCATTCTCT 
GCTCCTCCTCCTGCAGGGGCGATGATACCACCTCCCCCCAGCCrrCCGGGTCCTCCTCGCCCTGGT 
ATGATGCCAGCACCCCATATGGGGGGCCCTCCCATGATGCCAATGATGGGCCCTCCTTCTCCTGGG 
ATGATGCCAGTGGGGCCTGCTCCTGGAATGAGGCCTCCCATGGGGAGGCCATATGCCAATGATGC 
CTGGGCCCCCAATGATGAGACCTNCTGCCCGTCCCATGGATGGT 

SEQ ID NO: 3827 GGTACAGCGTCCTCGAAACCACNAGCAAGTGAGCAGATCCTCCGAGGCACCA 
GGGACTCCAGCCCATGCCATGGCGGATTCTGAGCGCCTCTNGGCTCCTGGCTGCTGGGC 
ACCAACTTCTCGCGCACTCGNAAGGGAANCCTCCTGTTTGCTGAAATTATAANATNCC 
TCCTGATCTGCTTNAGTTGCCTCCACACCAGGCTACTCCTCCCTGTCNGGAGATTTGAGATGATCCT 
TGCTGCTATTATCTTTGTTGNCTACATGTGTGACCTGCACACCAAGATACCATTNATC^ 
CTGTAGTGATTTTTTTCCGAACCCCTCATAGCGGAAATCCTCTACCTNATCACOT 
CCTTGTTTGNNGAGAGGAAACCACTCCCAAAATTNGTTCGCANGGGTACCCTGC^ 
CCaJTTTCGGAAAANGGCGAAATTCCAGCTACACTTGGGCGGGC 

SEQ ID NO: 3828 ACCCAAATGCTACCACTGGAGAAGGAATGAGAGATAAAGAAAGAGACAGGT 
GACATC TAAGGGAAATGAAGAGTGCTTAGCATGTGTGGAATGTTTTCCATATTATGTATAAAAATA 
TITmCTAATCCTCCAGTTATTCTTTTAmCCCT 

TATATTAAATAGGGTATTGGTAAAGAAACGGTCAACATlCTAAAGAGATACAGTCrrGACCTTTACT 

mcrCTAGTTTCAGTCCAGAAAGAACTrCATAmAGAGCTAAGGCCACTGAGGGAi^ 

TAGCTTAAGTCTCTCTGTAGACAGGGATCCATITTAAAGAGCTACTTAGAGAAATAATT^ 

GNNTCCAAACCGATAGGCTCAAACACTANGAGCTG^fTAGTAAAAAGAAGACCCANATGCm 

GAATTATCATTTTTITCAACTGGGAATAAAAACACCAGGTTTGT^ 

SEQ ID NO: 3 829 GGTACTTCTTGCCCTrGTAGAATTTCCrGAGGTTTTCrm 

TAACTGTGAGAACACGGGCAATGGATTTCCGGACGACTCGGATCITAGAGAGCTrGGAGGCCGCA 
CCGCCTGTCACTTTGGCGACGCGCANCTGGGACAAGCTNCANCTTTAAGGTNCGr™CAGC^ 
TNAGCAGCTTCCTCCrrCTrCTTCCX:GCGAAGATCTCGAGCCTTGATCTTG^ 
CCGNCAACGCCGCCGCCCGCTCCGAGGGAAAGAGCCCGCGT 

SEQ ID NO: 3830 ACTTTTTrTTTTTTrr^^ 

AACACACCATGGCTCTGTCACAATAGGGACATKTAAGCTCTACATACCATTTmT^ 

CAGGGCCCCATAAAAGCCTCCAAACCACGTGTANATGTGANACANATTTGAAAAAAACATTTC^ 

CTCATAGCTTATAATGATGCCATTTCTCCAGCTGNGCAAGGGCTITACAAAAACTGNGCCANGAC^ 

TCCCATGAGGCTGGATTGCTTGATTCATGTTTTATGAGCCCCACAATACTGAAGCTCCT^ 

GACTTGGCATNTGCAGTCAATTCCACATTTGGGATAAGTCCTCTCTGGAAGTGAAATGTCAGGCAA 

GTGACATNCAAAGTTrTTTGCATGCAGTGGG>rrTAACACCCATGTT^ 

AAAAGTACCTTGGGGCGGGAANCCCCTNANGGNGAAATTTCCA 

SEQ ID NO: 383 1 ACGCGGGGGAAGAGGCCGGGCTACGTCGTGCCCTGCGCGTGAGCAGCTGCA 
GCGGCAGAGGCAGCATCCAGCGGCGGCGCCAGCAGTTCCAGTCCGTTGCTTTACTTTTTGCT^ 
CGACATAGTCATTATGCCGAAGAGAAAGTCTCCAGAGAATACAGAGGGCAAAAGATGGATCCAA 
AGTAACTAAACAGGAGCCCACAAGACGGTCTGCCAGATTGTCAGCGAAACCTGCTCCACCAAAAC 
CTGAACCCAAACCAAGAAAAACATCTGCTAAGAAAGAACCTGGAGCAAAGATTAGCAGAGGTGC 
TAAAGGGAAGAAGGAGGAAAAGCAGGAAGCTGGAAAGGAAGGCCAAAAACTGAATCTGTAGAT 
AACGAGGGAGAATGAATTGTCATTGAAAAATTGGGGTrGATTTTATGTATCTCriTGG 
TAAAAAGCTATTTTTACCAAGTATTTTGTAAATGCTAATTTT^ 

SEQ ID NO: 3832 actaacttgtagcaaccacgtgtccgtgcagtgccacaggagctagagcagt 
gacaatgctggtggcaacagggcagtgtagcaggtgcttcatgttcaccttttcaaccrm 
taattgtcacaactcggaggtggattctgttagggacaggctgccccaggaccactccgcccccg 
ctaactcaatgcagctgacccttaccctgaatactctgcagctgcattcctgaaccgttatctac^ 
cgctatagcaaggtcaccagacttgctacaccgaagccctctgggtggcacgggggaggtcatga 
gaaacgtggattacacccccrrgtaaattcctatmcacaagataatatattgta^ 
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gagarratatgtogtaaagttaattgactaacaaccccangggtctctthrrccccata^ 
tcattttgtaaagctcaagggccgccaccctccgActggtggaaaaaccc 

SEQ ID NO: 3833 gcgtggtcgggccgaggtacgcggggctctgcgagcgttatttcaaaagaag 
ttgagaaccagagaaaccgacctaaggggattctcccatttggcccgtcct^ 
acctgctgcttttctggagcgcttaccagtgaccaagaggaacagaacacagagcagcctggcag 
tgtccnaagcaacaagcctccgctcctccttcctgcaccctggggctcctgaaact^ 
ggagggctgtctgagattcgagggaaacaagctctcaggacitccggtcgccatgatggctgtgg 
gcggtaaacgcggttagtgcaagcatctgggceatcttcaatggtaaaaaagatacagtaaagac 
ataaataccacatttgacaaatggaaaaaaaggagtgtccagaaaagagtagcagcagtgagga 
agagctgccgagacgggacagtggatccagtaggaacatagatgcatccaaactcattagact 

seq id no: 3 834 ggtacaccatataaacagcagatgaagtcggagagatagtctaatacactta 
gatcatgttccaccacaatgatatatctatctggatttattagagatcgtatagtaatagcagcct 
ttaaacgctgcttgacatctaggtaactagaaggctcatcaaacatgaaaatatcagc^ 
tgcaaacgacagcacaagcaaatctctgcaactctcctcctgaaagatcitc^ 
ttaggtgggttaaatcaagctgctgacatacaattgcctgtgtctttgtttca^ 
aata gatcccactgtccccrrtgcagccttaggaatctggtctacatattgaggm 
ttttaagggcatcttctagaatctttgnaaaagtaattttgtaattcagaatc^^ 
gtcaaaatctnccggcngtcangganggatcattcgtacctggncccggcggg 

seq id no: 3835 accgcgcttggcggtagctggccccagacrrctgtcttttcagctgcagtgaa 
ggctcggggctgcagaattgcaaccttgcx^aatggaccrgatcggttttggttatgcaccc^ 
gacatttggaagcatttttggatataagcggagaggtggtgttcccgtctttqaatggctggn^ 
tttgtnggatgtttggccggctatggagcttaccgtgtctccaatgacaaacgagatgtaaaagtg 
tcactgtttacagctttcttcctggctaccataatgggtgtgagatttaa^ 
atgcctgctggtttgottgcangttraagcctcatgatgatcctganactagtcttg^^ 
gagcatctggaggaacangaaaactaagttcatgtcatcctgctgtaatgggcaaaacatatm 
tttttgnatttaanaagataaacctninaattatgggaatg 

seq ed no: 3836 acttgttgactagagttcactctagggtgatctatctgtgtcatttgctgggg 
cacccttcatgrcagcttctgattacrtttccctttt^ 

tgtctggtgggaggaggtctgtatgccacrrctgtgagaaaagtganggaaagaagggcagto^ 

gatttcagtattcagtacgcggggactgcgataggaatcatgtctggtcgcggcaaaggcggaaa 

aggcttggggaagggtggtgctaagcgccatcgtaaggtgctccgggataacatccagggcatta 

caaaaccggctattcgcccgtitggctcggcgcggtggcgtcaagcgcatttccggtcntatcta 

gaggagactcgaggtgtgcttaaggttttcttagagaacgttattcgaaacgcc 

gagcacgccaacgcaaaacttgtcacaacccatggatgt 

seq id no: 3837 ggtactacttctccaaactcatagaatttatggacacttrcrtot 

gcaagaacaaccaccagatcacggtcctgcacgtctaccaccatgcctcgatgctgaacatctgg 

tggtttgtgatgaactgggtcccctgcggccactcttcagtttgtgctgacaatcatccaa 

agcttgcgggggtcatctggccgtgcacattccctcttggrrggttgtatttcca 

gatttccctgattgctctcttcacaaacttctacattcagacctacaac^ 

aaggaaagaccacctgaaggaccaccagaatgggtccatggctgctgtgaatgggacacaccaa 

cagcttttcacccctggaaaacaatgtgaaagccaaggaagctgcggaaaggat^^ 

aattgaaaccctccaaccacgtcatctgattgtaagcccaatatga 

SEQ ID NO: 3838 actccttcctgaactgcctccaggtcagcccctgccacggctggatgtcitcc 

AACAAGACGTTGCX3GACTCTCACTTCAGAGAGAGCAGAGCAACTCTCCAACACACTGAAAAAAAT 
TGCAGCCAGTGAGAAATTTACAAACTTCAATCITTTCTACATGGATT^^ 

acaggagtggcagaagagaggcggacagcx:ctggcagctcatcgagcccgtggatggattccac 
cccaacgaggtggctttgctgttgttggcggatcatttctggaaaaaggtgcagctccagtg 

CAAATCCTGGGAAAGGAGAATCCGTTCAACCCCCAGATTAAACAGGTGTTTGGAGACCAAGGCGG 

gcactgagcctctcaagagcatgcacccctggggagcacanggaggcagangctttgggtaaact 
cattccacaaaccctatnggggctgncacgttacangcccaaagg 

seq id no: 3839 ggtacatgtcacatatatagtcaatgtaaagcaattcta(nttgcatccctta 
gccacaggcataaggcagaacacagatatttctgtgttctgtgaatatctgtggaatgataot 
atatgtggatcttgagtaaagctgaattcatccagaarraaactttttgcattatt^ 
tagacatcatcrtgattaaacaagatatttgcttccacaagtgtagactgacaacatat^^ 
cctgaacacaccatgtcagaattatggctaatcccaaacttgtaatagagggatatctgtgtato 
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GGGATACAGCK}TCACTTGCAGTATTCTCTAGCCATCTGCCAAGAACTACTT^ 

TTTTCTTGGTGTTTCTGCTTAAAAACTAATGTACGCCGGGGG^^ 

GAGATGGTTGATTTNGCCTCITAACCAAOANATTCATTGGCAGCT 

SEQ ID NO: 3840 ACACTGGCTGTCCACAAGGCCGCACCrCACAAATCCGGGTTTCTTTCAC^ 
GCGGCACTCAGGGTTGTCATTGGTAACTCGTGTGGAGATACCAGTTCCACAGGTCTTTGAGCACTG 
GGACCATGAAGTTGTTTGAACAATACATTTCTGGCCTTGTAAAGGGTTGTAT^^ 
CATTCCAAAAACAGGGAGCCGCTTCAGTGAGCTGCCTTTTCCAACTGCAATC^ 
CGTCAACTCCACCTCGGAGGCATCGAATCCCAGCTCCrrGCCAAGGAGGCCGTCCTGGTCCTCCAT 
GGGGTCCTTGATACTATCCTCGTCACAGACCCACrCCTCGCAGCACTGCCCGGTAACm 
CCCGAGGGTTGGGACAAGCCCAAGTTGGGGAAGAGATAGTTCTTGGGGACACAGANGAATGCAG 
CCCACGGCGCCATCAAATACATGTGCACTTGATGTTTACAAGTTGGG 

SEQ ID NO; 3841 GGTACTTTTTTl 11 r'l'ril"ll"ll-ll"l"ll-llUlTGGNATATAAACTATTTATTAAC 
AGACAAGGCCTACAGACrrAmCTTCTTGGACACACCCACGGNGCGGCCACGGCGGCCAGTGGT 
CTTGGTGTGCTGGCCTCGGACACNAAGGCCCCANAAGTGACGCAGCCCTCTATGGGCCCGAATCT 
TCTTCAGTCGCTCCAGGTCn-CACGGAGCTTGTTGTCCAAACCATTGGCTAGGACCTGGCTGTAm 
TCCATCCTTTACATCCTTCTGTCTGTTCAAGAACCAGTCTGGGATCTTGTA^ 
CTrGCCArrCTTGATCAGCTGGACCCTTACACACTTCCTAATGGGCAGAATTTGGGCTGm 
CAACTCCTAATTTTTCCAGCACGATTOCTTTTGCATTGAAAAGCA 
TTTANGGCTGNGCCCCAAAAGANCCTTCTTATACTNGTTTATTAATO^ 

SEQ ID NO: 3 842 ACXirrTTTTTiTm^^ 

AACArrAAGAATTTACCTACATAGrrGAAAATATTCACAAAGGACTTGATCATTCACACTCATAC^ 
CAGAGAAAGTCTGCTGAATAAAAAAATGCTCCTTTACCCATTCGACCTCCTCTCAGGGGCTGCAGT 
GGGTTTGrcAATTCCACTGTGGGGCTTCAGTCirCATGAATOTTGAATAGm 
NATCTTCATGCTGCTCACCGTCCTTAATGATOTCCTTGCTATTGGCAT^^ 

TAGCACAATCCCCACGACCACTTTGTCCCTGAGGTAGAAGATGACACCTTTGCCGTAGTCCTCCCC 

CTGGACGGGAGCCTGTGGAACTGa:GGGGTGCTGGGANGAAATAGTAATTTTT^^ 

CTTTGGCTCACTCTNTGAACGGATCCAGTTCCTGA 

SEQ ID NO: 3843 GGTCGGCCGAGGTACI T TT l 'i 1 1 ' n ' i - lT rn'AATGTTGCACTTGTAGTTTCATT 
ACAAAAGATCAGATCATGAAAGGCAGTAACTCTCCAGGACTGGAATATCTGATTGCTCAGTGTTA 
ATAGTAGTTCATGCTGTGGTGAGATTG^TAAAAGGGTGCAAGACTG^^'GCTTCTCTT^^ 
ATTTTTCTATCTCTCACTrCTCAGGGATGA.\ArrCTTm 

GCCATGATGTGAGTGGTTATCCCTAGATAAAATTAAAAGGATTTTTAAAAAGTAATTACTGC 
AAAATGATAAATAGGTAATTTGAATAATTTTATTTTAAGCTCCTO 
CTCAGCTATAAATTCAAATITATACATACTATTGAGTATTAATATrCTCTGATITC^^ 
TGTCAGTCACATGATGATTATGTTTTTGTTTAA 

SEQ ID NO: 3844 ACTCTGCCAGGCATTTAACATACATTATTTCACTTGTTTTCATGA 

GGCAGGTTCAATC ACAGA AACCATGCAGTGGACAAAAGAGAGAAAAAGTTCACCATGAGGAGAT 
TCTAAAAACCTGCCTTTTGACrrGTCTATTTTrATCTC^ 

TGGACCCTTTGTCTGACTGTTTGAAATTAAGTTGCAGGTGAACCCTCCATTACCC^^ 

AAATAGACATAAGCACAGAACCrCACTAACTCCCCAAATTTTTATTm 

AATTTCTTGGTAACAATGTAGAATTTAATTrrGTAATTAAAAC^^^ 

CTGATGACATAAGGATAACACCAAGTTATGTGCACAAACGACAAAAAATGCCAAACTAACTGGAA 
TGATCATCCTTCCTGAAATTCTCTTAAAAAGCA 

SEQ ID NO: 3845 GGTACGCGGGGCTTGCGGTGCTGGGCAGCAGACCGTCCAAACCGACACGCGT 
GGTATCCTCGCGGTGTCCGGCAAGAGACTACCAAGACAGACGCTATGACTGAGGCTGATGTGAAT 
CCAAAGGCCTATCCCCTTGCCGATGCCCACCTCACCAAGAAGCTACTGGACCTCGTTQNfAGC^ 
ATGTAACTATAAGCAGCTTCGGAAAGGAGCCAATGAGGCCACCAAAACCCTCAACAGGGGCATCT 
CTGAGTTCATCGTGATGGCTGCAGACGCCGAGCCACTGGAGATCATTCTGCACCTGCCGCTGCTGT 
GTGAAGACAAGAATGTGCCCTACGTGTTTGTGCGCTCCAAGCAGGCCCTGGGGAGAGCCTGTGGG 
GTCTCCAGGCCTGTCATCGCCTGTTCTGTCACCATTCAAAAGAAAGGCTCGCAGCTTGAAACA^ 
AGATCrAATCCATTTCAACAGTCCATTGAAAGGCTCTTAGTCTNAACCTG 

> 

SEQ ID NO: 3 846 GGTACTTGCACAGGAAGTGTTGGCGCTTGTTGCATTCGTTGCTGCTCCAAGTT 
AAAAAGTTGTTATTGGAGCTCATCTCAGCACAGTGCITGrrCCCACCCATGGAOT 
GATCTGTACACAATGAATTGCTTTTATTTCGGTATGCATCCACATTTCAGCATIT 
GAACAGCAAGTGGGAAAGACGCAGCAATTTGCCAGGAGGTCAAGCCCACCAATTTCGGGGATCTG 



596 



wo 02/29086 



PCT/USOl/30732 



CTGTGCACACCGGGTTCCTTCTTAATCCCTGCTGAGGATCTTGGGAAGCAGCAGCAGC^ 
CCAAGGCATGCACCGGATTCAAGGTTCTTTTTGTTCCAGTTGTCAGATTCCAA^ 
GGATTGCAAGGATGACCAAATGAAAGCCCTGTTrAAAACTTCTTCAATTm 
GGTTACCAGGGAAAAGTAAAAACAArrCAGANGGATCATGTGTGCTTACA 

SEQ ID NO: 3847 ACATAATCGTTTTGTGGAGTCGGCACAGTTCAGGTTATGGAGGCACGTAATTC 
ACCAAAGTGCAAAAAAGGCAAAGGAAAACACGCTGCATTGTAGAATAAGGCATTCAAATGTGCT 
GTTAACGTTTAAGGCAGCTAATGGCCAAAACAGGCAAGTCAAGAAAAGTGGTCTGGNTTGGGAGG 
GTGArrmGCATCTAGAAGGCATTCTCITCTCGTGACCTCAAAGACTGAGC^ 
CTTCTTCCTCAAGGCCAATGATACTTCAGATACCAGATGGTTTCATTTITCAATO 
AGAGGGTTGAGrrGGGCCAGAATTGCAATCAGCCAAAAGAGAtAGCAGCAAACTGAACAGGTCA 
CCAACATGGTAATGATAACTCCCCGGTTAGGACCCITAAGGGAtGAACCAAGGCCCAAGAAGCCC 
GCGAAGCCCCAAACACGCTCATCACAATGAGANGCCCAGTGGAGGC 

SEQ ID NO: 3848 GGTACTCTGTGAAGAACAGAAATGATCATATTCTTATGCATCTATCTGTATGG 
GTCTGAAGGTGTATATACAAACTGAGATGAGTCCITATGACTCTrGATAAGCCTGAGTT^ 
AACAAAAATGCCAAGTTGTCCTGAGCCCrrCTGCGTTGTTATGCCACTTCCCTACTG^ 
ACGCTGGCTCCCCTGGGCACGCAAGGATGAGTATGGGCCATGGGCCCCTGTAGAGCTGCTTACCT 
GGTGATGACCATGCACCTTACAATTTCTGAACAGTTAACCCTATAGAAGCATGCm 
TCTTCTGGGAAGAGGAACCTTCTTAATCTCTTCTGTGGGATTTTCAAAATGCTAAAGA 
GCAGCAATCATCCCAGATGATTAAATTCAAAGAAATAGGTTCACAAACAGGAATATACTGAAGAA 
CTAGAGTGTCACTGCTGGTGAACTGTGGCACNGGTGCTCAACACAT 

SEQ ID NO: 3849 GGTACCAGGTATGTCACCTTTTGGGTTCCGTCACrrGGTGTGTAGGTTATCTC 
TACTTTTCCAGGCCCAGGAACAACAAAATCAGTTGCTCTGTATTGATCCCCATAAGCATGACGACC 
TATGATGATAGGTTTTACCCATCCACTCACAAGCCGGGGGATATTTTTGC^ 
TGAAGACCGTGCCACCCAGAATATTTCGTATGGTGCCATTTGGTGATTTCCACATTTGTrrC^ 
GAACTCCTCAACCCTCTTCTCATCAGGAGTGATAGTGGCACATTTGACGCCAACATTATGCT^ 
ATAGCrrCTGCAGCATCCTTGGTGACTTGGTCGTTGGTGGCATCACGATTCTCTATGCCTAAATCAT 
AGCTATGTAGATCCAATTCCACGTAGGGAAAAAATGAAGTTTCTCTTTTAATCAAT^^ 
TTCGTGTCATTTCATCTTCCTTGCATTCTNTACCCACAAGAACCG 

SEQ ID NO: 3 850 GGTACrcCCAATATACGATAGGGTTCATGTTATAGGATTCAATTGTAACATTA 
GrrGGTGTAGGCACTGAGGACGGCCCCAGATCCGCGGTGCCCATCTCAGCCCTGCTCACACCCTGC 
ATGACAAGGGGTAGGAGAAAGAGGAGAGCCATGCTGCTACCGACGGTCGCTGGCTCCAACCCCG 
AGCCCCCGCGTACTTIT^TTITIOTAl'rrr^ 

GGAAACAAANNTTGATGGCAATTGTNTAGGACNCACWACAGTTTCCTTCTAGGTCATA^ 
ATNATTGAATNTATirrcnTCTCTGGTCTTAATCCCTCANA 
CCCATNACAGTGCAGGTCCCATAGGAATATTNTGTNGAATAACAGGACCITNTT 
TTGAAACNCCTTGTTGAGCCCCAAAGCTCITANAACriTANCCTrGCATCC^^ 

SEQ ID NO: 385 1 ACTGTTGTCCATTTCATGAGAGTAGGCTTGAGGACACCATGGGCAAGGATCT 
GATGGTTGCCAGCCTAAGCGTTTTAGACTTTTGACCCAGAGATTTm 
ATTTTAGAGGATAGGGTCTCAAGATATAATCCTTITrATAGGCGGCAGGTCTTT^ 
TCCAGAGGAACGGATGAAGCCTGCTTGGAAGCATGCTGGGATGGCCATTTGGAAGGAGTGTTGCA 
AGGAAGCATGGCCrrGGCTGGGCGCTGCCAGGAGCTTAAGGGTTGTAGGTTGTTTGTCTGA 
TCTCGGCCTAATCrrGTGGCTTCCAGGAGGAAGAAGAGAGATCAAGCrGGCTGTCTGATGGGCAT 
GGCTTTATTCTGGAATGGTGAACCCAATGGAAAAGGTCCTGCAANATGGACCGCAGTTGGTGTGC 
TATAAATGACCCGGTAGGGGCTGGTCCATTGAGGCTGTANAGm^G 

SEQ ID NO: 3852 ACTGGCACGCTCTATAGCCAAGGAAGGCTTCGAGAAGATTAGCAAAGGTGCT 
AATCCAGTGGAAATCAGGAGAGGTGTGATGTTAGCTGTTGATGCTGTAATTGCTGAACTTAAAAA 
GCAGTCTAAACCTGTGACCACCCCTGAAGAAATTGCACAGGTTGCTCCNAATTTCTTGCAAACGGA 
GACAAAGAAATTGGCAATATCATCTCTGATGCAATGAAAAAAGTTGGAAGAAAGGGTGTCATCAC 
AGTAAAGGATGGAAAAACACTGAATGATGAATTAGAAATTATTGAAGGCATGAAGTTTGATCGAG 
GCTATATTTCTCCATACTITATTAATACATCAAAAGGTCAGAAATGTGAATTCCAGGATC 
TTCTGTTNAGTGAAAAGAAAATTTCTAGTATCCAAGTCCATTTGGACC 

SEQ ID NO: 3853 ACATCCAAAACCATAAGGAAATATTCTGATGCCCAGATGATGAAGACTGGGG 
TGAArrAAGTCCACACATTTATTTCAAGTTGTTAAAGAGTTTGTGGGCCACGCAATTO^ 
ATGCAAGAAGTCAAAGAGCTCCTCCGTGCAATCCTCTTCTGTATGGGAACCGNAGAGGATACACG 
CTCATCACAGAGCTCTAGCCGCTCCCGGGCCTTTACACATTTCTCCAACTGCTCGCATTGCTCT 
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ACTGTTGTTAGGGGATCCACTAATTCCTCCTCTTCCTCITCCTCCTCCTCA<^ 
TAAGCATCITITGCTCGTCCTCCAGTCCCATGTCTGGCTACGGTTCTAGA 
ACAGCGGCACCITCCGCGTACmGCAAGAATCAAGAATCAAGTCCACAAATAGAGm 
ACACCCAAGAAAAGACAGGGAAGCTCTATTGGGACAAGAAGATCAGCTT 

SEQ ID NO: 3854 GGTACCTGCTA(nTITATGCTTGTGTCTTTAGATGA^ 

TCAAATACCAGTATGCTGAATTAGGCAAATTATACTCAGTAGTGTCACAGCTGATCCGCTGTTGCA 

ATGTCTCTTCAAGAATGCAGTCTTCAATCAATGGTAATCCTCCTCTTCCCAAACCCT^^ 

CTAATTTATCACAACCTATAATGCCAATTCAGCAGAATGTGGCAGACATTTTAm 

GTTATGTGAAGAAAATCATTGAAGACTGCAGTAATTCAGAGGAAACCGTCAAATTGCTTCGT^^ 

GCTGCTGGGAGAATCCTCAGTTCTCATCTACTGTCCTCAGTGAACTTCTCTGGCAG 

CTATACCTATGAACTGCGGCCCTATTTGGATCTGCTirrGCAAATCTTACT 

CAAACTCACAGAATTCATAATGCACTGAAAGGAATTNCAGATGACCGAGA 

SEQ ID NO: 3855 ACACCTGTGAATTCAGAGGATTCAGATACCAGACAAACTTCCCATTTACAAG 
CAAGATCTCrrrCTGAGATAAATAAGCCAAATTTCTATAATAATGACTTTGATGATGAm 
ACAGAAGTTCAGAAAATATATTAACAGTGCACGAACAGGCCAATGGTGGAATCTTTTA^ 
AAAACAGAATTGTAAGGATITGGATGAAGATGCCAATGGAATAACAGATGAGGGGAAAGAAATT 
AATGAGAAAAGTTCTCAGCTGAAGAATCrmCTGAACTrCAGGACACTAGCCTT 
TCTCAGAGACATTCAACCCCCCAAAAAAAAAAAAAAAAAAAAAAAGTACTTGAAGCTGAGA 
CATATGACGTGGCCirCGTGTTGTCAGANAGTGTCTGGAACTGCTGTTGCCATCI^ 
ACCTTCACCCAGAGCCCCATGGAGAGAGGACATTTGGNCTCTGNTTCTTT 

SEQ ID NO: 3 856 GGTACTTGCCCCTTCCCCAGAAAAGCGGGACTTGCTGCTAAGGGTGAAGGAC 
CAAGGCAGTTGTCCCTGCGTGGTCTGACACCCTTGAAACGTGGGTGTATAATCAGAGAGGCATCC 
CTGCAATGATTAAACACCAAGGGAAGGCTGCCTTCCCAGTCTOTGACCAACCGCCGGAGTm 
GTCCACGGATAAAACGTGTCTCTmGTCTCTAGCAGAAAATGAAAGGAATTGAAATTA^ 
GGGAGAGATTGAAGTGTAGTGCCAAGATTGAAAGGAGAAAGTGGTTGAGGGATAGTGAGGGAAG 
TTGGAGAAGAGAGTAAAAAGAGGCTGCTTACCAGATTTGAAATTGGTGAGATGTTTOT 
GTCGGTCTGAGGACCTGAGGTCXSTAGGTGGATCnTCTCAGGGAGCAAAGAGCANGAGGACGGAG 
GATTGATCTCCCAAGGGAGGTCCCCCGATCCGAGTCATGGCCCAAAriTCATGTGCGTCCATGTGA 

SEQ ID NO: 3857 ACGCGGGGCTCTCTGCGGGGCTCACTCTGCGCTTCACCATGGCTTTCATTGCC 
AAGTCCTTCTATGACCTCAGTGCCATCAGCCTGGATGGGGAGAAGGTAGATTTCAATACGTTCCGG 
GGCAGGGCCGTGCTGAITGAGAATGTGGCTTCGCTCTGAGGCACAACCNACCCGGGACTTACCCA 
GCTCAACGAGCTGCAATGCCGCTTTCCCAGGCGCCTGGTCGTCCTTGGCTTCCCTTGCAACCAATT 
TGGACATCAGGAGAACTGTCAGAATGAGGAGATCCTGAACAGTCTCAAGTATGTCCGTCCTGGGG 
GTGGATACCAGCCCACCTTACCCTTGTCCAAAAATGTGAGGTGAATGGGCAGAACGAGCATCCTG 
CCTTCGCCTACCTGAAGGACAAGCTCCCCTACCCTTATGATGACCCATTTTCCCTCATGACCGATC 
CCAAGCTCATAATTGGAAGCCTGTGCGCCGTTCAAATGTGGCCTGGNAACT 

SEQ ID NO: 3858 ACGCGGGGTCTTCCTGCGGCTGAACCGCCCGGCTGAGCCGACATTGCCGGCG 
TCTTGGCGATTCGGCCCGACGAGCTCCGCTTTCGCTACAGCATGGTGGCCTACTGGAGACAGGCTG 
GACTCAGCTACATCCGATACTCCCAGATCTGTGCAAAAGCAGTGAGAGATGCCACTGAAGACAGA 
ATTCAAAGCAAATGCTGAGAAGACTTCTGGCAGCAACGTAAAAATTGTGAAAGTAAAGAAGGAA 
TAATCTACCCTGACTAAAGCTTGAAATGCTACATTTCCAAGGTGAAGATGTGTGGGCACATGTTAT 
GGCAGATTGAAAAGGATCTCATTCCATGGGAAAAAAAAAATCCTGTCTTGTTCATAAATTGACA^ 
TGTCAATAAATTGAAATATGGTTCACTGGTACTCTTGGAAAAAAAAAA 

SEQ ID NO: 3859 GGTACAGGGGATTGGAAACATGCTCCGCGCCTCCAGAGAAAAGrrGCTCCCG 
AGGTCCATGCCCCTGGAACGTGTTCCTATCACTCTGGCTGGTTGGGCTGGTCCTTAGACTGGGTGC 
TTATGATTAAAGGGTOrTGGTTAGCCCACTTTCCCTCTCCATGTGGAGATGGAAAGGGTA^ 
GGATACAGTGTCTATCCTCAAGTTGCTACGGTTCAGTGAOAGAGGCAGACATCTGAACAGGCAGG 
TAGGATTCAGTGTGCTCAGTGCACTGGGGATTTGGAGAGAGATGGGCTTGCTCTCTCTGTGCACCC 
AGGAGGGCCACGCACITAAAACTGTGTTrGTGGATCAAAGAAAGCTTTATAGCACANGGGGCAT^ 
CAGATGAGTCTTAAGAGGAAAGAAAAAAAACATGGCAAGCCAGATTACATCTGAGCCCGTTGNAT 
TGNGTirrTCTTrCTTCCCATGGTTATTTCTAAGAACTACCT^ 

SEQ ID NO: 3860 ACTTTTTTTTTTTT^^ 

GAAACTCTTCACATCATGGNGAGAGTTTGTATGATTAATAAGAAGCAGCTTTTO 

GGAGGNGAACGAGTTCTCAGCCTGTGAGAT(XGACCATCCCATTAACTTTTGAAGm 

TAATAGAAGAAAAAAGGGGAGGGTGAAGAAAAGGAGGAACATGCTAAAAACCTTATGACAATCA 
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TCCAAATGTGAGGAAAGAACAACCGA1TCACCAACTCCA.CTTTT^ 
CTCACTCTTGATTTTGGCCTTCCTGGCTGAAACAGCCTGGCAGTCCCT 

CCTGGTTCTTCAAAAGACAGAGGAGGAGAAACCCTGCAGGATGCGCTGCCACTTTCCCANANAAC 
TGACAGNCCGTGCTCCNAAAGTTTGACCCACACCTAATNTGNAAA 

SEQ ID NO: 3 86 1 GGTACTTGCTTTTGTAGCTGTTTTTTTGTAGGACATGTGATCAT 
CX^TTCTTCCAAAACTTCAGTmATTAGATGTTACTCCTGCCA 
CTTTTTCCTCCCATGTGGTCTrrGTAGGT(mTCrrTGTG^ 
TCACAGGCCTTGTAATTTTCCTAGGTGTTTCTCCTGCTGACTCITCAATC^ 
CCCAAACATTGAAOTCGTCAGATGTTCCCTCCACCAAGGGTGCAGCTTCGTGTTGGAAAT^ 
TTTGGTGCCACAAAGAAGAGTTAGCACTCCAACAACAAGTTAACACTCCAGCAAGGCAAATTNCT 
TCTATAGAAGGGTGAGTCTTGCAGATGGAGCAATNGCAAAGGCCCCCCGCGTACCTGCCCCGGCG 
GGCGCTCGAAA 

SEQ ID NO: 3 862 ACACCGTGGGCACCAAATACCGCAGCGAGTCCTATACGTGGTTCCTCTTCATC 
TTCTGCTTCATTGTGCCTCTCTCCCTCATCTGCrrCTCCTACACTCAGCTGCTGAG^ 
CTGTGAGTGGCATTTGATAGTCAGGGAAGAAGGGGTTCGGGGCTNCAAATTGAGAAGGAAGAGT 
GCTCTGAAACATAAGATGCCTGGAAATGTCCATAGCCAGAGAGGGTATCTAAAAGCAGCAAAGG 
AAGTAGGAGGAGGGAGAATGATGGAGATCCAAAGGAACTANGCCAGGAGATGGGACAGAAAAG 
AGGCAATCAGAGTGGATGCCCCCTCCCCCATCCCACAGAAAAGCATCCAGAGACCGGGCGCAGTG 
GCTCACGCCTATAATCCCAGCACmGGGAGGCCGAAGCAGACGGATCACCTGAGGTCAATAGTT 
CCAGACCAACCTGGCCTACATGGCAAAATGCTNAAATGCGAAAANTAGCTNGGCATGGN^ 

SEQ ID NO: 3863 GGTACTGGGGGTGGAGGGGTCCCCTCTTGCAGTGTGGGGTTACTGTTTGGGT 
AAAGCGAAGTCCCAGGCAGTTTCCTGTGCACATTTCCACATGGCCTGCATGAGGCGAGTGAAACC 
CATGTCTTTGGGCTTTTCCAGGCCTCTCATCAGCCGTTGCTTCATGATGGAAACNA/^ 
GCTCAG1TCGCCATTGCCATCACAAGTCAAAGAGTGCAAACACCACATCACACACGTGGTCTGAG 
AGCTCCACTTTAGCCACTGTCCTGGCCACCTGCTGCATGGTCACTrrATCAAGAGATGCTCCAGCC 
ATATGGTAAAAACTCAATGCAGTGTCCACATCArTAATGTTCTITAGGAAAGTAAAGAAGTO 
ACCTCCTGAAATGTCAGACCCTTTCCTTCmGAAGTGCTTCTTGAGCTGCCT^ 
GCTTCTTGGACTGCACCCCACTGTAGGCAAAGTAGCATGCCACCAAACTGGCTNTNAA^ 

SEQ ID NO: 3864 ACGCGGGGGCCCCATCCATGGACTCTTGCCTCGGTGCAGTTTCCACTCTTGAC 
CCCCACCTCCTACTGTCTTGTCTGTGGGACAGTTGCCTCCCCCTCATCTCCAGTGACTCAGCCTACA 
CAAGGGAGGGGAACATTCCATCCCCAGTGGAGTCTCTTCCTATGTGGGTCTTTTTC^^ 
CCCCACATTGGCCAAGTGGACTCATCCATTCTTTGGAACAAATCCCCCCCACTCCAAAGTCCATGG 
ArrCAATGGACTCATCCATTTGTGAAGGAGGACTTCTCGCCCTCTGGCTGGAAGCTGAT^ 
GCACTCCCAGGCTCATCCTGGGAGCTrrCCTCANCACCTTCACCTTCOT 

CAATNGGGGGCTGGACCCTTTAATTCAAAGGTT TAAT GCCTGCCCTTGCCCAAATGCCCAAGGGTC 
GGTGCCCrmriTGGATACCANTTAATCTCACATTTTTGGGTT 

SEQ ID NO: 3865 GGTACAACCAGCCAGATTCCAAGCGGCGCCAGACCAATAATCAGAACTGGG 
GCrCCCAACCCATTGCTCAGCAACCGCTCCAAGGTGGTGATCATTCTGGTAACTATGGrrACAAAT 
CTGAAAACCAGGAGTTTTATCAGGATACrmGGGCAACAGTGGAAGGTANAAACAGTAGGGCCT 
CTGTAAAATTGGAGACTGATAGGTTGATCAGAAACTCACCCTAAATCTGAACGGGTGCCGCTATA 
ATTTGTGACATCTGGCAAGATTTCCCTTTATGTATATATTTTAACi^ 
AGCCACACTTCTAACTGCTTCTGGCGAACTGATmATTmAAITITm 
AGATACTGAAAGAAATAGTTAATGAGTTTGCATTTGTGCTTGAGAAAATTTGGCTC^ 
GGCTGTAGTGTCAACGATGmCCAGTAGTGTTTAGATTTGGTGTCTTCAAAGGTAGT^^ 

SEQ ID NO: 3 866 GGTACTGATACAATTGAAGGCCCTTCCACTATAAATAGGATGGAGGATGGGT 
CACTGTGTCCGTATTACCAATGACAGTCACCCCAAGAAACACAAGCAGCTGCATCCACCCTCTTTC 
AGGGGGTAGAGCCACTATACTTCTCATGTAGATCAGCCACATTGTCACTGGAANAACTCGGATCC 
AGCCATCCTCCCGCACGTGGTAGAGGTTGACTACACCTCCTGAGTAGGCATCTCTGTAGGTGGCTT 
GGTAGATGGCTCGACGGGCCAGATCATAGGCCTGCTCCACTTCCAGGTCATAGGAATAGCCCCGA 
TCCATGACCCCATATGCATACACAGAGCCAGAACCTACAGAGAAGGTGGCCCCTGAAATCCGGTT 
CCCTTCACTGTCCACGTAGTAGAAGGCCAGGGCCTCTCTTATCCCAGCCCAGATCATGGTGCCCAT 
GGACAGCCCATGCCnTGGACTTT 

SEQ ED NO: 3867 acggcagccacgaggatgcgatgtatgggacaaaactggagaccatccgga 

AGATCCACGAGCAGGGGCTGATTGCAATACTGGACGTGGAGCCTCAGGCACTGAAGGTCCTGAGA 
ACTGCAGAGTTTGCTCCTTTTGrrGTTTTCATTGCTGCACCAACTATTACT^ 
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ATOAATCTCTTCAGCGTCTGCAGAAGGGATCrrGACATCTTTACAGAGAACATATGC^^ 

CGATCTCACAATTATCAACAATGAAATTGATGAGACAATCAGACATCTGGAGGGAAGCTTTTGAG 

CTCGTGTGCACAGCCCCACAGTGGGTCCCTGTCTCCTGGGTCTATTAGGCCTCTCCCAGATATCTG 

AGCATACTGGGGAACACCTCATTTOTGGAAAAGCCTTTTTGTTATCGGCCr^^ 

TGGTCCCTAAAAACTACCTATTTGTAGTGGTGACCTACArrATAATTATTGTCATGTCCG 

TNGGAGGAGAAAAACAATTNCCCCTTA 

SEQ ID NO: 3868 GGTACATCACAAACATCGGACAATGCAACAAAATCACCAAATGAAGACATC 
CTATCAGCCCCTCTTGCTCTTGCATATGCCGCTGAGATGGGTGTGAGGGTm 
ACCATGCAGACTTTGGCCTCATCrTCACTGAGTGGAATTCCAACAGCAGCACCTGCTGGGCTGA 
TGTTTGAAAGAGGCAGCGGCTGGAATACCTAAAGCCTCCTTGAGTTCCTTCACCAGCTGCCGGG^ 
TTCAAAGCATCGCACAAGTITATAAATCCAGGGGCTCCATTTAGAACTGTGATGGGAAGCrTGC^ 
CTGCAGTGTGTACATACTAAGCATGGAAATTCACCAGTAAAATCTTATTTGTCATm 
TACCTTGACATTATTCTTCCATTTCATAATCAACAAGGTTCTACTTTCTGGGTC^ 
GCCTTGCATTCCTGGAGCTTNACATTTTAAGTCCAANTATTGGCCACAAAGAACCA 

SEQ ID NO: 3 869 ACATGTCATTACAGTTGGTITAGATGGTTAATATAGGAATAAATAAAATAAC 
TACAGTTCAAACAAAGGAAATTAAAATGAGATTAAAGAGCCCCGATCAAAAAAAAAATACTTAC 
AACmCAGCTGAAAAACTGTAAAAGTTACATACATACATACCTAATGGCAOT 
GAAATATTTAACCCTATACTAAACACTAACCTAACACTGATGAACCTTAAAAGACTAT^^ 
TCCATTTTCATGCTACTGATACAAATATACATGAAACTGGGCAACTTACAATAGAAGGAGGTTAA 
ATTGGACTCACAGTTCCATGTGGCTGAGGAGGCCTCGCAATCATGGTGGAAGGCAAGGAGGAGCA 
AGTCACATCTTACATGGATGGCAGCAGACAAAAAGAGAGAGCTTGTGCAGGGGAAACTACCCCTT 
GTAATACAGTGTNGATCTCATGAGTCTCATTCCCTATCCAAGAACAACAG 

SEQ ID NO: 3870 GGTACGCGGGAACTTATCTATAAACTATAACCTCTCCTTCATGACAGCCTCC^ 
CCCCACAACCCAAAAGGTTTAAGAAATAGAATTATAACTGTAAAGATGTTTATTTCAGGC^ 
ATArrTTrTACTTTAGAAGCCTGCATAATGTTTCTGGATITC^ 

GGAGAAAATGGGTTTATTCACTGAACTCTAGTGCGGTTTACTCACTGCTGCAAATACTGTATAT^ 

AGGACTTGAAAGAAATGGTGAATGCCTATGGTGGATCCAAACTGATCCAGTATAAGACTACTGAA 

TCTGCTACCAAAACAGTTAATCAGTGAGTCGATGTTCTATTTTTTGTTT^ 

ATTCCCAAAAATTACTTTGGGGCTAATTTAACAAGAACTTTAAATTGTGTTTT.^ 

GGCAGGGGGTGGAATTATTACTCTATACATTCAACAGAGACTGAATAGATT 

SEQ ID NO: 3871 ACCAGGGCGGCGCGTGGTCTACGCCGAGTGACAGAGACGCTCAGGCTGTGTT 
CTCAGGATGACCGAGTGGGAGACAGCAGCACCAGCGGTGGCAGAGACCCCAGACATCAAGCTCT 
TTGGGAAGTGGAGCACCX3ATGATGTGCAGATCAATGACATIT<XCITGCAGGATTACAT^ 
AAGGAGAAGTATGCCAAGTACGATTTCGTTCTCTGATTCCAGCAGAGGGAACGCTATCCAAAA^ 
TTGGCAGCAATAAAACGTAAAATTCATCCTGATCAGAAAAATATTAATGCCTATGTTGTGTTTA^ 
GAGGAGAGTGCTGCCACGCAAGCATTGAAAAGAAATGGGGCCCAGATTGCAGATGGATTTCGTAT 
TAGAGTTGATCTCGCATCTGAGACCTCATCTAGAGACAAGAGATCGGTTTTTGTGGGGAATCT^ 
TTATAAAGTTGAAGAATCTGCCATTGAGAACACTTTCTGGACTGTGGAAGTAT^ 

SEQ ID NO: 3872 ACGCGGGGAAGTCAGTTTTCAAAGGGCTGACCCTTGGCCTATTATGTITCT^ 
TACATCTGTTTAAGCAAGCGGCTCCTAAGACTACACTAAGGTTTCrrCCATCCCCTA^ 
AGTTGACAATTCCCAGTTCTCTCAGCACCAGTGATGCCTGCCCTTAAAGAACACmTGCTG™ 
CAGTGCTTGAATACTGACGGCTAATlXJrGATAAATTTCTCAGGCTTAm 
TCACATTGTCCCCTAACACATCCCTGTTAACTGAAATGCATTGCCACCATAAATCCTCACGTm 
TAGGTAACATCACTTGCTAATGTAATTGTCTACTTGGCTATTTTATI^ 

CCTAGTAGTCCATATTATATAAGTATACATATGCAATITGTATTAATTGTATAAGATAGTTATACAT 
AGCACCATATGGGAAACTGCAGTATGGAGTTTCTNCCATGGGGAGGGTATA 

SEQ ID NO: 3873 ACATGAAATJTrGCAATACTACATTCACTTTATTGTTCTTGTGTm 

TATTCTTGATATCAGTCACAATATGTGTGTATAAAGTCTTTCGCAGAAG'fTrATCATGGCAA^ 

AAAGTTCAAAGAAGAGTTCTAGCAGGCnTGATGGATTGATGAGATTCTTATTrCT^ 

AAAQCmGCAAAATGTCATTCGCAGATCTGGATCCAATACGGTATGATTGTAGGGAGAGAAGAT 

CTTTCACCTCTTGAGGAAAATTACnTAGGTACATACTTGTTTCAAGGCAGCC^ 

CAGTTGGTTTTGTATTCTGTAGCCTCAAGTAGCCTGCrrAATAAAAGTT^^ 

TGCmCTCTACTTGTTTGGGTCAGATGATGAAAGTTTTAAAT^^ 

TTTTITAACCCAAANTGNATCCATNimAAACXNCCCCCTTO 

ATNCCC 
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SEQ ID NO; 3874 GGTAa"rrrri"ll"i"lU'ril''l"l-rri"117"l"17"ri€ATGTTNGATGTTCCAAGATGAC 
TATTTGTANATCTTCCATTCTCTTTTGATmANAGGCCGCCTTCTTA 
ATCTGrrrCTGAAATGTATGTAACAGGATCCTGCCGNGAAATriTACCAGGTTTAAAA^ 
CAGTTmCTGGNGGCTTTTTGGCATTTTTTACCT^ 
TANATGANCCTCnTrTTCTTGCTTGGTTGCrTTTGm 
CTTGATTTCCTTTATTCCTTTCTTTGCTTTTAAA 
TTACTGGTTCACTTACAAAGTTTTCTTTCCACTTGGGGATTACT^ 
CATCCGCANCCCCAAGAATNAANGCNCCTNGCTGGCTGGCCG 

SEQ ID NO: 3875 ACGCGGGGTCTCGCGAGATCCCTACTGGCTATAAAGGCAGCGCCCCGGAGAG 
CTCTTGCGCGTCTTGTTCITGCCTGGTGTCGGTGGTTAGTTTCTGCGACTTGTGTrG 
TAGGAAGATGTCTTCAGGAAATGCTAAAATOGGCACCCrGCCCCCAACTITCAAAGCCACAGCT 
GTrATGCCAGATGGTCAGrrTAAAGATATCAGCCTGTCTGACTACAAAGGAAAATATGTTGTGrrC 
TTCTTTTACCCTCTTGACTTCACCTTTGTGTOCCCCACGGAGATCATTGCm 
AAGAATTTAAGAAACTCAACTGCCAAGTGATTGGTGCITCTGTGGATTCTCACTT 
CATGGGTCAATACACCTAAGAAACAAGGAGGACTGGGACCCATGAACATTCrrm 
CCCGAAGCGCACCATTGCTCANGATTATNGGGGTCTTAAAAG 

SEQ ID NO: 3 876 ACGCGGGTATAGGATTCGTGTTCGCCGTGGTGGCCGAAAACGCCCAGTTCCT 
AAGGGTGCAACTTACGGCAAGCCTGTCCATCATGGTGTTAACCAGCTAAAGTTTGCTCGAAGOn^ 
CAGTCCGTTGCAGAGGAGCGAGCTGGACGCCACTGTGGGGCTCTGAAGAAGTCCTGAArrCTTAC 
TGGGTTGGTGAAGATTCCACATACAAATTTmGAGGTTATCCTCATTGATCCATTCCATAAA 
ATCAGAAGAAATCCTGACACCCAGTGGATCACCAAACCANTCCACAAGCACAGGGAGATGCCGT 
GGGCTGACATCTGCAGGCCCGAAAGAAGCCCGTGGCCTTGGAAAGGGCCACAAGTTTCACCACAC 
TATTTGNTGGCTCTCGCCGGGCANCTTGGANAAGGCGCAATACriT^ 
AATATTAAGTAAAAGTTGTAAAATTTATTNCNTAATAAACAATTTANGANAGTC^ 

SEQ ID NO: 3 877 ACAATACCAATTGATGGAAATTTTTTTACATATACAAGACATGAACCTA^ 
TGTATGTGGCCAAATCATrCCTTGGAATTTCCCX}TTGGTTATGCTCATTTGGA^ 
ACTGAGCTGTGGAAACACAGTGGTTGTCAAACCAGCAGAACAAACTCCTTCTACTGCTCTCCACGT 
GGCATCTTTAATAAAAGAGGCAGGGTTTCCTCCTGGAGTAGTGAATATTGTTCCTGGTTATGGGCC 
TACAGCAGGGGCAGCCAnTCTTCTCACATGGATATAGACAAAGTAGCCTTCACAGGATCAACAG 
AGGTTGGCAAGTTGATCAAAGAAGCTGCCGGGAAAAGCAATCTGAAGAGGGTGACCCTGGAGCTT 
GGAGGAAAGAACCCTTGCATTGTGTTAGCTAATGCCGACTTGGACAATGCTGTTGNAAm 
CCATGGGGTATTCTANCANCAAGGCCAGTNGTGTATANCCCAT 

SEQ ID NO: 3 878 ACAGGCCnTATGTGTGGCATCCCCTTGGCTATTGTAATACTGCATAGGTGGC 
TGGGTCCTATCGTATTCAGAGCGAGGGTCTCTGATTCCTAACAGTGCTGGTGAGGAAGAGGTGCTG 
CCGACTGACTGAATGATGGGTGACATCTGTTGGTTGGTGGTAGGCAAGGAAGGATGTGATTTGAA 
TCGGTCTCGCrCCAATGTCGACACAGGTGTAATAAAATGGTTCTGATrCCACCCATCCTTm 
ATGCTCCGGAGGTCCCGATATTGCCATAATGTATTCAAGACCTGGGCTGCTGCCTTCACCACTTTC 
AGAGATGATCTGTCGCOCCTGCCTTTGGTTATGTTCACCAGCITCTCTATGCCTCCTGAG 
AGGCTTTTGCCGTTCTCCATGTTTTTGCTGGTGACCTCGTGCAGAACACAACANATGGCTGNCATC 
GGCTCATCAAAACAAGACACTGGGGCCArmCCNCCGGGAACCCGbrrGACCAAGGTCTTCCATTG 
GNGTATTTGCCTATAACCTCITGTNCGAACATTTAGTGNCNTATTCCrCAAGGGTGTC 
NAAANAAANT 

SEQ ID NO: 3 879 GGTTCCTTTTCTATGTTTAGATACACAAATACCATTGTTAAAATTACC^^ 

AGCCCAGGCGTGGTGGCTCGTGCCTCTAATCCCAGTACACAATGGTTTATTAAAGGAATGTATGGC 

CCACATCAACCTAGCAAGGATTCTACTGGTAAACCTTCCCATGGCCAAAGGAAAAACAAGCAGGA 

GTTGAGTGGCTGGGGTGGGGTGCAGGCAATGGAGAGAGGGCAGAAGGGTGTAGAAGCTGAAGGG 

GGCTAGAAGCTTACTCCTGAGTrTCTTCCTTCTGTCrrCAAATCTTTACTTOT 

CAGCTGTTTCATAGGCTGGAGATGCACTCTTCTAGACTGCTCGAGACAGCCAGAGACAGGGGAGG 

AGGGAAGAAANGATACTGTGGAAAGGGATGGCGGGGCAAACATTTAGAGCTAGAAACCACTACT 

GGGCCAATGCTAAAAGTTTCTGTCTCITAACCrAAAAAAGCCAGNGGAATM 

TTANNTNGCTAGGGTTTCCCTTTGAAATAATGAGCNGATTTACCCCCGCTT^ 

GACCGGGGC'mTGCAAGAAATA'mAAA 

SEQ ED NO: 3880 GGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATTTCGACCGC 
GATGATGTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAGCGCGAGGG 
CTACGAGCGTOTCCTGAAGATGCAAAACCAGCGTGGCGGCO^JCGCTNTTT^ 
AGCCAGCTGAAAGATGAGTGGGGTAAAACCCCAGACGCCATGAAAGCTGCCATGGCCCTGGAGA 
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AAAAGCTGAACCAGGCCCTmGGATCrrcATGCCCTGGGTTCTGCCCGCACGGACCCCCATCT 
GTGACnTCCTGGAGACTCACTTCCTACATGAGGAAGTGAAGCTTATCAAGAATATGGGTGACCAC 
CTGACCAACCTTCACANGCTGGGTKGCCCGGAAGCTGGCCTNGGGCGAAGTTTCTOT 
CTCACTCTTAAGCACGANhriTANTACCC^^ 

SEQ ID NO: 3 88 1 ACCAATGTCTAAATAAGGTTTAGAATAGATATAGCTTTCATTrrACTGCTCCT 
CAAATCAA'ITCAAAAGTITAACTrAATCAATATAAAATTTACTTATGAAC^^ 
TTATAGGTGACTATATAAATTAGAGCTTCAAATCATCTTCATAGGTCTTTI^i^ 
AAATTAAAATTGATTTTTACACAGTGGTTTCTTCAGGAAAAATATGAA 
TACAATATTCGAAGCAAAAAAGTITCCACATATAGGAAATAATGAATTTCCCTGATCA 
TTCCCAGCTATTTCCTTCTAGCANATATATTCACTGAGAAGAAAAGCTTTAATC^ 
TACTTTGAGGGGTCAAGCTAATGAAATTTTAGATCTATGTATTCTTAAAGATO 
GAAGAGTATCTGTTCCAACCAAAGAAAAATTTCOTCTCAATTTATT 

SEQ ID NO: 3882 ACGCGGGGGTCGGTAGAGGCAGAAGGAGAAGGTCGGGTTGTAGAAGCTGGG 
GTGGCCGGCAGCTCGCTCATCGGTGTTCGTGGGCmGTCGGTCCGTGCCTCGTCTCTCCCTGG 
AGGGAGGGAGGCTTCGACGTCGAGAGGGAGCCGCTGCCGCGTTAGTTNCCAANCTTGAAGTCACT 
AGGACTTCTCrCAAACTTGTGTGCTGAGGAGACTCAGATGTTGGCCTCAGCTCCTAGGCTGAACTC 
AGCAGATCGGCCCATGAAAACITCTGTATTGAGACAAAGGAAGGGATCTGTCAGAAAGCAACACT 
TGTTATCTTGGGCTTGGCAGCAAGGAAGAGGACAGGTAGTGGAGATCCTGCAATCTGAAAAGCAG 
ACTGAAAGGTGACAAAGAAGCTGAAGATGGGTGGTGGAGAGAGGTATAACATTCCACCCCTCAAT 
CTAGAAATGTTAGTAAGAACCAACAACAGCTTANCAGACAGAAGACCAAGGAACAGAATT 

SEQ ID NO: 3883 CTCGGTCAAGTCTGCACGAGACAGCTTCCTGACATTCTCACTGGGCCCCTCCA 
CAGCCTGGGCTAGAGGTGTGCCCTGGAATGCTGTGGCATGCAGGTAGTTAAAGACCACATCTCGC 
ATAGATOCATCATTCTCCTGCATCTCCCGCAGGATCACATCACGTTCCTTCTCAATCTGTGAGTOT 
CCAGACTACAGTTCTGCACAATGTCACCCAGGAGCTCCACAGCITTCGGCAGATCCTT^^ 
GCCTTGATGTAGTAAGCTGTGTGCTCCCGGGTGCTGTAGGCATTAAGATGGGCCCCCATGCTCTCC 
ACCTCCTTCTCCAGGGCACTGCC^GGCCGATTCTTrGTTCCCrrGAAAGCCAGATGCTCCAAAAAG 
TAGCCTGCCCCATTATTCnTCTCAGTCTCAAAACGGNTGCCAACATCAATCCACACTTCCACC^^^ 
CCAATGGGCTTANAGGACTGCTCGGANGCCACACCCAGGCCGTGTCCACANGCTAACTTNGT^ 
CGGACAANTGGNCGCCrrGA(>rAANGTGCCGACTTGGCCGGACACOTAAGGGGAArrCCCCCCC^ 

SEQ ID NO: 3884 CCCGGATGTTGCGGACAGTATGAGGCAAGCGCAGGGGGACGGGGACCAGCA 
GCTGTCGCCGCCGCTCTCAGGGTGAAGAGGGAACAGAAATCTTTGCCCCCTGACTTTGGAAATCTC 
GTrrAACCTTCAAACTGGCGATGTCAAGGGrrCCAAGTCCTCCACCTCCGGCAGAAATGTCGAGTG 
GCCCCGTAGCTGAGAGTTGGTGCTACACACAGATCAAGGTAGTGAAATTCTCCTACATGTGGACC 
ATCAATAACTTTAGCTTTTGCCGGGAGGAAATGGGTGAAGTCATTAAAAGTTCTACAT^ 
GGAGCAAATGATAAACTGAAATGGTGTTTGCGAGTAAACCCCAAAGGGGTTAGATGAAGAAAGC 
AAAGATTACCTGTCACTTTACCTGTTACTGGTCANCTGTCCAAAGAGTGAAGTTCGGGCAA/^ 
AAArrCTTCCATCCTGAATGCCAAGGGAGAAANAAACCAAACTNTTGAGAAGTCACNGGCATATA 
NGTTTTGTNCAANGNCAAAACTGGGGGATTCAANAAATTATTCGTAGANAAT^^ 
CACCGGCTTTTTCCTN 

SEQ ID NO: 3885 ACGCGGGGGATAAATTCATGAAAGAAGCCACGACGAATGCACCATTCAGATC 
GAATAAGAAAGACAGAAAAACTGTGAAAATGATGTATCAGAAGAAAAAATTTGCATATGGCTAC 
ATCGAGGACCTTAAGTGCCGTGTGCTGGAACTGCCTTACCAAGGCGAGGAGCTCAGCATGGTCAT 
CCTGCTGCCGGATGACATTGAGGACGAGTCCACGGGCCTGAAGAAGATTGAGGAACAGTTGACTT 
TGGAAAAGTTGCATGAGTGGACTAAACCTGAGAATCTCGATTrCATTGAAGTTAATGTCAGC^^ 
CCAGGTTCAAACTGGAAGAGAGTTACACTCTCAACTCCGACCTCNCCCGCTANGTGTGCAAGGAT 
CTTTTAACAGTAGCCAGGCTGATCTGTCTGGCATGTCAAGGACCNNAGATATT^ 
TGTNCNCAAGTCCTITITGGAAGTGAATGAAAAGGAACCAAAGGCNG 
TTNAACTTTTTTCATTTTGATGNCCCAAAAAATTTACTNGCGACCATCArrCN- 
NAATCCNCNG 

SEQ ID NO: 3886 GGTACTACACTGAAGACAGGTTGCTCACATACTCTAAAGCACATTCTTGATAC 
AGGAAGAAGGGCTTGTGGGGAAA.GCGGCGATTTGGTATTGGGCAAGAGCCACGTTAAGCCTTCAT 
GAGGGCAGCCACCACAGCCTTGGCATTGAGGCTGCGTAGGTCACAGTAGGTCTGG CCGGTG CCCA 
CAAAGACGATGGGTTTGCTTGTGATGTACTAGGTCTTATTCATCCTTTCTAACTAT^ 
CATTAGCCATCCCCAACTCCACCCCTCAAAGTTATACTrTCTAAATAACTAGCTCTGAAAGCATCT 
(nrrACCGAGGCTTAACTGTGTGTGTTATGGAGAAAATTGTGCTATTGCTCTGAC^^ 
AGTCCTACCCCCTAGCATTTAANTAATATGACCATATTTTGGAGAAGANGGTCTITAAA^^ 
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ArrAAAACTAAAATGAGGTCATAAGGGGTGGNrrCTTAATCCATATGAAANCTKTCT^ 

AGGANATNTAACCCAGAOTCCCNAGGGANAAArmTrTA^ 

CCCAGGNGAANAGNTTTT 

SEQ ID NO: 3887 GCGTGGNCNCGGCCGANGTTNNCNGGATCGANATGAAGGTNCAAGACCNGA 
CANACAGCCTGAGTTTACTCANATNGACATATAGATGTCATTTGTAGACCAGACTGGGATCCANA 
GTTTAATTGAGGGTITGCTCCAGTNTTCCTGGCCCAATGACAAAGATNCTGTGGTTGTO 
TACTATGACTTTTGCTGAGGTGCTGGCCACCTATGGAACTGATAAACCTNACACTCN^ 
GAANATTATNGATATCNNTGATGTGTrrANAAACACNNAGATrGGATTNCT^ 
GTAAGCCCCATGGAACTGNGAAAGCCATATGTATCCTGAAGGAGNAA^^raCTTAAAAAGG^^ 
ACATrONATCCATTAAAAACTTTGTAGCTGACC^riTOT 

AACCCAATAGAAACTNGGAATTOTCCACCTGCTTAATTTCATAATGGAGTCNCAAACCCCTGGAA^ 

TTATCCCACTAATGGNGANCa^ACANGGAKAAGGGGGCCCANTACTGNTTGGAAACACAAANAA 

AAATTCTCTTirrANGGAAAAACCACTNGGATGTC 

SEQ ID NO: 3888 AC nU " i " riU "i l ' i U' ll ' l ''i l TTTTTTTTGACCACACCTGCCCTTTATTGGTCTCT^^ 
AGCANAGTGGCTCCAGGCCCTTCACGCCTOTCANACACCACCCATGAGGGTTTAGGAAGGTGCCA 
TCATTCTGTGAAGGCCCANAGCTTACCCAAGTCrTGGAGCCCAAGTTGAATCACCAACCAGAG 
TTGGGAGAGGAAAAGGAAACAGGCAGAGGGGAAAGGCAAGGCTCTGCAGTGAAGGGGACTGAT 
ATNAAGGGAATGCTGAGGTCCANCAGTGTCTCCTGAAGGCATGCTGCATCCTAAGGCTCCTCAGG 
ACTGGATGGAGTAGGANATCTGTGTGTTGAGCAGTTCACATCTATATGGCAACTTTAAGGAGGCG 
OTGATGTCAGGCTCAATGTTGATGGTTTGGGAAAGTGCCGCTGTAACGTTCGAAAGGG^ 
TCCGCCCTATGANGAACrrm'CAAANTTCCAGCCACATTrGAGCGGGGCCCAAGG^ 
GANCTTTGGAACCGGCATTAAGGNAAAATGGGTCATATTAAGGGGGGGGGACTT^ 
GCGAAAANANGAT 

SEQ ID NO: 3889 GGACTTTTAATCACrrACTGAGAATATTTCAAATTTATATTCTCATCA^ 
ATGTAAAAAAGATACTGATATATATTCTATGATTTrTATGCCACAAACACCCCTG^^ 
TTTAATAGGAAAACAATTATACTGACATCTCITGTTCCATTTTCCC^ 
TCAAGATAGrrTAAAAATAATCArrCAGGTTATTCTAAAATTTTGCCATAAAA^^ 
CCrrCTGTTAATTGCTAAAACCAATATTTCACATATAAAAGCATGTGAT^^ 
CCATirCTGAAAATTAAAAACCTATCATTTCATGGGCTAAATTATAGAACT^ 
ACTACTTCATAAGACTGTTATGACAAATGCmTTATTACATAATATTTAAGTAGGTCATGCTC 
CAACAANGNCAACAGAAACTTGTGAATATTCTGNTTAAATAATCTAGCACACTAATACTACC^^ 
TTTTTAGGGCAGTACCATGGAANAATCCTGNATCCTTATTCAAACAGGTTTGGAA^ 
ATAAGANACCA 

SEQ ID NO: 3890 ACTTGGGGAAGCCATCTCTATCAl'l'rri'CrrGTAAAATCI^ 
TGGCTCTCATCTTCAGTGCTTTGAGATCATTTTTCAGTTCATTTGTCAT^ 
AACCAGCCATCCXCTGCTGTTITITGTCGTTCIT^ 

ACTATATGGTGGAACACAGTGGTTTTTITCAAAATCAGGTGTAATGACGGCm 

ATTirrCTTirrCTCCrrGATCTGTGTTAGGGTTCTCTTGTTAGACTGT^ 

TAATATACAAACCACCCAACTGCTTGATACTCANACCAGGGTCTATGCTGCTGC^^ 

GAAGTTTAACCTTTNGCTATTANGAAGTCTTCTTTCATCACTAAACTCATC^ 

NCTGATGAAACTTCTTCACITITITCAACCCCTT^ 

CTTGGNTTCCCTTTNCAAGNAAAAANTTTTTNNCNCN^ 

SEQ ID NO: 3891 GGTNCAACCAAGTTACTTTTCCTCTTACTGAATTTCATTATGTTTCA^^ 
GCAGrirrTATTCAGCAGAGAATAATTCTTAAAATCTTrrCT^ 

gtagcttcacattatactaccatgtrtcacccttcgtttcccaaatacacaaaaaac^^ 

tttcaaaaatgaaataagatgcccacattaaaaaaataaagcctacaaaaagtt c^^ 

aaagattattcatatggcacaaagtgatctcctactagtccaaagttcaaaaacat^ 

tccatttatatatattttgnttgctmcaatgaaatgctacattr^^ 

ttttaagtctttggagtcaacaagtggtcaccagaaagcctccaaggatngcttgaagtcccaccg 

tctctggatgccggnggctcanaaccccccgtntcattttatgggaggctctggggactgggggc 

accctgcaaggagctggccagccagccx:ggnngnggatgggttccttgggggggggggggttnc 

CTCANNCCCGANTTCTGGGNAATNATTAATG 

SEQ ID NO: 3892 ACTGTGCATTICCTCTACTTGCATGGCCAATAAATACA GCTACGA CCTGTTT^ 
AAAAACAAAACAGATGTGAACCTCTGGCATGTGCTAGCTAGTGAAAGGTTCTTT^ 
GTAGCAATGATAAAACAGATTAGAAAGATACAAAATTATCCTAAAGGCATAAATGCAAATAAATT 
rmATGGCAAATACAGATGAAAATATAATGGGCTAATTTATTAATGATTAAGAATATAT^^ 
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AGCGTmCATGGATITGGGAAGTCrTGTTCCTATATTAGCTTTCAT^^ 

ATAGGGAAAGGCAATANGCAAGTTTTTTCCAGTCATTAAGTGTGACITITATC 

ATGCAAGAC ATCTTTTCCTAAAGTrrAGTAAAACAACTTTAATAAG^ 

AAGCnrrGGTAATATTAATNNACCTGGTATTTCCACCTGTCTCm 

CAAAAGTNTirCACrAAAAAATAATAAAAGTCCrnTm 

GGGATCCNAAATGTTTCATOTGT 

SEQ ID NO: 3 893 GTACGCGGGATTGGCACATGGGTGGACACGGATCTGCTGGGCTCTGCCTTAA 
ACACACATTGCAGCTTCAACTTTTCTCTTTAGTGTTCTGTTTGAAACTy^ 
TTTGTGTTCATTTCATTTCAGGGTCTTGGCTGCCTGTGGGCITGCCCAGGT 
AAGGGAAGTAACAGACACACNATGTTGTCAAGGATGGTTTTGGGACTAGAGGCTCAOT 
GAGATCCCTGCAGAACCCACCAACCANAACGTGGTTTGCCTGAGGCTGTAACTGANAGAAAGATT 
CTGGGGCTGTGTTATGAAAATATAGACATTCTCACATAAGCCCAGTTCATCACCATTTACTGC^ 
ACCTTTCAGTGCAGTTCTTTTCACATTAAGCTGTTGGTTCAAACT^ 
TCTCTGGGAAAGTGGCAACGCATCCTGCANGGCTTTGTNCTACTGTGCTTTTTGGAANA^ 
T^rrTNTNANGGGCTNTANGGACTTGCCAGCTGTTTAACCAGGAA 
1TNNAAT 

SEQ ED NO: 3 894 GGTACCCCAATCTGAAGTCAGTAAATGAACTAATCTACAAGCGTGGTTATGG 
CAAAATCAATAAGAAGCGAATTGCTTTGACAGATAACGCTITGATTGCTCGATCT^ 
CGGCATCATCTGCATGGAGGATTTGATTCATGAGATCTATACTGTTGGAAAACGCTTCAAAGAGGC 
AAATAACnrCCTGTGGCCCTTCAAATTGTCTTCTCCACGAGGTGGAATGAAGAAAAAG 
ATTTTGTAGAAGGTGGAGATGCTGGCAACAGGGAGGACCAGATCAACAGGCTT ATTAGA AGAATG 
AACTAAGGTGTCTACCATGATTATTITrCTAAGCTGGTrGGTTAATAAACAGTACl^ 

SEQ ID NO: 3895 GGTAC' riiTinuH4 - iiin ' iU Ti T i un T iu - iTi 'TTTTin"ri'rriTATTO 

GTTATTTATCAAGAGGAACTATTTNTTAGCCCACATATTCATGNGTCATAGTTCAGGAACACAA 

CAGNGACAAACTTCTAGGNAATTNAACCTGAAAAAATTCTTTATATTCCAA 

TX^AAAGANACCAGCCTTCCTNATTTCCTCAAAATTmrCATGACATO 

GNGTATGCCTTrTTTCTTTGATCAGCCACNCGAAACTTNTACAAAGCT 

NCGAATGCTACANCCOTT^TNATTTCTCAAACNCCTGGCAAAAGGCC^^ 

AAAACTTNGGGNGCCATGGGAANTACTGGGCTTGANACCmTGCCAACCT^ 

CTGANTAAANGGAAAACGAACCGTCCCCC 

SEQ ID NO: 3896 GGTACTGTCTGTCTTCACATTCATATTCCAGATTTATATTTTCTGGAGTTAAAT 
TTGGATGATTTCTAAATTATCACAAAGTGGGACCTCAGCAGTAGTGATGTGTGTGTCTCATGAGCA 
GTGAGCACAGTCTGCATTCATCATGAAACACTATCTTCTACCAGGAGGAGGTTAATGTAAATCACC 
AAATCCCAATGCCTTCTGACTITCATAGGATTCCTGATCATGCATOTTGATGTACAAT^^ 
GAGTTGATATrAATAGAAATTATTCCAAAATTArrCTTGTCACAAGTAACTACrATATCCCACAT^ 
AAAAGGGAAAAAATCCCACCCAATCACAGAAAAGGCATCCTCTGTATGTTTCCGTGGCAATGC^^ 
TGTITATGTATTCTCAAATrTTGTCTGGCTAGTTATCCACCGCTTCTCAATGGArrCAT^ 
TTGGAGAACCATATAGACTAATGACAGCATCTTGGGACACACCGGNCGTATCAAGTTCATGGNGG 
ACATTCCTATTATAAAAAAAACTCANGNCTTGGAGAAATTCCTGGGCTGGAAGGATCCCAAATO^ 
TTXTTTAAAAACATTTNTCTT 

SEQ ID NO: 3 897 ACTTTTTTTTTTTTTITm 

AAACTGAGAATACAGCAAGTAGGGAAATCCCAC^VTCAATGGAACCATCACACAGATGCCTTTCTG 

GAACCCCAACCTTCTATGATCCCCAAAAATGTGCTTTGNGGCTTTAGCATAAm 

GAGGGAAAATACTGAAAAAGGCCACTATTTAATGGTGAAAGAATGAAGCTGTAGAGGTCCCAAC 

CAGCCTAGGGCCAAAAAAGAAAATTAAAAATNTGCACAGAGCAAGCAAGCCTCTGACTGCTGAG 

AGTAAGGCATTCAGGCGCCAACCTGGTGAGAGTTCTCAGCGAGCACTGTCAGGTCAAGTGCACAT 

GAGGCTGTTGGTAGTGAGCTGCCCGTAGACACACACATAGAAAATGGGCACATGCCCCATACCAC 

CCGAAAAANGGAGATTTACTGGATTACACATCAGNGATGGTAATTCATTTTTCCTAAGGG^^ 

tatottgaaccaaancttccacttttgggnggggggactgnacccaaagg™ 

GTGGGA 

SEQ ID NO: 3898 Acrrr ri ' rr i'riu^iii i i-i i iui ' ii^uu ' iN GAAAATGTTTA 

AATTTTTATTTTGGTTTTCrrACAAAGGTTGACA 

AAAATTCAAATTTTTGGGGGAGCGAGGGAAGGAGTTAATGAAACTGTATTGCACAATGCTCTGAT 
CAATCCTTCTTTTTCTNlTITGCCCACAATTrAAGCAAGT^ 
CAGCTTTCAGTTAAAAAAGAANAAGAAGAAATGGCNAAGAGAAAGTTTTT^^ 
TTTAATTTAGATTGAGTTCATTTATTTCAAACAGACTGGGCCAATGrc^ 
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GCACCACCCGATGTCCAAAGGTCCMTATCAANGAAGGGCAGGCGTGAATGGCTTAm 
NATTCCAANGATGGCTITCCCATTCATTGNCOTmAGAGCANCCa^TTTCCA 
GTGAACCTGCTNTTGNOCTTCAACACCAAGTTCAACATNATTTANAACCC^^ 
TTTTAGGTCG 

SEQ ED NO: 3899 TACAAATATTTAAGAGTGTTGATTGGGAGTAAGGGAATGTCAACTGCCAATA 
AAGTGGAAGATGAAAGAATAGGACTTTACACAGAGCATATTTAGrTATGGGTCTCTGTCTCCTCC^ 
CACACAGAAAAACTCCCAGACATTCATGACnTCATCCACCCTGCCTGGCAGATAGC^ 
TGACCCCCTGGCTCTTCACCTCAAAAGGTTTATCmCCACCACACTAGCAAAGACCC^^ 
CACAACTCATACTGCCAGCTAAGCATCTACCCCGAGGGACAAGGCAAGCACACACTAGGGCAGCT 
GGGCCATCCTGGCCCTAATCCCTCCAGCGGTGTCCACACTGAGCATTGCAGCACTTGTAGAAGGTG 
GTCATCGGCTCATCTGCAGAACGGGTCTGAACTGCATGAAGTAAGCACGANGATGTTCGCATTTG 
GGACACCACrCTGCAGTAAAAGTCAACATTTTraCCANGCAGCTGNTCCACC^^ 
CTThrmCAATTTTNGGTNCCTNCCCGGAAAAGCTCCNm 
CCCCCTCAT 

SEQ ID NO: 3900 GGTACTITITrTrrTTTTTT^^ 

ACAGGGGTTGCATTGAATCAGCAGATTGTTTTGGTATATAGATATTTTAATAAm^ 

ATCCATGAGCTGGAATAGCTTTTCTTITACTTTTrATTm 

TGTTrrATCATTTTCmTGTAGAAATTACAAATAAAAAGATNTG™ 

TTGAGTANAATTAGTTAGTmAGTTCTTTCTTAAATTTATGGTAGAATTCAGC^ 

GATCCTATGCmCCTTTGAGGGGAGACTTATTATTAC^^ 

TCTCAGGNrnrCTGmCCTCCATGGNrrAAATCTTGGTAGGOTGTATGGNTCCCA^ 

ATTTCTTCmGGGTTTTCCAATTTA^ 

TT^^TTITGCAAATTTTTTACNT^^m 

AAATAANTTTT 

SEQ ID NO: 3901 ACATGGCCGCCGTCCTGGAATACCTGACAGCGGAGATTCTGGAGCTGGCTGG 
CAATGCAGCGAGAGACAACAAGAAGGGACGGGTCACACCCCGGCACATCCTGCTGGCTGTGGCC 
AATGATGAAGAGCTGAATCAGCTGCTAAAAGGAGTCACCATAGCCAGTGGGGGTGTGTTACCCAA 
CATCCACCCCGAGTTGCTAGCGAAGAAGCGGGGATCCAAAGGAAAGrrGGAAGCCATCATCACAC 
CACCCCCAGCCAAAAAGGCCAAGTCTCCATCCCAGAAGAAGCCTGTATCTAAAAAAGCAGGAGG 
CAAGAAAGGGGCCCGGAAATCCAAGAAGAAACAGGGTGAAGTCAGTAAGGCAGCCANNNCNTNC 
TGCACAACCCGAGGGCACACCTGCCGACGGCTTCACAGTCCTTTNCACCAAGACCTOT 
CAGAACTGAACCTTATTCACAGTGAAATCAGTAATTTAACCCGGCTTTTGAGGTO^ 
TCAATCCTACCAATGCTGACATTGCCTTNAAGAATAACTTAGGAAACCCCCTGGAAAAAAAAGGN 
GGCAANGAGTTTNTNGNAACTT 

SEQ ID NO: 3902 ggtnctttcctgccttttagttcctgtgcacagcccctaagtcaaot 

TTCTGCATCTCCACITGGCATTAGCTAAAACCTTCCATGTC 

GCAGTGCCAGGAACCCTTAAACAGrrGCACAGCATCTCAGCTCATCTTCACTGCACCCTGGATTTG 
CATACATTCTlp\AGATCCCATTTGAA 

TATTTATATITrGCCTGTTAAAAAGAAAGTGAGCAGTGTTAGCTTAGTTCT 
TTATGATTAGCTTTGTCACTGTTTCACTACTCAGCATGGAAACAAGATGA^ 
AGTGAGACAAAAITGATGATCCATTAAGTAAACAATAAAAGTGTCCCA 
NNNNNNGTCCTll"rri"lTITn'lTTTTll"i-m 

TTTTTNGGTGGTACCCTCGGNCNAAATAACNCAAAATTTCCAAATGTTCAAGGTCC^^ 

SEQ ID NO: 3903 GGNACAGTGATTTGGCTATAGACTCTCGCCCCTTCAGGGCANACTGTCCTCAG 
TTCATCCTTATTGAOAGAGAAAAGTTGTGCACCATTTAATACTCCAAGACTATTGACAGTCACAG 
GTTGAATCCCrrrGACTGTAACCACGTCTTCACATCCTCTGGTGTGGAGTCGNAAGTGATATTGAT 
AACTGGCACGTTCTGCCGTGGCACATGGAATTTOTTCTGAGCGGCACTCCGACCAATC 
GTGGATGAGrrCATCTTGCACTTCTCCATCTGAGATTTCCTTCGGTCCACCGGAAGTTGm 
TCTGGCTGNCTCGCACGATACTGCCACCACTGTCACTGGAGCTGCTGTTTTGACGTGTTATAm 
TGGGACCTTTGACACATGAACAGGTGCTGGAGTGGAAAGGGGGAAGGGGAACANGAACAGGAAC 
TGGTGTTTGGANGAAGTGATNGAGCAAGGGGAGTATCANCTGGTCTTGGCCATACTCATCCTTG 
TCTGNATAGGATGANTOTAAGGGGGATAAGGCCCCCAATCCANATTTTGGNGNCTAA/^ 
AAG 

SE Q ID NO: 39 04 GGTACTTCTTTTTTTTTT^ 
TTirnriTITrTAAAAACCNNTO 

TNTTCCTCCTGAAATTACATAAACAAATGCAAANGGAAANAATCCAAGTNTAAATTO 
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AACAGCCCTCC^™ACAAAAGCGNGTAAAATTACAANAACNCT^^I^ 

ANAAAACNATAATNTNGAAAACCACAAAATTGCCAAATTGNTCCCTAAACTN 

OSITGACTAATGAATGAGTTTGTTTrGTAAAAAAAAATCATTCAAATAAAT^ 

AGATGCAAAGGimGGCTTCmcrrCCTATCTAOTGGAm 

AAGAAAAANTACCCmTTTTGAAAGNGGCTNTTTGTTGNAAAAGCTGGN^ 

CCITGGTAATGAAAGTTTNNriTCCTGGG(>JGNCC^^ 

CCNTTTTTT 

SEQ ID NO: 3905 GGTACACTGCCCAGGCAAAGCGTCCGGGCAGCGTAGGCGGGCGACTTAGATC 
CCAGCCAGTGGACTTAGCCCCTGTTTGCTCCTCCGATAACTGGGGTGACCTTGGTTAATAT^^ 
AGCAGCCTCCCCCGTTGCCCCTCTGGATCCACTGCITAAATACGGACGAGGACAGGGCCCTO 
CTCAGCTTCAGGCACCACCACTGACCTGGGACAGTGAATCGACAATGCCGTCTTCTGTCTCGTGGG 
GCATCCTCCTGCTGGCAGGCCTGTGCTGCCTGGTCCCrGTCTCCCTGGCTGAGGATCCCCAGGGAG 
ATGCTGCCCAGAAGACAGATACATCCCACCATGATCAGGATCACCCAACCTTCAACAAGATCACC 
CCCAACCTGGCTGAGTTCGCCTTCAGCCTATACCGCCAGCTGGCACACCAGTCCAACAGCACCAAT 
ATCTTCTTCTCCCCAGTGAGCATCGCTACAGCCTTTGCAATGCTCTCCTGGG 
TTACGATGAAATCTCGAGGGCCTGAANTTCAANCTCCGGAGAATTCCGGAGGNTCAAA^ 
AGGNTTCAGGAAC 

SEQ ID NO: 3906 ACTTTTTTTTTTTTTTTTTTTTTTT^ 

TTTCTCTGTGTAAAACCAGTGAATATAACTAAAGTGTTAGTGGATTGGATTAAAANAAACTTA^^ 

GGCAAGAACAGGTAATGTAGTTATCCATGACTACTTTTAACCATGCANACTAATAATAT^ 

GTTTATAGCTCGGCACCTTCACCTTTTTTCACTGGTATTTCATGTAAGGCATC^ 

GTCCTTGTANATCraTGCAGGAGCGGGTGAANACTCATGTCTGTCTCCGTCTTCTO 

AGTAGCTGCACATGTTCCGTGACAATCTGTCTGAGGTCTGTCATTTTCTGGAGC^ 

AGCTGTGAGGACTCAGGGTGGrrCAGCTTCAGCTGGAGCTCCAGGGCTTGTAGCAGGTTGTCTT 

ATGTCTTCAATGGGCTTCACATTCAGCAAACCTGGGCGGTCTTCACTGAGAATAATGACAGC 

AATATTGCCAAGTCCCTGTCATNTAATTCCAGGCATTTGAACTTTAANAAAANTTO 

NCA 

SEQ ID NO: 3907 ACACAGAGTAAAATGTTmCTTTTTTCAGGACCTTGAACTGA^ 

GCTTTGGTTTCTATCTAGGAAGCTCAGCGACAGCAGAGTCTGTAGAGGCGGCCACTGATTTCACAC 

ACCCCGGAGAGGGACTCACGGGTAGCACAACGGCCGGTTCGGCAATAGCAGGTGGCTCTTGCCTG 

AGAACCTGAGGTTCTAAGAGCAGAGAGTCCATTTCCTGCAAAGGAGATAGCAAGGTCCT 

CTTCCCCAGACTGCTTCTGGGTTGTAGCCTCATCAGCTCTTTCCTGGAGTGACTCA^ 

CAGGGCCACCANGAGAATGGCAGCAAGGATGGCGATGGTCCTCATGGCTGGGGTCACCTGCAGG 

AGGGAGAGCANGAGTGGATATATGTCCCGCGTTCCACAATGGTITATTNAAAGGAATGTATGGCC 

CACATCAACCTANCAAGGATTCTACTGGTAAACCTTTCCATGGCCAAAGGAAAACAANCN^ 

TGATTGCTTGGGGTGGGGTGCNCXjCAATGGAAAAAGGGCAAAAGGGGTATAACTTAANGGGGOT 

naacttactct 

seq id no: 3908 ggtactttttttitttit^^ 

ccaaaagatggcagaaagaanaaattcatcctgaaagtatagtttggtgcgattctgttganatg 

gctcttccctctgaacgtgctctcctactgaccaccccactggagtcttgtttgtcttgcagcagw 

tctaaacacttcactgattcccacgtganaaggcaggagccatcttcaaatccacagattccaag 

gagagagtaacgtatntctcanaagaacagcatcttctatctcagaagagtacaaaaggtgag^ 

gtgaggagatagggagltcrrccttggctggctggcttcataatccctgggccccgcagataatta 

aatcgactttttctgtctcaggcatttgtatgacctcttttggaggttc^ 

tgnatctgatggacccatctcaatttaaaaactctgncaggttcgggaggtcatgcttgtcatn^ 

acccttrrggangctcaaaggngcccriggcttgancccaaaatttganac^ 

ggngaaac 

seq id no: 3909 acocgggggcagagctgcggaagatgaatgccagaggacttggatctgagc 
taaaggacagtarrccagttactgaactttcagcaagtggaccttttgaaagtcatga^ 
ggaaaggtttttcttgtgtgaaaaatoaacttttgcctagtcatccc 
arrtccagctcaaccaagataaaatgaatttttccacactgagaaacattcagggtctatttgcr 
cgctaaaarracagatggaattcaaggcagtgcagcaggttcagcgtcttccarrrctto 
caaatctttcacrggatgttttgaggggtaatgatgagactattggatttgaggatarrct^ 
atccatcacaaagcgaagtcatgggagagccacactrgatggtggaatataaacitggtttac^^ 
taatagtgtgctgttcatggaaaccgagggctgcatctrggttatagtcatcritggacctcgc 
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SEQ ED NO: 3910 GGTNCCATCTGCGCCATCCTGGAGAACTACCAGACAGAGAAGGGCATCACTG 
TGCCTGAGAAATTGAAGGAGTTCATGCCGCCAGGACTGCAAGAACTGATCCCCTTTGTGAAGCCT 
GCGCCCATTGAGCAGGAGCCATCAAAGAAGCAGAAGAAGCAACATGAGGGCAGCAAAAAGAAA 
GCAGCAGCAAGAGACGTCACCCTAGAAAACAGGCTGCAGAACATGGAGGTCACCGATGCTTGAA 
CATTCCTGCCTCCCTATTTGCCAGGCTTTCATTTCTGTCTGCTGAGATCTCAGAGCCT^ 
CAGGGAAGCCAAGCACCCATTCATCCCCCTGCCCCCATCTGACTGCGTAGCTGAGAGGGGAACAG 
TGCCATGTACTATTTACrrAAAAAAAAATTCAACCAATTAGAGAATTTCm 
GAAGTGTTTCAAAAAAGAGACTTCCCCAACCCCCCAAGAAGGGGCTTTGTACCTGCCCNGGCGCC 
GCTCGAANGGCGAATTNCAGCCACTGGCGGGCGTACTATGGATCCAGCTTCGGGCCAAC^ 
NAATATGGNCTACTGN 

SEQ ED NO: 39 1 1 ACACAAAGAGGGGGTGGGTGTCGGATGCAGAGTGTGTGGCCTGATGCTCCAC 
GGCGTGCAGGACGGGGGGCTAATAGTAGGTTTCCTTCTCCACCCAGCCGCCAGGGCGTCGCCTGA 
TGATGAGTTTTCTGACTTCGTCATATACGAAGATGAGAAGAGAGTAGGGGAAGGCACAGAACCAC 
CAGGTAGGTTTGAGGGGATACATCCTAAGAGCAACACCCATTCCAGGGCAGTAGGAAAGGAAAG 
CAGCCAGGGCTGTCTCrTCAAAGAGGCCAAATATCAAGATCTTGTTCTTC^^ 
CCGAATTCCTCCTGGTCTTACAGATGACCAAGTCGGCCCACTGCACCACCACGATACTGACGAAA 
GAAGGCTGTGTGGGCAGGTGAACTCCACGATTITCCTCTGCTCATAGGTCCACTGCTGCC^ 
TGTCrrCCACATCGTTGATCCAGCGGTCATTCCANTCCACTCGGAGGCCCAACANGTGAATTGGGA 
NGAAGCCGTCTCAGCCAGAATCACAAAGTAAGTAAAGAANOTCCANGGNCTGGATCATTCAATC 
TGNCCTAGGCCTGCT 

SEQ ED NO: 39 1 2 ACCTTTGGGGCATGGGGGCATTACATGGGATGCTTGTGTAATCGACCACCTA 
GCCTTCTCTCTCCCCTCCCGTCCTCCCCCAGAATCACTTCCTAGGACACCCGAGCTGCTTGCCCAGG 
GTCCTGTTTCCCTGCTAACTCCAGAGAAGCATCCCAGGGCTTTGTGACAGTCTCTAATTCCC^ 
TTCTCGTTAAGAATCATATTGTATAGTAGCTrrCAGACCATACAGTATTCATTG(^ 
ATTATCAAGTAGCTGGAATTGTGAAGGTCGGAGTAGTTAGATCTTTAGCTTTTTTCC^^ 
GTATTACTCTCCATGTGTATAAATTATTGATCATGTTGCTGGCTTTTATA^ 
GGAGCACTGCCTCAGCCTTTGCACATGGTAATGAACACTGGTTTTAAATAAAAGAG 
G 

SEQ ID NO: 39 1 3 GGTACTATGGAATTCCATTTATGGAAAGTTCAACAACTGGCAAAACTCACCTT 
CGGTTACAGAATTCCAGACAGTTGTAACACTTGGCAGTAGAAGATGACTGAAAGGGAACATGAAG 
GAGGCATCTGGAGTTCCACAAATATTGTATTTCTGGATTTGTGGGCTGGTTAGATCTATATGTG^ 
GTTTGCAAAAATTCATTGAACTGTACCGACCATAGAGCAAGAATCAAGATTCTGCTAACTCCT^ 

cagccccgtcctcttccmctgctagccrggctaaatctgctcattam 
caaactaagagtgataagggccctactacactggcttttttaggot 

ATTGGCCCAGTAGTGGCTTCTAGCTCTAAATGTTTGGCCOjNCATCCTT^ 

TCCTNCTNCCCTGTCTOTGGCTGGCTCGAGCCAGCTAAAAAAAG 

NCTNGGTa^TTTGGGraTTAAAAAAGTAAAAAATITGAA 

SEQ ED NO; 39 1 4 GGTACAAAATTCAAATACCAACAAAACTGATCTGTGATGATAGAAAGCATAT 
CACGTTCAGGGAGAGGTGGGACAACTGGAAAGGGGCTGAAGGGAGATTTCCGCTGGGAAATAAA 
TATTCTTAACTTGTTTTGAGTCTATAAATTCTCAAAACTCAATTCAAGTGC^ 
rrArrGCATGCTTATTAAACAAAATAACAGAGTTTTTATTCTGAAACAAGAAA^ 
TTGGAATTTTTAGATTATGTCTCCCTTTCTCTACTTGCAAAACCCAGAACTGGG^^^ 
TCTTAATAAATGCTITrTGAATGAACAGATACATGCAAAACTAAAAGC^^ 
AGCAGAACCAGCCITTTCTGTGTAGTGGGAGTATCnTrATCACCTCCAGAm 
TNCTCTrCCTTNTNCGCCTCGNGGNGGGGTCOTGACTCNCTGNTCGCCTGTO 
TANGATTAACCGATTCTTGGCCAAAAA 

SEQ ID NO: 3915 ACAGTGTTCCAGCCATCCTGCTTGTTTTTTCCCTCCAATACCTCCCAG^^ 
AAACACTTGCATCGAGTCTGTTCCTAAGAACTAGTTTTGAAAAAGAAGCGATGTACTGTO 
ATGTAGCGTGTGGCCTCATCAGTCTCCCCACCAAAAGCTATCTTTTGTCCrTCAAGCA^ 
ATCCTCTTAAAATGACGAAGATTGATGATCCTTTCATAATCAGGAGACTCTm 
AAAATTCCTTCACTGTTTCCTTAATOTCCATACAATTTGATm 

ATAGTCGGGTGCAATGCAGGTTTGGCCACAATTCATGTATTTTCCCCAGGTTATGCGTCTGCAAAC 

AATGTCCAGGTCACAATCTTTATCAATATAACATGGACTTTTTCCTTCC^ 

GGTCAANATGCTTGGCAGCAGCTTTNCATGACAATTTGCCAACCGCAGTGGm 

SEQ ID NO: 39 1 6 ggtacttgggatcccccagttccaaagtgtcttaaagtgtcgacttcttccac 

TACAAAATCTCCAGCGTCCAGTGCCTCAGGTCCTAGGCCTACTTACAAGCCTCCAGTCTCAAATTA 
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TCCAGGATATCCTAAACCTGAGGAAGGAATACTTGACAGTTTGGATGTTTGGGTCATTGCTGTGAT 

TGTTAT^GCCATAGTTGTTGGAGTTGCAGTAATTTGTGTTGTCCCGNACTTT^^^ 

TTTTTTCATCNCTGAATTTAGGATTTACTTITCTGAAAAACm 

GTTAGGAAAGGAAAAACTTITACCCCCAACCCACTGCCCAAGGCCTGTCCTCAGNCCC^^^ 
lOTTTAGGGAAACTATTACNCAGTCCTAAAANGATGCCTGGAAACACTGNNNGAm 
TTCCCCACCTNAAAAGTCTTGCCCTANAACTGGGGGNAAAAATTTNCT^ 
TTTCANCTAACTTNANGAGCTTTTCCirnSI^^ 

SEQ ID NO; 3917 ggtacaaaggcaaagtagaataacaaaaaatattttactaaaacataagatt 

TACAGAAGTTTCCAGACAAGCCATACAAAATGGTCACAAGCTTTTTITGA^ 

CTTGACAGCAATGTTATTAGTGAGGGCTGTGATGTTTGTTTAATGTTCCCATTrrGGTTCC/^ 

CAAGCTTGTCCATCTACAGCGTCTAAATAAAGTTAGACTTGGCTAGAGCATATTCTAAAGACCTGG 

TTAGCTGCTTTTAACCAATGCAATTAGATCACCAAAAAAGGGGGAAAGGAGCCCATA^^ 

CTACCTCCCCCCTCAAAAATAAAATAAAATAAAATAAAATAAAAACACCCACACCCCT^ 

ACCTGACAACTACCTTCATTCACAGTGCTTTATACTTAAACCAGGATGGGGGGAAATGAATAAAA 

GCAGGGANGGGCCNCTGNTTTTNAACGTTTCCAACAATCCAATGATCrrTCTACC^ 

ATGACGGGGATNAGGGCAGAATTAATTTTGTTAT 

SEQ ID NO: 39 1 8 GGTACTTGGATTATGGAGAAGAGAATTGGAAGAAACAGACATCTCAGTGCTT 
GAAGAACCTGGAAACATTCTGTGAGGAGACCAGGAGGAATTTTGAAGCCACCITAGGTTGGCTA 
AAGAACATGCrrGCTCAAGAACTTATGGTCTAGGCACTCA<m*GCCriTGGGATO 
TTGAATCACTTTCTGACTCCACTATTTACATGGCATTTTACACAGT^ 

GTAACTTGCATGGACAGGCAGAGTCTCCGCTGGGCATTAGACCGCAACAGATGACCAAGGAAGTT 
TGGGATTATGTTTTCTTCAAGGAGGCTCCATTTCCTAAGACTCAGATTGCAAAGGAAAj^ 

cagttaaagcaggagtttgaattctggtatcctgotgatcttcgccgtctctggcaangatcttgt 
tccaaatcatctttcatattanctttataatcatgtggctatgtnggccgga 

GGNCTNCAm'TGGANANasfAATNGGCArrTTCTTCTTGAA 

SEQ ID NO; 39 1 9 ACATCTTCCAGAACGTAATGGTCATGATCCTGGTCGTGGACACCAAGATCTTG 
ATCCTGATAATGAAGGTGAACTTCGACATACTAGAAAGAGAGAAGCACCACATGTTAAAAATAAT 
GCAATAATTTCrrTGGGAAAAGATCTAAATGAAGATGACCATCATCATGAATGTTTGAACGTCACT 
CAGTTATTAAAATACTATGGTCATGGTGCCAACTCTCCCATCTCAACTGATTTATT TACAT A 
GCCCTGCATTGTTATATCAAATCGACAGCAGACTTTGTATTGAGCATTTTGACAAACm 
AAGATATAAATAAGGATAAAAACCTGGITCCTGAAGATGAGGCAAATATAGGGGCATCAAGCCTG 
GATTTGNGGTATCATTTCTATCACTGNCATTAGCCTGCTTTNCnTGCTAGGCGTGACTTGGTO 
TCATTAACCAAGGGTGCTTCAAATTCCTTCTTACATTCCTTGGTGCATTANCT^ 
AGTGGGGAACCCCTTTTTTANrTANTTGCCC 

SEQ ID NO: 3920 GGACCTCTGGACCCACTCCTGCTTGGGGTCGCCACAGAACTGCTGGCCCTTCT 
TGGTGGTGAAGATCACTCCTGCCTTGAGGCATGTGCTCCTGCTGGACAGCTGGTAGCTGACCACTC 
GGTTCTCAGGAATTCTCTTGGAAACAAAGAACATGCAGCAGGGAGAGGGGATGACCACAGAGCC 
CGTAGGGATGATGTGGTGGGCACAGACACCAAGGAACAGAAGGCTGGTTACTATGGTCATCAGGC 
CTGCCATGTCTCAGAGAGCAGAAGCACCAGCTCGGGGCTCAAAGCTGACGTGCAGGAGGAAAGG 
ACCCGGAGGTAACAGGGTGACAGCCTTGGTGCCAAGTGCAGAGCAGAAATGATCCCCGCGT 

SEQ ID NO: 3921 ACTITrTTrrTTTTTTT^^ 

ACTGGGAAATTCCATGTGAAAGTGAAACAAGCATGAGTCAAGTCAACCAGGGAAGGAATCTGGG 

GACAGGCCAAGGAGCGGGAGGTGGGGCAGCGAGGCANTCCTGCTGGTAGGAGCCCTGAGGATTT 

CCCAGCTrGTGTGCGCTGCCTCTGGCATCCTANAGACCCGGATTTACTCAGCTAGGAGAGAGGATG 

GATCACAGGGTCTAAGGGTGGCCATTCAGAGGTAGAAGATGGAGGGGCGGCAGATTCTGGCAGG 

GCAGCANAGGGCTCAOTGGCCATGGCTTGAGGGGTAAAAAATTNAGGACATCCCCCAGTGCTGCC 

TCACCAAGGCTTCTTCOjAANAAATNAmGAAAATTTNGTGGNGGNGGGACCT^^ 

GCCTTTTTGCNGGAGTTTTTTTCAACCANAACCCAANCA^ 

CAAANCCCCTNAAAAAACCTrGATTTT(XNTTCAAAGGA^ 

SEQ ID NO: 3922 CTTAGCNTGGTGCGCGGCCGNGGGNCGCGGGTATCANANCCATGCGNAGAGT 
CCGGAAGACAGACAATAATCGCATTGCTAGAGCCTGNGGGGCCCGGATAGTCAGCCGACCAGAG 
GAACTGAGAGAAGATGATGTTGGAACAGGAGCAGGCCTGTTGGAAATCAAGAAAATTGGAGATG 
AATACTTTACTTTCATCACTGACTGCAAAGACCCCAAGGCCTGCACCATTCTCCTCCGGGGGGCT^ 
GCAAAGAGATTCTCTCGGAAGTAGAACGCAACCTCCAGGATGCCATGCAAGTGTGTCGCAATGTT 
CTCCTGGACCCTCAGCTGGTGCCAGGGGGTGGGGCCTCCGAGATGGCTGTGGCCCATGCCTTGNC 
AGAAAAATCCAAGGCCATGACTGGTGTGGAACAATGGCCATACAAGGGCTGTTGCCCAAGCCCTA 
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NANGTCATTCCTCGTACCTTGCTTATGGCAAGCATTTNGTCTAmTGCANAATGTO 
ATANAAAAATiGTAAATCCTTGAAATGCTAAATAAAATGTTTTTAAATm 

SEQ ID NO: 3923 GGTACGCGAAGGATATCGGTTTCATTAAGTTGGACTAAATGCTCTTCCTTCAG 
AGGATTATCCGGGGCATCTACTCAATGAAAAACCATGATAATTCTTTGTATATAAAATA^ 
GAAAAAACCCATAAAAAAAA 

SEQ ID NO: 3924 ACAAAGCTAAAGAAAAATTGTGGTCATTGATGATGGAATATATGACTTGCAG 
GCTTrOAAGTCTGCAGAATTC\AGAAAAGAGCTGCAAATGCATTm 
ACATGCTTTACCCTAAAGAAAOTGGGGCTAGGGGAAATGAAAGGAAGCCTGAAGA 
AAAACATGCAATATACTTATTCACTGTCTAAGTCTGTAGTATAACATGAACTGGAGTCTCT^ 
TTTCTAAAATCGCATTrrGTANCC^JAAAAAAAAAAAAAAAAAAAA^ 

CAACAGTCTCCCATCTGTATTCAATGGCGCCCCCAATACAGTCCTTTGTTTGGATGCTGGGGAG 

TAATCCCTACCCCAAGCNCCATATAGATAAGAAAACCCTCTCCAGTTGAGCTGAACCACANACGG 

TTGGCTGATGTTCCCCCACCCCATGACACAGTTNCTTGAATNGGAAGAAGGNGGACAACAGGGNG 
TTTTGAACTTTAAOGGCTCCNCTNTT 

SEQ ID NO: 3925 GGTACAGACAGCTGGTGTCArnTGACACCTGGGGCCACACACAAAGAAAGC 
AGCCCAGCTGATGCTGCCAAGAGAAACTGCTGAAGGGCCAGACACATTCCAGCAAGGAGGGGTG 
TGGGGTCTTTCACGTAGATTGCANAGAGTCCACCTGGTATACATTCTGGTTGGAAGGGGTGGTAAA 
ATACAGCTGCTGTGCTGGAACCCTGTTGTCACATGGGCTGNTGAGTCCCAGCGTCCTCTCNAAGrc 
CAGCAGNTGACCCATGAAGTTGAAGTTAGGGGATATGTTGGATTTTTTCATTTrGACA^^ 
GGCATCGTTCATCGACAGATTGAGCTTCTGCATAAGGTANGCCACANTCCAGTGACTGANCGGCT 
AATGCCAGCCAACCNTTGTACOTAGNTAACCAA^ 

TATTACGTTTTCTGGACACCCCACACGGATTCGGTOTGGCACATTCCTTATO 
ANNTTTTTTOACNTTGTTTNANTNCCCCAAT^ 

SEQ ID NO: 3926 ACTTACTTGGAGAGACATATGTCTGAATTTATGGAGTGTAATTTAAATGAACT 
AGTTAAACATGGTCTGCGTGCCTTAAGAGAGACGCTTCCTGCAGAACAGGACCTGACTACAAAGA 
ATGTTTCCATTGGAATTGTTGGTAAAGACITGGAGTTTACAATCTATGATGATGATO^ 
CAITCCTGGAAGGTCnTGAAGAAAGACCACAGAGAAAGGCACAGCCTGCTCAACCTGCT^ 
CCTGCAGAAAAGGCTGATGAACCAATGGAACATTAAGTGATAAGCCAGTCTATATATGTAITATC 
AAATATGTAAGAATACAGGCACCACATACTGATGACAATAATCTATACTTTGAACCAAAAGTTGC 
AGAAGTGGTGGAATGCTATGTTTTAAGAATCAAGCCAGATGTGAGTTTTTTTC^^ 
AAACCTATATAATGGGAATACCATTTTTTCTTTGAAANGGGGCTGGTAT^^ 
GGATGGGGGTTCTAAACCNAAAGG l ' il-il ' rri ' N TAANAAAAATANG 

SEQ ID NO: 3927 actggatgtcaggtctgcgaaacttcttagattttgacctcagtccataaa 

ACACTATCACCTCGGCCATCATATGTGTCTACTGTGGGGACAACTGGAGTGAAAACTTC^ 

ggcaggtccgtgggaaaatcagtgaccagttcatcagattcatcagaatggtgagactcatcaga 

CTGGTGAGAATCATCAGTGTCATCTACATCATCAGAGTCGTTCGAGTCAATGGAGTCCTGGCTGTC 

cacatggtcatcatcatcttcatcatccatatcatccatgtggtcatggctttcgttggacttaot 

GGAAGGGTCTCTTGTrrAAAAGTCATTGGTTTCrrCAGAGGACACANCATTCTGTGGGG 

GATTCTGCTTCTGAGATGGGGCAAGGGTTANCCATGTGGNCCCCACATCTGGGTATTTGTTK^ 

AGCTGCTTTTTCCTCAAAACTTTCANAAAANACCTGTTTAACTNGGT^ 

nngaggcaaaacccaantacctggcaatttttatggng 

SEQ ID NO: 3928 ggtactgaggagaaacttcatgatgctgccagcaagctgcttaacacagttg 
aagaaactacaaaagatgtatctggtctccattccaaactggatcgtaagaaggcagttgacct^ 
cacaatgcagaagctcaggatatttttggcaaaaacctgaatagtctgttta^^ 

gatttttccccctgtgaacaggcatgttgtattatataacatatcttgagcat^ 

TGGCTGTCArmCTCAGTGGCAACTATTTACTGGTTGAAAATGGGAAGCAATAATATO 
CCAGGTTTCTCTTAAAGCCCTTTCATGATGATGATGATGGGACTCAATTGTrnTTTAT^^ 
CTCCCTGATAAAAAACAAAATTTGGAGAAATAAAAATATTCCAAAAAAA 

SEQ ID NO: 3929 GGTACTGGGAAAAGATCTAATCTGCCGTGGGCCTGTCGTGCCAGTCCTGGGG 
GCGAGATGGGGGTAGAAATGCATTTTATTCriTAAGTTCACGTAAGATACAAGm 
CTGAAGGACTGGATTGGCCAAACATCAGACCTGTOTCCAAGGAGACCAAGTCCTGGCTACATCC 
CAGCCTGTGGTTACAGTGCAGACAGGCCATGTGAGCCACCGCTGCCAGCACAGAGCGTCCTTCCC 
CCTGTAGACTAGTGCCGTAGGGAGTACATTATTTCCAACAAGCTTAAGACrTACCATGAATO^ 
CArrCATACAAAAACACACTCACACTAATTCTTTTAAAACAGTAGTGCATACATTAm^ 
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ATAAAGCCAACTTrGArrAAAAACCACTNGTTTTCAAAGCrCAAGTCTTTGAT^ 

CAAGATATCCCCCTATGATCCTACCATCTATTTTANGNCTTTTTGGAC^ 

GGCAAATGGTTGCAGCCCAANArrAAa^CCCCACTNATTGGNTTrrGG 

SEQ ID NO: 3930 ACATCTCCGCTGCAGATCGTTTCACACCTGCTTTTCCTCGGTTCACTTO 

CTGCAGCCAGGTTAATATCTAAAACCTGGCCAGCAATCATTCTGCCATCCTCTCCTGCTACAGC^^ 

CCCGGGCATTmrCTCArrAACATACTGAACGAAGGCNAAGCCCrrATC 

ArmGCCATACTTCGAAAAGATTGCCTCCACATCANATTTCTTGACCACAAGAGTGT^^ 

CCAATGAATACACGGGAGTTCATGGAGCGAGGATCTGTCTTGTTGGTAACGTTGCTGGCCATCGTG 

TTTGATGGTAAGGTTTCTCACAAAGCCCGAAAATGTAACTGAAGATCAAAAA^ 

GGGGAGGGGAGAAGAGATTNGNTTCTGAGTCTNCTACTCCCCGGGTCTGCNrmAA/^^ 

GCTGGTTGGAGGCCGGNAACGCGGCCCAAACGGTTANNTTITNGaS^ 

NCCTTTOTTTTOTTTNTTTNTCTT^ 

SEQ ID NO: 393 1 GGTACTTTTTTTTTTTTm 

AAACGTTTCTTACAAAAGAGCATTACATTCTGCACACTGCTCTGAACAGATGCCAGGGA^^^ 

GACTATTGTTACTTTTCCTCCCTGTCCCACCCCCCAAATGTTACAGTGACCACAAAGCAA 

CACAATAATTACATGGGGGGAATTTTTrAAACCACCAACAATAACGAAAAA 

TCTGCTGCTGmCAAAATTTCAATGTTAGTTTTTGCACGCCCTTCCCCCC^ 

AGGAACTAAAACATTACATOTGGTGAACAGCAAAGATTTCACraCACCTCAAA 

ATGAAGCCAGAGGAATGTTGGCTTTTTAAACAGAAGCNGATNAAAAAAAAGATTCAGGACTCCT^ 

CAGTTCTTCACTTGTCTTAAAAAAACTTTCCANAATACTGmrCACAC^ 

ArrxrmANTANAANOTCAAAAATTTNGTTCTGGTT 

SEQ ID NO: 3932 GGTACACCCCAACCCCCAACCTCAGTGGAAAACAATGCCCAGGGATTAGGCT 

atggaagggcaaaatggacccattcaaatttccrcccagggaccaggccctatt^ 

atgtccttagctggtgggggaaaggttggcgatcaggaatacatatgtgtagtttttgt^^ 

catccatagcacacccgagggatgaaaggcctccaagtgggaactggaacgatcaaattcttgct 

gattattaattgggccctaamcagaaagcaatgctgtttitacaa^ 

aacaaaacacaaaaaacaaaaagagtagaaaattttatgctaaaaaagattcgcaat^ 

tcagtagcacattgcatctgttaagtgtcccaanctccctgtaatggttatgtttccaacgg™ 

tctttccaagataatggtgtaggtgttacacccccaatctttcatgtncacattctgcaa 

TGCTTAATAACTT^m:ATTAAACCNGNC^TGGAGTTTNTGA 

SEQ ID NO: 3933 GGACAGACTGTTTTCCGGATGGACTGGTrTGGGAACACTGTGCTGGGGGAAG 
CTGCCCAGGAAGCGCTCCCGCTGCCGGCTTTCCGGAGGTCTCTOCCCAGTGCACT^ 
CTGGAGGGCCCATTTCTACCACCCTAGCATGTCTGACAGAAAGCCCTGCTGGACTCTGGGGTCCAG 
ATGTCAACTCTACATTGGAGGAGGCAAAACACAATCTAGAGGCACTGTCTGAACIT 
CAGGGAGATTTCTCCACTGCACACAGCACAAGTGTCCTATACATGTGTCCTGGTGGAGCAGAGGG 
AGCGGGAGAGGACCACGGGTCAGGATCCTGTCACCACCAGCCTGAACAGACAGTCCCATCrrTGT 
GATCCAGGTGACAAATAATCAATTCCCTTGTCCCCANCAATTGACCTTACCANATGGT^ 
AGCTTNTTCACCCTAAANAATTTAAGTCTGGTTTGCTAATTGAOT 
TGNGGAAGGAAAGAACCAAAAAlWGGGTNCN>rmANAAGGGGTTAACCAACCG 

SEQ ID NO: 3934 GGTACTmTTTTTITrnTITI^^ 

CATGATCCAGGATGGATTTTANATCITGTTGAAAGCAGCCACATCCATGGACTGCACATAGT^ 
AAAAGCAGNGATCTGCTCCTCCAGCATATNTGTTCCAACmATCATCTTCAACT^ 
rrGAAGmCTTAATTCCGTATCCCACTGGAACTAGTTTAAATGAGCCCCANACTAAGCCGT^ 
rrGAATGCTTOTGACNCACTCCTOTAATTTCGCC^^ 

AGTAAGATGGAANACTTGGCAACAAGGGCAGGNTTTTTGGCTITCTTTGAT^ 
AACGTTCTTCCCTTANCCCTCTTGCTTCTTCACITrcCTOT 

AATGGGCATCOTCAATTTTTACTATrCTGGNAGCTTCCACTTTCTGNAGNGGGC^ 
NNNGGACCA>rrrrTTCCCAAANNTTTTrrTA 

SEQ ID NO: 3935 ACCTGTGAACCAAGTGTTTGGGCAGGATGAGATGATCGACXjTCATCGGGGTG 

accaagggcaaaggctacaaaggggtcaccagtcgrrqgcacaccaagaagctgccccgcaaga 
cccaccgaggcctgcgcaaggtggcctgtattggggcatgacatcctgctcgtgtagccttctctg 
tggcacgcgctgggcagaaaggctaccatcaccgcactgagatcaacaagaagatttataagatt 
ggccagggctaccttatcaaggacggcaagctgatcaagaacaatgcctccactgactatgacct 
atctgacaagaacatcaaccctctgggtggctttgtccactatggtgaagtgaccaaatgactttt 
gtcatgctgaaaggctgtgtggtgggaaccaaagaaacgggtgctcaccctccgcaaagtccttt 
cttngtgcanaacnaancggcgggcttctgganaaaaattnaccttaaattca^ 
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TTCCAAATTTOGGCCATGGGCCGTTTCAAAACATGGANGAAAAAAAA 

SEQ ID NO: 3936 ACAAGCTGAGTATCCCTTATCCAAAATGCTAGGGACCAGAAGTGTnTGGAT 
TTCAGGTTTGGAATATCTGCATATACACAACGGGACCCGACTCTAATTACATAArrCATG 
ATCCACCTTATACATATAGCCTCAAAGTAATTTATACAAAAArmGTG 

TTAAACTGAACCATCANAAAGCAAAGGTGTCGGGTGGGGAATTTTCCACTTGTGGGGTCAAGTCA 

GNGCTCAAAAAGATTCAAATTTTGAAGCTTCTGGATTTCAAATT^ 

TNGTAGTATCATTCACTCTGGOTATGTATNGACCTTGNACTGGTTTTTCAT^ 

AAACCAGCTGTGGATTTCAAAACACAGTGTATTCTAANATCATCTAAAATCCATGCCGGATT^ 

TTGCACAANAATTAAGGrrGGAACTNTTTGAGCTGGNACCTCANCAAACTAAAAGTATATO 

TCAm-An'll'nTTGGAAACATTTTAmTAATGNNCCGGGG 

SEQ ID NO: 3937 ggtacagatatcttcaaaggaggaagaagaaagggaaagcagatggtggag 

CTGAATATGCCACTTACCAGACTAAATCAACCACTCCAGCAGAGCAGAGAGGCTGAATAGATTCC 
ACAACCTGGTTTGCCAGTTCATCTTTTGACTCTATTAAAATCTTCAATAGTNGTTAT^ 
CACTCTCATGAGNGCAACTGTGGCTTACCTAATATTGCAATGNGGTITGAATGTAGGTAGCATCCT 
TTGANGCTTCTTTGAAACrmGTATGAATTTGGGT^ 

ACTTANATTTATTGGACCAGTCAGCACAGCATGCCTGGTTGTATTAAAGCAGGGGATATGCTGTTT 
rrATAAAAATTGGCAAhnrTANNAAAAATATAGTTCACAATNAAAATI^ 
AAAGGGGCTTGAAATTITriTmGTCAAAAAATTAATGCCNCCT^ 
CTrrANAAAGGGllilUNTATATTCGNTNCNTTTGAAAAACC 

SEQ ID NO: 3938 GGTACATTTAAGAATAAACTTTTGTAAAAAAAGAAAAATCTTACAGTGW 
ATCATCTCTTTAGTTGTTTTCACTAAGTCATTCCTACCATAACTGTGAAT^ 
CANAATCTTGCCANAGTCTGTTCrrrGGTCCTTGTTCTACCCTAAACTTTGTATCACCTGAAAT^ 
ACCAACTCATTCGAAAAAAAAAAAAAAAAAAAAAAGTCCAAGCCTGGAACATTGAAGGAC^ 
ACATTATTTCAGCAGTATATTCCCTGTTTTCTGGAGTATTCAATGGGAGGTTCAGTCAAAAN/^ 
TAATGGTCITCAGGTTCTGCCCTTAAATATTTAAAGATCACTTGCrCCATA^ 
CCCAATCTTCAACTATACCATGGCGGATTGGCCACTTTTGTTNCATATGTANGGT^ 
TCATCCCAATGAAAAAAGTCTANGNCATCAACACCTTTTATCACCCCTCCTTG^ 
CACTTTTTNGTGACCTCCTTTATTGNCATCCCGGGAGG 

SEQ ID NO: 3939 ACATCAGCAGCAAACTCCTGTGCATCCCGGTAATCACGGTTCTCCATCTTCCG 

cttgacagtgctgaggtccatggggtgcttaatgatgtcatggtagtcatgcaggccaagtgcag 
aagcatccactggtttatagaaaggccaagcataggcagcatgcttcttagagagtaactccttc 
aaaatgccattgcaatgttttaactgttctgaaagcit^ 

gagagtcaggcaagtctttgcgtgggggcttgatggggcgaccactctctctacgcataggggga 
agccgtgctgccrrangctcaagactcccaagagggctagctggagaaccaagagccaagatggc 

TGTAGGTGTAAGGGGTGGGTAAOTATCTGCTTTTCCCTTTACGCCTTTT^ 

tggcctggaaggancctccagtaaccanccaaggagccgggggtcccaccaaaatncaaaggac 
tttgaaaantgggaaaggaaaataacitgatggggnggggnaatttto 

SEQ ID NO: 3940 acttgtctatattgcaaaagtcttgattgaggtggtggcatttcagctagttc 

CCGTGTTTCACTTTGATCACGATGAGACAGTTCTCTGATTAAGTCrrGAAACCTTGAGTATGTGGTC 

TCCACrrCATCTTCGGCACTGCTGGTGAGCTTCCAGTCTGCCCTCACTGGCGGCTGATGTAATCCC 

AGGGGCGGCTGAGGAAGATGCTGGAGGTGAGGATGGTGATGGTGTGGGTATGGAACGCTGCCCT 

GACTGAGAAAGGCATGATGCTCGCTCCACTGCTGGAACCGTGCCTGCTGCTGCCTTAATGTTT^ 

GAGCGACACTCCCATGTATACGATCTAAAGCCTCCCCTATAAGTAATCCCGAGGGGTTCTCATCAA 

TGAAAGCTATrCAAAATGTGAAAGGAAGAATGGGGCCCGGGTTGTGCAAGTGGCTCTCAAAANGA 

CTGTGGCCCTTCrrrGCACCATGGGAAGGGGCTTCTTCTCCGGAATTACAAGCT^^ 

GAAACTTGGGCCCATTAANCAACTNCCTNTTAAAAAA 

SEQ ID NO : 3 94 1 ACTGCATGTTCTGTTGTGGTGAGGGAAAGAAACATGCTTTGAAGGrrTTCCCl^ 
TGTCAACANAATGTGTGTCTGTAGCTGTGTATTGCGCATGTATTCATATATTm 
AAGGTTTTTGCTGACAGTGTTGGGAACCTCACATGCTTCTGAAGCATTAAATATTG^ 
CCTTTCANAAATCCTCAGGTTGGGAAAGACCCCACACCTTCTTTAAGGATCATTTGT^ 
ACAGGATCrrGGAAATGTTTCCTAGGGTGTGTAAAAATTAACCAGGGGGGAATGAAGCACAT^ 
TCTGGCAACCAAACTTGAGTTCCTCAGAGAACAGATGCAGAAAGACCTGCTCCTGCTTGCCCGGC 
TACAGGGGCCACTGTGGAAGTCACACTGAGGCTGTGACCCGGCCCTTAACCCCANGANANCCCGT 
GGCATCTGTNCCCXIAAGGCGCCCAGACCTTTTAACCGGAAACTTTCCCAACT 
CAACCCTGCAATGAAATNGhOTCCCCCAAGCTTATTNTTT 
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SEQ ID NO: 3942 TGTACGCGGGGCTTTmCTAACTCCGCTGCCGCCATGGCTCCTGTGAAAAAG 
CTTGTGGTGAAGGGGGGCAAAAAAAAAAGAAGCAAGTTCTGAAGTTCACTOT 
CCTGTANAANATGGAATCATGGATGCTGCCAATTTTGAGCAGTTTTTGCAAN^^ 
GAACGGAAAAGCTGGGAACCTNGGTGGAGGGGGGGNGACCATNGAAAGGAGCAAGAGCAAGAT 
CACCGGTGACNTCCNANGNTGTCTTTCTCCAAAAGGTATTTAGAAATATCTCNC(^ 
AAAAANAATANTTrACTGNGACTGGTTTGCCCCNTATTTGCTAACAGCTA^ 
ATTACTTTACCTTTCCANATTAACCCGGNACNrrANAAATANGNGGAAAW 
CTTTTCTCTGGGAAAAATTTTGTATGANATNCTNTGAA 
GGATTNTCCTTTTGANAn'CNNCTGNGGGGGATCNAN>nNA/^^ 
GTGT 

SEQ ID NO: 3943 GGTACAGAGAAGCACCTATTGACAAAAAGGGGAATTTCAATTACATCGAGTr 
CACACGCATCCTGAAACATGGAGCCAAAGACAAAGATGACTGAAAGAACTTTAGCTAAAATCTTC 
CAGTTACATTGTCTTACTCTCTTTTACTTCTCAGACACTTCCCCC^ 
GCANCTTAGTrrCACAGCTTTGCCTCTTCTTTOTGATGTAm 

ACTTGTATAATCANACTGGAAATGGGGATGATGGTGTAAATTGTNTTGAAAAAGATCGCGAATAA 

AAATCANCAAATGTNAAAGCCCAGAAAAAATOTATTCGNATTTCTGGTTTTG 

ATTTTTATAATNANAATGGTATTTGGAANAAAAGATTATGCTNGCTN^^ 

ANAATNNGTCNANACCCTCCCGAATGGTATTTTTTGTATTGGGTTC^r^^ 

CTANAACACrGGCTTNGGNTNGATTAATGAATNCCCTGGTTTO 

ACTTTATTTNCCTGATATGGGCTGNNTGNNATATAAANNNCNTTATA 

NGTNCTGNAATNAAATTGTNGGGGGC 

SEQ ID NO: 3944 GGTACTTnTITITITm^^ 

TATTAGGCAAAATTTTACATAAAATCAGAAATCTATGATCTGTCCCTC 

GCAAGATTCATCAGAAGCCACGTGCAGTCAGATCCCAGCTGGCCGGCGGTGCANATCTGGAGTCC 

AGCCTCAGGGATGCGCTACTTTCCATTCTCTGCATTGAACATTCGTTCTGTCAGCATCCGCTCCAGC 

rrCACTGCATCAGCGGCAAACTTGCGGATCCCGTNANAGAGCITCTNCACAGCCA^^^ 

GTTGTGCAACCAACGGAAAAGACTTCTCATCCAGGGTGGATTITrTCCAGGTT^ 

CGCCTTrGGCOTGANAGCACAAGCACCANCrTTGGCGm 

GGTGANATGGTGAGGAAGTCACAACCGGOnTAGGGCTTTTATCTCCCNCGTNTGCCGGAANGGAGG 
CCCNATGACAATOGGTTTGAACTTAACTTTTNGTANAANTGGANATm 
GGTTTCNAGGGCTANANGATTCTTNCGGGTTGGCACNTCCAANAAGGAGN^ 
NNANGTTAANCTCCTGGTAG 

SEQ ID NO: 3945 GGTACGTGCAGACGGTGGTAGTTCTGGAGTCCTGGAAGCCACGAGGTGCTCA 
TCCATCACAAGGCCATCACAGCCGGGTAGAAATGCCTGAGGAAAGCAGCGGAGCTGACCGTGCC 
AGCATTTACACAGGGAACACTTC1TGGGCAGCCAGGTGTCATGGGGCACAGACCCACAGTTCTCTT 
TGCGCACATCGTGCTCACAGTTCCGTCCGTAGAAGGAGGGAGGGCAGGCACAAAAGGACCCCAG 
CATGCAGGTTCCCCCATTCAGGCAGCAGGTTCTGTTTAGCTCCTTACTGTGCTGTATCCC^^ 
GGCACACGCTGGGAAGACCGAGGCCGAATTGCAGGCTCCTCCTGGGGCCAAATGCTGCATCTCTC 
CCGCGT 

SEQ ID NO: 3946 GGTACAAAA(lVCAACAAAGGTTCAAACATCGAGATGTTCCCrrTAGCAAGGCT 
GAAAATTTCAGTCTCTGGTATTTGGAATTTAGGCTGCAGTCCTTGTT^ 

TGTGGCACAGTCCATGCTTTTAACCAGATTTGAACAGAAGAATGGCCACTTGGCCCAGGTAGAAG 
TAGATGAAGTGTTTGGTTTCATGTGTCACATAACTACCGAAGTTCCTCCCCACGATGCAATGCC^^ 
GTGGGATTGTACGCGGGGGAGGCGCTGTTCCAGCCTTCCTTCCrGGGTATGGAATCTTTGCGGCAT 
CCACGAGACCACCTTCAACTCCATCATGAAGTGTGACGTGGACATTTCGa^AAGACCTGGA^^ 
TTTTTTTTITITITrTA 

NTCCCCCCCANAATNAAAAATNTGCO^CAGCCCCCCATTTTTTNCNTrA^ 
CATTNCTGAACAANTNCTCCANAmrCCTGCCCCTNGCCTITTGGGAG 
GGGAAAC^^S^CNGGCNCmGGGNAANAATGGAGGGGNTNGGGANTCCTNGNGG^m 
NTTTCCCrTGTTTTAGGNNNTT 

SEQ ID NO: 3947 ACCAAATGTCACACTGGCTGTAATCAAATCGCCCAAGAATAGTCACATTAGA 
TTCACTGGGTTTAATTTTATCTATATCATCTTCCATCTTGCCAGGTTTCCAGC^^ 
TCACAAGACTTAGAAAGTATCAAATCGCCTAACCATCGCACACAATCAACATAATTCCTATGTATG 
TCTCTGGTAGAAAAATCAGGAAAATGGATTTTCTGAGAAATAAATGGCCTGTTAGTm 
TTATAATCATAAGATTCCTTAATTGCATTCATCATTCTCTTTGAATTGATCCT^^ 
AATGATCCATACCACAGGACATTATTTmCACCCAAAAGATCATAATCAGCCrrAGAAOT 
TTGTGCCCTTCTACGCCTCCAAATATTGCCACCCAGAGTGNCCGNCTGGATATTCCATAATCGGAA 
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AGCATGACnTTACrrACTTACCAGAAGAAAGATTTGGATCTmGGATGGAAm 
TAGCArmCATrGGCCACATAAGGGCTTATCCCTGGATTGGTNTAGGATraTATCCT^^ 
TTOANCCNCTCCNNCCGCANAGGGANG^^ITGTATTGNTT^ATAAGGCCCT^ 
TTTTANCTTACNTCNCGTAGA 

SEQ ID NO: 3948 GGTACCTTCAGTCTACACTTTAATCCCACTGAGTTAACITGCAGCTCACAG^ 
CCAGAAAAGTATAAGGAGAAAGCTGAAATCAGAGGAATCGTGCCCTCNACTGCCAGNTTGCAGCT 
NAAAGCNCACCCAAGCTATAATTAAGATTACCirCTGACGATTACATAGGACrrGCA^ 
TATTGAAATAAAACTGTTITCGCAACTAATCATGATGAAAATACnTm 

AANAAATACATGGGNCANGNCAAGTNTTCATNCTGGGGCACCAGGGTATACTCCCACCGNATAAA 

GACACAAACATAAAAATCNTTGTTGTGATCTGAANTCCAATCCANACACGATGCTTATTC^^ 

CTCGCGTACCTTGCCOSrGGCGGCCGNTCAATAAGGGNAAATCCAmACACTGGTGGTCGTTAOT 

NAGGAATCCGAACCTCGTANCNAANCTTGGCGTAATCATTGGGCATNGNTAGATTNCNGOT 

ATTOGTNTGCTGGTNATTATTCC(>mACATCCAANCCCGGAAGNTO 

CCTTATGATTANCCNACNTACNTAATTGCGGTGNGCTNATTNCNGGTTOO^ 

TGNNCNCTAGATAATGANTNTGCTNCNCCNNGAAAGCTGGTNCT 

SEQ ID NO: 3949 GGTACTCCGGCAGGGAGGGTGACAAGCACACCCTOAGCAAGAAGGAGCTGA 
AGGAGCTGATCCAGAAGGAGCTCACCATTGGCTCGAAGCTGCAGGATGCTGAAATTGCAAGGCTG 
ATGGAAGACTTGGACCGGAACAAGGACCAGGAGGTGAACITCCAGGAGTATGTCACCTTCCTG^ 
GGCCITGGCTTTGATCTACAATGAAGCCCTCAAGGGCTGAAAATAAATAGGO 
CCTCTGGGGGTCCTCTCTGAGTCAAATCCAGTGGTGGGTAATTGTACTTAGGAAAGAAGCAGGGT 
CCACATirTGTGCAGCCACACATGATGACTCATTTTGACTAACATATAAAGAGAAGT^ 
CAATTTTGTCTCCTAATGCAAATAAGGTTGNTNAGCTNrrGCTCAATGCA 
CATCATITGGajGATCATATCTTTTTNAAAATGAAACAGGNAGGGGTGNCCACAA^^ 
GGATAATGTTCCTNCTTCAGGTCTTT^^^T^CAAGCATANT^CNAGCNTTC^^ 
TTGGAAAATTANTATNAAGN(XGTTTTGC^mACTANGGCTCCAGTO^ 
TTAAAGGGGAATTCANNNNTGGGGCT 

SEQ ID NO: 3950 ACAATAGAGTTAGAGCCAAGGTCCTAGAGGCGGATAGGTGGATTCCTGAGGG 
AGGAGGAAGGGGCTGAGGTTGCTGGAGCCTGGCAGCTTCTTCCGGAGCCATTGGCAGGACTGATG 
CAAACAGCTCTGGGTGGGAAGAGGGAACTAGGATATCCTCCTGTGTCCTTCCTTTTCr^ 
CCTGGGTGGCTGCCAGATGGAATTCCTTGGATATCATTGCTTGGAGGTCCCCTGCATGCCAGAAAT 
GGCTTTCTGGACACATTGGGTGGGGGACATGGTGCAGAAGGTGCATTTGGCTCTCACCAGAAATG 
GTTTGCTGGCTCCATGTGGCAAAGTCGGTCAGGATTAACGTGGGGGGATGAGTTTTCTCGGAGCT 
GGATCTTTGTTAAGGAGCTGGGGTTCTTOTAAAGCTTGGGGCTGGNTG 
CTCTTGGAATCAATGAATCrrCGATTTmCTGGGTTTCTGGGACACCTGG 
TNGGACACAACAACTTCCAACANAATTCCCCGCGTCCTNGGGNCGNACACNCTAAGGCGAATO 
CNNACTTGCNGa^fGTTNTAATGGATO^ACCTCGNACCAAtm'GGCNAAA 
NTNGGTGAAATTGTATCCNTTAAATTCG 

SEQ ID NO: 395 1 CGAGGTACGCGGGGCITGCAGGCAAGAGTGCTGGAGGGCGGCAGCGGCGAC 
CGGAGCGGTAGGAGCAGCAATTTATCCGTGTGCAGCCCCAAACTGGAAAGAAGATGCTAATTAAA 
GTGAAGACGCTGACCGGAAAGGAGATTGAGATTGACATTGAACCTACAGACAAGGTGGAGCGAA 
TCAAGGAGCGTGTGGAGGAGAAAGAGGGAATCCCCCCACAACAGCAGAGQCTCATCTACAGTGG 
CAAGCAGATGAATGATGAGAAGACAGCAGCTGATTACAAGATTTTAGGTGGTTCAAGTCOT 
CTGGTGTTGGCTCTGAGAGGAGGAGGTGGGTCTTAGGCAAGTGATGGACCCTCCATm 
TACCCTGGNCGCTCATAATGAGGCATCATATATCCTCTCACTCTCTGGGACACCATAACCCCTGCC 
CCTTNCCTGGATGCCCCAGTAATGTATGTCTACTGGNGGGGAGACTGGGAAGGATCCCAAGATTC 
ANTATTTCCTGGCCAAAAGGCCCTTTGCTGGCTTCTTGGGGTGGTNAGTT^^ 
TTNCCTTTTTTATGACTGGGNCCCTGGGTGCAATAAAATT^ 
ArrTmATlSnS[AAGGTNCTNCNNGC>JGC>^ 

SEQ ID NO: 3952 GGTACGAAAAGCGGCAGAACTAGCTCTGAAAACTCTGAGCAAGGTCTGTGTG 
AAAATGTGTGACCCTGCCAA/.GGAGCAGCTGGCCAGAGAACCATCGCTGCCCTTCTGCOT 
CTGGACAAAGGAATGATGAGCACCGTGACGGAAGTTCGAGCCCTCAGCATTAACACCCTTGTGAA 
GATCAGCAAAAGTGCAGGAGCCATGTTGAAACCGCATGCACCAAAACTCATTCCAGCTCTGCTAG 
AGTCCTTAAGTGTATrGGAGCCCCAAGTTCTCAATTATTTGAGCCTCCGGGCGACAGAGCAAGA^ 
AAGGCTGCGATGGATAAGTGCTCGGCTTAGTGCTGCCAAATCTTCrCCAATGATGGAAAC^^ 
ACATGTGCCTGCAATACCTTGATGTGTCAAGTGCTTGGCGAGCTAGTTNCTANGTTGTGTGAACT^ 
ATCAAGAAGTGGTGTAGGTCITGGAACTTAAGGGTGGCTGGGGCCAGNGGCATTO 
CTACTCANNGGNCCTCAGGCCTAACACCTTAACTTAAGGTAAACTTATGAGTGCT 
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GCCTGACANATCGGACCAGNGGGATrAAAAAATTTTGGCmrTTGCTNTTGGGCCATr;^ 
CTTACGGGGATACACNCTTGAAAACTCTTCAAAACTNATGGGGGGTATGGGA 

SEQ ID NO: 3953 GGTACGAGATGGCAGCCCTCCAGAGCCCCTTCTATGGAGATAAGATGAATCT 
CTTCTCCCTGTGCCAGAAGATCGAGCAGTGTGACTACCCCCCACTCCCCGGGGAGCACTACTCCGA 
GAAGTTACGAGAACTGGTCAGCATGTGCATCTGCCCTGACCCCCACCAGAGACCTGACATCGGAT 
ACGTGCACCAGGTGGCCAAGCAGATGCACATCTGGATGTCCAGCACCTGAGCGTGGATGCACCGT 
GCCirATCAAAGCCAGCACCACTTTGCCTTACTrGAGTCGTCTTCTCTTCG 
CCTAGAACAAGCTAAGACCACAGGGGrrCAACAAGGTTCCCCAA^VAGGCTGCCCACCT^ 
CAAGATGCTTGAAANGGCAGACCAOTGANGGGAGGGGCGCTTGGCCCATTGTC^ 
NAAGAATTCCAAAAGNCCTTTTTTATTACTGGTGGNGGAC^ 

NAGGTGGGTTAACCGANGCCCCGGANGCCCCTGGANTTTGGATTGGAATGNGNAATm 

ATTTCTTCAATGGACTNNCAAGGNTTATNTAACAGGA>n^^ 

GAGCCTTGAAANGGTAANACTTGCNGGGCGGCCTTCAAAGNGAATTCAuA^ 

SEQ ID NO: 3 954 ACGCGGGAGCTCATGTAGGTCTTGATTGGACACAGTGAGTTTCAGATGACAG 
CCTCCTGTCTCATGGGGTAGCCCCAAAGCCACAGGAGTCTGGTGATTTCCCTCITCCCC^ 
ATCTATGCCATCGGGGTGGGCAAGCTGGATGTGGACTGGAGAGAACTGAATGAGCTAGGGTCCAA 
GAAGGATGGTGAGAGGCATGCCITCATTCTGCAGGACACAAAGGCTCTGCACCAGGTCTTTGAAC 
ATATX3CTCGATGTCTCCAAGCTCACAGACACCATCTGCGGGGTGGGGAACATGTCAGCAAACGCC 
TCTGACCAGGAGAGGACACCCTGCATGTCACTATTAAGCCCAAAGAGCCAAAGAAACCTGNCGGG 
GGGCCCTCATCTCCGACCAATGGGTCCTGACAGCAACTCATTGCTTCCNGCGATGGCAACGACCA 
CTTCCTTGTGGANGGGCAATGTGGGAAACCCCAAATTCNAATGGGGCAAAAAATTOT 
AAGNCGGNGATCTCCCAAGGTrrGATGTCTTTGCAAAAAAAACAAGGGATCTNGAGT^ 
ATACATACTNTGTGAACTTGCCCAAAAAGNAAAATGTCACCCTGCAGGCCNTTTCTT^ 
GAGGCAATTGGTTTGNGAAC1TAGGG 

SEQ ID NO: 3955 ACAGACAGGCrrCTCTGCTATCCTCCAGGCAGTGTAATAGTCAAGGAAAAGG 
GCAACAGTATTGGATCATTCCTTAGACACTAATCAGCTGGGGAAAGAGTTCArrGGCAAAAGTGT 
CCTCCCAAGAATGGTTTACACCAAGCAGAGAGGACATGTCACTGAATGGGGAAAGGGAACCCCC 
GTATCCACAGTCACTGTAAGCATCCAGTAGGCAGGAAGATGGCTTTGGGCAAGTGGCTGGATGAA 
AGCAGATTTGAGATCCCAGCTCCGGAACGAGGTCATCTTCTACAGGTTCTTCTTCACTGAGAC 
GAATTCAGGGTGATCATTCTCTGAAGGGCTGAGAGGNGCTTCCTTCGATTTTCNCTACCACATTAA 
NTTGGGTTCTTTTGTCTCAAAAAGGGTANTCTTTAAAAOT 
AAAACGNAAATAAGTTCATTAAATGGCmCCACCTTGGCTGGATGNAC 
AAAAAGG/VAGCrrGGTAANGNACrnGGGTCCTTITNGGOTy^ 
TGNCAGGhrTTTNGGGAAAGGCTTTTAAAAACATTNC^ 
TTTCAACTNTGANTTGAAANANATTNCCCCNAATCTGNGGNAAT^ 

SEQ ID NO: 3956 ACNNGGGGGCNGCCGAGGCGTGCACATGCTCGCCCAGCCACCCCCAGGACG 
CCrrCTGCAACTCCGACATCGTGATNCGGGCCAAGGTGGTGGGGAAGAAGCTGGTAAAGGAGGGG 
CCCTTCGGCACCGCTGGTOTACACCATCAAGCAGATGAAGATGTACATAAAAGAAThmrr^ 
TTAAATAGATACAAATGTCTATCAACTTTAATCAAGTTGGAACTTATATTGAAGACA^ 
ATAATTAAAAATTATGACAATGTTAAAAAAAAAAAAAAAAANANAAAAAGTTCT^ 
NCNTTCTNAGGGCT 

SEQ ID NO: 3957 ACl'rri"riUlUU"l"i'lll'ri"ilUinUlUU"lTGCAAAGTGCTTrATTTACACAGAGC 
ATOTCCAGTGAATTCTCANAACAACCITATGAGGTAGGTATGATTTCAACT^ 
AAAAAACTGTTCCANAGAGGTAAAGTGGNTTGCCTGAAGTCACACAGCAAGAAGGAGCANANAT 
GGGAmGAAGTTGGCTACATGATGATTTCATAAAACTNTGGAGTGTGCCCrrrGT^ 
GGAGATrmATTTGTTCATTTCTGGCACAAAGATGATTCTTTCGGTGGCACT^ 
GNCCTCCCTTGGACCTTGGTTGNGTGGGGCCAGCTGATTATGGGCAGGGNGCTNGCCANGACTNC 
ATTGAGGATAACAGTCATTGTAACCCCNAACCNAANCGGNACTTTCCACCATTAATACATGGAGG 
ANGNTTTCGCACACTTTTTTGATNAATGGGGAGGNAAGGCCTATTNGGAANAT^^ 
ATTATTGGCANNAC^^GCN^rITGC^^^CNCCCTGGATTGGACACTGNT 

GGGGCNCANCCCAANANTGAGGGAAATTTTTTGAAGGCCANAGGAGCNCCNTCCAGGNCTTACA 
AGTNANCCTTCNGGCAANTNGGATATTCTTNNGGA 

SEQ ID NO: 3958 ACGCGGGGGCTGACTCTCTTTTCAGACTCAGCCCACTTGCACCCAAGTGAATT 
AACAGCCTTGTTGCTCACACAAAGCCTGTITAGGTGGTCTTCTATACGGACATGCTrGACACTTGG 
TGCCAAAATCTGGGCCAGGGGGACTCCTTNGTGAGACCGGCCCNCTGTCCTGGCCCTCATTCCGTG 
AAGAGATCCACCTGCGACCTCGGGTCCTCAGACCAGCCNAAGGAACATCTCACCAATTTCAAATC 
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GGATCTCCTOGGCTTAGTGGCTOAAGACTGNTGCTGNCCGATCGCCTCAGAAAGCCCC^ 

ATCACAGATGCCGAGCNTTGGGTAAATTOTACCGGGGGAGGATTCCCAACCATATGAAAGACAC 

CCTANCTGGACGATTAGTNCKTGNNAAAAGTCTGAACCCTTAAACTCTA 

NANCCTACCCCGGCATTTATTNNACACCAACTGGCGGCCATITGCAGGACNTNTC^^ 

CATTTCAAAATAAGCCTNCTCATAANAAGGNAANTTGGATTTTC^^ 

ATANGNCGAGAGCCGATTANAAAATAACTTAAAACCTNAATTTGGCATGNCCTANCAAGGTATC 
TTTTTGGANNTNTTTTNATGGNTCCCCT 

SEQ ED NO: 3959 ACTGNmACAACCACAGCTACTCTTCTCAAAATAGTCCTTm 

GTTrTGTTGCTTCCATCGAGGCGGAGCAGAGAAGGCAGCTGGAGTCATCCCAAAGGCCTGAGTGA 

AATCTTCAATGGACAGGTGTTCCTTCITCCTGCTGGGGTCCACACCCTCGGGGAGCTCCTC^^ 

GCTTGTTNACTAGCTGCTCCAGGGGGAAGATGGGCAGAGGCCCAGAACTGAGGTTGCTGTTAGCA 

TTGAANACGTCCACTTTGGGGCTTGTGACCTCANCNGTGATCTGGCTNCAGTCCCT^ 

AGNTCCGNTTCAGGTCCTCATAGGATTGGGGGTNCTCCANTTGAANGGATNNCAANNCAGG^ 

ATNCTGTGAANGGTGGGGGGCTTCGTGTCCTTGTTTAANCACAATTGANGGGGGTCT^ 

CGCCTTTTGGGATNGGTCTTNGANGATTCIOTGTGN 

TGGNNTGTTTAaSfAA^^^CACAANAAGATCTGGTNCTNGANATTTA^^ 

CCANNATTTTGNATNAANTAAGGNNTTTGTGCCCGAAANCCCCAATCTATGTGGANNTTA^ 

GGNGGTGANA^fNNTNlT^rrTCNAAA 

SEQ ID NO: 3960 ACGGCACTTGGCGTAAAGCCGCrrCCCTCAAGAGTAACTACAATCTTCCCAT 
GCACAAGATGATTAATACAGATCTTAGCAGAATCTTGAAAAGCCCAGAGATCCAAAGAGCCCTTC 
GAGCACCACGCAAGAAGATCCATCGCAGAGTCCTAAAGAAGAACCCACTGAAAAACTTGAGAAT 
CATGTTGAAGCTAAACCCATATGCAAAGACCATGCGCCGGAACACCATTCTTCGCCAGGCCAGGA 
ATCACAAGCTCCGGGTGGATAAGGCAGCTGCTGCAGCAGCGGCACTACAAGCCAAATCAGATGA 
GAAAGCGGGGGTTGCAGGCAAAAAACCTGTGGTAGGTAAAGAAAAGGAAAAAAGGGTTNT^^ 
GGTGGTNAAAAACAAGAAAAAACCimTGGTGGGAAAAAANGCAGCAGC™ 
CCCCTTGAAAAGAANCCTGNTNAGAANAAACCTTCnTWTGAAGGAG^^^ 
ACTCTTAAATTGATTATTNCTTAAAGGGCAATCATTITGG>^ 
' TTNTTTCCGGCNAAAAACNCAAAAAAAAAAAAAAAAAAAGhrrCTTTGGCCGGGA^^ 
GAATTCANNACTGGNGGCGGTNTATGGNNCCCC^G^^SrCCAACTNGGNA^^ 

SEQ ID NO : 3 96 1 ACGCGGGGArrACGAGATTGGCTTGGATTCTGTCGGATGGACTTGGGGCTAG 
CTGCGGCGGGGCTGGAGGAGGCCAGATAACCATGTCAGCCACAGTTGTAGATGCAGTTAATGCTG 
CACCCCTATCGGGGTCCAAAGAAATGAGTTTGGAAGAACCAAAGAAGATGACCAGAGAGGACTG 
GAGAAAGAAGAAGGAGCTAGAAGAACAGCGAAAATTGGGCAATGCTCCTGCAGAAGTTGATGAA 
GAAGGAAAAGACATCAACCCCCATATTCCTCAGTATATTTCrrCAGTGCCATGGTATATTGATCCT 
TCAAAAAGACCTACmAAAACACCAGAGACCACAACCAGAAAAACAAAAGCAGTTCAGCT^ 
TGGAGAATGGNACTTITITrTrrTTTT^^ 

TATT^^^^ATCATTTAAAAATCCAAGGNGGG^^ITAAGNG1WATTTO 
AACTNTTANACTTATTCTTAACCTGGTGAACAAGTTCTTTTAAAACCTTO 
CTGAATCCTGGTTTANNGGGGGGGGTGGTGAATAAACCCTGAATTGGAAAACTCAAGGC^ 
AAGGTTAATANTTTTNGGTT 

SEQ ID NO: 3962 ACTTTTmTTTTTITrTTTTGCCGm 

AGCAGAGTGGCTCCAGGCCCTTCACGCCTCTCAGACACCACCCATGAGGGTTTAGGAAGGTGCCA 

TCATTCTGTGAAGGCCCAGAGCTTACCCAAGTCTTGGAGCCCAAGrrGAATCACCAACCAGAGGG 

TTGGGAGAGGAAAAGGAAACAGGCAGAGGGGAAAGGCAAGGCTCTGCAGTGAAGGGGACTGAT 

ATCAAGGGAATGCTGAGGTCCAACAAGTGTCTCCTGAAGGCATGCTGCATCCrAAGGCTCCTCAG 

GACTGGATGGAATAAGAAATCTGTOTGTTGAACAANTCACAATCTATATGGCAACrm 

GGGCCCTTGGATGTCAAGCTTAATGGTNANTGGGTNGGGAAAGGTGCCGGCTGNNACCTmG/^ 

GGGTTTTCCTTCCGGCCTTTTAAAGGAATTNrmAAAGTTCCAGCCCCA]^ 

GCTTCCAATGANGANCNrrGGGAACCGGCATTAAGGGAAAANGGTTATTTTAAGGGTO 

TGTCTTTAAGNAAGCCAAAAAAAGAANNCGTCOTGCCATTANCTCAAAT^ 

GGGNTNGKrTCCCCCAGACGGAATNTTANATNTAAGATNTC 

SEQ ID NO: 3963 GGTACTTCCCAGGAACrGGGGACCTACGGGATATCGGGGCTGGCAAAGGCAA 
GTATTATGCTOTTAACTACCCGCTCCGAGACXSGGATTGATGACGAGTCCTATGAGGCCArm 
GCCGGTCATGTCCAAAGTAATGGAGATGTTCCAGCCTAGTGCGGTGGTCTTACAGTGTGGCTCAGA 
CTCCCTATCTGGGGATCGGTTAGGTTGCTTCAATCTAACTATCAAAGGACACGCCAAGTGTCTGGA 
ATTTGTCAAGAGCTTTAACCTGCCTATGCTGATGCTGGGAGGCGGGGGTTACACCATTC^^ 
TGCCCGGGCTGGACATATGANACACTGTGGCCCTGGATACCGAGATCCCTAATGAGCTTCATCNA 
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ATGACTACTTTGAATACmGGACCAGAATTCAAAGCTCCACATCAGTCCTTTCAATNTC^ 

CCANACACCAATGAGTCNNCmOTATANTCTTAATAAAAAAACA^ 

TACACATTACCACATTNACNCCCTTANGGACGNCGAATNNTAGGCTCCrrAA^ 

GAACAAAATGATGGThOTAAACCAAAAACTCACANCTTTTTTAT^ 

GGATTACCTCCANCCAAAATACCANNTNGGGGCTNG 

SEQ ID NO: 3964 CGAGGTACATTTCCCCGGGAGAACTCGTCCATTCGGTGGTGGOKjCCGTTATT 
GGTGGTGAAGACCCGTAGCAACAGTGGGCATGTCTTCTCGCGGTCGATCGGTTTCTCTGGCTCCTT 
CTTAATTTCCTCCTGGGTAACGCGCGACTCCACCGCCATCITCCTCCT^ 
CCCCGCGT 

SEQ ID NO: 3965 ACnTGTTGACATTAGAGGAGAAATGCCAAGGAATTTAAGAGAATACATGGTT 
GAAAAAATTTACCCACAAATACCTGATCACCTGAAAGAACCATTCTTAGAAGC^^ 
TCATCTGAGGTCCATGCCAGCAAGCTTCCrrCCTCCrrCATCAGTGAAGAAACGAGGTGTTOT 
TTGGGAGACGCATATAATATGAGGCATCCACTTACTGGTGGAGGAATGACTGTTGCIT^ 
ATAAAACTATGGAGAAAACrGCTAAAGGGTATCCCTGACCTTTATGATGATGCAGCTAT^ 
GCCAAAAAATCATTTTACTGGCAAGAAAAACATCTCATTCCTrrGTCGGGAATATCOT 
CrCTTTATGAATTATTTTCTGNCCAGATGATTCCCTGGATTAACTAAAA^ 
TTCAAACTTGGGTGGCGAATGTGTTGNCGGGGTNTGNTGGACCTGNCTTCTGGANTGGCTCCT 
CXirrCrrANGTTTTAAATGGACACTTTTTGGTGGTGCAATCTATC^ 
AACCrrGGATTACNAAACTTCGAGNCCTTITTNTOAGGGGGGCGGAATO 
GGTTTAAANATTTGCAGCTTGGTATTACTGGATATTG 

SEQ ID NO: 3966 GGTACATGACAAGGTGCGGCTCCCTAGGCCCCTCCCCTCTTCAAGGGGTCTA 
CATGGCAACTGTGAGGAGGGGAGATTCAGTGTGGTGGGGGACTGAGTGTGGCAGGGACTCCCCAG 
CAGTGAGGGTCTCTCTCTTCCTCTTGTGCTCITGCTGGGGCTGGTGGTCCA 

GAGGCCATGTGGGCCATGAGGTCCACCACCCTGTTGCTGTAGCCAAATTCGTTGTCATACCAGGAA 

ATGAGCTTGACAAAGTGGTCGTTGAGGGCAATGCCAGCCCCAGCGTCAAAGGTGGAGGAGTGGGT 

GTCGCTGTTGAAGTCAGAGGAAACCACCTGGTGCTCAGTGTAACCCAAGATGCCNTTGAGGGG^ 

COTTCGACGCCTGCITCACCACCTTCTTGATGTCATCATATTTGGCAAGGTTT^ 

GGTCAAGGTCCACCACTTACACGTTGGCAGTGGGGACACCGNAAGGCNATGNCAATGAA NCTT NC 

CGTTCACTCANGGATAACCTTGCCCAAAACTTTGGCAACCCCNATAAAAGCAAGGATTATT^^ 

GAAAA 

SEQ ID NO: 3967 GGTACTTTTTTrmriT^^ 

TGTAGACAGGTGTGTGGGTATAAACTGCTGTATCTAGGGGCAGGACCAAGGGGGCAGGGGCAAC 

AGCCCCAGCGTGCAGGGCCAGCATTGCACAGTGGAGTGCAAAGGTTGCAGGCTATGGGCGGCTAC 

TAGTAACCCCGTTTTTCCTGTATTATCTGTAACATAATATGGTAGACTGTCACAGAGCCGAATAC 

AGTAACAGGATGAATCCAATGGTCATGAGGATGCCCAGAATCAGGGCCCAGATGTTCAGGOVCTT 

GGCGGTGGAOGCATAGGCCTGGGCCCCGGCACGTTCGCCAACCATTTTCCTGTCCCTAAAOT 

GGAGTANGCGAATGCTATGAAACCCANACAAGCANCAGTTCAANAAAAAGGGNGTNAAACANGG 

GACCAAAACGACATGGTCGGGO^CGGANGGTTTCCTTGGGATTGTTGANTCNCCGGGGGACC^ 

GGAAAGGA^TOGTGNT^GGGGGGNTTCCCCCAAAANAANCACOTA^^^^^ 

TTTTGGGTTTTGGGGAAAG 

SEQ ID NO: 3968 GGTACTGCAAGTCAAGGGGACTCTTTGCAGGCGTGTCnTAGAAGGGAGCTG 
TTTGATTGAAAGGAAAGAAACTAATAGAAAATTTTATTGTCAAGATATCCGAGCTT 
ATTTGGAGATACACCGCGGCCTGCTCAAGCCGAAGATCTTrATGAAATTCTTGATTCCTTTACT^ 
AAAGTATGAAAATGAAGGACAACGAATCAATGCAAGAAAAGCAGCAAGGGAGCAGAGGAAGTCT 
TCTGCTAAAGAATTACCTCCAAAGCCATTGTCAAGACCACAGCAGTCA TCTG CACCAGTCCAGCTG 
AACTCTGGCTCTCAAAGTAACAGAAATGAATATAAGCTCTATCCTGGACTTTCC^ 
GAAAAGTGGCAATTTGAATCAACCCATAGAAAGTGACAGCGCTGTATTCATTTTGAAGGAAACAG 
CCTGGGGGATTTGAATTTTTCAAGCTGGAGACCAGAATCCCAGTTTT^ 
TTTTTNGATTTGGGGGGGAAAGGAAACmCGAGGGCN^ 
ACCCTTGAANT^*lAACG 

SEQ ID NO: 3969 GGTACACTGTAAATGCTCAATAAATATTGATGATGGGAGGCAGTGAGTCTTG 
ATGATAAGGGTGAGAAACTGAAATCCCAAACACTGTTTTGTTGCTTGTTT^ 
TAAATTGGGAAATArrGGCCCTmGAATAATTGTCCCAAATATTACATTCAAAT/^^ 
GGAGGAAAAAAAAAAAAAAAAAAAANANNNNh^ 
a^CNCCTGCTGCCTTCCTTGGATGNGGNANACGTTTTTNAGGCm 

ATTCCCCGTCACCCGNGGCACCATGGNAGGCACGGNGACTNCCNTCGAAAGTTGATAGGGCAAAC 
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NTTCCAATGGGTCCGTCGCCCGCCACCCCGCGT 

SEQ ID NO: 3970 ACGCGGGGGAAACGACAGGGGAAAGGAGGTCTCACTGAGCACCGTCCCAGC 
ATCCGGACACCACAGCGGCCCTTCGCTCCACGCAGAAAACCACACTTCTCAAACCTTCACTCAA^ 
ACTTCCTTCCCCAAAGCCAGAAGATGCACAAGGAGGAACATGAGGTGGCTGTGCTGGGGGCACCC 
CCCAGCACCATCCTTCCAAGGTCCACCGTGATCAACATCCACAGCGAGACCTCCGTGCCCGACCAT 
GTCGTCTGGTOrcrrGTTCAACACCCTCTTCTTGAACTGGTGCTGTCTGGGOT 
ACTCCGTGAAGTCTAGGGACAGGAAGATGGTTGGCGACGTGACCGGGGCCCAAGCCTATGCCTCC 
ACCGCCAAGTGCCTGAACATCTGGGCCCTGATTCTGGGCATTCTCATGACCATTGGATTCATCCTT 
GTTACTGGGATTCGGTTCTGGGACAGGCTAACCATATTATGGTACAGATAATCANGGAAAACNGG 
GGTTCTAGTAACCGGCCATAACCTTGNAACCTTTGGATTCACTGGGCAAl^TTGGNCC^ 
TGGGT 

SEQ ID NO: 397 1 acactgcccaggcaaagcgtccgggcagcgtaggcgggcgactcagatccc 

AGCCAGTGGACTTAGCCCCTOriTGCTCCrCCGATAACTGGGOTGACCTTGGTTAATATTCACCAG 

CAGCCTCCCCCGTTGCCCCTCTGGATCCACTGCTrAAATACGGACGAGGACAGGGCCCTGTC^^ 

CAGCTTCAGGCACCACCACTGACCTGGGACAGTGAATCGACAATGCCGTCTTCTGTCTCGTGGGGC 

ATCCTCCTGCTGGCAGGCCTGTGCTGCCTGGTCCCTGTCTCCCTGGCTGAGGATCCCCAGGGAGAT 

GCTGCCCAGAAGACAGATACATCCCACCATGATCAGGATCACCCAACCTTCAACAAGATCACCCC 

CAACCTGGCTGAATTCGCCTTCAGCCTATACCGGCAGCTGGCACACCAAGTCNAACAAGCACCNA 

TATCTTrnrn-CCCAAtGAAGCAATCGNTrCAAGCCTTTC^ 

GTTGACACTTACCNAAGNAAATCCTTGGANGGGCCTGAAATTTNAACCTTACCG^ 

AGGNTTAAAATCTT 

SEQ ID NO: 3972 ACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATTTCGACCGCGAT 
GATGTGGCTCTGGAAGGCGTGAGCCACTTOTCCGCGAATTGGCCGAGGAGAAGCGCGAGGGCTA 
CGAGTOTCTCCTGAAGATGCAAAACCAGCGTGGCGGCCGCGCTCTCTTCCAGGACATCAAGAAGC 
CAGCTGAAGATGAGTGGGGTAAAACCCCAGACGCCATGAAAGCTGCCATGGCCCTGGAGAAAAA 
GCTGAACCAGGCCCTTrrGGATCTrcATGCCCTGGGTTCTGCCCGCACGGACCCCCATCT 
CTTCCTGGAGACTCACTTCCTAGATGAGGAAGTGAAGCTTATCAAGAAGATGGGTGACCACCTGA 
CCAACCTTCACAGGCTGGGTGGCCCCGAAGCTGGGCTGGGCCAATAT^rmTTTCGAAANGN^ 
ITITAACACGACTAANANCCITIOTGAANCCCACCGACTI^ 
AAGGGCTTTTGGCTTAAACCTTTTCCTTCAACCCAAAAGGGAANTTTrm 
CTTTGGAN 

SEQ ID NO: 3973 GCCTGTGCAGTGGGACTGATTGCCGTGGGTGTCGGGGCACAGCTTGTCCTGA 
GTCAGACCATAATCCAGGGGGCTACCCCTGGCTCTCTGTTGCCAGTGGTCATCATCGCAGTGGGTG 
TCTTCCTCTTCCTGGTGGCITITGTGGGCTGCTGCGGGGCCTGCAAGGAGAACTAT^ 
CACGTTTGCCATCTTTCTGTCTCTTATCATGTTGGTGGAGGTGGCCGCAGCCAT^^ 
TTTAGAGATAAGGTGATGTCAGAGTTTAATAACAACTTCCGGCAGCA 

AAACAACCACACTGCTTCGATCCTGGACAGGATGCANGCAAAATrrrAAAGTGCTGNGGGGC^^ 
TAACTACACAAGATTGGGAAGAAAAATCCTTTNCATTGTCGAAAN 

ctggattaaatggtactgggggctgggggaattantttcaacgaanaaaggcgatnc^^ 
gggtmgtggaaaaaaatnggggcttgcttaagnaaaantgot 

seq id no: 3974 acgcggggagaggacgaaaaaaataaccgtccgcgacgccgagacaaaccg 
gacccgcaaccaccatgaacagcaaaggtcaatatccaacacagccaacctaccctgtgcagcct 
cctgggaatccagtataccctcagaccttgcatcttcctcaggctccaccctataccgatgctcca 
cctgcctactcagagctctatcgtccgagctttgtgcacccaggggctgccacagtccccaccato 
tcagccgcatttcctggagcctctctgtatcttcccatggcccaatctgtggctgttgggccm^ 
gttccacaatccccatggatattatccaagtcgggcccatctatccacctggctccacagtgct^ 
gtggaaaggagggtatgatgcaagtgccagattttggagcttgggctactgntggcnaaaattcc 
tncttcancttctgganggccctcccaatgctggttaacttgnaagca™ 
ccttggnaacttaaacggaangggaacttttttatngggggggtaaaaagggggg^^ 
nggggaggna 

SEQ ID NO: 3975 accgaccatagagcaagaatcaagattctgctaactcctgcacagcccggtc 

CTCTTCCrrrCTGCTAGCCTGGCTAAATCTGCrCATTATTTCAGAGGGGAAA 

AGTGATAAGGGCCCTACTACACTGGCTTTTTrAGGCTTAGAGACAGAAACTl^ 

TAGTGGCTTCTAGCrCTAAATGTTTGCCCCGCCATCCCTTTCCACAGTATCCTT 

CTGTCTCTGGCTGTCTCGAGCAGTCTAGAAGAGTGCATmNCAGCCTATGAAACAGCTGGGTCm 

GGCCATAAGAAGTAAAGATTTGAAGACAGAAGGAAGAAACTCAGGAGTAAGCTTCTAGCCCCCrr 
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CAGCTTCTACACCCnTCTGGCCTCTCTTCATTGGCTGGACCCCAACCCAGCCA 
GGGTTTNCITraGGCCATNGGGAAGGGTTACCAAGAAAAA 

ATACATTTCCTTTAANAAAACCATTGGGGNCTTTGGGCCGGAO^ACCrrAANGGGG^^ 
CAACTNGG 

SEQ ID NO: 3976 GGTACTTTTTTTTTTTTITI^^ 

TTGGTCCAAGGCTTGTTAGGATAGTTAANAAAGCTGCCTATTGGCTGGAGGGAGAGGCTTAGGCA 

NAAGCCCTATTACmGCAAGGGGCCCTTCAAAAGTCGCTGGGCTCAAAAGG 

TGANAGTGAGCCTTTCGAANANATACTCGCCCAGCCCANCCTCCGGGCCACCCANCCTG 

TTGGTCAGGTGGTCACCCATCTTCTTGATAAGCTTCACTTCCTCATCTAGGAAGN^ 

AAGTCACANANATGGGGGTCCGTGCGGGCAGAACCCAGGGCATGAANATCCAAAAGGGCCTGGT 

TNAACrmrCTCCAGGGCCATGGCAGCTTTCATGGCGTCT 

TGGCTTTTTGATGTCCTGGAAAAAAACCCCGGCCCCCCCCCTGGT^ 

TTGNAAACCCTTGCGCTTmCTTNGGCCAAATTNCGGGAAAAA^ 

CX:CATTATTG 

SEQ JD NO: 3977 CGAGGTACGCGGGGGAGATGGCAGATGAGATTGCCAAGGCTCAGGTCGCTC 
GGCCTGGTGGCGACACGATCmGGGAAGATCATCCGCAAGGAAATACCAGCCAAAATCATrTTT 
GAGGATGACCGGTGCCTTGCTTTCCATGACATTTCCCCTCAAGCACCAACACATTn^CT 
CCCAAGAAACATATATCCCAGATTTCTGTGGCAGAAGATGATGATGAAAGTCTTCTTGGACACTTA 
ATGATTGTTGGCAAGAAATGTGCTGCTGATCTGGGCCTGAATAAGGGTTATCGAATGGTGGTGAA 
TGAAGGTTCAGATGGTGGACAGTCTGNCTATCACGTTCATCTCCATGTTCTTGGAAGGTCGGCAAA 
TGCATTGGCCTCCTGGTTAAACACGTTTTGGGGATAATTTTCTCTTCTTTAA 
NGCCAATTTCCAGTATTGGTAAAGAACACACTTNATTTTNGCCTG^ 
AAATAATrrTNAAAANNGCTTCNTTAAATAAAAAAAAm 

SEQ ID NO: 3978 ACTTTirrTTTTTTTTTTT^^ 

AATTTTGAGTAGTCAAAGTCAGAGCAGTCAATNTGTGTTGTGAGCCGAGGCACAGCTGCANAAGC 

GTGT^m3AGGTGTCCGGTGGAGGTGGCAGCCGAGCIOTGGGACTAATCACCGTGCTGG^ 

ACCGCGTNAGGATGCAGGCAGATCCCTGCANAAGTGTCTAAAATTCACACTCCTmrCTGGAGGG 

ACCGTCGATGGTATTAGGATANAAGCACCAGGGGACCCCACGAACGGNGTCGTCGAAACAGCAG 

CCCTTATTTGCACACTGGGAGGGCGTGACACCAGGAAAACCACAATTNTGTCTTTCACGGW 

CACTGTCCTTGGCCGCGACCACGCTAANGGCGAATrrCAGCCACTGGCNGGCCGNACTAATGGA^ 

NCANCTTCGGGACCAAACTTNGNGGAAATATGGGCATAACTTGTTTTCITGGGNGNA/^ 

TTCCGTTNANA^VTTCCCACAAAAAAACCAAACCCGGAANCCTTAAAANGTTAA^ 

GNCTTAATN 

SEQ ID NO: 3 979 GGTACGCGGGCCTTGCTCCTGTGTGCTGTCTAAACCACTGGTGGATGAATACT 
TTTAACAATTCrrAATATATATTAGTGCAGTGCTGAAan'ATCAAAGTGGGTCCGAAGAAT^^ 
TATCAAATAGCAGTAATAGGCCAGGTGTGTTGGTCACACCTGTAATCCCGGCAGTTTGGGAGGCT 
GAGATAGGCAGATTGCTTGAACCCAGGACTTCAAGACCAGCTTGGGCAATAGAGTGAGACCTTAT 
CTCTACAAAAAAAAAAAAAAAAAAAAAAAANGTNCGCGGGGGAGGCCTGCCTGACCGACCTTO 
NCAGGGCTGNGGCTACCATGTTNTCTCCGCGGGTGTCGCTGGGCTGTCGGCCTGGACCTTC 
CCAATGGATTCAAAGTTCGAAATATGGGAACTTTGNAAGATATCCCNGGAAACTTAAGGTC 
AAAACATCCAAAAATTACCAAGTNThrrGAAAATGGGGCNGGGqCAAAATTTGCCCA 
AACTTGAACCCCTCGAAATTTTGGGTTGGGGACNTTACTTTGTTGAAA^ 

SEQ ID NO: 3980 ACTGITGTCCATTTCATGAGAGTAGGCTTGAGGACACCATGGGCAAGGATCT 
GATGGrrGCCAGCCTAAGCGTTTTAGACTTTTGACCCAGAGATTl^ 
ATTTTAGAGGATAGGGTCrCAAGATATAATCCTTTTTATAGGCGGCAGGTOT 
CCAGAGGAACGGATGAAGCCTGCTTGGAAGCATGCTGGGATGGCCATTTGGAAGGAGTGTTGCAA 
GGAAGCATGGCCTTGGCTGGGCGCTGCCAGGAGCrrAAGGGTTGNAAGTTGmTGNCTGATCGGC 
TCTTGGCCTAATCTTGNGGCTTCCAGGAGGAAAAAGAAAAATCAAACTGGCTTNCT^ 
TGCTTTATTCmGAATGGNGNACCCCATGGAANAAGGGTCCTCAAAATGGCCGGCAG TO 
CTATAAATGACCCGGTAGGGGCTGGTCCATTGAAGGTTGAAANTTTGANGGGGTAAAANTm 
AAAGGACTGATCGNCCACTrGGGGNGTTTTATTTGOTTNGGAATCCAAACTTOT 

SEQ ID NO: 398 1 GCGTGTCGCGGCCGAGGTACAAAGAAAGTTTTAAGTCAAGGCCTCACCAATT 
CCTACAGTATTAGTATTGTX5TCTCAATTCTCAAAACTAACTTTTAAAAAGOT 
AGGATTTTAAATGAAAATATAAACTAGAATGAACAAACATGAGAAATATTTCm 
GCTAGCACCTTTGAGTTTTCCAAAAAAGCACGTCTCCCCAGTGTGTTCACTGTGATC 
AGATCCACATTTAACATACITAAACTAOTAAACTTAGATAACATCACTCTG/^ 
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AATGTTAATTGAGAAAAGCTGAAAATAGTmAGTTTACTCATTATCACATGCm^ 

TGCATGAGAAAACACTGAAGAAGTAATTTTTTAATCCAGATTTTTCAC^^ 

GGGTCCCCCACTTCCTCTATTATAACTGCTCTTAATTGCTGGTGGCTGGCTGTGAAAATG 

AGTACCTCAATGGTCCTACCAAACTAGGCTTANCNNCCATGCTGGTCTGNCCCTGGA 

GAACAAC 

SEQ ID NO: 3982 GGTACGCGGGGGTGGCAGCTTCGGATAAACGCAGGACTCCGCCCGGCAGCCC 
GATTTCTCCCGGAACCTCTGCTCAGCCTGGTGAACCACACAGGCCAGCGCTCTGA^^^ 
GTGACCCTGGGCCTGCTTGTGTTCCTGGCAGGCTTTCCTGTCCTGGACGCCAATGACCT 
AAAAACAGTCCrrTCTACTATGACTGGCACAGCCTCCAGGTTGGCGGGCTCATCTGCGCT^ 
CTGTGCGCCATGGGCATCATCATCGTCATGAGTGCAAAATACAAATGCAAGTTTGGCCAGAAGTC 
CGGTCACCATCCAGGGGAGACTCCACCTCTCATCACCCCAGGCTCAGCCCAAAGCTGATGAGGAC 
AGACCAGCTGAAATrGGGTGGAGGACCGTTCTCTGTCCCCAGGTCCTGTCTCTGCACAGAAACTTG 
AACTCAAGATGGAATTCTTNCTTCTTTTGCTGGGACTCCm 
TTTGCAAGAAGGGKrrmGGTCNAATITrm 
AAAA 

SEQ ID NO: 3983 GCGTGGGTCGCGGCCGAGGTACTGTTTATTAACCAACCAGCTTAGAAAAATA 
ATCATGGTAGACACCTTAGrrCATTCTTCTAATAAGCCTGTTGATCTGGTCCTCCCTGT^^ 
TCTCCACCTTCTACAAAATGGGTGGTCmTTCrrCAmCACCTCGTGGAGAAGACAAm 
GGCCACAGGAAGTTATTTGCCTCTTTGAAGCGTTTTCCAACAGTATAGATCTCATGA^ 
TCCATGCAGATGATGCCGTATTTACCAAGAGATCGAGCAATCAAAGCGTTATCTGTCAAAGCAATT 
CGCTTCTTATTGATTTTGCCATAACCACGCTTGNAGATTAAGTTCATTTACTGACr^ 
TACCAGCATCCCCAGCGTCTGGCATTCCATGTTTCTGCTCCTGNGGCCTCACGGNGCAAC AAG^ 
GC^JGGTTACTTGGACCITGNCTCATCTTCTTTNTTT^ 
TTCACTTGGTTNT>TOGGGCCAAAGGTTTCCAAAAAAATGGNG 
ANCTTGNCCNG 

SEQ ID NO: 3984 ANTTGTGACAGGCAGACGTGATTGCAGCCACGAACACGATGAACTCACTGAA 
GTCCACCTGGGCATCTCCATTGGCGTCCAGGTCCITGAGCAAriTATCCACGGCATCCT^ 
CCGCTCTGCAGGAAGCCTGGTAGCTCCTTCTCCATCAGCACCTTGAGCT^ 

TGCGTGCTGCCCTCGCTGCCCGAATATCGGGAAAAGACGTCTATGATCATGCCCATGGCTGTCTCT 

AGTTCCGTCATGGTGCTAGATTCAGACCCACCTTCCTCCTGGGGGCTGGCAGGGCCGAGAAAATGT 

CCCGCGTACTriMlUnn'lU-riinHl'ri'n'lllU N GGCATGCAACGAAACCTTrATT^ 

ANGTTCAACTATTAACTGAAACTTGNAATTTCTAAACITAAATO 

AAANAATGNCTTAATNGGCANTGGNAATGCNTGATTGAAAAATNANANCNCNCOT 

GAACAGGGCAGGGCCAATTTTTITGGATTTGAAmAAAAAAAANGGGGOT 

SEQ ID NO: 3985 ACTTGTGACAGGCAGACGTGATTGCAGCCACGAACACGATGAACTCACTGAA 
GTCCACCTGGGCATCTCCATTGGCGTCCAGGTCCTTGAGCAATrrATCCACGGCATCCTTGT^ 
CCACTCTGCAGGAAGCCTGGTAGCTCCTTCTCCATCAGCACCTTGAGCTCCCCCTTGGTC^ 
TGCGTGCTGCCCTCGCTGCCCGAATATCGGGAAAAGACGTCTATGATCATGCCCATGGCTGTCT 
AGTTCCGTCATGGTGCTAGATTCAGACCCACCTTCCTCCTGGGGGCTGGCAGGGCCGAGAAAATGT 
CCCCCGCGTACC 

SEQ ID NO: 3986 ACCGACGTTGAGGTGGCTGCTGACCTTGGGTCTCATCTCCTTGATTTTC^ 
CTTCTTCATTGCCGTCCTCTCTAGGCTGTCTTTGGCGAGGAGGGCCCCTGCGGAATCGTGGTCTATA 
TCCCCGATACATATTCTGCCTCACTGGTCTACCITGTTCTCCrGCACCCTGGTrGTC^^ 
ATCACTTCTCCCTGCACAGGAGGGTTGGAATACTQTGGTCGACGCCCATAGGGTCTCCGCATGTAG 
TAAGGTGGGAACCTTCGCCTGCGGTAGGGCCGGCGTTGTTGGGCCTGGCCTTCGGGAGCACT 
GATCCCTCGTTCTTTTCCCCACTCTCACTATTCTGGTAATTTTC 

TACGACGTGGATAGCGTCTATAATGGTTACNGNCTGCTGCATATTTAATGGCTTGAACCCGCGT 

SEQ ID NO: 3987 GGACGNGGGGTCCTCTCTNCGGTCCGTGCCTCCAAGATGANAAAGAAAAGAA 
GGAACAATGGTNGTGCCAAAAAGGGCNGCGGCCACGTGCAGCCTATTCGCTGCACTAACTGGGCC 
CGATGCGTGO^CAAGGACAAGGCCATTAANAAATTCGTNATTCGAAACATANTGGAGGCC^ 
AGTCAGGGNCATTNTTGAAGCGAGCGTNTTCGATGCCTATGTGNTTCNCAAGNTGCATGTGAAC^ 
TANATTANTGCGTGAGTTGTGCAATTNACANCNAANNAGTNAGGAATCGATOT 
NAAGGANNGAANACCCNCACCNCGATmTANGACCTGCGGGTGCITCCCCACGTCC^^ 
AACCCATGTAAGGANKTGAGTTNTGTAANGANTGAAGACAGGCTTTTNTC^ 
AGGTTATTTmGTT^mn'CTTGGNGGCrrTTTAANGGCCAA^ 
TTANAGAATCTTNNTTNCGGTAAACCAATTTTGTTAAATNAANGGC^^ 
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TGGTTTTNGTTT 

SEQ ID NO: 3988 GGTACTTTTTTITITn^ 

TGTTTCATAGGCTGGAGATGCACTCTTCTANACTGCTCGAGACAGCCAGAGACAGGGGAGGAGGG 

AAGAAGGATACTGTGGAAAGGGATGGCGGGGCAAACATTTANAGCTAGAAGCCACTACTGGGCC 

AATGCTAAAGTTTCTGTCTCTAAGCCTAAAAAAGCCAGTGTAGTAGGGCCCTTATCACT 

TGCTAGGrn'CCCCTCTGAAATAATGAGCAGATTTAGCAAGGCTAGCAAAAAGGAANAGG 

GCTGTGCAGGAGTTAACANAATOTGATTCTTGCTCTATGGNCGG 

SEQ ro NO: 3989 ACnTTAAGGGAACTACCCAACTATGTTGTGATAGAAGAAAGAGAAACCTTCA 
CTTTGGCATTTTTTTTAATCACTGTTTATT^ 

GCAGATATGCTTTGCATATGGATTGTTATGTTTrrATTTGGGCAAGTTTAAT^^^ 

AAAGAAGGGGGGAAATGGTCAGTTTAAGCCAAAAGAAACTTTCTAAACAATGTATAGGTACACAC 

CTTTGTCCACTGGGTAAATTATATTCATTATGCCCACTGCTGCAGCACGCATAAACCAA 

GCATGGCTGAACAGGGCCTAATCTAAGACTGATGGGAGAAAGGCTTGCAAACCAAAATCAANGG 

GNTTCTNCCTAATACTGCTACCAAGCTGATCCCTACAAAAATGCNTTTTAA^ 

TTCTGTGTGCAAGAAAAACCCAGACCTTGGTAAAAAAIWCTCTCCTTANCAT^^ 

AGCTTTAAAAAAAACCACCCCCAACCACCCTTGGGTNGCCCCrCTTAATNCACAAACOT 

G^^^T^^AANGGAAAAGAANTTTGGGGGGAAAATCCCT^ANGGNGGANAA^ 

GCCCGNGCCTTTNTrrCTTTTCCAAACCTCTTO 

SEQ ID NO: 3990 GGTAC rini ' lU " iU - ll 'lU-ll"ril"i l l 'lCGGTATCATAAAGAGNGNTGAAGTTTAT 
rrATTATAGCACCATTGAGACATmGAAATTGGAATTGGTAAAAAAATAAAACAAAAA 
GAATTGTATTTGGNGGAACAGCAAAAAAAGAGAAGTATCATTTTTCTTTGGCAAArrAT^ 
CCAAACATTTTGGAAATAAATAACTGGAATmGTCGGCACTTGCACTGGTTGACANGAT^^ 
AANAGGAACACATNTGGAGTTAAATTTTTTTTGNTGGGATTNCAGATAj^ 
AAGCAAACAGGGCNACNGTNCACACCAAATT^^TGATCAGGACNCCAATT^ 
TTTTNCAATAGGGAGTCTCACANCCITGCCTNGTCCAATNTTCA^ 
CANNGNGTTTGGGNNAACCATTCTTTTAAAACTGGGGAAGGNGAATTTNGGTAT^ 
NGGATrTACCTTTATTTGGACCCATTAGCTrrAAGGGCTT^^ 

SEQ ID NO: 399 1 GGTACATTTTTCTCAAGTTAATGTATAAAGAAACTGCATTGATGTAGATA^ 
TCCCAGAACTCTCAGTTCTTTAGGGATGGACTAAATAGCTGGTATC ATCAC TTACA 
GAATGCTATAGAGCTAATGnCAAAAATATAGmATAmXTTATCrmAACCAT^^ 
ACTTITTGAAGTCTCCriTGAATATGTACCCATTAAACrGCTAA/^^ 

ACAACAGAAGTCCAATTTAGATTCTGAGTGTTGTCACCATGTGAri'ACAATCACACAGACACTTCC 
AAGCTTATAGCTGGAGCTCCTGGAAGCTATrrCATACTCTGGTGCAAGGGGCAAAAAAAAACA^ 
ACNCAAGAAGGGAATAAGGTACACACANACACACACACACNNCCCNCACAGGCCTCTCTCTN 
TCThm-CTCTNTCTrGGTCAAGCCATCCAACACCCCTTCAAGGATGATA^ 

AG<m'ACCA^^r^^^AATNCAAATCAAAA^m^^ccGa^^>^^ 

. AANATTTANCCCTT 

SEQ ID NO: 3992 GGTACAGGTGCCCTCTGTGCCTATTCAGCAATTCCCTACTGAAGACTGGAGCG 
CTCAGCCTGCCACGGAAGACTGGTCTGCAGCTCCCACTGCTCAGGCCACTGAATGGGTAGGAGCA 
ACCACTGACTGGTCTTAAGCTGTTCTTGCATAGGCTCITAAGCAGCATGGAAAAATGG^^ 
AAATAAACATCAGTTTCTAAAAGTCAAAAAAAAAAAAAAAAAAAAAGTACTTT^ 
TTTTmACTXjANACAGGGCTTAACTCCTGTCACTCAAGCTGGAGTGCAGTGGC^ 
NACTGNAACCTCCGCCTTTCGGGCTCAANANATT^m'CTGCT^ 
ACAGGCCCOTGACAGGATGCCCAGTAATTGGATTTTTGNAGAAATNGGGGCTT^ 
AAGCTGGGCTNAAACTTCTGACTTCAAGNGATCCCCTTGCCT^ 
TCAGGCTTTAACCNCCCNCCCCGGCCCAANNAAAATTTTNNm 

SEQ ID NO: 3993 acgaaaacagaaccaatctaaaaatggctgatgttactttaggagcctgaaa 

AAAACAGGAGATCCTTGAAGACCCAGCCACCCCrrCTAGAATGTTCAATAAGGGCACCTITCCA^ 

AGCTACTAAGCAGGCACTTGGCATTrrAGGAGTTTGTCTTATGGTTGCATAAAAGTATCCC 

CCCAGACCTGGCACCCnTrATGGTTCAAAGNTAANAACGGGAAGAATGGGTGGCAAGGTGGCTCC 

TGGNAGAGCTCACCCAGCACAGCATGCCCTGAGCTCGGGGCCTTGGTTTNTGTCCCTNGGGATATT 

TATATTTAATNAATTTITATTAAATNCACNGNGAAATNAAAATATTAAA 

AAAAACGTGNGGNACTCAGTANGAAGCCNAAACTNGAGANGTCNTCAAGGCCAATTTGACTT^ 
NTTACCCTTTGGNTTTTG^OmTTCTCCTT^GNTT^ 

GGGGbmrCCNGGTACTTCTGCCGCNACACNCCTNAGGGGAATTTCAACC^^ 
CTAGTGGATCCNNNNTNGGATCNANTTNGGGANNAANGGCNTAKTCGOT^ 
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TCNCN^^^^^^AATTTCNACAAATTTANCCGAATC 

SEQ ID NO: 3994 ACATGGCAArrAGAAGlTGTCATGGCAAAAGAAAACCACAGCTGGCCTGCCA 
CAGCCAACACAAGAACCAGAAAATGGTAGATGAAATGAAGGAATAAAGGTGGGGrrTATTCCTTA 
TTATAAAAGAAAAAAAATAATTCTTCAGCAGTCTTAACAAAGACATCAAGATACAAAAT^ 
TGTTITGACTCCAGCCCTGTCCCCATCTCCTCCAAGAGCAGAGGTAGGAGACAGTTGAAGCAAAC 
AAGCAATTCTGTAAAAATTACCTAGAAACCCTACAAATTTGATTAAAATCTAAACTO 
TGCTTTTTAAAAAATTTAATATCAAAAGGCCTGCTTTAGTGACATGCT^ 

CCCCAATACCCCCTCTGTCAGNAACATGCTCAAGTTGACCAAGCCCACTNTTATCTNTTAACC^^ 

GTGGACAAGTGGGNCTTTAAACCAAAGC<>ICNGAGATCAAGTGACTGmrrACX; 

ACTAATCCAGNGCNCCTThrrCCNANGGNGGGCCGGGCAAAACNCAAANGGTTTTGAGGG 

CAAAATACCTOTTAAAACNCTTCCrrGTTTTTTTTAA 

TAAATTGGGCCCGGNNCGAAGGGTTTCTTTACCAAACNN 

SEQ ID NO: 3995 ACGGGCTCATGAAAGTGTTGTGAAGAGCGAAGACTTTTCGCTCCCAGCTTAT 
ATGGATCGGCGTGACCACCCCTTGCCGGAGGTGGCCCATGTCAAGCACCTGTCTGCCAGCCAGAA 
GGCACTGAAGGAGAAGGAGAAGACCTCCTGGAGCAGCCTCTNCATGGATGAGAAAGTCGAGTTG 
TATCGCATTAAGTTCAAGGAOAGCTTTGCTGAGATGAACAGGGGCTCGAACGAGTGGAAGACGGT 
TGTGGGCGGTGCCATGTTCTTCATCGGNTTCACCCGCGCTCGTTATCATGTGGCAGAAGCACTATG 
TGTACACTGAGAAAAATGTTACTGCTTCAAAACAACCAAAAATGGGAAAATAACTGAGGCT 
ACAAATTTCTCTTTCTAAATTCCAGCGGGCTCGNCAGCAGTrCNTATTAA^ 
TCAACTCTAGTTGGCCACAGTTCACCCAATGCNNGATX>rTTTAGAC^ 
GGTCCCCCCAAAATTCCTTAAAOTGGTCCCAATACNrrGCAATCAAAAATATAAT^ 
TTATITCTAATTANGCAAAAa^CNCCCCNTGATTGNCAACAAAATGNTAATTGGT 
ATACTGCCTCCAGTTTNTTTTCCACNGGGGTGCT 

SEQ ID NO : 3 996 GCCGTGGGCGCCGNCCGAGGNCCAGACATTTTCAAAGrrGCCAGTGTTACTT 
TAATTGGACTGCCTTCGTAATTCATTGCCTCTGCTTCAACAATGTGCAACTCATCCm 
CCCTAAACTGACCGTTCTTAAAGATAACTGGGCTCATTTTCATCATTATCCACCTTAAAGTGAT^ 
TCTTTGTCGGCCTTTAGTTCACAACCGAAAAGATAGTTCTGGGGCCTCAGGGGGCT^ 
TCCATCGAATCTTCCATCGGGTGGCGGCACGCACTTAGGTAGGAGAOAAGGCGGACGGAGATAAA 
AGAACGCTGCTCCAGAGAACAACCCGCGCAGGACGGAATCACACCAGGGACCCCGCGTACTGNA 
AAGCTCTTGCTATAGCCCATATATTCAGTTrGNTCAACAATCTCACTCATGACAGAAAAAGT^^ 
AAATAAGACTTGGGCAAAATTATTrrCTATCAGCTCACATOGAAAGACCACAGCTTTAATGAAT^^ 
TACAATCCTTAAAATAATCCTTGGCAGGATAGAATTNCGAATCAAATTAGCATGAACTGGT^^ 
ATANC™TCTOCNCTTAGAANNCTAACATmCTTTCNAGATGm 
GAAGTTTATNTTGCTTTANGGGTT^^mAAAGCTTTTACTA^ 

SEQ ID NO: 3997 rmTrrnriTiTTTTT^^ 

CAACATGACACCAACAGAAGnGATCAAAGTGTGTCGTGAAATTGCGAGTCTTCATTCCrrm 
GGNTCTTTGTATTCAAAGATGCTATTTAGNGGGTTCTGGCTTCATCTTCT^ 

GGTCTCTTATTTGACCCTTTCTTGCCAGGCCTCrrGAGGCCTCTGCCATGAGCTGCTTGGCCCGGM 

GCTANATACATCATCATAGCTCTCCCGAGTGTTCCACTTTGAGCCTTTCTT^r^ 

AACTITGGGTTTTTATACCGCGTTAACACTGGGCCCCTTCCTCAT^ 

TmTGCCTGGGCCANAAGNTTNTGACTTCCTCAANGAAAACNAGTTATTAAAAAA^ 

ATTCTTAANAACATCATNAAAGGGCTTrCTCCGCTGCCTTTTTGAANAACCT^ 

CGANTTCTAAAGCTCCCAAGTGCTAACTTTTTAAACCTTTCATGGGAC 

CAAATTrrrNTTrGGANAAAAAATTGGCCriTCNCAAANAA^ 

GAGGG 

SEQ ID NO: 3998 ACTTTNTiTmrnrmrnTiTrrm 

GAACCCAATCACCGGGTGCCCAACTATGACTCTGCTATCTTCGAAATGATGANATGGGGCACACT 

CCCCTAACAGTATTTAGCTAGACCAACAAGGATCCTGGCTCCTCCCAGGGCTGTAGCANAGGGAC 

NAGGCCCTTCGAAGCTTNTAGTGGGCACAGGGTATTTGAAATCCATCCTTmAAAC^ 

ACTTGGCCTACTTGATTTATTGCTGGTAGCAAGATCATTAGAATTTTAGTTAATTAATAT^^ 

CTCTGCACCGGNGAGGACTCTGGAGTCTCTTGTTACTGCAAGCATAAAAAGTGCANCCCACA^ 

CTTNCTGGCCATGGAAAGGACCAAATGCNCCTGCAGGNGGAAACAAACATGACCTCGGGCCACTG 

NGCAACCCCTTCCATGTCCCCCTGGAACCGACACGCCTNACATTAAAAAGGGCCCCCCACTTGNA 

GNTNGGCAAGACAGCTTTITGACNCNAAACTTTTGAAAGAACaSfCC^^ 

AAAAGGACCTGCTTCGGGNAAmAAANAACCTTTTNGGCAAGAACCGGGGGAm 

AAAAAAAGAATGTNGGGGACCAAGGGGAAATTTTCnTGAGCGCTGGGGGGGTNG 
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SEQ ID NO: 3999 CGAGGTACTTOTnTITnTm^^ 

GAAAGAGTGGCCACCACATCTTTATTGCATACTCAGGTGAATAACTTATTATACAATGAA(^ 

TCCATTAGGAGACCATGCCCACTTACAGAATGCAGCCGTAAATGCGGTAAATm'ATTTACAGA 

TTGGGGTGCAAGATGAQAGAAGTATCAGCCCCAGGAATTTGAAGTGAGAATGATCTACAAATTOT 

CCTGACAAGGAGCAACCGGGCTTGTGCTAGTGAGGTNTGAAAGAATTNCTGGCANAGCGTAGGGG 

GAGATTAGATCTCGGAATTGACAGCAAGTTTGGGGACAGGGCAAAAAAAANAGGGGTGACCTGG 

GAATTGGTGCTNGGGAGCTGNTGAGCCCAATGNGAGGCAGCACTNNANAGATGAGGTAAATT^ 

NGGGNATmrrAACCTmCTAACNCAGNCAAAAAGGTlNGGGANCNGG^^ 

GCTTCAOTGTAATTNANGNCNATGGGCTTANANAAGCTTTTGGCNTTTTTCT^ 

TTTTTGGCNCAAANlOTTAAAACCCTGGGAGlTAAAGAa^G 

CCAAAAACCCANNTmCCTNGGAAAAANCNANATTANGGC^^ 

SEQ ID NO: 4000 ACGCGGGGAATTCATGGACGACACGAGCCGATCCATCATCCGCAATGTAAAA 
GGCCCCGTGCGCGAGGGCGACGTGCTCACCCTTTTGGAGTCAGAGCGAGAAGCCCGGAGGTTGCG 
CTGAGCTTGGCTGCTCGCTGGGTCTTGGATGTaJGGTTCGACCACTTGGCCGATGGGAAT^ 
TCACAGTCTGCTCCTTrnriTGTCCGCCACACGTAACTGAGATGCTCCTrTA 
GTTTCANGTTTA 

SEQ ID NO: 400 1 CGAGGTACACACCmGTCCACTGGGTAAATTATATTCATTATGCCCACTGCT 
GCAGCACGCATAAACCAACACCCCTGCATGGCTGAACAGGGCCTAATCTAGGACTGATGGGAGAA 
GGGCTTGCAAACCAAGATCAAGGTGTTTCTCCGCTAATACTGTCTACCAAGCTGATCCCTACAAA^ 
ATGCATATAAAAGCAGGCAAGTTTAGCTACTGTGTTCCAAGAGAAACCAGGACCT^ 
TTCTCTCCATTACCATTTATTCTCTCAAGGGAAGCTTAAAAAAAAACAACAACAA^ 
GGTCTGGCCACCTCATGAATCCAACAAGCATTAGTGTGGCATITCAGTGGAGAAGGAAACTTGGG 
GGGAAAAATCCCATCAANGGTGTAAGAAAGGTTCCAATTTAACTGGCCTGNCCTATTTATNC^^ 
ATCCAGACCATCATTATTTAGACCCTCTGATCAATAAAAGGG>mCCAGCNTTCAGAG<^ 
GNGAACCCAAAGACATNAAGAAACCGNTTGCTTGANAAAAGCAGCGATTmrCT^ 
CTTGGGTAAAAAATGCCAAAAAWTGGrrGGGNTTAAAACrrGT^^ 
NACTTAAAAGCCATAmGNTTAATrTTTTTCANGGTTATAAAGGGGTrrr 

SEQ ED NO: 4002 GGTACGCGGGGGCTGACTCTCTTTTCAGACTCAGCCCACTTGCACCCAAGTO 
AArrAACAGCCTTGTTGCTCACACAAAGCCTGTTTAGGTGGTCTTCTATACGGACATGC^ 
rrGGTGCCAAAATCTGGGCCAGGGGGACTCCTTCGTGAGACCGGCCCCCTGTCCTGGCCCTCATTC 
CGTGAAGAGATCCACCTGCGACCTCGGGTCCTCAGACCAGCCCAAGGAACATCTCACCAATTTCA 
AATCGGATCTCCTCGGCTTAGTGGCTGAAGACTGATGCTGCCCGATCGCCTCAGAAGCCCOT 
CCATACAGATGCCGAGCITCGGGTAACrCTTACNGTGGAGGATTCCNAACCATATGAAGACArc 
TACTGGACGATCAAGNCTTGGCAAAAGTCTTGACCCCTTAAACTCTACAGCTTAATGGACAAAAC 
CTACCCGGNATTTATTAGACj^CCACTGGCGNCATTGNAGGANCCriTCATrGGGTTA 
ATAAAGCCTGCCTTAAACAGCCANTrGGATrrTCNTrrCTTCTGGAAG^ 
CCGATANAAAACAACTTCAANCCITAAAATTCTGGAAGGCCANCCAGGCATGTT^ 
TTTTCAAAGGCTTCCAATTGTTCAAAANGr^mC^nrTT 

SEQ ID NO: 4003 GGTACGCGGGGGAAACGACAGGGGAAAGGAGGTCTCACTGAGCACCGTCCC 
AGCATCCGGACACCACAGCGGCCCTTCGCTCCACGCAGAAAACCACACTTCTCAAACCTTCACTC 
AACACTTCCrrCCCCAAAACC^GAAGATGCACAAGGAGGAACATGAGGTGGCTGTGCTGGGGGCA 
CCCCCCAGCACCATCCTTCCAAGGTCCACCGTGATCAACATCCACAGCGAGACCTCCGTGCCCGAC 
CATGTCGTCTGGTCCCTGTTCAACACCCTCrrCrrGAACTGGTGCTGTCTGGGCTTCATAGCA^^ 
CCTACTCCGTGAAGTCTANGGACAGGAAAGATGGGTGGCGACGTGACCCGGGCCCAAGCCTATCC 
TTCACCCGGCAAGTGCCTGAACATCTGGGCCCTGATTCTNGGCATCCTCATGACCATTGGATTCAT 
CCTGGTACTGGGATTCGGCTCTCNGAAAGCTACATATTATGTTACAAAATAATACAAGGAAA^ 
GGTTCrAGTAACCGCCATANCTTGAANOTGNACTCAATTGGCAATGCTGGCCW 
TGTGNCCTGNCCCCTGGGCTNCCCTAANACAACAAGTTATACCCCCCCCTGGCTACAGGGATTCA^ 
TAAANGCCCGGCTTCCAAAAAAAAAAAAAAAAANACCTGCCCGGGGC 

SEQ ID NO: 4004 ACGCGGGGAAGAACCCCCCTATCAACACCAAGAGTCAGGCAGTGAAGGACC 
GGGCAGGCAGCATTGTCITGAAGGTGCTCATCTCTTrrAAAGCrA^^ 

AATCTCTGGACAAGAATGGTGTGGATCTCCTAATGAAGTATATITATAAAGGATTTGAGAGCCCGT 

CTGACAATAGCAGTGCTATGTTACTGCAATGGCATGAAAAGGCACTTGCTGCTGGAGGAGTAGGG 

TCCATTGTTCGTGTCTTGACTGCAAGAAAAACTGTGTAGTCTGGCAGGAAGTGGATTATCT^^ 

GGGAGTGGGAATTGCTGGTCCAAAATTCArrCAAGAAGAAAATAGATACCACCTGACAACATGGC 

AAAATCCCATC^TrCCAAACATCAAAAAAAAAAAATTGGTCCGGCATGGGGCTT^ 

CCGCTAATTTTTGATTTTTAGTAGAAACANGGTTTCACCTATT^^ 
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ACTrGNGATCCANCCGCCTCGGCTTCNNAAGNOTTGGAATACCAGGCGTAACC^ 

CCAAAATTTAAAAATTTTTAAAOTGGTTTAATTGGTCCCTCANA^ 

rrTTTCCCTTTTGGNCCTTNCCCANCCCCCATTGGNGGGGGAAGGTGCC 

SEQ ID NO: 4005 ACTGCTATGAAGCATCCCTTCCACATCAGATCAAAGACATCTTAAAGCCAGA 
AATAATGGAGGAGATTGTGATGGAAACACGCCAGAGGCTTTTGGAACAGGAGGGATAAGGAGGT 
GCTCCAGAAGCACGGGACTGTGGACCTTGCAGGAGTGAAGACTGTGATGTGTGGTCCCCATATGT 
GGCTCAGCAAAGACTCGAGAGATCATCCCTTTGTCTGCATTGACGGCCCTGTGAC GGCC TTCAGCC 
CACAGGCCTGCTTTCTCCTGTCTAACACCAAACCTGGGTGGCAGATGAACAAGTGCTTTCT^^ 
TGNAAOTAATTNCCGNATTAAGGAATAGmAACTNTTTCAAAGTGCAC^^ 
GGCTTCCATCCTGGAATAATATGGAAAATCCTTGGCTTCCCTCCTGGCATTACAAGGACAGGCCC^ 
ATGAAATTGCCCCATANAAAAATGGCATNCCTACCTCCAAANGGGTGGTTGGGCTCCATCCTTAT^ 
CTTAAANACTGAAAAGCACTGGNAAAOTGGAAAATAAAGTACCACCAAGGGGAACCCCTTT^^ 
TAAAAACAAAAATAACCKGGGGGGGGGGGCCCTTGNNCCNCTTTTTGGOT 
TTNACCTGGNNGGGGGGTGGGGG 

SEQ ID NO: 4006 ACTTTTrrTTrmTnTTT^ 

GGCAAGTTCATGAAACCTGGGAAGGTGGTGCITGTCCTGGCTGGACGCTACTCCGGACGCAAAGC 
TGTCATCGTGAAGAACATTGATGATGGCACCTCAGATCGCCCCTACAGCCATGCTCTGGTGGCTGG 
AATTGACCGCTACCCCCGCAAAGTGACAGCTGCCATGGGCAAGAAGAAGATCGCCAAGAGATCA 
AAGATAAAATCrriTGTGAAAGTGTATAACTACAATCACCTAATGCCCACAAGGNACC 

SEQ ID NO: 4007 ACGCGGGGGAAACGACAGGGGAAAGGAGGTCTCACTGAGCACCGTCCCAGC 
ATCCGGACACCACAGCGGCCCTTCGCTCCACGCAGAAAACCACACTTCTCAAACCTTCACTCAAC 
ACTTCCTTCCCCAAAGCCAGAAGATGCACAAGGAGGAACATGAGGTGGCTGTGCTGGGGGCACCC 
CCCAGCACCATCCTTCCAAGGTCCACCGTGATCAACATCCACAGCGAGACCTCCGTGCCCGACCAT 
GTCGTCTGGTCCCTGTTCAACACCCTCTTCTTGAACTGGTGCTGTCTGGGCrTCATAGCATTCG 
ACTCCGTGAAGTCTANGGACAGGAAGATGGTTGGCGACGTGACCGGGCCCAAGCCTATGCCTNCA 
CCGCCAAGTGCCTGAACATCTGGGCCCTGATTCTGGGCATCCTCATGACCATTGGATTCATCCT^ 
ACTGGNATTCCG>rrrrGGGACAGGCTACCATATTATGTTACAGATAATACANGAAAAACGGGGT^ 
CTAGAACCCGCCATAGCTGGAACCTrrGCACTCACTGGGCATGCTGGCCCTGA(>fTTG<^ 
CCTGCCCCTTGGCCTGCCTAANCAGAAGTTATACCCCCCCCTGGCTAAAGGGCATTAANy^ 
CGGCTTGGAAAAAAAAAAAAAAAAANACTTNGCCG 

SEQ ID NO: 4008 ACTrrmrrnTiTnTiT^^ 

TCACATTTCCCAATACAAAGGAAAACTGCATCTTTmGTCCCACTTCTCCCCTCCAAAACT 

CmCATAGGACAGGGGAGCAAGTCTTCCTTATGCTGTTTANAAAACTCAGT^^ 

ATCTCCTGGTGAAGCAGAACAGGTAATATAAAACCGATACAATAAGGCCTCCCCTCTATCCTTATC 

TGTCTGGTCGAGTCATTCCGGGCCGAGTGGGCACCATCATGGGACGGGCAGGAGGTCTNATCATT 

GGGGGCCCAGGCATCATTGGCATATGGCCTCCCATGGGCGGCCTCATTCCAGGAGCAGGTCCCAC 

TGACATCATCCCAGGAGGAGGAGGGCCCATCATTGGCATCATGGGAGGGCCCCCCATATGGGGTG 

CTGGCATCATACCAAGGCGAGGAGGACCNCGAAAGCTGGGGGGAGGTGGTATTATCGCCCNTTCA 

GGAGGAGGAGCAAAAAATGGGAGTAGGAGGNATTTTTCNTTGTTNAATGCCACC^ 

NAATCAAGGCTCTGAACCTGCTTTTCATTCATITITGATAAANAGN 

CTTCNCTGNAAGGGGNTTTrmACCAAATOGANANNTATGGGGGAGGATGT^ 

TNAAANTTGGGCTTCTGTTTNAAGGCGTTGGNNNTN 

SEQ ID NO: 4009 ACCATCCTTTAATAGATCTCATACACCAGAATrCAGATCATGAATGACTGACA 
GAATATTTTGTTGGGCAGTCCTGATTTAAAACTAAGACTGGCTTGTGGTTAAATGAATAT 
rrCTTGAATTTTAATAGTAACTCCAATrCAGTAAATGGTATCACTGTTTACCCC^^ 
ATTAGACrrCGTTAGTAATGTTCAACTTTTCACAAAGATGGTGAGT^ 
AGATTGGTTTTATATTTAGATTTATATAACTGGTTATGTGAATATATTTAAATACT^ 
TTCACTGCTTAGAACCAAGCAAGATTCACCCGTGTTTGNGTTCATGTCATTTGCCTOT 
ANGGTTGAAGATAAATAAAGNAGCAATGCTATAGTTTTGGCCTTACTATGCCAAOTAAT^^ 
CTG>rmAAAAGGGTCTTTACTTATTGAANGCATlTAANGNGGGTATGTG^ 
AACCCTTNCCATCTTACAAACTATAAGGCNCTTCTTTOA^ 
AATCTTTACAANTAAGGCAACTAAGrrrAATTNGGATGGTTTACCGCrrCC^ 
GGITTTGNTNGGAANGGN 

SEQ ID NO: 401 0 acactgccx:aggcaaagcgtccgggcagcgtaggcgggcgactcagatccc 

AGCCAGTGGACTTAGCCCCTGTTTGCTCCTCCGATAACTGGGGTGACCTTGGTTAATA 
CAGCCTCCCCCGrrGCCCCTCTGGATCCACTGCTTAAATACGOACGAGGACAGGGCCCTGTCTCCT 
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cagotcaggcaccaccactgacctgggacagtgaatcgacaatgccgtcntctgtctcgtggggc 

atcctcctgctggcaggcctgtgctgcctggccctgtctcctggctgaggatcccaaggagat^ 

gcccaaaaagacagatacatcncaccatgatcaggatcacccaaccttcaacaaagaatcacccc 

aacttgctgaattcgcrrraagcctattaccggcagctgggacaaccaagtccaacagcac^ 

nttttttitcccaangaacaattgntrcaagcttrrg 

nacacttacgangaaaatcctngaagggctogatttcnaaccttanggaaaato 

aaatcantaagggttcaagaanttcttntnaccttnaacaagcaaa^ 

cgggaanggctnttcttaagangggctganctnagggnaaarntggggaag™ 

seq id no: 40 1 1 ggtacgcggggctccagaagtttcccccttgggcggtggtggaggtggtaac 
cgtgatagtagcagctccggcggcagcaacatcgactacgagggatggcggcggctgcagcagg 
aactgcaacatcccagaggtttttccagagcttctcggatgccct 
cggcgttagaggagctgactaaggctttggaacagaaaccagAtgatgcacagtattattgt^ 
agagcttattgcacattcttcttgggaattactgtgttgctgttgctgatgcaaagaag 
aactcaatccaaataattccactgctatgctgagaaaaggaatattgtgaatcccatgaaaaaaa 
ctatgctgctggcctagaaaccmacanaaagacaaaaaattgataagacggggm 

GGCCNAGCTNGCNTTNAACTNTTGACTCNAGGGAACCACTTGC^ 

anaagtgccaaatgcitatitnnatgglsrrgggntnaaaagnggna^ 
aaattgangggggggactncttngtcnaaattaagttnactgggnttaaot 

KTCCnrmrCAAAAATTCCAAAAATOn: 

seq id no: 40 1 2 cgaggtactttttttttttttttt^^ 

gaggaggaggaagagtganaanattggtgggaacaggaggagggaagaagaggggaagaggaa 

agagaaaaaaatattrratgtaagggccaaataggtgtcagccgaactagacctaatctgcctgg 

aactaaatccagatctaggctctacttttatttattccattctt 

tttgtaaaaaattgagctatagtttacataccatcaaattcccccttt^ 

ggnttttaagtatattcacaaagm'gggctagcatcaccactgncttattrrg 

ccccaccatgaaanaccatgtccctggcaaccnttcccatnacctttccttgggaggtctccnm 

tactttggtaaaactgatgaaattgcctttcttgganaot 

CCNAAAAACATNAACCGGNTTTACTNGGGCAAACNTTANAAGNGTNGGGNNT^ 

TGTTGCTGGAACCCCAGGANGGAATCTA^^^^CTGGATTCCAGGAATCCTCNTNCCGGGG 

TTTGAAAANNANTAACCTAAGGCTTGTNGAANNTANAAGGGANT 

SEQ ID NO: 401 3 GGTACAAAAGCTTTTCTGATGCrmCATTATCACAGAACACACCAOT 

TGTGTGCAAAAAGGAAATGAGGGGTGGAAGGAGAGGAAGTCTAATTGGGAAAGGCTGGATGGTC 
ACITCATTTCTTCTCTTTCTTCTTATCCT^ 

CAGTAATGAAATCTAAGAAGAGATCAATGCAGTGCCAGATATAATCTTGATCTCCATGTTCGGCCT 

TTTCAATAATGAGTTGAGTATCAAAAAGGACGAAGCCCACATGACCACCAGTCCCACATACAGGT 

TTGCCTGGAAAAGCCAAATGGArcCAAAGAAAACArrCCCCAGGGAAGACAAAAGCAACAAGCr 

CAGGGCTGACATCAAGATACaTCCAGAAAGAGGTACTACGGCCCTGGCATAAAGTGCACTTANG 

GTGGAGCAGGTAAAGATCATTGCXGTGCCCATGAAANCAATGGGAANGATGCTGGGGTTGACACA 

ATACAAACrrCANGGCANGGCCCAGCCCACTTCTGTAAGGATGCAAATCNACCAGAAGCCCAGCT 

TTTTTGGTCAATTCTTGmrrGAGGGGTGCCCTCACCCAATC^ 

NAGGCCCCCTGATGAAAGAGNGCNTNTGGCCTAGGCCCTCCNCCCCCNA 

SEQ ID NO: 4014 ACGTAAAGTTAACCTTCCAATTGTCTGAGCTGTCGTCACTGACTTCATGACAG 
TCTGGCCCTCCAGACAAGAGCAGCGCTGGCATCGGGCAGGTGATTCCTGACACCTGCTGCCTGCA 
GGCATTCACTGACCAGGCCTrTCCTGGAGGAAACACCCAGGGCCGGGCGGCTGCTGm 
GTGGACTCGGATCTGCTGTGACACCGTCAGCCCGACAGTCTCTCCATATGCAGCCTTTCCTCTGTA 
CGTGCACCTCCTGTITCAAAAACGCAAGCAGAAAATAGGTCTGGACAAGGGCAAGCT^ 
ATirCTGCTGCAACCCTAGCTCAGGTAAGTTATGTTTA^rrCCTTCACTTTGCT^ 
GTCTGTrGATGCCTTCTTCCTGCTGTTCTAGGGCATCCAGGACAATGCCCATCGGTGTCCTOT 
ACCTATTCCTTCCATTrTATrCTCAATTrGTTCTTGAGGTTCCGAAGTOT 
ACATCAACAGCAAAGGAAAAAGTAATGGCCAACACAAAGAACATTGACTrCTTCN^ 
AAAAGGGAANNCTCCNCCACCTGACCNCCTA 

SEQ ID NO: 4015 ACGCGGGGAAACGACAGGGGAAAGGAGGTCTCACTGAGCACCGTCCCAGCA 
TCCGGACACCACAGCGGCCCTTCGCTCCACGCAGAAAACCACACTTCTCAAACCTTCACTCAAC^^ 
rrCCTrCCCCAAAGCCAGAAGATGCACAAGGAGGAACATGGGGTGGCTGTGCTGGGGGCACCCCC 
CAGCACCATCCTTCCAAGGTCCACCGNGATCAACATNCACAGCGAGACCTCCGTGCCCGACCATG 
TCGNOTGGTCCCTGTTCAACACCCTCTTCTTGAACTGGTGCTGTCTGGGC^^ 
NCGNGAAGTOTANGGACAGGAANATGGTTGGCGACOTGANCGGGGCCCANGCCTATGCC^ 
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CGNCNAGTGCCTTGAACATTTTGGGCCTGATTNTGGGCATTCTATGi^^ 
GGNATT^mGG^IT^^•GGACA^OTACCATATTATGTACAAANANTC^^ 
NCGNCCATANNNT^mAACTTTGNTmCACTTOGCAAN 
TNCNCCTNGGNNTNNCCCTTNATC 

SEQ ID NO- 40 1 6 GGTACCATTCCGACGTTACAATGGTGGAQITGGCAGGTGTGCGCAGGCCAAG 
CAATGGGGCTGGACACAAGGTCGGTGGCCCAAAAAGAGTGCTGAATTrrTGCTGC^ 

aaacgcagagagtaatgctgaacttaagggtttagatgtagattctctggtcattgagcatatc 

agtgaacaaagcacctaagatgcgccgccggacctacagagctcatggtcggattaacccataca 

tgagctctccctgccacattgagatgatccrracggaaaaggaacagattgttcctaaaccaga^ 

GAGGAGGTTGCCCAGAAGAAAAAGATATCCCAAAAAAAACTGAAGAAACAAAAACT^^ 
GGAGTAAATTCAGCATTAAAATAAATGTAATTTNTTGGGAAAAAA 

SEO ID NO: 4017 ACGCAGCCCCTCACACACTGCTTCATTITGTGArrTCTGCAriTCCAGCTGC^ 
GTAGCAACTTTTGTAGTGTTGCAATATTCTCCAAAGACAAACAAAAGGCATCTTCGTCTTCACACA 
CCACATCTCTTTCAAAGCTTGTGTCTGGGGTGTGGTCTAAITCTTCCAT^^ 
CTTTATACTGACAAACTCCTCACGCCTAGAAGCCrrTGTTTCCCTCAAAGTTGCACAT^ 
ACTGGGTCAACTCTTCTAANCTGGGCACTGANGCACTGCAATATCATAATGG GGCAT GCAAAAAA 
TTTCGCACAGGTCrrGATCTTGCTCTTGAAGAACTTCAAGNCCTGGTTCT 
ATCAATTCCACrrGGGNGCCAAATCTTTTTTAATTGCAANAAGG^ 
TTAACATGTAACTCGCTGCCANANNGGTCAACTNTTrrTGACANA^ 
TTTCTTTAGGTTTCTTTTTAACAATATTAAATC^^ 

TGGAACCGTGGNCTTNGGAAACCAATANCTCCATTTTNCNAAGGGATTANGNTm 
ATGGATCTCCCCCNCC 

SEO ID NO: 4018 GGTACTTTTTTTTTTTT^^ 

TAAATCACAGAAACTTTAGTGCAAAACAAAAATCACGAAGTCCATTTAATAGC^ 

GCTGGCmGCTTGCTQTCTCCTGGCAACCANAAGTGGACAGAAGCGTGGGTGCCCAAGTGGGCC 

ACANACAGCTTCCAACCCCCACACCCCAGCAT(XAATCCACACCCAGCANACCCTTCGGCATGCC 

GCCCXm'ACCAGGAAGCCAGAGGCCTAGGAGCTCGCCATCCATATTTATTTGAAA^ 

GAGCATNTATGANACAAGGGAGGGGTGCAGGCTGAAGCAGCGCCTNAACAGCCAGGGACATGTA 

GGCAACACGAGCAGGCACAGCGCGGCCACCACTGTCCACACGCTCACACAAGCCAGGCCCGCAG 

GGCCTTCGGAGAGCTAGCAGGTTACATTCAGGCAGATGGNCCThm'CCACCCAAACCCACAGAAC 

CCCAAACAANGGGTNACCAGGAAAGACACNGGAAACCCAATTACAATTTGAACNNGGGNAGANA 

AACCTTGGGCCCCTTGNTGTrCCAAGCACCAAAAGTTGTTTCA 

SEO ID NO: 4019 acaatatcatatatctggtatacaaaaaaaatcgctcattattttagccctac 

CCTCCATTTGTCTCTCTTCTITCTGGGACTCCTATTGATTGCATGACOT 
TCTTACCTTlTmCATAACTTCTCTTAATTTI^ 

AGGCTGGAGTGCAATGGCGTGATCTCAGCTCACTGCAACCTCTGCCTCCTGGGTTCAGGTGAT^ 

CGTGTCTCAGCCTCCTGAGTGGCTGANACTACAGGTGTGCACCAGTGTTCCCAGCTGATTTTTGTA 

TTTTATGTANAGATGGGGTTATGCCATTTTGGCCGGGCTAATCTCGAACTCC^^ 

ACACACACCTCAGCAAANCTTTTAAAATTATACANTCNGGNGATATTTCOT 

ANCCNCTTGNNTNNGANTATTITNCATTGNGGANAATGGNGGGGT^^ 

ATGGAAAAATTCCAACCTTNC 

SEO ID NO' 4020 ACl-l" i T n"rrn i"i'l''i'i' i ^ r i i ri l fTTTTNGGCCATATCATTCATGACCAAAAAA 
AACCAAAAAGCAAAAAACAAAAACAAGTCCCATGTCCTACTGACTCTAGTTCTTCCAGTC^ 
CTCTGCTGTCTCTCTGGCATCCTATGAATCAAATCAAGCCCATGCTGTTGTTGGTT^ 
TTTCAATAATAGGCTAAGGAAANACATGTTTTTCTCTTTTAAATCTCTGC 
CCTTCGAACGTATTTAATTTGGCCrmCAGCTTTCTTCTTGCC 

TGTGTGCACATGTTGGTTTCACTCACCCAAAGTAATTTTGNGAGCATGCNTGGGCTTm 
GCTTGAATGTTTGNCCTGCTNGNGTTGCAGCITCTAAAGANATTGC 

SEQ ED NO- 402 1 ACGGAGGAGTTCCTGGAAGCCTTCATGGATCTGAGCCTCCGGAATCTCCGTG 
AGGTTGAAATTCAGGCCCTCCAGGATTTCATCGTGAGTGTCAGCCTTGGTCCCCAGGGAGAGCArr 
GCAAAGGCTGTAGCGATGCTCACTGGGGAGAAGAAGATATTGGTGCTGTTGGACTGGTGTGCCAG 
CTGGCGGTATAGGCTGAAGGCGAACTCAGCCAGGTTGGGGGTGATCTTGTTGAAGGTTGGGTGAT 
CCTGATCATGGTGGGATGTATCTTGTCTTCITGGGCANNATCTNCKrGGGG^ 
ANACATGNACCAANTAGNCTNNGCCTGCCTACTAAiNAGGATGCCCNCCAGANNGATATCGGNm 

GTNGTTNTCTTTCTCA 
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SEQ ID NO: 4022 ACTGATTCATCACTGTCAGATTCTGTTCCAGACCCTGTCTCAGCCTGGGGCTG 
CGGCAACTCCTGCTCTGTAGCAGGGACGGmCTGTGGCTTCGCCGGGCATTTTGTGCAG 
CGGAACCAAGATGGCGGCCCCCGCGTACTCTGATTTAAAACTTTGGACATCCTGTGATCTGT^ 
AAGTTGGGGGGTGGGAAATTNAGCTGACTAGGGACAAACATGNAACCTATNTTCCTAT^ 
GTNTTAAATGTCX:CNCTTGAATANCANTAATTCTrCATAGNTTT^^ 
ANCTAATTATTTGTAATGAATTATlOT'ATAACANTrCITAGC^ 

SEQ ID NO: 4023 ACAGACATGGCGGCGGCTTTTCGGAAGGCGGCTAANTCCCGGCAGCGGGAA 
CACAGAGAGCGAAGCCAGCCTGGCnrrCGAAAACATCTCGGCCTGCTGGAGAA^ 
ACAAACTTCGTGCAGATGACTACCGTAAAAAACAAGANTACCTCAAAGCTCTTCGGAAGAAG^ 
CTTGAAAAAAATCCAGATGAArrCTACTACAAAATGACTCGGGTTAAACTCCAGGATGGAGT 

SEQ ED NO: 4024 ACGCGGGGGAGTCACTGCTGCTCTTTGAGGCAATGCGCAAGGGCAAGTTTTC 
AGAGGGCGAGGCCACACTACGGATGAAGCTGGTGATGGAGGATGGCAAGATGGACCCTGTAGCC 
TATCGAGTCAAGTATACACCACACCACCGCACAGGGGACAAATGGTGCATCTATCCCACCTACGA 
CTACACACACrGCCTCTGTGACTCCATCGAGCACATCACTCACTCACTCTGCACCAAGGAATTCCA 
GGCCCGACGCTCTTCCTACTTCTGGOTTGCAATGCACTGGACGTCTATTGCCCT^ 
GTATGGCCGCCTCAACCTGCACTATGCrGTTGTCTCTAAGANGAAGATCCTCCANCTTGTA^ 
TGGTGCTGTGCGGGACTGGGGATGACCCACGGCTITTTAACTCACGGCCCTGCGACGGCGGGGCTT 
CCACCTGATGGCCATCAACACTTOTGTGCCOjGGTGGGAAGTGANTGTGGCACAAACCAAC^ 
GANCCCCATCTTCTNAAAAGCCTGTGTGCGTTANTGTGCTAAATACACANCCCCCCNACCAT^ 
TGGCTGGAAGTAATTACGGGTAATTNATACCAANTTTCTGTTGCCAANTCCTTGGGAAT^^ 

SEQ ID NO: 4025 ACAGGCAGAGCAACCATCCTCTTCGTCACCCAGACGCAAGACAGTAAAGGA 
AAAGGATGACACTGGCCCTAAGAAGCACAGCAGCAAGAACTCAGAGAGAGCTCAGAAGTCAGAG 
CCCAGGGAGQGGCANAAGCTCCCCAAATCCAGGACGGCCTACTCTGGTGGAGCATAGGACCTAG 
AGAGGGAGCTGAAGAAGGAG.\AACCCAATCACGAGCACAAGTCCTCAAGCAGGAGGGAGGCAA 
GAGAAGAAAAGACCAAGGATTAGGGACAGAGGGCGGAGCTCAGATGCACATTCTANCTGGTATA 
ATGGGCGTTTTAAGGGCGTATTTTANAAGTAGAAGTAGGAGCCGAATAAATCCCATANGCATAAA 
AGGGCCCGAO^CTCCCGGGAGCGGGAGTCTTCGAATCNCAGTGACCGTTGGCCGTCACT 

seq id no: 4026 acaagtatttatatcaatgaaaatttccattggtgattittrggcagaatat^ 
ggtcttgactctgtggaataaatgacgacgtaaacgtagctgcacaggggtgttcctgtataatgc 
ttgaatcaattgtgtgtgaaagcatcatgcaaatggctaattaaattgggtgatgactgy^ 

ATAAATCCTTCATTCCAGCTCCACGAGCAGATCCCCTTCTCCAACTGTGTCTCCAGCT^ 

acagatttcacccgtgccagttttcccagctgtcaatactattctngcat^^ 
cacaaattncttgaccttctgctaccc 

seq id no: 4027 acctaaataatgtccitcarmgtcitagcitgcaaagcct^ 
acctgcccttttcaaaaaaaaaaaaaaaaaaaaaanaaaggcrrgtccacctct^ 
aactcaagggtctatgtctgtcctgtttgctatttcatctgcattgccaaatgcagto^ 
ggtaggcaacctgaggtggaatcaaattatgaatgcatatgttatctcatgtctataaaataaatg 
agntaaaatttataaataaagggataggaaaaatgctgattattctgngtgncngngct^^ 
ataaggccanttcnnacatgc 

seq id no: 4028 gggtactctggtgagtcaccacttcagggctttactccgtaacagattttgt^ 
ggcatagctctggggtoggcagtitmgaaaatgggctcaaccagaaaagcccaagttc^^ 
gctgtggcagagttacagttctgtggtttcatgttagttaccttatagttactgtgtaatta 
acttaatgtatgttaccaaaaataaatatatctaccccagactagatgtangtattt^^ 
ttggatttcctaatactgtcatcctcaaaoaaagtgtatnggttttttaa/^^ 
ggaaataaaagtcatgatggaaaaatrcatttntttaaattccccggttt^ 
anaagatgggccatatnnccccctttttggccccatggatttaa 

SEQ ID NO: 4029 AC i " rrriUU " lUTO U''ri'in" i lU"lU'lTmGCAAGCCATATCTGAAAAGCCATCT 
CCCTTCCCCTGCCTGCATCCCCCANACCATGAGCCGGGCACCGGCCAATGCATTNAGGGNGGGGA 
CNCATTTCCTGCCGTGGCTGTCCTTTGANAGACCCACATGTTATTTCATGGAANACGGAGTTG^ 
ACrraAGGAAACTCCTTNTGGANAACATGGGANGCNCCTTNATAAN^ 

SEQ ID NO: 4030 ACGCGGGGGCGTCTTGTTCTTGCCTGGTGTCGGTGGTTAGTTTCTGCGACTTG 

tgttgggactgctgataggaagatgtcttcaggaaatgctaaaattgggcaccctgcccccaactt 
caaagccacagctgttatgccagatggtcagtttaaagatatcagcctgtctgactacaaaggaa 
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AATATGTTGTGTTCTTCTTTTACCCTcm'GACTTCACCTTTGTGTC 

AGTGATAGGGCAGAAGAATTTAAGAAACTCAACrGCCAAAGTGATTGGTGCTTCTGTGGA^ 
ACTTCTTGCATCTAGCATGGGTCAATACACCTAAGAAACAAGGAGGACTGGGACCCATGAACATT 
CCTTrGGTATCAAACCCGAAGCGCACCATTGCTCATGATTATTGGGGTCTTAAAAGGCT^ 
GCATCTCGTTCAGGGGCCTTnTTATCATTGNATGATAAA 

SEQ ID NO; 403 1 ACCCTCCAGAAATTGGTGACTTTGCirTTGTGACTGACAACAC^^ 
CACCAAATCAGACAGATGGAAATGAAGATTCTAAGAGCTTTAAA(nTrGGTCTGGGTCG^ 
ACCTTTGCACTTCCnTaSGAGAGCATCTAAGATTGGAOAGGTTGATGTCGAGCAACATACTT^ 
CAAATACCTGATGGAACTAACTATGTTGGACTATGACATGGTGCACTTTCCTCCTTCTC^^ 
AANCANGAGCTTTTTGCTTAGCACTGAAAATTCTGGATAATGGNGAATGGAC^^ 
CATTACCTGTCATATACTGAAGAATCTTrm'Cm:CAG^^ 

CATGGTAAATNAAGGACrrACAAAGGCACATGANTGGCAANACAAGTATGNCCACATCCGAAGC 

ATGCrAANATCANCTCIKITACCACTAGCTGAATTNrrGGCACT^^ 

GGGCAAATGNGTAACTTGTAANCTTGAGTTGGA 

SEQ ID NO: 4032 ACACAATGGTTTATTAAAGGAATGTATGGCCCACATCAACCTAGCAAGGATT 
CTACTGGTAAACCTTCCCATGGCCAAAGGAAAAACAAGCAGGAGTTGAGTGGCTGGGGTGGGGTG 
CAGGCAATGGAGAGAGGGCAGAAGGGTGTANAAGCTGAAGGGGGCTAG ANGC TTACTCCTGAGT 
TTCTTCCTTCTGNCTTNAAATCTTTACTTCTTATGGCCAAAAACCCA 
ATGCACTmTCTATACTGCTNNAGACAGCCAGAAACAGGGGANGAGGGAAGATTG 

SEQ ID NO: 4033 ACTGCAGCTAAACCAGCGGCITCAATAACAAGTAAGCCTGCTACACTTACA^ 
CVVACTAGTGCAACCAGTAAGTTGATCCATCCAGATGAGGATATATCCCTGGAAGAGAGAAGGGCA 
CAGTTACCTAAGTATCAACGTAATCTTCCTCGGCCAGGACAGGCCCCCATCGGTAATCCACCAGTT 
GGACCAATTGGAGGTATGATGCCACCACAGCCAGGCATCCCACAGCAACAAGGAATGAGACCCC 
CAATGCCACCTCATGGTCAGTATGGTGGTCATCATCAAGGCATGCCAGGATACCTTCCTGGTGCTA 
TOCCCCCGTTGGGCAGGGACCGCAATGGTGCCCCCTTACCAGGGTGGGCCTNCTCGACCTCCGAT 
GGGAATGAGACCTCCTGTNATGTCGCAAGGTGGGCCGOTACTGGATCTTACITCATCCAGTCT^ 
AGGmTTGGAGATTAAACCNTTTCTrAAOTGTGCTGTTATATAGCCA^ 

SEQ ID NO: 4034 ACCTGGGGGGAGTTGTAACACTCCANAAGGTCCAACTCCTCTCTTGGCATGG 
CCAAGGCTGAGATCCACCGAAGCrmCACTTTGAGTCTCCGTGCGGAAGAGGAACTCGG^ 
GCGCCCTGAGTGTTCTGCCGCANAAAGAGTCGGAACAGGTTrrTGTAAGGTCCATGTANCT^^ 
TCACACTTTCCCCCCAAATGGANGANAAGGGAGCm'GGTCAAATTACNANGAATCGNNTACCCT^ 
TCNGGGCCGGATACAATTACNATACAAGTCATTNAAANAGGT 

SEQ ID NO: 4035 ACGCGGGGGAGCCAGCGCGGAGCACCTGCGCCCGCGGCTGACACCTTCGCTC 
GCAGTTTGTTCGCAGTrTACTCGCACACCAmrrCCCCCACCGCGCTTTGG^^ 
CTCAAGGCAAAGGTGGGATATCATGGCATCTATCTGGGTTGGACACCGAGGAACAGTAAGAGATT 
ATCCAGACITTAGCCCATCAGTGGATGCTGAAGCTATTCAGAAAGCAATCATGAGGAATTGGAAC 
TGATNAGAAAATGCTCATCAGCATTCTGACTGAGAGGTCAAATGCACANTCGGCTGNTGATTGTT 
NAGGAATATCAAGCAGCATTATGGAAAGGAGCTTGNAAAGATGACTTGAAGGGGTGATCTCTCTG 
GCCACirTGAGCATCTCATGGGTGGCCCTANTGACTCCNCATCAGNCTTTGATGCAAAGC^ 
AAGAAATTCATTGAAGGGCCNCNGGGAACAAACTGNAAGATGCCTTGATTGAAATOT 
TGGACAAGNCAAGCTNATCAAANGATNTTTTITCANCCTTAT^^ 

SEQ ID NO: 4036 CCGGGGGTAGACGGAACTTCGCCTTTCTCTCGGCCTTAGCGCCATTTT^ 
AAACCTCTGCGCCATGAGAGCCAAGGTGGGCGGrrCCTGGTAGTAAGCTTGGGAGGTAGGAGTTG 
GCGAGTAGTAGCGAGGAGACGAAGTGGAGGAAGAAGCGAATGCGCAGGCTGAAGCGCAAAAGA 
AGAAAGATGAGGCAGAGGTCCAAGTAAACCGCTAGCTTGTTGCACCCGTGGAGGCCACAGGAGC 
AGAAACATGGAATGCCAAACGCTGGGGATGCTGGTACGCGGGGCCTTTTCGCTGNTGCGGCCGCA 
TCCATGATTATGCTCATGGCTTCAGAAGAGGCTTCGCCTCTAGGTGTCCTCCGCTGTGGCAAGAAT 
AAGGTCTGGTTNGACCCCATGAACCAATGAAATTNNCAATGCCACTCCCCGTNAACAA^ 
AANCITATCAAANATGGGCTGNTCTTCCTCAAGCCTT 

SEQ ID NO: 4037 ACTACTGCTGTTTTCTGAAGACGCGAGGGCAAGTGCAGCCAGCCGTTTCT^ 
CCTTCmAAGCGTTTCTTCTCCTGTTTCTCCAGCTTCCTAGTAATCT^ 
TGAACCATTGCTTCCTTCATGACATCCAGATTCTITCGTGGTATCT^ 
GTCGCTCTTCAACTTGTTCrCGAAGCrrCrCCCCGAATACACTCGTGG 
CGATTCGTGAGGCAATACTGCATTTGTTTGCCAGGTATCGGGAGATGCGGCCTTTTG^ 
GCTGCTCGGCCAATGAAGGTGGAAGTGGAAAATGAGTCCATATTTTGGGGTGGTTACCCCT^ 
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TNTNAGGGGCTCTGAACAGGGCCmrCANCCCCAAGGATCTGCACTGGGGA 
CANGTNTGTGNAGNTNGCAACATTGTGCAATGAGANATG 

SEQ ID NO: 4038 ACCTAGAAGAGAGGCGGGTCAAAGAAGTAGTGAAGAAGCATTCTCAGTTCAT 
AGGCTATCCCATCACCCTTTATTTGGAGAAGGAACGAGAGAAGGAAATTAGTGATGATGAGGCAG 
AGGAAGAGAAAGGTGAOAAAOAAGAGGAAGATAAAGATGATGAAGAAAAACCCAAGATCGAAG 
ATGTGGATTCAGATGAGGAGGATGACAGCGGTAAGGATAAGAAGAAGAAAACTAAGAAGATCAA 
AGAGAAATACATTGATCAGGAAGAACTAAACAAGACCAAGCCTATITGGACCAGAAACCCTGATG 
ACATCACCCAAGAGGAGTATGGAGAATTCTACAAGAGCCTCCTAATGACTGGGGAAGACCAOT 
GCAGTCAAGCACTTTTCTGTAGAAGGTCAmTGGG>rmCANGGGCA^^ 
GGGCTCCrrrOACCTTTTTGAGAACAAGGAANAATAAGAAC 

SEQ ED NO: 4039 acacatgagagaaaaatctttccaatgtaatgagagtggcaaagcctttaat 

TATAGCTCAGTCTTAAGGAAACATCAGATAATCCATTTAGGAGCGAAACAATATAAATGTGATGT 
GTGTGGCAAGGTCTTTAATCAAAAGCGATATCTTGCATGTCATCGTAGATGTCACACTGGCAAG^ 

accttacaagtgtaatgattgtggcaagaccttcagtcaggagttaacccttacatgccatcatag 
acttcatactggagaoaaacattacaagtgcagtgagtgtggcaagaccttcagtcgaaattcag 
cccttgtaattcataaggcaattcatactggagagaaatcttacaagtngtaatgaatgtggcaa 

GACCTTCAGTCAAACGGTCNTACCTTGGGT 

SEQ ID NO: 4040 ACGCGGGGTCCGCCGCTAGTCTCCAGCTCCAAAATGGCGGCTGCCACTGTGG 
GGCTTCTGCCGGCCGGTAGTCCCTGGCGCTGCTGACCCAGCATCGGCTrrTCTACGTC^ 

ggattcgcctaggggttgggaagggcttgtggacggcgttgggggaggcctgacgagattaataa 
agaactcttcagaattcctggtgtttcatcatatatacgactaagatatcaactcttctaqctt 

GTTTCTGGACCAAAAAAAATGACGTCTATTATCAAATTAACTACCCTTT^ 

TCTGCCCTTTGCTATCTTCTCCAAGTNGATGANTITAGATTTTTTAT^^ 

CACTTTNCTATGGATATTATTGATTCCNTGA 

SEQ ID NO: 4041 ACTTGCAGCCCTCGGCCAAACGGCCAGACGCCGACGTCGACCAGCAGAGACT 

agtaagaagtttgatagctgtaggactgggtgrrgcagctcttgcatttgcagg 

cggatctggaaacctctagaacaagttatcacagaaactgcaaagaagatttcaactcctagc^ 

ttcatcctactataaaggaggatttgaacagaaaatgagtaggcgagaagctggtcttat^ 

gtgtaagcccatctgctggcaaggctaagattanaacagctcataggagagtcatgattm 

tcacccaaataaaggtggatctccttacgtaaccaccaaaataaatganncaaaagacttgc^ 

TAAACAACCACCAAACATTGATGCTTAAGGACCACACTGGAAGGAAAAAAAAAGAGGGGAC™ 

GAAAAATAAAANAAAAAAAAAAAGTCCrrTTTOTCTCAT^ 

GGGAAATTGrronrNNGTGGCAGGGGGTTTGAhfNAAGTCC^^ 

SEQ ID NO: 4042 ACATGACAAGGTGCGGCTCCCTAGGCCCCTCCCCTCTTCAAGGGGTCTACAT 
GGCAACTGTGAGGAGGGGAGATTCAGTGTGGTGGGGGACTGAGTGTGGCAGGGACTCCCCAGCA 
GTGAGGGTCTCTCTCnTCCTCTTGTGCTCTTGCTGGGGCTGGTGGTCCAGGGGTOT 
GGCCATGTGGGCCATGAGGTCCACCACCXn'GTTGCTGTAGCCAAATTCGTTGTCATACCAGGAAAT 
GAGCTTGACAAAGTGGTCGTTGAGGGCAATGCCAGCCCCAGCGTCAAAGGTGGAGGAGTGGGTGT 
CGCTGTTGAAGTCAGAGGANACCACCTGGTGCTCAGTGTACCCAAGATGCCCTTGANGGGGCCCT 
CNACGCCTGCTTACCAACCrrCTTGATGTCATCATATTTGGCAAGGTT^ 
NGTCCACCCACTGACACATTTGGCATTGGGOACACGGAANGCCATGCCANTGAACTTNCCCGm 
AACTNAGGGATNACCTTGCCCACACCTTGGCANC 

SEQ ID NO: 4043 ACNCGGGAGAAGAAGAGTAAGAAGGACAAGAAGGCCAAAGCTGGTCTGGAG 
AGCGGGGCCGAGCCTGGAGATGGGGACAOTGATACCACCAAGAAGAAGAAGAAGAAGAAAGCA 
AAAGAGGTAGAArrGGTTTCTGAGTAGTGAAGGCCACTTGAAGCTGGAGGAGAAACTAAAGCCTT 
ATTGAGAAAACATGTTATAGATCCTTTTGrrGCTGAGAGAGTGGAACATAGGTCCTAGAC 
GAANAGTTCTGGCACATTTTAGCTGCTCm'TGAGACCTCGGTGATGTTACCT^ 
ATCTTGTCCTGTTTTAAGGATTTGGGCNGGGNATANATGAAAGAGGC 

SEQ ID NO: 4044 GTTTTTNTT^mT^Tr^GGCCCAAATTT^ 

AGGATCCTATCAAAAGANAACAAAAAGAGAAACNAGCCANTTGTTGCAGAGGAAAAGCATGCAC 

ANATCTGCnTNAAGTTTACTCTGAACCACGGACNCTCCGGGAATTCTNCACACCAGGAGCT^ 

ACTCCAGGAAGTGGGCACAACCCACCATCCACACAATGGAAACAAGAATGGCCAGGAAATGTCA 

CCGAAGTGAGACTTCCATATGCAGGATCAGCGTTCACAGTAGCACAATTCTAACCAAGTTCAAGG 

GAAAACAAACTTNrrGTCTCACACAAACATGGGAATCAAAATTGTTGACCCTAW^ 

TAACATAATTTGCTTCAAAAAGACAAACGAAATTTANTCCGGACr^ 



628 



p 



wo 02/29086 PCT/USO 1/30732 



acccgatrccaggggcagcttagtgtgcrtmcattacaaaaaat^^ 

acaaattacctggaggaanaagagtgttggaccacncattgtcttggttgnaacaagggi^ 

gggcnttggtggcccattaaatggaggcttcagcattggaaggacctgccnggcgggcnt™ 

SEQ ID NO: 4045 accaaacgggcaaggacatctctacaaattactatgcgagtcagaagaaaac 
atttgaaattaatcccagacacccgctgatcagagacatgcttcgacgaattaaggaagatgaag 
atgataaaacagttttggatcttgctgtggtmgtttgaaacagcaacgot 
ttttatcagacactaaagcatatggagatagaatagaaagaatgcitcgcctcagm 
accctgatgcaaaggtggaagaagagcctgaaoaagaacctgaagagacagcagaagacacaac 
agaagacacagagcaagacgaagatgaagaaatggatgtgggaacagatgaagaagaagaaac 
agcaaaggaatcaacagctgaaaaagatgaattgtaaattatactctcacx;atttggatc 
tggagagggaatgtgaaatttacatcatttctttttgggagagacttgttttggatgcccc^^ 

CCCCITCTCCCCTGCCTGTAAAATGTGGGATTATGGGTCACAGGAAAAAGTGGGGTTTTTAGT^ 
ATTTTTTTTAACATTCCTCATGAATGTAAATTTGT 

SEQ ID NO: 4046 ACTTTTTTm"lTllTn'TTTTNGCATC 

GGCTTGTTAGGATAGTTAANAAAGCTGCCTATTGGCTGGAGGGAGAGGCTTAGGCAGAAGCCCTA 

TTACTTTGCAAGGGGCCCrrCANAAGTCGCTGGGCTCANAAGGCTCITAGTCGTGC^ 

GCCTTTCNAANAGATACTCGCCCAGCCCAGCCTCCGGGCCACCCAGCCTGTGGAGGrTGGTCA^ 

TGATCACCCATCTTCTTGATAAGCTTCACTTCCTCATCTAGGAAGTGAGTCTCCAGGAAGTCACAG 

AGATGGGGGTCCGTGCGGGCAGAACCCAGGGCATGAAGATCCAAAAGGGCCTGGrrCAGCTI^ 

CTCCAGGGCCATGGCAGCTTTCATGGCGTCTGGGGTTTTACCCCACTCATCrrCAGCT^ 

ATGTCCTGGAAGAGAGCGCGGCCGCCACGCTGGTITTGCATCrrCAGGAGACGCTCGTANCCCTC 

GCGCTrrCCTCGGCCAGTTCGCGGAANAAGTGGCTCACGCCTTCCANAGCCACATCATCGCGGTCG 

AAAATAAAACCCCAAAAAAGAGGTAGGTGTAGGAGGCCTGCAGGTACAACACANCCAAAGACTG 

TNAA 

SEQ ID NO: 4047 ACTTTTTTTTTTnrr^ 

GCnrrCTCCAAGGGCGACAAAGTGAACTGAAGGTCANAAGGAAGCrGGGTGCGGGCT^ 

GCTCTTGCTCCAAAACCTGGAAGTGAGGANAGGGCNCTCCGGAGCTCTGGGGAAGGTTGGTGCAC 

ACAGGGGTTCCGTTGGTGGGGGANAANAGCCGCCAGCCCACACACGGTCACTGGATTGGTGTGAG 

TGGGTTCCAAGCGACTGCCATGTGCTAGTCCACTGACATGATTGACATTAACATTCnTGGGGGGC^ 

TTAAATTAAGGAATGACACAGGGAGCCAAGAGAGTGGCTTATTCGGTTGGATTCTGAATCACAAT 

CAGGAAATAGTCTTTATCTGGTGCAACCATAATTTCATTTTTCTTGGAG^^ 

GAGATCGTTCTGGGGGTCGATGTCACGCACGGTGCTCCGTGCCrTCAGGATGAAGCTGTGCATGA 

GGCTGGCATACTGGGTGGTGGTGGGGTTGTNCATGGTGCTCTTGATGGNAATGCCTTCTGTGTT^ 

CNACNATNAATCCCTGCACTCCCnTCTGGCTCTGCANTCNCTTCANTGTT^ 

TTNCA 

SEQ ID NO: 4048 ACTCTGCCACAAACTGATCACACTGCTTCTGGTAAGGGTCTGGCAGGAAGCT 
GCAGCCTTTCTCAAGAGCAGCCAGGATCTCCTGCTTGGTGCTGTTTT^ 
ATAACCCACCAGCTTCTTGCACACnCGCAGAAGCCACCGTCCTTTGGCTGAGTCACG 
CAGTGCAGGCAGCCGCGTGCCAGAGCAGAGGTGCAGCATGCTGCACACCAGCTCAGGGCTGACCT 
CCTCCAGCAGGATGGACAGGATGGAGCTGCCGTACTTT^mT^Tm 
GGGCAANATATITArrGTGTTAACATGTGAAACATACAATTTGCTCAGTAAAAAT^^ 
AAAATATTATAAGCTTATATTCATAAAGAAATGGGTATGTTATTACCTCTTTrrOT 
GACTATTAATTTGACAAGGTTGGAATGTGCACAGCACAGCTGAGACACCACCATTTTAACACT^ 
ATCACTATACCATTGAACTGGCANAACCCTGCm'GAAGGATGAAAAACTCATACCCA^ 
AATCACACAGCAGCATGOAGGGGGAAAATGAACTATbTTGATGCTAACCGCATTTAAm 
GGGGGGGA 

SEQ ID NO: 4049 ACCGACCATAGAGCAAGAATCAAGATTCTGCTAACTCCTGCACAGCCCCGTC 
CTCTTCCTTTCTGCTAGCCTGGCTAAATCTGCTCATTATTTCA 
AGTGATAAGGGCXICTACTACACTGGCTTTTTTAGGCTTAGAGA 
TAGTGGCTTCTAGCTCTAAATGTTTGCCCCGCCATCCCTTTCCACAGTATCCn' 
CTGTCTCTGGCTGTCTCGAGCAGTCTAGAAGAGTGCATCTCCAGCCTATGAAACAGCTGGGTCm 
GGCCATAAGAAGTAAAGATTTGAAGACAGAAGGAAGAAACTCAGGAGTAAGCTTCTAGCCCCCTT 
CAGCTTCTACACCCTTCTGCCCTCTCTCCATTGCCTGCACCCCACCCCAGCCACTCAA 
GTTTTTCCTTTGGCCATGGGAAGGTTTACCAGTAGAATCCTTGCTAGGTTGATGTGGGC^^ 
TCCTTTAATAAACCATTGTGT 
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SEO ID NO* 4050 ACTGTTGTCCATTTCATGAGAGTAGGCTTGAGGAC ACCA TGGGCAAGGATCT 
GATGGTTGCCAGCCTAAGCGTTTTAGACnTITGACCCAGAGATTT^ 

ATTTrAGAGGATAGGGTCTCAAGATATAATCCTITITATAGGCGGCAGGTCTTAAGAGATGAGGGT 

CCAGAGGAACGGATGAAGCCTGCTTGGAAGCATGCTGGGATGGCCATTTGGAAGGAGTGTTGCAA 

GGAAGCATGGCCTTGGCTGGGCGCTGCCAGGAGCITAAGGGTTGTAGGTTGTTTGTCTGATCGGCT 

CTCGGCCrAATCTTGTGGCTTCCAGGAGGAAGAGGAGAGATCAAGCTGGCTGTCTGATGGGCATG 

GCmATTCTGGAATGGTGAACCCAATGGANAGGGTCCTGCAGATGGACGGCAGTTGGTGTGCTA 

TAAATGACCGGGTAGGGTCTGGTCCATTGAGGCTGTANAGTrrGAGGGGTCANACTTITGAC^^ 

GACTGATCGTCCANCTAGGGTGTCTTCATATGGCTGGGAATCCTCCGCCGTAAGATTACCCN^^ 

CGGCATNTGTGATGGTCCAAGGGGGCTCTGAGGCAACGGCCANCATNAOTCTTNACCNCTA^^ 

NGANATCCATT 

SEO ID NO- 405 1 ACTGAGGACAAATCAGTTCTCTGTGACCAGACATGAGAAGGTTGCCAATGGG 
CTGTTGGGCGACCAAGGCCITCCCGGAGTCTrCGTCCTCTATGAGCTCTCGCCC^ 

ctgacggagaagcacaggtccttcacccacttcctgacaggtgtgtgcgccatcattgggggcat 

GrrCACAGTGGCTGGACTCATCGATTCGCTCATCTACCACTCAGCACGAGCCATCCAGAAGAAAA 

ttgatctagggaagacaacgtagtcaccctcggtgcttcctctgtctcctcttt 

GGTTGTCCCCCAGCCTCTGCCACCCTCCACCTCCTCGGCCAGCCCCAGCCCCAGGTTGATAAATCT 
ATTGATTGATTGTGATAGAAAAAAAAAAAA 

SEO ID NO- 4052 ACTTCCCACTrTTCATAACGAGTNGGAGCCTAGAGTTGATCGACTCCAGCGAC 
TTmCGTCTTCTTTGCGGCCACCATCTTCCTGCOT 

GCCGCTAANATGGCCGGGGAACGAGAAAGGAAAGACTTCCCACAOTGCAAAGCTCTTCACGGCC 
CCCGCGT 

SEO ID NO: 4053 ACCAGAAGTATAAGTTrATGGAACTCAACCTTGCTCAAAAGAAAAGAAGGCT 
AAAAGGTCAGATTCCTGAAATTAAACAGACmOGAAATTCTAAAATACATGCAG^ 
AGTCCACCAACTCAATGGAGACCAGATTCTTGCTGGCAGATAACCTGTATTGCAAAGCTTCAGTTC 
CTCCTACCGATAAAGTGTGTCTGTGGTTGGGGGCTAATGTAATGCTTGAATATGATATTGATGj^^ 
CrCAGGCATTGTTGGAAAAGAATTTATCGACTGCCACAAAGAATCTTGATTCCCTGGAGGAAGAC 
CTTGACTTTCTTCGAGATCAATTTACTACCACAGAAGTCAATATGGCCAG^ 
GTAAAAAGAAGAAACAAGGATGACTCTACCAAGAACAAAGCATAATGCTGGCAATTAAAAA 
GGTTTAGTTITCCAAACATGrrATCTTAAATACCCCTTTATCCTTACAGGTTGACATA^^ 
GTTITAACAGCAAGAATTTTAAGAAAAGATAAACACCATTTTATTTAm 
TrrCAAATATirrrGACATTGTGATITITim 
TTTTT 

SEO ID NO- 4054 ACGCGGGGAAGGGGAGAGTTTAAAAACCCAAACCGrrGTGGTTTrAAGG^ 
CTCATTTITAAAAGGGAGAGAGAATCTATTTAAAGCTAm 
GTCCAATGTATTCCrrGTTCTTTAAAAAAATTITmTAGAGG^ 
ACTAACTCTTCTGGTCACnTGTATTTATTTATTCATTCATTCATCAGATATTTGT^^ 
GAACTGGCCCAGTGGGTCTGAAAGCTCGCTTGAGAATAGGAAACTTGAGACCTGCCCCCTGTGGG 
TAGGAGAACAAGGACCACCTGGGTTCTCCAGTCTTGAACGAGAATCTCACTCTTATCAG^^ 
TTCTTAACCTCAGCGTATGATGAGGAAATTTACTTATCTCTAGCTAGGATTTGACAAATTCC^^ 
TCAAATGATCAAAACATTTGCCACTGAGGCrrCACTGGTGAGATCCGTTCTCCGTCCTCGGGTGCA 
GTCCCTTGGGGGCTGCTCCTCGGACTGCGCCCCGCACACCTGTTATCGAGGGTGTGAGAAGCGCCT 
AACCTGGTGACATGTGATCTGGGACGCCTTCATITCTCGGGCCAGGAGTACCANCTGCTANGACA 

GCAN 

SEO ID NO- 4055 ACAAAAAAATTAAATTrGCTTTAGTTATAAAAGAGCTCTGTCAATATACAC^ 
AACTATATACTTCAGACATTCACAAAAATGTGAGCAGAAGGCTTATCAAAAGACAT^ 
TTAGTITrCAACAACCCCrrGGTGGTCCACATCTACAAAGATATCCAGCCCAA^^ 

ccaaatcccacccccacagaaaagcacatacttaccagaatititagcaagtatgg™ 

TITGTGGTnTTGITmTAAAAAAAGGCCCCCAGGGCAAGTTATTTACAGm 

aactgatctggaccttgatcgggaccgggacctctggcgatccacagatgctggagacttagatc 

TAtnTGAAGAACCACGTTTCTGGCTCTTCTCAGGCACGGGAGACCTACTAACAGAACGGGAC^ 

tccggctccggctcctgctcctgcttcttgaccggctgtaagatttgcgactacgggaacgggatc 

GGCTCGAGACCTAGAGGAACTTCTGGTCCGGGATCGAGACCTGCTTCTTGACCTACCCCGCGC 

SEO ID NO- 4056 ACCCCTGATGCCGTTGACAAGTATCTCGAGACACCTGGGGATGAGAATGAAC 
ATGCCCATrrCCAGAAAGCCAAAGAGAGGCTTGAGGCCAAGCACCGAGAGAGAATGTC^^^ 
CATGAGAGAATGGGAAGAGGCAGAACGTCAAGCAAAGAACTTGCCTAAAGCTGATAAGAAGGCA 
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GTTATCCAGCATTTCCAGGAGAAAGTGGAATCTTTGGAACAGGAAGCAGCCAACGAGAGACAGCA 

GCTGGTGGAGACACACATGGCCAGAGTGGAAGCCATGCTCAATGACCGCCGCCGCCTGGCCCTGG 

AGAACTACATCACCGCTCTGCAGGCTGTTCCTCCTCGGCCTCGTCACGTGTTCAATATGCTA^ 

AGTATGTCCGCGCAGAACAGAAGGACAGACAGCACACCCTAAAGCATTTCGAGCATGTGCGCATG 

GTGGATCCCAAGAAAGCCGCTCAGATCCGGTCCCAGGTTATGACACACCTCCGTGTGATTTATGAG 

CGCATGAATCAGTCTCTCTCCTGCTCTACAACGTGCCTGCAGTGGCCGAGGAGATTCAGGATGAAG 

TTGATGAGCTGCrrCAGAAAGAGCAAAACTATTCAGATGACGTCTTGGCCAACATGATTAGTGAA 

CCAAGGATCAGTT 

SEQ ID NO: 4057 ACAGCCAACGGTrrCCCTTGGGGGCTTTGAAATAACACCACCAGTGGTCTTA 
AGGTTGAAGTGTGGTTCAGGGCCAGTGCATATTAGTGGACAGCACTTAGTAGCTGTGGAGGAAGA 
TGCAGAGTCAGAAGATGAAGAGGAGGAGGATGTGAAACTCTTAAGTATATCTGGAAAGCGGTCTG 
CCCCTGGAGGTGGTAGCAAGGTTCCACAGAAAAAAGTAAAACTTGCTGCTGATGAAGATGATGAC 
GATGATGATGAAGAGGATGATGATGAAGATGATGATGATGATGATTTTGATGATGAGGAAGCTGA 
AGAAAAAGCGCCAGTGAAGAAATCTATACGAGATACTCCAGCCAAAAATGCACAAAAGTCAAAT 
CAGAATGGAAAAGACTCAAAACCATCATCAACACCAAGATCAAAAGGACAAGAATCCTTCAAGA 
AACAGGAAAAAACTCCTAAAACACCAAAAGGACCTAGrrCTGTANAAGACATTAAAGCAAAA^ 
GCAAGCAAGTATAGAAAAAGGTGGTTCTCTTCCCAAAGTGGAACCAAATTCATCAATTATGTGAA 
GAATTGCTTCCGGATGACTGACCAAGAGGCTATTCAAGATCTCrrGGCAGTGGAGGAAGTCT^ 
TAAGAAAATAGTTTAACA 

SEQ ID NO: 4058 acggtitggcctgcaatgctaatggtctcacaagcagtcaaggtattttgcta 

CAGTGGTTGTGCTTCATCTTCTGrrGCCTGGTGTGTCTGGTGATATGAAATC 

TGCTTrGGCTrCGATCTACGTTATAAAGGATCTAATGACATrCATGTCT^ 

CTGCCCATTCAGCTCTGTGTGGCmCTITAAAATCATCAGGTC^^ 

GAAAACCAAAGGAAACATCTCTTGTATCTATTGGATTTTTA^ 

TCAAATTTATAAATCTAAAGAAAAACAAGTGAGTTTCCCTTmOTTT 

CCATATATTACTGCATTAAAAATAAATAAACATCTATATrTTTTTAA^^ 

ACCCAAAGTGTTACAGACTGCTAACAGGAAAAAAAAAAAGCTCACArrATT^ 

GCCATATGCTGATTCTTGAACATTTAGGTITCTTTTAATATATTTT 

TACAGTAACCACGGTTGCCmGTGCTCAAAATCTTTGGCCATACCANAGTCT^ 

SEQ ID NO: 4059 ACAGAACATGATCAAGGGTGTTACACTGGGCTTCCGTTACAAGATGAGGTCT 
GTGTATGCTCACTTCCCCATCAACGrrGTTATCCAGGAGAATGGGTCTCrrGTO 
rrcrrGGGTGAAAAATACATCCGCAGGGTTCGGATGAGACCAGGTGTTGCTTGTrCAGTATCTCAA 
GCCCAGAAAGATGAATTAATCCTTGAAGGAAATGACArrGAGCTTGTTTCAAATTCAGCGGC^ 
ATTCAGCAAGCCACAACAGTTAAAAACAAGGATATCAGGAAATTTTTGGATGGTATC^ 
GAAAAAGGAACTGTTCAGCAGGCTGATGAATAAGATCTAAGAGTTACCTGGCTACAGAAAGAAG 
ATGCCAGATGACACTTAAGACCTACTTGTGATATTTAAATGATGCAATAAAAGACCTATTGAT^ 
GAAAAAAAAAAAAAAAAAAAAAAAGT 

SEQ ID NO: 4060 ACCACAAAGAATAGTCTCATTTGCACGAGAGACTCAAATATTGGACTCTAGC 
ACAGGAAATGTCTCITGGGGCrTAGTAATATACAAACTGATAGAAAAGAAAATTGCCTT^ 
AATTTCACAATTGACTAAACATCTGAAACATITITATCACAGCTrTCA^ 

CCAGAGATGAAGCTTATTGATGAAATAAAACCTAGTCACTATCATCTGACCCATGTTCTGAGCCT^ 

TATCTGAGGCCTCATTGTTGGATCCCTCTGGTTCCGATTCATTTCCAGAGCCrrGGCCAG 

ATTATCAGATCCTCTCTCCGATTCGTGATCTGATTCGGCACTTGGAGAAGCTGGGCGAGAGTCGT^ 

CTCAGAAACTCCTGAGTGGCnCTCCCTGACTGCACAGATTCATTGTCAGACTGCnrA 

GGGCCTTCTCTTTCTGGATGGCTGGTCACTGTCTGAGTCCTGATCTGACCGCTGTCT 

CGGGGACTGCCGGCCTCGCTGCCANACrrGTTCTGGTrCTCATCGGAATCACTCTCTGATGA 

ATTTCTTTCGTTGTTCGTCCTCGTCTGANTCACTGGTGCTGm'GCT 

CCA 

SEQ ID NO: 4061 ACrTrr iU 'i n i"lH l - ri - L T l -i-ri " i-il-l-l i C CAAGCAGTATGTCTCAATAGTGGCCT 
TAAAATATTCAGCAAAATATreCGTAAAAATATTCAGTAAATCTAAATGACANATGTGCTGTC^ 
TTGGCmGTTGTTCCATTTCTANAGTGCCCCGCGTACCACCACACCCAACTAACTTTTGTATT^ 
AGTAAANATGAGGTTTCACCATGTTGGCCAGGCTGGTCTCAAACTCCTGGCCTCAAGTGATCTACC 
TGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCOTGAGCCACCACACCCAGCCTACTCAAACTTT 
TATGTTGAAAAAAAAAATCATAA' n 'l'rriTl-l'i'l ! 1 AAAGGAAATGAACGTGGAGGACTGGGGTGA 
AGGGCCACCCTGGGTAGTTNAATCTTTTTX}GGAANACATGACTITAAG 
ACAGGTTGCTCCATGCTGTNTTGGGGACAAGGGCCTGT 
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SEQ ID NO; 4062 ACACACACGGGAAAGAAACTCTGCGAGTCTGATGTGTGTCAGAGTTCCAGTC 
TTACAGGACATAAGAAAGTCCTCTCTAGAGAGAAAGGTCATCAGTGTCATGAGTGTGGGAAAGCC 
TTTCAGAGGAGTTCACACCTCGTCAGACATCAGAAAATCCATCTTGGTGAGAAGCCTTATCAGTGC 
AATGAGTGTGGCAAAGTCTrrAGCCAGAATGCAGGCCTTTTGGAACATCTCAGAATO 
GAGAAACCTTATCTATGTATCCATTGTGGAAAAAATTTTAGGCGCAGCTCTCAC CTTA^ ^ 
CAGAGAArrCACAGTCAGGAGGAGrcCTGTGAGTGCAAGGAGTGTGGAAAAACCriTAGTCAGGC 
CTTACTCCTCACCCACCATCAGAGAATCCATAGTCACTCCAAAAGCCATCAATGTAACGAGTGTGG 
AAAAGCTTTCAGTTTGACCTCAGACCTTATTCGACACCACAGAATTCATACTGGAGAAAAACi^^ 
CAAGTGTAACATATGCCAGAAAGCCTTCCGACTAAACTCACACCTTGCTCAACATGTAAGAATC^ 
ACAATGAAGAAAAACCCTATCAGTGTATGAATGTGGAGAAGCCTCAGGCAAAGGTCAGGTCrm 
TCAACAT 

SEQ ID NO: 4063 acccaaagcctccccaaggccacagtagtcatgctcccgggcagtatctgcc 
atcccagccacttacattcccatcgtggtttgggcaagagagaggctttcccgcagccacagcagt 
gactgtctccaggatggaagactcctcagcgttgctgcctggagttcgctgcagcacgtccatcag 
gtttgtgatgaagaaattgttctggagcgcggcca<xcctrrctcgggcaggatggaggtctgg 

GCACACTGGGCAGGAGAGGGTTAAACTGTGGGCAGGAATGTAGTTCTGCAGGCACCrCTCGCAGA 

AAGTGTGCAGACAGGGGAGAACCTTGGGATTCTTGTACCAGGCAGTGACAGCCACCCT^ 

AAGAGGAAAGAGAAAGCCAAGATCCACTACCGGAAGAAGAAACAGCTCATGAGGCTACGGAAAC 

AGGCCGAGAAGAACGTGGAGAAGAAAArrGACAAATACACAGAGGTCCTCAAGACCCACGGACT 

CCTGGTCTGAGCCCAATAAAGACTGTTAATTCCCCAAAAAAAAAAAAAAAAAAA^ 

SEQ ID NO: 4064 ACAGTTTGCAGAATATATTCAGAAAAACGTGCAACITTATAAGATGCGAAAT 
GGATATGAATTGTCTCCCACGGCAGCAGCTAACTTCACACGCCGAAACCTGGCTGACTGTCTTCGG 
AGTCGGACCCCATATCATGTGAACCTCCTCCTGGCTGGCTATGATGAGCATGAAGGGCCAGCGCT 
GTATTACATGGACTACCTGGCAGCCTTGGCCAAGGCCCCrmGCAGCCCACGGCTATGGTO 
CCTGACTCTCAGTATCCTCGACCGATACTACACACCGACTATCTCACGTGAGAGGGCAGTGGAACT 
CCn'AGGAAATGTCTGGAGGAGCTCCAGAAACGCTTCATCCTGAATCTGCC^ 
AATCATTGACAAAAATGGCATCCATGACCTGGATAACATTTCCrrCCCCAAACAGGGCTCCTAACA 

TCATGTCCTCCCTCCCACTTGCCAGGGAACTTTrmGATGGGCTCC^ 
CAGGCGCACTCTTGATAAATGGTTAATTCAGAATAAAGGTGACTATGGATATAATTCA 

SEQ ID NO: 4065 acgacctcattgccgtgtccaatcattatggagccatgggggttggccactac 

ACTGCATATGCGAAGAACAAACTGAATGGTAAATGGTATTACTTTGATGATAGCAACGTGTC^^ 

ggcctctgaggatcagatagtgactaaagcagcttatgtgctattttaccaacgtcgagatgatoa 

atttrataagacaccttcacttagcagrrctggttcctctgatggagggacacgaccaagcagctc 

tcagcagggctrrggggatgatgaggcrrgcagcatggacaccaactaatgctgactccacgatc 

ctgccaccctgtagcgccagtgtaatcccccaggagaacatcmgacactctgcagactgctagt 

gttctgtctaaaaaccagacaaggaaatacccttmttatgagcagaagga 

aagaagaccgtttacctagaagaagctatgtcaagaggctgaattattm 

gtgagaaat1tctgtgaaacctgtgaagctgaaaagggggtgggatgggggt 

SEQ ID NO: 4066 ACAGGTH'CACTATTACAAATATATGATGTTAAACTAACAAACTCATGACCTT 
CAAAGATGTCTTCGTCCCACGCACACACATTTGTAATTTGTGTCCATTTGa^^ 
TAATCTTCAAATTATATAGTTATACATTGAGTTCCCTATGCATCTCACCCATCTCCm 
CrrCTCATACTTrGCCATrCTCTrarrCTGGAAATAACCA^^ 

ATCACCACAACCACAATAACAGCAATAACACCAGCTTTTAGACCCTGCATTGAGAATTCA^^ 

TTTTTCATCAACATAATAAATTAAAGTTTGACCAGGATCCAGATCCAGTTGT^ 

AGGTCCATTTTCTTAGAATGAAACAAGGATTCACCTTTAACATCTTTT^ 

TCAGCTATGTCCACATCATTCTGAGTrTTTrTGAGAAGAATTTTGAACCAGATC^^ 

ATTATTCTCATACAAAATACTCGTGATAAATTTTGGATCCAGTTGATAACGCGrrGTO^ 

TGAATGCAGTCCGCAAACTTTTACTATCATAANGTITrTCTOT 

SEQ ID NO: 4067 ACAGCCTACCAAGAGGCCAGAAGGCAGAACTTATGCTGACTACGAATCTGTG 
AATGAATGCATGGAAGGTGTITGTAAAATGTATGAAGAACATCTGAAAAGAATGAATCCCAACAG 
TCCCTCTATCACATATGACATCAGTCAGTTGmGATITCATCGATGATCTGGCAGACCTCAGCT^ 
CTGGTTTACCGAGCTGATACCCAGACT^TACCAGCCTTATAACAAAGACTGGATTAAAGAG/^ 
CTACGTGCTCCTTCGTCGGCAGGCCCAACAGGCTGGGAAATAATTGTGTTGGAAGCACTGGGGGG 
GTTGGGGTGGGCTTGGAACACAGGTGTGTACrrCTCTGCAAGTAAAT^ 
TTCTGATTAAATAAATGATTACCCTAATCAGCTATTATTTATTATm 

AAGTGCCAGAGACCTGCCTATCTCGCATAAAGTTTGGTCAGATATATGCAAATGCAGTGTTGGTC 
AAGATCATCTGAATAATGATCACTTTCCATGCCATATTANTTATATTTANTATCACTGCT^ 
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CTTCCTCTCTTTAATAAAGTTTAATCAAGCAATTAAAANCTGTGTO 
CTGTAAT 

SEQ ID NO: 4068 ACAGGTGCATCGTGCACATAAGCCTCGTTATCCCATGTGTCGAAGAAGATAG 
GCTCTTCTTCAAAGATGAAGGGAAAGAAGGTGCTCAGGTCATTCTCCTGGAAGGTCTGT^ 
GGAACCAGCATCrrCCTCAGCTOJTCCATGGCCACAACAACTGACGCGGCCTGCCTGAAGCC 
CTGTAGTGGTGGTCGGAGATTCGTAGCTGGATGCCGCCATCCAGAGGGCAGAGGTCCAGGTCCTG 
GAAGGAGCACITCATCTGTTTAGGGCCATCAGCrrCAAAGAACAAGTCATCCTCATTGCC^ 
ATAAGCCATCATTrCACTGGCNAGCTCAGGT 

SEQ ID NO: 4069 ACATTTACTGGATTGTTTTTCAGTTTGCAGCTGCCCTCTTGACATCC^ 

AGGAATTCCCGTTAAGTAAAACCTGCAGTGCAATTCACTCAGTGGTTAGTTTCAGGCCAAGCCTC^ 

rcCTACCArrATGCTTGTTCITCCCCACCGCCCCCAAATCCTGCAGTGGCACCATAT^^ 

CACTAAGTACCTGTTACTGCTGCTACTTCCTCGmGACACCTTCCTGGAATCTCTCT^ 

AGGAAATACCTAGTAACAAACATOACTGAGTTCTGGCTTATATCTGCTCCTGGGGAGAAAA 

TCAGCAAACATGGGAGAAATTGCATGCGGCAACnTCAAAGAACAATAATCTTGCTGTCACTTCCA 

AGTTCAATATTCCTCACTTAAAGGTTGGCACGTTGGATGTCTTGGTTGGCT^^ 

CTAAACTGGATGCATTTGTAGAAGGAGTGGrTAAGAAAGTAGCTCAATACATGGCTGATGTAT^^ 

GAAGATAGCAAAGACAAAGTTCAAGAGAATCTGTTGGCTAATGGAGTGGACTTGGTTACTTATAT 

AACAAGGTTCCAGTGGGACATGGCCAAATATCCAATCAAGCAGTCCCTGAAAAATATTTCTGAAA 

TAATTGCCAA 

SEQ ID NO: 4070 ACTTTTTTTTTTTTITI^^ 

TTTAATAGAAAATATGATCAAAACTCGATTACAAGAGTTCAAAAAGACATAGAAAACCAGTGAGT 

TTCAATTTTATTACAAGTTTTCAAATCTGGGACTAGTTrCTni^^ 

TCAGCCCAGGGTTTTTTCACAACCAAACTAAAAATGACTTACTACATGG^ 

AGTAGAATTTGTAAACTCAAGCCACAAACTTAGTTAATAATCATGGTTAAGGGACATTGCCAAAG 

AGCAATTGATGCCTCAGTGAAGTTTGAAANAAACTCTGCTTTCTGTGACGGCAGAGAAGAAATA^ 

GCAAGCAATTCTGCTTCAAAGAAATTIXjCATAGAAATGGAAAAATGCCAGAGCC^ 

TGAAATTGCAAAGCCTCAACACGTTCAACTCAATCCACAGAGCACCAAATGTTTAATGGGAGCCA 

AGGTAGGACTCAGCATTOAACITCCAGCTATGCAACTCGCAGGGCACAATTTCA^ 

CATCTGTAGGCAAGCTCTTTTAAAAACATGAATn'ANACACCGTAAATTCTAATGCAACAC^^ 

CATTTCT 

SEQ ID NO: 4071 ACATTGAAGCTCGGGTGACAAAAGGTGAGACACTCACCTAGAACAGTGCCGT 
GCTGCTGCTGGGAAGTTGCTTTACACAACACAGGCCACATGGGAAAGGCCCCAGCAGCCTrCAGC 
TCCrrCCTTTCTCCTTAAAGAGCAACAGGGCTTATTCTTGTTm 

GGGCTCTGCCATCTGGGGTGTGGTGTGGTATGTGGGAAGAAGTTCAGAGGAACCGTTGGAAACGA 
CGTTAGGCATTTTACCTTTTCAGTAACATTTTATACATCTACTT 

AGCCAAAAGCCTGGGACTCTTTGTGAAGGTCCTCCTCACCTCTATCTTTCTTTCTCTCT 
CTTTCCTTAAAGTTCTCATTGCCTTTGCACTGCTTCTGTGAACAGTCmG 

GTGGGAAGTGCGGGGCAGTCCTGGTCAAGACACTCATGCCCTGCAATGTGGCTGCCANANAATGT 

TGTTGCTAACCCACCANTTTCTTGNTGAriTGGAGAGGTCAAGGCCAGGCCCCACTTGGCTTGA^ 

GGACATTTTCANACTITrrTTrCTGC^ 

C 

SEQ ID NO: 4072 ACGCGGGAGCCAAGAGGCGGATAAGTGCCCCACCTTAGAACAGTATGCCAT 
GAGAGCGTTTGCCGACGCACTGGAGGTCATCCCCATGGCCCTCTCTGAAAACAGTGGCATGAATC 
CCATCCAGACTATGACCGAAGTCCGAGCCAGACAGGTGAAGGAGATGAACCCTGCTCTTGGCATC 
GACTGriTGCACAAGGGGACAAATGATATGAAGCAACAGCATGTCATAGAAACCTTGATTGGCM 
AAAGCAACAGATATCTCITGCAACACAAATGGTTAGAATGATTTTGAAGATTGATGACATTCGTA 
AGCCTGGAGAATCTGAAGAATGAAGACATTGAGAAAACTATGTAGCAAGATCCACTTCTGTGATT 
AAGTAAATGGATGTCTCGTGATGCGTCTACAGTTATTTATTGTTACATCCTTTTCCAGACA 
ATGCTATAATAAAAATAGCTGTTTGGTAACCATAGTTTCACTTGTTCAAAGCTG 
GGTACCGATTATCCCAGCAAAACAAAAATAAGCnTTTATriTATrAAC;^ 
CCAATCAAATCrmAGGAACAAACTGCAAGAAAAGCTAAGAATGTrrrAA^ 
CAGACATTGCT 

SEQ ID NO: 4073 ACTTTITrTTTTTTT^^ 

GCTANAAAGACACnXjrrmANCCAAAATCGGCAATGACNCTAAAN^^ 

TATGCAAATATNTTTNTTCCAANAGTTGCCCTGGNGTGACTTCAANAGTTCATGTT^ 

TGGAAACTTCCrrrrCTTAGTNGTTGTATTCTTGAAAAGCCTGGGCCATO 
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TTrGGGCAGTGAACTCCTTGATGTTCTGGCAGTAAGTGTTTTATCTGGCCTGCA^^ 
TCCATCCTGGCAGGCGGTTGTGGTGGTTTNAAAAGTITGGACAGGTCCTCCTNAGGGANCGGGGG 
TTCTCCTNGGNTCTGGCCCTGCAT^^IT^^^CCTQCTGGCAACGCTGCTGATACT^ 
TGTTTGTTNACTAATmrClWATGTATmGTTGANTTj^ 

TCCATAACNAAACTGGANATTNTTCCCCAAANGATNGCTGCTGGCAAGCTNArc 
TTTNAAANACTGACTTTTTTCAATTTCCCCAT^ 

SEQ ID NO: 4074 ACGCGGGGGGGCGACTGAGCGGACAAACGGAAGTGTAGGTTACGGTCTGAG 
ACATCACCGCCAAGCTGGGCATCGGGGAGATGGCCGAGACTGACCCCAAGACCGTGCAGGACCTC 
ACCTCGGTGGTGCAGACACTCCTGCAGCAGATGCAAGATAAATTTCAGACCATGTCTGACCAGAT 
CATTGGGAGAATTGATGATATGAGTAGTCGCATTGATGATCTGGAAAAGAATATCGCGGACCTCA 
TGACACAGGCTGGGGTGGAAGAACTGGAAAGTGAAAACAAGATACCTGCCACGCAAAAGAGTTG 
AAGGTTGCTAATAATTTATACTGGAATCTGGCATTmCCAAGCCAAGAGAAGATCG/^ 
TTGCAGCTAACTACTATGTGTAGACAGGmTATATTATAAAGTATGCATTCTTATCACCTAGTATA 
TAGrrAGTTTGTAGAGTGATTTCCCCCCAGTTTCTTGAACATGGTATCTTCACATC^^ 
TCAGTTGTGCTATTCATTATTAAACACTAAAACTTrGGCGGTrCTTGCATAACAT^ 
TANTGTATTTCTGTGAAGTCATTTTTTTTCITGCATrCC^^ 
GTTGATG 

SEQ ID NO: 4075 ACAAATCATATGTTAGGTCACAGAACAGGTCTTACTAAArmTAAAAATCA 
AAATAATATCAAGCATCATTTCTGATTACAATGGAAAAACCTAAAAATCAATAAC^^ 
rrGGAAACTGTACTTTTTTTTTTT^^ 

GTCACTTTCACTCTCACTCTCTCGCTTCCGCrrCGCAGTTTTTGCTGCTTCGGCAACTAC^^ 
GCTTTTACCACTTCAGCAACCACCrmTTTTrrGGCAN 

AGGTGGCAGGTGTGCTGTCTGTGGGCTTCCCATGOTGTCCAAAAGGCCCTGCTTGATCATCAGCT. 
TCTTCTGACTTGCCnTrGGACCTAAACCCCACTTCCGAGGGTAAGTGTCTCT 
CTTGATCTTGGCTACTATACCATGGTCCGCAGGTANAGATGACCGCTGTGGTCATTAATGCAATAG 
CC^TGCANATTGCTTrrCCrTTGGTGGNGATAACCCAATCTC^^ 

ATCGAAAAACACCTGGAANCATAATCrrGGCCCATANCANATGGNATlTACTGCCCTGC^^ 

SEQ ID NO: 4076 CCCTAATGATAACCATTmAGAATTCAATCATCACTGTAGAATCAGAGTCTG 
TAATTCTTTTCTTGATTAGAGTGGTAGGACACTGTAATACTGTTCCTCCATCm 
TGTTATGTGTTACTCTACACTGTAAATGCAGTATTCAAATTCACTTGAGCCGTGGGCCTGGAAGT^ 
AGAGACTAGCTTTTACCrrATTACTTrCAATGATTTrATCTGAGT^ 
GCTGAATCCAAATTCTTCAAGTATCTCCTCAGTGTGrrCTCCTATGAAAGGATCCCTT^ 
AGGGATGGCTGGGGTGTTTAACAGCAGAGGTGCAGGGCGGGGGCTCACGTCCTGCTCCTCACTGG 
TGATAAACGAGCCCCGTTCCTTGTTGTGATCATGATGAACAACCTCCTCAAAAGTCAGAACCCGTG 
TACGATATH'GATAArrATATTTATATTTCACCACCTAAATGTAATGTTGA TTCCT ^ 
TGAAGGCCTACATTGAAATATGTTTTGTATAAATTGNCATGTTGAACAGCATm 
TCCCTTANCTATATGAATTTTTGGCATGTTTCAAAAGAGATC 

SEQ ID NO: 4077 ACTTmTTTTTTTTTTTTm 

AACCAAAAAGCAAAAAACAAAAACAAGTCCCATGTCCTACTGACTCTAGTTCTTC^^^ 
CTCTGCTGTCTCTCTGGCATCCTATGAATCANATCAAGCCCATGCTGTTGTTGGTTTTrAAG^ 
TTTCAATAATAGGCTAAGGAAAGACATGTTTTTCTCTTTTAAA 
CCrrCGAACGTATTTAATTTGGCCTTTTCAGCTTTCTTOT 

TGTGTGCACATGTTGGTTTCACTCACCCAGAGTAATmGTGAGCATGCATGTGCTlTIT^ 

GCTTGAATGTTTGCCTGCTGTGTTGCAGCTTCTAAAGACATTGTCACCGAATGTGTGTGACT^ 

GATCAAGAGTGAGGCTAAACTGCGACCCCAACATCACTTCTTCATGGAAAGGTGrrGGCGCTNAT 

GTNCCTCNTGGAAGGATCANACTGGGAACCACAGAOGAAGGKTNAGGTGGGATGTGGGAGNTCA 

AANCA^T^^mCTAAAGTNCCTTAGGCTTTTGNAACACTANATTC^ 

C 

SEQ ID NO: 4078 ACi'i"ril'ri'ril'i'l"l'l T Tr iUlTn TrCTCAAGCACGTGCACTTTATTGAATGA 
ACTGTANACAGGTGTGTGGGTATAAACTGCTGTATCTAGGGGCAGGACCAAGGGGGCAGGGGCA 
ACAGCCCCAG(XTGCAGGGCCAGCATTGCACAGTGGAGTGCAAAGGTTGCAGGCTATGGGCGGNT 
ACTANTAACCCCGTTmCCTGTATTATCTGTAACATAATATGGTANACTGTCACANAGCC^ 
CCAGTAACAGGATGAATCCAATGGTCATGAGGATGCCCAAAATCAGGGCCCAAATGrrCAGGCAC 
TTGGCGGTGGAGGCATAGGCCTGGGCCCCGGTCACNTCGCCAACCATOTCCTGTCCCTAGACTTC 
ACGGAGTAGGCAAATGCTATGAAGC(XAAACAGCACCAGTTCAAGAAGAGGGTGTTGAACAGGG 
ACCAAA(>IACATGGTCGGCACGGAGGTCTCGCTTGTGGATGTTGATCACCGGTGGACCTTGGAAG 
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GATGGNGCT 

SEQ ID NO: 4079 ACAAATGTTITTTATTCAAAAANGCAAAATAAATTATCTGTAGGCATGGACA 
ATGACAGCAGTAAACCAT^ATATAT^m'GTCAACTGAAACCAGTNACTGATGGr^ATAGTC^ 
CAGCCAGCCTTTTTCTTCATTTTCTCCAACTGAOTCTCTGA^ 
GGGCTTCCTGTCACA^^ITCATTA^rrAAAGGTTAAGCACTAGTCTAGGAGT^ 
CCATACCACCTACCATTCCACCCATTGNACCCATTNCANGGNCCTTCTTTTNm 
GACTACAACT . 

SEQ ID NO: 4080 ACACAATGGTTTATTAAAGGAATGTATGGCCCACATCAACCTAGCAAGGATT 
CTACTGGTAAACCTTCCCATGGCCAAAGGAAAAACAAGCAGGAGTTGAGTGGCTGGGGTGGGGTG 
CAGGCAATGGAGAGAGGGCAGAAGGATGTAGAAGCTGAAGGGGGCTAGAAGCTTACTCCTGAGT 
TTCTTCCTrCTGTCTTCAAATCTTTACTrCTTATGGCCAAAGACCC^^ 

TGCACTCTTCTAGACTGCTCGAGACAGCCAGAGACAGGGGAGGAGGGAAGAAGGATACTGTGGA 

AAGGGATGGCGGGGCAAACATTTAGAGCTAGAAGCCACTACTGGGCCAATGCTAAAGTTTCTGTC 

TCTAAGCCTAAAAAAGCCAGTGTAGTAGGGCCCTTATCACTCTTAGTTTGCT^^ 

AAATAATGAGCAGATTTAGCCAGGCTAGCANAAAGGAANANGACNGGGCTGTGCAGGAA 

AAAATCTTGATTCTTGCTCTATGGTCGGT 

SEQ ID NO: 408 1 ACTTGCAATGGGGCCACCATGTTTTCTCCCATTAGCCAGCCCCATTCATCATG 
GATGCTATGAGTCAGCCAGGGGGCAGGCTTGCCATGGGTnTGTGACACCCCCATCCAAAGCTCA 
CCATOTTGCATCCCGCCCATTGTCTGTGGGACCCCAAOTTTCTAGCCATGTCCAGTTCTTC^^ 
AGCTGGATGCACATGCCAAGGCAAGCCATCX:ACAGCTGCrrGCTGGAAGGGTGGTGCAGATCTAA^ 
AGTTGGAGACATTGGCCACCTCAGCATAGGTGTGAGCCCAGTCCACAATGTTGTTGGAGCATGCC 
AACCTGTGGCTGAGCAAATA.\CTCCCAAGAATTrGGCAGACAATTCCGGCCOT 
TATTGATGGCCCAACTGCACACTGCAAATGCTGTCACAAGAGGGGCACCACCACTTCT 
ATCCTGATGACTACACCAATTATCAGGTTCAAGCCCCAGCTGAGGTCTGAGGAGAGTGGGTTGAT 
GAAGGGGCAGGGAGCTGGAAGAACACTTGGGAGACAGCAGGTAGATGAGAeACGGCTTTATTCA 
ANAACCCCCGCGTACCTTG 

SEQ ID NO: 4082 ACTAGAGCGCAGAGTITCAGACTrGGATTTATAAATGCTTTCAACGTGTGGTG 
TTTGGAAAAGGAGAAGACATCATCTGATmCAAAACCTGAAGTrmCTCAGGACT 
ATCGTAACTGCCACAGAGGGAAAAGGGAAGCrmCCCCTTAATTGTTCATCTC^^ 
CAGTATCCCTGAGATAGGAACCACTGTAATTAGAAGATTGGAATGAACAGGTTrCTCCCAAAGGA 
AGATrGTTTGTTGCTGAATrATGCCTACTGCCCTGATCATCAGGTATAAACTTTGCAT^^ 
TAATGCTTGGCAGCACCAAGACGTTTCAGTAATGGTTCATCATAAACCCAGCATTCTCCATOT 
TGAAATAACCTTGTCTCCCACTGAATTTCCTTCTCCTTCCTTTCTCGGGOT 
TTCAAGCCTGTGCTTTG(m'CAGTTGCTGCATCAATGTCTCTGArrm 
TTCCAAAGGCTGCGGGATTCATACTCGTTCTGATCrrCCAACTTCCTCACT^ 
GCAAOTCTTGGTATCTACAAAGACTGTATTTTCCCCrGTTGCATAT^ 

SEQ ID NO: 4083 ACTTTTTTTTTTTn^^ 

TACACATATATACAATGTArmAAAAATGGGCTTTACAATATGTAGTTTGATC^^ 

ACTAAATATATTGNGAACATTTTGTCTrCTACAACAGrrAAAANAArrGAATAGCIT^ 

ACAATTTATTAAGCAATCrTGTTGGGGACATTGAGGTATAATrnr^ 

TrrTATAATGCCnrrGGGAAAAAAAGGGGAGTrCTTGNCTTATATAGCTT^ 

ACTTGCCCTTCCATTTAGCCTTmACTTGCITNT 

ATTTTCTITITCAACCTCTCrnriTNTATTrGCl^ 

NATGGGACTTCCATTCCTTCAGCACTCTGGGTTCCTCCCCTNAAAGATGrmC 

NACTAriTNAATTGATTTGATNAACTCTAC>n^AAAACrGTATTOT 

NTCAAm'AATCTGAACATATANTTNATTC^T^AAAAANAACAT^mT^ 

SEQ ID NO: 4084 acgcgggcggggaggctttggagggcgaggaggcttccgaggaggcagagg 

AGGAGGAGGTGACCACAAGCCACAAGGAAAGAAGACGAAGTTrGAATAGCTTCTGTCCCTCTGCr 
TTCCCTTTTCCATTTGAAAGAAAGGACTCTGGGGTTm 

CTGAGGACATTCCAAGACAGTATACAGTCCTGTGGTCTCCTTGGAAATCCGTCTAGTTAACATT^ 

AAGGGCAATACCGTGTTGGTTTTGACTGGATATTCATATAAACTTTT^ 

GCTAACCCTTATCTGTAAGTTTTGAATTTATATTGTTTCATCCCATG^ 

SEQ ID NO: 4085 ACAATGAACTGCTTTTCCTCAAGCAATAATTGTTTCCAACTTGTCTGGGAATT 
GTGTGTCTGGTAACTGGAAGGCCTTCCACTGTGGCAAATGGAGGCTTTTCACTGCCTGT^^ 
ATACAGTAAGCATAGTTAAGGGGTGGGTCAGAACATGTTAAGATAACrTACTGTATATGTATTCCC 
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TTGTATTTTGTTAAAGCTGGAACATITGATATTTTTCCATTTATTTATGAAA^^ 

TTCAmGTACAAAACirrGGATGAAAAGATACGTCAAATTTCATTTAATCAOT 

CACCAACTCCATCAATACCACCCAAAGTGTTTTANGCAGTGAATAAAATCAAAATAATGCATOT 

ATAAATTCCAGCTGTTAAAAGAACAAACTTAGCAATATATAACAGriTGCTAAC^ 

TATTCACTTTGGGAGTTATTTTTAAAAATNCACTT^^ 

SEQ ID NO: 4086 ACGGGGGGCGCGTCTTGTTCTTGCCTGGTGTCGGTGGTTAGTTTCTGCGACTT 
GTGTTGGGACTGCTGATAGGAAGATGTCrrCAGGAAATGCTAAAATTGGGCACCCTGCCCCCAA^ 
TTCAAAGCCACAGCTCTTATGCCAGATGGTCAGTTTAAAGATATCAOCCTGTCTGACT^ 

aaatatgttgtgttcttcttttaccctcttgacttcacc^ 

cagtgatagggcagaagaatttaaoaaactcaactgccaagtgattggtgcttct^ 

acttctgtcatctagcatgggtcaatacacctaagaaacaaggaggactgggacccatgaacatt 

cctttggtatcagacccgaagcgcaccattgctcaggattatggggtcttaaaggctgatgaagg 

catctcgttcaggggcctttrtatcattgatgataagggtattcttcggcagatcacto^ 

acctccctgttggccgctctgtggatganactttgaaactagttcaggccttccagto 

aaacatgggggaantgtcccancctggctggaaacax3ccagtgataccatcaagcccttgatgt 

CCAAAAAAAA 

SEQ ID NO: 4087 ACAAATGATGAAACGGAAAGACGAAGGAAATTTTCCATTTTTGAAG/^^ 

gtgttcagtgttaatggagccaggggaaaccacatggactttggtcagctctatcagttot 

accaaaggatgtggggatgttitccagatgttctttggtgtagaaggacaatgacatcannagta 

gttgaaagtatcttgccactgttggtctttcgattttt^^ 

ATTOTATTTTAGTTCCATTCTAAAATGTTGGGGAGTGGGGCACAANAAAA^ 

aatgcatctgtnaaaaatgncatgarrgaaagcagaactgagtttcaaattacaacot 
gttgttagatatttcttcacatatcagctgcccattttgaaaaagaaattatc^^^ 

GTNGGTGCTCCAATTrGCCAGCCATrCCCAACCCCCTTCTCCCTTACCTGCCTTCA 

AGAAAAGCTAATTTGCTCCCCCTTTCACCCTNTTGrrGCAACTAAC/^ 

ACACANCTTTTOGCCTTGGGAAATTTTGGGAAAACT^ 

TANGGCCACCT 

SEQ ID NO: 4088 ACTCTTTtAATGTGTTTATATTGGAGGTGAGGGTAGAATGTTTTATAAA^ 
AAATGGGGATTTGCTTTTGATTGTAGGTCTITGCAGATCCGCrGGTTCCT^ 
TGCTGGCTGTCAGGAGACAGTCTCCTTGTTTATTAGCAATGACCAGCTTGGATGACAGATC^ 
TGCTGGATGGCAAGTGGGAGATGGGGAAATCGGATCTTGGAAGACAAGAAAAAATTAAAAAAAA 
AAAATTTTTGCTTTAACAGCCTTCTCTTCATGATTATTm 

TTTACTTTGCACAGCCTGGTAAAGCTCrrCTGGACAGAAAAATAAAAACANGAAAT^^ 

CCACACACTTTGTGATTAAACCCCTCCCCACAANAATAACAAAAAACACCCTCAAAGGCGGTCC^ 

NGrmAAAACTANTCTGGATAAACTGGAATTTACCCTAAAATTO 

CGCTGCAATAATCACATTACCTTTCATACCATTCTNATNACCTTATTTACAAAG 

CNATCCCTTTCCAATTCATAAANTGGTTrACNTTGAAATGGOTCCCCAATTGG 

ACAGAGAATNCCTT 

SEQ ID NO: 4089 ACGCGGGGACTGCNCAGGCGCTTACAGTGCACCAAGATGGCCGCCCCCGTOG 
ATCTAGAGCTOAAGAAGGCCnCACAGAGCnTCAAGCCAAAGTTATTGACACTC^ 

aagctcgcagacatacagattgaacagctaaacagaacgaaaaagcatgcacatcttacagatac 

AGAGATCATGACTTTGGTAGATGAGACTAACATGTATGAAGGTGTAGGAAGAATGTrrATTOT 
GTCX;AAGGAAGCAATTCACAGTCAGCTGrrAGAGAAGCAGAAAATAGCAGAAGAAAAAATTAA^ 
GAACTAGAACAGAAAAAGTCCTACCTGGAGCGAACGTTAAGGAAGCTGAGGACXACATCCGGGA 
GATGCTGATGGCACGAAGGGCCCAGTAGGGAGCCTCTCTGGGAAGCTCTTCTCTGCCCCTCCATTC 

CTGGTGGGGGCN 

SEQ ID NO: 4090 ACAAAATCAACCAGGTCTGAACTGATTGGTGATAAGAGCACACAGATCAGTC 
OTCCTTAAGGAAACAGTrrCCTCCCTGATGCCCTC^ 

CCAGTGTACTAGAACAGGTTGACAACAGGAGTTGGTCTCAGCAGGAGCCCTGCnTTCTGCTCCT^ 

ACCTCCAGACTAAGCTCCITCATGGCAAGATGGAGATAGAGCAAATCAAAGGCCAGATGTTTGGA 

ATATCTGCTGAATGATCTTATGATAACCTAACITCCAGTGTGTCCGGAATTGGTGGGTCOT 

CACTGACTrCAAGAATGAAGCTGGGGACCCTCGCGGTGACTGTTACAGTTCITAAAGC^ 

TCTCGAGTTTGTTCCTTCTGAAGTTCGGACGTATTCGGAGTnCTGCm 

CTCCCTGCTCAGGAGTGAAGCTCAGACCTTCCAGTGAGTGTTACAGCTCrrAAGGCAGTGG 

GAGTTGTTCnrrCCTCCGGTGGGCrCGTGGTCTCGCTGCTTAANAATG 

TGAGTGTTACAGCTCATAAAGGAAGTGTGGACCCAAAGAATGAGCAm^WCAA^ 
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AGGGA 

SEQ ID NO: 409 1 ACTGCTATCTCGCATTCrrCTCAGGTTGCCAGGAAATGTTTCrCTAACCC^^ 
AAAAGTGTTTGTAAACCAAGTTCAAAGGAAAATAGACAGAATGATTTCCAAAAT^^ 
CCATGACa:ATATACGCCAAATTCCCTCGTTATGCCACGACCAGATAAGAATCACCAGTGGGTATT 
CAATAAGAACTGTTTCCCTOTGTGGATGTAGTGATTGATCCrrACCTTGTATATCATCTO 
CATCAGAAAGAAGGAATCATATTCCTTTATGAATGTOTAATGGGAATGAGAATGAATGGCAGATG 
TGGAGCTATTCTTGCTGATGAAATGGGTrrAGGGAAGACATTGCAATGTATTTCGCTCATCT^ 
CCTGCAGTGTCAGGGACCCTATGGAGGCAAGCCAGTAATAAAGAAGACACTAATTGTCACACCTG 
GAAGCTTGGTGAATAATTGGAAGAAAGAAXn'CAAAAATGGCTAGGAAGTGAAAGGATCAA 
ATTTACTGTTGATCAGGACCACAAAGTTGAAGAATTCATCAAGTCTATATmATTCT^ 
ATCAGrrATGAAATGTTACTTCGTTCCCTGGATCAAATTAAGAATATAAAATTTGATC^ 
GNGACGANGGG 

SEQ ID NO: 4092 ACAGCACTTCCGAAGAGTrTAGTTGG<XCTTTGCTGGrrGOGCTGAGm 
ATTTTTAAGTGTTTGTTTTTCAGTGCAATAATTTTTGTGTGTGTG^ 

TTGTTTTCTGCCTACACCGTTCATCAGCCCCATAACCCAGGAAGGAACAGGCATTGTTAGCATCAG 

ATTATACCTCATTATTAAAAGGAGGCATGGCCACACATGAAGAAATGGTCArTCTACTTCAAAGA 

AATTGAGCCAGCACTATCTGTACGAGTCTGAGGCGGAGGGAGTAATGGCAGGACAAGCGTrTAGA 

AAGTTTCTTCCACTCTTTOACCGAGTATTGGTTGAAAGGAGTGCTGCTGA^ 

GGCATTATGCITCCAGAAAAATCTCAAGGAAAAGTATTGCAAGCAACAGTAGTCGCT 

GGGTTCTAAAGGAAAGGGTGGAGAGArrCAACCAGTTAGCGTGAAAGTTGGAGATAAAAGTTCTT 

CTCCCAGAATATGGAGGCACCAAAAGTAGTTCTAGATGACAAGGATTATTTCCTAm 

GGTGACArrCTTGGAAAGTACCTCGGNCCGCGAACACCNCTAAGGGCNAATrCCANCACACTGGC 

NGGCCGTACTAG 

SEQ ID NO: 4093 ACGCGGGGTTGCmATTTTCCATCAAAGCCCTCTGAGAAGTGAGACCT^ 
AATTCCGGGAGCCACATAGAGACAGACTTGGCAAGGGACCCCCTGGTTCTGAGCCAGTAGCTGCC 
ATCTGGAAATTCCrCTTTTAGCCTCTCCTTAGAGGTGAATGTGAATGAAGCCTCCCANGCA 
TGAATTTCTGAGGCCTTGCTTAAAGCTCAGAAGTGGTTTAGGCATTTGGAAAATCTC 
ATAAAGAACrrGATTTGAAATGTmCTATAGAAACAAGTGCTAAGTOT 

SEQ ID NO: 4094 ACATrCGTGTTCATGTTTACAAN>rrCTCrrC(>TATATTTACAAATTTCACGAT 
AAAATTGTGGGGGAACAAAATrACTAGAAAAAAATAATGAGAAAATATACACCACATATrATTCA 
AAAACAAATGTACATTGGTGATTCTGAAGCTTATATCGGAGCAGACATTAAAGACAAATTAAAAT 
GTTATGACTTTGATGTGCATACAATGAAGACACTAAAAAACATTATTTCACCTCCGTGGGA^ 
GGGAATTTGAAGTAGAAAAACAGACTGCAGAAGAAACGGGGCTTACGCCATTGGAAACCTCAAG 
GAAAACTCCAGATTCCAGACCTTCCITGGAAGAAACCmGAAA^^ 

TGATGTTAGAGACATCTATGTCAGACCACAGCACGTGACTCCAGTCAGTGGTCCTGGTCCCACTGT 
CCCAhrrGTANGTTAGTATTCCTTCACATCCTCTCCATGGCTTAAGAATGTCCCAOT 
ACTCCAACTGCATCTCTACNTTTAGGAACAGAGACCCGCCTTAAGAGACTGGATCGCACACmTrG 
CAACANAT 

SEQ ED NO: 4095 acgcgggggtcgcgcgcgtggatctgccgccgggttgctgtgcgactattct 

CCGGGAGCCGTCCGTGTCACCGCCGGAACCTGGCGCAGGTTAATTATAGAAAATGCCAAGTAGGA 
AATTTGCCGGTGGTGAAGTGGTAAGAGGTCGATGGCCTGGGAGTTCACrrrATTATGAAGTAG 
ATTCTGAGCCACGACAGCACCTCCCAGCTTTACACTGTAAAGTATAAAGATGGAACAGAGCTTG 
ATTGAAAGAGAATGATATTAAGCCnTTAACn'CCTTTAGGCAAAGG^^ 

GrrCCCCTTCCAGACGCCGAGGGAGTCGATNAAGGTCACGCTCCCNATCCCCCGGTCGACCACCT 
AAAAGTGCCCCCGATCTGCTTCTGCTTCOCACCAGGCCGACATTAAGGAAGCA 

SEQ ID NO: 4096 ACAGGAGTTGTTGCATATTCCATGAGGCTGGTGTCGGGAAGCAGGGACCCAC 
AGTTGCCAGGTTGTCCATCTCTGAGCCAATTrCCCTCCACAACCCAGGGGTTTCAGTCTCATATCA 
ACTATCATGTTTGAAACAGAAAACAGGCAAAATGTTTGGCTAAAATAAAATGAAAACACT^ 
GAAGAGAACTGAGTGTGCTGGTGGACAGGAGCCCTGCTCACCTGTGGGAAGGGCAGGGCCAGCA 
AGGGCAGCAGAGCTCCCTGGGGGCAGCTAGGCTGTGTGTGCATGTGGCCCTACAGCTGGTCCCAG 
GGGAGATGCGGGGACAGGGGACAGTCCAGGCAGACAGGTACTTTTTTTT^ 
TTTCCACACCTGCCCTTTATTGGTCTCTTCTANCAAAGTGGCTCCATC^ 

ACCACCCATGAGGGTTTAGGAAGGTGCCATCATTCTGTGAAGGCCCANAACTTACCCAAGTOT 
GANCCCAAGTTGAATCACCAACCA 
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SEQ ID NO: 4097 ACTTTTTTTTITnTI^^ 

CCCTTAAATAGAAATTCCACTACAAAAATACAGAGGAGATAGGGTGTTTCCTGTATCCG^ 

CCCATAGAAAACTATAAGGGAAGAAATAGAACTTGGAATTAAAGCAGCAGCAAGGCGAGGTGAG 

AATGCGATTTCTAGGCCATCTTGrrGGGACTGATGAACAGCATCTCTGATCTCATGAm 

TGGTTATCCANAAGGGATGGGATTGGCCTAAAAAAACCGATCAATTTCTGGATTGTT^ 

TAATCCTTCAAATGACAAGAAGCAAAAATATTTGTAGAAACTAAGCAAGACAAGAGTTGGAGCAG 

TTAACACCACTATCTGCAGCAGAAACTCAGACCTTCTTAGTGAACrrAAGGGCT^ 

GCTGAGGCTGCAAGTGCANATCCNAAAAAAAAAAAAAAAGATTAGACAGAACCCTCCT^ 

CANATCCrcCTNTGCCCTGCCTITGAAAATGCCN^ 

GGATCTCTTTTNTTGGGANTGATCCrCA>n^GAAATATTCT^^ 

CXn^fAAAANGGGA 

SEQ ID NO: 4098 ACTTGAGGTAACAGTCATGGAGATCGAGATAACGACCATATCCCTCTTCATCT 
GTGAACTCCACCAAGTTTTGTGCCTCTTCACTTGGATTCTCTCGAGCCTTC^ 
CCACTGACATTGGCACACAGATCTCATTTGGGTGCTTCCGGTGGAATTCCTTTAm 
ATTATAGAATTCAGCAAACTCATTGGGTCCTGAAATGGCATrGAGCTCCTCCTTTCGTAATCCA^ 
CnATCATCATACAAATCCCTCAGGTTCCCACTGACCTCCATATACCTATCTTGCATGGCCCGAGT 
GCGGTGATCAGAATTGATCTGGTCCCGGAGCGTGGACTTCTTGGTGAGCATCTCTTTAGCCACGAC 
GTCCATGAGCCGTTCCTTCTCCTCATGATAGCGCCGCTGCTGCTCCAGTATTGTCTCCATOT 
TAGTCGCGGCTTCTCAATTCAGACCACCAACACGGCCGGAACAACTCCTGCCGCCANCGCGCCTG 
AATCCCCACGTTGCGCCCCOGCGT 

SEQ ID NO: 4099 ACATGATCAGCTCTTTCTATTTTTACTCGTAAAAATTATGGAAATGAAT/^^ 
TTGCTAACAACrTTGAAATITCAAACTTCrrGGAAAATATGAAA^ 
AAATTGTAAGGTATGAATGTGATTTGTCTGT 

SEQ ID NO: 4100 ACTGCGAGCAGTTCTTCCAATTCGATGAATATAATCCTCTGAGGAGTTAGGGT 
AGTCATAATTGATGACAAATTTCACATCTTCCACATCTAGCCCTCTGGAGGCCACATCTGTAGCy^ 
TCAGAATAGGAGCrmCCATGTTTGAATTCATITAGAACX:CAGTCACGCTCTrGT^ 
ACCATGGATACCCATGGCAGGCCACCCATCTCTCCTCATTTTTCTGGTAAGCT^^ 
TTGGTTTCCACAAAAACAATGGTTTTATrCTCCrrCTCACTCATGATCTCT^ 
GTTTTTCATCCTTTTCTACGTCATGACACACATCCACAATCTGAAG/^ 
TTCAAGTGCACCAATX/TTTATATGAATATAGTCTTTCAGGAAATCTTCAGC^ 
TTTGGCCAAGTCGCACTCCACATrAGAAGTTTGCCTATCAGGTCrTATTTGATCCAC^ 
ATITGGGGTTCAAAGCCCATATCAAGCATTCTATCTGCTTCATCAAGGACAAGG^ 
CTCAAGArrGGTTTTTCCACACICrAAAAAGTCAATCAAGTCTTC^ 
TC 

SEQ ID NO: 4101 ACTGGCCmATGTGTCTATTCATAGCAGTGGACTGGGCTGCTCGCACCAGAC 
eCTCCAATTCAGCACCACTGAAATTCTTGGTCTCCACGGCCAGTTC^ 
AGAGTAACTGATGCCCTCTCATTCTTGCTGTGTGGATGTGAAGAATCTGTAGTCXiGCCm 
TGGCAAGCCTATCTCCATTITAACTTCCAGTCTTCCAGGTCTAAGAAGAGCCTC^^ 
GGTCTATTGGTCATTCCAATGACTAGGATGTTGTTTAGCTGCTCCACGCCATCAATm 
AACTGGTTGACAACAGTGTCATGAACTCCTGTGCTACCAGCCATGCTCCCTCTCTGCT^ 
GCATCAATTrCATCAAAGATGATGATGTGCAAACCACTGTrAGCACCAAGCCTCCTTTGCT^ 
TCAGCATCAGCAAAAAGTTTGCGAATGTTAGCCTCTGATTCTCCCACATATTTGTTAAGGA 
GGCCCATTCACCACirrGGGCTCTCTTGCATTCAACATCrrGCCAAT^ 
TTACCACAACCTGGGGGGTCCATATAACAAGGATGCCTTTAACATGTTTACAACCCATCTG 
NAATCT 

SEQ ID NO: 4102 acctataaatgttgttgagacactgagaacacgtggggcccccacccggata 

GTGAGAAAAGTAGCCCGGAACCTGGGCAAGGCCACTTCAGGTGTCCTCGTTGTGCTGGATGTAGT 
CAACCTTGTGCAAGACTCACTGGACTTGCACAAGGGGGAAAAATCCGAGTCTGCTGA^^ 

ggcagtgggctcaggagcrggaggagaatctcaatgagctcacccatatccatcaagagtctaaa 

agcaggctaggcccaattgttgcgggaagtcagggaccccaaacggagqgactggctga agcca 

tggcagaagaacgtggattgtgaagatttcatggacatttattagttccccaaarraatacttt^ 

taattnnctatgcctgtctttaccggaatctctaaacacaaattgtg;^ 

tcacttccccaatcaatacccttgtgatttcttatgcctgtcmact^ 

nctgangaggatgtttgtcacctcaggaccatgtgataamn^ttaactc^ 

catgtgtgtttnaacaatatgaaatctgggotcctngaaaaaaagaacaggatacagcatri 

anggaataaaaaa 



638 



wo 02/29086 PCT/USO 1/30732 



SEQ ID NO: 4 103 ACAACAAAAAGATTAGGAAGCAAAATGTGAATAAGCITGTATTCAGAATATA 
CCTATATGTGTGTGCAAAGACAATACACTCATCATATACCTCACTTAGGTTCCTTTAT^^ 
ATTCCTTAATCATTCATTTGTGAAACAAGAATGAATATTAAGAAACATGAAATCCAAAT^ 
ATTTTCACAATATCAATGCTAAACTCAAAATAGCAACTTCATTGACTCTC/^ 
CATGGAAAGCAATCTTAATTTTTITAACCCmGACTAGGGTCT^^ 

ACTGTACAGCATTCTGGAATAAAGCAAGAGTGTTCATTCACACACACAGTAGCTTCAAAACTGTT^ 

GATCTGTTTGTTCCCATGTAGTTTTCTAAAGATGGAAAAAAAGGACTTTTGGTCATC^ 

GTGGCCATATTAGATTACTGGAACATCTAAGCATCAGTGTGTGACCATGCGAACAAAAGACTTCG 

GGGAGTGTCTATTTTTAAAAAGGTTTATGTGTGTCGAGGCAGTTGTAAAAGATTTA 

AAGCCC(nTTTANGCTTAGGACCAGGTTCTAACTATCTAAAAATATTGACTGAT^ 

TCTTAAATGT 

SEQ ID NO: 4 1 04 ACGCGGGOTAAGAGCTTTCGAGTATACTGTATTATCCTTGTAAAACCCAAAG 
ATGTGAGTCTTTGGGCTGCAGTAAAGGAGACTTGGACCAAACACTGTGACAAAGCAGAGTTOT 
AGTTCTGAAAATGTTAAAGTGTTTGAGTCAATTAATATGGACACAAATGACATGTGGTTAATGATG 
AGAAAAGCTTACAAATACGCCirrGATAAGTATAGAGACCAATACAACTGGTTCTTCCT^ 
CCCACTACGTTTGCTATCATTGAAAACCTAAAGTATTITTTGTTAAAA^ 

TTCTATCTAGGCCACACTATAAAATCTGGAGACCnTGAATATGTGGGTATGGAAGGAGGAAT^^ 

TTAAGTGTAGAATCAATGAAAAGACITAACANCCTTCrCAATATCCCAGAAAAGTGTCC^ 

GGGGAGGGATGATTTGGAAGATATCTGAAANATAAACAGCTAGCAGTTTGCCTGAAATATGCT^ 

AAGTTTTTGCAGAAAATGCCAAAANATGCTTGATGGAAAANAATNTATT^ 

GGGCTTTCTTTNAAAAAAGCAATGAOTATTNCXCCCAACCANGT^^ 

AAATATGGNTN 

SEQ ID NO: 4 1 05 ACCTGGCCATCCTGGGCAGTGTGACGTTTCTGGCTGGCAATCGGATGCTGGCC 
CAGCAGGCAGTCAAGAGAACAGCACATTAGTTCCAGAAGAAAGATGGAAATTCTGAAAACTGAA 
TGTCAAGAAAAGGAGTCAAGAACAATTCACAGTATGAGAAGAAAAATGGAAAAAAAACTTTATT 
TAAAAAAGAAAAAAGTCCAGATTGTAGTTATACrmGCTTGTTTTT^^ 

CAGATACCTGGTGAGCTCAGATAGTCTCmCTCTGACACTGTGTAAGAAGCTGTGAATATTCCT^ 

ACTTACCCAGATGTTGCTTTTGAAAAGTTGAAATGTGTAATTGTTTTGGAATAAAG 

TAGTNNAAAAAAAAAAANAAAAAAAAAAAAAAGGTCCCTGCCTATm 

TTTCATTGTAANAAACTAGTCCATTATTTAAGTGTCCCAGTATTTTTCATTTCAGTGG^ 

G(>IAAGGTTTCCAGACACAATCTTGGTCTCTAATACTGCTCCAGGGTGGGATATCAATTCTGTCA 

CATGATTTGCAATGATGATAACCGTTCCCTTTAATGAAACATTTTTTCC 

NAAACTGNGGAG 

SEQ ID NO: 4 1 06 ACAAACATGACATTACAGAGTATCTTATAAAATACAAAGACAAATATAAAAG 
GACTATGATGCTTTAAGTCTGAAAACTATTGGCCAAATATrrAGGTTTAAATTTAC^^ 
TATGAGAATCATATTACTATATACATCTCCCAAACCAGTAGGTAGTATmCCAATTAACCATGTG 
TGGTATCATCITCTACAAAGTCnTTGGCCATCTCTGCTGTGATCACATCAATATQACT^ 
TCTGAACTTTACACCATAGAATTTGTCAGCTGACTCAAGCAGTTCAGGCCTAAAAGTAGTTGTAAT 
AAACTGAGCATGT 

SEQ ID NO: 4 107 ACGCGGGGACGGTTCGTITITGCmAGTCAGGAAGGACGTTGGTGTTGAGGT 
TAGCATACGTATCAAGGACAGTAACTACCATGGCTCCTGAAGTTTTGCCAAAACCTCGGATGCGTG 
GCCTTCTGGCCAGGCGTCTG(>rAAATCATATGGCrGTAGCArrCGTGCTATCCCTGGGGGTTGCAN 
GCirrGTATAAAGTTTCTTGTGGCTGATCAAAANAAANAAGGCATACT 
ACAATTGTTATNGAAAGATTTTGANNGANATNAGGAAGGCTGGTATCTTTNAN 
ATCrrrGGAATATAAAAGAATTNCTCCANGTrGANTAANCTAAAA^ 

SEQ ID NO: 4 1 08 ACTGGCCTGCTGCnXjGCCCGCAGGCrrCTCAATAGGTTTGGCATGGAC^^ 
TCTATGAAGGCCAAGTGGAGGTGACTGGTGATGAATACAATGTGGAAAGCATTGATG GTCAG CCA 
GGTGCCrrCACCTGCTATTTGGATGCAGGCCTTGCCAGAACTACCACTGGCAATAAAGTT^ 
GCCCTGAAGGGAGCTGTGGATGGAGGCTTGTCTATCCCTCACAGTACATCCAAAACCATAAGGAA 
ATATTCTGATGCCCAGATOATGAAGACTGGGGTGAATTAAGTCCACACATTTATTTCAAGTTGTTA 
AAGAGTTTGTGGGCCACGCAATGGTCCCTCGCATGCAAGAAGTCAAAGAGCTCCTCCXjTGCAATC 
CTCTTCTGTATGTGATCGAGAGGATACACGCTCATCACAGAGCTCTAGCCGCTCCCGGGCCTTTAC 
ACATITCTCCAACTGCTCGCATTGCTCrrCTCACrGTrGTTAGGGGATCCACT^ 
TCTTCCTCCTCCTNAGGATCTCCGGATTCGGTAAGCATCTTTTGCTCGTCCTCATCCCATG^ 
CTACGGTTCTAGATrCAACACGAANCAGCAACAGCGGCACCTACCCAhrn'CANGATCAAAAAAG 
ACTTGTAAGGGTC 
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SEQ ID NO: 4109 ACGCGGGGGCGGCCAACATGGCGGAACGCAGGAGACACAAGAAGCGGATCC 
AGGAAGTTGGTGAACCATCTAAAGAAGAGAAGGCTGTGGCCAAGTATCTTCGATTCAACTGTCCA 
ACAAAGTCCACCAATATGATGGGTCACCGGGTTGATTATTTTATTGCITCAAAAGC^^ 
CTTTTGGATTCAAAGTGGGCAAAGGCX^AAGAAAGGAGAGGAAGCTTTATTTACAAC^ 
TGTGGTTGACTACTGCAACAGGCTTTTAAAGAAGCAGrmTrCACCGAGCCCT 
AATGAAATATGATAAAGACATAAAGAAAGAAAAAGATAAAGGAAAAGCTGAAAGTGGAAAAGA 
AGAAGATAAAAAGAGCAAGAAAGAAAATATAAAGGACGA 

SEQ ID NO: 4110 ACAGTAAGGCAGTAArrCCATTAGCATCCTGCTGTGGTTITCTTTTCAAGTTA 
OTCTTTAGGGAATTTTITmAACTAAGGAAAGACTAAGGTGTAT^^ 

AAATGAAGGAGGAAAACATCCTAAAGGGATTCACATGAAATTCATTTGCCCCTTGTGTAAGAA 

TTmATTTCCAAACTATATTTTATAGAAGCirrAGAGTAGAAm 

ACATATCTAAACrArrrrAAGAAAAAATTGGAGGCATrATTTCTTACrrGCATATCAG^ 

CAGTAAGGTATAGGAACTTATGTAAACAATTrGTrCTGGAATGCATTCTTTCm 

TTGGGTAATCCTAATAAAATGGACGAGQAGTGTAAAGGTTATGCAGTTGTTTTTCAAAAGC 

TCATGGCATTTTCCTATGTGOAAAATTITCTTTTGTCATCT^^ 

CATITCCGTCTGTCCCATGCCAAATCATGAAAGCCTTTTCCAGAGGATATACC TATT GGGCAm 

CCCTCAAANAATACTAAGCCNGTGGTGCCrTTTATTTCATTGAACACAT^ 

CAGC 

SEQ ID NO: 41 1 1 ACGCGGGGAGCCGGGTGCTGATGCGAGTCGGTGGCAGCGAGGACATTTTCTG 
ACTCCCTGGCCCCTGACACGGCTGCACTTTCCATCCCGTCGCGGGGCCGGCCGCTACTCCGGCCCC 
AGGATGCAGAATGTGATTAATACTGTGAAGGGAAAGGCACTGGAAGTGGCTGAGTACAAAATCCC 
CTTTGTTGAAAAATAAGGGGCITrCTAAACTAATAAAAAAGGAAGTT^^ 
TAAAACAACrrrTTTGGCAAACAAAGTTACTTCAGGTGAGGAAATm 
CAAATGAATCTTTTCrrAAAACTTmAAAAAATTATGTGCCAGTGTATACT^^ 
TGTCTTAGAAGTTTTTAAAGCATTCTGTTAAATGCCCACTGAAACAATGGGACTCCAAAAATAT^ 
TCAATAATCATGATAAAAAATTATAATATGATTATCAAGTGAAGCANGTATTGAAGAAATAAAAA 
TTCTCACTTGCTCACTGGCAATTTCrmCTAACAGATATTATTGGAGAAGGCCTGAAGT 

SEQ ID NO: 41 12 ACCATACCACTTATAAAGTGGAAACTCTTGGACCAAGATrTGGATTAATITGT 
TTTTGAAGTTTTTrGTATATAAATATGTAAATACATGCTTTAAm 

AAATAAGTTAGACATTTAAAAGAAATGATTGTTACCATAAATTAGTGCTAATGCTGAGGAGAA^ 

ACAGTTTTTCTTTTGAATTTAGTATTTGAGATGAGTTGTTGGGACATGCAAATA/^ 

AAAAAAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 4113 acgcggggaggcattgaggcagccagcgcaggggcttctgctgagggggca 

GGCGGAGCTTGAGGAAACCGCAGATAAGTITITTTCrCrm 

TTAAAAAATATAGTCAATAGGTTACTAAGATATTGCTTAGCGTTAAGTTmAACGTAAT^ 

AGCTTAAGATTTTAAGAGAAAATATGAAGACTTAGAAGAGTAGCATGAGGAAGGAAAAGATAAA 

AGGTTTCTAAAACATGACGGAGGTTGAGATGAAQCTTOTCATGGAGTAAAAAATGTAm 

GAAAATTGAGAGAAAGGACTACAGAGCCCCGAATTAATACCAATAGAAGGGCAATGCTTT^ 

TAAAATGAAGGTGACTTAAACAGCTTAAAGTITAGTITAAAAGTTGTAGGTGArrAAA^ 

AAGGCGATCTmAAAAAGAGATTAAACCCGAANGTGATTAAAAGACCTTGAAATCCATGA 

GGGAGAATTGCCGTCATTTAAAGCCrAGTTAACGCATTTTCTAAACGCCAGACGAA^ 

AGATTAATTTGGGAGTGGNNAGGATGAAACAAATmGGNNAA^mATNAGAAAT^ 

AAANACTGGGAAGAAC 

SEQ ID NO: 41 14 ACGCGGGGGCTGTCGCTCACTCAGATrGTCCGTTTGCTATGCCGAATGCAGCC 
AAAATTCCTTTTTACAATTTGTGATGCCTTACCGATTTGATCTTAATCCTGTAm 
ACACTCCCTTATACTGTGTTrcTCTTTTTGGGGGAGCTTAACTGCT^ 

CCATAGTAAATGCCACAAGGGTAGTCQAACACCTCTCTGGCCCCTAGACCTATCTGGGGACAGGC 

TGGCTCAGCCTGTCTCCAGGGCTGCTGCGGCCCAGCCCCGAGCCTGCCTCCCTCTTGGCCTCTCAT 

CCATTGGCTCTGCAGGGCAGGGGTGAGGCAGGTTTCTGCTCATAAGTGCTTTTGGAAGTCACCTAC 

CTTTTTAACACAGCCGAACrAGTCCCAACGCGTTTGCAAATATTCCCCTGGTAGCCTAt^ 

CCCCGAATATTGGTAAAGATOJANCAATGGCn'CAGGACATGGGTTCTCTrcrCCTGTC^ 

AAGTGCTTACTGCATGAAAGACTGGCTTGTCTCAAGTGTTTCAACCCTCACCAGGGGCTGGCTNTT 

GGGTCCAAACCTTNGGTTCNCTGGTTAANTGCCGTmTGAACANCCCCCAATCAAAAOT 

TGGNCCC 

SEQ ID NO: 4115 ACTAAOAGAAAAGCACGAAGCTGTGGATCATAGTTCCCAGCATGAGGAAAA 
TGAAGAAAGGGTGTCAGCCCAGAAGGAGAACTCACTTCAGCAGAATGATGATGATGAAAACAAA 
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ATAGCAGAGAAACCTGACrGGGAGGCAGAAAAGACCACTGAATCTAGAAATGAGAGACATCTGA 

ATGGGACAGATACTTCTrrCTCTCTGGAAGACTTATTCCAGTTGCTlTCATCACAGCCTGA^ 

ACTGGAGGGCATCTCATTGGGAGATATTCCTCTTCCAGGCAGTATCAGTGATGGCATGAATTCTTC 

AGCACATTATCATGTAAACnTCAGCCAGGCTATAAGTCAGGATGTGAATCTTCATGAGGCCATCT^ 

GCTTTGTCCCAACAATACATTTAGAAGAGATCCAACAGCAAGGACTTCACAGTCACAAG^^ 

TTCTGCAGTTAAATTCTCATACCACCAATCCTGAGCAAACCCTTCCTGGAACTAATTTGACA 

TTCTITCACCGGTTGACAATCATATGAGGAATCTAACAAGCCAAGACCTACTGTATGACOT 

TAAATATATTrOATGAGATAAACTTAATGTCATmGGCCACAGAAGACACTTGATCC^ 

TTTTNTCAGCTT 

SEQ ID NO: 4116 ACTTTITrTTTTTTTT^^ 

ACACTGTANACAGGTGTGTGGGTATAAACTGCTGTATCTAGGGGCAGGACCAAGGGGGCAGGGGC 

AACAGCCCCAGCGTGCAGGGCCAGCATTGCACAGTGGAGTGCAAAGGTTGCAGGCTATGGGCGG 

CTACTANTAACCCCGTTTTTCCTOTATTATCTGTAACATAATATGGTAGACTGT^ 

TACCAGTAACAGGATGAATCCAATGGTCATGAGGATGCCCANAATCAGGGCCCAAATGTTCAGGC 

ACTTGGCGGTOGAGGCATAGGCCTGGGCCCCGGTCACGTCGCCAACCATNTTCCTOTCCCTAGACT 

TCACGGAGTAGGCGAATGCTATGAANCCCANACAGCACCAOrrCAAGAAGAGGGTGTTGAACAG 

GGA(XAAACGACATGGTCGGGCACGGAGGTCTCGCrGTGGATGTTGATCACGGTGGACCTTGGAA 

NGATGGGGCTTGGGGGG 

SEQ ID NO: 4117 ACGCGGGTATAAAACTATGGAGAAAACTGCTAAAGGGTATCCCTGACCTTTA 
TGATGATGCAGCTATTrrCGAGGCCAAAAAATCATmACTGGGCAAGAAAAACA^ 
TGTCGTGAATATCOTGCTCAGGCTCTTTATGAATrATTTTCTGCCACAGA 
CTAAGAAAAGCCTGTTTTCTTTATrrCAAACTTGGTGGCGAATGTGTT^ 
CTTTCTGTATTGTCTCCTAACCCTCTAGTTrrAATrGGACACTTC^ 
GTATTTTrGCTTTAAGTCAGAACCrrGGAITACAAAACCTCGAGCCOT 
ATTGT 

SEQ ID NO: 4 1 1 8 ACAAAGACAAACACCTAAAACACACTGCGTCAAGTGCAGCACCGACriTGGT 
CAAATAAAAAAAATAAAAGAAAAATrAATAArrGCTAAGCrmCTA 

TGCATATmCATATAXCTGGGGGAAAATAAAGGAAGACTCCAAATAAATTGTAAAATGCAGCAA 
CATCCAAAATACTGATATTCTAAGCATCTACAGATCTCAGAATAGCACTGCCACCGACCGTACITT 
GTGATGGATCTCCTGAAGGGCCTGCCAGGCCTGCCAGGGCTCTTCATTGCCTGCCTCrrC^ 
TCrcTCAGCACTATATCCTCTGCTTTTAATTCATTGGCAACT^ 

CTTGGTTCCCTGAGTTCTCTGAAGCCCGGGCCATCATGCriTCCAGAGGCCTTGCCTTTGGCT^ 

GCTGCTTTGTCTAGGAATGGCCrATATTTCCTCCCAGATGGGACCTGTGCTO^^ 

CATCrrrGGCATGGrrGGGGGACCGCTGCTGGGACTCrrCTdCCrTGGAATGrrC^ 

ACCCTCCTGGTGCTGTTGTGGGCCTGTTGGCITGGGCTCGTCATGGCCTTNrrGGAT^ 

AGCATCNT 

SEQ ID NO: 4 1 1 9 ACTTGCAGCCCTCGGCCAAACGGCCAGACGCCGACGTCGACCAGCAGAGACT 
AGTAAGAAGTrrGATAGCTGTAGGACTGGGTGTTGCAGCTCTTGCAmGCAGGTCQCTACCAm 
CGGATCTGGAAACCTCTAGAACAAGTTATCACAGAAACTGCAAAGAAGATTTCAACTC 
TTCATCCTACTATAAAGGAGGATTTGAACAGAAAATGAGTAGGCGAGAAGCTGGTCT TATT T^ 
GTGTAAGCCCATCTGCTGGCAAGGCTAAGATTAGAACAGCTCATAGGAGAGTCATGATT^ 
CACCCAGATAAAGGTGGATCTCCrTACGTAGCAGCCAAAATAAATGAAGCAAAAGACTTGCTAGA 
AACAACCACCAAACATTGATGCTTAAGGACCACACTGAAGGAAAAAAAAAGAGGGGACrmG 
AAAAAAAAAAAAAAAAAAAAGTACTTTGTICTCAmTAAAGAAGATGAANAAGT^^ 
GATTGNCTACTGTGGGCAGGTGTTTGAAAANTCCCCCCTGCGGGNGAANAACTTCCGGGATCTGG 
CTTGCGCTT^GACTTCCCGGGAACNGGCACCCAAAANNTNCT^ 

SEQ ID NO: 4120 ACTrGATATGGAAAGAAGCTrrrTTTCTTCTCAAGTTGGATGCAGAACTGAGT 
TTAAAAGATACAAGGAAAGCAGGGTTTCATTAGTGGGAGCTGATTTAGGGTTCAGTTCTTTAT^^ 
TGCAATCTTAGGCAGGCCCCTTAACTCCTAGTCTCTATAAAATAAGGAAGGGGGTTGTCCCAGm 
TCAGTCTAATGTCCCCATTCTATGACAGACCCATCTCTTCTTTATGTAGACCAAGGAAGAGGA^ 
TTAGCAAAACTACCTGATTATGCCCATGriTCCCAAATACCTTGATTrrTCTTATA^^ 
TCTGCACATGTCTGGGTGATAATTCATTAATCAATGTAAAACGTAGGGACCCTITATTAGCTC 
AATGAAAAAAATCCCTCCACCAGCCAAGTGCTGGGACOCAOGTGGCCACCATTGTTATTGACAGO 
TAAGGAAGCACTGGAAATGCACCATCCTCTCCATCCCCACACTTTCCAGAGTAAAGCAAGGATGA 
GGACACCCAGGCATTGTGCCAAGGCTGCCATTGCAGGAGGGCACANAGAGAGGATGGGATAAGA 
AAAAAGCATGGATTAAGCCAAGANGCCNCTGAAAAC^fCAAAGGCTAAACTGCAAACT^ 
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GGGTT 

SEQ ID NO: 4121 ACCTATATGGCTCCCCCAA7TAATCACAATTCAGCAGCATCCACTCCCCATAT 
GTGGCTGAGATAACTTACTTTGACTGTCTAmGGTrATCTCTCAGACAAA^ 
AATGGACAGGTTTGTGGACTGTGTAGAGCAGCGGAAAAGriTCCACATTTTCrrGGACT^ 
CTrCrGT<>GCGGCTCCAAGGTGTCTGCTCCAGGCATGTCTACACACCACTCTC^ 
TTGTACAGACTCTCATTTTATGATGTATCCTACTGCATCAGGACATTTGTGTCAATGTCAG^^ 
AGGGGAAATGAAAGTGATGAGACGATGAGAGGAGTGAAATACCAAGGACGCGATACTAGGAAAC 
CCAGGTCTATTTGTTATCAGAGTAAGGATCAAGCCAGATAGCCTGTTATGTAATTrCTCCGATAAA 
AGATTTTGAAAGCAGGTGCTGTGGGCATCTGTATGGGGAATCGCACTCATAGAATTATm 
GGTAAATATTTGGGTATCAGGCCCAAGCAAGGGAAAGAANCrrTACTGNATTTACCNTCmOT 
GAAAANAATTGGATlTITCCTCTCTrCCCTTAAGGGGATATGAGGTAT^^ 
TAAGC 

SEQ ID NO: 4122 ACTTTTTTTTTTTITrrmTT^ 

GTTGGAGTTCAATGGCACGATCTCGGCTCACTCCANCCTGGTGACANANCAAGACTCTGTCTCAAA 

AATAAAATAAAATAAATAAATAAATAAATGTOTTAAATCCATTGCrrrAACAT^ 

TTGGTTTCTGTTCAACAAAACCAAACCACATTGGGATGGCATTTACAGAAGCTCANAA;^ 

ATGCACTGAAAAATACTTOTCATTANACCACAACAAGCTTTAAAAAAATAAAATTAAGT 

AGTTAAGTTACAAAAAGCTGAAAGTATAAAACATTGTGGAAACAGTGACCATCTATTAACTGGGA 

AAACWCAAANGAAGAGAAAOSrATTTTCAACCATTAATCATTTNTCAAACNTG 

TAAANGAATATNTTATCAAANGGTGAGGGGCTGGGCITACTGGCANTTTT 

NTCAAACCCCAAACCCTTCACAGNAAAAAACNarmAAGTTGTACCTNGNCC^^ 

TTTAGGGGCGNAANTTNCCAGCCNCACCTGGNNGGCCCGNTTANTNAATNGGGAATNCC^^ 

NNGGTACCCAAA 

SEQ ID NO: 4123 ACTTTTTTTTTITITTTTTT^^ 

AGCCCAAATAGATGTTCCCTGTGGAGGAGGACTTAAGGACACTAGGGGAGGAGAATGGGACACC 

TGGGAAGAGAATCACCACAGAGACCAATCTTCACAAAAAGGGTCCAATATTGATTTCTAGGGAGG 

AGCAGGGCATGGTCAGCTCAAATTTGGTGATAACGTCAGGATGAAGGACCCCAAGCTTCCCGACG 

CnrrGACCCCTGGCAAAGATCTCTGCACATCGCCCGGGGAAGAAAGCAGGCCCTTCTGATGCm 

ATC^CATATCCCCCCTTGTCTTCACCAGGAGGCACATCGAGCAACTGCATAATTCTGTC 

CCATGAATGATCTCAAACCCAGGArrCTTGTTGTAATAAACAGCACAAAAATGTCTGTANT^^ 

GCACCTACATCTGTATTANAATCTTTTATTACAATGTCANANATTTCAAACAGTT^ 

GGCATCTTACGAlTTGCTGCTATGGTCrrCANGAGGCCANGAAAAANGGTAGTGCNTGCCCCT^ 

AAATTAAC(nX3rrrTAGGArrACTTATGTGGACTGCCmGTTGCAAAAA^ 

TCAC 

SEQ ID NO: 4124 ACGCGGGATCTGAAGAAAACCAAGAAGAACTAAGCATTGATAACTGCATrGT 
AACTTGGCCAGATGCTCCAGCATACGCACGTTCACTGCAAAGCACCCTACTGGTTTTGAAAATCTG 
ACCTTGTCATTTCAATAGTTATTAACATGACTAAATATTATCITAATTA^ 
GCITITAGGGGTrrCTGACATATATTCTGGATACTAT(XGAGGTAAT^ 

ctcatatcaaatgaatatagaactaatattgtcgggaacacctaatagaaaggaatactattata 
gcaaatcacagaatgatagactcaagcataaaacttggcagttttatctgcttcaa/^ 
atcattattcctgtattttctctgaaactgattataaaaaccaa.tgtcx:agctctc^^ 
acacttgaagaaatggagatcgatttgatttgtttataagcagaccactgcaattracaa^ 

CmACGGTTTTATAAAArrATCITCCA>rrTTOTACATIT^ 

ATAACACACCACCCArTTTTGANGGACCTCCTAACCAGTTAGNTTCCAAAAGCArn^AT^^ 
TAATT 

SEQ ID NO: 4125 ACAAGTATTTATATCAATGAAAATTTCCATrGGTGATTTmGGCAGAATATT 
GGTCTTGACTCTGTGGAATAAATGACGACGTAAACGTAGCTGCACAGGGGTGTTCCTGTATAATGC 
TTGAATCAATTGTGTGTGAAAGCATCATGCAAATGGCTAATTAAATTGGGTGATGACTGA^ 
ATAAATCCTTCATTCCAGCTCCACGAGCAGATCCCCITCTCCAACTGTGTCTCCAGCnTGAC^^ 
CACAGATrrCACCGTGCCAGTmCCCAGCTGTCATACTATTCTGCATTTTCATGGOT 
CAAATTTCTTGACCTTCTGCTACCGCOT 

SEQ ID NO: 4 1 26 ACTGGTGTCCCTGGGTGCCAGTGAGAAACTATCCTTGCTCTCTCTGGGGAATC 
AGTCACTGCCACACAGCAGTCCTAGGCCTGCCTCTGCCAAACACTGCAGGAAACTCATTCACCTC^ 
TGAGGCCAGCCCATAGCATGTGATTCCAGATTCCTGCGGTCCAGCCTCCAACTTTGGTTGCCAGCT 
CTTTCTTATTCTACTACACAAGCCGCCAACTCAACTGAGAGCTAAAGAGACTAGA^ 
GCTGCCAACTCAACTGAGAACAAGAAACTAGAAGAGATTTATATATAAAGOT 
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GATGCL^GGATGTTTTCAACCAGTAAATTTTATrGCTGTTGGTGCCAGAGAAGAGTCC^^ 

ACATCCAGGGGCCrmCTCCAATAATGTGCCmAACTCTAGGGACCTGCCTCACGGA 

GAAAAACCTCAACCTGAAAGATCTCirCCTTTCTGGACTCCTTT^ 

TTAAACGTGCTGGCCCCANGACAGTGATGAANACAGAACCTGTCTCAACCTCTAGGCTGGGGGGG 

GATCAATGCCATTCAGTCCCTGTTATTGAGGGGATTATCCCCTTAACCAAACATTCCTATCn^ 

GTG 

SEQ ID NO: 4127 ACCCATGATCACAGGCCTTTGGAGCACTTTrACTCTCTGAGAAGAACTGGAG 
CTAGAGATGTAAAATGGACAGTCrTGATGGGGTTGAGAACCTTCTGGGGAGCCAGATOACCCTCT 
CnrrGCACAATAGATAAAAGTCTTTATATGAATATATATAAATTTATT^ 

GATTTCTGGAGAATGAGAATTGTCCAAATGCTCAGTCTACCTGAGATAGTAAATTCATGGCrrATG 

CTTCTGGTCCTTAGAAAAAAAAAANAAAATAAAAAAAAAANANGTNCAGT^ 

TTAAAAGACCATGrrCCTAAGCTTCCAAGAAGGriTNGGATACTANAAGTATrAATOT 

TCTCCCAGTAAAACCATAGGCCTGANGTTCACATTGGGTCTTTAAATCrrmANATATATAOT 

NCATTTCANAAAAATTCTCATAGNGGNATTGGariTATATTTAACTm^^ 

NAAACAAAGCCANCCT 

SEQ ID NO: 4128 acagtgtgatgctcgagcaattggctctgcttcagagggtgcccagagctcc 
ttgcaagaagtrraccacaagtctatgactttgaaagaagccgtcaagtcttcactcatcatcct^ 
aaacaagtaatggaggagaagctgaatgcaacaaacattgagctagccacagtgcagcctggcc 
agantttccacntgttcacaaaggaaaancotgaaaaggtt>rmaaggac^^ 
tccttaaaaotctotgggacaamcanttctaataatggccntaaatm 
ccttggaaaatctccattgngtgtgcctitmaaatgatgtctggcaaan 
agtggtgtgtanacatgcctggagcanacaccttggagccctgacanaangtgaagcagtccaag 
aaaatgtggaaacttttccgctoctctacacagtccacaaacctgtccatm 
ctttgtctgagagataaccaaattngaccggtcaaagtnaagttttot 
gaagtgggatgccttgcttnaaattggngaattaamggggggaacccattatnnggt^^ 
gccccggggcggg 

seq id no: 4129 actatcatgttgttgaaactgggtccatgggagcaagattagtggctgctaa 
acttgaaccgaaaagcttcaaacatacccantotagataaacnaaactgctcagggc^^ 
gacataagtaacaaggcttctgggganataaaaattgcctatacttactctgttanctt 
anatgataanatcagatgggcgtctagatgggactatattctggagtctatgcctcatacccacat 
tcagtggtttagcattatgaattccctggtcattgttctcttctratctggaatgg^^ 
atgttacggacactgcacaaagatattgctagatataatcagatggactctacggaagatgccca 
ggaagaatttggctggaaacttgttcatggtgatatattccgtcctccgagaaaagggatgctgct 

• atcaatctttctaggatccgggacacanarrttaattatgacctttgggactct^^ 
tggggatttttgtcacctgccaaccgaggaaccctaatnac^^^^^^gctgg 

TNGGGCCCCCrrGNAGGCTTrrGTn^GCCCAAATTTTTATA^ 
NGAAAAAAC 

SEQ ID NO: 4130 ACCGTGTCCCGTTCTTAGTGCTCGAATGTCCCAACCTGAAGCTGAAGAAGCC 
GCCCTGGTTGCACATGCCGTCGGCCATGACTGTGTATGCrCTGGTGGTGGTGTCITACrrC 
ACCGGAGGAATAATITATGACGTTATTGTTGAACCTCCAAGTGTCGGTTCTATGACTGATGAACAT 
GGGCATCAGAGGCCAGTAGCTTTCrrGGCCTACAGAGTAAATGGACAATATATTATGGAAGGACr 
TGCATCCAGCTTCCTATTTACAATGGGAGGTTTAGGTTTCATAATCCTGGACCGATCG^^ 
AAATATCCCAAAACTCAATAGATTCCTTOTCTGTTCATTGGATTCGTCTGTGTCCTA 
TrrrCATGGCTAGAGTATTNATGAGAATGAAACTGCCTATrGAAACGGGAGTCTCGCTCTG^^ 
CAAGGCTGGATGCAGTGGCGTGATCTTGGCTACTACAAACTCTGCCTCCCAAGGGCTATCTGATGG 
GTANAATTGCCrmGANAAAAAAATAAGNGGATACTGGANTTTGCCCCTGTCAA 
AAGGGTTGNACT 

SEQ ED NO: 4131 ACCAGAGCTTGAAGAACAGGATTCCACCCAGGCAACCACACAACAAGCX^CA 
GCTGGCGGCAGCAGCTGAAATTGATGAAGAACCAGTCAGTAAAGCAAAACAGAGTCGGAGTGAA 
AAGAAGGCACGGAAGGCTATGTCCAAACTGGGTCTTCGGCAGGTTACAGGAGTTACTAGAGTCAC 
TATCCGOAAATCTAAGAATATCCTCmGTCATCACAAAACCAGATGTCTACAAGAGCC^ 
AGATACTTACATAGTTITrGGGGAAGCCAAGATCGAAGATTTmCCCANCAA^ 
CTGCTGANAAATTCAAAGTTCAAGGTGAAGCTGTNTCAACCNTTCAAAAAACCACCCLVGA 
ANTGTACOTGCCCGGGGCGGGCNGTTTAAAANGGNCAATTTCCNACNCAC^ 

SEQ ED NO: 4132 ATGNCATTGCTTCGNAGCCGGGCCCGCCCAGTNGTGGNATGGGNATTATTCT 
NGCAGNAAATTTCGGCCCCCTITNAGCCGGGTOGGGGTNCCGCGGGCCCCGGAGGGGTTACCm 
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NAAACCAAGNAATTTATTTANCCANCCAAGGAAAACCANTTTNCCCCACCCCCCAAACCA^ 
CCAACCAAGOAAAA>nrTANTTANCCANTTTTCCTTANTTCCCCAAACC^ 

AAAACCriTlTCriCCCCAAAGGAATAGGGACCCANTATTGGAATAGGGGCCCACCAAAAACCGG 

AGCCCTCAATTAAAATTTAAAGAAAATTTGGAATTAATGTCAAGGCACTCT 

GAATAAACTGGAAATTACTCCAAAAGGACCTTGAACCTCAATCATGGAAATTNATAACCTGTCCT 

GATGTCArTGGGTCAAAAATGAAATCAAGATAGAAATCAAAAAATTCrrCTAACTGAATGC^ 

GTAACACAACCTATCAAAACCTCTGTGATCAGCAAAGGCAGTGCTAAGAGGAAAGGTTCATAGCC 

CTAAACACCTACATTAATAAGGTCTGAAAGAGCACAAGACAATATAAGGGTCACACCTCAAGGGG 

AAGTAAAAAAACAAGAACAAATCAAACCCAAACCCAGCAGAAGAAAGGGAAATAANGAAAGGA 

TATTGAGCAGGAATTAAA 

SEQ ID NO: 4133 ACTTrrTnTTTiTmTr^^ 

ATTCAGTTCCCAAAGCTGGGClTCTCTTCCTCTCCCTTTCTNATGTCTCrrCCTC^ 
TCCCTNACTNTTCCT^r^T(XTCCT^^^GCTm 

CTTGTTCTCCTTTTTCCACTTNATGCGCCGGTTCTNAAACCAAAT>^ 

CAAANCGTGTGCGCNATCTCTATGCGCCGCCG(XGC^TOAGGTANCGATTGTAGTGAAAATTC^^ 

CTCCAGCTCCAGGGTCTNGGTANCGGGTGTAAGGT^^r^GGCGGCCCTC^^r^ 

GGAGNTTAA 

SEQ ID NO : 4 1 34 ACGCGGGGAAACGACAGGGG AAAGGAGGTCTCACTGAGCATCGTCCXIAGC A 
TCCGGACACCACAGCGGCCCTTCGCn€CACGCAGAAAACCACACTTCTCAAACCTTCACT^ 
TTCCTTCCCX^AAAGCCAGAAGATGCACAAGGAGGAACATGAGGTGGCTGTGCTGGGGGCACCCCC 
CAGCACCATCCTTCCAAGGTCCACCGTGATCAACATCCACAGCGAGACCTCCGTGCCCGACCATGT 
CGTCTGGTCCCTGTTCAACACCCTCTTCTTGAACTGGTGCTGTCTGGGCTTCATAGCATTCGCCTAC 
TCCGTGAAGTCTAGGGACAGGAAGATGGTTGGCGACGTGACCGGGGCCCAAGCCTATGCCTCCAC 
CGCCAAGTGCCTGAACATNrrGGGCCCTGATTCTGGACATCCTCATGACCATTGGATTCATCCTGrr 
ACTGGGTATTCGGCTCrrGTGACAGTCTACCATNTrATGTTACAGATAATACAGGAAA^ 
TACTAGTGGC(>IGCCATANCTGCAACCTTTGCACTTCACTGTGCAATGCTGGCCCTGCACG 
GACTGTTGCCCTTGCCCCNTTGGTCCTGCCCTTAATTACAA 

SEQ ID NO: 4135 ACTATrrCATGGTCCAAACCTGrrGCCATAGTTGGTAAGGCTTTCCTTTAAGT 
GTGAAATATTTAGATGAAATTTTCTCTTITAAAGTTCTrrATAG^ 
ATATTAATAAATCTGTAGTGTTTTGTGTTTATATGTTCAGAACCAGAGTAGACTGGATO 
GGACTGGGTCTAATTTATCATGACTGATAGATCTGGTTAAGTTGTGTAGTAAAGCATTAGGAGGGT 
CATTCrrrGTCACAAAAGTGCCACTAAAACAGCCrrCAGGAGAATAAATGACTTGCTm 
CAGGTITATCTGGGCTCTATCATATAGACAGGCTTCTGATAGriTGCAACTGTAAGCAGAAACC^^ 
CATATAGrrAAAATCCTGGCmCTTGGTAAACAGATTTAAAATGTCTGATATAAAACATGCCACG 
AGAArrCGGGGATTTGAGrrTCTCTGAATAGCATATATATGATGCATCGGATAGGGTCATTATGAT 
TTmACCATTrrCGACTTACATTAATGGAAACCCAATTTCATrmAAATA^^ 
TTGGNAAGGTTGGNGGGAAAAAAGCCTAAATTGGNAGTTTTTNCATrrAATO 
AATA 

SEQ ID NO: 41 36 ACTAGCCGGACITGGATTTTCTGGAAAGATTrCAGTTGAGGAACGGGAACAA 
AGATTATGATAGCTTTCCGACCACCACCAACITCAATTTCCTTAGCTGCCGTAATATT^ 
GAGCTCAGCCTTGAGGTCCGAGTTCATCTCCAGCTCCAGAAGAGCCTGGGAGATGCCGGACTCGA 
ACTCGTCOGGCrrCTCGCCATTGGGCTTCACGATCTTGGCGCrCGAAATGAACATGGCm 
GGAGAACTTGCCGAGCGCCGGCTTAGGAAGAGACCCAAATCTCGCGAGAGCCCCCGCGTACTCTG 
CTATGGTGCTGGCrrCCmAAACTCAKGATAGATCCAGGTGGGGCTCCG^TTCCTAAAACTC 
CTCNAGCTCGNATCAGACCA>rrTCCTANCTTCCTGAAGTAACCATAAGA^ 
A 

SEQ ID NO: 4137 ACGCGGGGAATGAAGGTGATAGAAAACCGGGCCATGAAGGATGAGGAGAAG 
ATGGAGATTCAGGAGATGCAGCTCAAAGAGGCCAAGCACATCGCGGAAGAGGCTGACCGCAAAT 
ACGAGGAGGTAGCTCGTAAGCTGGTCATCCTGGAGGGTGAGCTGGAGAGGGCAGAGGAGCGTGC 
GGAGGTGTCTGAACTAAAATGTGGTGACCTGGAAGAAGAACTCAAGAATGrrACTAACAATCTGA 
AATCTCTGGAGGCTGCATCTGAAAAGTATrCTGAAAAGGAGGACAAATATGAAGAAGAAATTAAC 
rrCTGTCTGACAAACTGAAAGAGCTGANACCCGTGCTGAATTTGCAGAGAGAACGGTTGCAAAAC 
TTGOAAAAGACAATTGATGACCTGGAAGAGAAACnTGCCCAGGCCAAAGAANAAAACGTGa 
ACATCANACCTTGGATCAGACCTAACAACITACTGTmTrAACCAAAACAGAA^ 
ANAACTCTGGACTTCTTGGGTCTT 
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SEQ ID NO: 4 1 3 8 ACACAATGGTTTATTAAAGGAATGTGTGGCCCACATCAACCTANCAAGGATT 
CTACTGGTAAA.COTCCCATGGCCAAAGGAAAAACAAGCAGGAGTTGAGTGGCTGGGGTGGGGTG 
CANGCAATGGAGANAGGGCAGAAGGGTGTTNTAAGCTGAAGGGGGCTAGAAGCTTACTCCTGAG 
TNTCTTCCCTTCTGTmrCAAATm'TTACTTCTTTATGC^ 

SEQ ID NO: 4 1 39 ACTGTATGTGTTTCTTGAAAAAGTOTGTATGCATCTGCGAATTrCI^^ 
GTTTAGAAATATGATCATAGTATrGTGTGTAGTCTCTTTGAGAAATACCAATAAGCTTGT^ 
GACAAAACATCGAAACn^AGCCCCAGGAATCAATTCACACCATTITCGGANAACGAGCT^ 
CTATACATGGATCTGGAGAATCATCAGT 

SEQ ID NO: 4140 AaSCGGGGAGACATTCACAGCCAAAAGCCTGGGACTCTTrGTGAAGGTCCTC 
CTCACCTCTATCTTTCirrCTCTCTCTCTCAAACTTrCCTTAAAGTrCT 

CTGTGAACAGTCTTTGTCTCCTCCCCACCTTTGGTGGGAAGTGCGGGGCAGTCCTGGTCAAGACAC 

TCATGCCCTGGCAATOTGGCTGCCAGAGAATGTTOTTGCTAACCCACCAGTTTCrrGTTGAT^ 

AGAGGTCAAGGCCAGGCCCCCACTTGGCTTGAAGGGACArrTTCAGACrmCTTTCT 

GAGTGTCTATGCCTCrCATATTTCCCTAATAAACTCCTCAACTTTGGAAAAAAA^^ 

AAAAGTACATAGGAATCTAGCAAATTCAGGAACCAAGGGGAAATGTTGTGAGATAACATTTACAT 

TGTCAACCTTTATTGACTITGTTTTTACAATAAAAAATATTn'AC^ 

SEQ ID NO: 4141 ACTTTTTTTTT l l Ulll^lU ' l ^ lUll - lTr ACACAAAAACACm 
ACAATTTTCCAAAATATArnrrGTAANAAAATGCAATAATrATrAACT^ 
GTrrCTCAGTAAATTCCAGNGTACnrriTrTTn^^ 
TTTmTGAAACCCCACCAACTGCAAAATOTGTTCCTGGCATTAANCTCC^ 
GTCTTTOTTCAGNGGNCCCATNAATGCTTOnTOTCCTCCATG 

cttogagggggngthaatoaacttaaggncaatnttctccaaancccgccgt™ 

ACAAGGACrrGNGGAGGGNGANCNCCCGNrrTTTGGrrCCCAa:ACANAGCCnT^ 
AAGTam-GGTCACTTCLVCCATAGNGGACAAAGCCCCCAGAGGGTrGATGCTCTTGC 
ATAOTCAGGGGAGGCATTGTrCTrGATCANCTTGCCCGCCrrGATAAGGTACCCCTGGCC^ 
ATAAAT 

SEQ ID NO: 4142 ACTrGGCCAAGCGCTCAGATCGGCAAGGGGCACCAGTCTTGATCTGCCCAGT 
GCACAGCCCCACAACCAGGTCAGCGATGAAGGTATCTTCAGTCTCCCCCGAACGATGAGACACCA 
TGACGCCCCAACCATTGGCCTGGGCCAGCTTGCACGCCTGAAGAGACTCGGTCACGGAGCCAATC 
TGGTTGACTTTGAGCAGGAGGCAGTTGCAGGACTTCTCGTTCACGGCCTTGGCGATCCT^ 
TTGGTCACTGTGAGATCATCCCCCACTACCTGGATTCCTGCACTXjGCTGTGAACri^ 
CCCCAGTCATCCTGGTCAAAGGGATCTTCGATAGACACCACTQGGTAGTCCrrGATGAAGGACrrG 
TACAAATTCTTAGATGAAGACTTTGTGTTCGATATATACAGAGACAGTAGGTGGAAGGTGTGGNT 
CATTGACTTTAATCATTTNGGTGAAGTCACNANATNCACTGNTGNTCACCTG^ 

SEQ ID NO: 4143 ACQACGTGATCGTGCTGGGCACCGGCCTGACGGAATGTATCCTGTCAGGTAT 
AATGTCAGTGAATGGCAAGAAAGTTCTTCATATGGATCGAAACCCTTACTACGGAGGAGAGAGTG 
CATCTATAACACCATTGGAAGATTTATACAAAAGATTTAAAATACCAGGATCACCACCCGAGTCA 
ATGGGGAGAGGAAGAGACTGGAATGTTGACTTGATTCCCAAGTTCCTTATGGCTAATGGTCAGCT 
GGTTAAGATGCTGCTTTATACAGAGGTAACTCGCTATCTGGATTTTAAAGTGACTG^ 
TGTCTATAAGGGTGGAAAAATCTACAAGGTTCCTTCCACTGAAGCAGAAGCCCTG^^ 
AATGGGATTGTrTGAAAAACGTCGCrTCAGGAAATTCCTAGTTTATGTrGCCAAOT 
AGATCCAAGAACrmGAAGGCATTGATCCTAAGAAGACCACAATGCGAGATGTGTATAAGAAAT 
TTGATTTGGGTCAAGACGrrATAGATTTTACTGGTCATGCTCTTGCACTTTACAGAACT^ 
CTTAAATCAACCGTGTTATGAAACCNTTAATAGAArrAAACTTrACAGTGAATCTTGGCCA^ 
TGGCAAAA 

SEQ ID NO: 4144 ACrTATTTCAACAATTCITAGAGATGCTAGCrAGTGTTGAAGCTAAAAATAGC 
TTTATTTATGCTGAATrGTGATnrmATGCCAAAATTT^^ 

GGAAATAAATAATTATGCCATGGCATTTGACAGTTCATTATTCCTATAAGAATTAAATTGAGm 

GAGAGAATGGTGGTGTTGAGCTGATTATTAACAGTTACTGAAATCAAATATTTATTTGm 

TTCCATTTGTATTTTAGGTTrCCTrrrACATTCTTm 

GACTATGGAAATAATTTAAAGATTTAAGCTCTGGTGGATGATTATCTGCTAAGTA^ 

GTAATATrrTGATAATACTGTAATATACCTGTCACACAAATGCTTTTCTA^^ 

ATTGCAGTTGCTGCTTTGT 

SEQ ID NO: 4145 ACTGCTACTTCTATAAACGGACAGCCGTAAGACTAGGCGATCCTCACTTCTAC 
CAGGACTCTTTGTGGCTGCGCAAGGAGTTCATGCAAGTTCGAAGGTGACCTCTTGTCACACTGATC 
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GATACirrrccrrCCTGATAGAAGCCACATrrGCrGCm 

GCAAACAGCTGGACTITCCAAGGAAGGTTCAGACTAGCTATGrrCAGCArrCAAGAAGG^^ 

CTCCCTCrrGCACAATTAGAGTGTCCCCATCGGTCTCCAGTGCOGCATCCOT^ 

TCTGTTCCACCCCCTTTCCTTCCTTTCCTCrCTGT 

SEO ID NO- 4146 ACTGTTTCmCATGCGAATCTACAATTATCTCAAAATAAAAAGTCTAATTTA 
AAAATATGTGTTCTGGCCAGGCAOTGTGGCTCACATCTGTAA^ 

ATGGGCGGATCACCTGAGGTCAGGAGATCGAGACCAGCCTGGCCAACAGGGTGAAACCCTGTCTC 

TACTAAAATATACAAAAAriTGCCAGGCGTAGTGGCGGGTGCCTGTAATTCCAGCTAC^^ 

GCTGAGGCAGGAGAATTGCTTGAATCrGCAAGGTGGAGGrrGCAGTGAG^ 

GCACTGCAGCTTGGGTGGCCAAGTGAGACTCTGTTTNAAAAAAAAAAAAAAAAAAA^ 

ATGTCCCAGGCTTGGAGTATTTAGAAAGATAAAATTNATACOTGCflLCA^^^ 

TrCACNACAGTTGCAGGGTTC\NCCCACCACAGNGAGTACCA(nTTT^ 

GGTTAATNAGNCTATCNGGCCTACAGGTNTGACTTAAA 

SEO ID NO- 4147 Acrn" n i i "i 1 1 u u u 1 1 1 i TNCTmr rn - i 1 1 1 1 1 itaggtccaatggtagt 

TTITATTCCCCAGGAACAAATATAATGTGGANAANAAACTCCAAAAGG^ 

cagaaatgacaccacggnagcctggctcaggagtcaggtaaatagatggctgctccacagtggac 

CTGGTCATGGCCCTGGTrrCTTGGATCCACATn"GCCCCGTNTCTTAAGTGTCCATCT^ 
AGGATGGAAACAGTGAGGTGGAAAANGTNCrTGCrCTACACAAGAGACAAATGGNTC^^ 
CCCANAGGACCANGGACACCAAAACCTNCCTCTTCANAATGGAGGTATGACAAGATAAAAATGA 
GAGGGGNTTGATTGNCANANACAATOTGAAAGTAATAATAGGGGCTCACTGAGGCn^ 

TGGTCCTGCCNGGGC 

SEO ID NO- 4148 ACCTATITGACTTACCATGGAGrrAACATCATGAATTTATTGCACATTGTO 
AAAGG/^CCAGGAGGTTTTTITGTCAACATTGTGATGTATATO 
TGGAAAAACrrGTGCTATAAAGCTAGATGCTTTCCTAAATCAGA^^ 

TCAGTATAGGTAGGGAGATAmAAGTATAAAATACAACAAAGGAAGTCTAAATATTCAGAATCT 

rrGTTAAGGTCCTGAAAGTAACrCATAATCTATAAACAATGAAATATTGCTGTATAGCT^^ 

ACCrrCATITCATGTATAGTTTTCCCTArrGAATCAGTrrCCAATTATTO 

TTGAACCTATGAAGCAATGGATATTTGTACATnGGGCACAGTATGATATTTGACACGAAAAi^^^ 
rrGTATCAGAANTGAGCATCANAAANATAGCA(>rTCATCTTACACATAATAACTANAAAATA^^ 
TTrCTOTCTGATCATTTAAGTGrrACAATATCCCGAGATGTTATACATTGGAACAAATTATC^ 

TGCCNAGGCCGGNCGTTAAGGGCN 

SEO ID NO' 4149 ACTGCCATTCCTTAAATTCATTTAGATTACAGTGTGTAATCATAACTm 
CATCAGCTCCOTTGTCAAACACTGGTCATACTGCATGAGrrGATTTGCTTCA^^ 
CTGArrCCCrCCCATCCTGTGGCAGGGTCCTAGTTCAACAAAGCCTCCATTTGTTm 
TCAATGCAGTAAGCAGrrTCGAAGCCTCTGATrrCTCCCCAGTCAACATTm 
TAGTGTGAGGTGATATCATAAGCTAmcrrCCATGAACCACTrAAAACm 
CTCGAAATITmCAGCTCCGATATATCCCCATATGGTAATGCCTGCGATTCAGGACGACTAGCAT 
AGAAGTAGTCrn'ATATTCATCCCACCAAACCTCCACAACrCTAACATAATTCTT^^ 
AAGACCCAACATAAATGGGCGGAGGATTTCCTTGCCAGCCCTCAAGACGGTAGATATGTCCAACA 
CGAGAACAAGGAACAAAATAATAATTTGCCCCACACTGCCATATCTTGTATGAGi^^ 
rirCACCCCCCAAATCTGGAGACCTGGATCATANAGACCAATTCAAAGAANACTCTCGTTC^^ 

GCAA 

SEO ID NO- 4150 acttcttcagatcgccaggattcaagaagccatagttc^agaagaagttctc 

CAGAGTCAGATCGACAGGTCCATTCAAGATCTGGGTCATTTGATAGCAGAGACAGG 

CGAGATCGATATGAACACGACAGAGAGGGAACGTGAACGGGATCGGGAAAGAGAAi^^ 

AGAACTAGAAAGAGAGCGTGCTAGGGAACGGGAGAGAGAAAGAGAAAAAGAGAGAGATC^ 

AAGGGATAGAGACCGAGACCACGATCX3AGAGCGGGAAAGAGAGAGGGAAC0AGACA^^^ 

AGAACGGGAACGAGAAAGAGAAGAGAGAGAGAGGGAGAGAGAGCGAGAACGGGA^^^ 

GAGAGCGAGAACGGGAACGAGAAAGAGCGAGAGAAAGGGATAAAGAACGAG/^C(£C^ 

ArrGGGAAGACAAAGACAAAGGACGAGATGACCGCAGAGAAAAGCGAGAAGAGATCCCGA^^ 

GATAGGAATCCAAGAGATGGCATGATGAAAGAAAATCAAAGAAGCGCTATAGAAA^ 

TCCCACCCTAACAGTCCCCGAAGCGCCGCGTGAACATTCTCCGGACAGTGATGCCTTACAACAGT 

GGAGATGATAAAAATTGAAAAACACAGAC 

SEO ID NO- 4151 ACCAGTAAATCAAAAAAAGAGGGAGTATGTCCATITAACTmATTCAA^ 
TATGTAGGAGGTCrrCAGAAATCAAATGTGAGCATGAAGATATGGCCAAACATAAAAT^ 
ATTCAGTGAAAAGTrGCTGACTTTAAATAGTAGrrGTAAGAATGACC^^ 
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CAACCCCTCACTCTATAGTAAAAAAAAAAAAAAAAAAANTAAAATAAACATTCAGTAAACA^^ 

TTCCrAAGGTAGrrrCCCTTATATAGCAGTGATITATCCATTrCriTGTA^ 

CATACCAATCAGTGGTTCTCACAATTGAAAAATAAAAGCACAAAA^ 

SEO ID NO: 41 52 ACCAGCGCCGGAAGTTGGTCTCGACACCTGGACTAGCCGGGTTGTArrTGGA 
AACGCGGAGTGAGTTrrrCCGTGCTGTGTAGGGGCTAACAATGGACACCCAGAAG 
CTCCAAAGCAGCAACCAATGATATATATCTGTGGAGAGTGTCACACAGAAAATGAAATAAAATCT 
AGGGATCCAATCAGATGCAGAGAATGTGGATACAGAATAATGT 

SEO ID NO- 4 1 53 ACCTTTTCATCTTCTCTGTGGCCAACATGAGGAACAGCAAGCTGAAGGACAT 
CCGGAACGCCTGGAAGCACAGCCGGATGTTCTTTGGCAAAAACAAGGTGATGATGGTGGCCTTGG 

gtcggagcccatctgatgaatacaaagacaacctgcaccaggtcagcaaaaggttgaggggtga 

GGTGGGTCTCCTGTTCACCAACCGCACAAAGGAGGAGGTGAATGAGTGGTTCACGAAATACACAG 
AAATGGACTACGCCCGAGCTGGTAACAAAGCAGCrrrCACTGTGAGCCTGGATCCAGGGCCCCTG 
GAGCAGTTCCCCCACTCCATGGAGCCACAGCTCANGCAGCTGGGCCTGCCCACCGNCCTAA 

SEO ID NO: 4154 ACrmTTTTTITITITI^^ 
TTCTCTTCTTrmCTTTITT^ 

TGCTGCTGCTCCAGCGTATGGGATGCTGGGAGGAGGGCCANATGTCACTGTGACCTCTCCCACTGG 

CACGGCANAAAGTCCTAAACTTCIXnTGGACTTGGAGTGTCGCTTCTCm 

rrcrrCTITITGCTCTTCTTTGACTrCTTTTTCrrCT^ 

TCCCCAGCCTCTAGTTCCCCACCAGAGGATGAGTCTGATTCTACCAGAATAGGTTCAAGCCCTG/^ 

AGATCAAGGTTAGCACTGTGGGACTCTGCGAACTGGGAGGCGTCAGACCCACAGCCTTCAGGGCC 

AGGAGATGAATGTGCGTCTGAGGATGACTTGTGCTTTTTCCGGGCTGTmCAAA^ 

CTCATGTCCTANGAGTAAACACCCTGCTCATCCCGAGCrGArrrCTTTGAGGATI^^ 

CTTGTTGGGAGGGATmTGAAAAAGAGTNCTNATCAAAAAACCTTGm 

SEO ID NO: 4155 ACAAATGTITmATTCAAAAATACAAAATAAATTATCTGTAGGCATGGACA 
ATGACAGCAGTAAACCATTATATArmGTC^CTGAAACCAGTAACTGATGGTTATAGTGAT^ 
CAGCCAGCCTITITCTTCATTTTCTCCAACTGACriTCTCT^^ 

GGGCITCCTGTCACAGTTCATTAATAAAGGTAAAGCACTAGTCTAGGAGTTAGAACATGCCACCTC 

CCATACCACCTCCCATTCCACCCATTGCACCCATTCCAGGGTCCTTCTCTTCTTrAGG^^ 

GACTACAACITCTGCTGTAGTTAACAGAGAGGCCACACCAGCAGCATCCAATAAAGCAGTTCTCA 

CAACCTTTGTTGGGTCAATGATrCCnTnCCACCATAT^ 

ACCAACTTCTGAGGAACTTTGCATAATTTTCTCAACTATCA^ 

SEO ID NO* 41 56 ACGCGGGGAGGAGGGGCTGCTGAGATATCCTGTGCCCTGGCAGTTAGCCAAG 
AGGCGGATAAGTGCCCCACCrTAGAACAGTATGCCATGAGAGCGnTGCCGACGCACTGGAGGTC 

atccccatggccctctctgaaaacagtggcatgaatcccatccagactatgaccgaagtccgagc 

CAGACAGGTGAAGGAGATGAACCCTGCTCTTGGCATCGACTGTTTGCACAAGGGGACAAATGATA 

TGAAGCAACAGCATGTCATAGAAACCTTGATTGGCAAAAAGCAACAGATATCTCrrcCAACAC^ 

ATGGTTAGAATGATTTTGAAGATTGATGACATTCGTAAGCCTGGAGAATCTGAAGAATGAAGACA 

TTGAGAAAACTATGTAGCAAGATCCACTTCTGTGATrAAGTAAATGGATGTCTCGTGATGCGTCTA 

C^GTrATTTATTGTrACATCCTrrTCCAGACACTGTANATGCTATAATA^ 

CCATAGnTCACTTGTTCAAAGCTGrrAATCGNGGGGGT 

SEO ID NO: 4157 ACTrTTTrTTTri' rri - l 1 1 1 ill 1 1 1 1 1 GC^TCAAAAAGCnTArrTCCATTTGGT 
CCAAGGCTTCTrAGGATAGTTAAGAAAGCTGCCTATTGGCTGGAGGGAGAGGCrrAG^^ 
CCTAmCTTTGCAAGGGGCCCTTCAAAAGTCGCTGGGCTCANAAGGCTCTTAGTCGTGOT 
GTGAGCCTTTCGAANAGATACTCGCCCAGCCCAGCCTCCGGGCCACCCAGCCTGTGGAGGTTGGT 
CAGGTGGTCACCCATCTTCrnGATAAGCTTCACTTCCTCATCTAGGAAGTGAGTCTCCAGGA^^ 
ACAGAGATGGGGGTCCGTGCGGGCAGAACCCAGGGCATGAAGATCCAAAAGGGCXn'GGTTCAGC 
TTITTCrCCAGGGCCATGGCAGCTTTCATGGCGTCTGGGGTTTTACCCCACTC^^ 
TCTTGATGTCCTGNAAGAGAGCGCGGCCGCCACGCTGGTTTTGCATCrrCAGGAGACGCTCGTA^^ 
CCTCCGCTTm'CTCGGCCAATTCGCGGAAGAAGTGGCTCACGCCTTCATAGCCANAT^^ 
CGAAATAAAAGCCCAAAAAAGAGGTAGGTGTAGGAAGCCTGCAAGTACTnGTNAAC^^ 

ANAATOGGCANG 

SEO ID NO* 4158 ACGCGGGGCCCAGTAACAGGCATCGAACGGTGCAGACTGAAGACGCCCTCC 
GTCAGCGACGCCGTCGCAATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCm 

cggtgctggaacgaacatcccggtgctaggggtgcaggaggaggacggcagcaaccgcagcagc 

AGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAG 



647 



wo 02/29086 



PCT/USOl/30732 



CCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAG 

TTCTTTTGATTCTGGAGCITCAACTAACAGGAAGGAAGGCTTTGG 

TTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAy^ 

AGGTTTGGGAATCATCAGGGCAGTGGATGTTmCTGTTTATTCACCAGTGAAA^ 

ATH'CAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTC^ 

AATAACTTACAGAGTATCTAAATTCTGTCCAACCNTTAATAAATCAATGGGAGGAACAC^ 

TGAACTGAAAAAGT 

SEQ ID NO: 41 59 ACCCTGGCCACTCCTTTCXnTITGGCrGGCCAATGTCTCCTCTGTAGGCTCC^ 
GAAGGCTCTCAGGGATGCAGGCGGCCTCCTGCAGGGTTGAGTTGCAATGGGAACAAAGACAGCTG 
TGGTCCCATAGCACCCTCATCTGGTGACATCCTGCTACTGACAGTCAAAAGAAGCCTTCCCAGATG 
AAATTn-AGTCCTCTGCGCAGCCATGCTCTTCTTCCAGCAAAAGAGCCATGTGCAGTCGGGTCTGC 
TCCCCATGGGGGCTTTGATGTGGGCCCAGCAGTGGATCAGCCTTCCAGACACGCTCAACTCTGC^^ 
ACTCrrCCTGCCGCCTCAGGCTTTCCAGGACCCTCCCGAGCCITATCAGAGTCC^^ 
CTACTGATACCTTGCTGGGTGACCTTGGACAGATTCACTTACCTGGACTCAGTTTCA^ 
AATGATAGGGTTGGGCTACGTGATrrTCACGTTGTGCTTCAGATTGTTAGAAGTTAGGGGCTGAAA 
GGCATTACCTGTCCCTCTCTCAATTTCACTCATGCTACrTTGTTTT^ 
GCTTAAAAAAAAAGGGTTCTGGGACAAAACCTGAAATTACTGGGCTCCAGGAGCTTTOT 
ACTAAC 

SEQ ED NO: 4 1 60 ACCAGTGGAGGAAGGCCTTCCGGCGGAACATGGCAGTGAACTGCTCCGAGAT 
GCGCTTGAAGAGCTCCTGGATGGCTGTOCTATTGCCAATGAAGGTGACTGCCATCTTGAGGCC^^ 
AGGTGGGATGTCACAGACGGCTGTCITGACATTGrrGGGGATCCArrCCACAA^ 
CTTGTTCTGCACGTTAAGCATCTGCTCATCGACCTCCTTCATGGACATCCGACCACGGAAGACAGC 
AGCCACGGTGAGGTATCGGCCGTGGCGGGGGTCACAGGCAGCCATCATGTTCTTGGCATCGAAGA 
CCTGCTGGGTGAGTTCCGGCACTGTGAGAGCTCGATACTGCTGGCTTCCACGGCTGGTGAGAGGG 
GCAAAGCCAGGCATAAAGAAATGGAGACGTGGGAAGGGGACCATGITGACTGCCAACITGCGGA 
GGTCAGCATTGANCTGGCCAGGGAAACGGAGGCAGGTGGTGACACCACTCATGGTGGCTGAGAC 
AAGGTGGTTCANATCCCCXJTAAGGrrGGGTGTGGTCAGCnTCAGAATGCNGGAAGCANATATC^^ 
ANAGGGCCTNTTGTCAATGCAATAGGTCTCATCANTATTCTCTACCAACTGATGGACGGA 
TGGCATTGTANGGG 

SEQ ID NO: 4161 ACTITITTrrTTTTTTm 

AATCAGGCAGCCACTGAAGATAAAATACGGTCCCTGGGAGTAATCACAATGCTGTTTTCT^ 

AAGTGGACATAAGAGTAG lU - ri - iU ' illTl TATTAGCGCAAGTGGTCAAAAGTTGTCAAAATTGTCC 

TCATTCCTCGATTGTCTCTTTTTTACCAGTCrCITGCCOT 

TCAGCCCATGTGATGTTGCCATTGGCTAGGTCTTGGACTATGCTGGGCAGCTCAGAGATCTCTGCT 

CTTATCTGCCGCATTGAGTCACGGTCCCTCANAGTTGCAGTGTGGGGGGTCTTGTTCACT^ 

AAGTCAATGGTGACACCAAAAGCCACG(XAATCTCATCAGTCCTGGCATANCGCCTTCCGATTGA 

CCCAAAGGAATCGTCTACTTTGTGAGATACTCCATGCCTGGTCAAGGGCTTCCGATAATO 

CAAAATGGCATGAACTCCTGGTTTTGGCTCAATGGGAGGACGGGAACOTTrGAATGGAGCAACT^ 

CAGCAGGGAAACTGAAGAATGTTCTCTGTTCATCTCCTTCTCGTACCACT^ 

GGACGTG 

SEQ ID NO: 4162 ACCACTCAATCTTTTAAAAAAATGAAAAAGAAAAAAAAAGCGGCTCCAAAG 
ATAGTCITATACCATTCTTTAAAAAAGGAAACTGrrCCTTTTAACm 
ATTTCAAAACATCATTTAATTGTCTTGGTCATGGACATTTCCAAGATGAGAT^ 
CACAGCTTCTGGTTATCAGAAAACCCATGCnTrCCTTTATTGAAGGAGm 
GGTGTATCCCTTCCAATATTCTTCATCCTCATOTrGCTrCTGGAGGCATTTCCT 
TTCATCTTCATTAAAAGCTGCrcCTACTGAAAGAGTTT^ 
AGGCTTACTTGATCCAAGTTTGATGGATATGGCTGATGCTTTCTTTGTCGTC^^ 
AATCCAAACTTGGAGATCTTTGTAGGCrrrGTTGGGAGGTCGGCAGOT 
TTCTCAGCGCTGCGACTGGAACTITCCCCTCATTACTGGAAAGAAACAGTCrrAGTm 
TTTTCTGCTTCTTCTTCAGGTCCTCCGGCGGCTCCAGCTCGCTTGCGACT^ 

cc 

SEQ ID NO : 4 1 63 acgcgggggggaaaggaggtctcactgagcaccgtcccagcatccggacac 
cacagcggcccttcgctccacgcagaaaaccacacttctcaaaccrrcactcaacacttcot 
caaagccagaagatgcacaaggaggaacatgaggtggctgtgctgggggcaccccccagcacca 
tccttccaaggtccaccgtgatcaacatccacaga}agacctccttgcccgaccatgtctr 



648 



wo 02/29086 



PCT/USOl/30732 



SEQ ID NO: 41 64 ACGCGGGACTOTCAGCAAATGCAAAAGAACTAAAATCATAACAGTCTCTAG 
GGCCACAGCACAATCAAArrAGAATTCAAGATTAAGAAACTCACTCAAAACTATACAACTACA 
GAAACTGAACAACCTGCTCCTITATGACTCCTGGGTAAATAATGAAAT^ 
AGTTCTTTGAAACCAATGAGAACAAAGATATTTGATATAATGTACTCTGACAGCTGTGC^^ 
CATCCTTGAGGACrrGGTCTTCCCAAGCGAAATTGTGGGCAAGAGAATCCGCGTCAAACT^ 
GCAGCCXSGCTCATAAAGGTTCATTTGGACAAAGCACAGCAGAACAATGTGGAACACAAGGTTGA^ 
ACTrmCTGGTGTCTATAAGAAGCTCACGGGCAAGGATGTTAATTTTGAAT^^ 
TTGTAAACAAAAATGACTAAATAAAAAGTATATATTCACACTAAAAAAAAAAAAAA^ 

SEO IDNO- 4165 acttgcagccctcggccaaacggccagacoccgacgtcgaccagcagagact 

GGTAAGAAGmGATAGCTGTAGGACTGGGTGTTGCAGCTCnTGCATTrGCAGGTCGCTAC 

tcggatctggaaacctctagaacaagttatcacagaaactgcaaagaagatitcaactcc t^^ 

tttcatcctactataaaggaggatttgaacagaaaatgagtaggcgagaagctggtc^ 

gtgtaagcccatctgctggcaaggctaagattagaacagctcataggagagtcatgattttgaat 

CACCCAGATAAAGGTGGATCTCCTTACGTAGCAGCCAAAATAAATGAAGCAAAAGACTTGCTAGA 
AACAACCACCAAACATTGATGCnTAAGGACCACACTGAAGGAAAAAAAAAAAAAAAAAi^ 

AAAGT 

SEO ID NO: 4 1 66 ACGCGGGGAGCTGGAGGCGGCGGAGCGGAAGCCCCACCATGGCTGCAATCC 
OAAAGAAOCTGGTGATCGTTGGGGATGGTGCCTGTGGGAAGACCTGCCTCCTCATCGTCTTCAGC 
AAGGATCAGTTTCCGGAGGTCTACGTa:CTACTGTCTTTGAGAACTATATO 
GACGGCAAGCAGGTGGAGCTGACTCTGTGGGACACAGCAGGGCAGGAAGACTATGATCGACTGC 
GGCCTCTCTCCTACCCGGACACTOATGTCATCCTCATGTGCTTCTCCATCGACAGCC 
TGGAAAACATTCCTGAGAAGTGGACCCCAGAGGTGAAGCACTTCTGCCCCAACGTGCCCATCATC 
CTGGTGGGGAATAAGAAGGACCTGAGGCAAGACGAGCACACCAGGAGAGAGCTGGCCAAGATGA 
AGCAGGAGCCCGTrCGGTCTGAGGAAGGCCGGGACATGGCGAACCGGATCAGTGCCTTTGGCTAC 
CTTGAGTGCTCAGCCAAGACCAAGGAGGGAGTGCGGGAGGTGTTTGAAATGGCCACTCGGGCTGG 
CCTCCANGTCCGCAAGAACAAGCGTOGGANGGGCTGTCCCATTCTCTGAGATCCCCAAGGCCT^ 
NCTACATGCCCCTT 

SEQ ID NO: 4167 ACTTTTITITITlTnT^^ 

GCAATrrGTTCTTCATCATCCrrCTGACTmCTTTGTmCCCATOTOT 

ATCACCTTGTAGCrCCTCTTCTGTCTCCTCTTCATCrrCACTAGAGGAAGCT^^ 

TCCATGTCCAAAAGCTCTTCCTTTITAAACTrCCTGTTGAGCATTGTAATTC^ 

CATCCCAAGTGATrrCCACCGTTCATGTTCCCATTGCAGCANAAGTGAAATATrr^ 

CTGTTAAATTCACrrCTGAGGCTACATCCTTAGGCTCATCATCAAAAGTAATATCATCT 

ACCTTAGATCTATCAAAGAACAACTACrrrCAAATTCCAGGCCATCACAATCCTCAT/^ 

TAGCTGTTTCCGGAGAATCACAGTCTACTACTGCATAATAGT 

SEQ ID NO' 4168 ACAGCCTTCGCTTCCCCAAACTCCACAGTCTCAGTGCAGAAAGATCATCTTCC 
AGCAGTCAGCTCAGACCAGGGTCAAAGGATGTGACATCAACAGTTTCTGGTTTCAGAACAGGTTC 
TACTACTGTCAAATGACCCCCCATACTTCCTCAAAGGCTGTGGTAAGTTTTGCACAGGTGAGG^^^ 
GCAGAAAGGGGGTAGTTACTGATGGACACCATCTTCTCTGTATACTCCACACTGACCITGCCATGG 

gcaaaggcccctaccacaaaaacaataggatcactgctgggcaccagctcacgcacatcactgac 

AACCGGGATGGAAAAAGAAGTGCCAACTTTCATACATCCAACTGGAAAGTGATCTGATACTGGAT 
TCTTAATrACCITCAAAAGCITCTGGGGGCCATCAGCTGCTCGAACAC^ 

GAACCATGAGGCCACAAAAGCGGTCAAAGGrrCTGGGAATTCGGGTCTGGGGATTCACTTAATCA 

GAACATTCTTCTGTGTATGGATATAAACCTGTACAAGCCAGCTCGGTTCAGGGGACTATCCATC^ 

CATCACCAAACTCTGGTGGGTGATATCTGGCCGCGCTTCCCCANGGTCCCGTCCATTCTTO 

TATAAACTTG 

SEQ ID NO; 4169 ACAAAAAGACAGCCAGAGGTGTGCGGAGAGGGTGAGGTGGCCGCGTGGACG 
TGGGTAGATAATCGCATGCAGCACTGGAACTCCTGATGAGGGGTGGGGTCCCCACITCrCC^^ 
GGTTTGAGGGATTGCGGGGAGGGGGTCAGCTGACTCAGAGAAGTAGGATCTCrrGCACTGGAAOT 
GAGGCTTCCTGTCTCCTCCATGGGCCrcACCAGTCACAGGGACATGAAATCCGTGGCCTGGAGGA 
GGGAGAGGGAGAGCAGGAGCAGCAGCAGCCACNAGGTGTTCTGAGCCAGCAGGCTGATGCCCTC 
ACACrrGACCAGTTTGTCTCTGAGCACTGTGACGTTCTGGGAGGAGATGGGTGGGGAATGGCCAG 
AGTGGTGGAGTGCACACGTGTAGGTGCCCTCGTCCTTGCTAGTGAAGGCGGATAAGTAGAGGACC 
TTCATGTTGTATTTGCrGGTGAAGTTGGTTCGGGAGMOTATGTGTGCTCAGGCACCCCCACAOTG 
CCAAAKAGCACGTGCTrcrrrGTCTCACGGGTCAGGCTGAACTCGTACCTCGGC 
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SEQ ID NO: 4170 ACGCGGGGCTCTCAGAACCTTCCTGCCGTCGCGTTTGCACCTCG CTGCT CC^^ 
CCTCTGGGGCGCATTCCAACCTTCCAGCCTGCGACCTGCGGAGAAAAAAAATTAOT 
CCCCATACATACCTTGAGGCGAGCAAAAAAATTAAATTTTAACCATGAGGGAAATCGTGCACATC 
CAGGCTGGTCAGTGTGGCAACCAGATCGGTGCCAAGTTCTGGGAGGTGATCAGTGATGAACATGG 
CATCGACCCCACCGGCACCTACCACGGGGACAGCGACCTGCAGCTGGACCGCATCTCTGTGT 

SEQ ID NO: 4171 acttgcagccctcggcx:aaacggccagacgccgacgtcgaccagcagagact 

GGTAAGAAGTTTGATAGCTGTAGGACTGGGTGTTGCAGCTCTTGCATTTGCAGGTCGCTACGC^^ 

TCGGATCTGGAAACCTCTAGAACAAGTTATCACAGAAACTGCAAAGAAGAmCAACTCCTAGCT 

TTTCATCCTACTATAAAOGAGGATTTGAACAGAAAATGAGTAGGCGAGAAGCTGGTCrrAT^ 

GTGTAAGCCCATCTGCTGGCAAGGCTAAGATTAGAACAGCTCATAGGAGAGTCATGATTTTGAAT 

CACCCAGATAAAGGTGGATCTCCTTACGTAGCAGCCAAAATAAATGAAGCAAAAGACrTGCTAGA 

AACAACCACCAAACATTGATGCTTAAGGACCACACTGAAGG AAAA AAAAGAGGGGACTrC^ 

AAAAAAAAGCCCTGCAAAATATTCTAAACATGGTCTTCTTAArmCT^ 

CTTATCrrCCACCATTAAGCTGTATAAACAATAAAATGTTAATAGTCrrGCTTm 

AAGATCTCCTTAAAATTCTATAACTGGATCTlTmCTTATm 

AAGATTTT 

SEQ ID NO: 4 1 72 ACGCGGGGGGAGTTAGGCGACCAAACCAGTGAGAGCCCCAATCCCTGCAGTT 
TTGTGGCTTCAAGTGTGGGTGGACAGTCCTAATGGGGATCTCCAGCTCCTrCCTGTGGGCTGCCAC 
AGACAGCTACCCCCAGAAGGGTCAATGTTGGGAGTGGTrGTGGCTCTGAGCTGCTCTACAGAGCT 
TCAGTGTGAGAGGATCGAGCCATTGAAAGCrCATTACCAGTAGGACATAATTrTTGGCTCT^ 
TTCACAACCAGTGCACAGTTTGACACAGTGGCCTCAGGTTCACAGTGCACCATGTCACTGTGCTAT 
CCTACGAAATCATITGTTlXn'AAGTTGTGTrTATTCCTGGAGTGACATC^ 
TrrCACTGAGGATGCTGTCCTCTGArrTAGCTGCrGCCTCCAGCCTCT 
GGCACrrCCTrcCTGTTAAACCCCTGTTAACTCrCCATAAATT^ 
ATTTTGAGTTAACATCTCTTGAAGCCAAACTCCACCTTCTGTGCn'Tm 
TTTCnTTAAAAACAGTCCCAAGAATGACAAGATATTAAAAAAAAAAAAAAAAA 
TCCTT 

SEQ ID NO: 4173 acc aacagacgtggataagtggttccatcaccagaaaaactaatgagatttc 

TCTGGAATACAAGCrGATATTGCTACATCGTGTTCATCTGGATGTATTAGAAGTAAy^ 
rmCAAAGCTTTAAATTTGTAGAACTCATCTAACTAAAGTAAA 

ACTCAGAATGTTATCCATCTAAAGCATITITCATATCTCAACTAAGATAACTmAGC^^ 
AATATCAAAGCAGTTGTCATTTGGAAGTCACTTGTGAATAGATGTGCAAGGGGAGCACATATTC 
ATGTATATGTTACCATATGTTAGGAAATAAAATTATTTTGCTGAAAAAAAAATAAATANAAATAA 
ACAGATAAANANAAACCCTOTCAAAAAAAATAANAAAANANATA>mX3NTT^ 

SEQ ID NO: 4 174 ACGTGCAACCTTGTTGCTACGGCCACTATTCTTCATTTTGGTAGAQCCGAACT 
TGGTGGCAGCAAAAGTGGCAGTGGTAAAGGTTAAAGGCTGAGTGGACCGGGGCATGAAGGTAGG 
GCTTACAGCAGCAGAGTQACCAAAAGGAGGCGATGGAGAGCTGGACACTGATGAGGCTGGGTCA 
CTTATGGGCATAAAAACCTGGGCCrCTGGGTTAAAGCTGTTTITGATCTCOT 
CATTTTCATTArrATCATCCACGTAAAGCACCrrTCACTGGTCCCT^ 

CTCAAATGGGTCGATCCAAACACTAAGATCCTGTGGCAGATTGCCACOAACATCATCAATGTCCA 
AACX:ACTCTCTTrGGATGCTTGTTCAATCACTGGGTCCACTTTCTCC^^ 
CCCCAATCCTTTGTATGGCTTTTCAGGATACCAGTGCCCTTCATAm 
AGTTCTmACCAAAAATGTTGACACCGTCTNCTGGGAAGCTTATTGGT 

SEQ ID NO: 4175 acgcggggagacgcgcgggcgggaagatggcggctgggttcaaaaccgtgg 

AACCTCTGGAGTATTACAGGAGATTTCTGAAAQAGAACTGCCGTCCTGATGGAAG AGAACTTGGT 

GAATTCAGAACCACAACTGTCAACATCGGTTCAATTAGTAC'rrirrri'J-l 1 iTTrrTTITriTTTTTO 

GGCANAATGANAAAATAThrmGCAAACCAAACATNTGACAAGGGGTTAATATCCAA^ 

AGGAATTGAAACAACTCAATAGTAANAAAGCAAATAACCTAArrAAAAACTGGCCANACGACTT 

AATAGAATTTCTCAAATGAANACATACAAATGGCTAGTAGGTNTATGAAAAAAATATTCA^^ 

ACAAATCATCAGGGAAATGCAAATTAAAACCACAATGAGATATCACTNCATGCCTATTAGAATC^ 

CTACTATCAAAAACACAGAAGATAATATGTGTTGGCAAGGACCGTGCAAAAAATGGAACCCTTAT 

TCCTGTOGGTGGGAGTGTAAATTAGT 

SEQ ID NO* 4176 ACGCGGGGGCTGACTCTCrmCAGACTCAGCCCACTTGCACCCAAGTGAATT 
AACAGCCirGTTGCTCACACAAAGCCTGTrrAGGTGGTCTrCTATACGGACATGC^ 
TGCCAAAATCTGGGCCAGGGGGACTCOTCGTQAGACCGGCCCCCTGTCCTGGCCCTCATTCCGT^ 
AAGAAATCCACCTGCGACCTCGGGTCCTCAGACCAGCCCAAGGAACATCrCACCAATTTCA^ 
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GGATCTCCrCGGCTTAGTGGCTGAAGACTGATGCTGCCCGATCGCCTCAGAACK:CCCTTGGACCAT 

CACAGATGCCGAGCTTCGGGTAACTCnTACGGTGGAGGATrCCCAGCCATATGAAGACACCCT^ 

CTGGACGATCAGTCCTTGTCAAAAGTCTGACCCCTCAAACTCTACAGCCTCAATGGACCAGACCCT 

ACCCGGTCATTTATAGCACACCAACTGCCGTCCATCTGCAGGACCCTCTCCATTGGGTTCACCAT^ 

CCAGAATAAAGCCATGCCCATCAGACAGCCAGCTTGATCTCTCCrCTTCCTCCTGGAACCACAAGA 

TTAGGCCXjAGAGCCGATCAAGACAAACAACCTACAACCCTTAACTCCTGGCAGCGCCCACCAAGG 

CCATGC 

SEQ ID NO : 4 1 77 ACTAATGTGATCACAGAAAAATAGCAAACTGAAGTCAAAAAGCACAGGCTG 
CCCATCATAGAGTCCCCAGTCTATTTAATATTTTCGACCTGCTCTGAGGATGGAGTGTGTCAACAC 
CTTAGAATACCATQATQGGCmAAAACrGGGTAAAGAATCATCCCTAGGGrrAGGGCAAATG^^ 
ACirCACTTATTGCrrACACACAGGAAAAAATCTGrrrCCTTTTCCAAGATA^ 
CTTGACAGTGGTTCTTTTAAAGGTATCAAAAGATGTCCTAAAAAAAAAAAAAAAAA 
TGOAATTCTCTCGGAATTCCAGAAAGArrATa-GGCAATTCATTCCTCCACCCCATCAAAArT^ 
CmGAAAAATTACAACTGGCAACTCACATAAGATTTAATCAACTTACTTAAAA^^ 
TTCTCAATAATGTTTCCCCACTAGAAAACTAAGAAATTCAGTGATACTCAGNGCCAGTAATTAAGT 
GGGTACGGAGAATTCTATAAAGAGCrTGACCAATGACTGGGAAGATCACTTGGGCAGGGGAAGC 
ATTTTTCAGTrrGAAGGACAim?^GAATTCAGAACCCTTTCTTTTGTCCC^ 
GATCTGTTTGA 

SEQ ID NO: 4 1 78 ACTTATTTCAACAATTCTTAGAGAraCTAGCTAGTGTTGAAGCTAAAAATAGC 
TTTAmATGCTGAATTGTGATTTTTTTATGCCAAAATT^^ 

GGAAATAAATAATTATGCCATGGCATTTGACAGTTCATTATTCCTATAAGAATTAAATTGAGTTTA 

GAGAGAATGGTGGTGTTGAGCTGATTATTAACAGTTACTGAAATCAAATATTTATTTGTO 

TTCCATTTGTATTTTAGGTTTCCTTTTACATTCTrm 

GACTATGGAAATAATTTAAAGATTTAAGCTCTGGTGGATGArrATCTGCTAAGTAAAGTCT^ 

TGTAATATTTTGATAATACTGTAATATACCTGTCACACAAATGCTTTrCTAATC 

TATTGCAGTTGCTGCTTTGT 

SEQ ID NO: 4 1 79 ACCi'rin'il'CTGAGCTCrGGTTTGCCTTTCTTGACTGTGGCCATCACCATGTCA 
CCCACACCAGCAGCGGGAAGTCTGTTCAGCCGTCCCTTGATCCCCTTCACGGAGATGATATACAG 
GTTTTTGGCTCCTGTGTTGTCAGCACAATTGATTACAGCTCCTACCGGAAGACCCAAGGAAATCCG 
GAATITCGCACCANAGGACCCACCACGTCCTCGCTTCGACATCTTGAACGCCGGAAAAAAAAAAA 
AGCAAAAAAAAAAAAAAAGTACAGAATGATACTTAGTTTTCTTATACCTCAATACACAAT^^ 
CrrAGCAACAAAATATATGTTmGAAAAGACATGAGTTTTAGAATAATCAATGCTCCT^^ 
CTGTATATTATTAAAAATAATATCCAGTGTATrmAACACTACTGAAAGAAGT^ 
GAATCCTGTAAGTCACAGACTTACAGGATATTTCTCACAGATTTTGCTAAGGCAGAAATAAGCTCA 
TOTGCTAmATTGAGTAAAATAAAACAACATTAAAAACAATAATACCAAGTGTTATrrA 
CTTCTGNGTTTTTCTATGAAAAATTGATTANAACATAGAGTCAirrCAm 
TAGAC 

SEQ ID NO: 4 1 80 ACATCCCAGTGGGGACATAATCATTGCCGATTATGGTAATAAATGGGTCAGC 
ATTTTCTCCTCCOATGGGAAATTTAAGACAAAAATTGGATCAGGAAAGCTGATGGGACCCAAA 
AGTITCTGTGGACCGCAATGGGCACATTATTGTTGTGGACAACAAGGCGTGCTGCGTGTTTAT^ 
CCAGCCAAACGGGAAAATAGTCACCAGGTTTGGTAGCCOAGGAAATGGGGACAGGCAGTTTGCA 
GGTCCCCATTTTGCAGCrGTAAATAGCAATAATGAGATTATTATTACAGATTrCCATAATCATTCT 
GTCAAGGTGTTTAATCAGGAAGGAGAATTCATGTTGAAGTTTGGCTCAAATGGAGAAGGAAATC^ 
GCAGTTTAATGCTCCAACAGGTGTAGCAGTGGATTCAAATGGAAACATCATTGTGGCCGACTGGG 
GAAACAGCAGGATCCAGGTTTTTGATGGGAGTGGATCATTmGTCCTACATTAACAC^^^ 
ACCCACTCTATGGCCCCCAAGGCCTGCCCTAACn'CANATGGTCATGTTGTGGTTGCAGACTCTGG 
AAATCACTGTITCAAAGTCTATCGATACTrACAAGTAATGGTGGGCAGGTGGATACCCNCTTCATG 
GTCTTGCACTATA 

SEQ ID NO: 4181 ACTTTTrmTTTTTITrT^^ 

TTTTGGTCTTACAAATGATCACTTTTAAATGGACTTTTCTGTAAG^^ 

CAAGTATGTATCTGATCCACACAAATCCTTANAAAGGTTTTCTGTGTAGTCTTCATTAACGC/^ 

CTTTGTGAATGTTTCACTCTTACTGTAGGATCTTGAATATGTTrrACAATAATGAAGCTACA 

TTTATGCAGTGCATTCATTGTAAACTATAAATAACATTTGTATTAAAAAGAAAGC^^ 

AAAATAGGAGAGACTCTGAGGAGCAGGCAATCTGTrGAGGCTCAGTATATCTTATTTGC 

TCTGCTGTCATTCCTTTCAAAGAGCACACAGCACATGAGGCATAGTAATCATCTGTCTTGTT^^ 

TCTCTAAriTAAAAATCTGTrGTCGTAAAACTGGACCAATTATCACAAACTATCAm 

AACCCGCAAGTCTGTTACAAAGCArmGTTAGTGGGTATCAACAAATGCAGGANGATGGACATT 
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TATAAAAAAATGGGTTTTGTTAAGCTTGCCAACTtXTTAAATATCATGGACTATACTGCCm 
GAIT 

SEQ ID NO: 4 1 82 ACAAAmAATAAAGCCTTTGATCACATCTCAATCCATAAGTTAGCTACAAAA 
ACAAAAAATCCTACAACTTrrTAGAGCCATCAAGAGCCATCAAGGTGTCAGTGAACACCAGTTAC 
AAAAGGAGATTCAGTGTTTCAGATAGTGCAATATTTCCAGCTGCTGAAAAGAATT^ 
TACAAACTACCCCTCCTTGTCAAAAATCCACATGAAGTTGATATTGGTGTTTATAAATCACTCTCTC 
CCAGTCCCTCACTGGTTCCAACCTTCAGGTGATAAAAATTAGGATGGGATCCATCCTCCCTGTGCT 
GACAGTCTGGGGTCCCCGCATOTATGCACAAACCCQCCCAGCGTGCGCACACACGTTCAGAAGAA 
ATCITCAAAGGAACCGAGCGTTTGGAGAAAGTGGCAAGTCCACAGAATCAGAGGTTACGAACACA 
CXnrCAATAATATTAATACATTCCTGTCTTTAAATTCCTTGCCATGm 
CATTGTTTTCCAAAAGCCTGGGGGCTCGACCTGNGTGGGAC^CCAGGATGCAGCTGA^ 
AGGACTGTGGAAACCGGGTGGGCCCCTGCrriTGCTGCCACTGGGACAAGCTC^^ 
GTTTGCGT 

SEQ JD NO: 4 1 83 ACTTTTTTTTTTTTTT^^ 

GGAAAAAAAAAATACANAANAGGTmGTTCrCATGGCTGCCCACCGCAGCCTGGCACT/^^ 

GCCCAGCGCTCACTTCTGCTTGGAAAAATATTCrrrcCTCTITTGGAC^^ 

CTGCCAGGTTTCCAGCCAGCTGGGCACACTTCCCCATGTTTGTCAGTGAACTGGAAGGCC^^ 

AGTCTCAAAGTCTCATCCACAGAGCGGCCAACAGGGAGGTCATTTACAGTGATCTGCCGAAAAAT 

ACCCTTATCATCAATGATAAAAAGGCCCCTGAACNANATGCCITCATCAGCCITrAA 

ATCCTGANCAATGGTGCGCTTCGGGTCTGATACCAAAGGAATGTTCATGGGTCCCAGTCCTCCT^ 

TTTCTTAGGTGTATTGACCCATGCTAGATGACAGAAGTGAGAATCCCAGAAGCACCAATCAC^ 

CAAGTTGAGTTTCTTAAATTCTTCTGCCCTATCACrGAAAGCANTGATCTCCGTG^ 

GGTGAAGTCAAAAGGGTAAAANAANAACACAACATTTTTCCTTTTAGTCAAACAGGCT^ 

TAAACTG 

SEQ ID NO: 4 1 84 AC(nX}GGTGTTCCCCACCrrGGGCATCATGCACCACAACAAACAGGCCACTG 
AGAATGCAAAGGAGGAAGTGAGGCGAATTCTGGGGCTGCTGGATGCTTACTTGAAGACGAGGACT 
TTTCTGGTGGGCGAACGAGTGACATTGGCTGACATCACAGTTGTCTGCACCCrrGTTGTGGCT 
AAGCAGGTTCTAGAGCCnrCTTTCCGCCAGGCCTTTCCCAATACCAACCGCTGGT^^ 
ATTAACCAGCCCCAGTTCXGGGCTGTCnTGGGGGAAGTGAAACTGTGTGAGAAGAT^ 
TGATGCTAAAAAGTTTGCAGAGACCCAACCTAAAAAGGACACACCACGGAAAGAGAAGGGTTCA 
CGGGAAGAGAAGCAGAAGCCCCAGGCTGAGCGGAAGGAGGAGAAAAAGGCGGCTGCCCCTGCTC 
CTGAGGAGGAGATGGGGGTCATTAAAGGAAACTGAACATTGGATAA 

SEQ ID NO: 4 1 85 ACAAGTAACTGTTTAGAGCAAGTAAGTAGTTTGGTCCAATATTATCTTAATGT 
AATGGTTTTTCGCTCCCTCCCTTITATGTGACGACATACTATGACATACAAAACATGC^^ 
CACrrATAAACAAAGAAGTAAGACACCACTCAGACTGACATTTGTATACCCCAAGGGTCAACCAC 
ACAACTAATAAGGCTATAATACAGCTATGCAGCATAAATGGTCAACACTGCTGATCCTAAAGTGA 
GCACCATATATACATCTGAATATAAAAAACCATTGAAACCAAACACATATGGGrrAGTTAAAAGG 
TGCCATTCCAAGGCTATACATAATCACAGGAAAAAGTATTTACACTGCTTGGAAAACAGCTGCAC 
AAAGAGTTATTGCCAATGCCCTTATAAACTTCTCTCCTATGTTAATACAGATGTTACGCTCATATCT 
ATGTGAACCACACATCTATTCTGGCCCTTATAAANCCNCTAAGATTCAGCAAATTN^ 

SEQ ID NO: 4 1 86 ACGTGCAGACGGTGGTAGTTCTGGAGTCCTGGAAGCCACGAGGTGCTCATCC 
ATCACAAGGCCATCACAGCCGGGTAGAAATGCCTGAGGAAAGCAGCGGAGCTGACCGTGCCAGC 
ATTTACACAGGGAACACTTCTTGGGCAGCCAGGTGTCATGGGGCACAGACCCACAGTTCTCm 
GCACATCGTGCTCACAGTTCCGTCCGTAGAAGGAGGGAGGGCAGGCACAAAAGGACCCCAGCAT 
GCAGGrrCCCCCATrCAGGCAGCAGGrrCTGTTrAGCTCCrrACTGTGCTGTATCCCCATGGGCGG 
CACACGCTGGGAAGACCGAGGCCGAATTGCAGGCTCCrCCrrGGGGCCAAATGCTGTCATCTCTCC 
CCGCGT 

SEQ ID NO: 4 1 87 ACTTTAGGGGCrGCCATTGGGTATTCTrCTGGAAGGAATAGTTCAAGTTTAAA 
AGTCCCTCCCTCAAAGGGQGAATCCTGAGGGCCAGCAATGACCACATGAAAATAACGGGCGTTGC 
TCTCATCTGGTTCGGCTITGATGCCAGGAACrGGTTCTGCCAGCAAACGCTGGGTTTCCT^^ 
TCCTGCGGGGCAGCCCGGCCATCTTGTCAGAACCCGAGTTCCCCCGCGTACTCTTATTAA^ 
AATAGGACAAGCAAAGAAGCAAATGAACTACATGTGTTGACCCTAC TATTTCCT AAATCT^ 
TATCTGCTCATCTACACTGTAAAGAATGCTCTAAATTTCATGGTCAATTT^^ 
AGCTAmTCTTGTCCATATCCACCACCAATGTGCTAAAATmGTATANATTCACTGArrANTO^ 
GTCTCCATGGANCAAATCAT^AGG^^^CNC^™CCCCTrGTTAC^rrAAAATTAm 
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SEQ ID NO: 4188 ACGCGGGGGGATGGTTCCATCATGGCGTCAATGCAGAAACGACTACAGAAA 
GAACTGTTGGCTTTGCAAAATGACCCACCTCCTOGAATGACCTTAAATC 

TTCAATTACACAGTCATGTTTACTGGTGAAAATATTCCTGTTCATCCTCATGTTTATAGCAATGGTC 

ATATCTGriTATCCATTCTAACAGAAGACrrGGT(XCCAGCGCTCTCAGTCCAATCAGTTT^^ 

CATTATTAGCATGCriTCCAGCrGCAAGGAAAAGAGACGACCACCGGATAATTCTrTTTATGTGC 

AACATGTAACAAGAATCCAAAGAAAACAAAATGGTGGTATCATGATGATACTTGTTGATGCCCTG 

TTATCATCCTCCTAGCAGAAGATAGTCCTACTGAGAAATGANACirrGATCATTCAGTC^ 

TTTACCTTGCTGGAATGACCTrAGCAATGAAACTCTTCCTTTACTGATr^ 

CGCTGTGATCGTGGTTCATCCAGNCACTNNATGCTnTTATOTCANTT^ 

SEQ ID NO: 4189 ACATGATCTAAATGTTTAATGCTAAAGGTATATCGTAAGGGTAGTGTTTGTTT 
NNGAACGATAATT(:M.GAAGTTCTCATAGAAAGCGTATAACATAGGTCrrCAGAAACTATAAA^ 
ATTTTCATATAGTATTAAAATCCATAGACTAAAATCTGAGAATTTTTT^ 
AAACATAAGCrACCAAAATAAAGAGCAATGNGTTCTGGCTGTTTTATACTTCAACA>r^ 
AAGTGGTAAGCTATTACTATAAAACATATNTTTANAAACATTGGTATNGGGAGCTGCTGTO 
GCCAGTTNTCCTNGCACACAATGAGGCTAGGNTTT 

SEQ ID NO: 4190 ACTTCGTCTTCTAATTTCAAAAATATAACTTAAAAATGTAAATATTCT^ 
AATTTAAATATAATTCTGTAAATGTGTGTAGGTCTCACTGTAACAACTATTTGTTACTA^ 
ACTATAATAim 

GCAGTTTTATTT rCCTG TAGTTGGAACTACTAAAATTTAGGAAAATGCT/^ 

TGCTTTGAAGTATTTrrATGCTCrcAATGTTTAAATGT^ 

ATACAACCTGGCTAAAGATGAATATTITrCTACTGGTATriTAAmTC^ 

CGGATGAGAAAACTATACAGATTGANAAATGATGCTAAATTATAGTTTCAGTACTTAA^ 

ATGAGACNTGCCAAAATTGCTAAGCTACAAGATCAAGGCTGNCCGCACAGGGAAACAGTTTGAAA 

TTATGACTTCTATTTAGGAGGITGGAAACTTTGCTAGNAATCTATNCTTGNCA^ 

TGCTATCTGGTTCAATTGATTAAA 

SEQ ID NO: 4191 ACITGTCrAGCTCCTCTCGGTTCTTCCGAGCCAGCTCGTC 

ATGTCTGCCATGATCTTGGCGAGGTCCTGAGATTTGGGGGCATCTACCTCCACGGTCAACCCAGAG 
CTGGCAATCTGGGCTTGTAGGCCTTTTACTTCCTCTTCGTGGTTC^ 

CCTTGAGAGCCTCGATCTCTGTCTCCAGCTGCAGTCGTGTGATATTGGTGTCATCAATGACCTTGC 

GGAGCCCATGGATGTCGTTCTCCACAGACrrGGCGCATGGCAGCTCTGTCTCATACTTGACTCT 

GTCATCAGCAGCAAGACGGGCATTGTCAATCTGCAGAACGATGCGGCATTGCCACAGTATTTGCG 

AAGAT CTGAGCCCTCAGGTCCTCGATGATCTTGAAATAATGGCTCCAGTCTCTGCCTGGGGTCCTT 

CTTTTCAAATGCTCCGGATTTGTCTCAACCn'CCGGTCTCGTCTCAAGCTCCCa^ 

CACCTAGGGNAATCACAACTGGCGCCGTTCTATGGACCACTCGTNCACTGG 

SEQ ID NO: 4192 TCGGCCGCCGGGCAGGTACATTTCAGAAGACAACAAATAAAATTACTCTCAG 
AAAGCTGCAAAGATGGACACATATAATCTAAGAATGTGGTAATGGCCAGAGGGAGTACATTCATT 
TGAAAGACTAAAATTAGGTATAAGAACATGATTTGAAAGGATATTTATAGCCrGGATATACATGA 
AAAAG TAAGT AACTATATAAATGAAGAGTCATAAGTTTACATAAATATAAGAAACATTAAATTCT 
AAAATATTTTCTCTATGGTATTCATGAATTTTTCACTAATNAANAACT 
GGTCCATATAACAAGTATTANCANNATTAAAGATTCCACTGGATTCCAAANTGNAGTCTTO 
GATCAAATGCTGCATNATCAT 

SEQ ID NO: 4193 ACATGATCTAAATGTTTAATGCTAAAGGTATATCGTAAGGGTAGTGrTTGTTT 
T TGAA CGATAATTTAGAAGTTCTCATAGAAAGCGTATAACATAGGTCTTCAGAAACTATA>^ 
ATTTTCATATAGTATTAAAATCCATAGACTAAAATCTGAGAArnTTT^ 
AAACATAAGCTACCAAAATAAAGAGCAATGTGTTCTGGCTGTTrrATACTTCAACAAT^^ 
TAAGTGGTAAGCAATTACTTTAAAACATArrmAAAAACATCGGTATCGGGAGCT^ 
CGQCCGGTTGCCTGNCACACAAGGAGGCGAGGCTATGCGTTCGAGGCCAACCTAGGCAAAAAAA 
AAAAAAAAAAAAAAAAGTACGCGGGGTGGCGGCGACNCCNATTCAAACGTCTGCCCTOT 
CNTGG 

SEQ ED NO: 4194 CGGCCGCCGGGCAGGACGCGGGGACAAAATGGATACATAAAGACTAAGTAG 
CCCATAAGGGGTCAAATTTTGCTGCCAAATGCGTATGCCACCAACTTACAAAAACACTTCGT^ 
AGAGCrmCAGATTGTGGAATGTTGGATAAGGAATTATAGACCTCTAGTAGCTGAAATGCAAGA 
CCCCAAGAGGAAGTTCAGATClTAATATAAATTCACTTTCATTm 

mGGTTGGCACTAGACTGGTGGCAGGGGCTTCTAGCTGACTCGCACAGGGATTCTCACAATAGCC 
GAGATCAGAATTTGTGTTGAAGGAACTTGTCTCTTCATCTAATATGATAGCGGGAAAAGGAGAGG 
AACTACTGCCTTTANAAAATATAAGTNAAGTGATTAAAGTGCrCACGTTCCTTGACACATA 
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AGCTATGGGTTAGrrcrrANATGGCAAGCATGTACTTATATTAATAGTATTGNAAAGTTGGTGGAT 

ANCTTCCTGTGCAGGTCATGGTTCTrCTTATAAAATTGThm'ACAAA^^ 

TTTCTTGAANGTT 

SEQ ID NO: 41 95 ACCTAAGTATATGATTGCGAGTGGAAAAATAGGGGACAGAAATCAGGTATTG 
GCAGTTTTTCCATTrrCAmGTGTGTGAATTmAATATAAATGCGGAGAC 
AAGT TAAAA TGTTTCAGTGAACAAGTTTCAGCGGTTCAACTTTATAATAATTATAAA 
TAAATTirrCTGGACAATGCCAGCAmGGATTTTTTrAAAACAAGTAAAm 
AAAAAAAAAAAAAAAAAAAGT 

SEQ ID NO: 4196 actttttttttttttitit^^ 

ACACCATGGCTCTGTCACAATAGGGACATCTAAGCTCTACATACCATITCTTTO 

GGCCCCATAAAAGCCTCCAAGCCAOGTOTAGATGTGANACACATTTTACAGAACAm 

AGCTTATAATGATGCCATrrCTCCAACTGTGCAAGGGCTTTACAAAAACTGNGCC^ 

TGANGCTGGATTGmTGATTCATGTTTTATGAGCCCCAAAATTCTGAAC^ 

CATANCA 

SEQ ID NO: 4197 AClU"nil"i"i riTril-riUU-14"lUl"l-lTTGCACAATGGTrrATTAAAGGAATGTA 
TGGCCCACATCAACCTANCAAGGATTCTACrGGTAAACCITCCCATGGCCAA^ 
AGGAGTTGAGTGGCTGGGGTGGGGTGCAGGCAATGGANAGAGGGCAAAAGGGTGTAAAAGCTGA 
AGGGGGCTANAAGCTTACTCCTGANTTTCTTCCTTCTGTCTCAAATOT 

CCACTGTTTNATANGCTGGANATNACTCTNATAACTGCTTAGACAGCCAGAAACAGGGGAGANGG 
AAAAAGGATACTGNGGNAAGGGNTGGCGGGGCAANATTThmAACTTANAAACCCT^ 
TTCTAAAGTTTTGTCTNTAANCTAAAAAACAGTTATTAGGCChrrr^ 
NCTGAANTTGACANATTAA(>IANCnTCNAAAAGGA 

SEQ ID NO: 4198 ACmAGGGCTGAGGCAAAACTGNTTTrTAAGTTGGTAGATATGOrCATGAGC 
AAAGGTTCCCTNCATGTAATCATAGCCnTCCAGCTGNCAAGTTACAC 

SEQ ID NO; 4 1 99 acagggaaaaggggaagggagaaggaaacaaaaccctttacaaatcctgct 
aatactgtcttacccaaaaagaccataatgctttgtcccatattcagtcacritatataatai^ 
caaataaaatcttctctggaagatacattgctcctcmgagatggcagggaaaagggac 

seq id no: 4200 acagccaacggtttcccttgggggctttgaaataacaccaccagtggtctta 
aggttgaagtgtggrrcagggccagtgcatattagtggacagcacttagtagctgtggaggaaga 
tgcagagtcagaagatgaagaggaggaggatgtgaaactcttaagtatatctggaaagcggtcrg 
cccctggaggtggtagcaaggttccacagaaaaaagtaaaacrrgctgctgatgaagatgatgac 
gatgatgatgaagaggatgatgatgaagatgatgatgatgatgattttgatgatgaggaagctga 
agaaaaagcgccagtgaagaaatctatacnagatactccacx:aaaaatgcacaaaagtcaaatc 
agaatggaaaagactcaaaaccatcttcacccaagatcaaaaggcaanaatccttcaa 

GAAAAACTCTAAACCCAAAAGGCCThrrTTTTrrAANCT^ 

GTGGT^^TITCCAANTGGAACCAANTT^^'CATTITNGANA^ 

AATTTNCTTGGGGATTTTCTANACACCCCNCX;CCCCX:CCC^ 

SEQ ID NO: 420 1 ACGTGGGCCTQTAATGCACCTTCTCAGGCACCTCCAGCTGCCCCCGGCCGGG 
GGATGCGAGGCTCGGAGCACCCTTGCCCGGCTGTGATrGCTGCCAGGCACTGTTCATCTCAGC^ 
TCTGTCCCITTGCTCCCGGCAAGCGCTTCTGCTGAAAGTTCATATC^ 

ATAAAGGTCCCATGCTCCACCCGAGGACAGTTCTTCGTGCCAGAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAGT(>rCGGGAAATGGGTTCTAGTTCAATTTGTTTANTATAAATTGTCATA^ 
TGAAAACAAACACATTTAAAArrGGTTTACCTCAGGATGACGTGCANAAAAATGGGTOAAGGAT^ 
AACCGTmAGAC>nnrGCCCCACTNGTAGGATGGCCTTTm 

SEQ ID NO : 4202 ACGCXjGGCATCACTGAQTGCAAGGAGGAGGACATCATGTGCATGTATGAAGC 

cgaaatgcagtggaagagggactacaaagtcgaccaagaaattatcaacattatgcaggatcgg 

ctcaaagcctgtcagcagagggaaggacagaactactagcagaactgtatcaaggaagtggagc 

agttcacccagotggccaaggcctaccaggaccgctatcaggacctgggggcctacagttctgcc 

aggaagtgcctggccaaacagaggcagaggatgcrgcaagagagaaaagctgcaaaagaggccg 

cgctgccacctcctgaggcagctgtgggtgcccctgctgtgtggcrctgtatgactgttgctgy^ 

tataaagccctgcacctaaaaaaaaaaaaaaaaaaaaaaaaagtacrrrcanca^ 

TrCAAACCTTCNTTTTAAAATNTTCGTArr>m'ACTT/^ 
AACTTCCrGANNCTGAGANCTTANTCCrrrCTOTGTCATOT 
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ThrmAOTGATGTCAACCANCTATATCTTTGAAACCTT 

SEQ ID NO: 4203 ACGCGGGCACAGTCAAGCTTTAAAGAAAGTGTTTGCTGAAAATAAAGAAATC 
CAGAAATTGGCAGAGCAGTTTGTCCCCCTCAATCTGGTTTATGAAACAACTGACAAACACC^ 
CCTGATGGCCAGTATGTCCCCAGGATrATGTTTGTTGACCCATCTCTGACAGTTAGAGCCGATATC 
ACTGGAAGATATTCAAACCGTCTCTATGCTTACGAACCTGCAGATACAGCTCTGTTGCTTGACAAC 
ATGAAGAAAGCTCTCAAGTTGCTGAAGACTGAArrGTAAAGAAAAAAAATCTCCAAGCCCTTCT^ 
TCTGTCAGGCCTTGAGACTTGAAACCAGAAGAAGTGTGAGAAGACTGCTAGTGTGGAAGCATAGT 
GAACACACTGATTGGGTTATGGTTTAATGTACAACAACTATTTITAAGAAA^ 
TGGnTAAQGTCCTCGGCCCGACACCCTANGGC 

SEQ ID NO : 4204 ACTACTCGTAAAGAGCTTCAAGAACTITCTrCATCCATTAAAGACCTTGTrCT 
CAAATCTAGGAAATCATCTGTGACGGAAGAGTAATGATCTTAAmACATTTGTCATATAGTAAGC 
ATTTTCCCCCAAGGnGAAGGTGAGTGGTCACAAAAAAGTAGTCACTATACAACTCCCCrCTCCCT 
GCAAAAACCACCTCATACACACACAATTCAGTTAAAACAGTAGTATTGTATTAAATGTAAATCTTA 
AAAAGATGTGAATTmGTAAATTGGGTTCTTCATGGAAGrm^ 
TACTATATGAAATTmCACATTATTTTCACATAATTITAAAAATrACAT^ 
TTCCAAATGTTGAATGAAAAACAAATTTTCAATCCATTTATCCCTGGGGAANGATTCAT^ 
AGTGCCAGTATACTAAATGTNACTTATATNATTAAGCAGAAATTAAATGGGGCl'nri-nTAAAGA 
ACCTTTGNGCAGAAATAAATAATNTGTTTATTTTAATNGAAGCCTCAAACT 
ANAAAATTTTCTA 

SEQ ID NO: 4205 Aa-lTrirrri"Jl'ri"lTriTITGCCAGGAAATATTTTATTGACAACCAGGGACA 
CAGTCATAAGAGAGGGAAGCACACAGGACTGCAAACTAACACCCAGTAGCCAGCAAGGGCCCTC 
TGGGCCAGGAATACTGAATCCTGGGATCCTCACAGTCTCCCACCAGTANACATACATTACTGGGC 
ATCCAGGGGAGGGGGCAGTGGCTNTGGTGTCCCATAGAGTGAGAGGATATNTGATGCCTCA1TAT 
GAGCGANANGGTTATG 

SEQ ID NO: 4206 ACnTGCCCTCAGCATCTCCCTTCATGTCTGGGTCATGAACTTACGCTGGAAA 
AACTCCAACAGCTTCATATGGACAGATGGACTTCAAATGCTGTTTGTATTCCAGAGACTAGTTAC^ 
GGGCmGATTCCTGGGTCTAAGAAGGACAGCAAGTCCTGCTAAATGTTAAACACTGACGGCAAT 
TAAAGCa^CATCTTCACGCCCGGTAGAAGATGCCAATCAAAATAAACrrCATTCCrGAAG 
GGCTGGAAAATCAAAGCTATTCAACTCCTCAAGGCCCAGGGACTATCAAGAAAGAGGCGGACAG 
ATGAGArrGTAAAGGCTOATTTTGAGAGATAAAATAAGTTCAGTTTCCCTATAAATTAATCAT^ 
TGTCAAAGGCACACTGAAGCAAGACCAGCATATGGGCCTCTGTGTCAGATAACANGTTTCTTGAA 
CAGTAACTGCTCCTTATAAAGGTTTTAAAGCAAAAAAAAAAAAAAAAAAAAAGm 
ACACCTAGGCA 

SEQ ID NO: 4207 ACATGTTTGGAAATGAGTTAGATACTTGAAAAGTCTAAACACACTGATTTAG 
GAGTGCGTATGTTGCTGTCTCAATNAACAATGGGTCAATANGTTCATATGGACrAAATNAATC 
GGGTAGCACCGTATGTGTGTAAATATAAAATTATATTAGACArrANCACCANTGCANAACATGCT 
CAGCNTTCATAAGATCC 

SEQ ID NO: 4208 ACGCGGGGCTCTTrrTCCGGCTGGAACCATGGAGGGTGTAGAAGAGAAGAAG 
AAGGAGGTrCCTGCTGTGCCAGAAACCCTTAAGAAAAAGCGAAGGAATTTCGCAGAGCTGAAGAT 
CAAGCGCCTGAGAAAGAAGTTTGCCCAAAAGATGCTTCGAAAGGCAAOGAGGAAGCTTATCTATG 
AAAAAGCAAAGCACTATCACAAGGAATATAGGCAGATGT 

SEQ ID NO: 4209 ACGCGGGTATGCGGGCCAATATCACAGCCATCCGCAGAGTCCGGAAGACAG 
ACAATAATCGCATTGCTAGAGCCTGTGGGGCCCGGATAGTCAGCCGACCAGAGOAACTGAGAGAA 
GATGATGTTGGAACAGGAGCAGGCCrGTTGGAAATCAAGAAAATTGGAGATGAATACrTTACnT^ 
CATCACTGACTGCAAAGACCCCAAGGCCTGCACCArrCTNCTCNGGGGGCTAGCAAAGAGATTCT 
CTCTGGAGTAGAACTCAACCNCANGATGCCNTTGCAAGTOTGNCGNAATGTTOT 
CTGGTTGCTAGGGNGGTGGGGTCTCCNAGArrGT 

SEQ ID NO: 4210 ACI"riU"llU"riTrilUl"riUl"lU'rilG1U-lll'lUlUllUUUllU"l-l'l-l"l"l'l'rAGGTT 
CACTAAArrTATTrrAAAAATCATAAAACGTTTCTTACAAAANANC^ 

CTGAACAAATGCCAGGGACATGTGGACTATTGOTACTTTTCCTCCCTGTCCCACCCCCCAAATGT^ 

ACAGTGACCACAAAGCAAGGNGTTCACAATAArrACATGGGGGGAATTTTTTAAACCACCAAC>^ 

TAACNAAAAATAAAATCCACrCACTCTGCTGCTGTTTCAAAATTTCAATGT^ 

CTTCCCCCCCCCAACCCTGTTTGTAAGGAACTAAAACATTACATCTTGGTGAACAGCAAAGATTC^ 

CTCACCTTAAATGCANAACCCTATGAAGCAAAGGAATGTTGGCTTTTTAACAGAAGCAGATAA 
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AAAAAAATNCNGNCTCTTCAGTTTTACTAGTCTTAAAAAACTTCCAA/^ 
AAAANNTTTTChnrAAAAANTTAAATTTGATCAATT 

SEQ ID NO: 42 1 1 ACTCAGGCTCATCATCTAACCAGCCTTCGGGTTTTGTGGCCnX^ 

ATCTTAGCAGGGGCATCTTCATCCCAGTCATCTGGCTTGACAGCnTCTGGATCrGGGAT^^ 

TTrCATCCCAATCCTOTGGCTTCCGGTOTCTGGQTCCTCAATTTCACGTG^ 

GAGTCATGTCATrGAGCAGATTTarACTArrCACCACAGATTGGTCAACCAGTATTTCAAAACT^ 

TATCTGGATTCAAGATTAGTGTGTAAAGATGTGTTTTCTTATCAGTAAAATAGGTC^ 

CATCTGGCCTCTTAGCATGTTTTTCTTCATANATACCCGTTTTGGGGTIT^ 

NTGCAGTTrATAGTCCTCTCCACATTTATCTGGACAAACATAATCNTATAAGGGGOT 

TGATCCAGGTTGAGTCTGGTGTITANAAAGCAmTCAATAGCACACCACATCTATTCAT^ 

TTAACCTATCTGAACAATGAGAGCTTGGGTAAACAN 

SEQ ID NO: 4212 ACGCGGGGCCCTGGAGCTGTCTCTGGAAAGTAGCTGGCGAGGTTACCTTAAC 
TATCACTGAAGAAAGAAATTTTCTGACACACTGATGGCATGTGACTTGTCTCCT 
CATCACTTTGTITGCATAAAGTATACGGTrrGrrAAGGCCTTTGrrOT^ 
GCTAGTCTGCAACCTAGTTTTCCCTCTCACCrrTTAACTGACGr^ 

CTANAGTACCCTTCAAACCTCACAAGATGAAACTCAGAAGAGATAATGGATTTGGAAGCATTT^ 
CGTGGCrrATATTATGCCACGCCAGTGTrACTGTTGAAGTrCTGTTTGTTCTGTGT^ 

SEQ ID NO: 42 1 3 ACTGGAAGGGTAGTGGAACGAGTAAAGGCCTTACCACATTGTTTACACTTAT 
ATGGTITCACACCAGTGTGAATTCGTTCATGGATAAGACATAAATTGAGAAAATAGAAGGCTrrC^ 
CACAAAACITACATTTATAAGGTCCATCTCCACTGTGCATTACCCTGTGTCm 
GGGAAATGAAGGTTTCTGTACCACATATCCCATTTGACrTCrATTTGTGTGAAATGGCCT^ 
GGTCAAGCCAGCACCTGATGAAACriTCCTTCAGTGAGGCX:TTGCTGAAGAGGA^^ 
TC(XAATTCrGCTGAACAGGCATCTATCCmCTCTGGTGCAAANATAAACAATG^ 
TCrGATTGNGCTCCAGNGACKriTGAAGTGCAAArrGATAAGTCGCAGGTGGGOTCCm 
GGGGNCAATGATTNANGACACATGTGGCTGACTGTGGTGA 

SEQ ID NO: 4214 ACGCGGGOTCAGAAGCITGGACCGCATCCTAGCCGCCGACTCACACAAGGCA 
GAGTTGCCATGGAGAAAATTCCAGTGTCAGCATTCrrGCTCCnTGTGGCCCTCTCCTACACTCT 
CCAGAGATACCACAGTCAAACCTGGAGCCAAAAAGGACACAAAGGACTCTCGACCCAAACTGCC 
CCAGACCCTCTCCAGAGGTTGGGGTGACCAACTCATCTGGACTCAGACATATGAAGAAGCTCTAT 
ATAAATCCAAGACAAGCAACAAACCCrrGATGATTATTCATCACTTGGATGAGTGCCCACACAGT 
CAAGCTITAAAGAAAGTGTTTGCrGAAAATAAANAAATCCANAAATTGCAGAGCAG 
CTNATCTGGTTTTNAAACAACTGACAAACCCriTCT(XTGTGGaSfAT^^ 

SEQ ID NO: 42 1 5 ACCTATGGGCTTCCTTGCCACTGTCCCTTCAAAGAAGGAACCTACTCACTGCC 
CAAGAGCGAATTCGTTGTGCCTGACCTGGAGCTGCXCAGTrGGCTCACCACCGGGAACTAC^^ 
TAGAGAGCGTCCTGAGCAGCAGTGGGAAGCGTCTGGGCTGCATCAAGATCGCTGCCTCTCTAAAG 
GGCATATAACATGGCATCTGCCACAGCAGAATGGAGCGGTGTGAGGAAGGTCCCTTTO 
TTTTGTGTTTGCCAAGGCCAAACT(XCANTCTCTTGCCCNCTTTi^ 
NCACTACCCTTACTTAAAATCATTrGGNCChrrGCXGGGGCNGCTCT^ 
CTGCGGCCGTTCTTNOTGGATCAANTCGNACj^ANCTTGCGTANT 
A 

seq id no: 42 1 6 accgcctattcarmcttgaacttctcataatgatagtcatcagtrgcct^ 
cgttagcaggacgcttgcggttaggaggggaccgctcaggctctggcttctccgtgtcacctactc 
tcaagggccgggccttgggctcttctttgttrctccgtatgggcgcgttgagctc 
atctgttqtgctgcacataattcacagccatgttggtaggcacgaaggaggtctc gctgtctttct 
tcttgttctgctgctcnxjccaacagacgggccitggcatcctccgtggaaatgat^ 
agcatcgatgcccaggtccacctcaggaatgccactcagcatctggtggaaagcatctctcggtct 
tctttgctgangaaacacggatgtttctggaagttcataaaacagcctctgc^ 

TTCTGTTCTATGTTCACGANCCCTTCCTTTTTTTACTCT^ 

ATGATCTCGANNGAATTAAAATACNATGGCCGAATAGGGGGAGGGTCCQNTAC 

SEQ ID NO: 4217 ACGCGGGACCAAACGGAAAAAAACGGAATTATCGAATGGAATCGAAGAGAA 
TCTTCGAACGGACCCGAATGGAATCACCTAATGGAATGGAATGGAATAATCCATGGACTCGAATG 
CAATCATCATCGAATGGAATGGAATGGAATCATCGAATGGACTCGAATGGAATAATCATTGAACG 
GAATCGAATGGAATCATCATCGGATGGAAACGAATGGAATCATCATCGAATGGAAATGAAAGGA 
GTCATCATCTAATGGAATTGCATGGAATCATCATAAAATGGAATCGAATGGAATCAACATCAAAT 
GGAATCAAATGGAATCATTGGACGAAATTGAATGGAATCGTCATCGAATGATTGACTGCAATCAT 
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NNAATGGCTCGAATGGAATCATCTCAAATGOATGGAATGGAATCATCGCATAGAOTCGATTGG 

TATCATCGNTGGATTCGNTGNATCAAATCAACCGGAAAAAAACGAATTTTCGAATGAATTC^ 

AACOTa^ATGGNCAANTGNATCrmATTONAAGGATGGATATNCTGGACT^ 

T 

seq ed no: 4218 acgcggggggagcgggcggcggcgitggcggcttgtgcagdaatggccaag 
atcaaggctcgaoatcitcgcgggaagaagaaggaggagctgctgaaacagctggacgaccrga 
aggtggagctgtcccagctgcgcgtcgccaaagtgacaggcggtgcggcctccaagctctctaag 
atccgagtcgtccggaaatccattgcx:gtgnctcacagttattaaccagactcanaaaga^ 
tcaggaaattctacaagggcaagaagtccrcngc 

seq id no: 4219 tcgaagccggccgccgggcaggtacgcggggacaganggaggaaggacagc 
anggccaacahrrcacagcagccctgaccagagcancctggagctcaagctcctctacaaagagg 
tggacagagaagacagcattagacx:atgggacccccctcagcccotccctgcagattgcatgtcc 
cctggaaggaggtcctgctcacatam'cacttctaaccttctngaacccatccanc^ 

SEQ ID NO: 4220 actaaagcattcatggaggctcttcaggctggtgcagacatctccatgattg 
ggcagtttggtgttggcttttattctgcctacttggtggcanagaaagtggttgtgatc^ 
acaacgatgatgaacagtatgcttgggagtcitctgctggaggttccitcactgtg 
atggtgagcccattggcaggggtacaacagctccaagacaacacttaattccagctatacgtggc 
aaaagatgttatggcagggaatagagaggrrraaatacagatgaaataaagggtcaccatctcct 
caggcacaaggaacagotacttttt 

CCCGCGGCCnTCGCTTITAGCTCCrCXjAGTTTCTrCTGCTCCT 
ATCirCCTCGTCCATCTNCTTGGCCTGCTTCTTGGGCTGTTTCAGTGG^ 

GGCCGGACATGGCGCCTGCCGCCGCrmCCCCCCCGTGTACTTTTCAGGAAGACTGACTTAAA^^^ 

TNCGGGGTGAGTAAAATAANTTGGGGTATAAAAATCTGAACTTTTACATCTGGCANAAGG 

AAAATATTTGACATTGNTAACITGACTGGGGAAANANGATGGNrGCATTGT^ 

SEQ ID NO: 422 1 ACATCAAAGATTACATGAAATCAATCAAAGGGAAACTTGAAGAACAGAGAC 
CAGAAAGAGTAAAAOTTriTATGACAGGGGCTGCAGAACAAATCAAGCACATCCTTGCTAAm 
AAAAACTACCAGTTCTTTATTGGTGAAAACATGAATCCAGATGGCATGGrrGCTCTATTGG 
CGTGAGGATGGTGTGACCCCATATATGATTTTCTTTAAGGATGGTTTAGAAATGGAA 
CAAATGTGGCAATTATTTTGGATCTATCACCTGTCATCATAACTGGCrrCTGOT 
ACACCAGGACirAAGACAAATGGGACTGATGTCATCTTGAGCTOTCATT^ 
TATTTGGAGTGGAGGCATTGTTTTTAAGAAAAACATGTCATGTAGGTTGTCTA/^^ 
TITAAACTCATTTGAGAGAATGCCITTTAGTTTAATGCATATTTAAACT 

TCCTGGAGAAGCTAGAGCCTGArTGTAGGCTCTACTCATCAATTAACTTCTACAGTGGAGACTACT 
TCTGGGACTGGAATATAAAAAANAATCAAAGGTCTGATTTTGAGTTGCAATAAAGGGAAAGACCA 
TGCTCATAGCAGTGGCCAACTTNTGAAGTGTGGAGCCTTACCCTTTTATTAACCTACA 

SEQ ID NO: 4222 ACTATGGTAGAAATGGCGCCATCTTCCTTTTCATGCTTAAAAACCACC^^ 
AGTATCTCCAACTTCmAGAGGTAAACACTGTCACAGTGTCCCATTTCTTCAAGGACAG 
CAATCTAGGAAGAGCTGTAGCACXIAACAGTTTTACAAAGTCTTCGGAGATCCCATmGAG^ 
CCTCACrAACATGATATTATATTTATITGCATAATGAAGAGCCATGTCTGCC 
ACTACGACATTTGCACCAGTATCAGCAATAGCTTTGACTTGTGCATCCATGAGGTnTOT 
TACTAAAATTCATCAATTCTTCAGCAGTCTITATCAACACTGTrC 
ATCAAAAGGACAAGAGT 

SEQ ID NO: 4223 ACACAATGGTTTATTAAAGGAATGTATGGCCCACATCAACCTAGCAAGGATT 
CTACTGGTAAACC^CCCATGGCCAAAGGAAAAACAAGCAGGAOTTGAGTOGCTGGGGTGGGGTG 
CAGGCAATGGAGAGAGGGCAGAAGGGTGTAGAAGCTGAAGGGGGCTAGAAGCrTACTCCTGAGT 
TTCnrCCTTCTGTCTTCAAATCTTTACTTCnTATGGC^ 

TGCACTCTTCTAGACTGCTCGAGACAGCCAGTGTAGTAGGGCCCTTATCACTCTTAGTTTGCTAGG 
TTTCCCCTCTGAAATAATGAGCAGATTTAGCCAGGCTAGCAGAAAGGAAGAGGACGGGGCTGTGC 
AGGAGTTAGCAGAATCTTGATTCTTGCTCTATGGTCGGT 

SEQ ID NO: 4224 acgcggggctcactgagcaccgtcccagcatccggacaccacagcggccctt 
cgctccacgcagaaaaccacacttctcaaaccttcactcaacacttccttcc 
tgcacaaggaggaacatgagotggctgtgctgggggcaccccccagcaccatccttccaaggtcc 
accgtgatcaacatccacagcgagacctccgtgcccgaccatgtcgtctggtccctgttcaacacc 
ctcttcttgaactggtgctgtctgggotcatagcattcgcctactcr^ 
ggttggcgacgtgaccggggcccangcctatgcctccaccgcaagtgcctgaacatctgggccct 
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GATTCrGGGCArcCTATGACCATrGGATTCATCCrcTTACTGGTATTCGGCTCrG 
ATATTATGTTACAGATAATACAGGAAAAACXjGGGTrACTAGTANCCCGCCCATAGCCTGCAACCT 

TTGCACT(XACTGTGCAATGCriX3GCCCTGACGCTTGGGGCTGTTGCCCCrGCCCCC^ 

COTAAATCAGCANrmATACCCANACACCTGTCTAAGTGTNTTTCAAT/^ 

AAAAAAAAAAAAAAAAAAAAAAAGNCCTT 

SEQ ID NO: 4225 ACANAATGGATTrTGGAAGAGGGAGTCACCACTGGACCTCCAA GGAA GCCAC 
GTGCAGACATCTACAACCTTCGATCTCCTGACGAGTTTATTGTTGGCCAAAACCAGGCT^ 
AACCAGGATGAATGCGGGTGTTGGAAGTAGAATATATATATACAT ATAAA ATTGAAACTGGCGAT 
GGAATATGAGAGGAGCCCTCTGGAAAGAAAAGGACAGACCCTGTGCTTTCATGAAAGTGAAGATC 
TGGCTGAACCAGTTCCACAAGGTTACTGTATACATAGCCTGAGTITAAAAGGCTGTGCCCACTT^ 
AGAATGTCATTGTTAGACrrrcAAATTrCTAACTGCCrACCTGCATAAAGAA^ 
AATCAN 

SEQ ED NO: 4226 ACTGTGTGGAAAAGTGGTGTCGTGATTACTTCCTCAACTGTTCAGCACTCAGA 
ATGGCAGATGTTATTCGAGCTGAACTCTTAGAAAmTCAAGCGAATCGAGCTTCCCTA TGCA GAA 
CCTGCTTTTGGCTCCAAGGAAAACACTCTAAACATAAAGAAAGCTCTTCTGTC^ 
CAGATTGCTCGGGATGTTGATGGATCAGGTAACTACTTAATGCTGACACATAAGCAGGTTGCTCAG 
CTGCATCCCCTGTCTGGTTACTCAATCACCAAGAAGATGCCAGAGTGGGTCCTCITCCATA^ 
AGCATTTCTGAGAACAACTACATCAGGATTACCTCAGAAATCTCTCTGAACTATTTATGC^ 

T 

SEQ ID NO : 4227 ACAGATTTTGAATATGGTGACTAGGCAAGGGGCACTTTGGGCTAATACTCTA 
GGTTCTCTGGCnTrGCTCTATAGTGCATTrGGTGTCATCArrGAGAAAACACGAGGTGCAG^^ 
GACCTTAACACAGTAGCAGCTGGAACCATGACAGGCATGTTGTATAAATQTACGTGCCTCGCGCC 
AGCATGCCCTTGGGAGCCTCCTCAACTGGAGTGAGGAACTCATACTCCTCAGGCCGAGGTCCATA 
GCTGCCAACCATAAATGTTGCTTTATCCACnTrCACCCCAGTCCTGTAGGTC^ 
AGGCCTGACACAATATCCCTGTTCACrrTGAAGTGAATTTTGACTCTATATTCAGA^^ 
ACACAATGGTrrCCTITTTGAGGGCTTCCAGATCTCCAGTAAGGTCCA^ 

CACTCTCACAAACCANGGTGAGCCGGGCGACAACGACATTGGGGGCTTTCGGATCTGTCACCACA 

GGACCATCTCCCANCAGCCGTTTTCTTGTACGCAGAGCCCACCTGAATGACCTTGAA^ 

CCATTrCTTGGAATTGGCCTCCTGTATTTCCTTGAGTGGTCCC 

SEQ ID NO: 4228 ACGGGGGAGGCATTGAGGCAGCCAGCGCAGGGGCrTCTGCTGAGGGGGCAG 
GCGGAGCTTGAGGAAACCGCAGATAAGTTTTTTrCTCTTrGAAAGA T^^ 
TAAAAAATATAGTCAATAGGTTACTAAGATATTGCTTAGCGTTAAGTTITrAAOT 
GCTTAAGATTTTAAGAGAAAATATGAAGACTTAGAAGAGTAGCATGAGGAAGGAAAAGATAAAA 
GGTTTCTAAAACATGACGGAGGTTGAGATGAAGCITCTTCATGGAGTAAAAAATGT Am 
AAAATTGAGAGAAAGGACTACAGAGCCCCGAATTAATACCAATAGAAGGGCAATGCTTTTAGATT 
AAAATGAAGGTGACTTAAACAGCTTAAAGTTTAGmAAAAGTTGTAGGTGATTAAAATAA 
AAGGCGATCTTTTAAAAAGAGATTAAACCCGAAGGTGArrAAAAGACCTTGAAATCCATGACG 
GGGAGAATTGCGTCATTTAAAGCCTAGTTAACGCATTTACTAAACGCAGACNAAAATGGAAAGAT 
TAATTGGGAGTGGTAGGATGAAACAATTTGGAGAAAATAGAAhrrTTGAGGGTGGAAAACTGGAA 
AACAGAA 

SEQ ID NO: 4229 AC lUl ' Jlll ' i ' 1 i T TTTTTTrTTITmAAGm 1 1 1 TAGACA 

ACCTACATGACATGTrmGTTAAAAACAATGCXn'CCACTCCAAATAAAT^^ 
GAAGAGCrCAAGATGACATCAGTCCCATTTGTCTTAAGTCCTGGTGTTGTGTGGATGACAAGC^^ 
AGCCAGTTATGATGACAGGTGATAGATCCAAAATAATTGCCACATTTGrrAACATTmCCAT^ 
TAAACCATCCTTAAAGAAAATCATATATGGGGTCACACCATCCTCACGGTAGTCCAATANAGCAA 
CCATGCCATCTGGATTCATGTTITCACCAATAAAGAACrGGTAGTrmGAAA^^ 
GOTGATITCTTCTGCAGCCCCTGTCATAAAAGGTTTTACTCnTrCTG 
CTTTGATTGArnCATGTAATCTTTGATGTCCCCCATGCAATATATGGCT 
TTAATCGAACCrrGTTGAGCTTCACAAAGGGTTCCATTGAAGATTTGACGAAGGCGAAAA^ 
CAACACCTTTCGAACCTTTGGGCTCACTCCATTGATACCTCTGATTCT^ 
QGGTTNTTCAGGT 

SEQ ID NO: 4230 ACCAACAGAATTATTTGTGAGAAGAATGAACAAATTTTGATAAAGTATGAAT 

ttgttttatmaaaaagcaaacatactaaatttttttat^ 

ttacacctotataaggatttcatatatacattgtatgtgtgtatatataaatacatatatgactgcc 

TAAArrGTrrATAAATTTAATTmCTTTAATAGGTrrCAT^^ 

aaaatgaaatatagattagtttaaatgtgaattcagtgacictagggccaaagaatat^^ 
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GTTrGGAAAGAATTTTTGTATTTATTCCTGTTACAGTTTTCAC^^ 
GAAGTCCTGGTAAAGGATCTAACATmTTATTCCCTTCnTrCCT 

SEQ ID NO: 423 1 ACATGGCCACAGATCATCAAACCAACATTTTTTGTGAAATCATGAACAAGAT 
GAAGTAAAGCTGGACGTGAGTTTGGAGCACCTGTCATAACAAGACACroTGGCCTAAAGT^^ 
ACGTGGTCrrCCACTCCAGAAAGACGAATTGAATGCTGCAGTGCATTCAGGTAAGTCAGGGCTTGT 
GTAGAGGATCCCCAATTCACATCTGGTTTTTTGTAGGTAACATAAATATACAGCCCAAGGACTATC 

acatatgttagcaatgcagcccacx;agttaatgacgaacattactatgcaacaaagaattgctcc 
aagaagtgatatccacatgttgtagtatttgaatocaggacgccatcctggagattttgcaagtga 
tgcatggaatactgaaaaattgatcaatgcatatgatgcaaggaagaagtttgagataattggtg 
caataacattcagttcagcaattaagatgaatccaagtgcaattaagaatgttaagatgtagcca 

CNAAAGAGGTTCATTATTTOTCCCATAACCTTTAGCAAACATCTGGAAAGCT^ 

ttacatagagcctgaaatatitrgggagcactcactagggatgctaatgctgaaaa^ 
ttgaaatatacctgcanaattaatggggtaaatcctgacccatactcattacctggaagtgtt 

seq id no: 4232 actgctaccattacatggttccttattaaattrgaaaagtgcctgaaagtttg 
ggcaccagaaagacaccccaacaacatgtgtctaaactgcaacttcaggttaatatgactaaac^ 
agttacattgtgagaagtgctgaaggtatgtgatgtcmcccggcacaaaggtggcct^ 
tca(xagatggtgtagccaccatcrgaaccacccatgaagaagttccccttccgctgagttacgag 
gacgttggct<x;ctgcatgactgcaagctgagcagcattgggaoggcatccaggaggtggaggag 
gaatgttgccagcagtaccccactccaaatctggccctgcatcatacccntcttccaccagcactg 
tggaaccaggtggatagatgggaccgactggataataagccatggggattggtggaacctaaag 
gccaacagcccacanactgggccatgggaagatacagaaaggctccangaaatgccgctgacat 
ggtggggactgnggcanccctgggtgcacaaanctcggaccataganctctgantaggcaggtg 
gagcatcggtatagggtggancctgangaaaatccaogncttaaggtatactggattcccangan 
gctgcacanggtanggtggctgngtngatattgacctttgctgtcatggggggtgngggtccggt 

TT 

SEQ ID NO: 4233 actccttactcaggttctccatatatitctgtagtgctttatgcttccaga 

TCnTGGTGTCTTGATGGCAATATCTCCTGGACCAArrcrrACrrAACAGATGAAGAATGCy^ 

tgttttgtctaaaaacttgcagccttgtatcaagatgtatctgtctaaa™ 

ggatgtgacactggaggtaaccattaagagatggatgtctaaaaanccaaacacacaggtgacc 

gattcatctcagcataaacttaaaggcggttcataaangtcatrrtctaaaagm 

seq id no: 4234 acccagcatcaggtcaaaaacctgaatcgttgcagtgtgccagtaoatcagg 
cctctoagtcactgctgaaaagcaaagcctgctttggagctgaaaacctcatgagggrrct 
aactattg(xgccttggtgaagtgcgcacccacatt(xtgtgggtgttgtggggtm 
tgggaanagcagcctgatcaatancctgaaacgcaaccgcccatcancgtgggaacirrtcct^ 

ATTTrcAAArTTTTGCNGGGAGGTTTCCTGGANAAGTrrATC 
CCCAGGGCCCAACTTAAAAGGGGGGCCCATTCTTGGTTAAATTGGNTCCAC 

SEQ ED NO: 4235 ACGOlGGGGTCTGCAGGrrGTGCTTCCGGTGCGGAGGTCAGGGACAAGATGG 
TGCCACCGGTGCAGGTCTCTCCGCTCATCAAGCTCGGCCGCTACTCCGCCCTX3TTCCT^ 
CCTACGGAGCCACGCGCTACAATTACCTAAAACCTCGGGCAGAAGAGGAGAGGAGGATAGCAGC 
AGAAGAGAAGAAGAAGCAGGATGAACTGAAACGGATTGCCAGAGAATTGGCAGAAGATGACAGC 
ATATTAAAGTGAGTGACCCTGCGACCCACTCTTTGGACCAGCAGCGGATGAATAAAGOT 
TTGTGTAAAAAAAAAAAAAAAAAAAAAAAAGTACTGTGAGGGATAGACAAGCCTCCATCC^^ 
CTCCCTrCAGGGCACCAAAAACTTTATTGCCAGTGGTAGTTCTGGCAAGGCCTGCATC^ 
AGGTGAAGGCACCTGCTGACCATCAATGCTTTCCACATTTGTATrCATCACCAGTCA^ 
GGCCTTCATAGATCTTGTCX:ATGCCAAACCTATTGAAAAACCTCGGGCCACCANCAGGrc 
AAAATACCmAGCCAAAATANAAACTGACATTCAAAAAGTOTGTGTTGTT^ 
AAC^IATTTOATTArrAACCCAGGTTATAAAAATACAArriTTmT^^ 

SEQ ID NO: 4236 ACrrGCrCCCCACCTTAGTTCTrCACAACTAACATAGAAAATTGTTGAAAAGTA 
GGGGCAAGCATTTGCAAAAACAAACAAAAAATCCCCAGCTTATTATAAGCATG 
GGAATTTCTTCCCAGCAATAGACTTCCAAACCATCAAGAA^ 

ATTTTGCCTATGTCCAACAAGAGATGCACTTATATGTCCAACAGAGGAGCTGAAAATTAA^ 

TGTTrAGTTTGGGGGTGGGTTGGGAAATTCAGCCACCATTTTAAAATGACTGTCT^ 

TGACCACCAAAAATAGGTTCACTAAATITATTTTAAAAATCATAAAACGTTTC^ 

TTACATTCTGCACACTGCrCTGAACAGATGCCAGGGACATGTGGACTATTGTrACTm 

TCCCACCCCCCAAATGTTACAGTGACCCAAAGCAAGGTGTTCACAATAATTACATGGGGGGAATT 

TTTTAAACCCCAACAATAACGAAAAATAAAATCCXn'CACTCTGCTGCTGTTTCAAAATTO 
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AGTTTITGCCGCCCTrCCCCCCCCAACCCTOT 

AANATTTCACTACX:CCTAAATGCNAACCCTTTNAGCANAGGAATGTTGGCT^^ 

SEQ ID NO: 4237 ACGCGGGQGTCATAGCGACrmGGGATAGTTTGCTATCGACAAAGGGAGAC 
AAAGTrAAGGGGTCAAGGGAAAGGAGGGCCAAGTAGACCCTCCACGACCCTCGGCTTCCTCCTCA 
CCAGCTCCCCCTCCCTCCAAGTCCAGTAAGAAGTTGGGCCAAGCTGGAAGGGATTGACCGGGTTG 
CrrGTCTGGAGATGGAGCCACTACAGGCGGGCCTGGGCGCCTGGAGTCGGGCATGAAAGAAAATA 
GCGCCTCATCGCTCTTATCTAAGCCCAGAGGTAGACrrCGCrrGAAAAGATCGAGAA 
GGGATCGGCACTCACTTGCAGACATCAACAGTITCCAGAAAATrCGGTTCAATTCCTTC^^ 
ACCCACCAACCAAGGAGCTGGCAAAACCCACTAACCAACCANGAGAACACAGACCCGGTTTTGTC 
TTTTGACAACCCGAGTGGAACTCACGGGGCCCGGCX^CTGCCCTGCGGGTCCCOT 
NAAGAAAACCAAATAAACCirrGAACAGCCTTGAAAAAAAA 
CGCNACCCC 

SEQ ID NO: 4238 ACCGGGGGTTAGAGGTATATGAAAAGATGAGAAGCTCACACTGGGCTTCTTC 
AGTTACAAAACCCCGTTGGGATCACACCCACTCTTCnTCGTGCCCTCTTrATGGTGACATTGGm 
GGGTGCAGATCTTTAGGTGCTAAAGGGCAACTAGACTTGCCGATGGAGGGGTTTGGGGAAAATTA 
AGGGCAACCTGCACTGAGTTTTCAAGGCTATATAACTTCAGTAGCAGTTGTATGCCCATA^ 
CATGACCACTAGGAGAGGGGCATGGAATGACAAGTCCTAGTGTGATGGGAGTGGGACATTTGGTT 
AAGTGTTGGTTCAATGATCTAGACTCAATGAGACACrrCTGAAAGCrACAAACTG 
CCTACAGCACGAAGCCACGTTTATTACTGAACTCCTAACCAGCCTNCANTGCCCAAGAACGGTCA 
ANAAGATTCCTTGCTGTTCACANAGACCANACAGATGATATCGGTCCCGGAAATGCnTr^ 
AGCAGTGTCACAGAAGGCrrCCrrTCCATCCGTTTTTCTTGGAGGGGAANTGATC^ 
ATAAGA 

SEQ ID NO: 4239 ACGCGGGGCACAAAATACATGCAGAAGATGGTGGCAGATCTGGTGGAAAAT 
AGCTATTCAATITCTGTTCCGATCrTCAAACAGTTTCACAAGAAT^^ 

AACATTAAAAAGArrCGTGAAGAAAGCAACACCAAAATCGACCTTCCAGCAGAGAATAGCAATTC 

AGAGACCATTATCATCACAGGCAAGCGAGCCAACTGCGAAGCTOCCCGGAGCAGGATTCTGTCTA 

TTCAGAAAGACCTGGCCAACATAGCCGAGGTAGAGGTCTCCATCCCTGCCAAGCTGCACAACTCC 

CTCATTGGCACCAAGGGCCGTCTGATCCGCTCCATCATGGAGGAGTGCGGCGGGGTCCACATTCA 

CTTTCCCGTGGAAGGTrCANGAAGCGACACCGTTGTTATCAGGGGCC(m'CCTCGGATC^^ 

GGCCAAGAAGCAGCTCCTGCATCTGGCGGAGGAGAAGCAAACCAAGAGTTTCACTGTTGACATCC 

GCGCCAAGCCAGAATCCACAAATTCCTCATCGGCAAGGGGGGCGGCAAAATTCCAAGGTGCGCG 

ACAGCACTTGGACACGTGTCATCTTCCCTGCGGCTGANGACAAGGACCAGGACCTGGATCACCAT 

NNTrGGAAAGGAGGACCCCGTCCGAAAGGCACANAAGGACCTGGANGCCTTGATCCAAAACCTG 

GATAA 

SEQ ID NO: 4240 ACTTTTITTTITrrrTT^^ 

TCTGAAAAGCTACAAAAGTGCATTmACAAACTTAGGGGAAGTTGAGGTTCCTGG^ 

CGGTTCTCCCAGCrrTGCTCTGTAATGCAGGTCTCTGCAAGGGTCCCTGT^ 

NAGCNCAGGTTGGCCTGTGCCTCCACCCCACTCCCGATTCAAGCTCACAGCCCACCTm 

CCCATCTOA>rrcCAACAGGTGATCGANAACCACATCCTCAAGCTCriTCCANAGCA^ 

CGCTGACCCTGAGTGAAGGCCGCCTGCCGGGGACTCANACACTCAGGGAACAAAATGGTCACCCA 

NAGCTGGGGAAACCCAAAACTGACTTCAAAGGCAGOTCTGGACAGGTGGTGGGAGGGGACCCTT 

CCCAAAAAGGAACCAATAAACCTTCTOTGCAAAATGAGGGACTTTNOT 

TTCTGGAAGGAAACAACCCCTANAGGTTTACACCTTTCTCCCCAAACNCC 

GACCACACCGGCCAAAANT 

SEQ ID NO: 424 1 A Cl ll" J lll l " ril " rriU - il T ri'I4-r'i i l INGCATCAAAAAGCTITAT^ 
GTCCAAGGCITGTTAGGATAGTTAAAAAAGCTGCCTATrGGCTGGAGGGAGAGGCTTAGG^ 
NCCCTATTACTTTGCAAGGGGCCCTTCAAAAGTCGCTGGGCTCAAAAGGCTCrrAGTCGTG 
NAGTGAGCCTTTCNAAAANATACTCGCCCAGCCCAGCCTCCGGGCCACCCAGCCTGTGGAGGTO 
GTCAGGTGGTCACCCATCTTCTTGATAAGCTTCACTTCCTCATCTAGGAAGTGAGTCTCCAGGAAG 
TCACAAAGATGGGGGTCCGTGCGGGCAGAACCTAGGGCATGAANATCCAAAAGGGTCTGGTTCA 
NCTTTrrCTCCANGGCCATGGCANCrrrCATGGCGTCTGGGGTTrTACCC 
CTTCTTTGATGTCCTGGAA 

SEQ ID NO: 4242 ACGCGGGGGAGGAGCCTGAGGAAGAGGGCGGCGACGGTGGTGGTGACTGAG 
CGGAGCCCGGTGACAGGATGTTGGTGTTGGTATTAGGAGATCTGCACATCCCACACCGGTGCAAC 
AGTTTGCCAGCTAAATTCAAAAAACTCCTGGTGCCAGGAAAAATTCAGCACATTCTCT^ 
AAACCmGCACCAAAGAGAGTTATGACTATCTCAAGACTCrrGGCTGGTGATGTrCATA^^ 
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AGGAGACirCNATGAAAATCTNAATTATCCANAACAGAAAGTTGTGACTGTTGGACANTr^ 

TTGGTCTGATCCATGGACATNAANTTATTCCATGGGGAGATATGGCCAGCTTACCCTGTTGCNTAG 

GCAATTTNATNTGGACATTCTTTCTT 

SEQ ID NO: 4243 ACGCGGGGCTOTTCAGGATCAAATACAAAATTAATATACTAGGCCGAGCGCA 
TrG(XTCAGGCCTGTAATCTGAGCACGCTGAGAAGCCGACGTGGGTGGATCACTrGAGTTCAGGA 
GTTTGAGGCCAGCGTGGTCAACATGGTAAAGCCGTCTCTACTAAAAATACAAAATTTGGCC^^ 

atagcggcacacxsccagtagtcccagctacitgggaggctaaggccxgagaatcgcttga^ 

ggaggtggaggttgcagtgagctgagatcacaccactgcactccagcctgggtoacagagtgaga 

ctccatctcaaaaaataaacaaacaaataaataaataaaagaaattaaggtgctggcatgcacct 

gtagtcccagctctcaggaggctgaagtgggagaatcacttgagcccaggaaagcggaggtttgc 

agtgagccaagatcataccactgtgctccagcaggtgacagaacaacactcttgtcrcaaaaaaa 

aaaaaaaaaaagtggagggataccaagttttgacaaggacatggancactanaactgtatanaa 

aaaatrmaaattctgtcttgaaagaaaacattttcatcaaaggangcatgtaci^^ 

aacaaccctaangggcnaattncaaccacactggcnggccgtncttantggatcot 

CCA 

SEQ ID NO: 4244 actctctctgaaacagctacaaacatcttgtttttgcaaaatatacaatgttt 

CrCAATCTTTCTGTCCTTATCrCAATTTGCAAAAATAlTTTGAAACAATCTCCTTTA 

TGTTAATGAGGGCAAATCTTITAAAATCCACATGCrAGATCTTGAAAACGCTTGAGAAGAA^ 

AACTGTGAAAGGAGTGGTTATTTAAATACrrCAAACTGCACATGTGAAACAGACCCACTATTTAAT 

CAGAATTCrATGCAAGrmCATCCAGTATTATGATGTCATTTAATCTCTGCATAAATO 

CAAATATCrATTTCCATAAAnCATTCATGGATACTATACAAATAGATrCACAGCAGTATTTGGCA 

CCCACCCATAAAATTTATITACTCTATCAGGGTTGCAAAGCTCTAATATCAGCTGATAGCA ACrT 

AATTATTGGCCITTGAATGACAGATTCAAACAAGAGATAAATTACATCAAAATCACTATCAl'^ 

ACATGAATTAACTTGAAAATGTAACAGGTAAACCITACTCrrCAAGAAGAAAAATCAAATAGCAT 

AGTTTAGTGTTATTTATCTTCCTTCCAGCmCACCAAGGGCCAATTACAAGTC 

SEQ ID NO: 4245 ACITATACCCCCTAAATATATAAAACATTITrAAAAGAAAAAAAGGAAGA^ 
CTATTCATACATGCAACAACTTGGATGGATTTCAAGGGAATrATGCTGAATGAAAAAAGATCAGC 
CrCGTAAGATTACATTCTGTATGATTCCATTCATACAACATTCITGAAATGACAAAATTACAG 
TGGAGGACAGAACAGTGGTAGCCACAGGTTGGGGTGAGGGTATAAGAAAGGGATGTGGCTGCGG 
CTGTAAAAGGGCAGTGCAAGGGATCCATGTGACAGAACTGTTCTGTCTCTTGTGATGGTGGTCACA 
TGAATCTACACATGTGATAATATTGCATAGAATTAAATACACATACACGAAAAAAGTTCAAGCAG 
TTGAGCACAAATATTTTAATTGTCTAAAATGACATTTTCTTTAAGAGTTATCTACAGTTCAA^ 
ACirnATGAGGTGTCACATCCATCACCATTTTAAGAGATATAAAATCATGAAAAGATTCCCAG^ 
GCTATGTAAACATTTCANCTAAGGGTAAANAGAAAGTTAANGGGGGTTTTTACAGGGAAA^^ 
AGAAGGCAATCCCAATGAAGTCAACATTGGTCCACAAAATCITGGTAAAAGAACTANAATGGGAG 
CCCCANCTTGTTGANCAAGTGOGANAAANAAAGNAAACCTTTTNCCAACCGGATCCra 

SEQ ID NO: 4246 ACATAAGCATAATCAGTTATGGACAGCTTCTTGTATAAATTGCTATTCAGCAA 
TACATAAACTGCCTCAAAGATITATGCrrACAGGTAGACATTCAATTrACCAATAAAACAGCATGT 
TCTGAAAATATGGGCACATTTTAAAACATATTAAGACAGTTCTGTTAACCATAATAGTCCCACAGT 
ATGACTGAGTAATAAGAATCrACTTCAAAAGAAAAAAAAAAArrAATCAGTATAGTGCATGATTG 
A1TCAACATAGTTCCCAGGGAACAGACCAGTCACTCCGATTGCANACTCCTTCATACCA GCCAT CA 
TCATTCTTCTTTATAACATAAATGATTGCCCCrCCATAAATGACAGCTCATCATCCTTGTCTT^ 
ATAATCATATATTGCAACAACmCTCAATATAATTCTTGGGGGCCCAAGCAAGGATCCCCATCTG 
GANTATGGGANCATTATACrrGAACTACTGCAGCCTNCTNATCTTCATAATCCACTGGTGGGTGGT 
GGNGGGGGA 

SEQ ID NO: 4247 ACCAGGGCGGCGCGTGGTCTACGCCGAGTGACAGAGACGCTCAGGCTGTGTT 
CTCAGGATGACCGAGTGGGAGACAGCAGCACCAGCGGTGGCAGAGACCCCAGACATCAAGCTCT 
TTGGGAAGTGGAGCACCGATGATGTGCAGATCAATGACATTTCCCTGCAGGATTACATTGCAGTG 
AAGGAOAAOTATGCCAAGTACCTGCAGGCCTCCTACACCTACCTCrCTCTGGGCTTCTATTTCGAC 
CGCGATCATGTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAGCGCGA 
GGGCTACGAGCCGTCTCCTGAAGATGCAAAACCAGCGTGGCGGCCGCGCTCTCTTCCAGGACATC 
AAGAAGCCAGCTTGAAAGATGAGTGGGGTAAAACCCCAGACGCCATTGAAAGCTOCATGGCCCTT 
GGAAAAAAAGCTGAACCAGGCCCTTTTGGATCrrCATGCCCTGGGGTTCTGCCCGCACGGACCCC 
CATCrCTGTGACTTCCTGOAACTCACrTCCTAGATGAAGAAAGTGAAGCTTATCAAGAAGATOGGT 
GACCCCTTGACAACCTTCACAGGCTGGGTGGGCCCGGAGGNTGGGCTTGGGC 



661 



wo 02/29086 



PCT/USO 1/30732 



SEQ E) NO: 4248 ACAGCCGCCACAGCTACACTCCAACCACGTCCCGCTCTCCCCAGCATTTCCAC 
AGACCTGATCAAGGAATCAACATTTACCGAAAGCCACCCATCTACAAACAGCATGCTGCCTTGGC 
AGCCCAGAGCAAGTCCTCAGAAGATATCATCAAGrrTTCCAAGTTCCCAGCAGCCCAGGCACCAG 
ACCCCAGCGAGACACCAAAGATTGAGACGGACCACTGGCCTGGTCCCCCCrCATTTGCTGTCGTA 
GGACCTGACATGAAACGCAGATCTAGTGGCAGAGAGGAAGATGATGAGGAAOTCTGAGACGTC 
GGCAGCTTCAAGAAGAGCAATTAATGAAGCTTAACTCAGGCCTGGGACAGTTGATCTTGAAAG^ 
GAGATGGAGAAAGAGAGCCGGGAAAGGTCATCTCTGTTAGCCAGTCGCTACGATTCTCCCATCAA 
CTCAGCTTCACATATTCCATCATCTAAAACTGCATCTCTCCTGGCTATGGAAGAAAATGG 
CCGGCCTGTTTCTACCGACTTTCGCTCAGTATACAGCTATTGGGGATGTCAACGGGGGAGTGCGAG 
ATTACCAGACACTCCCAGATGGCCCArrGCCTGCAATGAAAATGGACCCGAGGAGTGGTTTATGN 
CCCAACTNTTGGAACCAAAGAAATTTCATTTGAAATGCTCATGGGGACC^ 
A 

SEQ ID NO: 4249 ACi"llUl"riU"ll-in-lU-ril-lUl"lU'lUUUllGGANATGGAGTTTCTCCTTGTAAGC 
CAGGCTGGAGTGCAATGGTGTGAGCTCGGCTCACTGCAACTTCCACCTCCCAGGTTCAAGTGAGTC 
TCATGCCTCANCTTCCCAAGTAGCTGAGTTTACAGGCATGTGCCACCACACCCAGCrAATlTrTGT 
ATTTr^AGTGGANACGGGA^T^CGCCAT^m'GGCCAAGGCTGGTCTCGAACTCCT^ 
ATCCACCCACCTNGGCCTNCCAAATGCTGGGATTNCAGGCATGAGCCACCGCNCGCAGTCTATTA 
TTrATTAAATACTAACGTTTACATCTTTNATTCTANAAGGGCAAACCC^^ 

SEQ ID NO: 4250 ACCATATCAATGCCAACCTTTTATTATAAATGAATAACCAAAAAAATAAGTG 
AAAAATGAGGGCACOATCTCrrCTTACACCTCTTCAACTCTCCAGCATCTAA^ 
AAGTCTGCTCTGGAGACTTTTTTCATCTGCAAGGACAGTGCTTTAT^^ 
TGCCTCTGGATAATGGTTGGTACTATAACCnTGAAAATTITCGACCTGGTGTCTAGT^ 
AAGTAATGCATTTTTTTTAAGTGAAAAGCrTCITACATTAT^ 
ATTATGAATAGTTAGATATATTTTATGTACirrmTTTTTm 

TTATTCTCTTGGNAATCrCTTCCAGTTAACTACTTCCGNTGGAGGTGATGAAATGGGTAAG^ 

CACTTCAAATGTTGGCACTGCTATGANCATGGTCTTCCCTTTATTGCACTCAAAATCANAACC^^ 

ATTCTTTTTTATATTCCAGCCCAAAAGTAGCTCACTGGNAAAAGTAAATTGATGAAG^^ 

SEQ ID NO: 425 1 accagctgtaaccaatacgattctggggcaggttgtgggcgagtagaagaac 
ctccttcccctctgcgacattgaacggcgtggattcaatagtgagcttggcagtggtgggtgggtt 
ccagaaggttagaagtgaggctgtgagcaggacctccttccaggggacatgcaatctgcagggag 
gggctgaggggggtcccatggtctctgctgtcttctctgtccacctctttgtagaggagcctgagc 
tccaggaatgctctggtcagggctgctgtgactgttggccctgctgtccttcctccttctgtccc 

CGT 

SEQ ID NO: 4252 ACGCGGGGCCTTTCCGGCGGTGACGACCTACGCACACGAGAACATGCCTCTC 
GCAAAGGATCTCCnrCATCCCTCrcCAGAAGAGGAGAAGAGGAAACACAAGAAGAAACGCCT 
TGCAGAGCCCCAATTCCTACrrCATGGATGTGAAATGCCCAGGATGCTATAAAATCAC^ 
TTAGCCATGCACAAACGGTAGTTTTGTGTGrrGGCTGCTCCACTGTCCTCTGCTAGCCTAC^^ 
GAAAAGCAAGGCTTACAGAAGGATGTTCCTTCAGGAGGAAGCAGCACTAAAAGCACTCTGAGTCA 
AGATGAGCGGGAAACCATCTCAATAAACACATTTTGGATAAAAAAAAAAAAAAAAGTACGCGGG 
GGAACCCAAAGAGCCCTCCTTAGCCAACACGCTAACTCCGAAGCCTCCCTTACCCCCCAACCA^ 
AAGGCGGCGACACCTGATTCAGCGCACAAACACAGGTCCCTTCTGTCCCGGATACAATTACGCGG 
GAAAACACACTTAACTTGCGCCGGGGCACCAAAAACAGCTATGAGTCTTACACTCCATAm 
CTCCTGGGAATGGCTGTTGGGGATACAAGGCTGCAAAATCATCATCGTGCCGCCAA'riM-rrrri'A^ 
AGCCATTTTTT 

SEQ ID NO: 4253 ACTCTCCTACnCTGGGCATGGGGTGACTTGAGGAATGTTGAAGCCATTCTGA 
CCACCATCCTGATCGGCCATGCTGTCAAAGAAGAGCCAGGCAGAATCGTCCTrCCCATACTTCAC 
AAAAGCAACATAGTGGCTTGTTTCTATGCAGAGAACAGCAAATAACTCCATATTCTGGCAAGGGA 
TGCAGCCGTGTCTCCAGTCCCAGTCGGGTAAGTCmGGGAAGTGACACTGGGTTATATT^ 
TCAGCCIXrrrCGGATGAAGGTGGACITGAGTGTl'GCAGGTTTTACAAAACT^ 
CTGAGATGTCCGGATCGTCGTAGCATTCTCTACACrcATACATTGCAAGCCCTCCACATATCCGGC 
ACTGTCTGGGAGTGTCirCAAGTAAATCTGTTATATTTAATTCCAGAGAAGGAAAA^ 
ATAGTTTAAAGTCnriTCCAAATCGAGGCATCTGAATAATCAGACATGATGGTGCCTCTGCA^ 
TCANGTTACTGTTGATAAAAAOACCATTCTAACAACTGCTGAArrGTGGGAACGCCAACT^ 
TTTirmCATAAAAATTTGATAGAAATAACAATCTTGG 

SEQ ID NO: 4254 ACTCTCCrACTTCrGGGCATGGGGTGACTTGAGGAATGTTGAAGCCATTCTGA 
CCACCATCCCGATCGGCCATGCTGTCAAAGAAGAGCCAGGCAGAATCGTCCTTCCCATACTTCAC 
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AAAAGCAACATAGTGGCTTGTTTCTATCCAGAGAACAGCAAATAACTCCATATTCTGG^ 

TGCAGCCGTGTCTCCAGTCCCAGTCGGGTAAGTCmGGGAAGTGACACTGGGTTATATTTATGAT 

TCAGCCTCTTCGGATGAAGGTGGACTTGAGTGTTGCAGGTTTTACAAAACTGCTTGAT^^ 

CTGAGATGTCCGGATCGGTCGTAACATTCTCn'ACACTCATACArrGCAAAGCCCTCCACATATCC^ 

GCACTGTCTGGGAGTGTCTTCAAGTAAATCTGTTATATTTAATTCCANAAAAGGAAAAAT^^ 

AAATAGTTAAAAGTCTrn-CNAATCGAGGCATCTGAATAATCAAACATTGATGGGTGCCI^ 

AAATTTTCANGTTAACTGTTGATAAAAAACCATTCTAACAACTG 

SEQ ID NO: 4255 ACGCGGGGATAAGATAAGGGACCTGTTAGATGTTTCAAAGACCAACCTTTCA 

gttcatgaagacaaaaaccgagttccctatgtaaaggggtgcacagagcgttttgtatgtagtcc 
agatgaagttatggataccatagatgaaggaaaAtccaacagacatgtagcagttacaaatatga 

ATGAACATAGCTCTAGGAGTCACAGTATATTTCTTATTAATGTCAAACAAGAGAACACACAAACG 
GAACAAAAGCTGAGTGGAAAACTTTATCTGGTTGATTTAGCTGGTAGTGAAAAGGT^^ 
TGGAGCTGAAGGTGCTGTGCTGGATGAAGCTAAAAACATCAACAAGTCACmCTGCTCTTGGAA 
. ATGTTATTTCTGCTTTGGCTGANGGTAGTACCTNGGC 

SEQ ID NO: 4256 ACGAATACACAGAGTGGTCrmCAACACTCCTCCCCCTACTCCACCGGACCT 
CAACCAGGACTTCAGTGGATTTCAGCTTCTAGTGGATGTTGCACTCAAACGGGCTGCAGAGATGG 
AGCTTCAGGCAAAACTTACAGCTTAACCCATTTTCAAGCAAAACAGTTCTCAGAAATGTCATGA^ 
GCCGGGGTGAAGGCAAGAGATOAATrGCATTATTrrATATATTTmATTAATATTTGC^^ 
ATTGCTAAAACAGCTTCCTGTTACTGAGATGTCTTCAATGGAATACAGTCATTCCAAGAACTATAA 
ACTTAAAGCTACTGTAGAAACAAAGGGTTITCTTTTTTi^ 
TGAGATGGTTCCCGATATCATGTGATTTTTTTTTCCTCCCCTTCCCTT^^ 
GTGCAATACTTAGAGAACCTATAGCATCTTCTCATTCCCATGTGGAACAGGATGCCCACATACTGT 
CTAATTAATAAATTTTCCATTTTTTTCAAACAAAAAAAAAAA 

SEQ ID NO: 4257 acgcggggggatcgctgctcctctctggggtcctggcggccgaccgagaacg 
cagcatccacgaaatgccacgggtgacctggccaccagcaggaatgcagcggattcctctgtccc 
aagtgctcccagaaggcaggattctgaagaccactccagcgatatgttcaactatgaagaatact 

GCACCGCCAACGCAGTCACTGGGCCTTGCCGTGCATCCTrCCCACGCTGGTACU"llU"l"114"iU"l'ri'r 
llU"ll-lM"ri-lU'GGGNNUlU"lU-141-ll-lU"ll']'lU-l"ri'rri"l'l"riGGGCTCTAAAGGGGGTA 

SEQ ID NO: 4258 ACCACATCATCCATGCTGACATCTACCGCTGGTTTAACArrrCGTTTGATATTT 
TTGGTCGCACCACCACTCCACAGCAGACCAAAATCACCCAGGACATTTTCCAGCAGTTGCTGAAA 
CGAGGTTTTGTGCTGCAAGATACTGTGGAGCAACTGCGATGTGAGCACTGTGCTCGOT 
GACCGCTTCGTGGAGGGCGTGTGTCCCTTCTGTGGCTATGAGGAGGCTCGGGGTGACCAAGTGTG 
ACAAGTGTGGCAAGCTCATCAATGCTGTCGAGCTTAAGAAGCCTCAGTGTAAAGTCTGCCGATCA 
TGCCCTGTGGTGCAGTCGAGCCAGCACCTGTTTCTGGACCTGCCTAAGCTGGANAAGCGACTGGA 
GGANTGGTTGGGGAGGACATTGCCTGCAGTGACTGGACACCCAATGCCCAGTTTATCACCCGTrCT 
TGGCTTCGGGATGGCCTCAACCACGCTGATACCCGAGACTTAAATGGGGAACCCCTGTCCTNGGC 
CGCNACAA 

SEQ ID NO: 4259 ACAGATCAGCAGAGCAGGACAGTTGGCAGCAGTGACCTCAGTAGGGAACAT 
GTCCGTCTACCCTCTCGCACTCATGACACCTCCCCCTACCAGCrCTCCTCTTCCTCCTCCT^ 
CCTGTGGGAGGTGGTCAGTGGGACTrAGGGATCTTTCACCTGCTGTGCCCAGTAGTTCTGAAGTCT 
GCTTGTGGAGCAGTGTmATGTTTATCCCTGTTTACTGAAGACCAAATACT^ 
TTCCATGTCITGCTCTrCTACCTCCCTAGTrAGTGGAAATTTGGATAAGGGAACTGTAGGGCCC^ 
ATTCTGGAGGTTTTATGTCATTGGCCACAGAATAACTGTCTCTAAGCTATCCATGGTCCAGTGGT^ 
CCTGCCAATTCTGTAGACTTCAGAAAGCACTTCTCTCTTATGGGGTTCATGG^ 
GTGACTTGCTTGGTGGCCTCATTCCATGTGTGCCTGTGCCTTGGGGCATGGACTTTG 

SEQ ID NO: 4260 ACATACCCirrCACTAGTGTTrGAGTGGCACAAGCCACATTTACCATGAGAAA 
GAAGAOATTTATTCATAAAAATGAATTTAGGCTGAATGGCTTATTATTGGAGGTAAGGGATT^ 
ATCTCTGTTCGTCACTGTCCACTGACAAGAAAAATCACCCTGAATGGTTTTACATCCAAT^^ 
AAGTCCATrGCTGGTACrrTTTITITITT^^ 

ACACATATATACAATGTATTrrAAAAATGGGCirrACAATATGTAGTITGATCACT^ 

CTAAATATATTGTGAACATTTTGTCTTCTACAACAGTTAAAAGAATTGAA^ 

CAArrTATrAANCAATCTTGTTGGGGACATTGAGGTATAATTTTTTTTCTAAGGAGGC^ 

TTTATAATGCCTTTGGGAAAAAAAGGGGGAGTTCNTGGCOTATATANCTTTCT 

CCTTGNCCTTCCrrrACCCTTTTA 
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SEQ JD NO; 4261 ACCX3CCAGCTCTCTGCTCTCCACAGGGCTCCCCGCCCCACCCGGCCTGATAAA 
GCGCGCCGACTGGGCTACAAGGCCAAGCAAGGTTACGTTATATATAGGATTCGTGTTCGCCGTGG 
TGGCCGAAJWVCGCCCAGTTCCTAAGGGTGCAACrrACGGCAAGCCTGTCCATCATGGTGTTAAC^ 
AGCTAAAGTTTGCTCGAAGCCTTCAGTCCGTTGCANAGGAGCGAGCTGGACGCCACTGTGGGGCT 
CTGAGAGTCXn'GAATrCTrACTGGGTTGGTGAANATTCCACATACAAATTTTI^ 
ATTGATCCATTCCATAAAGCTATCAGAAGAAATCCTGACACCCAGTGGATCACCAAACCAGTCCA 
CAAGCACAGGGAGATGCGTGGGCTGACATCTGCAGGCCGAAAGAGCCGTGGCCTTGGAAAGGCC 
ACAAGTTCCACCACACTATTGGTGGTmCGCGGCAGNTTGGANAAGGCCAATACTCr^^ 
CCGTTCCGCTAATATAANTAAAGTTGNAAAATTATACTTAATAAAAAATTTAGGACAGrc^ 
TGCTTACAGGTGTNATTTGTCTNrTAAAACNATOTGCAANGNTTNT^ 
A 

SEQ ID NO: 4262 ACGCGGGGGAGGTGACTTCCTGGTCTATCCTGNTGACCCCCTNCGNTTCCAC 

gcccattatatcgntcagngcrrggg(xcctgaggacaccatcccactccaagacctggttgctgcr 

gggcgccttggaaccagcgtcagaaagaccctgctcctctgttctccncagtctg 

gtctacacctccctgcaatgggccaacctgcagagaactccacagacctaggggatgtggctgtg 

TCGGCAGCAATANCCTTTOTGGATGTTCCCCAGCTCTTCTCTGGGAGT^^ 

TTCrCCGCGGTTAGTTTTTGATTCCAGGriTrCCAACACTACATC^^ 

AAGCACTTATTGGCTGNGTTTITGTAGTTACCTATTrrCACACTGTGAGCr^ 

GGGTTTGATTNATCTGNTTTCTA(>GGGTrrAAAGTCTCANGANGTCTN^ 

AATGTCAAAAAAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 4263 ACTTTTTTTrrrTTiTrTTrrr^^ 

QAATTTGGGGTGGCGAQGGTAGCCACACCCTCCAGGAGGATGAGGTAGGTTCAGTGCITCCAGCT 

CACACCTTTCCTGGGTCTTCCnTCAOTGTGACAGTTCGGTTAAAGACAANCnT^ 

CTGGATCCACGGAAAAATATCCAAGAOGCTCAAACTGGAAACTTGTCGAAGGGTTTTGCC^ 

CCACAAANCATTNCCTAAATGCTTGCATCCACCACGTOTAANTNATGCCAGGTTAAGGTC^ 

AAATNCACCAAGGCACCT 

SEQ ID NO: 4264 ACACTAAAGGGTGTTTCCCAAGAATAGAGGTGAAGATATTTTCATTTTGTTTA 
ACCCACAAACTATTTGGTCAAAGGAATATGTAAAGCTAAATAAAAGCACATCTGGTAGAAATTCA 
TGGCAATGCATGTTGACAAGATGTGCTTGGACCTCGCTTGCAGCATCTGGCAGTGGGTAGCAGAA 
CAAAGGTAGGAATCTCACAGGCTCTCCmTGTCTTTTCTGGAAGAGGCTCCGTO^ 
GCATTTGCAGCCGAATTCAAGACTTCTCCAAAATCACCACCAGCAGGCITGGTTCCCCCTA 
GACTTTAAGrmCGACCTTTTCTTCAAATGATTTAAAAGTTGGGGA 

TTTGGTGATGACTGACCAACAGACGAAAAAGCAGCTGAGGCCTTCTGTCCACCTGGGATAAGG^ 

TCANATGTCTTCirGTAACAGATGTTGCTGTCACGTCTTGCCACCCTITGGCAATG^ 

TTCCTGTANAAAATTGATTCCAAGTTTCCCOTrrGATCTCTGCTTAA^ 

ACACTTGAA 

SEQ ID NO: 4265 AC rrrilU - ri " rri - ri Tl'ri"ll Trt 'GANACAGGGTCrAGCTCTATGGCCCAGGCT 
GGAGTGCAGTGGCACAATCATAOCTCACTGCAGCCTCAACmCTGTGCTCGA GTGA TT 
CTCAGTCTCCTGAGTAGCTGGGA<XACAGACGTGCACTACCACACrrGGCTAATT rTAl 1 1 1 1 iCA 
NANATCACATGTGGTTTGAGGGAGGATAGAQAGGACAGGGTGTCAAGAACAGCAirii iACCA AG 
TTGATATAAAAATGAAACAGGATCCAGGCATGGGTCTITACACTGATAGTTACAGTAAGT^ 
ATCTGAGCAGCAATTTGAATCACrrCrrGAAAAACACACAAAATATATTTT^ 
TCTTGCTCTOTTGCCCAGGCTGGTCTTGAACTCGAGATTCTCCTGCTTTGGCCT^ 
GGATTACAGGCGTGAGCTACTGCACCCGGCCTG 

SEQ ID NO: 4266 ACGTGCAACCTTGTTGCTACGGCCACTATTCITCATTTTGGTAGA^ 

TGGTGGCAGCAAAAGTGGCAGTGGTAAAGGTTAAAGGCTGAGTGGACCGGGGCATGAAGGTAGG 

GCTTACAGCAGCAGAGTGACCAAAAGGAGGCGATGGAGAGCTGGACACTGATGAGGCTGGGTCA 

CTTATGGGCATAAAAACCTGGGCCTCTGGGTTAAAGCrOTTTrrGATCTC^ 

CArnrCATTATTATCATCCACGTAAAGCACCTTCACTGGTCCCTTTTCACCAAm 

CrCAAATGGGTCGATCCAAACACTAAGATCCTGTGGCAGATTGCCACGAACATCATCAATGTCCA 

AACCACTCTCnTrGGATGCTTGTTCAATCACTGGGTCCACTTT^ 

CCCAATCCTTTGTATGGCTTTTCANGATACCAGTGCCCrrCATATTTCTTOT 

GTTCTTCACCAAAAATGTTGACACGTCTCCTGGGAAGCITATTGT 

SEQ ID NO: 4267 actatggcctcttcaatacactgtagccagtgcactgggcccttgggqatgcc 
cagtgatatacacgttgaagcagtaatgtaagtctgtgagccactgttcctgaacacttccactct 
tcaatgtcgtgatattcttgatgatctgctgcaccaagatcgcattggtcaacatatgagtcctga 
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GGGOTGCATGCTGAAGCAGCAGTCGAAGCAGCTGGCCCACAACAGGCAGCTCCTCAGGCCCAACT 
CTGTTCACCCGCAGGCTCTGCAGCATX^AACCTGAGGACTCGAGGCAAGAGAGCCAGGATGGAGAC 
ACAGACCCCCCACAGCTTGTCCCTCTTTCTTAOATGnCTGCTTCCCCm 

AAATAAAGAAGACTGTAGCAACAAGAAGCCAGAAATGGCAGTAAGAGTCTCACTAACTACACAA 

GTATCATCCAGGAGCTTACAGATAACACCAAAAACACAGCCAGCCGTGCTGTCAGGAATTACAGT 

CAACCCAACCAAATGCITACTGATGGCACTTrGCAAGATGTCAAAGTTCTGACT^ 

AACCAATAAACTGTGAAAAACTGCTCTGTTACATNCCAGTGGTGTAAAAATTGATCCAAAT^ 

AGGTAAA 

SEQ ID NO: 4268 ACrGrrCTGCrCTATAAGAAGGTCGTTTAGTGGAATAAAAGTCrACATAATTT 
GTGGTCGATTTCAGTCCATACTGTGTTGAGTAATCAGTAGAAGCTGGAAAGTGAACTCGGTC^^ 
AAATAAGGATCTTCATAGCTATGAGGAGACnjCAAATACAATCTGTATGCATCAAAGTTCm 
TTGGAGTCATCn'GACTATAATACAGCTGTTGATGCTGTAGCCGTCTATTTTGTTCTCT^ 
AGGAATAGGAACTGATGTAAArrGGTGAAGGTTTGCTGGAGCCTGACTGAATGATGGGTGACATC 
TGTTGGTTGGTGGTAGACAAGGAAGGATGTGATTTGAATCGGTCTCGCTCCAATGTCGACACAGGT 
GTAATAAAATGGTTCTGATTCCACCCATCCTTTrrATAAATGCTCCGGAGGTCCCGATA 
AATGTATTCAAGACCTGGGCTGCTGCCTTCACCACTTTCAGAGATGATCTGCNCCCCTGCCm 
TTATGTTCACCAGCTTCTCTATGCCTNCTGAGTCGGCCAGGGCTTrTGCGT^ 
GOTGACCTCGTGCAGAACACANCANATGCTGCCATGOTCTCATCANACAAACACTGGGGCCATTT 
GCC 

SEQ ID NO: 4269 ACCAAAAGAAAAAGAAAAGGAAAAGGTTTCTACTGa'GTATTATCTATAACT 
GCCAAGGCTAAAAAGAAGGAAAAAGAAAAGGAAAAAAAGGAGGAGGAGAAAATGGAAGTGGAT 
GAGGCAGAGAAAAAGGAGGAAAAAGAGAAGAAAAAAGAACCTGAGCCAAACTrCTAGTTATO 
ATAACCCAGCCCGAGTTATGCCTGCCCAGCTTAAGGTCCTAACCATGCCGOAGACCTGTAGATAC 
CAGCCTTTCAAACCACTCTCTATTGGAGGCATCATCATTCTGAAGGATACCAGTGAAGACATTGAG 
GAGCTGGTGGAACCTGTGGCAGCACATGGCCCAAAAATCGAGGAGGAGGAACAAGAGCCAGAAC 
CCCCAGAACCATTTQAGTATATTGATGATTAAGGGCCAGAGGATCT(:j\CTTGCr^ 
ATTGTCCAGGCTCATATTGGGAATGCTTATGANGAAATTCATGCCGAGACCTGCTATTCAATGC^^ 
GTATCGTTGCCTCTGCACTGCCTGAAGAACCCTGTCTCCAAGTCTTTGGrrGAAGAGAAGAT^^ 
GACTGTTGAGTGTGCTCTTTCACAGACTTGGTTTTCAAATAAATATTAANAra^^ 
AAAAAAAAAAAA 

SEQ ID NO: 4270 ACGCGGGGmCAACTGACCTCTGGACGCAGAACrrCAGCCATGAAGGTAAC 
AGGCATCTTTCTTCrCAGTGCCrrGGCCCTGTTGAGTCrATCTGGTAACACT 
GGAAGAGAGGCCAAATGrTACAATGAACTTAATGGATGCACCAAGATATATGACCCTGTCTOT^ 
GACTGATGGAAATACTTATCCCAATGAATGCGTGTTATGTTTTGAAAATCGGAAACGCCAGACTTC 
TATCCTCATTCAAAAATCTGGGCTTTGCTGAGAACCAAGGTTTTGAAATCCCATCAGGTCAC<^ 
AGGCCTGACTGGCCTTATTGTTGAATAAATGTATCTGAATATCAAA 

SEQ ID NO: 4271 ACTGTCACCTACTCATGCACAAAACTGCCTCCCAAAGACTTTTCCCAGGTCCC 
TCGTATCAAAACATTAAGAGTATAATGGAAGATAGCACGATCTTCTCAGATTGGAC^^ 
CAAACAAAAAATGAAGTATGACTTTTCCTGTGAACTCTACAGAATGTCTACATATT^ 
CGCCGGGGTGCCTGTCTCAGAAAGGAGTCrrGCTCGTGCrrGGTITTTATTAT^ 
CAAGGTCAAATGCTTCTGTTGTOGCCTGATGCTGGATAACTGGAAACTAQOAGACAGTCCTATO 
AAAGCATAAACAGCTATATCCTAGCTGTAGCTrrATTCAAGAATCTGGTTTCAGC^^ 
CCACCTCTAAGAATACGTCTCCAATGAGAAACAGTmGCACATTCATTATCTCCCACC^ 
ATAGTAGCTrGrrCAGTGGTTCTTGCTCCACCTTTCTCCAAACCCTCTTAATTC TAGAGC A 
GACATCTCTTCATCGAGGACTAACCCCTACAGTTATGCAATGAGTACri"IUl'inTi-riUUlU-llMUlU^ 
TTTTTGGATANTTTTATTTCTCCAAATTGTTArn'ATCAGGAGTQACT^ 

SEQ ID NO: 4272 ACGCGGGGAGGCCCCAGCCAGCTCAGGCTACACTATCCCAGGATCAGCATGG 
CCGTCCGCCAGTGGGTAATCGCCCTGGCCTTGGCTGCCCTCCTTGTTGTGGACAGGG^ 
TQGCAGCAGGAAAGCTCCCTITCTCAAGAATGCCCATCrGTGAACACATGGTANAGTCTCC^ 
GrrCCCANATGTCCAACCTGGThrrGCGGCACTGAWGGCTCACATATACAAATGAATGCCAGC^^ 
GNTTGGCCGNATAAAAACCAAACAGNACATCCA 

SEQ ID NO: 4273 ATACGCGGGGAGTCACGGGGGAGCGAGGCCTGCTGGGCTTGGCAACGAGGG 
ACrCGGCCTCGGAGGCGACCCAGACCACACAGACACTGGGTCAAGGAGTAAGCAGAGGATAAAC 
AACTGGAAGGAGAGCAAGCACAAAGTCATCATGGCTTCAGCGTCTGCTCGTGGAAACCAAGATAA 
AGATGCCCATTTTCCACX^CCAAGCAAGCAGAGCCTGTTGTTTTGTCC;^^ 
CCACAGAGCAGAGATCTCAAAGATTATGCGAGAATGTCAGGAAGAAAGTTTCTGGAAGAGAGCTC 
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TGCCirmCTCTTGTAAGCATXjCTTGTCACCCAGGGACTAGT^ 

TTCTAGATTTGGATCATTGCCCAAAGTTGCACrrGCTGGTCTCTTGGGATTTGGC 

ATCATACATAGGAGTATGCCAGAGTAAATTCCATTTTTITCAAGATCAGCT^ 

TGGTCCACAGCATAACAGGCACTGCCTCCTTACCTGTGAGGAATGCAAAATAAAGCATGGATTAA 

GTGANAAGGGAGACTCTNACCTTCAACrrCCTAAATrCTC 

ACCTCTGAATTTG 

SEQ ID NO: 4274 ACCTCTCTAAGTTTTCATGCCCTGCTATCTGGAACACTTCCCCACT^ 

AGAAOTCCTCTAGCTCTOCATTAAGGATCTTGTTGTCCCTGACTGACACCCACATATGGAACAT^ 
GGCATGCCTCCATGGCAACACGGGATCCCTGAAGTTTATGAAAGCGrrCCATGTGAGAGAGGATG 
CTCATATACTCCTGCCCCACCCTCCAAAGATGTACTTrrmTT 

SEQ ID NO: 4275 ACCAGCTGTGGGATTTCGTCrrCGGATTCATTTGTTGCTTTAACrtGG 
CCTCrGCCACGTTTTTTCTTCCCATGCTCTGGGGTCTGTTCCT^ 
AGTATCACCACTTTCAGGTGCCACATCATCTTTACTAAGAACTGATQCAGTCTTCCT 
CClCITCTTCTTCCTCn'CCrmGTTTITCAAAATTTC^ 
TTCITAtTAAGCAAAGATCTmGGTGGCITCATCCCAATrGCTGA 
AGTGCAGCCGAriTCTCAGTTTTCACAAACAGGAGTTrCACGCTCTCCCAC^ 
CTGAAAGTCCrrrGGTGACAGCAACAATGTTTTCAATGATGTGCTCAATTT 
CAATACGTATAGCACTGCAAGAACCACTTTTAGAAATGTTTAAGACCGTT CCACC TAT^^ 
TGATCTCTCTTGATAAATTCTTGGACAGAGGGTTACAGATACTGGAACmcrJl' 
ATGTCTCCCAATGAGTGAGGGTAAAGAGCCGCTAATTCTGGCATCAGTAAGGAAGAAATCAAA 

SEQ ID NO: 4276 ACACCAGATCAaJAGACATCGTTrCATACTTCCCAAATAGTTITATATm 
CTTTGAAGGTCAGTTACCAGAGCCAAACITGTTCTrAACAAGCAGAATITrATCT^ 
GTCTCITACACCTrrcrGGGCCTArrCACTTGCAGAGAGGAGTCGAi^ 
TATCGTGCATTCATATGTGATGTCCTGTCrrCATATGCTGTCCAATTTCm 

ACTCGCAGGAGGGCCACGTAGATGAGAAAGTTCCGTATGGAGGAGATCACATCCTCGCGCATCTT 
CCGTTTCATCrCCTCCGGGTCCAGCTTCGGCGCGAGGCCCrCAATCCGGA ACAT CGCGGCTCTCCC 
CGCGTACCCAGAATACTTATTGTTCATTTTGAAAAGACTTTGTTCrm 
TTTGTGACCAGAGAAGrrAGGGAGGAGGTTATTTTTGTGTTTTGGGGTTGG 

SEQ ID NO: 4277 ACTACCAGAGCGAGGAGCAGGCAGAGGAGGAGCTCCTGGACATGGCGGTX5C 
TAAAGGACTACATTGCCrACGaK:ACAGCACCATCATGCCGCGGCTAAGTGAGGAAGCCAGCCAG 
GCTCTCATCGAGGCTTATGTAGACATGAGGAAGATTGGCAGTAGCCGGGGAATGGTTTCTGCATA 
CCCrCGACAGCTAGAGTCArrAATCCGCITAGCAGAAGCCCATGCTAAAGTAAGATTGTCTAACA 
AAGTTGAAGCCATTGATGTGGAAGAGGCCAAACGCCTCCATCGGGAAGCTCTGAAGCAGTCTGCA 
ACTGATCCCCGGACTGGCATCGTGGACATATCrATTCrrACTACGGGGATGAGTGCCACCT 
AAACGGAAAGAANAATTANCTGAAGCATTGAAAAAGCrrATTTTATCTAAGGGCAAAACNrc 
TCTAAAAATACCANCAACTTTTTGAAGATATTCNGGGGACAATCTGACATANCAATTACT 
TATGTTTGAANAAGCACrGCTTCCCTGCANATGATGAATT 

SEQ ID NO: 4278 ACCTAATGAAGAAGGrTATGAAATATTAGGTGGTTCGAGTTTCnTAAATCTG^ 
CACCAGACCATGACmGAAGGTGTTAATGTTGAAGTGTCGAT GCAA AAATGTC TATTAA CAA^ 
CTGCirAACACTGTTTGAGAAACAGGACAAAATAATTGTTTTTCTT^ 

AATTCTCTACAGACTCTCTCCCATTCAGAATCAGTTGGGTGGACTGATGAGGCAAAAAATAi^ 
CnTrAAAAAACAAAACrGGAGCCTATTTACAAAACATGCAAA 

ACTGCAGAACTGCTCANACGTGAATACAGCTGAGTGACAGAATATACCnTrACTTCTACAAATA^ 

GGTCCTTCCTCCAGACTTTXTGGAAGAAATACATTrrCAGGGTGTGGACTAT^^ 

TGCAGAGACAGAAACAAAGAAGTTTGCAAATCTTTTATATTTCCAGCTGTTGAGACAGTA'r^ 

AGGGCrGATX3TTACCTCTAGCGGCGAAACCAGAGa:AGCTArrAAGCAGCCAGAAGCTACAGTAA 

TTGAATACATGACCATTTTNTITmAGCACGTTCrrTG^ 

CTATTT 

SEQ ID NO: 4279 ACTrmrriTTriTiTr^^ 

GTCTANAATTANATCTGTTTTCTAAATATGCTGTTTCrGAACCTGAATCCT 

CCTCAGCAAANAAGCACAACCATCAAGCCCGTGTACOTCAAGCAGGAGTGCAATCGCACCCACAA 

CCGCOTGTGCGAATGCAAGGAAGGGCGCrACCrrGANATANAGTTCTGCTTGAAACAT^ 

GCCCTCCTGGATITGGAGTGGTGCAAGCTGGAACCCCANAGCGAAATACAGTTTGCAAAANATC^ 

CCANATGGGTTCrrCTCAAATGANACGTCATCTAAAGCACCCTGTANAAAACACACAAATTGCAG 

TGTCTITGGTCTCCTGCrAACTCATAAAGGAAATGCAACACACGACAACOTATG^^ 
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NGAATCAACTCAAAAATGTGGAATANATG1TACCCT 

SEQ ID NO; 4280 ACCCCTCAACCCCTTCTCCTTCACCCnrTAGCGGCAAGTCCCGCTTTCCT^ 
GCAAGAACCTCCCAATCGCTTAmCCGCACCCCAACCTCTTATCTCTGTGCCC^ 
CCGTGCCCCAACCCTTTCTCTGCTTTTCTGGAGGGCAAGAACTCCCCACCC(nT 
CTCTrrrCTCTGGGCnTGCCTCCTTCACTATGGGTAGGClTCCACCTO 
TTAGCCIXjTGTTCTCAAAAACTTAAAACCTOTCAACTCACACCTGA 
ATTTTCTTCTGCAATXjCCGCTTGACCCCAATACAAACTCGACAGTAGTTCCAAAT^ 
GGCGCTTTCAATTTTTCCATCCTACAAGATCTAAATAATTCTTGTCGT/^^ 
GAGGTGCCTGACGTCCAGGCATTCTTTTACACATCAAGTCCCTrCXTA 
ACTCGTCCCAAATCTTCCrrCTTTrcCTCCrrCCTGTCCCCTCAGTCCCA^ 
GTCTTTCTAATCirCCTCTTCTACAGACCCATCTGACCTCTNCCCTNCGOT 

SEQ ID NO: 4281 ACTGGTATTACTTGAAAAAATAAAAATTAAAAATATATTGTCAATCATGCATT 
CCACAATTTCAAGAACTAAGATATAACTCATAAATACTGTTATGTCTTTGCAGGTTAT^ 
CATCAGGCCTCAGCTGAAAGCrCATCTGCCAAAACACATCAGCACrrCATGTAAAATAAAAGGAGT 
AGGGAGGTGACTTTCTACACAATACTGTACGCGGGGAGAGGGAGGAAGGACAGCAGACCAGACA 
GTCACAGCAGCCTTGACAAAACGTrCCTGGAACTCAAGCTCTTCTCCACAGAGGAGGACAGAGCA 
GACAGCAGAGACCATGGAGTCTCCCTCGGCCCCTCCCCACAGATGGTGCATCCCCTGGCAGAGGC 
TCCTGCTCACAGCCTCAOTCTAACCTTCTGGAACCCGCCCACCACTGCCAAGCTCACT^ 
CCACGCCGTTCAATGTCGCAGAGGGGAAGGAGGTGCrrCTACnTGTCCACAATCTGCCCCAGCATC 
TTTTTGGCTACAGCTGGTACACTCACGTGCCTTACAGGATATTAAACCAAAAAG^^ 
AACATGCCAAATGTTTTCACITrGAATCGTAAACACAGCrCCTATAm 
ATGTTTCnsrA 

SEQ ID NO: 4282 TTATCATCCTAATGTAGACAAGTTGGGAAGAATATGTTTAGATATTTTGA/^ 
ATAAGTGGTCCCCAGCACTGCAGATCCGCACAGTTCTGCTATCGATCCAGGCCTTGTTAAGTGCTC 
CCAATCCAGATGATCCATTAGCAAATGATGTAGCGGAGCAGTGGAAGACCAACGAAGCCCAAGC 
CATANAAACAGCTAGAGCATGGACTAGGCTATATGCCATGAATAATATTTAAAATTGATACGA 
ATCAAGTGTGCATCACTTCTCCTGTTCTGCCAAGACTTCXrmC^^ 
GTCTTANAAACATTACAGNANTAAAAAGCCCAAGACATmrrCAGNCCCnTr^ 
ACAT^^AGCAAA^^'CTAATGT^m'GCCTGATTCCTGCCr^AAAAGNATGAG^^ 

SEQ ID NO: 4283 ACCGGGATGTrCCAACACTACATCCGACATCAGAAGAGCTAACAATTGCTGG 
AATGACCTTTACAACITITGATCrTGGTGGGCACGAGCAAGCACG^ 
CCCAOCAATTAATGGGATTGTCmCrGGTGGACTGTGCAGATCATrCrCGCCrCGTC 
AGTTGAGCrTAATGCTTTAATGACTGATGAAACAATATCCAATGTGCCAATCCTTAT 
CAAAATTGACAGAACAGATGCAATCAGTGAAGAAAAACTCCGTGAGATATTTGGGCTTTATGGAC 
AGACCACAGGAAAGGGGAATGTGACCCTGAAGGAGCTGAATGCTCGCCCATGGAAGTGTTCATGT 
GCAGTGTGCTCAANAGGCAAGGTTACGGCGAAGGGTTTCCGCTGGCTCTTCCAATATATTTGACT 
ATGTTTGGACGGNGGAAAATA^LAAAAAGTlTrACTTNTCT 

SEQ ID NO: 4284 ACril-l-in-l-ri-i-rrL-l-l'ri-i rrri lTGCAGAAAGAGTAGCCAGGTGTTAGCCAC 
TTTAATAGAAAATATGATCAAAACTCGATTACAAGAGTTCAAAAAGACATAGAAAACCAGTGAGT 
TTCAATTTTATTACAAGTTTrCAAATCTGGGACTAGTTTCTT^^ 
TCAGCCCAGGGTTTTTTCACAACCAAACTAAAAATGACTTACTACATGGGA^ 
AGTAGAAmGTAAACTCAAGCCACAAACTTAGTTAATAATCATGGTTAAGGOACATTGCCA^ 

agcaattgatgcctcagtgaagtttgaaagaaactctgctttctgtgacggcagag;^ 

gcaagcaattctgcttcaaagaaatttgcatagaaatggaaaaatgccagagccttt^ 

tgaaattgcaaagcctcaacacgttcaactcaatcccagacaccaaatgtttaatgggagcc 

gtaggactgagcattgaacnccagctntgcaactcgcagggcacaatrrcaagtgtggaaacca 

tctgtnggcaagctctttaaaaaattgaatttttanacaccgtaaam 

gcattantg 

seq id no: 4285 acacttggatgagaaganaagctttacaaccagaagtatccaggactactg 
aaaataatacggcatcctgaaaggaaaggtcttgctcaagcccgcaacactggctgggaagctgc 
cacagcagacgtggtcgccatcttggatgctcacattgaagtcaatgttgggtgggcaga gcca a 
tcrrggctcggattcaggaggaccgcactgtgattgtgtctcctgtgritgacaacattcgt^ 
caccttcaaactggataagtatgaactggcagttgatgggtttaactgggaactctggtgc^ 
cgatgcactgccacaagcctggattgatctgcatgatgtcactgccccagtgaagagtcct^ 
catgggcatcctggctgctaacaggcacttctgggagagatcgggtctctggatggtggaatgct 
catctatggaggagagaacgtggagcttaacctgagggtgtggcantgtggagggaaggtcgag 



667 



wo 02/29086 



PCT/USOl/30732 



ATTTTGCCCTGTTCCCGGATTGCCCACCTAAANAAACACCACAAGCrCTACCCOT 

GCITGCCTTGAAANCGCAATGCn'CTGCGANTGGCCCNAAATTTGGATO 

CNTGGTCTT 

SEQ ID NO: 4286 ACTGCAGAAGAAAAAGCAATCGITCAGCAGTGGTTAGAATACAGGGTCACrC 
AAGTAGATGGACACTCCAGTAAAAATGACATCCACACACTGTTGAAGGATCTTAAT^^ 
AAGATAAAGTCTACCTTACAGGGTATAACTTTACATTAGCAGATATACTATTGTAOT 

'riiTii"i"iTfTn"mTT 

SEQ ID NO: 4287 ACGAGCCGGTAGAGGAATCCTGmGATCTGGAAATTTTCCGTGGAGAGCCC 
AAAAGGTCGGAGAACCAAGTTCCCAAGATCTTTTAATTTACCTAACATCTCr 
TTACGTTCTTCAATTTGCTTAGGTAATCTCATACAAGCTTCTCTTG 

TTCTAATATAGATTTATAGTCTTCCAGGGCTTCATCTAGCTTGTCCGT(nTCTCATACAAC^ 

CTCCTCAATATTGCCCTGATATAGCTGGGGTTTAArrGAATTGCTTTGCTGCAGTCATTGAT GGCC A 

TTTCmCTTGTCCTGTTTCATCCITGCTGCAGCrCTAm 

GAAGCAGGATGGGCACATTTCGAGGGCrCGACTATAAGAACTITCAGCITCTATAT 
CTTAAACrGTTCATTTNCCTCCTCCrrTAAGTCTAATGCT 

CGACATGTTTTTTCCAGTTCTATTAGGTATTCTTCATCTAGTTCAGAGGAATTCACATC^ 
TTGTTCCTAAACTTTGTCCGCTCCTGGCTCCTCCTCAAAATGAGGCACT^ 

SEQ ED NO: 4288 ACACTGAAACATAAATCCGCAAGTCACCACACATACAACACGCGGCAGGAA 
AAAACAAAAACAGCAAGrrrACATGATCCCTGTAACAGCCATGGTCTCAAACTCAGATGCTTCCT^ 
CATCTGCCAAGTGTGTTCTGGATACAGAGCACATCGTGGCITCTGGGGTCACACTCAGCTTAGGCT 
GTGGGTCCACAGAGCACTCATCTGGCTGGGCTATGGTGGTGGTGGCTCTACTCAAGAAGCAAAGC 
AGTTACCAGCACATTCAAACAGTGTATTGAACATCTTTTAAATATCAAAGTGAGAAACAAGAAG 
CAACATAATAATGTTATCAGAAAGATGTTAGGAAGTAAGGACAGCTATGTAAAGCTTGAGGCTGA 
AAAGTAAGCTTGCCAGCITCATTrcrrTGGTTTCrrGGGTAGTGGGCGCCGGAACAGCAAGATGTG 
AGGTTCTGGTTCATGGATCATATAATGGACCCATCCCTGACTCTGCTGAACGCCAAGATTCCTCCA 
TTCANATTCAAACATCANATGGGTTTTAGGGACCACTTGGCTATGTCCTTGGGCACATGACATGTC 
GATCTCAAACTCCTCGTCGTCGTATTTGTrCGAATAGTAAATTTGGTTGTGCCACATG 
NTTGGTTAG 

SEQ ID NO: 4289 ACAAATGTTTTTTATTCAAAAATACAAAATAAATTATCTGTAGGCATG GACA 
ATGACAGCAGTAAACCATTATATATTTTGTCAACTGAAACCAGTAACTGATGGTTATAGTGATTTT 
CAGCCAGCCTTTTTCTTCATTTTCTCCAACTGAOTCTCTGAAG^ 

GGGCTTCCTGTCACAGTTCATTAATAAAGGTAAAGCACTAGTCTAGGAGTTAGAACATGCCACCTC 

CCATACCACCnx:CCATTCCACCCATTGCACCX:ATTCCAGGGTCCTTCTCTTCm 

GACTACAACTTCTGCTTGTAGTTAACAGAGAGGCCACACGAGCAGCATCCAATAAAGCAGTTCTC 

ACAACCTTTGTTGGGTCAATGArrCCTTTTTCCACCATATTCATAAAATCT^ 

ACCAACTTCTGAGGAACTTTGCATAATTTCTCAACTATCAAAQATCCTT CAC^^ 

AATGGTCATTGCTGGAArrTTGAGTGTTCTTTTAATAATTTCTAT^^ 

CTGGGAGTCAATGCCCGCGTCCTTG 

SEQ ID NO; 4290 ACATAAGTGGCTATCAGAGAAGCCAGCCGATATGG ATTGGCCTGCACGACCC 
ACAGAAGAGGCAGCAGTGGCAGTGGArrGATGGGGCCATGTATCTGTACri'rri"J-rrrrJ"l"l"l'l"rn' 
TTTTTTITAACTGAAAACTTmATTGCKr^^ 

TGATCCCATCATACTTCTGCTGGAACXAGCNCATGGCCTCCTCTTTGCTGAT^ 
AATGCAGa:TGTCCTGCGCTT^m'GTm'GCGATGCTGAAACCTGGCCTACCCA 
GTCCAGGCCGTATATACCAATGCTTGGGTCATATTTGATACCCAAATCGAATGTGTTCCTGGATCC 
CAAACCAAAGGTTTCANTTT 

SEQ ID NO: 429 1 ACTGTTCAAAAAGGAATGCCCCACAAGTGTTACCATGGCAAAACTGGAAGAG 
TCTACAATGTTACCCAGCATGCTGrrcGCATTGTTGTAAACAAACAAGTTAAGGGCAAGATTCTTG 
CCAAGAGAATTAATGTGCGTATTGAGCACATTAAGCACTCTAAGAGCCGAG ATAGCTTCCTGAAA 
CQTGTGAAGGAAAATGATCAOAAAAAGAAAGAAGCCAAAGAGAAAGGTAC TTTTT l'ri^ 
TTTGGAATACCTGGTTTATTGGGAAAACTrCATAATGAAAACTACAATTAGCIT^ 
ACAAAATAATAATCTGATAmAAAATGAATTGGTTTTCATTATGTAAGTCGAAATGGTAAAAA^ 
CATAATGACCTATCCGATGCAtCATATATATGCTATTCAGAGAAACTCAAATCC CCGA ATTCTCCT 
GTGGCATGTTTTATATCAAACATTTAAAATCTGTTTACCAAAGAAAGACCAGGATT^ 
GTAGGTTTCTGCTTACAGTTGCAAACTATCAAAAGCCTGTCTATATGATAGA^ 
TGAAATTTAAAAAAGCAAGTCATITArrCTCCTGAGGCTGTTAAGTGGCACTm 
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GACCT 

SEQ ID NO; 4292 ACmATGTGTCAmCCATGACTACTAACCACATATATAGAGACTAGGGGAT 
ACTTAGTGAAGACAGCmCCAGGAAGTTTACAAAATTGAAGAAAAAAAAGTATGTCCAAATATG 
GCATAAGTAGCAATGGGGCCCTGAATTTACTTTGGAAAGATTGCCTAAGGCACCACTATAATGAA 
CAGTTGTAAATCTCTGATCAGACTrCAAATTTTTCTGGTCTTAAAGTO 

TAGGTGACTAGATTAGCTGGTATATTATTTCCTCTGTGCCAGCATCrGTAAGAAATATTACATCTAT 
CCTGGGCACCGTCACTCCGATATATATTTGTAATAGATCTGArrATATGAGGTGTGAAAGTCAATA 
TGGGTAATTTTTCTTTTAAATGCATTTTAAGAACAAA 

GGAAAAGGACTATTTTAATGTAAAACAATArrAAAATCANCGTATGATGAGGTTGAGTTGCT^ 
ACTGGGAAAACTGATAATTTAAGAAATTGTAAATGCrmATrCAAAACTGTAA/^^ 
ATTCCCTAGTTNCTGGAATCACAGTTCTTATGAATAATANCCTTCTACTGATGAAC 
AATTAA 

SEQ ED NO: 4293 ACGCGGGGTTTTTATATACCCTGTAAGCTAAGAATGGTTTCCGCATT^ 
TGGTTGGGAAAAGAAATCAAAGACTAATAATTCATGACGTGAAAATTATCAGAATTCACAAATAA 
AGCmATTGGAACTAGCTATACTCATCTGTTTATATATTATCTGTGGCTGCm 
TGCAATAGAGATGGTAAAGCCTACAAAGCCTAATTATTTACTGTCTGGTTITTGTCAGAAAAAAGT 
TTGTCAATCCTTGTTrrAGAAGATGGAAAAATGTGAAGATCTTTGGAGATTCTCITGAGTGGTATA 
TCTAATTGAAATGGGATOTCGTTTTGGCITGTATGTTGATGAAATCAAOT 
AAAAAAATTAAAGAC<>rrGAAAATTGTTTTTGAAAANAAAAATA 

SEQ ID NO : 4294 ACTGACGAAGTCCAACACAAAGGTATAATACAGCCTGTTGTCTCAAGCCAAG 
GAGTCATAAAACCATGAGAAATAAATAGGMTCAATAGTTAGTAGTGACATTGOTGCTCTCTAGA 
AATCrCAGCATGAGCTGCTATAGAATACCCTCCCAGCAACAAAACCTAATCAGTAAGGCCAGCTA 
GACa:AATGGCTCATGCCr^GT^^TCCCAACACT^'GGGAGGCCAAGGTGGGAGGATGGC^ 
CCAGAAGTTCGAGACCAGCCTGGACAACATAGTGAGATCCTATCTCTATAAAAAATCAAAAATTA 
GCCAGGCATGGTGGTGCATACCTATAGTCCTGGCTATTTGGGAGGCTGAGGCAGGAGGArrGCTTT 
AGTCCTGGAGGTCGAGGCTGCAm'AAAGCCATGATTGCNCCACTTGACTCACCCGGGTGACAAAG 
CAAGCCCCTGTCTCANAAAAAAAAGAAAATTCAAGGCCAGTTAAGACAAATGCTATGACm 
ATTTACAGAAAGAAATTACAAGTTTA 

SEQ ID NO: 4295 ACGCGGGGGGTGAATGCTGCTGCCCCTGCTGGCAGCCACCTTGAGACCTCAC 
CGGGCCTGTGATATTTGCTCTCCTGAACTCTCACrCAATCCrcnTCCTCTCCTCT^ 
TTATTGTCCCCTAATGATAGGATATTCCCTGCTGCCTACCTGGAGATTCAGTAGGATCrm 
GAGGTGGGTAGAGAGAGCAAGGAGGGCAGGACACTTAGCAGGCACTGAGCAAGCAGGCCCCCAC 
CTGCCCTTAGTGATGTTTGGAGTCGTTTTACCCTCrTCTArrGAATTGCCTTGGGATT^ 
TTCCTGCCACCCTGTCCCCTAAAmGTGCTTCTGAATTGAGGAGCCTTCACCT^ 
AATGGTANAATGCTGCCTATCACCTTCAGCACAATCCCAGTGAAAAAGGTGTGAAGCACCCACCA 
TGTTCTTGAACAATCANGTTTCTAAATNAACAACTGGACCANCAAAAAAAAAA^ 
AAAAGTCCTNGGNCGCGACCACCCTTAGGGCG 

SEQ ID NO: 4296 ACGCGGGAAGGACTTAAAGGAGAAGAAGGAAGTTGTGGAAGAGGCAGAAA 
ATGGAAGAGACGCCCCTGCTAACGGGAATGCTAATGAGGAAAATGGGGAGCAGGAGGCTGACAA 
TGAGGTAGACNAANAAGANGAAGAAGGTGGGGAGGAAGAGGAGGAGGAATAAGAAGGTGATGG 
TGAGGAAGATGATGGAGATGAAGATGAGGAATCTGATTCANCTACGGGCAAGCGGGCNATCTTG 
ANGATGATNAGGATTACNATGTCGATACCCAANAAGCTTANTACCCGAC 

SEQ ID NO: 4297 ACTl"lUTi"i"llU"ll-iU"ll"ri"IG'niTrrilANAACAANACAGGTNJ"lU-l'l'h™^ 
AAACTACCTGATACAAAAAATANTGTGGTrrGGATGGATAGTNTGAANGACAAATAATAC 
TATTITATTGAAATAAACAAAAATGCm'ACACANCrCAATGGGTCACCTGNAACA^ 
AOTTOm-ACTGAGCANAAACAAAACTGAAGCACAAACACCTTTCTT^^ 
NATAGGTAAGAAAACACTACTTGG^^ITITITAAAAAA^^^GACC^ 
AATOTAAANAATOT 

SEQ ID NO: 4298 ACTTTrrrTTTTITrmTn^ 

ATTGAAGAGTCTAAAAACTAGAGGTCTGTAACAGAATACTAATCCTOCAGAACAAGGGACAGAA 

AAGTCAGGCGAGGCAAGCTTGTCTTCGTAACCTGGGCTCTCAATl^GGGGTGCTCTCTTACATTTCA 

ATTGAATATCTGTTTTTAAAOTCCCTTCAACAGTCAGGGTGTCTGTCTATT^ 

TTCTAACTGCAAAATAAATGACCATClTATCATGTTrrCCX:ATGGCTOT 

TTTATAGATTACTACATGGNCAAAAATCTACCCATCGGATGNATrCTrTTCTTCrrC^ 

GCArrcAAAAGCAACCACTGOATTTTTTTCAATGTAAAAAANAAAAATAGATACCCATGCC 
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TTCATGGGTGTAAACAGCCAACAAANCCAAACTTACAAAGAAAATGAAAAAAOT 
AGTGGTTTTATTGGNAAAAAATTAAAANATATTGGGCTNTm^ 

SEQ ID NO : 4299 ACTTGGATGAGTCCAATGCCCTGGGGAAGAAGTTCATCATTCAAGACATTGA 
TGACACTCACGTCTrrGTAATAGCAGAATTGGTTAATGTCCTCCAGGAGCGAGTGGGTGAATTAAT 
GGACCAAAATGCTTTTTCCCTTACCCAGAAATGAAAATACTCAATAT^^ 
AGCAGCAACTGTGAAAGACTTGCCACTCAATATCITAGCT 
TANGAGCATGCCACGGGAAAACTGAGGGATCnTGATCnTTGT 

SEQ ID NO: 4300 CrmCGAGCCGCCGCCCCGGGCCAGGGTCCCCAACO^TNGCTTTCATT^ 
GGGAACANGGAACAAGGGAGAACCNGTTGCCCCTCCCAGGAATGAACTTAGGGGCAGAACCCTG 
CCCAGCCAATAAAGAATGGCAGGGGCCAAACTCATACTTAATTGTTGGTAGGGGATCAAAGGGGT 
TATAAAAGTCTTGTGACAATCTGATGGGCCATACCANGAGCAANGCTACCAAGGGCGGGCAAGAC 
CrrGCCACGATGAAAATTATGCCTCCACX:CATGGCTATACGGGGCCTTCTTCACT^ 
CCACAAGCGCGTOCACTTNATGCCCATCGTGGCCACAAACATGGCCAGGAAGCCCANCACCANGG 
GAGACCACCATTAGGGCTCGAGTGGCCTGCAAGGGCCGCGGGACANGGGCGAANCACCGAGTCG 
TACTTGCTTTTACTTGTATCCCATACGTTTTANTATATTACATTTCCATTATC 
AATGTTCAANTmCTTCTTAATTTTACCATTTGGCCCACTGATCATTCAGG^ 
CATGGTGTTTGGTATAAGTTITCAAAAATTCCTCTTCTTATTGATT^ 
GGGGCAAAANAAAAATACT 

SEQ ID NO: 4301 ACGCCTCCACCGCCTCCTGCTCTGACGTGTCAGCCTGCTGCGAAGTGGAGTCC 
CGAGGTCATGATGAGTGACTATGAGAGCGGGGACGACGGCCACTTCGAAGAGGTGACGATCCCGC 
CCCTGGATTCCCAGCAGCACACGGAAGTCTGACTCTCAACTCCCCCCAAAGTGC CTGACTTTA GTG 
AACCTAGAGGTGATGTGAGTAATCCCGCGCTGTTCTTTGCAGCAGTGCTTCCAAGCTTT^^ 
GAGCCGAATGGGCV^TGGCTGCGCTGGATCCTGCGCCTCTOGACGTGCTj^ 
ACTACTGTCATCGTGAGGTrnCATCGGCrGTGCCATTrCCAACGTCTTTTGGGATTT 

tgtgttaaaataatcaaacgaaaaatcantcctgtgttggcagcatgattcatgtatttatat 

ATTTGATTATTTTAATTTTCCTGGCTCTTTTm 

SEQ ID NO : 4302 ACTATGTCATAGGAAtnTAAAGTTAGATGTATCTAAATCCCTTCACTTTrCTA 
AGACrrCTCTGTGCTCCCACCCCTGAAGTCTTCAAATCATTITr^^ 

AAAGATGACTATAAACAAGATGCAGCCCTCGGTTTCCATGAACAGCACACTATTACAGTAAACCA 

AGTTTATATTCCACCATCAAGTGTGGCTCTCCCATGACTTCGCTTTGTGATGGATCATTAAGAAT^^ 

CCTCAAATCCAATAGTCTCATCATTACCCCTCAAAACATCCAGTGAAAGArrTGAGCTTOAAAGAA 

ATGGAAGACGCTGAACCTGCTGCACTGCCTTGAATTCCATCIXjTAATTTTAGC GGAGC 

CCTGAATGrrrCTCANTGTGGAAAAATTCATTTTATOTGGNTGANCTGGA^ 

TTCAAGGGGATGACTAGGCAAAAGTTCATTmCACACAAGAAAAACCTTrCCGAAG 

GACTTTCAAAAGTCCACTTGCTGAAAGTTCAGTAACTGGAATCTGTCCTTTANCT^ 

TCTCTGOCATTCATCTrCCGANCTNTGCGAACAGCCTTTTTTCCCCGNGTACC^ 

SEQ ID NO: 4303 ACAGGTTTCACTATTACAAATGTATGATGTTAAACTAACAAACrCATGACC^ 
CAAAGATGTCTTCGTCCCACGCACACACATTTGTAATTTGTGTCCATTTGCTAmCCOT 
TAATCTTCAAATTATATAGTTATGCATTGAGTTCCCTATGCATCTCACCCATCTCCTTTAT^ 
CrrCTCATACnTGCCATTCTCrrCTTTCTGGAAATAACCAGCACAACAAT^^ 
TCACCACAACCACAATAACAGCAATAACACCAGCTTTAAACCCTGCATTGANAATT^ 
TTCATCAACATAATAAATTAAAGTTGACCAGGATCCAGATCCAGTTGTCCCCATTACTGTCAGGTC 
CATTTCTTAGAATCCCGCGTCCTGCCCGGGCCGGCGTCGAAAGGGCG 

SEQ ED NO: 4304 ACACTGCCTTATATTAGTCCATrrGTCCCATGTTTTCATCACTGAATAAACT^^ 
TTAAATGACrmGGTCTGGATCrCACACCTATATTACTTCAm 

GATAACATCATTTTTATATCCTAGGGCATGTAGTTCCGAGCCCCACAGAAAGTAATCACCATTCAA 

GTAAGCCAATAGTTCATTCCTATCTGTATAGAACTGANGCTTGTAAATCTACACATAGATCT^ 

TGTAGTTCAATAATGATAATAAATGTTTGTGCCCCCAGTTGTTATCTCTAAGGATAAGAGTAAT^^ 

ATGATCATTCAGCAGATATAGCITATATATGGOTGGCAGTTTACAAATTATAAATTGATTTACCAC 

ATTTTCTTCTGCTCCCAAGACACAATAAAGATGGTTTATCTTTGCA^^ 

GATTGGCACACAATAGATACTACTATAGNGGCATTCTTTAAACACCAATCCTTGCGTATGC^^ 

CAAAANATAATCATTGGTATTCAAArmCCTACTTTTATrTGCACAATAT^ 

TTTTTATAACCTANTACCCCGGGCATTTT 

SEQ ID NO: 4305 ACTTTTCrmGGTTTTGTTTTGTTTTGT^ 

AAGAGATTmATTACAAAGAAAAAAATrcCAGTGAATTGTGCAGAAATGCTGGTT^ 
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CCTAAAGAAAACTTTACAANGGGTGTTTNGGAGTAGAAAAAAGGTTATAAAGT^ 

ATGGTAAAATAACCATTGAGTGTCAAAGTCTAAAANCAGACCrcATTTTGTGCAT 

AAAGACTACTGATAGGGTlU'nUTl-lTlTCTCCITmAAATNAAAAAANC^ 

TACTTTTATTGAAGTAAATCTGAATGACCTACTCCTTTGGAGTAAAAC^^ 

TGTNTTTACCTCTGGTTGGAATTTGAAAAAAAAGGAAAAAANGAANCGAAACCCTA 

GGGAAAGTCCCTGCTNTCNGGTGAITCCC 

SEQ m NO: 4306 ACGCGGGGGACCTTCAGCAGGGCTGTGGCTACCATG'ITCTCTCGCGC GGGT G 
TCGCTGGGCTGTCGGCCTGGACCTTGCAGCCGCAATGGATTCAAGTTCGAAATATGGCAACm 
AAGATATCACCAGGAGACTAAAGTCCATCAAAAACATCCAGAAAATTACCAAGTCTATGAAAATG 
GTAGCGGCAGCAAAATATGCCCGAGCTGAGAGAGAGCTGAAACCAGCTCGAATATATGGATTGG 
GATCTTTAGCTCTGATGAAAAAGCTGATATCAAGGGGCCTGAAGACAANAAGAAACACCTOT 
TGGTNGTGTCTCANAACAAAGGACTG 

SEQ ID NO: 4307 ACTGAAATAGATGTATAGACCAATGGAACAGAACAGAGGCCTCAGAAATAA 
CACCATACATCTGCAACCATCTGATATTTGACAAACCTGACAAATACAAGGAATTGGGAAAGGAT 
TCCCTATTTAATAAGTGGTGCTGGGAAAACTGGCTAGCCATATGTAGAAAGCTGAAACT^ 
CTTCCTTACAACTTATACAAAAATTAATTCAAGATGAATTAATGACTTAAATGTTAGAC^ 
TTAAAAACCCCAGAAGAAAACCTAGGCAATATCATTCAGGACATAGGCATGGACAAGGGCTTCAT 
GACTAACCACCAAAAGCAATGGCAACAAAAGCCA 

SEQ ID NO: 4308 acttgctggtctcaaatttccacaaggagatatcaatggtgataccacgttca 

CGCTCAGCnTrCAGTTTATCCAAGACCCAGGCATACTTGAAGGAGCCCTTTCCCATCT 
TCCTTCTCAAATTTTTCAATGGTTCITTTOTCGATGCCACCGCA^ 

TTGGTGGACTTGCCCGAATCTACGTGTCCAATGACGACAATGTTGATATGANTCTTTTNC^ 
TTTTGGCTTTTA 

SEQ ID NO: 4309 ACCACATTATAGTAAAAGTATTAGAAAAGTGACCCTCAAGGTGTATCAATTA 
TAAAGCAGATGAAAACTTGAATGACAAATATCI^AGTAAAATTCTCTAGTAAAAAAGTCGAT^^ 
CCCATGCTACAAAATAAAAGTGAGAAAGCCATATAAATAAAGCAGAATAATGrrCTAGGCTTT^ 
GGTAATGAAGTrAGTCAAGTTGAAAAAATAAAAATAAAAAAAGAAAATCCTNAATTGGAATTA 

SEQ ID NO: 43 10 ACCTGTGAACCAAGTGnTGGGCAGGATGAGATGATCGACGTCATCGGGGTG 
ACCAAGGGCAAAGGCTACAAAGGGGTCACCAGTCGTTGGCACACCAAGAAGCTGCCTCGCAACA 
CCCACCGAGGCCTGCGCAAGGTGGCCTGTATTGGGGCATGGCATCCTGCTCGTGTAGCCTrCTCTG 
TGGCACGCGCTGGGCAGAAAGGCTACCATCACCGCACTGAGATCAACAAGAAGATTTATAAGATT 
TGCCAGGGCTACCTTATCAAGGACNGGAAAGCTGATCAAGAACAAATGCCTCCACTGACTATGAC 
CTATCTGACAAGAGCATNAACCCTNTGGGTGGCTTTTGTCCNTATGGTGAAGGACNAATGACTTT 

SEQ ID NO: 43 1 1 ACTCGAAGATCTAGATTTCATTCTCCATCTACAACTTGGTCACCCAACAAAGA 
CACTCCACAAGAAAAGAAGCGGCCCCAGTCTCCATCTCCCAGAAGAGAAACTGGGAAAGAAAGC 
AGGAAGTCTCAATCACCATCTCCTAAGAATGAGTCAGCCAGAGGCCGGAAAAAATCCCGTTCTCA 
GTCCCCAAAAAAGOATATTGCAAGAGAAAGGAGGCAATCTCAAGTCTCGGTCTCCAAA^ 
ACTACTAGGGAAAGCAGAAGATCTGAATCACrGTCCCCAAGAAGAGAAACTTCTAGAGAGAACA 
AAAGATCTCACCAAGAGTGAAAGATTCTTCCCCAAGGAQAAAAATCCANQTNCCAGANCAGAGA 
ACGAAGAAAGTGATNGATATGGGCAGAAGGAGAAAAGAGAAAGGAGAACCAGAAAAGTGGTCr 
ANGTCCAAGATCTTATTTCTAAGGTC<XCrmAGATGTANAACNAAAAAGTAANGAGT^^ 
TGGTA 

SEQ ED NO: 43 1 2 ACTAAAATTGTGTTGGGAGCAGGGATTrGGAAATTTCTGAGAGATGTGTAGT 
TAATTTAGTAATTCTGTTTCATGAGATATGATCTGTTATGCTAGTGGmAATAGGCTTGCTATGTA 
AGTAGAACGTGGCrCAACTAGATATtnrrATATGTATGGGCATTACrCTTAGTGATATT^ 
GTCCTTTGTTGCTCATGCTGmAAGTGCAGGCTGAGACCCACCTCTTTGTAA^ 
AGGCAATCCGGCCCTTTCCGGTGGAGGACCX:TTCCCCCrGGGTTTGATC^ 
GGTGTTrrGACrCGACTCrrrCCCCTrmCCTCCGCAGAAOT 

TTACTGGAACCCTGCATTCrAAACTTGCTGGAAACCCACX:CTCTGGAATTGGCCTGC^ 
CTGA 

SEQ ID NO: 43 1 3 ACTTAGCATCGGAACTGCAATGTTAGGAGCAGGGTTGTGTGTTGGA'nTGAC 
ATAGATGAAGACGCATTGGAAATATITAATAGGAATGCAGAAGAGTTTGAGrrAACAAATATTGA 
CATGGTTCAATGTGATGTGTGOTATTATCTAACAGAATGTCCAAGTCATTCGATA^^ 
GAATCCTCCCTrTTGGGACCAAAAATAATAAAGGGACAGATATGGCrmCTAAAGACTGC^ 
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AATGGCAAGAACAGCAGTATATTCCTTACACAAATCCTCAACTAGAGAACATGTTCAAAAGAAAG 
CTGCAGAATGGAAAATCAAGATAGATATTATAGCA 

SEQ ID NO: 43 14 ACCAGTTITCTGAGGACCATGATGTGOAATATTTTCTTCGGCTGGCTCATGAG 

ctgggactgctggttatcctgaggcccgggccctacatctgtgcaga<jtgggaaatgggaggatr 

acctgcrrggcrgctagagaaagagtctattctrctccgctcctccgacccaoat^^ 

tgtggacaagtggttgggagtccttcn'gcccaagatgaagcctcrcctctatcagaatggagggcc 

agttataacagtgcaggttgaaaatgaatatggcagctactttgcctgtgatmgact^^ 

cttctgcagaagcgctttcgccaccatctgggggatgatgtggttcttgrrtac cact 

cataaaacattcctgaaatgtggggcx:ctgcagggcctctacaccacggtggactttggaacag 

cagcaacatcacanatgctttcctaanccananoaaagtgtgancccaaaggacccit 

tctgaattctatactggctggctaaatcactggggccacctcactncacaatcaanaccgaa^ 

gtggcrrcctcctotatgatatacttgccx:cgtggggcgagtgtgaacttnt 

seq id no: 43 15 acaagtctggctagggcraaaatgtgaagaatgagaagatgttacctgggaa 
aagggaggcagggggtagaagtggcctggaagagcaggtgcaaaggtcccaaggagggagggg 
tggcatgcctgggcacagagggctgtixsataagtggtctgagatgragctggaggtgtaggccag 
gggacaagtggtactgcatgttctgttgtggtgagggaaagaaacatgot 
tcaacagaatgtgtgtctgtatctgtgtattgcgcatgtattcatatatmaaagtm 

NGTTmGCrGACANTGTTGGGAACCTCACATGCTTCraAAGCAT^ 

SEQ ID NO: 43 16 ACTTCAAGAACCrGCACAGATTTACTCATATT(XTTCAGGAAAGTGm 

CGCTCAGAGGTCCTGCATCAAGTATTCATCTCCAATTGTGACTCCAGTAAAACGACTCAAAAATGG 
GAAATGAATNACATCCATAGTGTrTAGAGAGAAAAAAATAAACCAATAACCTACCTACTGACAAG 
T 

SEQ ID NO: 4317 ACTr r iUlUU l HUll l ' li ril'lU'ri^lllll'l ACAGGAAATCTGCCTATTITAm 
CArrAAAAAAATATTATNTATATAACATTTTGTTCCACTATCTTCTCCTTGATC^ 
CTTTAATITrACTCAAGTAAAANCANAATCACATAACGGACATCAAAACTAAATAG^^ 
TAGTTTAAAT 

SEQ ID NO: 43 1 8 GTACGCGGGGACrrCTGAGAAANTGAAACGACAGGGGAAAGGAGGTCTCAC 
TGAGCACCGTCCCAGCATNCGGACACCACAGCGGCCCrrCGCTCCACNCANAAAACCA^ 
NAAACCTTCACTGAACACTTCCTTCCCCAAAGCCANAAGATGCACNAGGAGGAACATGAGGTGG^ 
TGTGCTTGGGGG 

SEQ ID NO : 43 1 9 ACGCGGGGGGCCTTGCAAATTCTCTTGCAATTGAAGGCAGGAAAAGC AACAT 
TCATTX3TAACACCATTGCTCTTAATGCGGGATCACGGATGACTCAGACAGTTATGCCTGAAGATCT 
TGTGGAAGCCCTGAAGCCAGAGTATGTGGCACCTCTTGTCCTTTGGCTTTGTCACGAG 
GGAGAATGGTGGOTGrrTGAGGTTGGAGCAGGATGGATTGGAAAATTACGCTGGGAGCC^ 
TTGGAGCTATTGTAAGACAAAAAAAATCACCCAATGACTCCTGAGGCAGTCAANGCTNACTGGAA 
GAAAANCTGIGANTTTTGAGAATGCCAGCTA 

SEQ ID NO: 4320 ACTGCATTCTTGAAGGAAAAAAACTGCAGCCAAGGCAAGAACTCTGAAGTT^ 
GCACTCAGAGTTrAAAAGACAGACCCCTACTCTGCAAACTGAAGACTGCCACTCTC 
CTCCAGCCTGCCACATTTTACITCAArrGGTAAAGCACrCTGCTGAGAATATAA^ 
AGGCAATCATAAAAGATTAAATATTGGTAGGCATGTAAGCirCCrmA ATAA AAA^ 
TCAATGTCATCTATTTAAATAAGTATCCTTTGTTATAGAGTGTrCCAGTTTm 
TAAAAAACCCAACrGTCTACATOTrAAANCCTACCCCTI^ 
NCACG 

SEQ ID NO: 432 1 ACATTrCAAGTGAAATAAGTAATTCrAGATAGGACAATTT AAATT GGATAAT 
mAAAGTGTCTATAATTGCAGTGGmAmGCAAAATrCCTAAAAGGAAAAATrTTATCACT^ 
C^TCACAGCAGGTTrCCTCATCCAGATGAGGAAACTAGACAAATGCTAGTGTGTTTTj^ 
AACAAAACTAANTTAAATGAACATTTAAAAGTTTCCCTANCGGGCCATTCCTTATCA/^ 
AATCCCTGTNGCTACATTGACTAAAAGGTCATTATGAATGGAATATGTAAGACTO 
ACCTAATCAGATGGTTAGAGGTG 

SEQ ID NO: 4322 ACCTCCCATAGTTTGTCCCCTGTCTCCACTTCAGCAATGTCAA^ 

rrCCGATGCTGACGCCAGCATTTTCCCATCATGGCTGAAACTGAGGGTTCTTACAGGCCAATCCAQ 
CCTGGAAAAGCACCGAACACACACTAACTCATCCACATCCCAGAGGCTGACCAAAGCATCTGCAC 
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